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Foreword 


This project started in July 2010 when Anil Chandy and Sandra Fabiani from 
Springer approached me to take on the editorship of a handbook series on 
manufacturing engineering. The initial thought that came to me was a colossal 
task involving hundreds of people and taking tens of years to complete. The 
decision of the topics to be included is equally daunting. We met a couple of 
times in person and over the Internet to size up the scope and contact a number of 
potential volume editors. After many attempts and sounding out prominent authors, 
I managed to convince three section editors to take on this task, which was still a 
long way from the 12 section editors planned. In May 2011, I approached the 
Executive Director of SIMTech, Dr Lim Ser Yong, for his help and a joint 
presentation together with Anil and Sandra was made to his research group leaders 
on the significance of such a project. He gracefully agreed. Much to our joy, five 
section editors agreed and were appointed in 2012, followed by another two editors 
in 2013. The last two overseas editors joined in mid-2013. In April 2014, we finally 
saw the project through to completion and the handbook is ready to roll out, even 
though it has been a lengthy journey! 
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Preface 


Innovation and manufacturing capabilities are well known to be the wealth creator 
of any nation which has strong advanced manufacturing technologies for making 
high- value-added products and is able to compete globally. 

Manufacturing is evolving continuously, engulfing more technologies than 
several decades ago. The rapid development of Internet technology, computer 
science, materials research, microelectronics, and biosciences has propelled 
manufacturing activities far beyond mere product fabrication. Manufacturing 
technology has now entered into the realm of intelligent product creation, and yet 
at affordable prices, and is highly compatible with environmental concerns. 

Manufacturing knowledge has been created by both the academia and industry, 
but unfortunately a great deal of information is scattered over a myriad of published 
papers, reports, and books - some are publicly available, while others remain 
proprietary information and are well guarded by the organizations which 
created them. 

The raison d’etre of the Handbook of Manufacturing Engineering and Technology 
is to gather the fundamental and evolving technologies in manufacturing engineering 
from many experts and practitioners in an attempt to cover as many fields as 
possible in common manufacturing activities. The collated materials will be updated 
frequently to capture the latest developments. The six volumes of the handbook cover 
the following topics: 

Volume 1 - Forming and Joining 

• Materials Forming: Forming of Polymer and Composite Materials 

• Metal Forming 

• Materials Joining 

Volume 2 - Machining and Tolerancing Systems 

• Machining 

• Tolerancing Systems 

Volume 3 - Nanomanufacturing and Non-traditional Machining 

• Nanomanufacturing Using Ion Beam Technology 

• Non-traditional Machining Processes 
Volume 4 - Robotics and Automation 

Volume 5 - Additive Manufacturing and Surface Technology 

• Additive Manufacturing: Rapid Prototyping, Tooling, and Manufacturing 

• Surface Technology 
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Preface 


Volume 6 - Product Life Cycle and Manufacturing Simulation 

• Product Life Cycle and Green Manufacturing 

• Manufacturing Simulation and Optimization 

It is hoped that these volumes provide useful assistance for both academia and 
industry with regard to the needed reference and basic knowledge of each process. 
What is more important is that the knowledge will be updated continuously to keep 
abreast with the state-of-the-art developments in the world of manufacturing 
research and practice. 

Andrew Y. C. Nee, DEng, PhD 
August 2014 Singapore 
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and their applications. The general concepts that are readily used in the field of 
polymer and polymer composite are discussed. This book chapter can provide 
basic understanding on polymer and polymer composite for newcomers. Then, 
the physical and mechanical properties of polymer and polymer-composites 
are described. Discussions on reinforced polymer composite highlighting on 
fabrication and characterization of polymer composite are provided, and 
particular importance is placed on the use. Discussions on the various 
nanofillers in polymer composites and their modification using various tech- 
niques have been focused on in this book chapter. 


Introduction 

Many of the terms, definitions, and concepts used in polymers are not generally 
found in other branches of science, these need to be understood in order to fully 
discuss the fundamentals of polymers. Application of polymer composites is 
increasingly important in many different industries, like aerospace, automotive 
sectors, and areas dealing with corrosion and construction because they are strong, 
resistant to damage, and easy to install. The global composites market is expected to 
reach about $62.6 bn in 2012 (Source: The Composites Market 2012-2022: Glass 
Fibre, Carbon Fibre and Aramid Fibre report, Visiongain 2012). In addition, 
applications are also found in fuel cells, sensors, electromagnetic interference 
shielding, human implants, and scaffolds. 

This book chapter highlights both basic fundamentals and advances toward the 
understanding of properties of polymers and polymer composites holding various 
aspects. This chapter also deals with basic characteristics of polymers and polymer 
composites. In addition, current application and future trend in advances of the 
polymer and its composite will be discussed. 


Polymer 

The name “polymer” gives an idea of the structure of the materials. The term 
“polymer” arrived from Greek words in which poly means many and mer means 
unit. A polymer is a very large molecule formed by successive linking of many 
monomers into a chain or network structure. The concept of the polymer emerged in 
the 1920s presented by Hermann Staudinger who received the Nobel Prize in 1953. 
Generally, a polymer has a chain or network structure made of a carbon backbone 
with hydrogen. In addition other elements such as O, N, F, Si, S can be arranged on 
it. Some of the very common polymers are polyethylene (PE), polypropylene (PP), 
polystyrene (PS), and polyvinylchloride (PVC) (Billmeyer 2007; Fried 1995; Mark 
and Kroschwitz 1985). 

The simple structure of polyethylene, for example can be written as given in Fig. 1. 
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Fig. 1 Simple structure of the polyethylene polymer 


Monomer 

Monomers are generally simple forms of organic molecules which can react and 
give larger molecules in the form of polymer. Figure 2 shows the monomer 
structure of different polymers. 


Polymerization Reactions 

Polymerization is the process by which polymers are made or “polymerized”. In 
polymerization, chemical reactions occur between small simple hydrocarbon 
monomers. Generally there are two main types of polymerization reactions. One 
is addition polymerization or chain growth and the other is condensation or step 
growth polymerization. 


Addition (Chain Growth) Polymerization 


Addition polymerization involves a rapid “chain reaction” of chemically activated 


monomers. 


It occurs mainly in three stages which are initiation, propagation, and termina- 
tion. For example, in the case of polyvinylchloride (PVC), the polymerization of 
vinyl chloride (monomer), initiation can come from a free radical generated on 
vinyl chloride monomer by initiator. Free radical can also act to initiate and 
terminate the reaction. This process generally produces linear structures but can 
produce network structures. A variety of initiators may be used, for example 
a peroxide or azide containing molecule can be activated by thermal or light. 
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Fig. 2 Structure of some common monomers used for preparation polymer 
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Fig. 3 A simple mechanism of addition (chain growth) polymerization process 


Atom-transfer radical-polymerization (ATRP), reversible addition — fragmentation 
chain transfer polymerization (RAFT), and group transfer polymerization GTP are 
examples of addition (chain growth) polymerization. Figure 3 shows a simple 
mechanism of an addition (chain growth) polymerization process from monomer 
to polymer. 
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Fig. 4 Synthesis of phenol formaldehyde resins (polymer) through (step growth or stepwise) 
polymerization process 


Condensation (Step Growth or Stepwise) Polymerization 

In condensation polymerization, individual chemical reactions between reactive 
functional groups of the monomer that occur one step at a time, are slower than 
addition polymerization. Whether the final product of the polymer will be linear or 
network depends on the number of functionality of the monomer (a functionality of 
two generally gives linear, whereas functionality of three gives network type/ 
crosslink). Ring-opening polymerization (ROP) and polycondensation are exam- 
ples of condensation (step growth or stepwise) polymerization processes. Figure 4 
shows a simple process, in which phenol and formaldehyde form phenol formal- 
dehyde resins (polymer) through a (step growth or stepwise) polymerization 
process. 

There are many techniques available for polymerization, as given in Fig. 5. 
In addition, polymerization processes are also classified in bulk, suspension, solution, 
and emulsion. 


Properties of Polymers 
Structure of Polymer 

A polymer structure can be classified into three possible molecular structures based 
on architecture which are linear polymer, branched polymer, and crosslinker 
polymer which is shown in Fig. 6. 

(a) Linear polymer: A linear polymer consists of a long chain of monomers which 
are covalently bonded to each other. Some common examples for linear poly- 
mers are high density polyethylene (HDPE), PVC, nylon, polyester, and 
PAN etc. 

(b) Branch polymer: A branched polymer has small branches covalently attached 
to the main chain. Some common examples are low density polythene (LDPE), 
glycogen, and starch. 

(c) Crosslinker polymer: In cross-linked polymers, polymer chains are linked to each 
other through a covalent bond. Cross linking results in a giant macromolecule 
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Fig. 5 Polymerization techniques 



Crosslinked network 


Fig. 6 (a) Linear chain; (b) branched molecule; (c) three-dimensional crosslinked network 
molecules in which chains are linked to each other through covalent bonds 

with a three-dimensional network structure. In elastomers, crosslink density is 
low or loosely bonded, while thermosets have high density crosslink networks, 
which make it hard, rigid, and brittle in nature. Bakelites and malamine formal- 
dehyde resins are some example of Crosslinker polymer. 

Another classification of polymers is based on the chemical type of the monomers 
used during polymerization process (Fig. 7). Homopolymers are made from the same 
monomers whereas copolymers are made of more than one different monomer 
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Block copolymer opolymer 
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Fig. 7 (a) Homopolymer; (b) random copolymer; (c) alternating copolymer; (d) block copoly- 
mer; (e) graft copolymer 


repeating units. In addition, depending on the arrangement of the types of units in the 
polymer chain, they can also be classified as random, alternating, block, and graft 
polymers. In random copolymers two or more different monomer units are organized 
randomly in polymer chain, whereas in alternating copolymers repeating units of the 
different monomers are arranged in alternating sequences. In block copolymers a long 
series of the same monomer is followed by a long chain of another monomer. Graft 
copolymers consist of a polymer chain made from one type of monomer with 
branches which are made from another type or similar type of monomer. 

Homopolymers can be classified based on tacticity into isotactic, syndiotactic, 
and atactic polymers (Figs. 8, 9, and 10). 

Isotactic Polymer 

In isotactic polymer all the substituents are located on the same side of the polymer 
backbone. Polypropylene synthesise using Ziegler-Natta catalysis is an isotactic 
polymer. Isotactic polymers are generally semicrystalline in nature, e.g., isotactic 
polystyrene. 

Syndiotactic Polymer 

In syndiotactic polymers the substituents are arranged in alternate positions along 
the polymer back bone. Polystyrene synthesized in the presence of metallocene 
catalysis during polymerization gives syndiotactic polystyrene which is crystalline 
in nature and has a melting point of about 161 °C, e.g., syndiotactic polystyrene. 
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Fig. 8 Isotactic polymer 



Fig. 9 Syndiotactic polymer 



Fig. 10 Atactic polymer 


Atactic Polymer 

When substituents are positioned randomly along the polymer chain they are 
known as atactic polymers. This kind of structure is generally formed by free- 
radical mechanisms. Polyvinylchloride and polystyrene are generally atactic in 
nature. Due to their random nature atactic polymers are usually amorphous. 

Tacticity of the polymers is technically very important for application. For 
example polystyrene (PS) can exist in atactic or syndiotactic form and shows 
very different properties in the different structures. In atactic PS, polymer chains 
stack in an irregular fashion, cannot crystallize and form a glass, whereas 
syndiotactic PS is a semi crystalline material. This is a very general example for 
many polymers. Tacticity also affects other physical properties, such as melting 
temperature and solubility. 

Besides tacticity another classification, based on head-tail configuration of vinyl 
polymers, should be taken into account when considering polymer defects (Fig. 11). 
In a head to tail configuration all monomers are normally linked in regular polymer 
units in which the substituent group on “p” position is separated by three carbon 
atoms whereas it is two and four carbon atoms for head to head and tail to tail 
configuration, respectively. 
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Head-Head Head-Tail Tail-Tail 

Fig. 11 Classification, based on head-tail configuration of monomer during polymerization 



Random organized: 
Amorphous region 

Lamellae structure; 
High crystallinity region 


Fig. 12 Schematic model of a spherulite contains crystalline region and amorphous region. The 
lamellae part is the crystalline region, the other part is the amorphous region 


1 ^ 

Generally nuclear magnetic resonance (NMR proton or C NMR), X-ray 
powder diffraction (XRD), secondary ion mass spectrometry (SIMS), and vibration 
spectroscopy (FTIR) characteristics techniques are used to measure the tacticity of 
the vinyl polymer. 

Microscopic Structure 

Properties of polymeric materials are very much affected by its microscopic 
arrangement of molecules or chains. Polymers can have an amorphous or partially 
crystalline/semicrystalline structure. Generally in amorphous polymers, molecules 
or chains are arranged in a random manner. In semi-crystalline polymer, molecular 
chains are partially aligned and chains fold together and form ordered regions 
known as lamellae which compose larger spheroidal structures named “spherulite”, 
an example of which is shown in Fig. 12. Formation of spherulite is controlled by 
many factors such as the number of nucleation sites, structure of the polymer 
molecules, cooling rate, etc. These factors control the diameter size of spherulite 
and may vary from a few micrometers to millimeters. Spherulites show higher 
density, hardness, and brittleness as compared to disordered polymers. The lamellae 
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Syndiotactic polymer 




Atactic polymer 


Fig. 13 Structure of syndiotactic and atactic polystyrene 


are connected by amorphous regions which provide certain elasticity and impact 
resistance. The degree of crystallinity is estimated by different analytical methods 
such as DSC, XRD, etc. Crystallinity of polymer typically ranges from 10 % to 
80 %. That is why crystallized polymers are often called “semicry stalline”. 

The properties of semicrystalline polymers are determined by the degree of 
crystallinity, size, and orientation of the molecular chains. Cooling rates, chain 
structure, and mer chemistry, side branching and chain regularity (isotactic or 
syndiotactic) are some of the factors which control the degree of crystallinity of 
the polymer. In general, crystallinity of the polymer increases with slow cooling 
rate, simple chain and mer structure, and less branching. 

Polymer structure and intermolecular forces are two major reasons for high 
crystallinity or high amorpharsity of a polymer. For example if the polymer chain 
is regular and orderly, it will turn into crystals easily. If it is not, it will not. To better 
understand, let’s compare the structure of atactic in syndiotactic form of polysty- 
rene when the R group is a phenyl ring (Fig. 13). 

Figure 13 shows that the syndiotactic polystyrene is very orderly, with the 
phenyl groups falling on alternating sides of the chain. This means it can pack 
very easily into crystals. However, atactic styrene does not have such an order. The 
phenyl groups come on any side of the chain they want and the chains cannot fit for 
well packing which leads to highly amorphous character in the atactic polystyrene. 
Other atactic polymers like PMMA (poly (methyl methacrylate)) and PVC (poly 
(vinyl chloride)) are also amorphous. In general, stereo-regular polymers like 
isotactic PP (polypropylene) and polytetrafluoroethylene are highly crystalline. 

PE is another good example and can be crystalline or amorphous. Linear PE is 
nearly 100 % crystalline where branched PE just cannot pack the way the linear PE 
can, so it is highly amorphous (Fig. 14). 
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Linear PE Branched PE 


Fig. 14 Structure of linear and branched PE 



HO HO H 

Fig. 15 In nylon the carbonyl oxygen and amide hydrogen form a hydrogen bond. This allows the 
chains to form the fiber in orderly fashion 


Crystallinity and Intermolecular Forces 

Intermolecular forces can play a major role in polymer crystallinity. For example, 
crystallinity of nylon is due to the internal force. It can be seen from Fig. 15, that the 
polar amide groups of nylon chains are strongly attracted to each other through strong 
hydrogen bonding. This strong binding results in crystalline behaviors of the nylon. 

Another example of this is poly (ethylene terephthalate) also known as polyesters. 

In this case polar ester groups of the poly(ethylene terephthalate) hold polyester 
into crystals. In addition pi-pi stacking of the phenyl ring is in orderly fashion, 
making the crystal even stronger (Fig. 16). Polypropylene, syndiotactic polysty- 
rene, nylon, Kevlar, and Nomex are some examples of highly crystalline polymers 
whereas poly (methyl methacrylate), atactic polystyrene, and polycarbonate are 
some examples of highly amorphous polymers. 

There is a way to find out how much of a polymer sample is amorphous or 
crystalline. Generally, differential scanning calorimetric (DSC) and X-ray diffrac- 
tion (XRD) are the instruments used to determine the crystalline or amorphous 
property of the polymer. The crystallinity can affect the physical and thermal 
properties of polymers. For example, density, mechanical strength, heat resistance, 
and creep resistance increases with an increase in crystallinity. 


Melting and Glass Transition Temperatures of Polymers 

The glass transition is the reversible transition in polymeric materials from a hard 
and relatively brittle state into a molten or rubber-like state. The term melting 
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R„ 




The polar ester group of polyester, 
allow chain to organize, responsible 

for crystallinity. 


The phenyl ring of polyester 
undergoes n-n (pi-pi) stacking, 

leads crystallinity. 


Fig. 16 The polar group and pi-pi stacking can affect the crystallinity of the polymer 


temperature for polymers, suggests a transition from a crystalline or semi- 
crystalline phase to a solid amorphous phase. Though the abbreviation of melting 
temperature is T m , it should more precisely be called crystalline melting tempera- 
ture. Crystalline melting is only discussed for crystalline or semi-crystalline 
polymers among synthetic polymers. Thermoset polymers are closely densely 
cross-linked in the form of a network, degrade upon heating, cannot be reused 
(e.g., crosslinked polyisoprene rubber); while thermoplastics, which do not contain 
cross-links, will melt upon heating, and can be recycled for reuse, e.g., polypro- 
pylene, polyethylene, PMMA, etc. 

Glass transition is a phenomenon that occurs at a specific temperature known as 
glass transition temperature (T g ), when amorphous materials or amorphous regions 
within semi crystalline materials go from a hard and relatively brittle state into a 
rubbery like state or vice versa. This is a reversible phenomenon which very much 
depends on the nature of the polymer. 

T m and T s usually characterize, respectively, the upper and lower temperature 


limits for applications of semi-crystalline polymers. T g may also describe the upper 
use temperature for amorphous materials. 

T m and T g , are much affected by molecular weight (MW) of polymers, presence 


of secondary bonding, chain flexibility/chain stiffness, density of branching, and 
thickness of the lamellae. The melting temperature of a polymer can be over a range 
of temperatures due to the variation of MW, and generally increases with increasing 
molecular weight (MW); whereas polar side groups, ether or amide linkages on the 
main chain, double bonds, aromatic groups, and crystallinity increase the melting 
temperature. Presence of the bulky, large size non polar groups, branching may 
lower T m and T g because it will decrease crystallinity thickness of the lamellae 
(crystallizing the solid at a low temperature or annealing just below T m will do this) 
and increase the rate of heating. 


Viscosity 

Viscosity is an important property and one of the key issues of the polymer that 
needs to be considered during manufacturing materials, it is the measured 
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Fig. 17 State the viscosity, (r|) behavior of polymers in these various regions 


resistance of the material which is being deformed by either shear stress or 
tensile stress. Viscosity very much depends on temperature. It is the proportion- 
ality constant between the shear stresses and the velocity gradient and can be 
represent as, 

Shear stress (t) = Viscosity (r|) x Velocity gradient (dv/dy). 

Figure 17 can state the viscosity (r|) behavior of polymers in different various 
regions. 

Mechanical Properties of Polymeric Materials 

Viscoelasticity, as a property of materials, is a combination of viscous and elastic 
behavior. It is both time dependent and temperature dependent. 

When a polymer is subjected to a step constant stress, polymeric materials 
experience a time-dependent increase in strain. This phenomenon is known as 
viscoelastic creep. It is temperature dependent and tests are conducted in the 
same manner as for metals. 

Creep modulus is a parameter to quantify this behavior of polymeric materials. 

E c (t) = o 0 /e(t), 

where e(t) is time dependent strain and a G is constant stress at a particular 
temperature. 

Stress relaxation, which is also a result of viscoelasticity, describes how poly- 
mers relieve stress under constant strain, like viscoelastic creep, it is also time 
dependent. Relaxation modulus is a common parameter to quantify this behavior of 
polymeric materials and can be given as: 
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E r = c(t)/e 0 , 

where a(t) = measured time dependent stress and e G is constant stress at a particular 
temperature. 

Tensile modulus or elastic modulus or just “modulus” of polymer is identical to 
Young’s modulus for metals. The value of tensile modulus tends to be much lower 
for polymers compared to metals. In the case of semi-crystalline polymers the 
tensile modulus arises from the combination of the modulus of the crystalline and 
the amorphous regions. Similarly, tensile strength, impact strength, and fatigue 
strengths of polymers are defined in the same way as for metals. In general these 
values are much lower for polymers. 

Ductility values are usually much higher for polymers than metals whereas 
fatigue curves are the same as for metals, and some polymers may or may not 
have fatigue limits. The values tend to be lower than for metals and very much 
affected by loading frequency. Tear strength which is related to the tensile strength, 
is the energy required to remove a cut specimen that has a standard geometry. 
Hardness of the polymer is determined by measuring the resistance to penetration, 
scratching, or marring the surface. Durometer and Barcol are common instruments 
used for hardness tests for polymers. 

Polymer properties are very sensitive to temperature and generally with increas- 
ing temperature, tensile strength, elastic modulus decreases whereas ductility 
decreases. Besides temperature, the properties of polymer are very much affected 
by environment, e.g., moisture, oxygen, UV radiation, organic solvents, etc. 


Deformation of Polymers 

Elastic and plastic deformations of polymers are general properties that are expe- 
rienced every day. Elastic deformations in thermoplastics, is similar to a metal 
spring which upon stretching shows uncoiling but returns to its original shape when 
the stretch force is released. In polymer, elastic deformation behavior comes from 
coil polymer chains which uncoil upon stretching, and the chains revert to their 
original conformations when forces are removed. This is a reversible process. 
During elastic deformation, primary bonds are being stretched but not broken. 

Plastic deformations come from the chains moving past one another, secondary 
bonds are being broken and reformed. However, when enough force is applied, the 
primary covalent-bonds within the chains are also broken. It is not a reversible 
process like elastic deformation. A polymer chain containing a double bond or 
bulky group will restrict ability of the chain to rotate freely, making the material 
more rigid. 

Typical stress-strain curves for the three different types of polymers are shown in 

Fig. 18. 

Figure 18a, shows the properties of brittle polymer which fail during elastic 
deformation. Figure 1 8b shows rubber like elasticity property of polymer whereas 
Fig. 18c shows the typical stress-strain curves of plastic polymer. 
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Fig. 18 Typical stress-strain curves for the three different types of polymers 


Application of Polymers 
Commodity Polymer 

Initially polymers had major applications in manufacturing commodity goods 
(some of them are shown in Table 1) but with time they expanded to include 
engineering trough polymer matrix composites (PMCs), which will be discussed 
later. 

Amphiphilic Block Copolymer 

In addition to the above application, it has many other applications with a special 
kind of polymer known as amphiphilic block which is exploited in various fields. 
Amphiphilic block copolymers consist of distinct hydrophilic and hydrophobic 
segments which are able to form micelles in appropriate solvents. The formation 
of micelle depends on the nature of block, solvent, and concentration (Fig. 19). The 
size and shape of micelles depend on the chemical structure of block, molecular 
weight of each block, number of aggregation, and nature of solvent (Hadjichristidis 
et al. 2002; Raez et al. 2003; Yan et al. 2001a; Cao et al. 2002). Recent progresses in 
the synthetic techniques have led to the successful synthesis of a wide range of 
block copolymers containing different types of core and corona blocks with desired 
properties. Micelles of different shapes and sizes can be obtained. The aggregates 
can be in the form of rods, spheres or vesicles depending on various factors 
including the type of solvent and aggregation number (Antonietti et al. 1995; 
Zhang and Eisenberg 1995; Shen et al. 1999; Zhang et al. 2000; Alexandridis and 
Lindman 2000; Discher and Eisenberg 2002; Jain and Bates 2003; Wang 
et al. 2003; Liu et al. 2003). 

Amphiphilic block copolymers are multipurpose useful materials. In recent 
years, block copolymers have found wide application in many areas. These are 
also used as a vehicle for controlling as well as targeting the release of vector 
agents, for both hydrophobic and hydrophilic (Gref et al. 1994; Allen et al. 1999). 
Thus these are exploited for applications in drug delivery (Qiu et al. 2009; 
Lavasanifar et al. 2002; Jones et al. 2003; Riley et al. 2003; Tang et al. 2003), 
tissue engineering, cosmetic, water treatment, and industrial waste treatment 
(Hadjichristidis et al. 2002). Biocompatible block copolymers such as PEO 
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Table 1 Name of some polymers and their characteristics and applications 


Polymer 

Acetal resin 


Monomer/polymer unit 



Acrylonitrile 

butadiene 

styrene 




Cellulose 

acetate 


Cellulose 
acetate butyrate 


Cellulose 

nitrate 


Chlorinated 

rubber 



Cyanoacrylate 

resin 




Epoxy resin 


Ethyl cellulose 




R = H or CH 2 CH 3 


Characteristics 


Uses 


Good tensile 
strength and 
stiffness; 
resistant to 
chemical and 
abrasion 


Inexpensive, 
strong, resilient 
and tough 


Susceptible to 
sunlight, heat and 
high humidity 


In mechanical parts 
communic ation 
equipment, 
Videocassettes, 
cosmetic containers, 
pipes and plumbing 
parts 

In automobile parts 
and fittings, 
telephones,, pipes and 
conduits, luggage, 
boats, toys, and 
bottles 

Photographic him, 
transparent sheeting 
and fibers 


Photographic him, 
varnishes and 
moldings 


Burns with a 
bright, violent 
hame; smells of 
nitrogen oxides 


Susceptible to 
ultraviolet light 
and contact with 
alkaline materials 


It has high 
strength, good 
abrasion and 
chemical 
resistance, low 
water absorption 


Tough, hexible, 
transparent him 
can be prepared 


Lacquer, fabric dope, 
adhesives, paint 


Used in paints, 
varnishes, adhesives, 
inks and paper 
coatings 

In gluing glass, 
ceramics and other 
hard materials. In 
suture skin and weld 
crowns 

It is used in adhesive, 
fills, printed circuit 
boards, molded 
products and enamel 
surface coatings 


Food containers; 
hot-melt adhesives, 
inks, and as protective 
coatings for paper and 
textiles 


(< continued ) 
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Table 1 (continued) 


Polymer 

Fluorocarbon 


Monomer/polymer unit 

F F F F 
I I I I 

-c-c-c-c - 

I I I I 
F F F F 


Hydroxyethyl 

cellulose 


Hydroxypropyl 

cellulose 




R = H or CH 2 CH(OH)CH 


Melamine 

formaldehyde 



Methyl cellulose 




Nylon 

(polyamide) 


R = H or CH 3 




Phenol 

formaldehyde 

resin 



Polycarbonate 




Characteristics 


Uses 


Serviceable 
temperature 
range — —20 to 
205; resistant to 
heat and 
chemical 

Discolors and 
becomes 
insoluble with 
thermal aging 


Mainly used in 
aerosols 


Emulsifier, stabilizer, 
thickener, film former 


Excellent 
photochemical 
stability, it has 
poor thermal 
stability and 
discolors with 
age 

Chemicals and 
heat resistant 
hard, durable 
glossy film 

Good stability 
with minimal 
discoloration or 
decrease in 
weight 

Good tensile and 
flexural strengths, 
elasticity, and 
wear resistance 
and low water 
absorption 

Inexpensive, 
resistant to 
chemical and 
heat, sunlight 
causes 

discoloration 


High dielectric 
strength, 
extremely tough, 
strong 
mechanical 
properties; strong 
UV absorbent 


Emulsifier, stabilizer, 
thickener, film former 
in foods, cosmetics, 
paint removers, paints 
and glazes; also used 
as a sizing agent for 
paper 

Decorative 
homeware, circuit 
breakers, paints and 
enamels 

In sizing paper, as an 
adhesive in textile and 
paper conservation 


Fibers, paints, films, 
foams, and molded 
parts 


Fibers, adhesives, 
plywood, textile 
industry, leather 
processing, paper, 
foams, chemical 
resistant coatings, 
printed circuit boards 

Widely used in 
unbreakable 
windows, bank 
screens, police 
shields, helmet visors, 
and household 
appliances 


(< continued ) 


20 


A. Chaurasia et al. 


Table 1 (continued) 


Polymer 

Monomer/polymer unit 

Characteristics 

Uses 

Polycyclo 

hexanone 

- 


0 

\CH- 

n 

Oxidation with 
time makes it 
brittle and less 
soluble 

Used in picture 
varnishes and for 
retouching 

Polyester 


Inexpensive, easy 
to fabricate, 
versatile, 
resistance to 
chemicals 

Mainly for sheets, 
films, autos and boats, 
pipping boxes 

Polyethylene 

glycol 

H 

Jn 

Very tacky and 
susceptible to dirt 

Pore former, solvents, 
plasticizers 

Polyethylene 

vinylacetate 

-+CH-CH% 

0 

l 

c=o 

1 

ch 3 

Clear, tough, 
crack 

resistant and 
flexible at low 
temperatures 

Used as paper 
coatings, shrink-wrap, 
and hot melt 
adhesives 

Polyethylene, 
high density 

< 

x-o-x 

I 

x-o-x 

i 

Inexpensive, 

tough, 

lightweight, 

flexible and 

chemical 

resistance 

Containers, 
packaging films, 
fibers, pipes, toys, 
bowls, and bottles 

Polyethylene, 
low density 

< 

x-o-x 

1 

x-o-x 

1 

Soft and more 
flexible with very 
lower tensile 
strength 

Sheeting, films, paper 
coatings, toys, bags 
and packaging 

Polyimide 

-i 

/ 0 On 

/ II II 

L R— c— N— c- 

( R , 

} 

n 


Adhesives, binders, 
fibers; flame-retardant 
clothing 

Polyisoprene 

-E-ch 2 

/ C=C N 

H 3 C H 

Serviceable 
temperature 
range —57 to 
110 


Polymethyl 
acrylates 
(acrylic resins) 


o^och 3 

n 

Excellent optical 
clarity, good 
weather stability, 
food chemical 
resistance 

Paints, coatings, 
adhesives, fabrics, 
textile and leather 
finishes, windows, 
optical lenses, glass- 
substitute 

Polypropylene 


ch 3 

-ch-ch 2 - 

n 

Excellent stress 
and scratch 
resistance, good 
chemical and 
heat resistant, 
lightweight 

Bottles, fishnets, pipe, 
clothing, vapor barrier 
films, road signs, 
carpet, artificial grass, 
laminates, food 
packages, furniture, 
and photographic 
enclosures 


(< continued ) 
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Table 1 (continued) 


Polymer 

Polystyrene 


Monomer/polymer unit 



Polyurethane 




ii ii 

-0-(CH 2 )-0-C-NH-(CH 2 )-NH-C- 


Polyvinyl 

acetate 



Polyvinyl 

alcohol 



Polyvinyl 

butyraldehyde 


Polyvinyl 

chloride 






Polyvinylidene 

chloride 


Cl ' 

c— 



Polyxylylene 




Characteristics 

Inexpensive; 
good stability, 
stiffness, and 
impact strength; 
susceptible to UV 
light 

Excellent 

hardness, gloss, 

and resistance to 

weathering, 

abrasion, 

chemical 

resistant 


Odorless, 

tasteless, 

nontoxic, slow 

burning, 

lightweight, 

colorless 

Elastomeric 

properties 

Tough, flexible, 
weather-resistant 


Resistant to 
ignition, 
corrosion and 
stains 

High strength and 

abrasion 

resistance, 

dimensionally 

stable, good 

durability 

Impermeable to 
gas and 
susceptible to 
moisture, weather 
deteriorates in 


Uses 

In food industry; used 
in insulation, toys, 
appliances, cabinets, 
containers, and 
furniture 


Elastomer, sealants, 
adhesives, furniture, 
mattresses, laminates, 
carpet cushions, 
soundproofing, 
flotation devices, 
packaging, and 
filtration 

Latex house paints, 
artists media and 
common household 
white glues 


As an adhesive, films, 
finishes 

Used as shatterproof 
safety-glass interlayer 


Gramophone records, 
sheeting, gaskets, 
tubing, raincoats, 
waterproof coatings 

Packaging, barrier 
films, fibers 


As a coating to 
improve mechanical 
strength and 
flexibility 


Siloxane 


R 

/O x | / 
^ Si 

R 


UV light 

Serviceable 
temperature 
range — —70 to 
200; heat 
resistance 


Used in electrical 
appliances and 
boards, aerospace, 
gaskets, molds 


(< continued ) 
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Table 1 (continued) 


Polymer 

Monomer/polymer unit 

Characteristics 

Uses 

Sodium 

carboxymethyl 

Cellulose 

RO' 

RO. 

r °. is 0 ? 

OR 

OR 

R 

R= H or 
O 

'^^ONa 

n 

Good stability 
with negligible 
discoloration or 
weight loss 

Used in detergents, 
food product and as a 
sizing agent for 
textiles and paper 

Soluble nylon 

0 

1 

H 

Becomes 
insoluble and 
shrinks with time 

As an adhesive, 
coating and sizing 
agent in paper 
industry to add 
strength to wet paper 
and consolidate 
friable pigments 

Urea 

formaldehyde 

resin 

N— ChL— N-CH„ 

i 2 1 2 

' C=0 

1 

HN— CH„— N 

^ 1 
1 
1 
i 

Susceptible to 
heat, acids, and 
alkalis 

Foams, insulation, 
coatings and 
adhesives 


Block 


Polar Block 
Non-polar 

Fig. 19 Schematic diagram 
of formation of micelles in 
different solvents 



Non-polar 



(Nojiri et al. 1990), polycaprolactone-b-poly(ethylene oxide), are particularly 
promising in the field of drug and gene delivery. 

The synthesis of metal or metal oxide nanoparticles in block copolymer micelles 
has brought substantial interest as a result of their unique properties. Block copol- 
ymer micelles filled with nanoparticles have shown special catalytic (Seregina 
et al. 1997; Bronstein et al. 2005; Klingelhofer et al. 1997; Mayer and Mark 
1997; Jaramillo et al. 2003), magnetic, electrical (Platonova et al. 1997; 
Rutnakompituk et al. 2002), and optical (Diana et al. 2003) properties. 

For block copolymers, the selectivity toward the core-forming block is important 
because of a physico-chemical interaction between the polymer core and the metal 
precursor that accelerates the metal incorporation into the micelle. A polymer block 
containing particular functional groups forming the miceller core can be loaded 
with some specific metal compounds. The micelles can often serve as a nano- 
reactor for nanoparticles formation. It may also play a role in stabilizing the 
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nanoparticles as the core forming polymer block that can be considered to exist in a 
quasi-solid state (as the core forming block is insoluble in the selective solvent) 
(Bronstein et al. 2005). 

Block copolymers are used as colloidal stabilizers to synthesize metallo- 
nanoparticle with controlled shape and size. They can provide an environment 
that can be used to prevent corrosion and leakage of heavy metals. They can also 
protect catalytic nanoparticles from deactivation (Antonietti et al. 1995; Mayer and 
Mark 1997; Forster and Antonietti 1998). 

Using this concept, block copolymer micelles containing a polystyrene core and 
a functional corona have been used to produce gold nanoparticles (Moller 
et al. 1996). Gold (Au)-labeled micelles have been incorporated using di-blocks 
of poly(2-vinylpyridine) (P2VP) (Bronstein et al. 2005)/poly(4-vinylpyridine) 
(P4VP) (Sidorov et al. 2004) and poly(ethylene oxide) (PEO) in water. The size 
of gold nano-particles obtained is dependent on the nature and number of unit of the 
two blocks. For example the size of gold nanoparticle is 1 — 4 nm for P2VP135-b- 
PEO350 whereas gold nanoparticles formed from P4VP28-b-PE045 ranged in size 
from 5 to 10 nm. The advantage of gold-labeled micelles with poly (ethylene oxide) 
PEO corona is that they allow for their preparation in water, an environmentally 
friendly medium. Zubarev et al. reported a stimulating approach for controlling the 
interfacial assembly of nanoparticles (e.g., gold nanoparticles) in self-assembled 
nanostructures of block copolymers, starting from gold nanoparticles covalently 
attached to V-shaped heteroarm chains (Zubarev et al. 2006). The supramolecular 
self-assembly of heteroarm star polymers leads to the precise location of gold 
nanoparticles at the core-shell interfaces of rod-like micelles or vesicles (Zubarev 
et al. 2006). Park and co-workers investigated the assembly of CdS or CdSe/ZnS 
quantum dots in vesicles or nanorods of PAA-based block copolymers (Sanchez- 
Gaytan et al. 2007). The application of block polymer is also reported for 
nanolithography for the development of biological performance of mineral oil, in 
biological and pharmaceutical applications (Spatz et al. 1999a; Loginova 
et al. 2004; Jeong et al. 1997; Otsuka et al. 2003). The synthesis of metal or 
semiconductor nanoparticles in the cores of amphiphilic block copolymer micelles 
in organic solvents was reported by many research groups (Antonietti et al. 1995; 
Moffitt et al. 1995; Saito et al. 1993). Using block copolymer micelle cores as nano 
reactors allows the synthesis of mono and bimetallic nanoparticles. The size of the 
nanoparticles, among other factors, depends on the particular reducing conditions 
as depicted in Fig. 20. 

Different morphology of the nanoparticles significantly changes the catalytic 
properties of such systems, even though the size may be similar. The crosslinking of 
block copolymer micelles provides an additional degree of stabilization to 
nanoparticles and allows the modification of micelle morphology (Lu et al. 2001; 
Underhill and Liu 2000; Yan et al. 2001b). The possibility of using nanospheres as 
nanoreactors for inorganic nanoparticles was demonstrated by the formation of iron 
oxide magnetic particles (Yan et al. 2001b) and catalytic Pd nanoparticles 
(Lu et al. 2001). The spatial distribution of nanoparticles in block copolymers 
was also investigated by many researchers (Hadjichristidis et al. 2002; 
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Fig. 20 Metal nanoparticles with different morphology could be obtained depending on the 
nature of reducing agent 


Antonietti et al. 1995; Jinnai et al. 2006). The selective separation of nanoparticles 
among blocks was achieved as a result of the presence of functional groups in a 
selected block (Bronstein et al. 2005; Sidorov et al. 2004). A large number of 
amphiphilic block copolymers forming micelles with a functionalized core in an 
organic medium are available but when aqueous solutions are favored, the choice of 
block copolymers is very limited. Here, nanoparticle formation is normally 
more complicated as the pH of the medium should be taken into consideration. 
A few examples of such block copolymers include poly(ethylene oxide)-block- 
poly(2-vinylpyridine) (PEO-b-P2VP), polybutadiene-block-poly (ethyeleneoxide) 
(PB-b-PEO), poly styrene-block-poly (2-vinylpyridine) -block-poly (ethyelene oxide) 
(PS-b-P2VP-b-PEO), and poly-[methoxyhexa(ethylene glycol) methacrylate] -block- 
[2-(diethylamino)ethyl methacrylate] (PHEGMA-b-PDE AEM A) . P2VP and 
PDEAEMA are examples of pH sensitive block as their core forming capability 
depends on pH (Spatz et al. 1999a). For example, at pH below 5, PEO-b-P2VP 
forms a molecular solution in water, whereas with a further decrease of pH, the 
PEO-b-P2VP forms a micellar solution. No micellar decomposition took place during 
incorporation of metal compounds or metal nanoparticle into micelle solution due to 
interaction with metal species (Bronstein et al. 1999). Morfit et al. reported the 
formation of spherical assemblies of CdS containing block copolymer using reverse 
micelles in aqueous solution (Moffitt et al. 1998). These stable assemblies were 
created by the slow addition of water into mixtures of the reverse micelles formed 
by poly sty rene-b-poly (aery lie acid) diblock chains. This resulted in micelles 
containing quantum-confined CdS nanoparticles. This method allows the relocation 
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of the CdS nanoparticles formed in the micelle cores in organic medium to aqueous 
medium without the loss of stability or nanoparticle aggregation (Moffitt et al. 1998). 

Organization of nanocrystals has been demonstrated by taking advantage of 
block copolymer self-assembly (Chaurasia et al. 2011). The poly(styrene)-b-poly 
(2-vinyl-pyridine) (PS-b-P2VP) diblock copolymer was widely utilized as a tem- 
plate for synthesis of hexagonally arranged gold nanoparticles of controlled size 
(Mela et al. 2007; Spatz et al. 1999b). Non-spherical gold (Antonietti et al. 1996) 
and cobalt (Platonova et al. 1997) nanoparticles are prepared using poly(styrene)- 
block-poly(4-vinylpyridine) (PS-b-P4VP) diblock copolymer. However, synthesis 
of nanoparticles of various metal oxides, e.g., Ti0 2 (Li et al. 2005), Si0 2 (Kim 
et al. 2004), and Fe 2 0 3 (Bennett et al. 2005) with the help of various diblock 
copolymers have been reported. An attempt has been made to prepare organized 
ZnO nanoparticles using a Zn(CH 2 CH 3 ) 2 precursor in a PS-b-P2VP block copoly- 
mer. However, the procedures are difficult partly due to the highly reactive and 
moisture sensitive nature of the precursor, which was traditionally used for metal 
organic chemical vapor deposition (MOCVD) (Braun et al. 2010). Yoo 
et al. reported synthesis of ZnO nano-arrays using oxygen plasma treatment of 
PS -b-P4 VP/zinc chloride film cast from toluene. ZnO nanoparticles with consider- 
ably larger particle size of about 16 nm diameter were obtained. Alok et al. reported 
a facile method for synthesis and organization of ZnO quantum dots (QDs) in 
various morphology using zinc acetate as precursor and PS-P4VP as a template 
(Chaurasia 2012). 


Polymers Composites 

Composite material is a material, composed or made of two or more distinct phases 
(matrix phase and dispersed phase), having significantly different bulk properties 
from those of any of the constituents (Campo 2008; Alok et al. 2011; Katz and 
Milewski 1978; Rosato 2004; Biron 2007). 


Matrix Phase 

Matrix phase is the primary phase, having a continuous character, usually a softer 
and ductile phase. It helps to hold the dispersed phase. A key role of the matrix is to 
serve as a binder of the fibers with desired shape and protect them from mechanical 
or chemical damages. Composite materials can be categorized based on the matrix 
material (metal, ceramic, and polymer) and the material structure. 

Based on the nature of the matrix material of the composite it can be classified 
into metal matrix composites (MMC), ceramic matrix composites (CMC), and 
polymer matrix composites (PMC). This book chapter is about discussion on 
polymer matrix composites (PMC). 

Unsaturated polyester (UP) and epoxiy (EP) are examples of thermoset whereas 
polycarbonate (PC), polyvinylchloride, nylon, and polysterene are thermoplastic. 
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Dispersed (Reinforcing) Phase 

Dispersed phase is the secondary phase of composite, embedded or dispersed in the 
matrix in a discontinuous manner. In the composite, the mainly dispersed phase is a 
greater load carrier than the matrix; therefore it is also known as the reinforcing 
phase. When small additives like metal alloys, doped ceramics or polymers mix in 
dispersed phases they are not considered as composite materials since their bulk 
properties are similar to those of their base constituents. (Physical properties of 
steel and pure iron are almost the same.) 

According to the classification of composites, PMC is the material consisting of 
polymer matrix and reinforcing dispersed phase (glass, carbon, steel or Kevlar 
fibers). PMCs are widely used due to their simple fabrication methods and low cost. 

Reinforcement of polymers with a strong fibrous network permits fabrication of 
PMC which provides advantage over non-reinforced polymers in terms of mechan- 
ical properties. Reinforced PMCs provide better tensile strength, high stiffness, 
high fracture toughness, good abrasion resistance, good puncture resistance, and 
good corrosion resistance. The main disadvantages of PMCs are associated with 
low thermal resistance and high coefficient of thermal expansion. 

Properties of PMCs are determined by properties of the fibers, orientation of the 
fibers, concentration of the fibers, and properties of the matrix. 

Two types of polymers are used as matrix materials for fabrication composites, 
one is thermosets which are generally epoxies or phenolics and the other is 
thermoplastics (LDPE, HDPE, polypropylene (PP), nylon, and acrylics, etc.). 

Fiberglass, carbon fiber, and Kevlar (aramid) fibers are widely used to 
make PMCs. 


Fiberglass: Glass Fiber Reinforced Polymers 

PMC is reinforced by glass fibers (fiberglass is a common name). Use of glass as 
reinforcing fibers in PMSc shows better corrosion resistance and high tensile 
strength, which may go up to 590 psi. The making of glass fiber reinforced polymer 
composite is simple and needs only low-cost technology. 

Glass fibers are made of molten silica-based or other formulation glass, from 
which glass extruded and then gathered to strands. The strands are used for 
preparation of yarns, rovings, woven fabrics, and mat glass fiber products (Fig. 21). 

Different kinds of glass fibers are used for making PMCs depending on the end 
requirement. For excellent electrical insulator PMCs, the most popular and inexpen- 
sive E-glass fibers are used. The designation letter “E” means “electrical” (E-glass is 
insulator). The composition of E-glass which is an excellent insulator ranges from 

50-57 % Si0 2 , 10-17 % A1203, 17-26 % CaO, and 9-14 % B203. For high strength 
PMCs, S-Glass is used. It has applications in military and aerospace areas. S-glass is 
generally made of silica (Si0 2 ), magnesia (MgO), and alumina (A1 2 0 3 ). 

Besides S-glass and E-glass, there are S + R-glass and C-glass. S + R glass is 
strongest and most expensive and has a diameter half of that of E-glass. C-glass is 
used for preparation of corrosion and chemical resistant PMCs, widely used for 
manufacturing storage tanks, pipes, and other chemical resistant equipment. 
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Fig. 21 General picture of glass fiber woven roving, Kevlar, and carbon fiber sheet 


The widely used matrix materials for manufacturing fiberglass-PMCs are unsat- 
urated polyesters (UP), epoxies (EP), nylon (polyamide), polycarbonate (PC), 
polystyrene (PS), and polyvinylchloride (PVC). 

Orientations in the reinforcing glass fibers in fiberglass layers of fiberglass- 
PMCs are very important as they greatly affect the anisotropy behavior of the 
final materials. Fiberglass normally contains between 42 % and 72 % glass fibers by 
concentration. 

Glass fiber reinforced polymer matrix composites are manufactured by open 
mold processes, closed mold processes, and the pultrusion method. 

Fiberglass-PMCs show excellent features in terms of high strength-to-weight 
ratio, high modulus of elasticity-to-weight ratio, corrosion resistance, insulating 
properties, thermal resistance (with respect to polymer matrix). 

Fiberglass materials are used for manufacturing, surfboards, gliders, kit cars, 
sports cars, microcars, karts, bodyshells, boats, kayaks, flat roofs, lorries, K21 
infantry fighting motor vehicles, minesweeper hulls, pods, domes and architectural 
features where a light weight is necessary, high end bicycles, body-parts for 
automobiles, such as the Anadol (Anadol was Turkey’s first passenger vehicle), 
Reliant (Reliant was a British car manufacturer), Airbus A3 20 (A3 20 is a short- to 
medium-range, narrow-body, commercial passenger jet airliners manufactured by 
Airbus), and radome ((combination of word of radar and dome) is a weatherproof 
enclosure that protects a microwave antenna). Fiberglass reinforced plastics (FRP), 
also known as glass reinforced plastics (GRP) are a modern composite material, 
used in chemical plant equipment manufacturing like tanks and vessels by hand 
lay-up and filament winding processes using BS4994-British Standard. (BS4994 is 
a British standard related to this application which still remains a key standard for 
the specification design and construction of vessels and storage tanks using 
reinforced plastics.) 

Besides the above applications, fiberglass materials are also used in 
UHF-broadcasting antennas, large commercial wind turbine blades, velomobiles 
(a bicycle car), and printed circuit boards used in electronics that consist of 
alternating layers of copper and fiberglass, which is technically known as FR-4. 
FR-4 is a grade designation given to glass-reinforced epoxy laminate sheets, tubes, 
rods and printed circuit boards (PCB). Glass fiber composite is also used in 
preparation of RF coils used in MRI scanners. 
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Carbon Fiber Reinforced Polymer Composites 

Carbon fiber-PMCs are similar to fiberglass-PMCs, in which carbon fibers are used 
instead of fiberglass as reinforced materials in polymer matrix. The reinforcing 
dispersed phase may be in the form of carbon fibers, commonly woven into a cloth. 

Carbon fibers are used in continuous or discontinuous form during PMCs 
manufacturing. It is expensive compared to glass fiber but has high specific 
mechanical properties to weight, with a very high modulus of elasticity, which 
can match that of steel in terms of tensile strength which may go up to more than 
1,000 ksi (7 GPa). In addition it possesses very low density of 114 lb/ft 3 (1,800 

Q 

kg/m ) and high chemical inertness. These properties make carbon fibers one for 
potential reinforcement. The disadvantage of carbon (graphite) fibers is being 
brittle, accountable for a catastrophic mode of failure. 

Some of the various types of carbon fibers available are “ultra-high modulus” 
(UHM), “high modulus” (HM), “intermediate modulus” (IM), “high tensile” (HT), 
and super high tensile (SHT). 

“Ultra-high modulus” (UHM) carbon fibers has modulus of elasticity of about 
65,400 ksi (450 GPa) whereas “high modulus” (HM) has modulus of elasticity is in 
the range 51,000-65,400 ksi (350-450 GPa). For “intermediate modulus” (IM) has 
modulus of elasticity in the range 29,000-51,000 ksi (200-350 GPa). For “high 
tensile” (HT) carbon fibers generally have tensile strength of 436 ksi (3 GPa) and 
modulus of elasticity of 14,500 ksi (100 GPa) whereas for super high tensile (SHT) 
carbon fiber, tensile strength is about 650 ksi (4.5 GPa). 

Carbon fibers are manufactured by PAN-based carbon fibers, the most well-liked 
type of carbon fibers. Polyacrylonitrile (PAN) is used as a precursor for preparing 
PAN-based carbon fibers. In this method, the polyacrylonitrile precursor goes 
through several steps to become a carbon fiber (thermal oxidation at 200 °C, 
carbonization in nitrogen atmosphere at 1,200 °C for several hours, and graphiti- 
zation at 2,500 °C). Coal tar or petroleum asphalt is used as the precursor in pitch- 
based carbon fibers. 

Epoxy, polyester, and nylon are among some of the polymers used as a matrix 
for preparation of carbon fiber based PMCs. This composite is generally prepared 
by open mold, closed mold, and the pultrusion method. Carbon fiber reinforced- 
PMCs are light in weight, show high strength, high modulus elasticity, high fatigue, 
good electrical conductivity, corrosion resistance, good thermal- stability, and low 
impact resistance. 

Carbon fiber reinforced-PMCs are widely used in automotive, marine and 
aerospace applications, golf clubs, skis, tennis racquets, fishing rods, light weight 
bicycle frames, artificial light weight legs, etc. 

Kevlar (Aramid) Fiber Reinforced Polymers 

Kevlar fibers were originally developed as a replacement for steel in automotive 
tires, because of its high tensile strength-to- weight ratio, by this measure it is five 
times stronger than steel on an equal weight basis. Kevlar is a trade name, registered 
by DuPont Co. in 1965. It is an aramid fiber and its chemical name is poly- 
para-phenylene terephthal amide. It is synthesized from 1 ,4-phenylene-diamine 
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(para-phenylenediamine) and terephthaloyl chloride through a condensation reac- 
tion in solution from. 

Apart from high tensile strength, it has very high modulus of elasticity, very low 
elongation breaking point, very low coefficient of thermal expansion, high chemical 
inertness, high fracture and high cut resistance, textile processability, excellent flame 
resistance, and toughness. It also shows high impact resistance and low density. 

The disadvantages of Kevlar are its ability to absorb moisture, it is difficult to 
cut, and has low compressive strength. 

There are several grades of Kevlar which are developed for various applications. 
Kevlar 29 - high strength, (~3,700 MPa) used for bullet-proof vests, composite 
armor reinforcement, helmets, cars, etc. Kevlar 49 has a high modulus about 
132 GPa, high strength (~3,810 MPa), low density, and is used in aerospace, 
automotive, and marine applications. Kevlar 149 which has an ultra-high modulus 
(about 187 GPa), high strength (about 3,600 MPa), low density highly crystalline 
fibers, and is used for composite aircraft components. 

The name of some other modified Kevlar are Kevlar K100 (colored version of 
Kevlar), Kevlar K119, Kevlar K129, Kevlar AP (has 15 % higher tensile strength 
than K-29), Kevlar XP (lighter weight resin). Kevlar KM2 is used as enhanced 
ballistic resistance for armor applications. Most of those Kevlar fibers are used in 
aerospace armor areas where mechanical, chemical properties, and weight play an 
important role. 

UV degradation is the main drawback of Kevlar fiber and the ultraviolet present 
in sunlight degrades and decomposes Kevlar, so it needs protection during outdoors 
application. 

However a combination of Kevlar and carbon fibers, a hybrid fabric, further 
improves their properties and give very high tensile strength, high impact, and 
abrasion resistance. 

Epoxies (EP), vinylester, and phenolics (PF) are the most widely used matrix 
materials for manufacturing Kevlar (aramid) fiber reinforced-PMCs. Kevlar (ara- 
mid) fiber reinforced-PMCs are manufactured by open mold processes, closed mold 
processes, and the pultrusion method. 


Effect of Length and Orientation of Reinforcing Material on PMCs 

For fibrous composites, dispersed phase in form of fibers improves strength, 
stiffness and fracture toughness of the material, there is hindered crack growth in 
the directions normal to the fiber, and strength increases significantly when the 
fibers are arranged in a particular direction (preferred orientation) and a stress is 
applied along the same direction. In general PMCs strength is higher in long-fiber 
compared to that of short-fiber. 

Short-fiber reinforced composites, consisting of dispersed phase in the form of 
discontinuous fibers, has a limited ability to distribute the load but is able to share 
the load. In addition orientation of the fibers in composite also decides the end 
properties of PMCs. 
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Short-fiber can exist in random preferred orientation in composites, whereas 
long-fiber reinforced composites consist of a reinforced matrix that can exist in the 
form of continuous fibers with unidirectional or bidirectional orientation. 


Particulate Composites 

Besides reinforcing fibers, a different kind of particle is used to make polymer 
composite. Choice of particle as a reinforcing agent depends on the end use of 
composites. Particulate composites consist of a matrix reinforced with a 
dispersed phase in the form of particles. Effect of the dispersed particles on the 
composite properties depends on the particles size. Very small particles, 
(less than 0.25 pm in diameter) finely distributed in the matrix, prevent the 
deformation of the material by restricting the dislocations movement. This 
strengthening effect is similar to the metal alloy’s “age hardening”. It is clearly 
found that, for a given particle amount, the mechanical strength of composite 
increases with decreasing particle size. As for example, mechanical properties or 
strength of kaolin filled nylon 6,6 composites increase with decreasing mean 
particle size (Bradley 1999). There is a large improvement in tensile strength with 
decreasing particle size. This indicates that the strength increases with increasing 
surface area of the filled particles through a more efficient stress transfer 
mechanism (Fu et al. 2008). 

However, it is noted that for particles with size larger than 100 nm, the com- 
posite strength is reduced with increasing particle loading whereas for nanoparticle 
particles, with size 10 nm or lower, the strength of particle composites trend is 
reversed with loading. To conclude, particle size and amount of loading clearly has 
a significant effect on the strength of particulate-filled polymer composites. 

Interface adhesion quality, between particle (reinforcing) and polymer matrix on 
fiber-reinforced composites is very important, control the strength and toughness of 
PMCs. The adhesion strength at the interface decides the load transfer between the 
components. However the Young’s modulus is not affected by this interfacial 
adhesion quality because, for small loads or displacements, debonding is not yet 
reported. Evaluation of adhesion between two different materials can be done by 
comparing surface properties of the particle (reinforcing) with respect to the 
polymer matrix. The basic mechanisms related to polymer surface are responsible 
for adhesion at the molecular level. The strength of micro-particle-filled composites 
either decreases or increases with particle content. This can be explained by 
interfacial adhesion, between particle and matrix, which significantly affects the 
strength of particulate composites. Effective stress transfer is the key factor, 
contributes to the strength of two-phase composite materials. The stress transfer 
at the particle-polymer interface is inefficient for weakly bonded particles which 
leads to discontinuity in the form of de-bonding. As a result composite strength 
decreases with increasing particle loading. However, for well-bonded or compatible 
particles addition into a polymer matrix will lead to an increase in strength 
especially for nanoparticles with high surface areas. 
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For example, it is reported that the interface bonding strength between alumina 
nanoparticles and vinyl ester resin shows decreased strength due to particle agglom- 
eration, but when functionalization alumina nanoparticles are used this leads to a 
strong interfacial bonding between particle and matrix. This significantly increases 
both the modulus and strength of the composite. 

In general, therefore, quality of interfacial adhesion between particles and matrix 
has a very significant effect on composite fracture toughness. Strong adhesion leads 
to high toughness in thermoplastic matrices but not necessarily in thermosetting 
matrices due to different failure mechanisms. To summarize, the strength of 
particulate composites is determined not only by particle size and quality of 
interfacial adhesion between particle and matrix but also by the amount of particle 
loading. Various trends in composite strength have been observed due to the 
interplay between these three factors (particle size, amount of loading, and interfa- 
cial adhesion), which cannot always be separated. 

Use of hard particles such as ceramic particles prevent wear and abrasion of 
particulate composites and allow materials designed to work in high temperature 
applications, whereas copper and silver particles provide composites with high 
electrical conductivity matrices; for refractory use tungsten and molybdenum are 
used as dispersed phase to work in high temperature electrical applications. 


Laminate Composites 

Laminate composites are made when a fiber reinforced composite consists of 
several layers with different fiber orientations, it is also called multilayer (angle- 
ply) composite. 

These layers are arranged in different anisotropic orientations as a matrix 
reinforced with a dispersed phase in the form of sheets. It directs the increased 
mechanical strength where mechanical properties of the material are low. Scheme 1 
shows the various techniques for the preparation of polymer matrix composites. 


Advances of Polymer Composite and Future Trend 

As discussed previously, conventional reinforced fibers such as glass, carbon fiber 
have been used to make PMCs. Superior properties of carbon nanotubes (CNTs) 
such as low-weight, very high aspect ratio, high electrical conductivity, elastic 
moduli in the TPa range, and much higher fracture strain make them an attractive 
candidate over conventional reinforced fibers and CNTs, and they are being used to 
replace conventional reinforced fibers to achieve advanced functional composites 
which provide excellent properties in terms of strength, aspect-ratio, and thermal 
and electrical conductivity (Breuer and Sundararaj 2004; Spitalsky et al. 2010; 
Seymour 1990; Mylvaganam and Zhang 2007). 

Although CNTs came to light more than a decade ago, preparation of satisfac- 
tory polymer composites by CNTs has faced difficulties due to several challenges 
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Scheme 1 Various techniques for the preparation of polymer matrix composites 


such as purification, dispersion, alignment, and adhesion of CNTs. In addition the 
limitations in interfacial load transfer must be overcome. CNTs are a bit expensive 
compared to conventional reinforced fiber, but it is worth it to get PMCs which are 
armed with superior properties, and it is only a matter of time to produce low cost 
CNTs (Liu et al. 2012; Sahoo et al. 2010). 

This chapter will discuss the strategies taken by researchers to counter the above 
challenges, giving particular attention to the CNT-polymer composites 
(CNT-PMCs). 

Purification and yield of CNTs are fundamental challenges in terms of cost and 
time. Generally soot produced through sublimation and recombination for CNTs 
preparation, contains inherent contaminants and needs a purification process. Most 
of the adopted processes, to get rid of inherent contaminants, were time consuming 
and produced small quantities. Latter, an efficient purification method was devel- 
oped using coiled polymer to extract CNTs from their accompanying material with 
a high yield. In this method, a toluene containing poly(m-phenylene-co-2,5- 
dioctoxy-p-phenylenevinylene) solution was used to purify nanotube soot by a 
low power ultrasonic sonicated bath. It allowed inherent contaminants (solid 
material amorphous carbon) to settle to the bottom of the solution and nanotube 
composite suspension was then decanted. 

Dispersion of nanotubes plays a crucial role in controlling the final properties of 
CNT-PMCs. The effective use of CNTs depends on the uniform dispersion of CNTs 
into matrix without reducing their aspect ratio. However, CNTs tend to remain in 
form stabilized bundles due to van der Waals attraction and very low solubility in 
solvents. 

To overcome the dispersion problem, many mechanical/physical methods were 
adopted such as high shear mixing, solution mixing, melt mixing, electrospinning, 
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ultrasonication, anionic, cationic, and nonionic surfactants surfactant addition 
(Pang et al. 2010). In addition in-situ polymerization and chemical modification 
to functionalize CNTs has been performed. These methods have drawn much 
attention to the preparation of high performance CNT-polymer composites. 

In particular, functionalizing nanotube ends with carboxylic groups were 
reported quite some time ago. Recently, various functionalized CNTs were used 
to graft polymerization through anionic, ATRP, radical polymerization, click 
chemistry, and the preparation of CNT composites employing hyperbranched poly- 
mers to achieve good CNT dispersion to get polymer grafted CNT materials with 
improved dispersion ability (Sahoo et al. 2010). 

Some of the scheme for chemical modification for CNT is given in Fig. 22. 

In mechanically reinforced composites, one of the most important issues is the 
interfacial stress transfer as discussed previously. It is responsible for interface 
failures in shear stress condition. CNTs have an inherent smooth non-reactive 
surface which limits the interfacial bonding between the CNT and the polymer 
chains that limits stress transfer. One of the approaches to overcome the above 
problem is chemical modification and functionalization of CNTs as stated previ- 
ously, which can give better bonding sites to the polymer matrix, supported by 
computational calculation. 

A simple method was followed for integrating CNTs into epoxy polymer via 
chemical functionalization of CNTs. First SWNTs were treated with oxidizing 
agent, e.g., concentrated H 2 S0 4 /HN0 3 mixture, which generated -COOH 
and -OH group on CNTs surface (Sahoo et al. 2010). These functionalized CNTs 
were then dispersed in solvents like N, N-dimethylformamide/tetrahydrofuran; 
epoxy resin/polymer (Forney and Poler 2011). If needed a curing agent was 
added. This leads either to formation of covalent bonds between CNTs and 
polymer or better dispersion of CNT in the polymer of CNT-PMCs (Geng 
et al. 2008). It is reported that composite with 1 wt% functionalized CNTs 
showed an increase of 1 8 % and 24 % in tensile strength and modulus respectively 
over the polymer composites with unfunctionalized CNTs and a 30 % increase 
in tensile modulus over pure polymer (epoxy resin) (Mylvaganam and Zhang 2007 ; 
Geng et al. 2008). 

In addition, the pi bond present in CNTs structure interaction can be used to 
make pi-pi compatible CNT-PMCs by choosing a suitable polymer as a matrix. An 
example of such a CNT-polymer composite is SWNT-polypropylene composite 
which is made by combining the uniformly dispersed nanotube/solvent mixture 
with polypropylene matrix/solvent mixture to form nanotube/solvent/matrix mix- 
ture. The final composite, with 1 wt% of CNTs, showed more than 50 % increase in 
tensile strength. This substantial increase in strength was believed to be due to an 
effect of pi-pi interaction which leads to uniform dispersion of CNTs in the matrix 
material (Mylvaganam and Zhang 2007). 

The final properties of the CNT-PMCs can be controlled by using a different 
polymer and tuning the conditions used in making the composite. 

Various CNT-polymer composites have been reported to tune electrical proper- 
ties of composites depending on application as different applications need specific 
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Fig. 22 Scheme for covalent functionalization of CNTs 
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levels of conductivity, such as for electronic goods, semiconductor components, 
and circuit boards (Heeder et al. 2011). 

CNTs are excellent candidates for the fabrication of electrically conducting 
composites due to their high aspect ratio and high electrical conductivity. The 
electrical conductivity of individual CNTs has been measured and found to be on 
the order of 106 S/m. The maximum electrical conductivity of SWCNT films has 
been reported to be in the range of 104-105 S/m due to the contact resistance 
between the individual nanotubes. Therefore, the range of electrical conductivity of 
CNT/polymer composites has tremendous potential, and can be tuned to the 
electrical conductivity of CNT/polymer composites by varying the amount and 
dispersion of CNTs in the composites considering other factors too. The 
CNT/polymer composites can be used for a variety of applications including 
electrostatic dissipation (<10 -4 S/m), electrostatic painting (10 -4 — 101 S/m), 
electromagnetic interference (EMI) shielding (>101 S/m), printable circuit wiring, 
and transparent conductive coatings. CNTs are being used as fillers for electrically 
conductive adhesives because of their high aspect ratio, high electrical conductiv- 
ity, and high oxidation resistance. CNT/composites are widely used in photovoltaic 
devices and light-emitting diodes. CNT-conducting polymer composites have a 
potential application in supercapacitors. The PANI/MWNTs composites electrodes 
showed much higher specific capacitance (328 F g -1 ) than pure PANI electrodes 
(Sahoo et al. 2010). 

Again, the electrical conductivity of CNT/polymer composites is widely defined 
by the percolation theory. The common factors affecting the percolation threshold 
of electrical conductivity are similar to mechanical properties such as dispersion, 
alignment, aspect ratio, degree of surface modification of CNTs, types and molec- 
ular weights of the matrix polymer, and composite processing methods. 

Due to the superior mechanical and thermal properties of CNT/polymer com- 
posites, they have drawn great attention to the applications in high end areas such as 
aerospace and defense industries. The most possible application comes from 
substituting the metal composite with the significantly lighter weight CNT/polymer 
composites in the design of airframes which requires materials with low density, 
high strength, and modulus. O’Donnell et al. reported that CNT reinforced polymer 
composites can show a profitable effect on the commercial aircraft business due to 
lighter weight (less fuel consumption). CNTs can be used as additional filler to the 
carbon fiber-reinforced polymer (CFRP) composite to enhance its interlaminar 
fracture characteristics (Sahoo et al. 2012). 

Potential application of PMCs composite can be found in sensing important 
materials that are critical to the environment, space missions, industrial, agricul- 
tural, and medical applications. Detection of N0 2 and CO is important to monitor 
environmental pollution whereas sensing of NH 3 is required in industrial and 
medical environments. Sensors based on individual SWNTs/polymer were demon- 
strated for sensing N0 2 or NH 3 . During sensing, it is found that the electrical 
resistance profile of a semi-conducting SWNT changed significantly upon exposure 
of N0 2 or NH 3 gas (Penza et al. 2009). The existing electrical sensor materials 
including carbon black polymer composites operate at high temperatures for 
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substantial sensitivity whereas the sensors based on SWNT exhibited a fast 
response and higher sensitivity at room temperature. A nanotube-poly 
(dimethylsiloxane) polymer composite film that can be used to form nanosensor, 
contains at least one conductive channel comprising an array of substantially 
aligned carbon nanotubes embedded in a polymer matrix, can be used to determine 
a real time physical condition of a material, and can be exploited in monitoring the 
physical condition of wing or chassis of a flying airplane or space shuttle 
(Mylvaganam and Zhang 2007). 

Composites of conjugated polymers are becoming increasingly used for solar 
cells because of their great expectations for cheap energy conversion. In addition, 
low weight, flexibility, and inexpensive preparation procedures of polymer com- 
posites for solar cell application make them more attractive than crystalline inor- 
ganic semiconductors for future applications. There are many reports on the 
performance of polythiophene/fullerene solar cells with a hole-collecting buffer 
layer that was made using composite films of functionalized multi-walled carbon 
nanotube (f-MW CNT), poly (3 ,4-ethylenedioxythiphene) :poly (styrenesulfonate) 
(PEDOT:PSS), and ZnO nano particle. P3HT:PCBM bases solar cell reported 
efficiency as high as 5 %. 

Among various polymer electrolyte membranes, Nation is the most suitable 
candidate for the fabrication of fuel cell membranes owing to its remarkable ionic 
conductivity and chemical Nafion-based membranes have a high production cost, 
low conductivity at low humidity and/or high temperature, loss of mechanical 
stability at high temperature, elevated methanol permeability, and restricted oper- 
ation temperature. The higher methanol permeability not only decreases the fuel 
cell efficiency, but also the cathode performance. These problems can be overcome 
by the incorporation of CNTs into the Nation membrane to improve the mechanical 
stability, the proton conductivity and to decrease methanol permeation of the 
Nation membrane. Choi’s research group prepared functionalized MWCNTs (oxi- 
dized and sulfonated MWCNTs) with reinforced Nation nanocomposite mem- 
branes for PEM fuel cell (Lee et al. 2011; Liu et al. 2012). 

Fullerenes, a family of carbon allotropes, were discovered in 1985 by Robert 
Curl, Harold Kroto, and Richard Smalley. Spherical fullerenes are also called 
buckyballs. The structures of fullerene (C60) and methanofullerene phenyl-C61- 
butyric-acid-methyl-ester (PCBM) are shown in Fig. 23. 

Fullerene (C60) has attracted continuous attention since its discovery due to its 
exceptional physical and chemical properties. Fullerene-containing materials have 
shown wide and promising applications in the field of superconductors, ferro- 
magnets, lubrications, photoconductors, and catalysts (Prato 1997; Wudl 1992, 
2002 ). 

Organization of fullerene (C60) and its derivatives into nanostructures within 
polymer systems has potential applications in solar cells and biomedicine (Chen 
et al. 2009; Po et al. 2010; Sariciftci et al. 1993; Orfanopoulos and Kambourakis 
1995). For example, interesting results on polymer solar cells were reported using a 
blend of fullerene derivative and a block copolymer poly(4-vinyl pyridine) of poly 
(3 -hexyl thiophene) (P3HT-b-P4VP) (Sary et al. 2010). It was also found that 
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Fig. 23 Structures of (a) 
fullerene (C60); (b) 
methanofullerene phenyl- 
C6 1 -butyric-acid-methyl- 
ester (PCBM) 




MeO 



fullerene in aqueous solution can generate singlet oxygen under photo-irradiation 
which has implications in the studies of biomedical and environmental science 
(Orfanopoulos and Kambourakis 1995; Anderson et al. 1994). One of the biolog- 
ically most relevant features of C60 is the ability to function as a “free radical 
sponge” and quench various free radicals more efficiently than conventional anti- 
oxidants (Krusic et al. 1991). Fullerene has widespread applications ranging from 
drug-delivery and tissue-scaffolding systems to consumer products (Markovic and 
Trajkovic 2008), and it has been explored in the area of biological chemistry, such 
as enzyme inhibition, antiviral activity, DNA cleavage, and photodynamic therapy 
(Boutorine et al. 1994). However, many of C60 potential applications have been 
seriously hampered (Zhu et al. 1997; Ravi et al. 2005) by its extremely low 
solubility in water. Derivatization of the fullerene molecule with various functional 
groups and other solubilization procedures such as surfactants or long chain poly- 
mers (Ford et al. 2000; Mehrotra et al. 1997) is done through covalent interactions. 
Alok et al. fabricated functionally unmodified C60-containing nanostructures via a 
combination of an amphiphilic block copolymer P4VP8-b-PEO105-b-P4VP8 self- 
assembly and charge-transfer complexation between fullerenes and P4VP segments 
in organic solvent (Alok et al. 2011). 

Recently, the development of graphene-based polymer nanocomposites has 
become a new direction of research in the area of polymer nanocomposites. 
Graphene is an allotrope of carbon with a two-dimensional structure in which sp“ 
bonded carbon atoms are densely packed in a honeycomb crystal lattice into a one- 
atom-thick planar sheet. Graphene possesses high thermal conductivity, superior 
mechanical strength, and excellent electronic conductivity. As compared with 
CNTs, graphene has become a relatively cheap nanomaterial because its synthesis 
procedure is much simpler than those methods used for synthesis of carbon 
nanotubes. It has been reported that the improvement in mechanical properties of 
polymers by adding graphene is much more efficient than that by nanoclay or other 
nanofillers. Therefore, graphene is considered as a good choice of nanofillers for 
making advanced polymeric nanocomposites (Sahoo et al. 2012). 

The unique structure and high surface area of graphene sheets allow them to be 
used as composite fillers in fuel-cell and solar cell applications. Among various 
polymer electrolyte membranes, Nation (Nation is a sulfonated tetrafluoroethylene 
based fluoropolymer-copolymer) has received significant attention due to its 
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remarkable ionic conductivity, and chemical and mechanical stability. However, the 
disadvantages of Nation-based membranes are high production cost, low conductivity 
at low humidity and/or high temperature (over 100 °C), loss of mechanical stability at 
high temperature (around 100 °C), elevated methanol permeability, and restricted 
operation temperature. PEM fuel-cell performance can be improved by increasing the 
proton conductivity of the membrane by incorporating 10 wt% of sulfonic acid- 
functionalized graphene oxide (GO) into the Nation matrix. These experimental 
results suggest that the functionalized GO/Nafion nanocomposites offer significant 
promise as electrolyte membranes for PEMFC applications (Sahoo et al. 2012). 

When Nation membrane is replace by poly (ethylene oxide) (PEO), which leads 
to a GO/PEO membrane, it shows proton conductivity of 0.09 S cm -1 , at 60 °C, and 

o 

a power density of 53 mW cm in a hydrogen PEMFC. It is due to partially 
existing -COOH groups on the GO in the form of -COO - and H + at room 
temperature. This provides better ionic/protonic conductivity in the PEO/GO com- 
posite membrane (Sahoo et al. 2012). 

Organic photovoltaic cells (OPV) are of great interest as a potential source of 
renewable energy and as a promising alternative to traditional inorganic solar cells 
due to their light weight, ease of manufacturing, compatibility with flexible sub- 
strates, and low cost. Currently, the most successful OPV cells are fabricated with a 
BHJ architecture based on poly(3-octylthiophene) (P30T) as the donor and the 
fullerene derivative 6,6-phenyl C60 butyric acid methyl ester (PCBM) as the 
acceptor. Graphene based PMC has great potential to be used as an acceptor for 
photovoltaic devices due to its excellent electron transport properties and extremely 
high carrier-mobility. Most research focuses on replacing or cooperating with 
PCBM of polymer-based OPVs because its electron mobility is high and its energy 
level can be tuned easily through controlling its size, layers, and functionalization. 
However, the power conversion efficiency (PCE) values of the OPVs reported in 
the literature are only slightly higher than 1 %, indicating that graphene is still far 
from being qualified to act as this kind of material (Sahoo et al. 2012). 

Numerous functional filler, reinforcements, function polymer, and fabrication 
techniques are being exploited by researchers to achieve various kinds of PMCs 
armed with various properties. 


Summary 

As is known, polymers or polymer composites have various applications. However, 
there is still a wide scope to explore various paths and ideas to improvement of 
properties like high-strength, light weight, high performance composites, and 
electronics to make more convenient, sophisticated, customized tools or products 
for future application. One of the ways to do this is by synthesizing/modifying 
polymers, reinforced fibers, functional fillers, and improving or inventing new 
techniques for making sophisticated products in the future. 

For example, currently although there are various reinforcing materials, glass 
fibers and carbon fibers are used most in preparation of high performance 
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polymercomposite, but CNT-filled polymers show potential applications due to 
improved properties, such as high-strength, light weight, and high performance 
composites; until now, there has not been much industrial successes showing their 
advantage over traditional carbon fibers. Because of their nanometer scale and high 
aspect ratio, CNTs usually form stabilized bundles due to van der Waals interac- 
tions, despite various methods such as melt processing, solution processing, and 
in- situ polymerization which are used to counter these problems. However, there 
are still opportunities and challenges to be found in order to improve dispersion and 
interfacial properties. The mechanical properties of CNT/polymer nanocomposites 
may be compromised between the carbon-carbon bond damage and the increased 
CNT-polymer interaction due to the CNT functionalization. Similarly, electrical 
conductivity of a CNT/polymer nanocomposite is determined by the negative effect 
of carbon-carbon bond damage and the positive effect of the improved CNT 
dispersion. In either case, the choice and control of tailored functionalization sites 
for chemical modification of CNTs are extremely necessary. It is also necessary to 
understand the mechanisms involved in the methods used to improve the properties 
of CNT/polymer composites. This will be helpful to select the appropriate polymers 
and CNTs as well as maximum adhesion at the CNT-polymer interfaces. Another 
problem associated with CNT is its high cost. It is one of the major hurdles to accept 
CNT as a generous reinforcing agent over traditionally existing reinforcing agents 
like carbon fiber and glass fiber. So, bringing down the manufacturing cost of CNT is 
one of the aspects toward wide industrial acceptance of CNT as a reinforcing agent. 

Similar kinds of problems or challenges or scope for improvement are also 
present in other polymer composite systems for specific use. Other versatile valu- 
able applications are found with block copolymers. They are exploited for appli- 
cations in drug delivery, tissue engineering, cosmetics, water treatment, and 
industrial waste treatment. Block copolymer micelles are used in synthesis of 
metal or metal oxide nanoparticles which have shown special catalytic, magnetic, 
electrical, and optical properties. 

The future trend of polymer or polymer composite is or will be decided by 
market needs or demand and current/future research progress. 
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Abstract 

► Chapter 3, “Polymer Surface Treatment and Coating Technologies” mainly 
discusses the extensive studies which have been carried out on properties and 
applications of polymer and polymer nanocomposites in the field of 
bioelectronics. It also highlights on some of the interesting engineering applica- 
tions such as high-performance composites used in aerospace application. In 
addition to that, we briefly talked about biodegradable as well as biocompatible 
polymers which have gained significant attention due to its widespread use in the 
preparation of biocomposites for various biomedical as well as agricultural 
applications. Next part of the discussion emphasizes on conducting polymer 
composite mainly on carbon nanotube (CNT)/polymer composite because of 
continuous interest in the use of polymers (conjugate) for fabrication of numerous 
light and/or foldable electronic devices and they are also extremely promising 
candidates for sensor applications. It also focused on the application of polymer 
and polymer nanocomposites for packaging areas. The main advantages of plas- 
tics as compared with other packaging materials are that they are lightweight and 
low cost and have good processability, high transparency and clarity, as well as 
good barrier properties with respect to water vapor, gases, and fats. Our discussion 
on polymer composite ends with its utility in automotive applications. Because 
they are lightweight and due to their property tailorability, design flexibility, and 
processability, polymers and polymer composites have been widely used in 
automotive industry to replace some heavy metallic materials. 


Introduction 

Polymer nanocomposites have drawn extensive attention due to the scope of tuning 
of phyisicochemical properties of the materials for high and specific applications. 
Preparation of polymer nanocomposites gives choice to choose appropriate poly- 
mer and type of nanofiller and processing parameter depends on applications. 
Polymer composites have wide use, from high-end application to low-end applica- 
tion. In this chapter, studies have been carried out on properties and applications of 
polymer nanocomposites in the field of bioelectronics besides highlighting on 
engineering applications which include high-performance composites even used 
in aerospace application. 


2 Properties and Applications of Polymer Nanocomposite 
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Polymer Nanocomposites in the Bio Field 

In recent years, nano-biocomposites from biodegradable biocompatible polymers 
have gained significant attention for a wide range of biomedical applications owing 
to their U.S. Food and Drug Administration (FDA) approval for most of polymers. 
Nano-biocomposites obtained by adding biopolymers or nanofillers often result in 
improved material properties without having any toxic products (Bordes et al. 2009; 
Alok et al. 2011). Such eco-friendly biodegradable polymers are mainly destined to 
biomedical applications, drug/protein delivery tuning, and formulating biomedical 
devices. Biopolymers can be either chemically synthesized or biosynthesized from 
microorganisms. Figure 1 gives a classification with four different categories of 
biopolymers, depending on the synthesis method employed (Averous and 
Boquillon 2004). Biopolymers obtained from microorganisms, agro resources, 
and biotechnology are from renewable resources. 


Biodegradable Polyester Biocomposites 

Many types of polyester have been the predominant choice for materials in bio- 
compatible and biodegradable drug delivery systems. Polyesters such as PLGA, 


Biodegradable 

polyesters 


I 


Biomass products 

(agro resources) 



Polysaccharides 

(chitosan/chitin, 

sta rches) 


Proteins and 
lipids (collagen 
gealtin) 


Po I y hydroxy - 
alkanoate (PHA) 



butyrate (PHBJ 


I 

Biotechnology 

(bio-derived 

monomers) 


1 

Petrochemical 

products (synthetic 
monomers) 


Polylactides (PLAJ 


Polyca pro lactone 

(PCI) 


Polyesteramine 

(PEA) 


Aliphatic co- 
polyesters (PB5A) 


Aromatic co- 
polyesters (PBAI) 


Fig. 1 Classification of the biopolymers 
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PGA, PL A, PCL, etc., are polymers that have gained significant attention for a wide 
range of applications in the biomaterial field. Owing to their biodegradability and 
biocompatibility, polyesters have been widely used as carriers for drug, protein, and 
gene delivery. Fine-tuning of the drug release from polyesters has been extensively 
studied for many years (Agarwal 2012). 

Homopolymers and copolymers derived from glycolic acid or glycolide, lactic 
acid or lactide, and e-caprolactone are studied extensively for more than a decade 
for vast biomedical applications. Among all polyesters, aliphatic polyesters are the 
most investigated degradable polymer for biomedical applications and have been 
used in sutures, drug delivery devices, and tissue engineering scaffolds. Polyesters 
are of utmost interest in biomedical applications because these biomaterials can be 
broken down and resorbed without removal or surgical revision (Fong et al. 2011). 
Polyesters are susceptible to acid, base-catalyzed hydrolysis, or enzymes present in 
the body. Fine-tuning of mechanical and drug delivering properties makes poly- 
esters a natural choice towards tissue fixation and controlled drug delivery appli- 
cations (Vert et al. 1992). 

Poly(L-lactic acid) (PLLA) is the most prevalent in this category, and though 
reports of the use of PLLA can be found in the 1960s, exceptional amount of work 
has been performed and published recently. PLLA is the product resulting from 
polymerization of L,L-lactide (also known as L-lactide). Being able to degrade into 
innocuous lactic acid, PLLA has widespread applications in sutures, drug delivery 
devices, prosthetics, scaffolds, vascular grafts, bone screws, pins, and rods or as 
plates. Strong mechanical properties and degradation into innocuous end product 
are the main reasons for such variety of applications (Shikinami et al. 2005). PLLA 
is U.S. Food and Drug Administration-approved for a variety of applications and is 
available commercially in a variety of grades. Some of the commercially available 
products are NatureWorks (Cargill, USA), Lacty (Shimadzu, Japan), PDLA (Purac, 
the Netherlands), PLA (Galactic/Total, Belgium), and Ecoloju (Mitsubishi, Japan) 
(Bordes et al. 2009). Some studies suggest the potential use of PLLA as a bone 
reinforcement material. The mechanical properties of neat PLLA might not be 
enough for high load-bearing applications. This explains the need to incorporate 
different elements like oriented fibers, HAP, or clays to form nanocomposites. This 
result in an increase in the flexural modulus and strength, which corresponds with 
bone replacement implants (Shikinami et al. 2005). 

Poly(glycolic acid) (PGA) is another aliphatic biodegradable polyester with 
applicability in the field of biomaterials. However, unlike PLLA, high water 
solubility of PGA and fast hydrolysis on exposure to aqueous conditions affect 
the mechanical properties adversely. Thus, water solubility and its high melting 
point limit the use of PGA in bionanocomposites. 

Another biodegradable biocompatible FDA-approved polyester commonly 
known as poly DL-lactide/glycolide or poly(lactide-co-glycolide) (PLGA), which 
degrades by hydrolysis of its ester linkages in the presence of water into lactic acid 
and glycolic acid (Fig. 2; Steele et al. 2012). 

PLGA has been used to deliver chemotherapeutic s, proteins, vaccines, antibi- 
otics, analgesics, anti-inflammatory drugs, and siRNA. Most often PLGA is 
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My DClactidc/glycolidc D,L lactic acid 


Fig. 2 Hydrolysis of Poly DL-lactide/glycolide into lactic and glycolic acid which later enters 
Kreb’s cycle through pyruvic and oxalic acid respectively 

fabricated into microspheres, microcapsules, nanospheres, or nanofibers to facili- 
tate controlled delivery of drugs. PLGA offers several advantages as delivery 
devices, for example, site-specific/localized drug delivery through surface 
functionalization and control of drug release from the matrix by changing its 
monomer’s ratio, molecular weight, and terminal end groups (Huang et al. 2013). 
Surface functionalization has been done for various purposes such as PEG to evade 
secondary immune response, folic acid for tumor targeting, and acrylates for 
bioadhesion (Tables 1 and 2). 

Poly(s-caprolactone) (PCL) is a biodegradable and nontoxic aliphatic polyester 
exhaustively studied as the biomedical nanocomposite. PCL is obtained by the ring- 
opening polymerization of e-caprolactone in the presence of metal alkoxides 
(aluminum isopropoxide, tin octoate, etc.). PCL shows a very low Tg (—61 °C) 
and a low melting point (65 °C), which limits some applications. Therefore, PCL is 
generally blended or modified (e.g., copolymerization, cross-linking). The copoly- 
merization is commonly done with other lactones such as glycolide, lactide, and 
poly(ethylene oxide) (PEO) or by nanofiller incorporation in order to wide range of 
properties as per application. The rubbery state because of low Tg permits the 
diffusion of this polymer species at body temperature, thus making it a promising 
candidate for controlled release and soft tissue engineering (Vert et al. 1992). This 
is important for the preparation of long-term implantable devices, as its degradation 
is even slower than that of polylactide. 


Polypeptide-Based Nanocomposites 

A wide range of possibilities in materials design and application are provided by 
polypeptide nanocomposites as ability to adopt specific secondary, tertiary, and 
quaternary structures, a drawback of synthetic polymers (Hule and Pochan 2007). 
Specific sites of the polypeptide backbone can be modified by incorporating 
specific amino acid functionality with desired activity. The increased mechanical 
properties and thermal properties of nanocomposites with addition of fillers are 
comparable to other widely explored biomedical devices and biomaterials. The 
secondary conformation of the nanocomposite matrix can be affected by the 
molecular weight of the polypeptide. Polylysine (e-poly-L-lysine, PLL) is a small 
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Table 2 Common poly(lactide-co-glycolide) products 


Product 

Bio-applications 

Vicryl™ (Ethicon) 

Suture 

Polysorb™ (Covidien) 


Vicryl™ mesh (Ethicon) 

Hernia and soft tissue mesh 

Vypro™ mesh (Ethicon) 


Sepramesh™ (Bard) 


Codman® Ethisorb Dura patch™ (Ethicon) 

Dura repair 

Polygraft Truht™ BGS (Smith & Nephew) 

Bone void filler 

Dermagraft™ (Advanced Biohealing Inc.) 

Wound healing scaffolds (ulcers) 

Rapisorb and Rapisorb cranial clamp (Synthes) 

Fixation 

Lupron Depot® (Abbot Labs) 

Drug delivery (prostate cancer) 


natural homopolymer of the essential amino acid L-lysine that is produced by 
bacterial fermentation (Chaurasia et al. 2012; Gao et al. 2003; Ramanathan 
et al. 2004; Tasis et al. 2006; Iijima 1991; Ajayan et al. 1994). L-lysine residues 
generally constitute homopolypeptide 8-polylysine, in which epsilon (e) refers to 
the linkage of the lysine molecules. a-Poly lysine improves cell adhesion, hence 
commonly used to coat tissue cultureware. Several secondary structures of PLL are 
the random coil, a-helix, or P-sheet in aqueous solution, and transitions can be 
easily achieved using pH, temperature, salt concentration, or use of cosolvent. 
These different conformations and transitions can be studied using circular dichro- 
ism (CD), FTIR, and Raman spectroscopy. PLL preferentially forms P-sheet struc- 
ture irrelevant to nanocomposite film formation method at high concentrations 
(Hule and Pochan 2007). These secondary structures of polypeptides aid the design 
of new nanomaterials for specific desired applications in the biomedical arena. Such 
nanocomposites with addition of fillers give strength to matrix, and potential 
applications include drug delivery matrices, tissue engineering scaffolds, and 
bioengineering materials (Fong et al. 2011). 


Other Types of Nano-Biocomposites 

Polyhydroxyalkanoates (PHAs) are linear polyesters produced in nature by bacte- 
rial fermentation of sugar or lipids (Aldor and Keasling 2003; Li et al. 2007a). They 
can be either thermoplastic or elastomeric material with wide melting range from 
40 °C to 180 °C. Polyhydroxyalkanoates (PHA)-based nano-biocomposites are 
useful in making bioplastic because of their biodegradability but possess some 
drawbacks, such as brittleness and poor thermal stability (Aldor and Keasling 
2003). This limits their application, and hence, often they are intercalated with 
clay. PHA/clay nano-biocomposites are prepared by solvent intercalation and/or 
melt intercalation processes. PHA-based nano-biocomposites have a wide range of 
applications in sutures, cardiovascular patches, stents, guided tissue repair / 
regeneration devices, articular cartilage repair devices, vein valves, bone marrow 
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scaffolds, meniscus regeneration devices, ligament and tendon grafts, and in wound 
dressings and as hemo stats (Park et al. 2005; Reddy et al. 2003). 

Poly(urethane urea) (PUU) block copolymers are used in ventricular assist devices 
and total artificial hearts as blood sacs. Polyurethane ureas (PUUs) are prepared with 
the same conditions as polyurethanes, with the difference that diamines are used 
instead of diols as chain extenders, which constitute the hard segments with improved 
mechanical and thermal properties. The higher cohesion of urea linkages develops 
stronger, three-dimensional hydrogen bonding in PUU (Oprea et al. 2013). One of the 
main disadvantages of PUUs in medical devices is their relatively high permeability 
to air and water vapor. The majority of the component of copolymer is the soft 
segment called the poly(tetramethylene oxide), which is responsible for the perme- 
ability and diffusive properties of the polymer. Addition of organically modified 
layered silicates overcomes the permeability and diffusive drawbacks while still 
maintaining the desired biocompatibility and mechanical properties of the nano- 
biocomposites (Xu et al. 2000). This additional silicate imparts increase in barrier 
properties because of intercalated clay layers in the polymer matrix with increases in 
the modulus and strength of the nanocomposite. 

More recently polymer-based nanocapsules are being used to design drug 
delivery system with improved solubility, bioavailability, and controlled release 
for a specific target. Polymer-based nanocapsules provided stability of drug mole- 
cules from degradation by external factors such as light or by enzymatic attack in 
their transit through the digestive tract (Mosqueira et al. 2001; Ourique et al. 2008). 

Polymer modification has been done in order to obtain more hydrophilic surfaces 
or polymer coatings to attain favorable behavior regarding active substance stability 
in the case of encapsulation (Mora-Huertas et al. 2010). 

In Fig. 3 micelles, formed from amphiphilic block copolymers (ABCs), with 
cores and coronas have been demonstrated as a powerful tool for cell imaging, 
disease diagnosis, and delivery of various water-insoluble materials (including 
quantum dots, magnetic nanoparticles, and drugs). 

For example, PEG-PCL and PEG-PLA, are some of the block copolymers, used 
for nano-encapsulation. This type of polymers are used to encapsulate hydrophobic/ 
hydrophilic drug or active ingredients depends on application and process. 

The tri-block copolymers, PC1-PEG-PC1 [poly(s-caprolactone)-poly(ethylene 
glycol)-poly(e-caprolactone)], were used to encapsulate and deliver ibuprofen. The 
release profile of ibuprofen was significantly affected by the block length of the 
copolymer composition and the extent of loading. The in vitro profile shows a 
sustained release of 10 % loading ibuprofen from 3 to 15 days. Release profile 
depends on the ratio of e-caprolactone to ethylene glycol-derived subunits in 
copolymer chains. With 5 to 20 wt% ibuprofen loading, release was continued for 
2-24 days for copolymer whose 8-caprolactone molar ratio to ethylene glycol- 
derived subunits was 2.49 (Yu and Liu 2005). 

Another example of the effect of nano -encapsulation was reported on reverse 
multidrug resistance in tumor cells when PEG-PCL was used. This study shows a 
novel drug delivery system, where an anticancer drug, doxorubicin, was encapsu- 
lated by polyethyleneglycol-polycaprolactone (PEG-PCL) using solvent 
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Fig. 3 Micelles, formed from amphiphilic block copolymers, with cores and coronas have been 
demonstrated as a powerful tool for cell imaging, disease diagnosis, and delivery of various water 
insoluble materials (including quantum dots, magnetic nanoparticles, and drugs) 


evaporation method. The size of doxorubicin-loaded polymer nanocomposite was 
about a diameter of 36 nm and a zeta potential of +13.8 mV. The encapsulation 
efficiency of doxorubicin was 48.6 % =b 2.3 %. This drug/polymer nanocomposite 
showed sustained release profile, increased uptake, and cellular cytotoxicity, as 
well as decreased efflux of doxorubicin in adriamycin-resistant K562 tumor cells 
(Diao et al. 2011). 

Block copolymer nanocomposites are used for bioimaging. One of the primary 
conditions applicable in bioimaging is micelle-encapsulated superparamagnetic 
nanocomposites, which should be dispersible and stable in aqueous medium besides 
other criteria. 

It was reported that amphiphilic poly (8-carpolactone)-£/<9c/:-poly (ethylene gly- 
col) copolymers were linked to a fluorophore, 2,1,3-benzothiadiazole (BTD). This 
resulted in new type of bioimaging agent. The polymers form micelles in aqueous 
solutions with average diameters of 45 nm and 78 nm depending on the polymer 
structures. So, neutral and hydrophobic biocompatible emitters can be made by 
using the block copolymer in a rational way (Tian et al. 2010). 
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Polymer Nanocomposites in the Field of Electronics 
Conducting Polymer Composite 


There has been continuous interest in the use of conjugated polymers for the 
fabrication of numerous light and/or foldable electronic devices, for example, 
electrochromic displays, microelectronic devices, protective coatings, and 
rechargeable batteries. The motivation behind this intense interest is because of 
their unique electronic and optical properties, ability to be chemically tuned, most 
importantly their lightweight/foldable mechanical properties, processability, and 
low cost. They are also extremely a promising candidate for sensor applications 
because of their conductivity and electrochemical activity that are extremely 
sensitive to molecular interactions, which provide excellent signal transduction 
for molecular detection (Gardner et al. 1992; Janata and Josowicz 2003; Chaurasia 
et al. 2012). The uprising popularity of conducting polymer-based sensors lies in 
the fact that only specific chemicals can trigger a drastic conductance change. By 
functionalizing polymer molecules, it can be made more specific (Gao et al. 2003; 
Ramanathan et al. 2004). However, the limitations such as relative low conductivity 
and low mechanical and chemical stabilities restrict its use for some practical 
applications. 


Potential Electronic Properties and Application of Various Carbon 
Nanotube (CNT)/Polymer Composites 

Nowadays, tremendous efforts have been made to prepare polymer and carbon 
nanotube composites (because of remarkable electrical as well as thermal conduc- 
tivities and the superior mechanical properties of carbon nanotubes (CNTs)) with 
the aim of synergistically combining the merits of each individual (Tasis 
et al. 2006; Iijima 1991; Ajayan et al. 1994; Dai and Mau 2001; Zengin 
et al. 2002; Cochet et al. 2001; Sainz et al. 2005; Moniruzzaman and Winey 
2006) component. To form a perfect polymer/CNT composite with much more 
enhanced functionality, in situ polymerization of the desired monomers in the 
presence of carbon nanotubes would be expected to show best results compared 
to the post-mixing approaches (Cochet et al. 2001; Sainz et al. 2005; Li et al. 2003). 
But, well dispersion of the carbon nanotubes into solution is a must for in situ 
polymerization. There are reports of different dispersal approaches which impart 
different surface chemistries and electronic structures to the carbon nanotubes such 
as polymer wrapping, (Zheng et al. 2003; Dalton et al. 2000; Star et al. 2001) 
noncovalent adhesion of small molecules, (Dai and Mau 2001; Chen et al. 2002) 
and acidic oxidation (Wang et al. 2005). The key to the expected improvement in 
the nanocomposites depends critically on the monomer-nanotube interfacial chem- 
ical and electronic interactions during polymerization and polymer-nanotube inter- 
facial interactions after polymerization. There are reports on the impacts of the 
surface chemistry and electronic structure of carbon nanotubes on the kinetics of 
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polymerization and the electronic performances of the obtained composites 
(Cheung et al. 2009). Because of their well-documented unique surface chemistry 
and electronic structures, they have used single- stranded DNA dispersed and 
functionalized single- walled carbon nanotubes (ss-DNA-SWNTs) as a representa- 
tive example. They have also discussed the multiple roles of ss-DNA-SWNTs 
during and after the in situ polymerization in the fabrication of highly conductive 
self-doped polyaniline/SWNT composites. The applications of these 
nanocomposites cover the wide area of biosensing and flexible electronics which 
will be discussed below. 

Nowadays, there is increasing efforts for the use of single-walled carbon nano- 
tube (SWNT) networks as sensing materials and conductive flexible electrodes due 
to its specific advantages. Fabrication of SWNT films can be done quite easily by 
various room temperature solution-based processes, such as spray coating, 
(Kaempgen et al. 2005; Artukovic et al. 2005) inkjet printing, (Kordas 
et al. 2006; Simmons et al. 2007) deposition by a layer-by-layer approach, (Shim 
et al. 2007; Kovtyukhova and Mallouk 2005) and deposition through a filter 
(Wu et al. 2004; Zhang et al. 2006a). Due to statistical averaging effects, the 
obtained network electrodes are highly reproducible and exhibit percolation-like 
electrical conductivity. A number of applications including electrodes for solar 
cells, (Rowell et al. 2006) organic light-emitting diodes, (Li et al. 2006) smart 
windows, (Gruner 2006) sensors (Ferrer-Anglada et al. 2006), and transparent 
transistors (Artukovic et al. 2005; Chaurasia et al. 2012) where SWNT networks 
can be very useful. But, all the experimentally measured conductivities of the 
SWNT networks are considerably lower than the conductivity of a SWNT rope 
(axial conductivity around 10,000-30,000 S/cm) (Thess et al. 1996). It has also 
been noted that the conductivity of the SWNT networks decreases as the temper- 
ature drops (Bekyarova et al. 2005). The existence of high junction resistance and 
tunneling barriers between nanotubes (which dominate the overall film conductiv- 
ity in the network) is the result of its low conductivity and the strong temperature 
dependence conductivity. Therefore, it can be highly expected that decreasing the 
inter-tube resistance and lowering the number of these high-resistance junctions 
could increase the conductivity of the network. Actually, Lee and co-workers 
(Geng et al. 2007) reported that contact junctions can be improved by treating 
SWNT networks with a 12 M HN0 3 which helps to remove the insulating surfac- 
tant in SWNT network, and indeed, it improves the conductivity of the SWNT 
network by 2.5 times. They have also observed the dramatic decrease in the 
percolation threshold to the greatly reduced contact resistance between the tubes 
in the SWNT network. 

Electrical Conductivity CNTs Base Nonconducting Polymer Composites 

As CNTs exhibit high aspect ratio and high electrical conductivity, they are 
excellent candidates for fabrication of electrically conducting nanocomposites. 
While the electrical conductivity of individual carbon nanotubes has been measured 

z 

to be in the order of 10 S/m, (Baughman et al. 2002) the maximum electrical 
conductivity of SWCNT films has been reported to be in the range of 10 4 -10 5 S/m 
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Table 3 Electrical properties of different CNT/epoxy composites 


CNT type 

V c (vol%) 

p 

Maximum conductivity (S/m) 

References 

SWCNT 

0.074 

1.3 

1.2 x 10 3 @ 0.2 wt% 

Shi et al. (2006) 

MWCNT- 1 

0.01 

— 

5.0 x 10 1 @ 0.1 wt% 

Nemati et al. (2013) 

MWCNT-2 

0.0025 

1.2 

2.0 @1.0 wt% 

Nemati et al. (2013) 


(Ericson et al. 2004; Sreekumar et al. 2002) due to the contact resistance between 
the individual carbon nanotubes in the films. Therefore, the range of electrical 
conductivity of CNT/polymer composites is reported to be tremendously wide. On 
the other hand, this wide range advises that it is possible to control the electrical 
conductivity of CNT/polymer composites by varying the amount and degree of 
dispersion of CNTs in the composites. The CNT/polymer composites can be used 
for a variety of applications including electrostatic dissipation (<10 -4 S/m), elec- 
trostatic painting (10 -4 ~ 10 1 S/m), electromagnetic interference (EMI) shielding 
(>10 1 S/m), printable circuit wiring, and transparent conductive coatings. 

Again, the electrical conductivity of CNT/polymer composites is widely defined 
by the percolation theory. The percolation theory predicts that there is a critical 
volume fraction at which nanocomposites containing conducting fillers in insulat- 
ing polymers become electrically conductive. According to this theory, a c = A 
(V — V c ) p , where a c is the conductivity of a composite, V is the CNT volume 
fraction, V c is the CNT volume fraction at the percolation threshold, and A and p 
are constant. So far, there are several publications documented on the progress of 
electrical conductivity of different CNT/polymer composites (Shaffer and Windle 
1999; Sandler et al. 1999, 2003). The percolation threshold has been reported to 
range from 0.0025 vol% (Sandler et al. 2003) to several vol%. Therefore, it is 
difficult to draw definite conclusions about the mechanism of electrical conductiv- 
ity of CNT/polymer composites from the literature. This is because the reported 
levels of CNT loading to achieve a percolation threshold vary widely. The electrical 
conductivity and percolation threshold of different CNT/epoxy composite systems 
are shown in Table 3. It seems that different systems give a wide range of 
percolation values. However, even for the same system, for example, SWCNT/ 
epoxy composites (V c = 0.0025 ~ 0.1 %), (Sandler et al. 2003) a wide variation in 
percolation value was observed. 

The mechanism for percolation threshold for electrical conductivity of 
CNT/polymer composites is determined by numerous factors, and a number 
of publications have reported the factors affecting the percolation mechanism of 
CNT/polymer composites. The common factors affecting the percolation threshold 
of electrical conductivity are dispersion, (Sandler et al. 2003; Li et al. 2007b) 
alignment, (Choi et al. 2003; Du et al. 2003) aspect ratio, (Li et al. 2007b; Bai 
and Allaoui 2003; Bryning et al. 2005) degree of surface modification (Georgakilas 
et al. 2002) of CNTs, types and molecular weights of the matrix polymer, 
(Pan et al. 2010; Ramasubramaniam et al. 2003) and composite processing methods 
(Liu et al. 2008). The aligned CNTs in epoxy decrease the percolation threshold by 
one order of magnitude compared to entangled nanotubes (Sandler et al. 2003). 
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The electrical conductivity of SWCNT/epoxy composites with SWCNTs aligned 
under a 25 -T magnetic field was increased by 35 % compared to similar 
nanocomposites without magnetically aligned SWCNTs (Choi et al. 2003). 

In contrast, Du et al. (2003) found that the electrical conductivity of 
CNT/PMMA composite with 2.0 vol% CNTs was 10“ 10 S/cm with the aligned 
CNTs in the matrix and 10 -4 S/cm with unaligned CNTs. This indicates that the 
alignment of the CNTs in the composite decreased the electrical conductivity. The 
reason is that there are fewer contact points between the CNTs when they are highly 
aligned in the composites, so CNT-aligned composites require more nanotubes to 
reach the percolation threshold. The aspect ratio of CNTs has a tremendous 
influence on the percolation threshold of CNT/polymer composites without chang- 
ing other important parameters, such as the polymer matrix or the dispersion and 
aggregation state of the CNTs. On the other hand, it is well known that chemical 
functionalization may disrupt the extended conjugation of nanotubes and hence 
reduce the electrical conductivity of functionalized CNTs. For example, silane- 
functionalized CNT/epoxy composites showed a lower electrical conductivity than 
that of the untreated CNT composites at the same nanotube content 
(Ma et al. 2007). 

Cho et al. (2005) reported that the electrical conductivity of the acid- treated 
MWCNT composites was lower than that of the untreated MWCNT composites at 
the same content of MWCNTs. This is attributed to the increased defects in the 
lattice structure of carbon-carbon bonds on the nanotube surface as a result of the 
acid treatment. In particular, the severe modification of carbon nanotubes may 
significantly lower their electrical conductivity. However, there are several publi- 
cations reporting that the functionalization of CNTs can improve the electrical 
conductivity of the nanocomposites (Tamburri et al. 2005). Tamburri et al. (2005) 
found that the functionalization of SWCNTs with -COOH and -OH groups 
enhanced the nanocomposites ’ electrical conductivity as compared to the use of 
untreated SWCNTs. 

Electrical Conductivity CNTs Base Conducting Polymer Composites 

Most of the reported conducting polymer/carbon nanotube composites show 
conductivity enhancement over polymeric materials but much lower electronic 
performance compared to CNT films alone (Bekyarova et al. 2005). Few years 
back, Sun et al. (Wang et al. 2008) demonstrated that bulk-separated metallic 
SWNTs show superior performance than the as-produced nanotube sample in 
conductive polymer composites which can be obtained by blending with poly 
(3-hexylthiophene) and also poly (3, 4-ethylene dioxythiophene):poly (styrene 
sulfonate). They did not show the performance of films prepared from SWNT 
alone as a control experiment. It was evidenced by Blanchet et al. (2004) that the 
percolation threshold of a SWNT network was drastically downshifted by replacing 
the insulating dispersing reagents in the network with a conducting polymer. 
But, the conductivity of the SWNT network was not increased by the replacement 
after percolation. However, they have fabricated a water-soluble and highly 
conductive self-doped polyaniline/SWNT composite (Fig. 4; Ma et al. 2006a) by 
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Fig. 4 (a) A proposed SWNT/conducting polymer composite network. A thin skin of conducting 
polymer wraps around and along each of the SWNTs. (b) An individual composite nanotube 
coated with a thin layer of functionable conducting polymer. The two “turning knobs” are 
incorporated to adjust the property of the polymer layer for sensitive and selective molecular 
detection, (c) A conducting polymer junction in this network (Cheung et al. 2009) 


the in situ polymerization of a thin skin of PAB A (poly 3-aminophenylboronic acid) 
along and around ss-DNA-SWNTs. 

They have also measured the thickness of the polymer layer on the carbon 
nanotube using transmission electron microscopy (TEM) and it was found to be 
around 1-3 nm (Fig. 5c; Ma et al. 2006b). The thin conducting polymer layer 
remarkably improves the contacts between the tubes and hence acts as a “conduc- 
tive glue” which effectively assembles the SWNTs into a conductive network 

(Fig. 5b). 

SWNTs with a PAB A layer (referred to composite-SWNT networks) can be 
fabricated by vacuum filtration and dip coating (same methods used for SWNT 
alone) (Hu et al. 2004). Also a post mixture can be produced by simple mixing the 
pre-formed PAB A with the same amount of ss-DNA-SWNTs. However, this post- 
mixing process is not as effective as the premixing one in terms of interlinking the 
tubes (Fig. 5d). Furthermore, it has also been observed that the morphology of the 
post-mixture composite (Fig. 5d) is akin to the ss-DNA-SWNT network alone 
(Fig. 5a), except some large particles or aggregates, which could be due to the 
presence of neat PAB A which has not being uniformly mixed with the SWNTs. 
Surprisingly, they have found that the percolation threshold of the SWNT networks 
increased by threefold and the conductivity of the post-mixture network signifi- 
cantly decreased (Fig. 6a). 

Finally, the conductance of the post-mixture composite is five orders of magni- 
tude lower than the network formed from the in situ polymerized PABA composite 
and is three orders of magnitude lower than the network prepared from SWNT 
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Fig. 5 AFM images of the third layers of the films prepared from (a) ss-DNA/SWNTs, (b) in situ 
polymerized composite, (d) post-mixture and (e) “seed” composite. The concentration of SWNT 
in all these samples is 10 mg/L. TEM images of in situ polymerized composite (c), “seed” 
composite (f) and post-mixture (g) (Kobayashi et al. 1984) 


alone (Fig. 6a). They have also found from Fourier transform infrared (FTIR) 
spectroscopic study that the structure of the PABA layer in the composite prepared 
by in situ polymerization is very different from that of the neat PABA (Fig. 7a) and 
also drastically different from the PABA in the composite formed by post-mixing 
with the pre-formed neat PABA (Fig. 7b, purple curve). The FTIR peak of PABA in 
the composite formed by in situ polymerization at 1,120 cm -1 has been assessed by 
MacDiarmid et al. (1994, Yan et al. 2007) as the “electronic-like band” and is 
considered to be a measure of the degree of delocalization of electrons. 

Therefore, it can be considered as a characteristic peak of polyaniline conduc- 
tivity. These results are the strong indication towards the conclusion that the PABA 
has much higher conductivity and existed in the more stable and conductive 
emeraldine state (Zengin et al. 2002; Sainz et al. 2005), compared to the neat 
PABA and the post-mixture PABA which were in the nonconductive pernigraniline 
state (Ma et al. 2006a; Wang et al. 2005). They have also prepared another PABA 
composite by in situ polymerization in the presence of pre-oxidized ss-DNA/ 
SWNTs (“seed” method) (Zhang et al. 2004; Zhang and Manohar 2004). The 
intensity of the “electronic-like” FTIR peak is slightly lower than that of the 
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Fig. 6 (a) Conductance of in situ polymerized composite ( red square), ss-DNA/SWNTs ( black 
dot), post-mixture {purple triangle), and “seed” composite {green triangle) as a function of layers 
of the composites and ss-DNA/SWNTs. The conductance was measured by a two-probe approach. 
Each data point presented here was an average of 18 pairs of electrodes on five silicon chips, (b) 
Conductance of in situ polymerized composite {red square) and ss-DNA/SWNTs {black dot) 
measured by a four probe approach. The conductance of “seed” composite and post-mixture was 
beyond the sensitivity of the measurement setup. Each data point presented here was an average of 
10 measurements (Kobayashi et al. 1984) 
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Fig. 7 Normalized Fourier-transform IR spectra of (a) in situ polymerized composite {red) and 
pure PABA {blue); (b) “seed” composite {green) and post-mixture {purple ) (Kobayashi 
et al. 1984) 


PABA in the in situ polymerized composite with the intact ss-DNA-SWNTs, but 
much higher than the neat PABA and the post-mixture PABA composite (Fig. 7b). 
Though the percolation threshold of the composite formed by the seed approach is 
threefold higher than the in situ composite but similar to the post-mixture compos- 
ite, the conductance after the threshold is four and six orders of magnitude lower 
than the SWNT network alone and the in situ composite with intact SWNTs, 
respectively (Fig. 6a). The morphology of the seed composite film has been studied 
using AFM and noticed that the PABA in the composite did not interlink the 
nanotubes into a conductive network. Instead, serious aggregation of the nanotubes 
into large particles (Fig. 5e, f) has been induced by PABA. The aggregation 
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mechanism is still not well understood yet and currently under investigation. The 
author thinks that it might be related to defects along the tubes (formed by the 
pre-oxidation process) which remarkably weaken the mechanical strength of the 
carbon nanotube. Thus, it can be concluded that not only the polymer’s molecular 
structure but also the arrangement or distribution of the carbon nanotubes in the 
composites dictates the overall percolation behavior and macroscopic electronic 
property of the composites. It is also important to mention that the fabrication 
process significantly impacts the electronic and molecular structure of the PABA 
formed in the composites as well as the arrangement or lateral distribution of the 
carbon nanotubes in the composites. To effectively optimize the fabrication param- 
eters and ensure the formation of SWNT networks in a controllable fashion for a 
variety of potential applications, it is crucial to understand these reaction charac- 
teristics. Furthermore, it is worth mentioning that ss-DNA-SWNTs played multiple 
roles during in situ fabrication of conducting polymer nanocomposites. First, it 
functioned as catalytic molecular templates during in situ polymerization of ABA 
as the polymerization process might be 4,500 times faster (Ma et al. 2008a). Addi- 
tionally, the quality of the resulting PABA was also drastically improved, observed 
by the fact that the backbone of the self-doped polyaniline had longer conjugated 
length as fewer short oligomers were produced and they existed in the more stable 
and conductive emeraldine state, which in turn can be exploited to produce 
conducting polymer composite materials with a much more enhanced electronic 
performance. Secondly, the ss-DNA-SWNTs also worked as unique conductive 
poly anionic doping agents in the resulting poly aniline film with enhanced conduc- 
tivity and redox activity both in low pH and neutral pH solutions. In addition, it also 
acted as active stabilizers after the polymerization. The final advantage will be that 
the large surface area of the carbon nanotubes greatly enhanced the density of the 
functional groups available for sensitive detection of the target analytes. 

A wide variety of conducting polymers such as poly aniline (PANI), poly(diphe- 
nylamine) (PDPA), polypyrrole, poly thiophene, etc., are currently used in different 
applications including metallic interconnects in circuits, electromagnetic radiation 
shielding, and chemical sensors (Stutzmann et al. 2003). The conductivity of such 
polymers arises due to the existence of charge carriers and mobility of those charge 
carriers along the bonds of the polymer chains. These polymers also show chemical 
selectivity, which makes them act as ideal candidate for the immobilization of gas 
molecules, and exhibit highly reversible redox behavior with a distinguishable 
chemical memory. So, these conducting polymers can potentially act as a gas 
sensor. Let us define gas sensor first. It is a device which detects the presence of 
different gases in an area, especially those gases which might be harmful to human 
beings or animals. The fabrication of ammonia gas sensors via a scanned-tip 
electrospinning method (Fig. 8) using a single 10-camphorsulfonic acid (HCSA) 
doped PANI/poly (ethylene oxide) (PEO) nanofiber with a diameter of 100-500 nm 
on gold electrodes has been reported by Craighead et al. (Liu et al. 2004) in 2004. 
The characterization of the well-defined single fiber material and the sensor 
response has been thoroughly studied. They have demonstrated that the sensor 
showed a rapid and reversible resistance change upon exposure to NH 3 gas at 


2 Properties and Applications of Polymer Nanocomposite 


61 



H.V. Power Swiitrf 




a 





jr 


Syringe- Pump 


\+ 


/ 

/ 


/ 


y-* 


Cull^ror 



Fig. 8 (a) A schematic of electrospinning process, (b) A SEM image of typical electrospun fibers 
(Ding et al. 2009) 


concentrations as low as 500 ppb via the protonation and deprotonation of PANE 
The performance of nanofiber sensors, for example, response time, can be estimated 
by considering the diffusion of ammonia into the fiber and the reaction of ammonia 
with doped PANI. Another aspect was the correlation between response times with 
fiber diameter. Indeed, the response times with various diameters refer to radius- 
dependent differences in the diffusion time of ammonia gas into the fibers. Manesh 
et al. have prepared another type of ammonia gas sensor with a detection limit of 
1 ppm (Manesh et al. 2007) using electrospun PDPA/poly(methyl methacrylate) 
(PMMA) nanofibers as sensing materials. They have demonstrated that the changes 
in resistance of the nonwoven membrane showed linearity with the concentration of 
ammonia in the range of 10-300 ppm. Additionally, the detection target was 
expandable and can be expanded from ammonia to other amines according to 
Gong et al. (2008) (Fig. 9) using PANI nanotubes which can be easily made 
using electrospun PVA fiber mat membrane as the template. The small diameter, 
high surface areas and porous nature of the PANI nanotubes gave considerably 
better performance with regard to both time response and sensitivity. They have 
also observed that the responses follow the orders: (C 2 H 5 ) 3 N > NH 3 > N 2 H 4 . 
However, the PANI nanotubes showed higher sensitivity and quicker response to 
(C2H5) 3 N compared with PANI prepared without a template. In addition, a 
reasonable reproducibility has been observed in case of the reversible circulation 
response change of PANI nanotubes. However, Li et al. (Ji et al. 2008) described 
coaxial PANI/PMMA composite nanofibers using the electro spinning technique 
and an in situ polymerization method. The responses of the gas sensors based on 
these PANI/PMMA composite nanofibers towards triethylamine (TEA) vapor were 
investigated at RT, and it was found out that the sensors showed a sensing 
magnitude as high as 77 towards TEA vapor of 500 ppm. Furthermore, the 
responses were linear, reproducible, and reversible towards TEA vapor concentra- 
tions ranging from 20 to 500 ppm. Additionally, it was revealed that the concen- 
tration of doping acids only brought changes in resistance of the sensor without 
affecting its sensing characteristics. For example, the gas sensor with a doping acid 
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Fig. 9 (a) Resistance change of PANI nanotubes exposed to 100 ppm of different gases (NH 3 , 
N 2 H 4 , and (C 2 H 5 ) 3 N). (b) Response of PANI prepared by using PVA fiber mats as the template 
and without a template upon exposure to 100 ppm of (C 2 H 5 ) 3 N. (c) The reversible circulation 
response change of PANI nanotubes upon exposure to 100 ppm of (C 2 H 5 ) 3 N (Gao et al. 2008) 


(toluene sulfonic acid) exhibited the highest sensing magnitude, which can be 
explained by understanding its sensing mechanism and the interactions of TEA 
vapor with doping acids. 

There are reports of use of polymer as brushes as a composite, intended to make 
many applications; a significant advantage of polymer brushes compared to other 
surface modification methods is their mechanical and chemical stability, accompa- 
nied by a high level of synthetic flexibility towards the introduction of functional 
groups. This is in contrast to the physisorbed, non-bound polymer films where 
chemical modification by using wet chemistry is difficult to conduct. Additionally, 
it is now possible to grow brushes on virtually every surface (flat surfaces, particles, 
or macromolecules), to any thickness, of every composition, incorporating a mul- 
titude of functional groups and containing series of blocks. More recent applica- 
tions of polymer brushes include nano-pattemed surfaces (Shah et al. 2000), 
photochemical devices (Whiting et al. 2006), new adhesive materials (Raphael 
and De Gennes 1992), protein-resistant biosurfaces, (Saha et al. 2012) 
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i PTBA brush 

Fig. 1 0 Optical micrographs of patterned surfaces: ( left image) 10-pm features in a continuous polymer 
brush showing regions of poly(te/t-butyl acrylate) (dark) and poly (acrylic acid) (light) and (right image) 
interaction of a water droplet with 200- pm features showing an unusual wetting profile and preferential 
interaction with poly(acrylic acid) brush domains (J. Am. Chem. Soc. 2000, 122, 1844-1845.) 


chromatographic devices, (van Zanten 1994) lubricants, (Joanny 1992) polymer 
surfactants, (Milner 1991) polymer compatibilizers, eight and many more. 

One of the most attractive applications of surface -initiated polymerizations is the 
formation of nano-patterned surfaces by soft lithography techniques that combine 
microcontact printing (pCP) and graft polymerization. An elegant example is that 
of Hawker et al. who combined photolithography with nitroxide-mediated “living” 
free radical polymerization to obtain patterned polymer brushes with well-defined 
hydrophobic and hydrophilic domains (Fig. 10). They extended this concept to 
synthesize patterned polymer layers by aqueous ATRP (Vidal et al. 1980). 


Conducting Polymer Composites Made from Polymer Brushes 

Recently, Huck and coworkers (Kong et al. 2007) have shown that charge- 
transporting polymer brushes (poly triphenyl amine acrylate) can be used for a 
variety of organic electronic device fabrications using composite methodology. 
These polymer brush films contain a greater level of ordering at the molecular level 
and display higher charge mobility than spin-coated films of the same polymer, 
which was attributed to the controlled polymer brush architecture and morphology. 
As, for example, when CdSe nanocrystals (with diameter in the range of 2. 5-2. 8 nm) 
subjected into the polymer bmsh layers form a polymer composite (Fig. 11), its 
photovoltaic quantum efficiencies of up to 50 % (Snaith et al. 2005). In another 
report, Advincula and coworkers (Fulghum et al. 2008) successfully grafted hole- 
transporting PVK (poly(vinyl carbazole)) brushes on transparent ITO electrodes. 
Using cyclic voltammetry, the PVK bmsh was electrochemically cross-linked to 
form a conjugated polymer network film. Covalent linkage of PVK led to a direct 
electroluminescent PLED device, in which the electroluminescent polymer layer can 
be simply solution-cast onto the modified ITO. 
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Fig. 1 1 Top image : PTPAA 
brushes and bottom image : 
Cartoon of inferred structure 
for CdSe nanocrystal 
infiltrated polymer brush 
photovoltaic device (From 
bottom to top ) ITO-coated 
glass slide modified by 
surface attachment of a 
bromine end-caped 
trichlorosilane self- 
assembled-monolayer (SAM) 
0 blue squares ), polymer 
brushes grown from the SAM 
(red lines), CdSe nanocrystals 
infiltrated into the brush 
network exhibiting some 
degree of phase separation in 
the plane of the film (small 
black circles ), and caped with 
an aluminum cathode (Nano 

Lett. 2005, 5, 1653.) 



A more ambitious challenge in surface science is the design of smart surfaces 
with dynamically controllable properties (Lahann et al. 2003). Such surfaces have 
characteristics that can be changed or tuned in an accurate and predictable manner 
by using an external stimulus. Recently, Huck and co workers have shown that 
wetting properties of surfaces modified with cationic polyelectrolyte brushes 
strongly depend on the nature of the counter ion (Fig. 12). Coordination of poly- 
electrolyte brushes bearing quaternary ammonium groups (QA + ) with sulfate anions 
resulted in highly hydrophilic surfaces, (Moya et al. 2005) whereas coordination of 
similar brushes with C10 4 rendered the surface hydrophobic (Azzaroni et al. 2005). 

Recent research has focused on the Cu(I) -catalyzed, highly specific, and efficient 
formation of 1,2, 3 -triazoles via the 1,3-dipolar cycloaddition of azides and terminal 
alkynes (“click” chemistry) (Feldman et al. 2004). This methodology has been used 
to modify surfaces of solid metals and cells, because the reaction provides high yields 
and stereospecificity and proceeds under mild conditions (Tomoe et al. 2002; Lewis 
et al. 2002), Click chemistry also has been used for functionalizing polymers in 
solution (Sumerlin et al. 2005; Gao et al. 2005). Research in nanobiotechnology and 
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Counterion Exchange 


Fig. 12 Top image : PMETAC brushes. Bottom image : Change in the wetting characteristics of 
PMETAC brushes (height, h ~20 nm) after exchanging the two contrasting counterions, TFSI (a) 
and polyphosphate (PP) (b). (c) Representation of 0 A as a function of counter ion (PP and TFSI). 
The plot depicts the reversible behavior of PMETAC brushes over repeated cycles of TFSI and PP 
counter ion exchange. On the right the chemical structures of both counter ions are represented 
(Angew. Chem. Int. Ed. 2005, 44, 4578.) 


biomedical sciences often involves the manipulation of interfaces between 
man-made surfaces and biomolecules (and cells), which generally requires the 
construction of surfaces that present chemically active functional groups from 
non-biofouling supporting materials. Choi and coworkers (Lee et al. 2007) used 
“click” chemistry to couple azide groups at the terminal of the non-biofouling 
polymeric film of poly(oligoethylene glycol methacrylate) with incoming molecules 
of interest containing terminal acetylenes (Fig. 13). As a model for bioconjugation, 
biotin was immobilized onto the poly(oligoethylene glycol methacrylate) film via 
click chemistry, and biospecific recognition of streptavidin was demonstrated. 
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Fig. 13 Schematic description of the attachment of biotin to polymer brush via click chemistry 
[acetylene group-containing biotin compound: biotin-PEO-LC-N-pentynoate (1)] 


Polymer Nanocomposites for Engineering Applications 

Polymer nanocomposites, which comprise of additives/fillers and polymer matri- 
ces, are considered to be an important group of relatively inexpensive materials for 
many engineering applications. The polymer matrices may refer to all type of 
polymers including thermoplastics, thermosets, elastomers, and even polymer 
blends. Two or more materials are usually combined to produce composites 
which possess properties that are unique and cannot be obtained by each material 
alone. For example, high-modulus carbon fibers or silica particles are added into a 
polymer to produce reinforced polymer composites that exhibit significantly 
enhanced mechanical properties including strength, modulus, and fracture tough- 
ness. Therefore, due to their unique and superior properties as well as ease of 
production at low cost, polymer-based composites are currently important engi- 
neering materials with many applications which include high-performance com- 
posites even used in aerospace application, filled elastomers for damping, electrical 
insulators, thermal conductors, and other special applications in which a particular 
superior property is needed. 

Special materials with extraordinary properties are chosen to create composites 
with desired properties; for example, high-modulus but brittle carbon fibers are 
added to low-modulus polymers to create a stiff and lightweight composite with a 
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Fig. 14 Schematic of nanofillers (Schadler 2004) 

reasonable degree of toughness. In recent years, although the highest level of 
optimization of composite properties with traditional micrometer- scaled fillers 
has been reached, a large bundle of opportunities has been opened to overcome 
the limitations of traditional polymer composites by using newly available 
nanometer-scaled fillers - polymer nanocomposites in which the filler 
is <100 nm in at least one dimension as shown in Fig. 14 (Schadler 2004). 

Unlike traditional polymer composites containing microscale fillers, when small 
amounts of nanoscale fillers, such as carbon nanotube tubes (CNTs) or nanoclay 
flakes, are incorporated into a polymer system, the properties of such 
nanocomposites can be largely modified even at an extremely low content of filler 
due to a very short distance between the fillers. Although polymer nanocomposites 
can be considered as a new class of materials, some nanofillers, such as carbon 
black (Donnet et al. 1993) and fumed silica, (Sumita et al. 1984; Kuriakose 
et al. 1986) have been used for more than a century for the fabrication of 
polymer-based nanocomposites. However, the research and development of poly- 
mer nanocomposites have greatly increased only in the recent years for the follow- 
ing reasons. 

First of all, some of the polymer nanocomposites showed unpredicted com- 
binations of properties (LeBaron et al. 1999). For example, the incorporation of 
equiaxed nanoparticles in thermoplastics, and particularly in semicrystalline 
thermoplastics, increases the yield stress, the tensile strength, and Young’s 
modulus (Sumita et al. 1983) compared to pure polymer. A volume fraction of 
only 0.04 mica-type silicates (MTS) in epoxy increases the modulus below the 
glass transition temperature by 58 % and the modulus in the rubbery region by 
450 % (Messersmith and Giannelis 1994). In addition, the permeability of water 
in poly(e-caprolactone) decreases by one order of magnitude with the addition of 
4.8 % silicate by volume (Messersmith and Giannelis 1995). Yano et al. (1993) 
showed a 50 % decrease in the permeability of polyimides at a 2.0 wt% 
loading of MTS. Many of these nanocomposites are optically transparent 
and/or optically active. 
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The second reason for the large increase in research and development efforts was 
related to the discovery of carbon nanotubes by Iijima in the early 1990s (Iijima 
1991). Although there is an argument that carbon nanotubes have been observed 
since the 1960s, (Colbert and Smalley 2002) it was only in the mid-1990s that they 
became popular to the researchers and the scientists around the world, due to the 
ability to produce them in the quantities required for property evaluation of 
composites. The unpredictably sophisticated properties of these carbon nanotubes, 
especially their superior mechanical and electrical properties over those of the 
traditional fillers, offer an exciting potential for new composite materials. 

Finally, recent significant development in chemistry, such as click chemistry, 
(Kolb et al. 2001) has brought remarkable control over the chemical modification of 
carbon nanotubes and the in situ process for polymer nanocomposites. It has also 
created an almost unrestricted possibility to control the interfacial interaction 
between the polymer matrix and the filler. 


Carbon Nanotube/Polymer Composites 

One of the well-known nanofillers for polymer nanocomposites is the carbon 
nanotubes. Due to the outstanding properties of CNTs, they have become an 
attractive candidate for scientists to develop advanced polymer nanocomposites 
with multifunctional features. The first polymer nanocomposites using CNTs as 
fillers were reported by Ajayan et al. in 1994 (Ajayan et al. 1994). Since then, there 
has been a lot of research dealing with fabrication of CNT/polymer composites. On 
the basis of extraordinary physical properties as well as the large aspect ratio of 
CNTs, most of CNT/polymer composites show unexpected properties in many 
aspects such as mechanical, electrical, and thermal properties. In general, the 
homogeneous dispersion, alignment, and content of CNTs in polymer matrices 
are the key parameters to enhance the physical properties of CNT/polymer 
composites. 

Like other nanofillers, the dispersion state of CNTs in their matrices seems to be 
an important factor that determines the physical properties of the resultant 
CNT/polymer composites. However, it is well known that CNTs tend to form stable 
aggregates or bundles in a polymer matrix due to the very strong van der Waals 
forces among them (Sahoo et al. 2010). Therefore, they are difficult to be separated 
into individual nanotubes and dispersed homogeneously in a polymer matrix, which 
hampers the mechanical and electrical properties of fabricated nanocomposites. 

Many research efforts have been made in the production of CNT/polymer 
composites for functional and structural applications (Liu et al. 2008; Thostenson 
et al. 2001; Coleman et al. 2006). However, even after a number of decades of 
research, the potential for CNTs as reinforcing fillers has been brutally restricted 
due to the drawback associated with dispersion of entangled CNTs during a 
fabrication process and poor interfacial interaction between CNTs and a polymer 
matrix. As CNTs are characteristic of small diameter in nanometer scale with high 
aspect ratio (>1,000) and thus extremely large surface area, their nature of poor 
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dispersion in a polymer matrix is rather different from that of other conventional 
fillers, such as spherical particles and carbon fibers. Therefore, the agglomeration of 
CNTs in a polymer matrix can be considered as the main reason for the reduced 
mechanical, thermal, and electrical properties of their nanocomposites as compared 
with theoretical predictions based on individual CNTs. The critical challenge is, 
therefore, how to incorporate individual CNTs, or at least relatively thin CNT 
bundles or disentangled CNTs, into a polymer matrix. In other words, dispersion 
of CNTs not only is a geometrical problem due to the length and size of the CNTs 
but also relates to a method for how to separate individual CNTs from CNT 
agglomerates and stabilize them in a polymer matrix to avoid secondary agglom- 
eration (Ma et al. 2010). In addition, the most suitable processing conditions are 
required for the efficient transfer of either mechanical load or electrical charge 
among individual carbon nanotubes in a polymer matrix towards a successful 
fabrication of CNT/polymer composites (Sahoo et al. 2010). 

As mentioned above, the agglomeration of CNTs in a polymer matrix and the 
poor interfacial interaction between the CNTs and the polymer molecules are the 
most critical issues in the fabrication of CNT/polymer composites. Fortunately, 
there are several possibilities to improve the dispersion of CNTs in polymer 
matrices such as solution mixing, melt blending, and in situ polymerization 
methods. Moreover, several methods are also available to enhance the interaction 
between the CNTs and the polymer molecules. Especially, the surface modification 
of CNTs is an effective way to prevent carbon nanotube aggregation by improving 
their chemical compatibility with the polymer matrixes, which helps CNTs to 
disperse better and stabilize within a polymer matrix. There are mainly two 
approaches for surface modification of CNTs, namely, physical modification 
(noncovalent functionalization) and chemical modification (covalent 
functionalization) . 

For example, CNTs are used in the development of the stiff and lightweight 
polymer nanocomposites. CNT/polymer composites show considerably improved 
mechanical properties even at a low CNT content. For example, with an addition of 
0.5 wt% MWCNTs, the tensile strength and modulus for high-density polyethylene 
nanocomposite films remarkably increased by ~30 % and ~20 %, respectively 
(Zhang et al. 2006b). CNTs can also be used as a nucleating agent for crystallization 
of polymers. Several groups have studied the crystallization of polypropylene in the 
presence of CNTs (Valentini et al. 2003; Manchado et al. 2005; Bhattacharyya 
et al. 2003). Assouline et al. (2003) studied the non-isothermal crystallization of 
MWCNT/isotactic polypropylene (iPP) composites. The crystallization behavior of 
MWCNT/iPP composite was significantly different from that of the neat iPP. With 
an addition of 1.0 wt% MWCNTs into iPP, the crystallization rate was increased 
with evidence of fibril crystal growth rather than spherulite growth. Many research 
groups have observed the improved thermal stability in CNT/polymer composites. 
For example, Kashiwagi et al. reported that the addition of MWCNTs into poly- 
propylene enhanced the thermal stability of PP both in nitrogen and in air. Besides, 
the MWCNTs could significantly reduce the heat release rate of PP. Generally, the 
thermal stability of the CNT/polymer composites increases due to the higher 
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thermal conductivity of MWCNTs that facilitates heat dissipation within the com- 
posites (Huxtable et al. 2003). The results show a great potential for the use of 
CNTs as a flame retardant for polymer materials. 

Therefore, in the following sections, thermal and mechanical properties and 
applications of CNT/polymer composites will be covered. 


Thermal and Rheological Properties of CNTs Base Polymer 
Composites 

The glass transition temperature (T g ) is a measure of the thermal energy required to 
allow polymer motion involving 10-15 monomeric units and corresponds to the 
softening of a polymer. Park et al. (2002) reported that T g did not change for their in 
situ polymerized SWCNT/polyimide composites. The SWCNT/PMMA composites 
produced by the coagulation method have the same T g over a wide range of 
nanotube loadings (Du et al. 2004). Therefore, it can be concluded that the addition 
of CNTs does not significantly change the glass transition temperature in 
CNT/polymer composites, because in the absence of strong interfacial bonds and 
at low nanotube loadings, the majority of polymer molecules are locally 
constrained only by other polymer molecules but not by CNTs. 

On the other hand, in larger districts, carbon nanotubes do obstruct the motion of 
polymer molecules as measured by rheology. Rheological (or dynamic mechanical) 
measurements at low frequencies probe the longest relaxation time of a polymer 
which corresponds to the time required for an entire polymer molecule to change its 
conformation. Du et al. (2004) found that, although it has little effect on polymer 
motion at the length scales comparable to or less than an entanglement length, the 
presence of CNTs has a substantial influence at large length scales corresponding to 
an entire polymer chain. The storage modulus, G\ at low frequencies becomes 
almost independent of frequency as CNT loading increases. This shows a transition 
from a liquid-like behavior (which has short relaxation times) to a solid-like 
behavior (in which the relaxation times will be infinite) with increasing CNT 
loading. By plotting G’ versus CNT loading and fitting with a power law function, 
they reported that the rheological threshold of these nanocomposites was ~0.12 wt%. 
This rheological threshold could be attributed to a hydrodynamic CNT network that 
impedes the large-scale motion of polymer molecules. A similar phenomenon has 
previously been observed in nanoclay /polymer composites by Krishnamoorti and 
Giannelis (1997). They reported that a network of nanoscale fillers restrains poly- 
mer relaxations, leading to a solid-like or nonterminal rheological behavior. There- 
fore, any factor that changes the morphology of the CNT network will influence the 
low-frequency rheological properties of their nanocomposites. 

The factors influencing the polymer chain mobility are the aspect ratio of CNTs, 
dispersion and alignment of CNTs in the polymer matrix, and the molecular weight of 
the polymer matrix. Du et al. (2004) reported that higher aspect ratio, better dispersion 
and less alignment of the CNTs, and longer polymer chains would result in more 
restraint on the mobility of the polymer chains, i.e., the onset of a solid-like behavior 
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Fig. 1 5 Optical micrographs using cross-polarizers of (a) pure PP and (b) a 0.8 wt% SWCNT/PP 
composite (Bhattacharyya et al. 2003) 


occurs at lower nanotube contents. In addition to these factors, the content, size, and 
interfacial properties of CNTs are expected to influence rheological properties of 
CNT/polymer composites. For example, at a fixed loading, nanotubes with smaller 
nanotube diameters and larger aspect ratios will produce a network with smaller mesh 
size and larger surface area/volume, which might restrain polymer motion to a greater 
extent (Du et al. 2004). Experimental results support this hypothesis. Lozano 
et al. (2001) observed a rheological threshold of 10-20 wt% in carbon nanofiber/ 
polypropylene composites in which the diameter of the carbon nanofiber is 
~ 150 nm. The rheological threshold is ~1.5 wt% in MWCNT/poly carbonate com- 
posites and only 0.12 wt% for the SWCNT/PMMA system (Du et al. 2004). Even if 
these three systems have different polymer matrices as well as their states of disper- 
sion are unclear, the diameters among carbon nanofibers, MWCNT, and SWCNT 
differ by orders of magnitude. It can be concluded that if the diameter of filler 
decreases, the filler loading required for a solid-like behavior increases significantly. 

The constraints imposed by CNTs on polymer matrices in nanocomposites are 
also evident in the polymer crystallization behavior. Bhattacharyya et al. (2003) 
studied crystallization in 0.8 wt% SWCNT/PP composites using optical micros- 
copy (with cross-polars) and differential scanning calorimetry (DSC). From Fig. 15, 
the spherulite size in PP is much larger than that in SWCNT/PP composites. The 
authors also reported that upon cooling, the SWCNT/PP composites began their 
crystallization at the temperature which was about 11 °C higher than that for PP’s 
crystallization, suggesting that nanotubes acted as nucleating sites for PP crystal- 
lization. They also observed that both melting and crystallization peaks in the 
nanocomposite are narrower than those in neat PP. Therefore, they proposed that 
higher thermal conductivity of the CNT as compared to that of the polymer at least 
in part should be responsible for the sharper but narrower crystallization and 
melting peaks, as heat would be more evenly distributed in the nanocomposite 
samples containing CNTs. 
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Mechanical Properties CNTs Base Polymer Composites 

CNTs exhibit excellent mechanical properties with Young’s modulus as high as 
1.2 TPa and tensile strength of 50-200 GPa (Qian et al. 2002). The combination of 
these exceptional mechanical properties along with the low density, high aspect 
ratio, and high surface area makes CNTs an ideal candidate of reinforcing fillers for 
fabrication of stiff and lightweight nanocomposites. Both SWCNTs and MWCNTs 
have been utilized for reinforcing thermoplastic polymers, such as polyethylene, 
polypropylene, polystyrene, nylon, and polycarbonate, as well as thermosetting 
polymers, including epoxy, polyurethane, and phenol-formaldehyde resins. Gen- 
erally the CNT-reinforced nanocomposites can be considered as particulate com- 
posites or short fiber composites with the filler dimensions on the nanometer scale 
and a high aspect ratio. Therefore, the mechanics of CNT/polymer composites is 
governed by that of particulate composites or short fiber composites. On the other 
hand, unlike the macroscopic particulate composites, mechanical properties of 
CNT/polymer composites mainly depend on the dispersion state of nanofillers, 
apart from the properties of filler and matrix themselves. In addition to dispersion, 
there are other important factors that determine an effective reinforcement of CNTs 
in nanocomposites: they include a high aspect ratio, alignment, and interfacial 
interactions between CNTs and polymer matrix. The aspect ratio must be suffi- 
ciently large to maximize the load transfer between CNTs and the matrix and, thus, 
to achieve enhanced mechanical properties. For example, polystyrene 
nanocomposites reinforced with well-dispersed 1.0 wt% CNTs of a high aspect 
ratio had more than 35 % and 25 % increases in elastic modulus and tensile strength, 
respectively (Qian et al. 2000). Similar promising results have also been reported, 
(Coleman et al. 2006; Jiang et al. 2007) but other reports demonstrated only modest 
improvements in modulus and strength. For example, the impact resistance and 
fracture toughness of the CNT/epoxy composites containing CNTs of a larger 
aspect ratio were improved much better than those of the CNT/epoxy composites 
containing CNTs of a smaller aspect ratio (Hernandez-Perez et al. 2008). However, 
the corresponding tensile modulus and strength showed very limited improvements 
of less than 5.0 %, probably due to weak bonds between the CNTs and the matrix 
molecules as well as agglomeration of CNTs. In reality, the dispersion is known as 
the foremost important issue in producing CNT/polymer composites. Many differ- 
ent techniques, including the functionalization of CNTs and processing of 
CNT/polymer composites, have been employed for CNT dispersion, as discussed 
in sections “Other Types of Nano-Biocomposites.” A good dispersion not only 
makes more filler surface area available for bonding with a polymer matrix but also 
prevents the aggregated filler from acting as a stress concentrator that is detrimental 
to mechanical performance of nanocomposites (Liu and Wagner 2005). 

However, to obtain a uniform CNT dispersion in nanocomposites, some param- 
eters, such as CNT content in nanocomposites, length and entanglement of CNTs, 
as well as viscosity of matrix, are still needed to optimize. There were many reports 
(Ma et al. 2007, 2008b, 2009) showing that there is a critical CNT content in the 
matrix below which the strengthening effect for CNT/polymer composites 
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increases with increasing CNT content. Above this critical CNT content, however, 
the mechanical strengths of CNT/polymer composites decrease, and in some cases, 
they decrease below those of the neat matrix materials. These observations can be 
attributed to (i) the problems associated with uniform dispersion of CNTs at high 
CNT contents and (ii) lack of polymerization reactions that are adversely affected 
by the high CNT content for an in situ process. The latter effect becomes more 
pronounced when functionalized CNTs are employed to produce CNT/polymer 
composites. To a large extent, the technique employed for CNT dispersion can 
influence the mechanical properties of CNT/polymer composites. 

It should be noted that the definition of a dispersion state of CNTs in a polymer 
matrix is totally dependent on the magnification or scale used for the analysis. 
According to the study by Li et al. (2007b) using the term uniform or good 
dispersion to evaluate the CNT dispersion without any distinctive description 
may simply be misleading or inaccurate. This is because, for the conventional 
composites, uniform or good dispersion generally refers even distribution of fillers 
in a matrix medium without aggregation. However, for CNT/polymer composites, 
dispersion has two major aspects: (i) disentanglement of CNT bundles or agglom- 
erates, which is referred as the nanoscale dispersion, and (ii) uniform distribution of 
individual CNTs or their agglomerates throughout the nanocomposites, which is a 
micro- and macroscale dispersion. From geometric consideration, the difference 
between random orientation and alignment of CNTs can result in significant 
changes in various properties of nanocomposites. 

The storage moduli of the polystyrene composite films containing random and 
oriented CNTs were 10 % and 49 % higher than the unreinforced bulk polymer, 
respectively (Thostenson and Chou 2002). The alignment can be regarded as a special 
case of CNT dispersion. A few techniques, including mechanical stretching (Jin 
et al. 1998), melt-spinning (Fomes et al. 2006), dielectrophoresis, and application 
of an electrical or magnetic field (Park et al. 2006; Steinert and Dean 2009), have been 
employed during the composite fabrication to align CNTs in a polymer matrix. The 
degree of CNT alignment in the composite can be governed by two factors: (i) aspect 
ratio of CNTs and (ii) CNT content. A smaller diameter of CNT can enhance the 
degree of CNT alignment due to the greater extensional flow, and a higher CNT 
content decreases their alignment because of the CNT agglomeration and restrictions 
in motion from neighboring CNTs (Desai and Haque 2005). While alignment is 
necessary to maximize the strength and modulus, it is not always beneficial because 
the aligned nanocomposites have very anisotropic mechanical properties, i.e., the 
mechanical strengths along the alignment direction can be enhanced, whereas these 
properties are sacrificed along the direction perpendicular to this orientation. 

In addition, the interfacial properties between CNTs and matrix molecules play 
an essential role for mechanical properties of such nanocomposites. A strong 
interfacial adhesion corresponds to high mechanical properties of nanocomposites 
through enhanced load transfer from matrix to CNT. Chemical and physical 
functionalizations of CNTs have proven to enhance the interfacial adhesion. 
Table 4 summarizes the effects of CNT functionalization on the mechanical 
properties of CNT/polymer composites made from thermoplastic polymers. 
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These results indicate clearly that functionalization of CNTs can greatly enhance 
the modulus, strength, as well as fracture resistance of CNT/polymer composites. 

Polymer Nanocomposites for Packaging Applications 
Introduction 

Due to their superior physical, thermal, and mechanical properties, plastics have 
developed to be the most important class of packaging materials. In Europe, 
packaging is the largest market for plastics accounting for nearly half of all plastics 
processed. The main advantages of plastics as compared with other packaging 
materials are that they are lightweight and low cost and have good processability, 
high transparency and clarity, as well as good barrier properties with respect to 
water vapor, gases, and fats. It is estimated that global plastic packaging materials 
and products market will reach US$262.6 billion by 2015. 

The huge use of plastics, however, brings in more and more environmental 
problems; in particular, plastic packaging consists largely of single-use, disposable 
items. Additionally, plastics are generally made from petroleum, which resources 
are finite and fast depleting. So, in order to reduce the negative impact to our 
environment, it is necessary to turn the packaging industry green by using more 
environment- friendly products. 

Green packaging is broadly defined as packaging that is designed to lessen 
environmental impact throughout the whole life cycle; while maintaining account- 
able performance, it includes packaging with recycled content, reusable packaging, 
and degradable packaging. Figure 16 shows the green packaging demand by type 
(The Freedonia Group & Inc 2011) and Fig. 17 shows the green packaging demand 
by market (The Freedonia Group & Inc 2011). 

GREEN PACKAGING DEMAND BY TYPE, 2009 

($34.5 billion) 


Reusable 



Recycled 

Content 

90% 


Fig. 16 Green packaging 
demand by type 
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Fig. 17 Green packaging 
demand by market 


GREEN PACKAGING DEMAND BY MARKET, 2009 

($34.5 billion) 

Other 



Polymer Nanocomposites for Packaging Applications 


Polymer nanocomposites, a new class of materials, have shown great potential to 
enhance the physical, thermal, mechanical, and processing characteristics at low 
filler loading. In the packaging industry, the use of polymer nanocomposites will 
not only increase the properties of the packaging polymer materials but also offer 
additional functions to the packaging. 

Among the polymer nanocomposites for packaging, nanoclay is one of the 
nanofiller mostly used and studied. The nanoclay usually not only increases thermal 
and mechanical properties but also increases barrier properties to moisture, sol- 
vents, chemical vapors, gases such as 0 2 , and flavors. Basilia et al. synthesized 
recycled polyethylene terephthalate (RPET)/organic modified nanoclay (OMMT) 
nanocomposites by direct melt intercalation method. The mechanical properties 
increased greatly with the nanofiller fraction as shown in Fig. 18 (Basilia 
et al. 2002). 

Hamzehlou and Katbab (2007) also found that modified nanoclay can increase 
both tensile strength and tensile modulus of the recycled PET (RPET), and perme- 
ability of the thin films prepared from RPET/nanocomposites to oxygen gas was 
also reduced significantly compared with both virgin and neat RPET (Table 5). 

Emamifar et al. (2011) prepared low-density polyethylene (LDPE) films 
containing Ag and ZnO nanoparticles by melt mixing in a twin screw extruder. 
The presence of the nanoparticles increases the antimicrobial activity of 
L. plantarum; reduced numbers of L. plantarum were observed (p < 0.05) in 
nanocomposite packages of orange juice containing nanosilver and nano-ZnO. 

Alamri and Low (2012) reported on water absorption behavior of nanosilicon 
carbide-filled recycled cellulose fiber (RCF)-reinforced epoxy eco- 
nanocomposites. Water absorption was found to decrease gradually due to the 
presence of n-SiC as shown in Fig. 19. It was believed that the high aspect ratio 
nature of the nanofiller enhances the barrier properties of the materials by creating 
tortuous pathways for water molecules to diffuse into the composites. Maximum 
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Tensile strength 


Modulus 


Fig. 18 Mechanical properties of RPET-PHIL (OMMT) system at various clay loadings 


Table 5 0 2 permeation for recycled PET and its nanocomposites 


Sample 

3 2 1 1 

0 2 permeation (cm m day bar ) 

FPET 

25.33 

Processed PET 

28.14 

PET + 1 wt% clay 

19.60 

PET + 3 wt% clay 

21.70 

PET + 5 wt% clay 

7.890 

RPET 

32.40 

Processed RPET 

36.80 

RPET + 1 wt% clay 

21.80 

RPET + 3 wt% clay 

22.04 

RPET + 5 wt% clay 

8.540 


water uptake of RCF/epoxy composites filled with 5 wt% n-SiC decreases by 47.5 
% compared to unfilled RCF/epoxy composites. 

Biopolymers are promising materials for green packaging applications. Bio- 
polymers are polymers derived from renewable biomass sources, such as vegetable 
fats and oils, cornstarch, pea starch, or microbiota. Some biopolymers are designed 
to be biodegradable that are capable of being decomposed by bacteria or other 
living organisms in either anaerobic or aerobic environments. Most biodegradable 
polymers are actually designed to be compostable, which means it degrades to 
carbon dioxide, water, inorganic compounds, and biomass at a rate consistent with 
known industrial composting conditions. Typical biopolymers from renewable 
resources and with biodegradable property are poly lactic acid or polylactide 
(PL A), starch, and poly hydroxy butyrate (PHB). 
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Fig. 19 Water absorption 
curves of n-SiC-filled 
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Fig. 20 Illustration of the grafting of PL A onto 3 -aminopropyltriethoxy silane (APS)- 
functionalized silica surface 


There are many reports that nanofillers increase barrier, thermal, and mechanical 
properties of biopolymers which are usually have poor mechanical properties, high 
hydrophilicity, and poor processability (Tang and Alavi 2012; Dean et al. 2008; 
Park et al. 2004). Wu et al. (2013) reported grafting polymerization of polylactic 
acid (PLA) on the surface of nano-Si0 2 (Fig. 20) and studied properties of PLA/ 
PLA-grafted Si0 2 nanocomposites. It was found that PLA-grafted Si0 2 can accel- 
erate the cold crystallization rate and increase the degree of crystallinity of PLA. 
Shear rheology testing indicated that PLA/PLA- grafted Si0 2 nanocomposites have 
the typical homopolymer-like terminal behavior at low-frequency range even at a 
content of PLA-grafted Si0 2 of 5 wt%. 

Li and Sun (2011) prepared surface -grafted MgO (g-MgO) by in situ melt 
polycondensation of lactic acid and surface-hydroxylated MgO nanoparticles and 
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Fig. 21 Mechanical properties of PCL/nanoclay nanocomposites as a function of nanoclay 
fraction 


then prepared poly(lactic acid) (PLA) nanocomposites through thermal 
compounding of PLA and g-MgO/MgO nanoparticles. It was found that 
PLA/g-MgO nanocomposites exhibited higher tensile strength than neat PLA and 
PLA/g-MgO nanocomposites with g-MgO fraction lower than 0.05 % show 
increased thermal stability. 

Polymer foams, a type of lightweight materials, are very important packaging 
materials; it provides protection for the products with controllable performance. 
Generally, uniformity of cell sizes, surface quality, thermal and dimensional sta- 
bility, and mechanical properties including strength and shock absorption are 
among the important properties determining the applications of the polymer 
foams. Using nanofillers into polymer foam will enhance significantly the perfor- 
mance of the foams as packaging materials. 

Hu et al. (Liu et al. 2010) prepared nanoclay-filled biodegradable poly 
(e-caprolactone) (PCL) nanocomposites foam with chemical foaming agent. It 
was found that Young’s modulus of the nanocomposites increased with increasing 
clay fraction, and elongation at break of the nanocomposites increased with 
increasing clay fraction at low nanoclay fraction, but decreased at high nanoclay 
fraction higher than 10 wt% due to the agglomeration of the nanoclays (Fig. 21). 

Istrate and Chen (2012) also studied nanoclay-filled poly (e-caprolactone) (PCL) 
foams with a blowing agent. The nanoclay was firstly treated with chemical 
blowing agents, and the polymer/treated nanoclay nanocomposites were prepared 
by solution mixing; the pores were foamed by thermal degradation of the blowing 
agent. The blowing agent played dual roles in this approach: formation of 
bubbles and facilitation of clay exfoliation. With nanoclay fraction as low as 
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2.2 and 2.9 wt%, the compressive modulus and stress at 10 % strain of the porous 
polymer were found to increase by up to 152 % and 177 %, respectively. Strain improved 
by up to 69 %, while thermal degradation temperature was also greatly increased. 

Lee et al. (2008) prepared tapioca starch-poly(lactic acid) nanocomposite foams 
with four different types of nanoclays (Cloisite 10 A, Cloisite 25 A, Cloisite 93 A, 
and Cloisite 15 A) by melt mixing with extrusion. It was found that the extent of 
intercalation depended greatly on the nanoclay types, and accordingly, the glass 
transition temperatures, melting temperatures, and unit density, bulk spring index, 
bulk compressibility, Young’s modulus, water absorption index, and water solubil- 
ity index were all influenced significantly with the types of the nanoclays. 


Summary 

With the deteriorating environment and fast-depleting resources, traditional cost- 
effective packaging materials are no longer a guaranteed competitive advantage. 
Polymer nanocomposites with much improved physical, thermal, mechanical prop- 
erties and value-added functions will be among the main packaging materials in the 
future. Bio-based polymers, which are derived from renewable resources and 
biodegradable which possess the ability to degrade into small molecules upon 
bioactive environment exposure, are a promising polymer matrix for polymer 
nanocomposites in green packaging. 


Polymer Nanocomposites for Automotive Applications 
Application of Polymer Materials in Automotive Industries 

Because they are lightweight and due to their property tailorability, design flexi- 
bility, and processability, polymers and polymer composites have been widely used 
in automotive industry to replace some heavy metallic materials. Table 6 
(Szeteiova) and Fig. 22 (Polymers in the automotive industry polymotive 2005) 
show the typical polymers used in a car. 

The cars with more components made from polymer composites are lighter. 
Figure 23 (APME 1999) shows the typical weight saving that can be made in 
various car parts when using plastics to substitute conventional materials. 

Cars with more polymers are lighter; in turn, chassis, drive trains, and transmis- 
sion parts can all be made lighter as a result of having to support a lower overall car 
weight. The benefits from the weight reduction are the improved efficacy of fuel 
and less greenhouse gases release. It has been estimated that for every 10 % 
reduction in a vehicle’s total weight, fuel consumption reduces by 7 %. This also 
means that for every kilogram of weight reduced in a vehicle, there is about 20 kg of 
carbon dioxide reduction (Frost & Sullivan 2005). It is estimated that the polymers 
will account for 18 % of average vehicle weight by 2020, up from 14 % in 2000 as 
shown in Fig. 24 (A.T. Kearney Inc 2012). 
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Table 6 Plastics used in a typical car 


Component 

Main types of plastics 

Weight in av. car (kg) 

Bumpers 

PS, ABS, PC/PBT 

10.0 

Seating 

PUR, PP, PVC, ABS, PA 

13.0 

Dashboard 

PP, ABS, SMA, PPE, PC 

7.0 

Fuel systems 

HDPE, POM, PA, PP, PBT 

6.0 

Body (incl. panels) 

PP, PPE, UP 

6.0 

Under-bonnet components 

PA, PP, PBT 

9.0 

Interior trim 

PP, ABS, PET, POM, PVC 

20.0 

Electrical components 

PP, PE, PBT, PA, PVC 

7.0 

Exterior trim 

ABS, PA, PBT, POM, ASA, PP 

4.0 

Lighting 

PC, PBT, ABS, PMMA, UP 

5.0 

Upholstery 

PVC, PUR, PP, PE 

8.0 

Liquid reservoirs 

PP, PE, PA 

1.0 

Total 


105.0 


The plastic components have been used from simple interior in the early stage to 
interior, exterior, powertrain, chassis, engines, electrical systems, and fuel systems 
in the present. The enormous growth of polymer components in automotives 
accompanies the demand for high-performance polymer materials to meet the 
requirements of the components; one of the advanced materials developed to 
apply in automotive sectors is polymer nanocomposites, which was firstly used 
by Toyota Motor Co. in 1991 to produce timing belt covers as a part of the engine 
for their Toyota Camry cars (Polymer nanocomposites drive opportunities in the 
automotive sector). 

Polymer nanocomposites, which are among the most widely watched technology 
areas in the plastics arena, are a new class of materials containing nanoparticles or 
nanofillers dispersed in the polymer matrix. The commonly used nanofillers for 
polymer reinforcement include nanoclays, carbon nanotubes, carbon nanofibers, 
nanosilica, nano-oxides, and polyhedral oligomeric silsesquioxanes; they usually 
can improve a wide range of properties of polymers at low filler fraction owing to 
their size and shape. 


Approaches of Fabricating Polymer Nanocomposites 

There are several ways to fabricate polymer nanocomposites; the commonly used 
are in situ polymerization, solution mixing, and melt compounding. 

In situ polymerization of polymer nanocomposites includes emulsion, 
emulsifier-free emulsion, miniemulsion, and dispersion polymerization; nanofillers 
are usually added directly to the liquid monomer, and a polymerization then starts 
either thermally or chemically. Significant property improvement can be achieved 
due to good interaction between the nanofiller and polymer matrix. Figure 25 
(Liang et al. 2008) shows the organic-modified nanoclay-filled nylon 
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Polymers breakdown for the 
BMW 1 Series (total 219 kg) 


Fig. 22 Polymer breakdown for the BMW 1 series 

6 nanocomposites formed by in situ polymerization, Fig. 26 (Liang et al. 2008) 
shows the barrier property, and Table 7 (Liang et al. 2008) tabulates the mechanical 
properties and heat distortion temperature (HDT) of the nanocomposites as a 
function of the filler fraction. 

The solution mixing to prepare the polymer nanocomposites may consist of 
several steps: (a) dissolving polymer matrix into an appropriate solvent to make a 
solution, (b) dispersing the nanofiller into the solution to make a suspension, and 
(c) casting the new mixture to evaporate the solvent to produce final 
nanocomposites. A method through solution blending and then compression mold- 
ing is also usually used by obtaining the mixture powders after procedure (b) and 
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Fig. 24 Polymers will account for 18 % of average vehicle weight by 2020 

compression molding into the mixture powders into the desired panel or prototype. 
One of the advantages for the solution mixing is the molecular level of mixing. 
Figure 27 (Bhattacharya and Chaudhari 2013) shows the mechanical properties 
of nanosilica-filled polyamide composites prepared by formic acid mixing. 
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Fig. 25 Nylon 6 nanocomposite formed through in situ polymerization with 12-aminododecanoic 
acid modified montmorillonite (ADA-MONT, Nanomer® I.24TL) 
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Fig. 26 Oxygen transmission rates (OTR) of nylon 6 nanocomposite from in situ polymerization 

(65 % RH) 
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Table 7 Mechanical and thermal properties of nylon 6 nanocomposites 


Nanoclay ADA-MONT 

(wt%) 

Flexural modulus 
(MPa) 

Tensile modulus 
(MPa) 

HDT (°C) 

0% 

2,836 

2,961 

56 

2 % 

4,326 (53 %) 

4,403 (49 %) 

125 

(123 %) 

4% 

4,578 (61 %) 

4,897 (65 %) 

131 

(134 %) 

6 % 

5,388 (90 %) 

5,875 (98 %) 

136 

(143 %) 

8 % 

6,127 (116 %) 

6,370 (115 %) 

154 

(175 %) 


60 



-30 

% Cone, of silica nano in PA film 

Fig. 27 Comparative tensile properties of nanocomposite films 


The composite film exhibits an increased tensile strength with an increase in silica 
content. However, composite film containing 1.0 wt% nanosilica exhibits much 
lower tensile properties as compared to the neat polymer due to poor particle 
distribution. 

In the melt compounding process, nanofillers are mixed with the polymer matrix 
at the molten state of polymers in the absence of any solvents. The dispersion of the 
nanofillers depends largely on the thermodynamic interaction between the polymer 
chains and the nanofillers. Comparing to other techniques, melt compounding is 
simple, versatile, and suitable for mass production; the resultant nanocomposites 
usually have high purity as the process is essentially free of contaminations. 
Figures 28 and 29 show the mechanical and thermal properties of the polypropylene 
(PP) nanocomposites as a function of graphene fraction (Pingan et al. 2011). The 
graphene used in the study was firstly coated with polypropylene latex and then 
melt-blended with PP matrix. 
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Fig. 28 Typical stress-strain curves of PP/PP latex as a function of PP latex loading level with 
detailed data presented in the figure 


Fig. 29 Thermal 
conductivity of PP and its 
nanocomposites as a function 
of grapheme loading 




Graphene (vol %) 


The graphene sheets were well dispersed in the PP matrix and considerable 
enhancement of the mechanical and electrical properties of PP was achieved by 
incorporating very low loading of graphene. By addition of only 0.42 vol% of 
grapheme, about 75 % increase in yield strength and 74 % increase in the Young’s 
modulus of PP were achieved. 
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Toughened Polymer Nanocomposites Prepared by Melt 
Compounding 

Tensile strength and impact strength of materials are among the most important 
properties for the materials applied in automotives. There are a few reports that 
show nanoclays improve both tensile strength and impact strength for the 
nanocomposites prepared by melt compounding (Liu and Wu 2002). However, 
the increase of the impact strength is usually only achieved at low nanoclay 
fraction; the further increase in the nanoclay fraction results in decrease in impact 
strength as shown in Fig. 30 (Kelnar et al. 2005). 

Nanoclays usually can increase the tensile strength and modulus of the polymers 
due to their rigid inorganic nature, nanoscale dimension, and huge adequate inter- 
facial contact area between the nanoclay and the polymer matrix. The increment of 
impact strength at low nanoclay fraction is perhaps caused by formation of submi- 
cron voids within the intra-gallery of clay layers under impact loading, which 
prevents crack propagation. On the other hand, nanoclays, particularly exfoliated 
nanoclays, actually act as stiff fillers which hinder the mobility of the surrounding 
chains of polymers and thus reduce the impact strength of polymers. Moreover, the 
size range of individual clay layers is in nanometer scale, which is perhaps too small 
to provide toughening via mechanisms like crack bridging. 

The common approach to improve the impact strength of polymers is to add 
elastomers with long molecular chains. The cavitation of elastomer particles 
followed by plastic deformation of the matrix is usually the main toughening 
mechanism in the polymer composites. Nevertheless, the soft nature of elastomers 
generally decreases the tensile strength of polymers. In order to obtain polymer 
composites with improved impact strength without sacrificing mechanical strength, 
ternary polymer nanocomposites with the presence of nanofillers such as nanoclay 
and elastomer have been designed. It has been found that the mixing sequence of 
the polymer, elastomer, and nanofiller have great effect on the properties of the 



Fig. 30 Tensile strength and impact strength of nanocomposites in dependence on the clay 
content (■) PA6 matrix, (□) PA6/EPR-MA 95/5 matrix. PA6: polyamide 6, EPR-MA: maleated 
(0.6 %) ethylene-propylene elastomer 
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nanocomposites, particularly the impact strength of the composites. It has been 
found that for polyamide 66 or nylon 66 ternary nanocomposite filled with nanoclay 
of Cloisite® 30B and elastomer of styrene-ethylene/butylene-styrene tri-block 
copolymer grafted with 1.84 wt% of maleic anhydride (SEBS-g-MA) (Dasari 
et al. 2005), the location of the nanoclay in the nanocomposites is different with 
different mixing sequences based on micro structure study. The two-step 
mixing-blending nylon 66 and nanoclay initially and later mixing with SEBS-g-MA 
is the preferred blending sequence to maximize the notched impact strength due to 
the maximum amount of the exfoliated nanoclay in the nylon 66 matrix. The 
presence of nanoclay in SEBS-g-MA elastomer phase reduces the cavitation ability 
of SEBS-g-MA particles. For the polyamide 6 nanocomposites filled with nanoclay 
of Cloisite® 93 A and elastomer of maleic anhydride-grafted-poly(ethylene-octene) 
(POE-g-MA), it has been found that the one-step compounding of PA 6 with the 
nanoclay and the elastomer shows the synergetic effect of the two types of the fillers 
in improving the tensile modulus and impact strength (Yu 2012; Fig. 31). 

As compared to PA 6, the impact strength of the nanocomposite is remarkably 
increased by 96.3 % in one-step mixing, while the impact strength remains the same 
in two-step mixing-blending of PA 6 with elastomer first and then blending with 
nanoclay. The nanoclay used in the study has a higher affinity to the PA 6 than to 
the POE-g-MA; the nanoclays disperse mainly in the PA 6 phase to enhance the 
tensile property in the one-step process. 

On the basis of the study on the extrusion sequence, the properties of the ternary 
composites are further optimized by investigating the filler fraction. Figure 32 
(Yu 2012) shows the tensile and impact properties for PA 6 composites with 
optimized filler fractions. The results are obtained with drying of the test pieces 
in the oven before testing to eliminate the effect of moisture absorbed on the 
mechanical properties of the composites. 

With the optimization in concentration of the fillers, PA 6 composites with much 
improved impact strength have been obtained; the tensile strength and modulus, at 
the same time, are not sacrificed. With 7.5 % of nanoclay and 10 % of toughening 
agent, the impact strength is increased by 1 10.0 %, modulus is increased by 9.2 %, 
and tensile strength is similar to that of neat PA 6. 


Green Composites for Automotive Applications 

With the drive of lightweight materials with super performance, improved fuel 
efficiency, and less C0 2 emissions in automotive industry, the usage of polymer 
nanocomposites in a car will continue to grow. Recently, there is increasing interest 
about green composites with growing environmental awareness. Green composites 
are composites that are designed to reduce environmental burden through their life 
cycles. The examples of green composites are natural fiber-reinforced polymer 
composites and biopolymer composites. Biopolymer composites are composites 
in which the polymer matrix is bio-based or biodegradable or bio-based and 
biodegradable. The research on application of green composites in automotives is 
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Fig. 31 Optimization in ternary composite processing (c = nanoclay, t = POE-g-MA) 
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Fig. 32 PA 6 composites with much improved impact strength (T = POE-g-MA) 
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Fig. 33 Use of natural fibers for composites in the German automotive industry from 1999 
to 2005 

as early as 1990s, the first car component made from natural fiber composites is 
believed to be a door quarter panels made of a LoPreFin PP/PET/natural fiber 
composite appeared on the 1999 Saab 9S. Figure 12 shows use of natural fibers for 
composites in the German automotive industry from 1999 to 2005 (Karus 
et al. 2006; Fig. 33). 

Both natural fiber and biopolymers generally are hydrophilic, easily absorb 
moisture, and have relatively poor processability and low mechanical properties. 
These disadvantages restrict their application, particularly as exterior automotive 
components. Research and development are required to overcome these obstacles 
to allow more green composites which are eco-friendly, lightweight, and cost- 
effective to be applied in automotive sector. One of the approaches to modify the 
properties of natural fiber-reinforced polymers and biopolymers is incorporated 
with nanofillers. Chieng et al. (2012) reported that with the addition of only 0.3 wt% 
of graphene nanoplatelet, the tensile strength and elongation at break of poly(lactic 
acid)/epoxidized palm oil blend increased by 26.5 % and 60.6 %, respectively. 
Nemati et al. (2013)studied mechanical properties of wood plastic composites made 
from wood flour, recycled polystyrene, and nanoclay. The obtained results indi- 
cated that the tensile strength and flexural strength were increased by raising 
nanoclay content in the composites as shown in Fig. 34. 

Guigo et al. (2009) prepared lignin and natural fiber nanocomposites filled with 
sepiolite or organically modified nanoclay by extrusion. It was found that the 
incorporation of 2 % or 5 % w/w of sepiolite does not influence the mechanical 
and thermal behavior compared to the reference lignin/natural fibers composite, 
while nanoclay-based nanocomposites have shown improved properties. Shi 
et al. (2006) investigated the effect of single-walled carbon nanotubes (SWNTs) 
and functionalized SWNTs (F-SWNTs) on electrical and mechanical properties of 
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Fig. 34 Effect of nanoclay content on flexural strength of wood plastic composites 


Fig. 35 Electrical 
conductivity as a function of 
nanotube concentration for 
SWNT and F-SWNT 
cross-linked nanocomposites. 
A value of 0.03 wt% is 
estimated for the electrical 
percolation threshold of 
SWNT nanocomposites using 
the scaling law 





unsaturated, biodegradable polymer poly (propylene fumarate) (PPF). It was found 
that nanocomposites with 0.1 wt% F-SWNTs loading resulted in a threefold 
increase in both compressive modulus and flexural modulus and a twofold increase 
in both compressive offset yield strength and flexural strength when compared to 
pure PPF networks, whereas the use of 0.1 wt% SWNTs gained less than 37 % 
mechanical reinforcement. The SWNT also increased significantly the electrical 
conductivity of the PPF polymer matrix as shown in Fig. 35. 

It has been proven that that the addition of nanofillers is an effective way to 
improve properties of neat polymers; the green polymer nanocomposites are con- 
sidered to be the next-generation materials for automotive and other industries. 
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Summary 


Herein, we have highlighted on the facts why polymer nanocomposites becoming 
popular among researchers, be it in industry or academics. The main reason is its 
widespread scope of tuning of physicochemical properties of the materials for high 
end (eg. in aerospace) as well as low end applications which ranges from bio to 
electronic and commodity to automobile. Furthermore, polymer nanocomposites 
become dearer due to its wide flexibility in preparation. In addition, it gives choice 
to choose appropriate polymer, type of nanofiller and processing parameter 
depending on applications. 
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Abstract 

An overview of surface modification and coating techniques for plastics is 
presented for changing the surface properties to meet the performance require- 
ments in a variety of applications. Surface modification and coatings are utilized 
for purposes of adhesion, wettability, biocompatibility, scratch and abrasion 
resistance, chemical resistance, barrier properties, and more. Methods for mod- 
ification include physical processes, such as surface roughening and abrading; 
liquid chemical processes, such as acid etching; and reactive gas chemical 
processes. The reactive gas chemical processes covered include corona, flame, 
and low -temperature plasma. Surface degradation from reactive gas exposure is 
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presented with respect to the sources, chemical mechanisms, and methods for 
characterization. Coatings for plastics, including paints, functional coatings, and 
metallization, are summarized. 


Introduction 

Selection of suitable materials for an application not only requires consideration of 
bulk properties, but surface characteristics play a vital role in their successful 
application. For plastics, surface treatments and coatings can improve the perfor- 
mance in existing applications, as well as enable the further expansion of plastics 
for new applications. Surface modification and coating is often performed to 
improve adhesion, tailor hydrophobic or hydrophilic properties, increase scratch 
and abrasion resistance, provide decoration, impart electrical conductivity, improve 
biocompatibility, increase barrier properties, reduce friction, enhance resistance to 
chemicals, and other goals to target specific applications. Surface modification 
technologies include physical treatments, wet chemical treatments, and dry process 
treatments and coatings, such as plasma, corona, and flame. Coating techniques 
include paint and functional coatings, metallization, printing and decorating, lam- 
ination, and chemical vapor deposition. 

Surface treatments and coatings enable the expansion of plastics into applica- 
tions otherwise dominated by glass, metal, and ceramic. For applications tradition- 
ally using glass, optical quality plastics offer light weight, greater impact resistance, 
and higher thermal insulation, which can impact energy savings and environmental 
performance in the long term. However, in order to achieve the required optical 
clarity, scratch and abrasion resistance, and exposure to weather (depending on the 
application), a coating or layers of coatings are required for plastics used for glass 
replacement (Tolinski 2009). Metallization of plastics allows the use of plastics in 
place of metal components, as well as enhances the decorative possibilities for 
plastics. Especially in the growing electronics industry, metalized plastics are 
replacing metal components for connectors, electromagnetic shielding, cell phones, 
and medical devices. 

The types of surface treatment or coating selected for a given application depend 
on the target properties and specifications, the substrate material, the final use and 
environment in which the part will be applied, all costs associated with the process, 
and the ability to integrate the treatment with the existing manufacturing processes. 
When a product traditionally produced from glass, metal, or ceramic is replaced by 
plastic, oftentimes, new quality control tests are required to capture new failure 
modes that can occur. This especially holds true for applications in which the 
plastic replacement has a surface treatment or coating. For example, adhesion 
tests, accelerated environmental exposure, chemical resistance, optical properties, 
scratch and abrasion, stress, and other such tests are critical to evaluate the 
successful application of surface treated or coated plastics. 
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Greater environmental and safety considerations as well as cost sensitivities are 
shaping the technologies for surface treatment and coatings for plastics. Wet 
chemical treatments that often involve the significant use of harsh chemicals and 
chemical waste are being replaced by dry processes that offer lower environmental 
impact. For example, hexavalent chromium, used in wet chemical treatment of 
plastics prior to metal plating, is carcinogenic when inhaled and requires the use of 
controls and personal protective equipment to restrict exposure of workers. The 
European Union prohibits the use of hexavalent chromium in electronics, according 
to the Restriction of Hazardous Substances Directive. Recent research is focused on 
using more environmentally friendly chemicals for surface pretreatment prior to 
metal plating (Nagao et al. 2006). 

In plastic coatings, radiation curing and water-based formulations have an 
increasingly greater role with better environmental impact. Radiation curing offers 
less solvent use, lower energy consumption, rapid curing times, and greater surface 
hardness and scratch resistance than traditional thermally cured coatings. Despite 
the advantages offered, radiation-cured coatings still accounted for a small fraction 
of the total coatings market in 2010 (IHS Chemical Report 201 1). It is believed that 
the high material costs and investment in new equipment are preventing greater 
penetration of radiation-cured technology for coatings. Powder coating is a fast- 
growing coating technique for coating conductive materials, in which high- 
performance coatings are produced without the use of solvent, reducing VOCs. 
Recently, powder coating technologies have been adapted for use on plastics using 
a surface treatment that makes the substrates temporarily conductive using a 
technique with no VOCs or hazardous by-products (Stay 2012). 

Table 1 summarizes available surface treatments for plastics regarding purposes 
and manufacturing processes. Greater detail on surface treatments and coatings are 
provided in the proceeding sections. 


Table 1 Summary of surface treatment processes for plastics 


Plastic Surface 
Treatment 

Purposes 

Manufacturing processes 

Surface 

Modification 

Wettability, adhesion, printability, 
paintability, bonding, compatibility with 
contacting material 

Mechanical roughening, 
chemical exposure and 
etching, corona, flame, plasma 

General 
Coatings for 
Plastics 

Durability, UV-protection, chemical 
resistance, scratch resistance, electrical 
conductivity, other 

Paint, themal-cure coating, 
radiation-cure coating, plasma 
deposition 

Metallization 

Decoration, reflectivity, electrical 
conductivity, EM shielding 

Vacuum metalizing, arc and 
flame spraying, plating 

Printing, 
Decorating, 
and Polishing 

Aesthetics and decoration, surface 
smoothing 

Printing, in-mold decoration, 
physical and chemical 
polishing, vapor polishing 
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Surface Treatment and Modification of Plastics 

Surface treatment and modification techniques are often applied to plastics prior to 
a coating or finishing process to remove contaminants, generate surface reactive 
sites, and create new surface functional groups compatible with the subsequent 
process chemicals. The low surface energy and smoothness of plastic surfaces 
oftentimes present problems for adhesion of a coating, a metalized layer, decora- 
tion, or other finishing processes. Additionally, surface impurities, such as addi- 
tives, and dust and grime buildup due to static charge may also impact adhesion, 
finishing, or subsequent application of the plastic part. Cleaning the surface with 
mild solvents, such as isopropanol, and antistatic processing of the plastic part may 
not be sufficient to remove impurities and therefore, a surface modification method 
is oftentimes applied. 

Several types of techniques have been developed to modify the surface of the 
plastic in the submicron region to enhance the surface area and sites for bonding, 
increase surface energy, create reactive sites or targeted chemical functional groups 
on the surface, and remove surface impurities. Surface modification techniques can 
be classified into physical processes, such as surface roughening; liquid chemical 
surface modification, such as acid etching; and reactive gas processes, such as 
corona, flame, and plasma. Each type of surface modification process has advan- 
tages and drawbacks which must be considered for a given application, the type of 
plastic, and the cost and commercial considerations. 


Physical Surface Treatments 

Physical surface treatments of plastics are useful for bonding a plastic to an adhesive 
or coating by increasing the surface area of the plastic and creating greater sites for 
mechanical interlocking. Typical physical surface treatments include surface rough- 
ening, such as with sand paper or emery cloth, adhesive abrading, and media blasting. 
The effect of the physical treatment depends on the type of abrading material, the 
plastic, the original surface quality, and the process parameters. Surface roughening 
is a simple process that can be performed prior to applying an adhesive or coating to 
increase adhesion strength. Surface roughening can be followed by degreasing with a 
solvent, and recommendations for the types of physical treatments and solvent 
cleaning depend on the polymer used (Smart Adhesives). Surface roughening can 
be performed in the presence of an adhesive, referred to as “adhesive abrading,” in 
which two surfaces are abraded and adhesive coated prior to bonding and curing. 
Adhesive abrading has shown to increase the bond strengths on PTFE by approxi- 
mately 700 % (Henkel 2011). Media blasting is a type of physical process that 
involves propelling small pellets of solid materials to the plastic surface driven by 
a stream of pressurized gas. The propelled materials can include sand, metallic shot, 
nutshells, plastic pellets, dry ice crystals, or others (Izzo 2000). 

While physical surface treatments are relatively simple and low cost, they are 
not effective for many types of plastics and adhesive or coating. For many 
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applications, increasing surface area is not enough for the adhesion requirements, 
and chemical surface modification is required. The next sections outline chemical 
surface modification processes, in which chemical reactions drive the surface 
changes to bring about the desired surface properties. Chemical surface treatments 
are performed in the liquid phase, as well as the reactive gas or plasma phase. 


Liquid Chemical Surface Treatments 

Wet chemical surface modification involves exposing the surface of a polymer to a 
chemical or chemical mixture in the liquid phase. The chemicals react with 
molecules on the surface to create new surface functional groups. Chemicals in 
the liquid phase can penetrate pores more effectively than other types of surface 
modification techniques. However, significant amounts of chemicals and chemical 
waste are involved in wet chemical surface modification for large-scale use, 
compared to reactive gas surface treatments. 

Acid etching of polymer surfaces involves application of acid to a surface of a 
polymer to induce surface oxidation and increase the surface energy. It typically is 
applied to enhance the bond strength to an adhesive or to a metal surface or coating. 
Chromic acid is commonly used for acid etching of plastics, which introduces 
oxidized functional groups, such as hydroxyl, carbonyl, and carboxylic acid. In 
addition, the process may alter the surface morphology and increase the surface 
area and, thus, increase the sites for mechanical interlocking. The type of plastic, 
the etch time, and the process temperature can affect the degree of oxidation and the 
etch depth. For instance, polypropylene shows an increase in etch depth with 
increasing etch time and temperature, while the degree of oxidation and etch 
depth increase with etch time for polyethylene (Henkel 2011). For polypropylene, 
chromic acid etching was shown to increase the adhesion level of the surface with 
an epoxy adhesive (Sheng et al. 1995). 

Other types of liquid surface treatments include the use of relatively milder 
chemicals, for example, iodine treatment of nylon, which increases the surface 
crystallinity and enhances adhesion to a metal coating. Sodium and sodium com- 
pounds have been shown effective at treating polymer surfaces for biomedical use, 
including reducing inflammation and infection induced by synthetic materials used 
inside the body, such as polypropylene meshes (Regis et al. 2012). 

Many of the liquid chemical surface treatments have considerable drawbacks. 
For example, wet chemical treatment typically involves many additional processing 
steps such as washing, rinsing, and drying. Furthermore, wet processes produce a 
considerable amount of waste, oftentimes requiring hazardous waste disposal. 
Regulations are driving manufacturing away from using some of the chemicals 
for liquid surface treatment. On the other hand, reactive gas and plasma discharge 
processes provide versatile, reproducible, and environmentally benign methods for 
surface modification of plastics. In addition, reactive gas or plasma processes can be 
applied to modify very thin surface layers or deposit single- or multilayer coatings 
without altering the bulk characteristics of materials. 
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Reactive Gas Surface Treatments 

Reactive gas and plasma processes (dry processes) involve the generation of a 
luminous gas or plasma that contains high-energy neutrals and ionized species. The 
major types of reactive gas processes that have been employed to modify plastic 
surfaces include corona discharges, flame treatments, low-pressure nonequilibrium 
plasmas, and atmospheric nonequilibrium plasmas (Yasuda 2005; Wertheimer 
et al. 2002; Wolf 2010; Bardos and Barankova 2010). Reactive gas processes 
have been used for the past several decades to chemically modify the surfaces of 
plastic parts to increase wettability, enhance adhesion to inks, coatings, and adhe- 
sives and to bring about compatibility with a chemical or contacting material in 
subsequent processes and the final application. The processes are effective at 
modifying the surface of the plastic while keeping the bulk properties unchanged. 
In addition, dry processes typically have a much milder environmental impact than 
liquid chemical processes with respect to hazardous waste and emissions. The most 
dry surface treatment processes can produce very effective outcomes using only 
oxygen, nitrogen, or inert noble gases. When other, less inert reactive gases are 
required to create highly functional tailored surface chemistries, inherently low 
flow rates of the process gases produce minimal effluent. 

Reactive gas modification processes involve the exposure of the plastic surface 
to energetic species with much greater energy than that of the covalent bonds of the 
organic polymer molecules. This exposure to high-energy species can result in 
removal of organic surface contaminants, surface cross-linking, reaction with 
species in the air to form new surface functional groups, and surface ablation and 
damage. Therefore, for a given plastic and final application, consideration must be 
made on the types of treatments, process conditions, and potential drawbacks in 
conjunction with the goals of the final application. In addition, the polymer addi- 
tives may affect the treatability of the surface, stability of the treated surface, and 
the compatibility of the treated surface with the final coating or application. 

Corona Discharges 

Corona treatment is a type of atmospheric pressure air electric discharge that is the 
most widely used surface treatment method for plastic surface modification. 
Figure 1 shows a corona process for treating polymer films. 

Most extrusion lines for films use in-line corona processes for surface treatment 
prior to printing. The basic components of corona processes include a power supply 


Fig. 1 Image of corona 
treatment equipment courtesy 
of Enercon Industries 
Corporation 
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and a treatment device. The power supply converts standard utility power to a single 
phase, higher frequency power that is supplied to the treatment device. The 
treatment device consists of two electrodes, one at high potential that is separated 
from the substrate surface by air, which is referred to as the “air gap.” The other 
electrode is at ground potential and is usually the surface on which the substrate is 
placed. When the voltage is applied to the electrode, energy is transmitted to the air, 
creating small filamentary discharges that consist of ionized air. High voltage is 
required to ionize air and corona processes typically operate around 10 kV. A 
substrate scans through the ionized air, contacting the energetic species that break 
surface covalent bonds to create radicals, cross-link surface macromolecules, and 
oxidize surface functional groups. Oxidation can create new surface polar groups to 
depths greater than 10 nm, such as hydroxyl, carbonyl, amide, and carboxylic acid, 
resulting in an increase in surface energy and wettability (Wolf 2010). 

Corona treatment is widely used to treat the surface of webs or rolls of plastic 
film. Many advances have been made in corona processing equipment and 
configuration since the invention in 1951 (Vetaphone). The types of rolls used 
today include bare rolls, covered rolls, and universal rolls. Bare rolls do not have 
an electrically insulating coating on one or more of the electrodes. Covered 
rolls have dielectric materials covering the electrode. A variety of dielectric 
materials are available, including silicone, Hypalon®, epoxy, ceramic, and 
glass. The choice of the type of roll covering for a specified plastic and applica- 
tion depend on the properties of the covering, including dielectric strength, 
dielectric constant, resistance to heat, ozone, cleaning tools, wear, good heat 
dissipation, low surface porosity, and, of course, cost (Wolf 2010). Universal 
rolls consist of proprietary ceramic coatings which is useful for treating a wide 
variety of materials. 

The extent and effectiveness of corona surface treatment is dependent on the 
watt density, line speed, and the substrate surface. Consideration of the factors that 
affect watt density should be made when sizing a corona system. The watt density is 
proportional to the power and inversely proportional to the station size (web width). 
The response of a material to corona treatment depends on the type of material and 
the surface quality (surface energy, crystallinity, etc.). For example, some poly- 
esters show a significant increase in surface tension for corona treatment using low 
watt densities. On the other hand, polyethylene requires moderate watt densities, 
and polypropylene requires relatively high watt densities for significant increase in 
surface tension (Markgraf). 

While corona treatment is used successfully to increase wettability and adhesion 
of plastics to inks, coatings, and adhesives, some effects of corona can be disad- 
vantageous for certain applications and plastics. These potential negative effects 
include detrimental increase in surface roughness, creation of pinholes, and insta- 
bility of the surface changes over time. The filamentary discharges in the corona 
process create discrete point locations of high energy in the air gap, which can bring 
about damage to the surface and heterogenous surface treatment effects. In addi- 
tion, significant surface morphology changes, and damage of surface macromole- 
cules can occur from reaction with such high-energy, nonuniform ionized air. 
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Fig. 2 Image of flame 
treatment attached to robot 
courtesy of FTS Technologies 


Flame Treatment 

Flame plasma treatment involves a flame generated by combustion of a gaseous 
fuel, such as methane or propane that is applied to the surface of a substrate. 
Exposure of a plastic to a flame brings about surface oxidation, producing increased 
wettability and surface energy for enhanced adhesion and printability. Flame 
treatment processes were initially used in the 1950s, and one of the first applications 
was improving the printability of low-density polyethylene (LDPE) (Brewis and 
Mathieson 1999). Flame treatment offers some distinct advantages over corona, 
including uniformity of treatment, a smoother surface after treatment, and a more 
stable surface for a longer time after treatment. In addition, flame treatment brings 
about oxidation to depths of only 5-10 nm, while corona can affect the plastic to 
depths much greater than 10 nm (Wolf 2010). Figure 2 shows an example of a flame 
process attached to a robot to treat three-dimensional parts. 

Flame treatment enhances adhesion by increasing the surface energy of a 
polymer surface through the surface oxidation. Treatment can result in the creation 
of new polar functional groups, such as hydroxyl, carbonyl, carboxyl, and in some 
cases, nitrogen groups. For polyolefins, flame treatment can introduce 5-15 % 
oxygen to the surface (Brewis and Mathieson 1999). It is believed that the chemical 
mechanisms involve hydrogen abstraction initially to form surface radicals, 
followed by reaction of the radicals with oxygen atoms and molecules (Strobel 
et al. 1996). The surface radicals formed can react with other molecules in the flame 
plasma or the polymer surface to form a variety of functional groups. 

The main exothermic reaction in the flame plasma discharge for the combustion 
of methane is as follows: 

CH 4 + 2 O 2 — » CO 2 + H 2 O in the presence of N 2 

Flame plasmas also contain highly reactive side products and species from 
combustion, including ethers, esters, carbonyls, carboxyls, and hydroxyls (Wolf 
2010). All of these species can react with the surface of the polymer during 
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treatment to create new surface functional groups. In addition, vaporized chemicals 
can be added to the surface of the plastic during treatment to add specific functional 
groups. One such system developed by FTS Technologies was tested on 
DaimlerChrysler’s 2004 AN Dodge Dakota Wheel Flare prior to painting (Brynolf). 
The patented ATmaP™ technology exceeded DaimlerChrysler requirements and 
demonstrated superior performance over the control sample. 

The effects of the treatment are influenced by the design of the equipment, the 
operational parameters of treatment, and the substrate polymer surface. The opera- 
tional parameters that affect the level of treatment include the thermal output and 
flame temperature, the stoichiometric ratio of air/fuel, the total gas flow rate, the 
specific power (W/area), the gap between the substrate and the flame (“air gap”), and 
the flame size and shape. The design of the burner can influence the thermal output 
and flame temperature, as well as the flame size and shape. The main type of burner 
used for flame treatment is a ribbon burner, which is designed for high heat release and 
a continuous flame. Modifications with orientation, width, and depths of the ribbons 
can affect the flame geometry and size, while the flame shape is affected by the depth 
and width of the stacked ribbons and flats (Wolf 2010). An alternative burner is the 
enhanced velocity (EV) burner, which was designed for high mass flow and velocity 
(Markgraf 2004). Compared to the standard ribbon burner, the EV burner can produce 
increased surface energy and adhesion while allowing for a greater air gap. 

Plasma Surface Modification 

In contrast to corona discharge, low-temperature plasma discharges can produce 
uniform surface treatments with controllability and flexibility to minimize damage 
and create numerous possible surface functionalities. Plasma discharges can chem- 
ically modify a polymer surface by surface functionalization using reactive gas 
plasmas, in which new surface functional groups are created, or by surface cross- 
linking, which includes the CASING effect (cross-linking via activated species of 
inert gases) with a plasma of a noble gas (Gilliam and Yu 2008; Strobel et al. 2003). 
The reactions that occur on the polymer surface during a plasma treatment involve 
the production of surface free radical sites that can react with the surrounding 
polymer molecules and the plasma-phase species. Radical sites that are not con- 
sumed during plasma exposure can be quenched by components in the ambient air 
that become incorporated into the polymer surface, mainly oxygen, moisture, and 
nitrogen, upon exposure to atmosphere. 

Various types of plasma processes for manufacturing are used for surface 
modification of plastics. The types of processes are basically distinguished by the 
operating pressure, electrode and power configuration, and energy parameters. 
Low-pressure plasma processes require a vacuum chamber with pumps and can 
be operated as batch or semi-batch continuous processes. For semi-batch continu- 
ous operation, the system is equipped with loading and unloading stations that can 
be quickly isolated from the main chamber for pumping down to operating pressure 
or increasing to the atmospheric pressure. Alternatively, plasmas operating under 
atmospheric pressure are also used for many plastic surface treatment applications. 
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In one such application, atmospheric plasma is used to pretreat polypropylene 
(PP) parts of tumble dryers to bring about strong adhesion and a durable seal 
(MIELE: Quality assurance on tumble dryers through plasma). 

The choice of power includes direct current or alternating current, in which the 
frequency may vary from the kHz range to microwave (MW) frequencies. In 
general, the higher frequency brings about a greater degree of ionization in the 
plasma. High frequency in the microwave region results in the generation of heat 
that may be destructive to plastics. The configuration of electrodes, such as capac- 
itive or inductive coupling, addition of a magnetic field, and the overall geometry of 
the chamber can result in varying degrees of effects of treatment. Plasma processes 
can involve capacitive coupling or inductively coupling of the electrodes. Within 
the vacuum range, higher pressure operation requires less operational costs and is 
more forgiving of leaks of the ambient into the chamber. In order to operate with 
higher pressure vacuum, higher power and frequency is needed. Operation in the 
radio frequency (RF) range generally offers the best combination of pressure and 
power (Yasuda 2005). 

Advantages of using low pressure include low flow rate of gases, the flexibility 
to use a variety of types of gases, and the uniformity of the surface treatment. 
Disadvantages of low-pressure processes generally relate to the manufacturability 
aspects, including the limitation of part size and shape based on the size of the 
process chamber, the high capital investment cost, and maintenance of the vacuum 
pumps and peripheral equipment. In contrast, atmospheric pressure plasma pro- 
cesses do not require expensive vacuum equipment or chamber, are less energy 
intensive, and can be easily placed on existing plastic processing lines. 
Low-pressure processes entail considering capital costs and requiring expensive 
pumps and vacuum equipment. Atmospheric pressure plasmas, on the other hand, 
offer considerable decrease in capital and working cost requirements and provide a 
true in-line manufacturing process for plasma treatment. Not every outcome pro- 
duced using a vacuum plasma process can be replicated with atmospheric plasma, 
however, and thorough background review should take place before selecting a 
suitable atmospheric plasma process. 

Chemical Effects of Reactive Gas Treatment on Polymeric Surfaces 

This section provides an overview of the chemical effects of reactive gas treatment 
on polymeric surfaces. In this section, the term “plasma” refers to any reactive gas 
discharge, including corona, flame, and low-temperature plasma. Low-temperature 
discharges contain many reactive species including ions, electrons, free radicals, 
metastable neutral species, ultraviolet (UV) and vacuum ultraviolet (VUV) pho- 
tons, and ground-state neutrals. Once a plastic substrate is placed in a plasma 
environment, the surface is subject to continuous bombardment by these plasma 
species that can react with the surface elements, change the surface chemistry, and 
modify the surface characteristics. Figure 3 illustrates the interactions of the various 
plasma species with the plastic surface. Energy transfers from the plasma species to 
the polymer can cause ablation of hydrogen or side-group species or chain scission, 
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depending on the energy levels and the polymer structure. Free radicals that exist in 
reactive gas plasma can diffuse to the polymer surface and cause various chemical 
reactions, including abstraction of hydrogen or other side-group species, and 
incorporation of chemical species into the polymer. 

Four main effects of reactive gas treatment on polymer surface include surface 
cleaning, ablation or etching, surface chemical functionalization, and cross-linking. 
Each of the effects is present to some degree in a reactive gas process; the extent 
and degree of the effects depend on the process, energy and frequency, gas 
chemistry, reactor design, and operating parameters (Yasuda 2005). Surface 
cleaning involves the removal of organic contamination of the surface from addi- 
tives in the plastic, previous processing steps, ambient plant air, or contact with 
another surface. Ablation and etching of surface material can increase the surface 
area, create reactive surface sites, and remove a weak boundary layer that can be 
detrimental for adhesion. Cross-linking and branching of surface macromolecules 
can cohesively strengthen the surface layers. Modification of the surface chemical 
structure involves the creation of new surface functional groups, which opens up 
numerous possibilities for surface properties. 

Ablation and etching of polymeric surfaces can increase the surface area, 
enhancing adhesion, and create reactive sites for bonding with a subsequent chem- 
ical or material. During ablation and etching, elements or small molecules are 
removed from surface macromolecules and surface radicals are generated. Chem- 
ical functionalization reactions can occur from the reactions of the surface radical 
sites with reactive gas species in the plasma. The surface radical sites can be created 
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through ablation of hydrogen, removal of small molecules from the polymer chain 
side groups, or chain scission as shown below: 

Hydrogen ablation: RH — > R» + H* 

Side — group ablation: RX — ■» R* + X» 

Chain scission: R 1 R 2 — » Ri • + R 2 # 

The radicals created on the polymer surface from ablation and etching can then 
react and form covalent bonds with plasma-phase species. As a result, new func- 
tional groups are created on the surface. In addition, when the treated polymer is 
exposed to the atmosphere, any reactive sites that remain on the surface may react 
with oxygen, moisture, and nitrogen in the air. Plasma treatment of polymers can 
introduce a wide variety of functional groups on the polymer surface depending on 
the chemical precursors or mixtures of gas or vapor added to the plasma. Conse- 
quently, numerous surface properties are possible, and tailoring of the surface 
properties is performed through the tuning of plasma process and conditions. 
Surface oxidation from plasma treatment can create various oxygen groups in the 
polymer to increase surface energy, improve wettability, and enhance adhesion to a 
subsequent coating or material (Strobel et al. 2003; Momose et al. 1992). Other 
process gases and vapors can include fluorine-containing chemicals for surface 
fluorination (Jama et al. 1999; Rangel et al. 2003) to increase hydrophobicity of a 
polymer and nitrogen-containing chemicals for surface nitradation (Klemberg- 
Sapieha et al. 1991; Tatoulian et al. 2004) to create basic groups for dyeability, 
printability, or biocompatibility. 

In the 1960s, Hansen and Schonhorn introduced the CASING effect theory 
(cross-linking via activated species of inert gases) for polymer surfaces exposed 
to ions and metastable species of inert gas plasmas (Schonhorn and Hansen 1967). 
In addition to noble gas plasma species, investigations have shown that VUV/UV 
photons and electrons that exist in a plasma can induce surface cross-linking on a 
polymer (Fozza et al. 1999). Surface cross-linking produces many desirable effects 
on the polymer, including stability of the polymer surface, improved adhesion, 
improved surface bond strength, and resistance to solvents, heat, and moisture 
(Gilliam and Yu 2008). Thus, surface cross-linking of a polymer is an important 
process that can be applied to many mechanically weak polymers to enhance their 
surface properties. 

Cross-linking of a polymer surface from inert gas plasmas and VUV/UV irradi- 
ation occurs through ablation of hydrogen or other atoms in the side groups from the 
interactions with metastable noble gas atoms, noble gas ions, electrons, or photons 
to produce radical sites on the polymer surfaces. The radicals can form bonds with 
other radicals on nearby macromolecules, resulting in a cross-linked network on the 
surface layers of the polymer. Photons can be absorbed into the polymer surface and 
induce cross-linking to depths where other plasma species are physically inhibited. 
For plasmas that do not contain gas or plasma species that can bond with the 
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polymer molecules, the formation of new functionalities on the radical sites is 
inhibited, and cross-linking with other polymer radicals is the significant conse- 
quence in noble gas plasmas. Polymers treated with plasma always contain 
unreacted or residual surface free radicals that can incorporate oxygen, moisture, 
and nitrogen upon exposure to atmosphere. Consequently, noble gas plasmas 
without reactive gas addition can result in a stable, cross-linked top layer with 
mainly oxygen and some nitrogen functional groups anchored at the surface. 

Depending on the susceptibility of the polymer to plasma exposure, the presence 
of additives at the surface, and the energy levels and extent of exposure, plasma 
treatment could result in polymer degradation and surface instability. Surface 
instability can result from the degradation of the top layer of macromolecules 
into nonvolatile, low molecular weight oxidized material (LMWOM), which typ- 
ically can be removed from the polymer surface with a polar solvent (Strobel 
et al. 2003; Weikart and Yasuda 2000). The LMWOM gives rise to a loosely 
bonded weak material surface that can be detrimental for surface stability and 
adhesion. Hence, the polymer chemical structure, the energy levels and abundance 
of the reactive plasma species, the wavelengths and intensities of the photons, and 
the exposure time can determine the relative occurrences of ablation and etching, 
chemical surface functionalization, cross-linking, and degradation. 

Most polymers are susceptible to oxidative degradation, which generally occurs 
by a free radical mechanism that is initiated by plasma ablation reactions and yields 
peroxy and hydroperoxy intermediates (Fried 1995). The peroxy/hydroperoxy route 
to degradation is a mechanism that can lead to the formation of LMWOM on 
plasma-treated polymer surfaces (Fried 1995). The peroxy pathway to LMWOM 
formation begins with the formation of a radical on the polymer surface that reacts 
with various plasma species to form peroxy intermediates, as shown in the reactions 
below: 


Peroxy formation: R* + 02(or H 2 O 2 ) — ► ROO* (+H 2 ) 

Hydroperoxy formation: ROO» + RiH —> ROOH + Ri • 

R • + H0 2 -► ROOH 

Decomposition of the hydroperoxy group leads to the formation of an alkoxy 
radical, which degrades to form LMWOM as follows: 

Hydroperoxy decomposition: ROOH — > RO* -f- OH 

Alkoxy degradation: Ri — C — C(0 • ) — R 2 — ■» Ri — C • + R 2 — C = O 

Strobel et al. (2003) have offered an alternative theory to the peroxy mecha- 
nisms, arguing that the peroxy pathways to LMWOM are too slow to cause the 
significant LMWOM damage that has been reported in the time frame of a plasma 
treatment. They concluded that reactions involving atomic oxygen and ozone to 
form alkoxy radicals are the major routes to the formation of LMWOM on polymer 
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Plasma Species Reactions with Polymer Consequences 



Notes: thicker arrows » Relatively more contribution 

thinner arrows > Relatively small contribution 


Fig. 4 Possible mechanisms that can occur on a polymer surface exposed to a reactive gas 
discharge process 

surfaces in plasma discharges. The atomic oxygen and ozone pathways are shown 
by the reactions below: 

Direct alkoxy formation: R» + O — > RO» 

R» + O3 — > RO • -T O2 

Alkoxy formation from peroxy and oxygen atom : ROO • + O — » RO *+02 

Much is still unknown about the complex reaction mechanisms that occur on the 
polymer surface during plasma treatment. Figure 4 presents possible reaction 
pathways that can occur at the plasma-polymer interface during plasma surface 
treatments of polymers and bring about various outcomes. Many plasma species 
including ions, photons, electrons, atoms, and free radicals can cause chain scission 
or hydrogen or side-group ablation, which results in the formation of surface 
radicals. However, high-energy ions have a greater tendency for chain scission 
and subsequent degradation. The plasma-activated surface macromolecules can 
covalently bond with surrounding species to create new functional groups, cross- 
link with other surface macromolecules, or further react to degrade into volatile 
etch products or LMWOM. However, the lack of selectivity during a plasma 
treatment makes it difficult to optimize specific reaction pathways. 

Surface degradation of a polymer can depend on the polymer’s vulnerability to 
the particular plasma environment, which is related to the polymer chemical 
structure. In addition, the extent of degradation is determined by the concentration 
and energy levels of plasma species, such as oxygen, and the wavelengths and 
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intensities of VUV/UV photons emitted from the plasma. For example, the presence 
of oxygen in the structure of an organic polymer has been shown to enhance the 
polymer’s susceptibility to plasma etching (Gilliam and Yu 2008; Weikart and 
Yasuda 2000). On the other hand, aromatic rings in a polymer (both in the backbone 
and in pendant groups) provide some resistance to surface etching (Gilliam and Yu 
2008; Momose et al. 1992). 

Some organic polymers are sensitive to VUV/UV radiation in plasmas, mainly 
due to their ability to absorb photons in the VUV/UV wavelength range 
(Wertheimer et al. 2002). However, the absorption spectrum for a polymer depends 
on its chemical structure, which causes different photochemical effects on the 
various polymers. Polymethylmethacrylate (PMMA) is a polymer that easily 
undergoes oxidative degradation in a plasma environment, while polystyrene 
(PS) is highly stable toward degradation (France and Short 1998; Moss 
et al. 1986). Silicon-containing polymers are particularly resistant to 
photodegradation and oxidative degradation, yet degrade very easily in fluorine- 
containing plasmas, because of the formation of stable and volatile Si-F compounds 
(Brewis and Mathieson 1999). 

A treated polymer surface can contain mobile functional groups, in which 
dynamic changes can occur driven by interfacial tension or other surface forces. 
In the 1930s, Langmuir first pointed out that the surface properties of a solid are 
determined by the surface configuration (orientation of the surface functional 
groups) rather than the chemical configuration of the bulk molecules (Langmuir 
1938). In addition, a polymer with mobile surface functionalities can undergo 
surface configuration changes with changing contacting media (Brewis and 
Mathieson 1999; Gilliam and Yu 2008; Weikart and Yasuda 2000). Surface con- 
figuration changes are driven by the thermodynamic requirement to minimize 
interfacial tension, whereby the interface changes to establish new equilibrium 
with a new set of conditions. 

Hydrophobic recovery and loss of wettability can occur in plasma-treated poly- 
mers that are stored in ambient air for extended periods of time (Yasuda 2005; 
Gilliam and Yu 2008; Weikart and Yasuda 2000; Guimond and Wertheimer 2004). 
Hydrophobic recovery is an indication of polymer surface instability in which the 
hydrophilicity decreases with time stored in ambient air due to surface configura- 
tion changes. Weikart and Yasuda (Weikart and Yasuda 2000) demonstrated that, 
in some cases, long-term hydrophobic recovery can be reversed by immersing the 
treated sample in water for 24 h. This indicates that over time, the hydrophilic 
surface moieties created from plasma treatment rearranged away from the polymer 
surface. Upon changing the surrounding conditions from air to water, the hydro- 
philic moieties reoriented toward the polymer surface, thus making the surface 
wettable again. 

Surface Tension and Wettability Testing 

A variety of techniques have been utilized to characterize surface configuration, 
wettability, and morphology changes, including atomic force microscopy (AFM), 
scanning electron microscopy (SEM), and various surface tension and wettability 
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Fig. 5 The Wilhelmy force loop obtained from RF plasma treatment of LDPE with two immer- 
sion cycles shows significant overshooting, indicating that LMWOM was created on the polymer 
surface during treatment 

tests. Surface tension techniques can be used to assess the quality of the surface 
changes that take place from plastic surface exposure to a reactive gas process, 
including the presence of a layer of LMWOM, which impacts morphology. Contact 
angle is a static surface tension method of quantifying the surface wettability and 
surface energy changes that take place as a result of plasma exposure. Dynamic 
surface tension techniques, such as the Wilhelmy plate method, can evaluate 
dynamic wettability by observing the hysteresis, which is affected by roughness, 
chemical heterogeneity, surface deformation, surface configuration changes, and 
adsorption and desorption. 

The Wilhelmy balance method is one of the simplest and most useful techniques 
available for analyzing dynamic wettability and surface stability, because of its 
sensitivity to the surface characteristics of the polymer (Yasuda 2005; Gilliam and 
Yu 2008; Weikart and Yasuda 2000). The technique involves immersing the 
substrate in the form of a plate into water and measuring force exerted on the 
plate surface with a tensiometer as the plate is immersed over time to a preset depth. 
Retracting the plate and repeating the immersion cycle can reveal dynamic surface 
changes and the presence of a weak boundary layer or LMWOM. Force loops can 
be generated on the immersion/retraction cycles that can be used to calculate the 
dynamic contact angles. During immersion, LMWOM that may be present on the 
sample surface is removed in water, revealing the underneath, intact polymer 
surface that is usually more hydrophobic. The exposure of a more hydrophobic 
surface is indicated by “overshooting” of the force loops in subsequent immersion/ 
retraction cycles. 

Figure 5 shows the Wilhelmy force loops obtained from a sheet of low-density 
polyethylene (LDPE) that was treated with RF plasma of Ar + 0 2 mixture. The 
force loop measured in the second immersion cycle shows significant overshooting 
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Untreated LD PE 



depth (mm) 


Fig. 6 The Wilhelmy force loop of untreated LDPE shows a stable, hydrophobic surface with no 
intrinsic hysteresis (the second and third immersion lines trace the first immersion line) 

from the first immersion cycle. This result clearly shows the presence of LMWOM 
on a plasma-treated LDPE surface, which was removed by water in the first 
immersion to expose a more hydrophobic layer in the second immersion. 

The Wilhelmy method can also reveal mobile surface functionalities that can 
undergo rearrangement when driven by interfacial forces, or a surface configuration 
change. The Wilhelmy force loop for a polymer with hydrophilic functional side 
groups, such as nylon-6, would show an apparent increase in hydrophilicity from 
the first advancing cycle to the next. This phenomenon is a demonstration of 
intrinsic hysteresis, which is caused by surface configuration changes of the hydro- 
philic moieties near the polymer surface. During the first immersion, the hydro- 
philic surface moieties rearranged to bend toward the water-polymer interface, thus 
making the surface more wettable during the second advancing cycle. On the other 
hand, a polymer that has a dynamically stable surface, such as low-density poly- 
ethylene (LDPE), exhibits no intrinsic hysteresis in the Wilhelmy force loop, as 
shown by Fig. 6. Due to the mobility of polymer chains at the surface, polymers that 
have been surface treated without a coating may undergo hydrophobic recovery 
after treatment. Hydrophobic recovery should be considered when implementing a 
new surface treatment in order to quantify the lifetime of the effects of surface 
treatment. 

Low-pressure plasma treatment has been shown to induce the formation of 
LMWOM on a variety of polymer surfaces, in addition to changing the surface 
morphology and increasing surface roughness (Weikart and Yasuda 2000). Atmo- 
spheric pressure plasma treatment can also impact surface morphology and rough- 
ness. One study comparing the effects of air corona treatment to nitrogen 
atmospheric pressure glow discharge showed that air corona produced much higher 
quantity of LMWOM in the form of small nodules or droplets (Guimond and 
Wertheimer 2004). Another study found that corona treatment produced 
LMWOM, while flame treatment did not (Strobel et al. 2003). 
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Coatings for Plastics 

Coatings enable the expansion of plastics into new applications by imparting 
protection necessary for performance under harsh conditions of weather, exposure, 
wear and abrasion, chemicals, and other. Advances in coatings for plastics in the 
twenty -first century have been driven in part by the design freedom offered by 
plastics and greater environmental performance, such as lightweight materials for 
transportation. Coated plastics are used in a multitude of applications spanning such 
industries as automotives, electronics, packaging, construction, furniture and appli- 
ances, fabrics, toys, sporting goods, and others. This section provides a brief 
summary of various coatings and finishings used on plastics, including paints and 
functional coatings, metallization, and others. For more in-depth information on 
coatings for plastics, the reader is directed to available literature (Wicks et al. 2007; 
Tracton 2006a; Ryntz and Yaneff 2003). 


Paints and Functional Coatings 

Paint and functional coatings discussed in this section are those organic or hybrid 
coatings that use wet process technology, in which the coating is applied in a liquid 
form before curing. The functions imparted onto the plastics from such coatings 
include scratch and abrasion resistance, UV protection, surface quality and appear- 
ance, chemical resistance, antimicrobial properties, and other specially targeted 
properties. Numerous types of paints and functional coatings are available on the 
market for plastics with a wide range of resins, formulation, and processing options, 
the choice of which should be selected based on the given application and substrate. 

The types of coating systems can be classified as thermoplastic or thermoset. 
Thermoplastic systems involve only a physical change during the cure process, in 
which the film hardens over time due to loss of solvent. Thermoset systems, on the 
other hand, undergo chemical reaction during curing, which is initiated by thermal 
energy, radiation, or oxidation. The resins must contain reactive functional groups 
to participate in reaction and cross-linking during the curing process, such as 
hydroxyl, carboxyl, amino, epoxy, and isocyanate. The system may contain a single 
component (or resin) or two components and a catalyst. 

The variety of resins available for coatings includes acrylic, polyester, urethane, 
epoxy, siloxane-based, alkyd, cellulosic, and polyester, among others, a few of which 
are briefly discussed here. A wide variety of properties can be achieved depending on 
the choice of resin. Acrylic polymers are those that are typically formed by chain 
growth polymerization of acrylic or methacrylic acid monomers. Acrylic resins can 
be used in thermoplastic coatings or thermoset coatings with a functional group, such 
as a hydroxyl group. Coatings formulated with acrylic resins are typically used for 
applications requiring photooxidative durability, resistance to hydrolysis, hardness, 
chemical resistance, and others (Wicks et al. 2007; Nordstrom 2003). 

Polyesters, polyurethanes, and epoxies are polymerized by step-growth poly- 
merization, which involves reaction of two different monomers. Polyesters used for 
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coatings are generally low MW, amorphous, branched, and are cross-linked during 
curing (Wicks et al. 2007). They are formed by carboxyl and hydroxyl reaction to 
form the recurring ester group -COO-. Coatings made with polyester resins offer 
greater flexibility and impact resistance than those prepared with acrylic resins, as 
well as good impact performance and scratch resistance. Polyester formulations can 
be prepared as solvent-bome, waterborne, high solids, or solvent-free (Huber and 
Stoye 2006). In addition, polyesters can be formulated in two-component systems. 
Polyurethanes are also polymerized by step-growth polymerization, involving an 
isocyanate and alcohol to make the urethane group R-NHCOOR/. Polyurethanes 
can impart flexibility, toughness, and chemical resistance to the substrate surface. 
Available polyurethanes include aromatic and aliphatic, which are more expensive, 
but offer better durability and less susceptibility to turn yellow upon exposure to 
sunlight. As a result, aromatic urethanes are typically only used in primers or other 
undercoats (Nordstrom 2003). 

Silicone hard coatings are those made of polysiloxane and offer superior scratch 
and abrasion resistance, resistance to chemical attack, and durability under harsh 
conditions. As a result, silicone hard coatings are used in numerous demanding 
applications, such as auto and train windows, headlamps, safety glasses, face 
shields, and many more (Bernheim 2006). Many plastic substrates require 
pretreatment, annealing, or a primer in order to achieve adhesion to a silicone 
hard coating. 

Most coatings contain other components, including pigments and other func- 
tional additives to achieve the desired properties, that are formulated in a solvent. 
Reactive monomers are sometimes added as part of the carrier liquid. The use of 
solvents in a coating formulation enables control of viscosity and flow properties, as 
well as leveling and film formation of the coating. Greater environmental perfor- 
mance has been driving the developments in formulations with lower solvent (high 
solids), waterborne formulations, and solvent-free coatings. In order to reduce the 
amount of solvent used, a conventional approach is to lower the molecular weight 
of the resin (Nordstrom 2003). Radiation curing allows for high solids in the coating 
formulations. Waterborne formulations require water-soluble polymers or dis- 
persed polymers using surfactant or surface-modified particles. Powder coatings 
require no solvent, but entail high temperatures, which limit the use for plastic 
substrates. 

Paints and functional coatings are typically applied by flow coating the solution 
onto the substrate, dip coating the substrate into solution, or spray coating. Some- 
times a “flash-off’ period is necessary before the curing step to remove some of the 
solvent. Thermal curing involves the use of thermal energy to initiate reaction in an 
oven at some prescribed length of time. Curing by radiation, in contrast, can be 
carried out under room temperature with a fast curing time. Radiation curing 
requires significantly less energy consumption, reduces emissions, and increases 
productivity compared to thermal curing. Formulations for radiation curing contain 
a chemical functional group that is activated by radiation, such as methacrylate and 
epoxy. The most common sources for radiation curing include electron beam and 

UV (Koleske 2006). 
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Primer Coatings for Adhesion 

Primers are sometimes applied prior to coating with a material that is particularly 
problematic for adhesion to the plastic. Poor adhesion can result from incompati- 
bility between the coating chemicals and polymer, poor chemical bonding, lack of 
mechanical interlocking, stresses induced at the interface, and a considerable 
difference in thermal expansion properties. In addition to improving adhesion, 
primers can provide increased resistance to wear, scratch, and abrasion or protec- 
tion from chemical attack by a solvent that is present in a subsequent coating (Izzo 
2000). Other functionalities can be added to primers if desired in designing a 
coating system. Although primers are technically coatings, the thickness is usually 
low, on the order of a couple hundred nanometers, and the primary purpose is to 
treat a surface prior to adhesion to a coating, an adhesive, or another surface. 

Primers typically contain chemical functional groups that react and bind with the 
substrate surface, as well as functional groups with high attraction to the subsequent 
coating, adhesive, or surface. Primer solutions can be water-based or solvent-based 
(sometimes called “oil-based” or “alkyd-based”). Water-based primers have less 
environmental impact with respect to waste and effluent handling and are therefore 
preferred for manufacturing. Common types of primers include polyurethanes, 
acrylics, epoxies, and polyesters. Primer coatings can consist of one component, 
which does not require a hardener or activator, or two components, which need to 
be mixed with a hardener. Primers may also contain additives for functional 
purposes, such as UV absorbers to prevent photodegradation of the underlying 
polymer. 

Primers involve dissolving a reactive chemical species in a solvent or a mixture 
of solvents. The solution is applied to the surface by spray, flow, or dip application, 
followed by a flash-off period to allow the solvent to evaporate. Depending on the 
type of chemical in the primer, the method of curing can be thermal or radiation, 
such as UV. Silane and isocyanate primers must react with moisture in the ambient 
air prior to application of an adhesive (Henkel 2011). A wide variety of primer 
solutions are commercially available and the choice of resins, additives, and 
processing methods should be considered with respect to the type of polymer, 
coating, and final application. 


Metalized Coatings 

Metallization of plastics has been used for over a century, with the first commercial 
application recorded in 1905 (Liepins 2006). Applications for metalized plastics 
span a wide variety of industries, including automotives, building and appliances, 
electronics, cosmetics, packaging, toys and sporting goods, and more. Metallization 
can be applied to plastics to increase the gloss, reflectivity, surface conductivity, 
and the abrasion resistance. Metallization of plastics has enabled the use of plastics 
in applications traditionally dominated by metals, such as in some packaging and 
electronics components. Metalized plastics offer lower weight, better corrosion 
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resistance in most cases, and lower cost than metals. Furthermore, the metallization 
process can be controlled to tailor the electrical conductivity of the finished part. In 
electronics applications, metalized plastics can be used for attenuation of EMI/RF 
signals. For aircraft windshields, a nanolayer of transparent gold is used for 
defogging and deicing. Metallization can provide decoration for plastics in pack- 
aging and other application. 

The most common techniques of metalizing plastics include vacuum metalliza- 
tion, flame and arc spraying, and plating. Vacuum metallization is a type of physical 
vapor deposition (PVD) process that is performed under vacuum as a batch process. 
The plastic part is placed in a vacuum chamber, usually on a rotating substrate 
holder to achieve a uniform coating. Metal in the form of powder or slug is placed in 
a tray on resistance heaters. The resistance heaters evaporate the metal, releasing a 
stream of atoms directed away from the heaters in a line of sight. The evaporated 
metal atoms condense on the surface, which initially form drops that eventually 
coalesce to form a uniform coating. Common metals for vacuum deposition include 
aluminum, copper, chromium, gold, silver, and nickel. Vacuum metallization can 
produce a uniform coating free of pinholes (Athey 2006). 

Flame and plasma arc spraying are liquid metal processes that require a heat 
source to melt the metal. Flame spraying involves a flame that melts a metal wire or 
powder and propels the molten metal to the substrate with compressed air. Arc 
spray uses metal wires to create an arc that melts the metal in the wires before 
propelling the molten metal to the substrate with compressed air. Plasma spray 
involves a plasma jet the melts metal powder and propels the molten metal to the 
substrate surface. Drawbacks of liquid metal spray processes include losses of 
25 %, relatively poor adhesion, and a low-density porous coating (Athey 2006). 

Conventional electroplating involves placement of a conductive substrate into a 
solution of metal ions and an electric field to drive the metal ions to the substrate 
surface. Because most plastics are inherently nonconductive, the technique of 
electroless plating is typically applied initially to make the surface conductive. 
The basic process steps for electroless plating typically involve (1) acid etching, 
typically chromic and sulfuric acid; (2) activation with precious metals, such as tin 
and palladium, to catalyze growth of the metal layer; and (3) electroless deposition 
of a metal film from a solution (Hart 1996). An electroless plating solution typically 
contains the salt of the metal to be deposited, a reducing agent, ligands, buffers, and 
stabilizers that enhance autocatalysis (Vaskelis 2006). 

Other less common processes for metalizing plastics include sputtering and 
lamination of a metal film. Considerations that should be addressed for a metalli- 
zation process of plastic include adhesion, the coefficient of thermal expansion, and 
corrosion of the metal. Surface treatments, such as acid etching, typically can 
enhance adhesion to the metal layer. A high disparity between the coefficient of 
thermal expansion of the substrate and the metal layer can result in delamination of 
the metal coating when exposed to thermal cycles. An initial thick copper layer can 
be applied to help offset this effect. Furthermore, a thick layer of nickel or 
microporous chromium can reduce susceptibility of the metal layer to corrosion 

(Hart 1996). 
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Fig. 7 Image of PECVD equipment for PC glazing courtesy of SABIC Innovative Plastics and 
ULVAC Technologies, Inc 

Other Coatings for Plastics 

Other types of surface coatings are used, including printing for decorative purposes, 
in-mold decoration (IMD), coextruded films, and plasma-enhanced chemical vapor 
deposition (PECVD). Some solutions involve multilayer treatments and coatings, 
such as the PECVD equipment shown in Fig. 7 that is used to deposit a protective 
layer on top of a wet coating on polycarbonate (PC) used for automotive glazing 
(SABIC and ULVAC). 

The major printing processes include flexography, gravure, screen printing, 
letterpress, pad printing, lithographic printing, and ink jet printing. The reader is 
referred to several sources for more comprehensive information on other coating 
processes (Yasuda 2005; Wolf 2010; Proell; Tracton 2006b). 


Research and Development Trends 

As mentioned in the Introduction, environmental considerations are shaping the 
trends in coatings and surface treatments for plastics. Wet chemical treatments 
typically produce large amounts of hazardous waste and are being replaced with 
dry processes. Regulations that ban certain chemicals are forcing companies to 
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change chemicals and processes, such as the European Union ban on hexavalent 
chromium. Radiation curing is growing, although still a small fraction of the coatings 
market in 2010 (IHS Chemical Report 2011). Powder coatings which are solvent-free 
have been adapted for use on plastics with no VOCs or hazardous by-products (Stay 
2012). Coating and surface treatment technologies are used to enable the use of 
plastics to replace glass and metal parts to reduce the weight in transportation 
vehicles and improve fuel economy and emissions. For example, SABIC has devel- 
oped polycarbonate glazing products using a combination of wet coatings and plasma 
coatings to replace glass and reduce vehicle weight (SABIC and ULVAC). 

The use of plastics in medical applications continues to grow and is expected to 
reach 4.4 billion pounds globally by 2015 (Schlechter 2010). Research on surface 
enhancements and techniques has also followed the trends in medical plastics to 
enable greater application of plastics in implants, tissue engineering, drug delivery, 
diagnostics, and other medical and life sciences applications. For biomedical poly- 
mers used in implants and tissue engineering scaffolds, cell adhesion, growth, and 
proliferation are critical for the successful application. Plasma treatment of biomed- 
ical polymers has been shown to significantly influence the interactions between the 
polymer surface and cells (Jacobs et al. 2012). Plasma treatment of polymers is also 
being investigated for antibacterial applications, cellular-based therapies, load- 
bearing implants, immobilization of biomaterials, biosensors, and others (Althaus 
et al. 2012; Garrido et al. 2010; Ogino et al. 2011; Fandgraf et al. 2009). 


Summary 

The bulk properties of a plastic alone cannot determine the performance of a plastic 
part in a given application. The surface properties are oftentimes critical to the 
success of the application. Surface properties can be enhanced and tailored using 
various surface modification techniques, as well as coatings and finishes. 

Surface modification methods include physical treatments, liquid chemical modi- 
fications, and reactive gas surface treatments. Physical treatments may bring about 
surface damage and liquid chemical treatments produce significant chemical waste. 
Reactive gas treatments include corona, flame, and plasma processes. 
Fow-temperature plasma treatments offer an effective and versatile surface modifica- 
tion method for various polymeric materials by generating new surface functionalities 
and thus introducing the new surface properties required for many applications. 
Plasma processes applied to polymers have the merits of being economically efficient, 
dry, and environmentally benign in tailoring surface characteristics while maintaining 
their desirable bulk properties. However, plasma treatment of polymers could result in 
significant surface damage through plasma degradation reactions, which depend on 
the polymer sensitivity to plasma, the reactive chemicals and gases used, and the 
processing conditions. Dynamic surface tension techniques, such as the Wilhelmy 
balance, are effective tools for characterizing treated plastic surfaces. 

Coatings for plastics can be used for protection of the plastic in the final application 
or for imparting targeted properties. A wide variety of paints and coatings are available, 
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and the choice of coating depends on the substrate, desired properties, and environment 
intended for the part. Metalization can provide decorative functions, conductivity, and 
increase abrasion resistance of plastics. Various techniques have been developed and 
widely used for plastics, including vacuum metallization, flame and plasma arc spray, 
and electroplating (via electroless plating). Other methods for imparting functional 
layers onto plastics include printing, IMD, coextrusion, and PECVD. 
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Abstract 

This chapter covers two major classes of polymer foams, the conventional foams 
formed by foaming agents and syntactic foams. The first part presents the basics of 
polymer foaming through the use of blowing agents including a brief introduction 
on blowing agents and the common foaming methods used by both laboratories 
and industries. Current technologies used for specialty foam fabrication are 
included in the discussion. The multiple roles played by nanoparticles during 
foam formation and the effects on the foam properties are addressed in detail for 
nanocomposite foams. The various factors affecting the formation of microcellular 
foams and the properties of microcellular foams are then discussed. Two detailed 
examples for high-performance polymer foams are highlighted in the article. An 
exceptional section is devoted to syntactic foams, covering both the processing and 
the mechanical behaviors. It starts with general preparation of syntactic foam in a 
laboratory-based environment. The discussion highlights the typical mechanical 
behavior as well as the change in mechanical behavior observed when the content 
of microspheres is changed. Details on how the content of microspheres affects the 
mechanical and fracture properties of syntactic foams are presented. Besides 
looking at various content of microspheres, the existence of various toughening 
mechanisms in syntactic foams and the kind of toughening strategies can be used 
to improve the toughness of syntactic foams are also included in the section. 
Finally, some of the issues concerning polymeric foams and the latest develop- 
ments in the field including future trends are addressed. 


Introduction 

It is almost unavoidable to come across some sort of polymer foam in daily life in 
the modern world. The excellent characteristics such as superior acoustic absorp- 
tion, strength-to-weight ratio, cost-effectiveness, and ease of being processed in 
various forms make polymeric foams an attractive material to be found virtually 
everywhere. High-density polymer foams are found to replace traditional mate- 
rials in transportation and building constructions for weight reduction, while 
low-density foams are commonly used in shock absorption and rigid packaging. 
The flexible and soft polymer foams provide comfort when used for furniture and 
bedding. The low heat transfer and sound transmission inherited from the porous 
structures make them optimal thermal and sound insulators. New insights and 
innovations continue to drive the field of polymer foams forward and are expected 
to witness increasing attentions (Fig. 1). 
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Fig. 1 Photograph of products made from polymer foams. Top left : packaging and disposal 
drinking cups. Top right : running shoes. Bottom : protective cycling helmet 

Various industries are on a constant search for these lighter structures. One of the 
major factors that drive the effort to reduce the weight of structures is the increase in 
fuel prices. In particular, the transportation industries, i.e., automotive, marine, and 
aerospace industry, spend a significant amount of money to fund their R&D 
activities with the aim to reduce the weight of cars, boats, and aircraft, respectively. 
The progresses made on polymer nanocomposite laid a solid foundation for 
nanocomposite foams fabrication. These foams take advantage of the size compat- 
ibility of the nanofillers to achieve microscopic reinforcements where conventional 
fillers failed to do so. Novel porous materials are fabricated by incorporating 
functional nanofillers and the applications are extended to fields of biomedical, 
tissue engineering, electronics and areas require high temperature resistance. 

The technologies for polymer foam fabrication are constantly steered by envi- 
ronmental considerations and cost-effectiveness. Recent research is focused on 
using more environmentally friendly chemicals for foaming and less stringent 
conditions such as lower temperature or pressure. Microcellular foams attracted 
much attention since the invention in the 1980s. Common gas releasing agents such 
as CFCs were replaced by inert gases such as supercritical C0 2 and N 2 which 
almost cause zero environmental impacts. 
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Syntactic foams represent a special kind of composite materials comprising of a 
matrix and a hollow filler. The filler is often also referred to as hollow spheres with 
diameter ranging from a couple of nanometers to millimeters and can be made of 
various types of materials, including polymers, glass, metals, and carbon. Glass 
microspheres are most frequently used due to their mechanical strength, smooth- 
ness, regularity of the surface, good wetting characteristics, and low viscosity of the 
resulting foam. The extremely low density of the hollow spheres effectively 
reduced the density without severe mechanical performance deteriorations. The 
inclusion of spherical particles also eliminates the problem posed by the anisotropic 
bubbles in conventional foams and acted as a good moisture barrier due to the 
highly closed-cell nature. 


Classification of Polymer Foams 

The nature of the polymer resin being used categorized the foams into thermoset 
and thermoplastic foams, which are then further divided into rigid and flexible 
foams. A rigid foam is defined as one in which the polymer exists in the crystalline 
state or, if amorphous, is below its T g . Following from this, a flexible cellular 
polymer is a system in which the matrix polymer is above its T g . The foam cell size 
classified foams into macrocellular (>100 pm), microcellular (1-100 pm), 
ultramicrocellular (0.1-1 pm), and nanocellular (0.1-100 nm). It has to mention 
here that such a definition is necessary because according to IUPAC recommenda- 
tion, materials with pores in the nanometer scale are also termed as microporous 
(McCusker et al. 2001). Another way of classifying polymer foam is through the 
connectivity of the cells. Closed-cell foams consisted of cells which are isolated 
from each other by solid cell struts, a feature that helps to improve the insulation 
and dimensional stability and prevents moisture absorption. Normally the closed- 
cell foams have higher compressive strength and are generally denser comparing to 
opened cell foam. Open-cell foams contain cells that are connected to each other 
and formed an interconnected network. Individual cells in opened cell foams are 
interconnecting which allows the cells to be filled with whatever they are 
surrounded with. However, the opened cell foams are generally more inferior in 
mechanical performances. 


Blowing Agents 

Introducing gas into a polymer system is a crucial part of polymer foam fabrication 
process since gaseous phase defined the voids for such porous structures. During 
foaming, gas can be physically incorporated into the polymer matrix or released 
through chemical reactions. In either case, the gas-forming compound is termed as 
blowing agents (BAs). BAs are traditionally classified into two broad classes, 
chemical blowing agents (CBAs) and physical blowing agents (PBAs); in recent 
years, inert gas and gases in the supercritical state are also used. 
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Chemical Blowing Agents 

CBAs are organic or inorganic compounds or mixtures of compounds that liberate 
gas as a result of chemical reaction such as thermal decomposition or through 
interactions with other components within the formulation. 

Chemical blowing agents are generally being characterized by: 

1 . The gas number: the volume of gases being liberated by 1 g of the blowing agent 
per unit time ( 1 min) at the maximum gas liberation temperature 

2. The decomposition temperature 

3. The temperature range of the maximum rate of decomposition 

4. The rate and kinetics of the gas liberation 

5. The pressure of the foaming system developed by gas 

The abovementioned characteristics provide an approximate data, and adjust- 
ments have to be made for real foam development. 

Organic CBAs are sometimes preferred than their inorganic counterparts due to 
the better compatibility and ease in dispersion. However, the residual components 
from incomplete decomposition may act as plasticizers and reduce the foam 
performances. Table 1 lists some commonly used CBAs for polymer foaming, the 
decomposition temperature, gas released, and the estimated gas yield. Activators 
can be used to lower the decomposition temperature and rate. 


Physical Blowing Agents 

PBAs refer to compounds which change the physical state upon heating or pressure 
change and do not affect the chemical or physical properties of the host matrix. 
Common PBAs include volatile organic compounds such as low boiling hydrocar- 
bons and alcohols and compressed gases or gases in the supercritical state such as 
supercritical carbon dioxide. Highly porous solids such as activated carbon and 
silicates are often used as absorbent to facilitate the handling of volatile liquids. 


Table 1 List of commonly used CBAs for polymer foaming 


CBA 

Decomposition 
temperature (°C) 

Gas released 

o 

Gas yield (cm /g) 

2 / 2'-Azobis(isobutyronitrile) 

95-104 

n 2 

-137 

l,l'-Azobis 
( 1 -cyclohexylcy anide) 

110-115 

n 2 

-85 

Azodicarbonamide 

200-230 

n 2 , co 2 , nh 3 

-120 

Sodium bicarbonate 

150-230 

co 2 , h 2 o 

-165 

5 -Pheny ltetrazole 

240-250 

n 2 

-200 

p-Toluenesulfonylhydrazine 

110-140 

n 2 , h 2 o 

-120 

2,4,6-Trihydrazino- 1 ,3 ,5 -triazine 

245-285 

n 2 , nh 3 

-185 
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Chlorofluorocarbons (CFCs) are the most widely used PBA due to their 
incombustibility, low toxicity, and well-defined gas release temperature and rate. 
Despite the advantage, CFCs are phased out due to their negative effects in ozone 
depletion. The process of replacing CFCs speeded up after the signing of “Montreal 
Protocol on Substance That Deplete the Ozone Layer” in September 1987. 
Hydrogen-containing CFCs with shorter atmospheric lifetimes are considered as a 
potential replacement. Although they contribute much less to stratospheric ozone 
depletion, they are viewed only as temporary replacements. Intensive researches are 
ongoing to find an environmental friendly candidate. 

Supercritical fluids such as carbon dioxide and nitrogen attracted much atten- 
tion in recent years as BA for polymer foam processing for the strong advantages 
exhibited. Usually the polymer matrixes are saturated with the fluid under pressure 
and later subject to further pressure increase or heating for bubble nucleation. Such 
process allows the production of polymer foam with cell size diameter smaller than 

Q ^ 

100 pm and population density larger than 10 cells per cm which is almost 
unachievable by the addition of conventional BAs. Using supercritical fluid also 
provides a “clean” foaming process without the use of harmful organic solvent and 
eliminates the hassle of residual from unreacted BAs from the final products which 
may deteriorate the final foam performances. However, such foaming method is yet 
to be employed by the industries due to the high pressure and stringent equipment 
required. 


General Consideration on BA Selection 

There are several general guidelines to follow while selecting a suitable blowing agent: 

1. The decomposition or reaction temperature should be close to the melting 
temperature and the hardening temperature of the polymer. 

2. Chemical compatibility enhanced the dispersion of BA in the matrix. 

3. Gas must be liberated within a narrow temperature range. 

4. Highly exothermic reaction should be avoided to prevent the polymer matrix 
from thermal damage. 

5. The liberated gas and residual component should not have negative effect on the 
polymerization process of the polymer matrix. 

6. The gas released should be readily dispersed and dissolved in the polymer melt. 

7. Cost-effectiveness. 

8. The final cell structure required. 

Foaming Processing 

Since the invention of polystyrene (PS) foam in the 1930s, foaming technologies 
progressed and matured. Nowadays, it is possible to produce foams of nearly every 
polymer resin through one or more of the following processes: 
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Injection Molding Process 

Polymer matrix and blowing agents (BA) are processed conventionally in the screw 
injection molding equipment equipped with gas leakage preventive measures such 
as nozzle shutoff valve or additional external control gate. Gases released from BA 
through thermal decomposition are dissolved in the polymer melt and injected 
directly into an enclosed mold cavity to produce foam products as the pressurized 
gases expand when subjected to atmospheric pressure. BA decomposition should 
occur during plasticizing process and remain dissolved in the polymer melt until the 
gas-melt mixture is injected to the enclosed mild. 

This process often produces foams with a sandwich structure consisting of a 
continuous foam skin and cellular core and is commonly used to produce thermo- 
plastic polymer foams such as high-density polystyrene (PS) foams, both rigid and 
flexible, poly (vinyl chloride) (PVC) foams, acrylonitrile butadiene styrene (ABS) 
structural foams, and thermoset foams such as polyurethane foams. 


Reaction Injection Molding (RIM) 

Reaction injection molding systems combine two or more liquid components that 
chemically react in a closed mold to form polymer foam, taking the intricate shape 
of the mold. Figure 2 shows a simple schematic illustration of the RIM process. The 
components are precisely controlled at stoichiometric ratio before combined at the 


Fig. 2 Schematic diagram of 
reaction injection molding 
process 
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mixing head by high velocity impingement mixing. The process is usually carried 
out at low temperature involving comportments of low viscosity which greatly 
reduce the pressure required to drive the components into the mold. RIM is widely 
used in industries for the production of large, complex parts such as bumpers for 
vehicles, panels for electrical equipment, enclosures for medical devices, and 
housings for computer and telecommunications equipment. Recently applications 
are extended to microcellular foam fabrications. 


Expandable Polymer Pellets 

The polymer pellets are first impregnated with BA during the polymerization process. 
These pellets are then subject to heat for free expansion or confined expansion. For 
example, freely expanded PS pellets are commonly used in packaging and as cushion 
fillings. For confided expansion, the expandable pellets filled up the entire 
predesigned mold and fused together forming foam products with well-controlled 
density, shape, and sizes. Disposable foam cups are an excellent example of polymer 
foams produced by confined expansion from expandable PS pellets. 


Extrusion Foaming Process 


STYROFOAM™ is perhaps the most popular type of polymer foams known to 
everyone. However, it should not be mistaken as the white disposable foam cups 
made from expansion process explained in the previous paragraph. STYRO- 
FOAM™ is a registered trademark for a line of extruded polystyrene foam products 
made exclusively by The Dow Chemical Company for thermal insulation and craft 
applications (Company DC. It is not a cup). 

The typical extruders as shown in Fig. 3 used for bulk polymer products are 
commonly used with necessary modifications which mix BA into the polymer melt 
homogeneously with applied pressure. The sudden drop in pressure led to rapid cell 
growth as soon as the melt is being extruded out from the die’s orifice. 

Continuous extrusion is preferred from the economical point of view due to the 
higher throughput and versatility in the properties and shapes of the products 
obtained. Besides the traditionally used polymer resins for foam fabrication, trends 
are moving toward the use of nonconventional foam materials such as biodegrad- 
able polyhydroxy alkanoates (Liao et al. 2012). 

In general, an ideal foaming system consists of a polymer with a gelation time that 
coincides with the time required for rapid gas liberation; hence the solidifying 
polymer will be able to “trap” the gas bubbles inside the matrix. Prior to foaming, 
it is necessary to characterize the BA and to obtain the viscosity profile in order to 
establish a balanced system for proper foam fabrication. An example is given in Fig. 4 
which illustrates that 220 °C is the “ideal” foaming condition in which gas liberation 
and polymer cross-linking occurred within the same time frame. In contrast, when the 
polymerization process took too long a time, the rising foam may collapse before the 
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Fig. 3 Typical extrusion machines used for polymer processing 
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Fig. 4 Diagram showing the relationship between gas liberation of azodicarbonamide and 
viscosity change of resorcinol-based phthalonitrile at 190 °C and 220 °C 
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polymer matrix failed to gain enough gel strength to form a stable foam system. 
Similarly, if polymerization occurred before gas liberation, the resulting foam may 
have a density much higher than expected. Cell growth was inhibited when the gas 
generated was unable to overcome the rapidly increasing gel strength. 


Nanocomposite Foams 

There are many emerging applications involving the dispersion of nanoparticles into a 
polymer matrix with the aim to enhance the properties of the composite material. One 
of the main reasons why nanoparticles are able to improve the properties of polymer 
resins is their large surface-to-volume ratio. The large ratio increases the number of 
particle-matrix interactions, thus increasing the effects on the overall material 
properties even at rather low filler loadings. The nanoscaled fillers are especially 
beneficial for foam property enhancements primarily because the foam cell walls are 
normally within the submicron regime which conventional fillers are incompatible in 
terms of size. Improvements in thermal, electrical, and mechanical properties could 
be achieved by synergistically combining the properties of the matrix and the fillers 
without altering the desired density or the foam morphology. 

Traditionally three types of nanofillers of distinct geometries as being illustrated 
in Fig. 5 are used. OD nanofillers are being characterized by having all the three 
dimensions in the nanometer scale. Typical examples include spherical silica 
particles, nanocrystals, and metal particles. Typical examples of ID nanofiller are 
nanotubes and nanofibers which feature two dimensions in nanoscale. The last type 
of nanofillers, 2D, has a lateral dimension in the range of several hundreds of nano- 
to micrometers and nano sc ale thickness. 

Conventional nanofillers, such as organoclay, are one of the most popular 
nanofillers being used for nanocomposite fabrication. However, these fillers cannot 
be used as cell nucleating agents in high-processing-temperature polymers due to 
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Fig. 5 List of nanofillers, the geometries, and surface area to volume ratio 
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thermal degradation of the organic modifiers. For such applications, nonconventional 
organic modifiers or inorganic nanofillers should be used. Graphene has demonstrated 
to be an excellent candidate for its thermal stability and has shown its potential in 
improving the mechanical, thermal, and electrical properties. To produce single 
graphene layers for mass manufacturing of nanocomposite foams still poses 
challenges in both the fabrication method and the economic aspects. Recent devel- 
opment at Michigan State University on the production of robust graphene sheets of 
one to five layers thick and diameters ranging from less than 1 pm to over 100 pm at 
cost-competitive prices could be an effective solution. 


Processing of Nanocomposite Foams 

In general, two separate steps are involved in the processing of nanocomposite 
foams: the synthesis of nanocomposite and subsequent foam formation through 
different foaming techniques as explained in the previous paragraph. It is well 
understood that particles, especially in the nanorange, tend to agglomerate due to the 
dominant intermolecular van der Waals interactions between them. Possible incom- 
patibility between the nanofillers and the polymer matrix makes it even more 
challenging to achieve uniform dispersion. Hence, surface modifications are usually 
carried out to promote interactions between the nanofiller and the matrix in order to 
overcome the strongly bonded nanofiller aggregates. Numerous studies and research 
were carried out to break up the nanofiller agglomerates for homogeneous 
nanocomposite preparation and were well documented in various academic and 
industrial publications and patents. A detailed survey on nanocomposite preparation 
is beyond the scope of this chapter. The focus of this part of the chapter is placed on the 
effects of nanofillers on the foaming process and foam properties. Besides the perfor- 
mance enhancement exhibited in nanocomposite, nanofillers may serve as heteroge- 
neous nucleation centers facilitating the formation of bubbles, improving the cell 
density, homogenizing the cell size, and altering the rheological properties of the 
foaming polymer affecting bubble stability, foam morphology and the foam density. 

Nucleating Effect 

Adding nucleation fillers to improve the cell density, which is defined as the number of 
bubbles per cm 3 , is a common practice in foam manufacturing processes. It is generally 
understood that nanofillers with low surface energy will reduce the free energy (DF) (see 
Eq. 1). Reduction of free energy is required for bubble initiation in liquid by lowering the 
surface tension at the liquid-solid interface (Klempner and Frisch 1991): 

AF = y ■ A (1) 

where y is the surface tension and A is the total interfacial area. 

Figure 6 shows how the cell size in polystyrene (PS) foam was reduced by a 
great extend and the cell density was increased by at least two orders of magnitude 
comparing to the pure foam by adding a small amount of nanofillers. 
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Fig. 6 Cell morphologies of (a) PS foams, (b) PS/1 % CNFs, and (c) PS/0.1 % CNTs (Lee 
et al. 2005) (PS polystyrene, CNFs carbon nanofibers, CNTs carbon nanotubes) 


A fine dispersion of nanofillers will greatly enhance the nucleation efficiency, 
resulting in increased cell density and reduced cell size. This can be easily under- 
stood as more gas was being consumed by bubble nucleation at the nanoparticle 
sites. Simultaneously, less gas would be available for bubble growth, hence reduced 
the final cell size. The classical steady-state nucleation theory is often used to 
qualitatively describe the number of nucleation sites (see Eq. 2): 

A No = CJ a exp (~ AG "% bT ) (2) 

where A G crit is the critical nucleation formation energy and is described as per 
Eq. 3. k B is the Boltzmann factor, and T is the absolute temperature. C 0 is the 
number of gas molecules dissolved per unit volume of the primary phase; f 0 is a 
kinetic pre-exponential factor. 
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A G crit = u ™ > (3) 

/ 3A P 2 

AP is the pressure difference inside and outside the nucleating bubble. 
However, it has to be pointed out that the data obtained by the theory shows a 
great deal of discrepancy from actual experiments because the nucleation formation 
energy is dependent on the critical nucleus size, which is an inaccessible parameter 
during actual foaming process. Lee et al looked into heterogeneous nucleation 
based on the classical steady-state nucleation theory to qualitatively describe the 
number of nucleation sites (TV,-) generated by nanoparticles (Spitael et al. 2004; 
Zettlemoyer 1969; Colton and Suh 1987a, b) by introducing/}, a frequency factor of 
gas molecules joining the nucleus, and Ci to account for the concentration of the 
heterogeneous nucleation sites (see Eq. 4). 

Ni = Cifiexp(- AG %/ kBT ) (4) 

where the energy required to form a nucleus is considered proportional to the 
energy required in a homogeneous system by a factor dependent on the contact 
angle between the gas and polymer and particle surface: 

AGS = lfc 7 J (^). s (» ) 

S(0) = (2 + cosO ) (1 — cosO ) 2 

A general selection guideline for nucleating agents was 
and Leung (McClurg 2004; Leung et al. 2008): 

• The nucleation agents should be able to lower the surface energy barrier needed 
for bubble initiation relative to homogeneous nucleation and unintentional 
heterogeneous nucleation caused by contaminants in the polymer matrix. 

• Ideal nucleating agents should have uniform sizes, surface geometries and 
surface properties and should be easily dispersible. 

• A rugged surface as illustrated by Lig. 7 that contains many conical crevices of 
small semi-conical angles (P) is preferred. 


( 5 ) 

( 6 ) 

proposed by McClurg 


Qualitatively, nucleating agent with small contact angle and high surface cur- 
vature causes more effective critical energy reduction and results in higher effi- 
ciency (Lletcher 1958). However, high nucleation efficiency will be achieved not 
only by choosing the right geometry and surface but, more importantly, through a 
good dispersion process (Colton and Suh 1987a, b; Lee et al. 2005). Carbon 
nanofibers are more effective nucleating agents compared to carbon nanotubes 
and nanoclay due to their increased homogeneous distribution and their favorable 
surface and geometrical characteristics (Lee et al. 2005). 


138 


X. Hu et al. 


Fig. 7 Surface geometry of 
nucleating agents with rugged 
surface, bindicate the semi- 
conical angle (Adapted and 
redrawn from Leung 
et al. ( 2008 )) 



Bubble Stabilization 

By nature, liquid foams before solidification are thermodynamically unstable. 
Movements of the liquid cell through capillary actions and gravity further promote 
the collapse of the foams due to cell thinning. Bubble stabilization is generally 
achieved in two ways. At first the melt viscosity increases or stabilizes by the 
adsorbed particles on the foam cell surface. When the viscosity or melt strength of 
the liquid cell wall has increased, the force required to overcome pressure differ- 
ence increases. This effect combats excessive liquid cell movement and hence 
slows down the cell thinning process and stabilizes the cell structure. There have 
been several recent examples of foams being stabilized by particles adsorbed at 
air/liquid interface forming a rigid shell that protects the bubbles against coales- 
cence (Dickinson et al. 2004; Hunter et al. 2008; Gonzenbach et al. 2006). 


Foam Properties 
Thermal Properties 

Well-dispersed nanofillers generally improve the thermal properties of polymer 
foams. Several mechanisms have been proposed to explain the phenomenon: (i) the 
barrier effect delayed the escape of volatile decomposition products during degra- 
dation; (ii) the nanofillers created a tortuous path for air, delaying the thermo- 
oxidative degradation of the material; and (iii) the thermally conductive nanofillers 
could ease the heat dissipation within the matrix. Figure 8 shows that the addition of 
0.1 wt% of functionalized graphite sheets and carbon nanotubes shifted the onset of 
degradation to higher temperature and improved the char yield by about 50 % 
compared to the control silicone foam. 

Compression Properties 

The compressive properties of foam are highly dependent on their apparent density; 
hence, the compressive parameters are usually normalized to exclude the effect of 
density variations. Verdejo et al. demonstrated that the addition of CNTs and 
functionalized graphene sheets (FGS) into flexible silicone foams caused drastic 
changes in the compressive behavior (Verdejo et al. 2008) (see Fig. 9). The 
combination of density change and the reinforcement effect of the nanofillers 
increased the normalized Young’s modulus by over 200 % through the addition 
of only 0.25 wt% of FGS. Similar behavior was observed for the case of rigid 
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Fig. 8 Weight loss as a function of temperature obtained by thermogravimetric analysis ( TGA ) 
(Verdejo et al. 2008) 

foams; see Fig. 10 which shows the relationship of compressive parameter of rigid 
polyurethane foams reinforced by multiwall carbon nanotubes (MWNT) and car- 
bon nanofiber (CNF), respectively. 

It should be pointed out that not all nanofillers lead to property improvements. 
For the case of a rigid foam system of PU, hydrogen bonds within the structure 
network play a prominent role in the mechanical properties. The addition of organic 
fillers, such as organoclay, may interfere with the formation of hydrogen bonds, 
hindering the structural formation and causing property deterioration. Hu and 
co-workers studied the effects of nanofiller content on the compression properties 
of rigid phthalonitrile foams. Results, as shown in Fig. 11, indicate that the specific 
compressive stress improvement is dependent on the type of filler incorporated. 
Both MWNT and expanded graphite seem to be able to improve the compressive 
stress. However, excessive amount of filler resulted in negative results caused by 
aggregation of the nanofiller creating a point of failure initiation. On the contrary, 
fumed silica caused the foam property to drop regardless of the loading content 
caused by poor interfacial interactions between the filler and the matrix. 


Microcellular Foams 

While the conventional foaming techniques produced foams with cell size in the 
order of 100 or more micrometers, the idea of microcellular foams emerged in the 
early 1980s as a means to reduce the cost of mass-produced polymer items 
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Fig. 9 Compressive stress-strain behavior of silicone foam nanocomposites (Verdejo et al. 2008) 

(Martini- Vvedensky et al. 1984). Microcellular foams are typically rigid structures 
consisting of closed-cell or open-cell sizes ranging from a few to tens of microme- 

Q ^ 

ters and a cell density greater than 10 cells/cm . The environmental friendly 
products are usually produced without the use of BAs such as CHCs and are widely 
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Fig. 10 Normalized compressive modulus and strength of rigid polyurethane foam (Dolomanova 
etal. 2011) 


used in the food packaging industries. The automotive industries start to replace 
parts with the low shrinkage, weight-reducing microcellular foams; this high 
strength-to-weight ratio material even lands itself in the aeronautical and transpor- 
tation industries. Recent progress made on open-cell microcellular foams widen the 
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□ FS 


Filler C onte nt (wt% ) 


□ Pd'e Foam 


7 21 


■ liflWVNT 


10.92 7.73 


Fig. 1 1 Specific compression stress as a function of nanofiller content at foam density of 

Q 

0.12 g/cm (FS fumed silica, MWNT multiwalled carbon nanotubes, GH expanded graphite) 

potential applications to tissue engineering. The unique optical performances 
exhibited by a sheet developed by microcellular foaming technology triggered 
considerable interest in using these materials as reflector and diffusion sheet in 
liquid crystal displays by tuning the cell morphologies (Lee et al. 2011). 


Processing of Microcellular Foams 

Solid-state batch foaming process (see Fig. 12) is a straightforward foaming process 
commonly used for microcellular foam preparation. The process is carried out in 
two main steps. The free volume of a semisolid or solid polymer is first saturated 
with inert foaming gas, such as N 2 or C0 2 , or a physical blowing agent inside a high 
pressure vessel. According to Henry’s law, the amount of gas dissolved in the polymer is 
directly proportional to the applied pressure; hence pressure is usually applied to 
improve the relatively low solubility of the gases. Besides pressure, sufficient time 
must be given to the system to attain an equilibrium concentration that is consistent with 
the solubility of gas in the polymer and the gas pressure. The fully saturated system is 
then driven to the supersaturated state by depressurization or increasing the temperature. 
Both strategies will reduce the solubility of the gas resulting in a thermodynamic 
unstable system which provokes a phase separation as the driving force for gas 
nucleation and growth. One consequence of dissolving gas in the polymer is the 
reduction of the polymer’s glass transition temperature due to plasticization effect. If 
heating is employed, the temperature of the gas-saturated polymer only needs to be 
raised to the glass transition temperature of the gas-polymer system to nucleate bubbles. 

The resultant microcellular structure and properties are affected by the proper- 
ties such as the presence of nucleation sites, polymer matrix crystallinity, polymer 
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Fig. 12 Schematic of the batch process to make solid-state microcellular foams 


melt strength, and gas solubility. To further complicate the process, these properties 
are heavily dependent on the processing conditions such as temperature, pressure, 
and gas saturation time. 

Well-dispersed nanoparticles, such as fumed silica, clay, carbon nanotubes, zinc 
stearate, and titanium oxide, can promote bubble initiation by acting as heteroge- 
neous nucleation centers. The nucleation efficiency can be predicted from the 
classical nucleation theory as described earlier. 

Amorphous polymers are preferred over the highly crystalline structures since 
gas solubility in the crystal structure is substantially lesser than that in the amor- 
phous region. The solubility of gas in a semicrystalline polymer matrix decreases as 
the crystallinity increases. As a result, the void fraction and cell size will be 
reduced. Liao et al used long-chain polypropylene to show the effects of spherulite 
on cell density at different stage of foaming. When first formed, the spherulite 
caused structural heterogeneity in the polymer matrix and increased the cell density 
by heterogeneous bubble nucleation. The growing spherulite then increased the 
polymer melt viscosity and lowered the possibility of cell coalescence. The cell 
density further increased through homogeneous nucleation when the larger spher- 
ulite excluded the bubble-forming gas out of the crystalline regions. 

Figure 13 shows the change in morphology under different saturation pressure 
by keeping the temperature constant. Increasing the pressure will force more gas 
into the polymer matrix, hence increasing the number of nuclei available for bubble 
initiation; as a result, cell size decreases and cell density increases. 

Foaming temperature is an important parameter in microcellular foaming pro- 
cess because nearly all the physical properties, such as viscosity, surface tension, 
gas solubility and diffusivity, and so forth, are temperature sensitive. While trying 
to increase the diffusion rate of gas into the polymer matrix by increasing the 
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Fig. 1 3 Porous PS foamed at 80 °C; pressure at (a) 180, (b) 230, (c) 280, (d) 330, and (e) 380 bar; 
and depressurization time <20 s (Tsivintzelis et al. 2007) 


temperature, the solubility of the gas is reduced; it is important to establish a proper 
reaction balance. Foaming at temperature near the melting point of the polymer 
matrix should be avoided. As the foaming temperature approaches the melting 
point, the polymer chain stiffness and viscosity decrease when the chains gain 
mobility. Such system loses the ability to prevent the gas from diffusing out of the 
polymer; as a result cell nucleation and growth was suppressed. 

The solid-state batch process was widely used to generate microcellular foams 
from a number of amorphous and semicrystalline polymers, such as polystyrene (PS), 
polyethylene (PE), high-density polyethylene (HDPE), polyvinylchloride (PVC), 
polycarbonate (PC), acrylonitrile butadiene styrene (ABS), polyethylene terephthal- 
ate (PET), crystallizable polyethylene terephthalate (CPET), polyetherimide (PEI), 
polysiloxane, and polylactic acid (PL A), just to name a few. Although being termed 
“solid-state” batch processes, this method can be applied to injection or extrusion 
molding for molten polymer. Both strategies suddenly reduce the solubility of the gas, 
driving the gas out of the polymer matrix and into nucleated bubbles. However, the 
low gas diffusivity requires an extended period of time to achieve saturation in a 
highly pressurized vessel; this cost-ineffectiveness became the major hindrance for 
industrial scale production. 


Foam Properties 

The microcellular foams represent a class of novel materials which differs from the 
conventional foams which fall in the low-density region (relative density less than 
0.1) or the structural foams (relative density greater than 0.5). When first being 
introduced, it was hypothesized that, by introducing sufficient amount of voids of 
size smaller than the size of the critical flaw existed in the system would effectively 
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reduce the polymer density while maintaining the mechanical properties. However, 
there was no report on the critical flaw size or quantitative data to support the 
hypothesis. In fact, studies on various microcellular foams showed that mechanical 
properties decreased as the density was lowered. 

The modulus of microcellular foams can be reasonably estimated by the 
Gibson-Ashby cubic cell model (Gibson and Ashby 1999), which predicts that 
the relative tensile modulus equals the square of the relative density. Kumar 
(Kumar et al. 1994) show that the tensile strength of microcellular foams decreases 
in proportion to the foam density and can be approximated quite well by the rule of 
mixtures. Thus a 50 % relative density foam can be expected to have 50 % of the 
strength of the solid polymer. 

It was suggested that the microvoids in the microcellular foam will resist the 
crack propagation process, hence improving the energy absorption ability. How- 
ever, this observation is not always true as impact tests are sensitive to the polymer 
matrix and the test conditions. The Charpy notched impact strength of microcellular 
PC matrix shows that brittle breaking behavior can be increased by reducing the cell 
size to less than 1 pm or increasing the polymer toughness. On the contrary, 
foaming leads to a drop in the impact strength for microcellular PC breaks tough 
under the same test conditions (Bledzki et al. 2010). The studies carried out by 
Juntunen et al using Gardner impact tester showed that the impact resistance of 
microcellular PC decreases linearly with relative density over the densities ranging 

Q 

from 0.6 to 1.0 g/ cm (Juntunen et al. 2000). PVC with 25 % density reduction 
reduced the normalized mean failure energy by approximately 37 % comparing to 
the unfoamed PVC. 

High-Performance Polymeric Foams 

The growing interest in replacing traditional metal foams with polymeric foams 
generated a great drive in the development of high-performance polymeric foams 
which are rigid, thermally stable polymeric foams for high-temperature (HT) 
applications. This new material takes the advantage of the thermal stability and 
chemical resistance of polymer while maintaining the low weight. A foam material 
with such combination of properties has potential applications in space technology; 
as the resulting nanocomposite foam would have ranges of stiffness and resilience 
that are outside the limits of pure polymer foams, be flame resistant, demonstrate 
electrical and thermal conductivity and yet be both light weight and cost-effective 
space stable materials. 


Polyimide Foams 

Polyimide demonstrates outstanding advantages of being high-temperature resis- 
tant and oxidation resistant and is one of the most widely chosen candidates 
for high-performance polymer foam fabrication to be used in areas requiring 
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Fig. 14 Mold concept for TEEK polyimide foaming (Adapted and redrawn from Nelson and 
Wilkie (2001)) 


Table 2 Thermal properties for TEEK polyimide foams (Nelson and Wilkie 2001) 


Property 

Test 

method 

TEEK- 

HH 

TEEK- 

HL 

TEEK- 

L8 

TEEK- 

LH 

TEEK- 

LL 

TEEK- 

CL 

Density (G/cm 3 ) 

ASTM 

D-3574 

(A) 

0.08 

0.32 

0.128 

0.08 

0.32 

0.32 

Thermal 
stability, temp. 

°C 

10 % wt 
loss 

518 

267 

522 

520 

516 

528 

50 % wt 
loss 

524 

522 

525 

524 

524 

535 

100 % wt 
loss 

580 

578 

630 

627 

561 

630 

Glass transition 
Tg, temp. °C 

DSC 

237 

237 

283 

278 

281 

321 


flame-retardant materials and fire protection, thermal and cryogenic insulation, 
gaskets and seals, vibration damping pads, spacers in adhesives and sealants, 
extenders, and flow/leveling aids. 

NASA Langley Research Center (LARC) developed polyimide foam from a 
large number of monomers and monomer blends. The specific densities of these 

Q 

foams can range from 0.008 to 0.32 g/cm . Figure 14 shows a simple illustration of 
the foaming concept. The foam precursor powder was placed in the foaming mold 
and subject to 140 °C for 60 min using heat plates on the top and bottom. The mold 
was then rapidly transferred to a set at 300 °C and held for 60 min before cooled to 
room temperature. Table 2 summarized the thermal stability of the as-fabricated PI 
foam in nitrogen at 10, 50, and 100 wt%, respectively. The T g obtained by 
differential scanning calorimeters (DSC) was one of the highest among polymer 
foams known till today. 
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Phthalonitrile Foams 


Phthalonitrile (PN)-based resin was proven to be an extremely thermally resistant 
polymer with service temperature more than 300 °C after proper post-curing 
treatments. Keller from the United States Naval Research Laboratory has inten- 
sively synthesized and patented a series of phthalonitrile-based monomers over the 
past few decades, with bisphenol (Laskoski et al. 2005), aromatic ether (Keller 
1994), and phosphine oxide (Laskoski et al. 2007) as spacer linkages incorporated 
between the terminal phthalonitrile units. Hu and Liu developed foams based on PN 
polymer targeted for high-temperature applications. The one-step foaming process 
as shown in Fig. 15 involved proper synchronization of gas decomposition from BA 
and polymerization. The resulting foam density can be precisely controlled through 

Q 

proper processing control with the lowest obtainable density of 0.04 g/ cm . 
A typical PN foam produced by Hu and Liu is shown in Fig. 16. 

Table 3 summarizes the thermal oxidative parameters of PN foam. Char yields of the 

Q 

0.2 g/cm foams were found to be 82.3 and 68.5 wt% in N 2 and air, respectively. The 
values were comparable to its void-free samples and unattainable by most thermoset 

o 

foams. Even at a density as low as 0.08 g/cm , the char yield at 800 °C in N 2 was 
maintained at 63.5 %. The high thermal stability was attributed by the high cross-linking 
density and the thermally stable macrocyclic stmctures formed during cross-linking. 

Figure 17 shows the percentage of compression stress retention of PN-based 
foams after thermal aging at 280 °C for 100 h. Foams with density larger than 

Q 

0.12 g/cm achieved 95 % property retention. The results proved the potential of 
such foams to be used for HT applications for extended period of time. 
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Fig. 15 Schematic of the one-step foaming process of PN foam 
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Fig. 16 Photograph of PN foam of density 0.15 g/cm 3 


Table 3 Thermal oxidative parameters of PN foams obtained by TGA 


Density 

Onset degradation °C 
(5 wt% loss) 

Char yield at 600 °C 

(wt%) 

Char yield at 800 °C 

(wt%) 

n 2 

Air 

n 2 

Air 

N 2 

Air 

0.08 

496 

483 

78.7 

72.3 

63.5 

13.5 

0.12 

478 

465 

77.2 

70.5 

63.8 

13.6 

0.15 

490 

481 

79.6 

73.6 

66.2 

14.6 

0.20 

511 

500 

82.3 

77.7 

68.5 

17.8 


Syntactic Foam 

For structural composite applications, often sandwich composite constructions are 
being used. Sandwich composites are made up of thin but stiff fibrous skin layers 
and a lightweight core material. The core material is normally made of a 
low-strength material, but due to its larger thickness, it provides the sandwich 
composite with high bending stiffness with overall low density. Typical core 
materials include materials like polyvinylchloride, polyethylene, polystyrene, and 
balsa wood. However, for the past few years, there is an increasing trend in the use 
of a special type of foam material, called syntactic foam. The concept of a syntactic 
foam is shown in Fig. 18. 

The term syntactic, which originates from the Greek word “syntaktikos” (= orderly 
disposed system), indicates a constructed foam, in the sense that the material is 





4 Polymer Foam Technology 


149 



* - a - * 1 - ■ 


( A A A A A I A A * A A 4 £ A A A 

*• U w U w » ■ v ■ 1 4 j . i v U *- U 

D ensity (g/cm 5 ) 

Fig. 17 s y , s 50 ( fully filled symbol ) as a function of foam density and s 50 retention ( hollow symbol ) 
for foams subjected to thermal aging at 280 °C for 100 h in air 



Binder 





Hollow Filler 



Syntactic Foam 


Fig. 18 Concept of syntactic foam 


manufactured by a specific mixing procedure of fillers and binders with appropriate 
volume fractions. The hollow filler is randomly dispersed in the matrix, creating a 
homogeneous material exhibiting isotropic macroscopic behavior. Since the spheres 
have continuous shells, the final material can be considered as a closed-cell foam. 
Typical photograph and SEM image of syntactic foam are shown in Fig. 19. 

The advantages of syntactic foam over the conventional foams are as follows: 


The mechanical properties of syntactic foam, in particular the compression 
properties, are several orders of magnitude higher than the properties of the 
lighter (traditional) foams. 

No blowing agents are required to create the porosity. 

The density of the syntactic foam before curing is fairly the same as the density after 
curing which makes the synthesis of these materials fairly simple (see Fig. 20). 




150 


X. Hu et al. 



Fig. 19 Typical photograph and SEM image of syntactic foam 


Fig. 20 Density variation 
with microsphere content for 
syntactic foam containing 
various types of 
microspheres: (■) K15 glass, 
(A) K46 glass, and (•) 
phenolic microspheres 




• Due to its closed-cell structure, syntactic foams absorb only minimal amounts of 
liquid which makes syntactic foam an interesting material for the marine and 
offshore applications. 

Although the use of syntactic foam has only increased sharply in the twenty-first 
century, it has been around for nearly half a century. Syntactic foams originate from 
the early 1960s (Shutov 1991). It was initially used as a buoyancy material for 
marine applications. Current applications for syntactic foam include buoyancy 
modules for marine riser tensioners, boat hulls, deep-sea exploration, autonomous 
underwater vehicles (AUV), parts of helicopters and airplanes, soccer balls, etc. 


Processing of Syntactic Foam 

This paragraph describes how, in general, a syntactic foam can be processed in a 
laboratory-based environment. The first step is to mix the base resin and the 
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hardener according to a predefined stoichiometric ratio. Mixing can be done in a 
glass beaker or disposable cup and stirring can be performed using a glass rod. The 
obtained mixture has to be stirred by hand for a couple of minutes until the filaments 
visible with the naked eye, due to the contact of the hardener and base resin 
component, are no longer visible. Stirring should be performed cautiously to 
avoid the formation of air bubbles. 

Once the base resin and hardener have been stirred thoroughly, the microspheres 
can be added to the mixture. The amount of microspheres to be added depends 
on the preferred micro structure of the syntactic foam. It is important to 
understand that the microspheres should be added to the mixture in small portions 
to avoid resin necking. Resin necking occurs when a group of microspheres is 
trapped inside the resin, affecting the mechanical and fracture properties of 
syntactic foam. 

The viscosity of the resulting uncured syntactic foam depends on the amount of 
microspheres added. For microsphere volume fractions of 30 % and below, the 
viscosity of the uncured syntactic foam is low enough to be poured into a pretreated 
mold. For mixtures having more than 30-40 volume percentage of microspheres, 
the viscosity of syntactic foam gets too high and the mixture is not able to flow into 
the mold. The mixture has to be scooped into the mold and requires some pressure 
to fill up the mold. 

The main challenge in processing syntactic foam lies in the choice of the process 
parameters, i.e., mixing temperature and mixing time and the addition sequence of 
the components (base resin, hardener, hollow microspheres). It is important that the 
viscosity of the mixture should rise rapidly once placed into the mold to avoid 
lamination of the mixture. 


Mechanical Behavior of Syntactic Foam Versus Filler Volume 
Fraction 

It is well known that the mechanical behavior of syntactic foam is far superior 
compared to conventional foams (see Table 4). It is obvious that the density and the 
properties of these foams will vary with the quantity and type of microspheres. 
Wouterson et al have done extensive research on how the content of microspheres 
affects the mechanical and fracture properties of syntactic foams (Wouterson 


Table 4 Typical properties of various foam materials 


Type of foam 

Compression 
strength (MPa) 

Tensile strength 
(MPa) 

Tensile modulus 
(MPa) 

Density 

(kg/m 3 ) 

PVC 

1.2 

2.2 

80 

80 

PS 

0.5 

0.85 

3.44 

35 

Glass-epoxy syntactic 
foam (30 % vol K46 
glass spheres) 

76 

26 

3200 

900 
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et al. 2004, 2005). Besides looking at various content of microspheres, they also 
studied the existence of various toughening mechanisms in syntactic foams as well 
as the kind of toughening strategies that can be used to improve the toughness of 
this kind of foams (Wouterson et al. 2007a, b). 

The work presented here highlights the typical mechanical behavior of a syn- 
tactic foam as well as highlights the change in mechanical behavior observed when 
the content of microspheres is changed. The work as presented focuses on three 
types of microspheres, namely, K15 and K46 hollow glass microspheres from 3M 
and BJO-093 phenolic hollow microspheres from Asia Pacific Microspheres. The 
details of these different microspheres are given in Table 5. 

Specific Compression Properties 

As was already highlighted in Table 4, syntactic foams exhibit phenomenal com- 
pression strength that have been reported extensively by various researchers (Gupta 
et al. 2001; Bunn and Mottram 1993; Kim and Plubrai 2004; Rizzi et al. 2000). To 
understand the performance, it is important to understand what happens inside the 
syntactic foam during compression loading. Examples of typical compression 
stress-strain curves of syntactic foam are shown in Fig. 21. 


Table 5 Typical properties of various types of hollow microspheres 


Micro sphere 

Q 

Density (g/cm ) 

Static pressure (MPa) 

Mean diameter 
(pm) 

Wall thickness 
(pm) 

BJO-093 

0.25 

3.44 

71.5 

1.84 

K15 

0.15 

2.07 

70 

0.70 

K46 

0.46 

41.37 

43.6 

1.37 



Engncer n (nnlnrl) 


Fig. 21 Typical compression stress-strain curve for syntactic foam 
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Fig. 22 SEM image of 
syntactic foam that has been 
loaded up to region II; the 
black arrows indicate the 
direction of applied load 



It is clear that all curves are made up of three different sections. The first region 
(I) is characterized by an almost linear-elastic behavior of the syntactic foam. The 
region ends when the material starts to yield and reaches its compressive yield 
strength. Upon yielding, the microspheres become crushed under the compression 
load and severe damage occurs. Yielding and inelastic damage occur during the test 
as the sides of the specimen barrel outward under compression loading. 

The second region (II) of the flatwise compression curves in Fig. 21 is charac- 
terized by relatively horizontal plateaus. The horizontal plateau is attributed to the 
implosion of the hollow microspheres under the increasing compression load (see 
Fig. 22). The elliptical voids indicate that the microspheres have been crushed 
under compression loading and the remaining pore has been deformed to an 
elliptical shape. Intact microspheres and sphere-shaped voids can also be observed 
indicating that the syntactic foam has not reached it maximum compression strain 
but is still in the horizontal plateau of the load vs. displacement curve. It is also clear 
from Fig. 18 that syntactic foams with higher fractions of hollow microspheres 
show a larger horizontal plateau and thus a larger strain which is in-line with the 
above observation of microsphere crushing leading to compression strain. 

The third region (III) is characterized by a steep increase in the load-displacement 
curve. The steep increase is caused by a large number of microspheres being cmshed 
and compacted, and the maximum density is being reached. The point, where region 
III starts, is considered to be the point of failure for the syntactic foam as this is the 
point where most of the load-bearing microspheres have been crushed. 

Although the results presented in Fig. 21 are given as absolute values, it should 
be understood that in order to compare the different types of foam, the values 
should be represented as specific values to compensate for the change in density. 
The overall results of the specific compression strength and modulus for various 
contents of microspheres are given in Fig. 23a, b. From Fig. 23a, it can be observed 
that the specific compressive yield strength, a yc /p, decreases with increasing filler 
content for phenolic and K15 glass microspheres. The microspheres act as voids 
and weaken the structure. Nevertheless, an upward trend in a yc /p is observed for 
K46 microspheres. The upward trend is attributed to a relatively minor decrease in 
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Fig. 23 Specific compressive properties of syntactic foam containing various microspheres 
(a) Specific compressive yield strength and (b) specific compressive modulus 


the compressive yield strength compared to the relatively larger decrease in density. 
From the difference in trends between the K15 and K46 glass microspheres, it is 
induced that a yc /p is influenced by the wall thickness of microspheres. The results 
for the specific compressive modulus, E c /p, are shown in Fig. 23b. Again K46 
microspheres perform better as shown. The performance of K46 microspheres is 
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also attributed to their higher wall thickness-to-radius ratio. For all microspheres 
investigated, it appears a densification process takes place at low filler content (~10 
vol%). The densification is caused by a compaction process of the matrix polymer 
in the presence of voids in curing. 

As the content of microspheres increases and, consequently, the matrix volume 
decreases, the microsphere takes up more load under compression and, as a result, 
the specific compressive moduli increase. K15 glass and phenolic microspheres 
show a rather similar behavior in E c /p. Both show a minimum around 20 vol%, after 
which the E c /p slightly increases. This increase is higher for K15 microspheres. 
The specific compressive stiffness levels off as the increase in stiffness is 
counterbalanced by the increases in density of the composite. 

Specific Tensile Properties 

Besides exhibiting high compression strength, Table 4 also shows clearly that the 
tensile properties of syntactic foams are significant. Syntactic foam behaves like a 
linear-elastic material up to failure when loaded in tension. The material experi- 
ences catastrophic failure across a plane perpendicular to the tensile axis. 
Luxmoore and Owen (1980) concluded that a crack will initiate from an oversized 
void, which basically acts as a stress concentration, when a composite is subjected 
to tensile loading. Similar findings were observed by Wouterson et al (2004, 2005). 
On the fracture surfaces of the tensile specimens, voids could be identified. The 
presence of voids in syntactic foam, prepared in a lab-based environment, is 
difficult to prevent. The number of voids can be minimized by avoiding vigorous 
stirring as well as by applying a pressure onto the mixture during curing. 

The outcome of the tensile testing of various compositions of syntactic foam is 
highlighted in Fig. 24. The trends in the tensile strength, a t , with increasing micro- 
sphere content are rather similar for the three types of microspheres, except for 
50 vol% phenolic microspheres. The results indicate that the failure under tension 
is matrix-dominated or dominated by an external factor (i.e., voids). Figure 24a shows 
that a t /p increases upon inclusion of a small amount of microspheres when compared 
to neat epoxy resin. However, beyond 10 vol%, a decreasing trend in a t /p is observed 
for all types of microspheres. The decreasing trend in a t /p with increasing filler 
content indicates that the relative reduction in strength is larger than the relative 
reduction in density. Indeed, a decrease in a t with increasing filler content is observed. 
The comparable trends and values suggest that a t /p is independent of the microsphere 
type and size but only varies with the micro sphere volume fraction. 

Luxmoore and Owen (1980) also suggested that the failure of the foam is 
attributed to the failure of the resin matrix. The nonlinearity in a t /p is caused by 
the reduction in the area of the epoxy matrix in a cross-sectional area. Introduction 
of hollow microspheres reduces the epoxy volume fraction and increases the 
inhomogeneity content, consequently reducing the tensile strength of the composite 
as a result of poor interfacial strength between the matrix and the filler. The authors 
assume that the matrix serves as the load-bearing phase in the composite, whereas 
the hollow microspheres only provide lightweight and minimal strengthening 
effect. The reduction in load-bearing volume outweighs the increase in stiff 
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Fig. 24 Specific tensile properties of syntactic foam containing various microspheres, (a) Spe- 
cific tensile strength and (b) Specific Young’s modulus 

microsphere shells. Around 40 vol% of filler content, a minimum is observed for all 
the microspheres. The authors suggest that 40 vol% of microspheres is to be the 
maximum amount of filler which can be fully wetted by the epoxy matrix in this 
system. The change in properties and behavior of syntactic foam around 40 vol% 
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was also observed by Bunn and Mottram (1993). Nevertheless, clear advantages 
regarding other mechanical properties arise upon introduction of hollow micro- 
sphere as shown in the discussion that follows. 

It has been reported that the Young’s modulus, E t , generally decreases with 
increasing content of hollow microspheres (El-Hadek and Tippur 2002). Bardella 
and Genna (2001) are some of the few who have reported an increasing trend in E t 

Q 

for syntactic foams containing K37 (p = 0.37 g/cnr) microspheres. The results 
presented in Fig. 22b to Fig. 24b shows similarity to those in Bardella and Genna 
(2001) in the sense that the trend for the specific Young’s modulus, E t /p, with 
increasing filler content depends on the type of microspheres used. K46 micro- 
spheres show a rather linear increase in E t /p, whereas K15 shows a constant trend. 
Based on the data presented, it is known that the constant trend in E t /p is caused by a 
proportionate decrease in the Young’s modulus and the density. For hollow phe- 
nolic microspheres, a decrease in E t with increasing filler content is observed. The 
value obtained for neat epoxy resin is too low considering the trends in E t for 
syntactic foams containing the three different types of microspheres. The lower 
value for neat epoxy resin could be caused by the presence of voids. The differences 
between the results for K46 and K15 microspheres can be attributed to their size and 
wall thickness. It has been shown by several researchers that, in general, for 
microspheres of the same material composition, improved mechanical properties 
are obtained for the microspheres with a higher density. Higher density for micro- 
spheres of the same material is often associated with a larger thickness-to-radius 
ratio, t/R (see Table 5). Syntactic foam containing microspheres of the same material 
having a larger t/R often exhibits improved mechanical properties (Fine et al. 2003). 

The difference between the phenolic and K15 glass microspheres explains that 
apart from microsphere size, the material composition of the microsphere is of 
equal importance for the mechanical properties. Phenol formaldehyde has an E t of 
about 6.8 GPa whereas soda lime glass has an E t of about 77 GPa. The difference in 
E t is believed to be an important factor to the difference in the trends of hollow glass 
and phenolic microspheres. This relationship between the material of the matrix 
and the inclusion was studied by Pawlak and Galeski (2002). It was found that the 
higher the Young’s modulus of the inclusion compared to the epoxy used, the 
higher the stresses developed in the material. Further they observed that the position 
of the maximum stress at the spherical inclusion changed from the pole for hard 
inclusion to the equator for soft inclusion. 

Figure 25 shows a typical SEM fractograph of a tensile specimen; the micro- 
graph confirms the role of the binder in the failure of syntactic foam under tensile 
loading. The micrograph shows a rough surface of the epoxy resin after brittle 
fracture. Numerous step structures can be identified in Fig. 25 which indicates 
the plastic yielding of the epoxy resin. Besides plastic yielding, intact microspheres 
are observed on the fracture surface. The fact that the crack has grown over the 
interface between the matrix and the microsphere indicates the presence of 
debonding. Debonding of hard inclusions was also reported by Pawlak and Galeski 
(2002). The debonding is caused by the complex stress state around the 
particle-matrix interface which will not be further discussed here. 
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Fig. 25 Fracture surface of 
syntactic foam perpendicular 
to the direction of applied 
tensile loading 



Specific Flexure Properties 

Figure 26a, b show typical flexure stress-strain curves for syntactic foams 
containing K46 glass and phenolic microspheres, respectively. The flexure behavior 
of K15 microspheres is not shown as it is similar to the flexure behavior of B JO-093 
microspheres. For all specimens, it is observed that the strain is reduced with 
increasing filler content. Especially, the syntactic foams containing glass K46 
microspheres show a larger reduction in failure strain. The reduction in strain 
suggests that the failure of the syntactic foam under flexure loading occurs at the 
side loaded in tension. The larger reduction in strain for glass K46 microspheres is 
attributed to the higher strength of the K46 microspheres (see Table 5). Fewer 
microspheres will fracture under the applied load which prevents stress relief in the 
material. On the contrary B JO-093 and K15 microspheres fracture more easily, 
relieving the stress in the material and thus showing a larger strain. 

Upon fracture, plastic deformation is observed for most compositions of syntac- 
tic foam with up to 30 vol% of microspheres. The amount of plastic deformation 
decreases with increasing filler content since the plastic deformation is attributed to 
the behavior of the epoxy binder. Syntactic foams containing hollow phenolic 
microspheres show larger plastic deformation compared to K46 and K15 micro- 
spheres. The latter might be attributed to ductile deformation of hollow phenolic 
microspheres as shown in Fig. 27, an observation that has not been seen for hollow 
glass microspheres. The plastic deformation of syntactic foam containing 
hollow phenolic micro spheres can be clearly seen on the fracture surface of 
specimens due to stress whitening. The stress whitening cannot be distinguished 
on syntactic foam containing hollow glass microspheres since the resulting 
syntactic foam is white in color. 

During three-point flexure loading, the specimen is subjected to compressive 
stresses on the top part of the specimen and to tensile stresses at the lower part of the 
specimen. From the tension and compression data presented in the previous para- 
graphs, it is expected that syntactic foam subjected to flexure loading will fail at the 
side that is loaded in tension. Indeed, failure at the lower specimen side under 
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Fig. 26 Flexure stress strain curves of syntactic foam containing various amounts of (a) K46 and 
(b) phenolic microspheres. The arrow indicates the direction of increasing filler content 


tensile stresses is observed during experimental work. All specimens fail at the 
center of the support span. No indentation of the indenter is observed. The compa- 
rable failure mode under tensile loading and three-point bending explains why there 
is some resemblance between Fig. 24 and Fig. 28. The only difference observed is 
the behavior of syntactic foams with 0-10 vol% of microspheres. 

For the specific flexural strength, a f /p (see Fig. 28a), some differences between 
the different compositions are observed. The differences are mainly caused by the 
difference in density between the different compositions as the values and trends 


160 


X. Hu et al. 


Fig. 27 A deformed hollow 
phenolic microsphere 



are rather similar for the maximum flexural strengths. All types of micro spheres 
show a decreasing trend in the maximum specific flexural strength with increasing 
filler content. Similar to the tensile test results, the specific strength approaches a 
minimum around 40-50 vol% of filler content. 

The trends and values in the specific flexural modulus, E f /p (see Fig. 28b), are 
consistent with the results presented in Fig. 24b. The similarity is again attributed to 
the tensile failure mode of syntactic foam under three -point bending. The K46 
microspheres show an increase in Ef/p with increasing filler content. K15 micro- 
spheres show a constant value for E f /p whereas phenolic microspheres show a 
decrease with increasing filler content. The constant value in Ef/p from K15 
microspheres is attributed to a similar relative decrease in the flexure modulus 
and density. 

SEM fractographs of the specimens tested under three-point bending (see 
Fig. 29) reveal the different loading modes at the upper and lower parts of the 
cross-sectional area of the specimen. Figure 29a shows the fracture surface of the 
lower part of the cross section, which is loaded in tension. The fractograph is 
consistent with that in Fig. 25. The surface is characterized by a rough surface. In 
between the tension and compression sides of the specimen, an area with relative 
smooth features is observed. The transition between the tensile surface and the 
compressive surface is shown in the right side of Fig. 29b. The compressive surface 
shows step structures behind the microspheres. The step structures diminish with 
increasing filler content and are less pronounced for phenolic microspheres. Besides 
the step structures, debonded microspheres can be identified. Debonded micro- 
spheres indicate a poor interface between the matrix and the filler. 

Specific Fracture Properties 

Besides the mechanical properties, fracture properties were also investigated for 
various contents of microspheres. Figure 30 shows the specific fracture toughness, 
K Ic /p, for various compositions of syntactic foam. The most distinctive features are 
the significant increase in K Ic /p for filler content up to 30 vol% for all types 
of microspheres and the decrease in K Ic /p beyond 30 vol% of filler content. 
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Fig. 28 Specific flexural properties of syntactic foam containing various microspheres, 
(a) Specific flexure strength and (b) specific flexure modulus 


The maximum in the trend for the fracture toughness has been reported for other 
solid particulate composites (Lee and Yee 2000; Rothon 1995; Kawaguchi and 
Pearson 2003). The increase in K Ic /p for 0-30 vol% filler content suggests the 
presence of a toughening mechanism which increases the fracture energy compared 
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Fig. 29 SEM micrograph of the fracture surface of syntactic foam containing 10 vol% K15 
microspheres under (a) tension and (b) compression. The black arrows denote the direction of 
crack propagation 


Fig. 30 Specific fracture 
toughness for various 
compositions of 
syntactic foam 



to neat epoxy resin. For K Ic , the hollow glass microspheres outperform the hollow 
phenolic microspheres with K46 microspheres resulting in the highest value for K Ic / 
p, suggesting that the wall thickness and density of the microspheres affect the 
fracture toughness of the syntactic foam. However, the hollow phenolic micro- 
spheres outperform the hollow glass microspheres for the specific energy release 
rate. This is mainly caused by the lower value of the Young’s modulus of the 
BJO-093 samples. 

Figure 3 1 shows the fracture surface of pristine epoxy resin after being subjected to 
three-point bending. The featureless fracture surface of epoxy resin is indicative of the 
well-known brittle deformation of epoxy resin. Uniformly oriented white lines on the 
fracture surface suggest minor shear deformation of the epoxy resin prior to fracture. 

From Fig. 32, it is clear that the morphology of the fracture surface changes 
dramatically if hollow spheres are added to the epoxy resin. The lines observed 
behind the hollow spheres are referred to as step structures. 

The characteristic tail structure which is created by the mismatch between two 
planes of crack propagation is often regarded to be the evidence for the action of the 
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Fig. 31 Fracture surface of 
neat epoxy resin 



Fig. 32 Step structures 
observed behind hollow 
micro spheres 



XI 


crack front bowing mechanism. Lange (1970) first proposed the crack front bowing 
mechanism. The mechanism proclaims the resistance of rigid particles to crack 
propagation. Because of the impenetrability of the particles, the primary crack has 
to bend between particles. There will be a point when the crack front breaks away 
from the rigid particle. At this point, the arms of the secondary crack fronts will 
come together and form a characteristic step structure as both secondary crack 
fronts propagate at a different crack plane. These characteristic step structures are 
also called “tails” or “lances.” Many studies on the fracture toughness of particulate 
composites consider the crack bowing as the main toughening mechanism. The 
bowed secondary crack front has more elastic energy stored than the straight 
unbowed crack front. Therefore, more energy is needed for a crack to propagate. 

Besides the presence of step structures, debonding of microspheres is observed 
on the fracture surface of syntactic foam. Debonding can be identified by the 
absence of the microsphere or by a gap in between the microsphere and the matrix. 
According to Lee and Yee (2001), the debonding is typical for microspheres with a 
size larger than 10 pm. The presence of debonding of microspheres from the polymer 
matrix suggests the weak interfacial adhesion at the microsphere-matrix interface. 
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Fig. 33 SEM micrograph of the fractured surface of SENB specimens of syntactic foam 
containing hollow phenolic microspheres: (a) 10 vol%, (b) 50 vol% 


Several researchers have suggested that the debonding of glass beads is one of 
the major toughening mechanisms or is a mechanism, which triggers matrix plastic 
deformation in glass bead-reinforced polymers. According to Evans (1972), 
debonding of glass beads was the most prominent mode of micro-cracking in 
glass bead filled epoxies as it exhibited the ability to reduce the crack driving 
force resulting in toughening behavior. Debonding involves energy dissipation and 
will thus impede the crack growth process. 

Figure 33 shows the fracture surface of syntactic foam containing low and high 
volume fractions of microspheres, respectively. Clearly, Fig. 33a shows that step 
structures prevail for the microstructures containing low volume fractions of 
microspheres, whereas debonding of microspheres is observed to be the dominant 
mechanism for syntactic foams containing large quantities of microspheres (see 
Fig. 33b). The maximum value in K Ic /p suggests a transition in dominating tough- 
ening mechanisms in syntactic foam. 

According to Fee and Yee (2000), the increase in microsphere content will 
decrease the interparticle separation between microspheres. The increase of micro- 
sphere content beyond the complete wetting ability of epoxy also introduces 
intersphere sliding. Higher volume fractions of microspheres allow more micro- 
spheres to debond from the matrix. Debonding is accompanied by premature 
cracks. If the direction of these cracks is parallel to the crack growth direction, 
the subcritic al cracks act as precursors and facilitate crack propagation. 

From the results presented, it can be concluded that the specific properties of 
syntactic foam depend on the types and volume fractions of microspheres utilized in 
the syntactic foam. Increase in microsphere density (K46 vs. K15) and the 
thickness-to-radius ratio led to an increase in specific tensile stiffness. The results 
for the tensile and flexural tests were comparable due to the fact that both types of 
tests exhibited the same failure mode. Both tests elicited a decreasing trend in 
specific strength with increasing filler content. A distinct trend in compressive 
behavior was noted in contrast to the tensile failure. The compression tests revealed 
the excellent compressive properties of syntactic foam and in particular the superior 
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performance of K46 microspheres, giving rise to higher compressive yield strengths 
and moduli compared to K15 and phenolic microspheres. 

From the fracture toughness tests, it can be concluded that all types of studied 
microspheres show a similar trend in the specific fracture toughness with increasing 
filler content. For lower filler content, an increase in the specific fracture toughness 
was observed. The increase reached a maximum after which a decrease in the 
specific fracture toughness was seen. The change in behavior was attributed to a 
change in the dominant toughening mechanisms from filler stiffening, crack front 
bowing, to excessive debonding. The usefulness of a combination of desired 
properties for syntactic foam such as lightweight high stiffness, high compression, 
and high toughness was clearly demonstrated. 

Summary and Future Research Directions 

Since the development of the first polymeric foam, the understanding of various 
aspects of foams, such as resin chemistry, structure development, morphology 
characterization, mechanical and thermal properties, etc., has increased dramati- 
cally due to the advancement in instrumental techniques. With the technical know- 
how and in-depth understanding of the chemistry, new materials were developed to 
serve high-end applications under stringent conditions where traditional polymeric 
foams failed to perform. 

Despite the various advantages, environmental concerns challenged the foam 
industry on the whole in recent years. The large quantity of foamed products such as 
disposable cups, food trays and containers, flooring materials, insulation boards, 
sound dampening, beddings, furniture parts, transportation, etc., raised alarming 
issues such as disposal, recyclability, flame inability, and degradability. 

The intensive usage of chlorofluorocarbon as blowing agents caused the deple- 
tion of earth’s ozone layer. Much effort was carried out to investigate the opportu- 
nity of employing alternative blowing agents with low environmental impacts after 
Montreal Protocol in 1987. This has been further accelerated by the issuance of 
numerous regulations and codes by various government agencies. HCFC-141b and 
n-pentane are the most commonly used physical blowing agents at present; how- 
ever the environmental impact and flammability still pose challenges for further 
researches. 

Challenges arising from production, quality control, and cost-effectiveness 
hindered the pace of moving polymer foams into commercial products. The mass 
production of polymer nanocomposites and nanocomposite foams depends on 
reliable and affordable synthetic and processing methods. First of all, there must 
be robust and cost-effective techniques to prepare nanocomposites with required 
mechanical properties in large quantity. For foam products, various desirable cell 
morphologies must be attainable through the successful control of the type and 
amount of nanoparticles added for nucleation and growth of bubbles. 

Open-cell polymer foams are one of the most commonly used scaffolds for tissue 
engineering. Efforts are made to generate highly interconnected cells of compatible 
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size and optimum mechanical properties for such applications. Nanoparticles may 
be helpful in generating open-cell foams under external fields such as ultrasonic 
fields or alter the foaming matrix rheology to promote cell breakage. At the same 
time, the high surface area and rich surface chemistry make nanoparticles poten- 
tially useful as carriers for desired biofunctionalities (e.g., adhesion sites and 
signaling molecules). This may trigger the need for mass production technology 
for tissue engineering scaffolds. 

The idea of creating nanoscaled cells in polymeric materials is in concept 
an extension of microcellular foams. This idea although generates excitement 
but is poorly explored due to the limited processing techniques. The unique 
structures are expected to offer properties that are superior to those of existing 
materials, such as much higher strength- to -weight ratios. It is also hypothesized 
that the thermal insulation can be significantly improved if the void size is 
smaller than the mean free path for molecular collisions. However, no studies 
have been carried out to justify the claim. Nevertheless, recent progress made 
on nanofoams demonstrated the potentials of this unique material to be used for 
any applications where foamed polymers are currently used, with possibly 
improved performance. These novel materials are expected to find a wide range 
of applications in construction, packaging, motor vehicle, microelectronics, and 
household products. 
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Abstract 

Among the manufacturing methods, bulk metal forming processes are recog- 
nized as classical methods with unique advantages for related industries. Despite 
many emerging technologies, these advantages have made them a hot topic in 
industrial applications. In this chapter, various types of bulk metal forming 
processes are described. The main processes including rolling, extrusion, and 
forging with their sub -categories are covered. Particular attention is given to the 
metallurgical aspects of these processes such as the effect of initial blank 
properties and interfacial friction on the in-process material behavior. Different 
approximation methods for modeling of these forming processes are explained 
and compared. The main challenges in finite element simulation of the bulk 
metal forming processes are also introduced and discussed. This chapter may 
serve as a good reference for forming process selection and identification for 
researchers, engineers, and students. 

Introduction: An Overview on Bulk Metal Forming Processes 
Introduction 

The broad topic of bulk metal forming includes many processes both as classical 
and modified categories. These classical bulk metal forming processes are still in 
great demand in many industry sectors. It does not seem that this trend will slow 
down as new technologies have been developed over recent years. These technol- 
ogies are continuously modified for new demands, such as higher precision (near- 
net shape) forming, micro-/nano-forming for micro -components, and bulk-sheet 
metal forming processes for complex workpieces. Thus, it is important to have a 
broad knowledge of the classical theories of metal forming which was mainly an 
overview of metal forming processes and their fundamentals, some of the modifi- 
cation of these processes in the current work. Specifically, effects of material 
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Table 1 Classification of the metal forming processes based on deformation type 


Category 

Characteristic 

Examples 

Bulk metal 
forming 

Semifinished (bulky) initial workpiece 

Forging, extrusion, 
rolling 

Small surface-to-volume ratio 

Large amount of deformation 

Sheet metal 
forming 

Large surface-to-volume ratio of the initial 
workpiece 

Deep drawing, bending 

Small change in the thickness during the 
process 


Table 2 Classification of metal forming processes based on deformation temperatures (Mielnik 

1991) 


Category 

Characteristic 

Remark 

Hot metal 

0.7T M a < deformation 

No work hardening, dynamic recovery, 

forming 

temperature < 0.8T M 

and recrystallization 

Warm metal 
forming 

0.3T M < deformation 
temperature < 0.5T M 

Partial strain/participation hardening 

Cold metal 
forming 

deformation 
temperature < 0.3T M 

Work hardening 


a T M = melting temperature 


behavior on the forming processes which are of paramount importance are 
discussed. The classical modeling techniques to describe the forming processes 
will also be addressed, with a focus on the challenges in this area. 


Classification of Metal Forming Processes 

Based on the type of the workpiece and deformation mode, the metal forming 
processes are generally categorized into two subgroups of bulk metal forming and 
sheet metal forming, as indicated in Table 1. 

Besides the above categories, metal forming processes can be classified based on 
the forming temperature as shown in Table 2. Indeed, to increase the formability of 
hard-to-deform metals, the process temperature is occasionally increased. 

It is noteworthy that majority of the bulk metal forming processes are performed 
in compression mode rather than tension or bending. This is different from sheet 
metal forming in which the workpiece is most commonly subjected to tensile 
stresses. As a result, the typical problems encountered in sheet metal forming 
such as anisotropy of the workpiece and material fracture caused by tensile stresses 
are commonly avoided in bulk metal forming. 

Rolling 

Before conducting a forming process, there is a need for primary processes to 
provide the bulky workpieces with defined configuration, dimensions, and 
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Fig. 1 Typical arrangement 
of rolling mills for slab and 
thick plates: (a) 2-high single 
pass, (b) 2-high reversing 
mills, (c) 4-high 




properties. In fact, rolling of a raw material into a slab or plate is considered as one 
of the most important primary processes (Avitzur 1968). It is important to note that 
in sheet or strip rolling, where the thickness is relatively low, the deformation 
occurs in plane-strain condition (no change in the width of the strip during the 
process). However, in bulk rolling of thick plates, deformation occurs in all three 
directions. 

Rolling is also used for forming of more complicated parts such as rounds or 
squares or complex sections such as L, U, T, or H. These processes are mostly done 
in multi-pass rolling by using the grooved rolls or universal mill. 

Rolling is a mechanical process whereby the metal is deformed by passing 
through the gap between rotating rolls. Depending on the process and the metal 
type, there might be only two or more number of rolls to be used for the process, as 
shown in Fig. 1. 

The rolls in contact with the deforming metal are called the work rolls , 
while the others are called backup rolls (Mielnik 1991). As will be explained 
in later sections, there are definite advantages of using small diameter work 
rolls. However, small rolls could be deflected easily by forming pressures. 
To resolve the issue, a four-mill rolling process is used for prevention of the vertical 
deflection. 

Side/edge or so-called universal mills with H or U cross section are used for 
precise directional control of the workpiece and obtaining more accurate width 
dimensions. 

Based on the amount of the metal to be rolled, reduction amount, final part 
shape, and desired mechanical properties, the rolling procedure and sequences (pass 
schedule) can be planned. 

In comparison with other bulk metal forming processes, rolling is a relatively 
uniform or homogenous process, since the plastic deformation proceeds 
continuously/in-continuously and plastic zone is penetrated to the center line of 
the workpiece. However, during the process, as shown in Fig. 2, in practice, a sort of 
backward metal flow occurs at the rolling entry zone (bending of the grid line 
outwards) and a forward metal flow at the rolling exit (bending back the grid lines to 
the straight position), which pushes the metal during the process (Mielnik 1991). 
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Fig. 2 Distortion of vertical 
lines in the central vertical 
section during rolling in case 
of (a) homogenous 
deformation and (b) 
inhomogeneous deformation 





Fig. 3 Nonuniform deformation in plate rolling (a) skin rolling, (b) normal (proper) rolling, 
(c) central alligatoring 


Under this circumstance, a rise in redundant deformation occurs, which in return 
decreases the relative process efficiency. 

In the case of large workpieces and small thickness reductions, there is a 
probability of having more deformation at the surface, while the center of the 
specimen remains undeformed, causing a nonuniform deformation through the 
thickness. Application of proper pass schedule (reduction in thickness) could 
minimize the nonuniformity in such cases. Nonuniform micro structure can also 
lead to alligatoring as shown in Fig. 3c. Besides, there are special type of rolling 
processes that are summarized in Table 3. 

Forging 

Forging is usually defined as plastic deformation of a bulk metal to a predetermined 
shape by compressive forces using a hammer or press. 

Forging in conventional way is classified as (i) open-die and (ii) closed-die forging. 
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Open-die forging is the term that is used for processes in which there is no lateral 
constraint for the material movement in the die set. These processes are practically 
used under the circumstances below (Altan et al. 2004): 

1 . As a preforming process for closed-die forging 

2. For simple desired shapes 

3. To justify the operation cost and time in small number of forgings 

Open-Die Forging 

If the open-die forging is used as the final forming process, a post-forging operation 
such as machining is required to obtain the desired shape. However, in case the 
open-die forging is used as a preforming operation, it is effective to optimize the 
forging sequence for near-net shape forming. 

Based on the operation used, the open-die forging processes can be categorized 
as indicated in Table 4. 

Since all the above mentioned operations are fundamentally connected to upset- 
ting or compression, the deformation mode in compression is discussed here in 
detail. In upsetting of a bulk piece, the main issues could be the barreling (Fig. 4-a) 
and the buckling of the workpiece (Fig. 4-b). The barreling is caused by inhomo- 
geneous deformation. When the initial height-to-diameter ratio of the workpiece is 
larger than 2.5, the lateral buckling appears (Fig. 4). 

The behavior of the inhomogeneous deformation in the forging can be analyzed 
by various methods such as (i) grid pattern distortion (Altan et al. 1983a), 
(ii) hardness distribution, and (iii) micro structural observation on the cross section. 

The cross section of the workpiece after upsetting deformation is schematically 
illustrated in Fig. 5. Due to the interfacial friction forces at the both end surfaces of 
workpiece, the material at the center of the workpiece has relatively more freedom 
to flow laterally, causing a barreling. The amount of this barreling depends directly 
on the friction. To reduce the amount of barreling and the required forming load, 
usually application of a proper lubrication during the process is advised. 

The so-called dead metal zones in the micro structure appear near the end 
surfaces of workpiece when the high friction exists at both end surfaces during 
upsetting. 

Closed-Die Forging 

The closed-die forging is a process in which the metal has the restriction in flowing 
laterally during the process. It usually involves filling a cavity between the punch 
(upper part of the die) and the die (lower part of the die). Due to that, the load- 
displacement curve for the closed-die forging cycle is usually divided into three 
main stages, as typically is shown in Fig. 6: 

1 . Upsetting which corresponds to lateral and vertical flow of the material. 

2. Die filling, in which the die cavities start to be filled and the flash begins to form. 

3. End stage in which the dies are completely filled. Due to restriction of the metal 
flow, the pressure inside the die increases rapidly. 
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Table 4 Different types of open-die forging process according to the operation (Mielnik 1991) 
Process Definition Schematic Applications 


Cogging 


Compression 
of an ingot 
between 
narrow dies 


Forged billets 
from cast 
products and 
elongation of 
bar stock 


Upsetting 


Compression 
between two 
flat dies 


After Forming 

r ' 




Disks 


Heading 

(upset 

forging) 


Localized 
upsetting of 
the workpiece 
between two 
flat dies 


After Forming 

N 


i 


Flanged 
shafts, 
finished 
forging, shafts 
with head 


0 continued ) 



180 


E. Ghassemali et al. 


Table 4 (continued) 


Process 


Definition 


Swaging Radial 

compression 

between 

longitudinal, 

semicircular 

dies 


Punching Indentation 

with mating 
dies 


Piercing 


Punch 
indentation 
into the 
workpiece 


Schematic 




Applications 

Bar 

elongation, 
stopped shafts 


Preform 

forgings 


Preforming, 

cavity 

forming 



0 continued ) 
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Table 4 (continued) 


Process 

Definition 

Schematic 

Applications 

Extrusion 

Extrusion of a 

i 1 

Boss 

forging 

bulk metal 




into a die 

BB 


Bending 

Bending of a 

1 111 A 


Bend forged 


bulk between 


parts 


mating dies 

Kg 




Initial Final Initial Final 


Fig. 4 Typical deformation of workpiece in the upsetting process: (a) barreling, (b) buckling 

As shown in Fig. 6, normally the dies are not completely closed in the closed-die 
forging processes. The purpose of the flash land is to control the back pressure to 
obtain a desired fill-up of metal in the cavity. The design of the flash land is critical 
though to have a good control over the process. 

It is important to reduce the material cost, in conventional forging processes. 
Therefore, the die design and the initial material (stock) volume must be optimized 
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Fig. 5 Deformation pattern 
in upset forging of cylindrical 


billet 




Dead Metal Zone 



Fig. 6 Typical load-stroke behavior during closed-die forging process 

to minimize the material wastage. However, technically the stock is usually pro- 
vided with a bit of an excess metal (Mielnik 1991). There are some special closed- 
die forging processes that are summarized in Table 5. 

Extrusion 

Extrusion is basically categorized as a compression forming process, since the main 
forming stress comes from the compressive stresses which are applied from the 
punch, as shown in Fig. 7. 

Extrusion is usually either performed at room temperature (cold extrusion) to 
obtain the final part with close dimensional accuracy or at elevated temperatures 
(hot extrusion) for extreme conditions such as high punch pressure and high degree 
of deformation. 
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Table 5 Special types of closed-die forging process 


Process 


Definition 


Schematic 


Applications 


Coining/ 

hobbing 


A closed-die 
forging 
process for 
imprinting on 
a restrained 
workpiece 


Metal Flow Direction 


V 


f 

t 


Metallic coins, 
decorative 
items, 
medallions, 
sizing of 
automobile or 
aircraft engine 
components 


Nosing 


Closing the 
open end of a 
shell or 
tubular 

component by 
axial pressing 
with a shaped 
die 


H 


Metai Flow 
Direction 


Bottom Die 


Gas pressure 
containers 


Fig. 7 Schematic of the 
extrusion process 


V 


a 
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3 Extruded b C 




Fig. 8 Different types of extrusion processes: (a) direct extrusion (forward extrusion), (b) indirect 
extrusion, (c) backward can extrusion 


The extrusion products are typically axisymmetric solid bars or hollow pipe/ 
tube, and parts with anomalous shape of cross sections. 

Extrusion Processes 

Direct Extrusion (Forward Extrusion) 

The flow of the material is in the same direction of the punch movement. Thus, the 
punch pushes the material to be extruded through the die orifice. The cross- 
sectional shape of the extruded product is dictated by the geometry of the interior 
shape of the die. 

Indirect Extrusion 

In indirect extrusion process, the die set is designed in a way that the material could 
only flow in the opposite direction of the punch movement, through the hole at the 
punch center. 

Backward Can Extrusion 

In this type of extrusion, the material flow is also in the opposite direction of the 
punch movement, but only through the gap between the punch and die walls. 
Lateral flow is constrained by the die walls, to distinguish this method by the 
heading process. This technique is used for forming a can by piercing from a 
bulk billet. 

The schematic of the abovementioned types of extrusion process is shown in 

Fig. 8. 

There is a slight difference between the forming-load behaviors of the direct and 
indirect extrusion processes. As shown in Fig. 9, the forming pressure in backward 
extrusion is slightly lower. This is because in backward extrusion, there is no load 
necessary to overcome the container friction. 

The extrusion force is influenced by the following process variables: 

Extrusion ratio , R = A 0 /A f , where A 0 and A f are cross-sectional areas of the 
workpiece before and after extrusion, respectively. Since the average strain 
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Fig. 9 Difference between 
load-stroke behaviors of 
direct and indirect extrusion 
processes 



increases by increasing the reduction ratio, i.e., average strain, the amount of 
deformation stress goes up. 

Die geometry (die angle, a in Fig. 7). The material flow during the process highly 
depends on the die angle, which is reflected on the forming load. In forward 
extrusion, increasing the die entrance angle leads to an increase in the volume of 
the metal undergoing shear deformation, which in return increases the deformation 
load. On the other hand, increasing the die entrance angle decreases the die-metal 
interface and results in the decrease in the die friction load. As a consequence, for a 
given reduction and friction, there is an optimum die angle that minimizes the 
extrusion load. 

Lubrication. It has been proven that efficient lubrication can lower the extrusion 
loads. 

Defects in Extrusion Products 

One of the most common defects in forward extrusion process is center-burst or 
chevron cracks, occasionally called cupping. This defect appears on the longitudi- 
nal cross section of the extruded part. The chevron defect is mainly caused by 
nonhomogenous deformation which requires an abrupt acceleration of the metal in 
the extrusion die. Thus, it can be prevented by (i) increasing the extrusion reduction 
ratio (Ro/Rf), (ii) decreasing the die entrance angle, (iii) decreasing the friction, and 
(iv) application of interval annealing (Tschaetsch 2005). 

Improvement in ductility of the material at the center of the billet is also 
important to prevent this defect. 

In addition to chevron cracking, dead metal zone formation is another defect that 
may result in nonuniformity in the final part properties. This defect is caused by 
selection of a large die entrance angle. 

At the critical die entrance angle, the internal shear deformation causes the 
formation of the dead metal zone next to the die surface. This zone does not 
participate in the flow process but instead adheres to the die surface, considered 
as a material wastage. In some cases, dead metal zone could leak to the die orifice, 
causing a nonuniform micro structure. Increasing the die entrance angle to the 


186 


E. Ghassemali et al. 



Initial 




Forming 





Fig. 10 Evolution of the grains shape during typical cold forming processes 


second critical angle causes the dead metal zone to move backwards, forming metal 
chips. This is interesting because as the rod proceeds through the die, the outer 
surface is shaved off and the core moves through the die without plastic deforma- 
tion, making the process similar to the blanking process. This is the main reason for 
decreasing the relative forming pressure with an increase in die entrance angle in 
the shaving region. 

In case the external force applies as pulling forces on the metal, the process is 
called drawing. The deformation principles are very similar to extrusion processes. 
This process is mainly used for wire or tube drawing. 


Cold and Hot Bulk Forming Processes 

According to forming temperature, the bulk metal forming processes are divided 
into two main groups of cold and elevated temperature forming processes. The 
elevated temperature bulk forming itself is somehow can be sub-categorized into 
warm and hot forming processes. The direct effect of temperature is on the plastic 
deformation of the metal. 

Cold Metal Forming 

Cold bulk forming is defined as forming of a bulk metal at room temperature 
without initial or interstage heating. The plastic deformation of the metal is 
developed by the slip on multiple slip planes in each grain. This develops a network 
of dislocations in the grains and a series of dislocation pile-ups at the grain 
boundaries, inclusions, and internal defects. These obstacles inhibit the dislocation 
movement inside the metal, and thus increase the flow stress and hardness by work 
hardening. This, nevertheless, decreases the ductility in certain amount. The plastic 
deformation changes the grain shape along the deformation route as shown in 
Fig. 10 for the cold rolling process. 

Moreover, due to possible inhomogeneous deformation in some conditions, the 
final part properties could be nonuniform. To adjust the mechanical properties of 
the cold-formed parts, annealing is advised. Figure 1 1 shows the typical change in 
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Recovery Grain growth 
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Fig. 11 The effects of annealing on the material properties 


Fig. 1 2 Limitations in cold 
metal forming processes 


Working Limits in cold bulk forming 



the mechanical properties and the micro structure of the metal due to the annealing. 
In some cases, recovery annealing is conducted to eliminate the residual stress in 
the cold-formed parts. 

In general, due to the low temperature, the main work limits in cold forming 
processes could be categorized as Fig. 12. 

Due to these limitations, and also due to geometry complexity of the formed 
part, the cold bulk forming processes need to be designed by highly experienced 
process designers with broad knowledge about the in-process material behavior, 
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Table 6 Typical alloys 
that can be cold-formed 


Metal type 

Example 

Steels 

Low and medium carbon steels (<0.45 %C) 


Low alloy steels a 


Stainless steels 

Aluminum 

Almost all alloy types 

Copper 

Almost all alloy types 

Nickel 

Inconel alloy 

Titanium 

Pure titanium, p-Ti alloys 


a Spheroidize annealing might be required to improve the formability 


tooling properties, lubrication, process costs, and FE simulation for design optimi- 
zation. Typical metal alloys that can be cold-formed are listed in Table 6. 


Elevated Temperature Metal Forming 

Although metals exhibit good ductility at room temperature, some alloys such as 
medium to high carbon steels, different grades of stainless steel, titanium alloys, 
and magnesium alloys cannot be easily formed in bulk due to low formability 
or/and higher flow stress. For these alloys, elevated temperature forming is suitable. 

In warm forming processes, the metal is heated to temperatures below the recrys- 
tallization temperature, for instance, up to around 750-850 °C for different types of 
steels. An exception could be austenitic stainless steels that are warm-formed at around 
250-450 °C. Such a warm temperature can ease the movement of the dislocations 
within the micro structure, increasing the formability of the metal. Furthermore, since 
the temperature is yet below the recrystallization temperature, the grain growth and 
phase transformations can be controlled through the process. The work hardening 
appears in warm forming process, but its extent is smaller than that in the cold forming 
process. As an example of warm forming, magnesium alloys such as Mg-3%Al-l%Zn 
(AZ31) and Mg-6%Al-l%Zn (AZ61) can be formed at 250-450 °C. 

In the forming of hard- to -machine alloys such as high carbon steels and 
titanium- and nickel-based superalloys, the hot forming process is used at temper- 
atures above the recrystallization temperature. The hot forming is also used to form 
large size components. 

The isothermal hot forming processes can provide a near-net shape product of 
titanium- and nickel-based superalloys (to reduce the material cost), with a well- 
controlled micro structure and properties. 

In isothermal forming processes, the dies are heated up to the same temperatures 
as the workpiece (e.g., more than 1,000 °C for nickel-based superalloys), 
preventing the temperature drop in the workpiece, which in return provides better 
formability. Moreover, to maintain the superplastic material behavior, the strain 
rate must be kept low. This prolongs the forming process. However, since the 
process is isothermal, there is no die chilling required. 

In hot forming processes, since the temperature is higher than the recrystalliza- 
tion temperature of the workpiece, dynamic recovery and recrystallization occur in 
the microstructure during the process (Fig. 13). 
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Fig. 13 Schematic of the 
micro structure evolution in 
hot rolling process 
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Fig. 14 The effect of 
dynamic recrystallization on 
the metal flow 



The effect of the dynamic recrystallization on the flow stress of the metal during 
the process is shown below. The elevated temperature affects the metal yielding 
behavior. Moreover, as can be seen, high temperature eliminates the work harden- 
ing effects (shown by the straight-line portion of the high-temperature curve in 

Fig. 14). 

Typical metals that are usually hot-formed are listed in Table 7. As discussed, 
based on the amount of the deformation, other easy-to-form alloys might also be 
considered to be formed at higher temperatures. 


Material Deformation Analysis in Bulk Metal Forming 
Introduction 

In bulk metal forming processes, the materials usually experience a significant 
amount of plastic deformation. As the bulk forming processes are marching towards 
near-net- shape manufacturing, the deformation mode, material flow, and product 
geometry are increasingly becoming complex beyond the levels to be addressed by 
simple analytical solutions. It is widely accepted that such complexity is 
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Table 7 Typical alloys that are usually hot-formed 


Metal type 

Example 

Steels 

High carbon steels (>0.45 %C) 


Low and high alloy steels 


Stainless steels (ferritic, austenitic, martensitic) 

Super alloys 

Almost all alloy types, e.g., iron-based, nickel-based, cobalt-based 

Copper alloys 

Pure copper 


Brass 


Bronze 

Aluminum alloys 

A3000 series 


A2000 series 


A6000 series 


A4000 series 

Titanium alloys 

Ti-6Al-7 V 


Ti-4Al-4Mn 


Ti-5Al-2Cr-~lFe 


contributed from the nonlinearity of the geometry, material, and boundary condi- 
tions in the process. Therefore, the main objectives of material deformation analysis 
in metal forming operations are: 

To establish the kinematic relationships between the undeformed and deformed 
parts. Most prominently, to predict the metal flow during the forming process 
(geometry) 

To establish the forming limits, i.e., to determine the possibility of performing the 
forming operation without causing any surface or internal cracks or folds 
(material) 

To estimate the load required to carry out the forming operation, which provides 
necessary information for tool design and equipment selection (boundary 
condition) 

Therefore, approximation methods, either analytical or numerical, are required for 
analysis of bulk metal forming processes. 

There are numerous approximation methods available to analyze the metal 
forming processes. However, none of these methods are robust enough to cover 
all the processes with great precision. This is partially because some necessary 
assumptions have been made in order to facilitate the development of a mathemat- 
ical approach. Moreover, their accuracy also depends on the quality of the inputs, 
which are essentially (i) flow stress data and (ii) friction coefficient. These two 
quantities are usually determined experimentally and difficult to obtain accurately. 
Thus, any uncertainty in them may impose additional error to the result of the 
analysis. In this section, various models to mathematically describe the material 
stress-strain curve and friction coefficient will be presented in future sections. 
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The choice of suitable test for obtaining the flow stress data is covered in this 
chapter, and friction conditions and their implication to the bulk forming are to be 
discussed in sections ahead. 


Approximation Methods for Deformation Analysis 

The approximation methods have changed significantly over the last 30 years since 
Altan et al. (1983b) first attempted to summarize all the common methods used and 
list them in terms of applications and limitations. Some of Altan’ s predictions on 
their future applicability are quite insightful. Authors here would like to reorganize 
the chart in order to reflect the progresses people have made over the last three 
decades in metal forming (Table 8). 

From the above chart, it is not difficult to point out that despite the long 
computational time, the finite element, finite volume, and finite difference take the 
lead in terms of accuracy and availability of the result outputs (field variables). Their 
advantages go further when complex shape parts are considered, where other 
methods cannot compete. Furthermore, long computational time problem has also 
been alleviated over the last three decades by the advances of the computer science, 
which makes large-scale computing possible and affordable (within minutes or 
hours). Therefore, these “finite” methods dominate the current metal forming sim- 
ulation practice, among which it is also widely believed that finite element method is 
superior compared to the other two, since it can accurately follow material inter- 
faces. This is widely used in engineering problems when deformation of complex 
isolated objects is modeled and material flow needs to be accurately traced. There- 
fore, most of the modem simulation software for metal forming analysis employ 
finite element method as the backbone algorithm, and in the section “Tool 
Manufacturing and Material Selection for Bulk Metal Forming,” we will focus on 
this method and its application to the bulk metal forming process simulation. 


Deformation Modes in Bulk Metal Forming and Their 
Characterization 


The deformation mode of the process refers to the dominating stress state in the 
workpiece that the process uniquely possesses. As mentioned in the Introduction 
section, bulk metal forming is dominated by compressive stress state. Figure 15 
illustrates the applied force and primary stress state of the four typical bulk forming 
processes. 

The aim of such deformation mode study is to (i) simplify the model into 2D or 
even ID case so that the analytical solution can be obtained and (ii) identify the 
suitable material workability test for its formability study and characterization. 

From Fig. 15, it can be concluded that rolling experiences a biaxial compressive 
stress state (plane strain) when operating without front or back tension. Forging and 
upsetting in closed die has a uniaxial compressive stress state, and extrusion shows 
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Die 

Extrusion 





Drawing 


Fig. 15 Illustration on the applied force and primary stress state in (a) rolling, (b) forging, 
(c) extrusion, and (d) drawing 


a triaxial compressive stress state. In the wire drawing, the deformation zone 
indicates a uniaxial tension-biaxial compression stress state. The common defor- 
mation modes in the bulk forming process are summarized as in Fig. 16. 

Although the names of uniaxial and axisymmetric are quite self-explanatory, the 
biaxial stress state plane stress and plane strain are not and they need special 
attention. Plane stress occurs when the stress states lie in the plane of the workpiece 
(no stress in third dimension). This typically occurs when one dimension of the 
blank is very small compared to the other two and the workpiece is loaded by a 
force lying in the plane of symmetry of the body. Thin wall pressure vessel is a good 
example for this type of analysis. 

Plain strain occurs when the strain in one of the three principal directions is zero, 
as in an extremely long member subjected to lateral loading or a thick plate with a 
notch loaded in tension. A common plane-strain condition in the bulk metal 
forming is the rolling of a wide sheet. It is worth noting that although there is no 
strain in the width direction, there is stress acting in that way. 
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Fig. 16 Typical deformation 
modes in the bulk metal 
forming 



Uniaxial (ID) 



Biaxial - plane strain (2D) 




Biaxial - plane stress (2D) 


Axisymmetrio (2D) 


In most cases of bulk forming, the stress state is complex. Therefore, a triaxial 
stress term is needed to evaluate the material performance during forming process. 
The most common one is defined by the von Mises criteria (von Mises stress = a v ), 
or effective stress a. 



Another important stress term is the mean or hydrostatic stress o m , as explained 
by Eq. 2 


CTm = ^ (°1 +°2 + ° 3 ) 



Hence, a general workability parameter can be formulated using those two stress 
terms as Eq. 3 (Venugopal Rao et al. 2003). 

P = ^ (3) 

a 

This parameter can be used to evaluate the workability of the material in bulk 
forming processes as well as in testing methods, including tension, compression, 
and torsion tests. 

Schematics of the material testing methods are shown in Fig. 17, with their 
deformation characters listed in Table 9. In Fig. 18, we link the bulk forming 
processes with those workability tests in terms of their deformation modes. 

It is clearly stated that for wire drawing, it is recommended to use tensile test to 
evaluate the material formability in this process, while the compression test works 
well when extrusion, rolling, and forging are the dominating processes. Torsion test 
is mainly designed for measuring flow stress under hot-working conditions, as the 
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Fig. 17 Illustration of 
common material formability 
tests 





Torsion 




Table 9 Material formability tests with their deformation characters 


Test 

Principle stress 

Effective stress 

Mean stress 

p 

Tension 


°x 

a x /3 

1.0 

Torsion 

<7 Z ^ 

o x 

0 

0 

Compression 

°z 0 


-o x /3 

-1.0 


Extrusion 


Rolling 


Forging 


Ef , strainatfracture 


Wire 



i 



Fig. 18 The influence of 
process stress state on the 
workability and strain at 
fracture 
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DIMENSION, MATERIAL AND QUANTITY 


QUANTITY 

D (mm) 

H (mm) 

MATERIAL 

LUBRICATION 

1 

6.0 

9.0 

SS 304L 

Enclosing Teflon 


1600 


1400 


o' 1200 

o. 



</> 1000 
in 

o 


800 


(/) 

£ 

o 


600 


400 


200 



0 


0 




■ SS304L 


True Strain 


Fig. 19 Compression test of ss304L for its stress-strain curve 


deformation is similar to the torsion test where excessive material reorientation 
occurs at large strains. These three tests are the main workability tests for bulk 
metal forming, and their test procedures have been standardized in ASTM-E8 (for 
tension), E9 (for compression), and A938 (for torsion) (ASTM 2009). An example 
of the compression test result of stainless steel 304 L is presented in Fig. 19. It is 
worth noting that we use the term “true strain” to represent the amount of defor- 
mation experienced by the material. Normally, we measure the material extension 
or contraction by “engineering strain,” which is the ratio of total deformation to the 
initial dimension of the material. It is expressed as Eq. 4 



where 1 is the material final dimension and L is the material initial dimension. 
However, to truly reflect the amount the deformation that material experiences, its 
incremental form has to be adopted and it is in the logarithmic form due to 
integration. It is often called logarithmic strain or true strain and in the form of 


r* 





fj 





It is widely considered that the compression test is the most important and 
commonly used test for characterizing the material stress-strain curve and 


5 Bulk Metal Forming Processes in Manufacturing 


197 


formability in bulk metal forming processes. Details of various material flow stress 
models based on the compression test data will be covered in sections ahead. 


Load Prediction for Bulk Metal Forming 

One of the main questions that a bulk metal forming study has to answer is the load 
prediction. Such exercise provides necessary information on tool design and equip- 
ment selection. Nowadays most of such studies are carried out using finite element 
simulation software, e.g., DEFORM, ABAQUS, PAMSTAMP, FORGE, etc., due 
to complex workpiece geometry involved. However, a few simple analytical 
solutions, derived from slab analysis, still provide useful information and good 
approximation on the load during bulk forming process. Some of the most popular 
equations are provided here as a guideline for quick load estimation (Kalpakjian 

1997). 

Force in Open-Die Forging 

The deformation mode of the open-die forging process is uniaxial compression. If 
the material is strain hardening with a stress-strain curve following power law (see 
section on “Tool Manufacturing and Material Selection for Bulk Metal Forming” 
for details), 


Y = Ke n (6) 

The expression for force at any stage during deformation becomes 

F = YA (7) 

The Y and A are instant values of flow stress and cross-sectional area at the given 
time. If a solid cylindrical workpiece is considered, the instant average pressure on 
the workpiece can be expressed as 


y ( 1 + f) <8 » 

where p is the friction coefficient, r is the diameter of workpiece, and h is the height 
at that stage (Fig. 20). The forging force is thus 

F = Pav(OT 2 ) (9) 


Force and Power in Rolling 

For rolling without front or back tension, the contact length curves (red highlighted 
in Fig. 21), F, multiplied by the strip width, w, are the contact area under pressure 
on the strip. A simple way of calculating the roll force is to multiply the contact area 

with average flow stress Y : 
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Fig. 20 Illustration of an 
open-die forging process 


Top die 



Bottom die 


l 


Fig. 21 Schematic of the 
rolling process 



F = LwY 


( 10 ) 


The average flow stress, if based on power law a = Ke 11 , can be given as 


K 


Y 


’6l 


s n de 


o 


ei 


Ke n { 

n + 1 


( 11 ) 


Moreover, the dimension L as the projected area is approximated using Eq. 12. 


L = VR A h 


( 12 ) 


where R is the roll radius and Ah is the difference between the original and final 
thickness of the strip (draft) (Fig. 21). 

As the friction in the rolling process is comparatively small, the friction force is 
ignored here for first-order approximation. The torque per roll then can be obtained 
from Eq. 13: 


T 


FL 

T 


(13) 


and the power required per roll is 
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Fig. 22 Variables in the 
extrusion process 



Fig. 23 Variables in drawing 
round rod wire 



Wire or rod 
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where w = 27cN w = 2 jtN and N is the revolutions per minute of the roll 



Pressure in the Extrusion 

The extrusion ratio is defined as 


R = (15) 

Af 

where A 0 is the billet cross-sectional area and A f is the area of the extruded product 
(Fig. 22). Then the extrusion pressure at the ram can be given as 



where Y is the average flow stress. If friction at the die-billet interface (but not the 
container wall) is considered, for small die angles a, the pressure is given as 


P = Y 1 + 


tan tan a 
F 


(R M 


cos cos a 


i) 



Drawing Force in the Wire/Rod Drawing 

The drawing of the wire/rod is similar to extrusion except that instead of the 
pressure (p) on the A 0 , there is a drawing stress a d on the exit A f (Fig. 23). Such 
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drawing stress for the simplest case of ideal deformation (no friction or redundant 
work) is 

= Yln (a) (1 8) 


where Y again is the average flow stress, and the drawing force F then is 



YAfln 



Similarly, if friction is considered, the drawing stress becomes 




— ( tan tan a 

Y 1+ 


v 






This section briefly lists the available approximation methods and their applica- 
tion on the die force prediction. Material deformation modes and their test methods 
during bulk metal forming are also included in this section. This is to provide 
an overview to the bulk metal deformation study methodology. More details on 
the respective methods can be found in the references listed at the end of the 
section. 


Modeling and Simulation of Bulk Metal Forming Processes 
Introduction 


Most of the numerical simulations of the bulk metal forming processes nowadays 
are carried out using finite element method. It has been extensively applied to the 
metal forming industry to enable a more scientific approach to forming process 
development and optimization. In this section, a particular interest was paid on the 
critical challenges in finite element simulation with the aim to help the handbook 
users to identify the key issues in this area. Such issues include the material friction 
behaviors, mechanical properties, micro structure evolution, and fracture prediction. 


Friction in the Finite Element Simulation 

In metal forming process, workpiece deformation is brought about during contact 
between a tool and a workpiece. This inevitably results in friction if there is any 
tangential force at the contacting surfaces. Numerous analytical models have been 
proposed to describe the frictional behaviors between the workpiece and die; below 
are the some of the most popular ones used in the finite element simulation 
software. Their areas of application are also presented. 
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Table 10 Typical friction 
coefficient values using 
constant shear friction 
model 


Bulk forming process 

Shear friction coefficient 

Cold forming (carbide dies) 

0.08 

Cold forming (steel dies) 

0.12 

Warm forming 

0.25 

Hot forging (lubricated) 

0.3 

Hot forging (dry) 

0.7 


Coulomb Friction Model 

Coulomb friction model is used when contact occurs between two elastically 
deforming objects (could include an elastic -plastic object, if it is deforming elasti- 
cally), or an elastic object and a rigid object. Generally, it is used to model sheet metal 
forming processes. The frictional force in the Coulomb law model is defined by 

f = HP (21) 

where f is the frictional stress, p is the interface pressure between two bodies, and p 
is the friction coefficient. However, the use of coulomb friction model gives 
occasion of an overestimation of the friction stresses at the tool-workpiece inter- 
face, as the normal pressure is often considerably greater than the yield stress of the 
material. Consequently, the friction stress becomes greater than the yield stress of 
the material in pure shear. 

Shear Friction Model 

The constant shear friction is used mostly for bulk metal forming simulations. The 
frictional force in the constant shear model is defined by 

f = mk (22) 

where f is the frictional stress, k is the shear yield stress (of the workpiece), and m is 
the friction factor. This states that the friction is a function of the yield stress of the 
deforming body. A general benchmark of friction coefficients for different bulk 
metal forming processes are listed in Table 10. It is important to note that the 
lubricant used on the tooling may play a large role in the value of friction stress. The 
friction will in turn affect the metal flow at contact surfaces. 

Hybrid Friction Model 

A hybrid friction model (combination of the coulomb and the constant shear 
models) is often used when rolling or elastoplastic deformation (for springback) 
is considered. The general function is as Eq. 23. 


x = pp (pp < mk) 
t = mk (pp > mk) 


( 23 ) 
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Fig. 24 Illustration of hybrid 
coulomb and constant shear 
friction model 



This model describes that at low normal pressure, the friction is proportional to 
the normal pressure whereas at high normal pressure, the friction is proportional to 
the workpiece shear stress. The frictional behavior is illustrated in Fig. 24. 

General Friction Model (by Wanheim and Bay) 

The general friction model proposed by Bay and Wanheim (1976) is a very popular 
friction model which also takes into consideration the different friction behaviors 
under low/high normal pressure. 


x = l'cxk (24) 

where x and k represent the friction stress and shear yield stress of the material, 
respectively, f is the friction factor and a is the real contact area ratio. This equation 
can also be treated as a combination of coulomb and constant shear models as at 
high normal pressure, both f and a stay constant, which implies a constant m value: 

^ = foe = m (25) 

k 

At low normal pressure, the real contact area ratio a exhibits a linear relationship 
with normal pressure p. With constant f and k, x has a linear relationship with p, 
which turns into a typical coulomb friction model. An illustration of the general 
friction model showing such a combined nature can be found in Fig. 25. 

Advanced Friction Models 

There are numerous advanced friction models to accurately capture the interaction 
between the workpiece and die under varying processing conditions. These models 
take into consideration the influence of the time, interface pressure, interface 
temperature, and surface stretch of the deforming workpiece or even a combination 
of these. Normally, the empirical models for the pressure, strain rate, and sliding- 
velocity-dependent friction coefficients can be obtained, as listed below. 
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Fig. 25 Illustration of the 
general friction model 
function 
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Pressure-dependent friction coefficient. 

m = m 0 (l — e _otp ) or/and p = p 0 (l — e _ap ) (26) 

where p is the normal pressure and a is a constant with typical values ranging from 
0.012 to 0.06. 

Strain-rate -dependent friction coefficient 

m = mo(l + ae) or/and p = p 0 (l + ote) (27) 

where e is the effective strain rate and a is a constant with typical values ranging 
from 0.0012 to 0.0045. 

Sliding-velocity-dependent friction coefficient. 



m 0 


o 


a 


or / and p 





where o s is the sliding velocity and a is a constant with typical values ranging from 
0.0016 to 0.014. 

These models provide additional options for users to accurately capture the 
friction behaviors during bulk metal forming process if such a process exhibits a 
strong sensitivity to friction. However, most conventional processes are not 
extremely sensitive to friction, and the typical values listed above may be adequate 
for initial process design and load prediction. 


Material Mechanical Properties Representation 

The material mechanical properties here are mainly the materiaTs flow stress 
during the forming processes. It is a fundamental parameter to determine toque 
and power of metal forming equipment and is defined as a stress that results in the 
material flow in a one-dimensional stress state. Initially we focus on the models that 
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Fig. 26 Stress-strain curve 
of a typical ductile metal 
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describe the material flow stress curves. Subsequently, multiple-dimensional 
stresses are introduced, with their anisotropy (orientation-dependence) considered 
in terms of yield function. 

Elastic Region 

A classic ductile metal stress-strain curve resembles that in Fig. 26, in which the 
two distinct material behaviors, elastic and plastic regions, are shown. For elastic 
region (till yield stress), Hooke’s law applies to relate the stress a with strain e, 

a = Ee (29) 

where E is the modulus of elasticity, or Young’s modulus. As it is a linear 
relationship, the modulus can be determined from the slope of the engineering 
stress-strain curve in the elastic region. 

Power Law 

In the plastic region, work hardening is observed for plastic materials. Work 
hardening is the strengthening of a metal by plastic deformation. This strengthening 
occurs because of dislocation movements and dislocation pile-up within the crystal 
structure of the material. Perhaps the most common mathematical description of 
such work hardening phenomenon is the power law, which is an empirical 
stress-strain relationship obtained by fitting an exponential curve to the experimen- 
tal data points of the flow stress curve. 

a = Ke 11 + c>o (30) 

where K is the strength coefficient and n is the strain-hardening exponent coeffi- 
cient, while a 0 can be seen as the initial yield value. Figure 27 provides an example 
of the power law fitting of Inconel 718 alloy flow stress data. 
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INCONEL 718, Annealed 

1800 i 



Strain 

Fig. 27 Flow stress of INCONEL 718 (annealed) 


Linear Hardening 

Another popular (but to a lesser extent accurate) work hardening rule is the linear 
hardening law. It assumes that the flow stress is proportional to the strain in the 
plastic region with hardening coefficient of H, which in principle is smaller than the 
value of the material Young’s modulus. Such rough approximation is especially 
helpful when limited material data are available (i.e., only yield stress, UTS, and 
elongation are provided). The linear hardening rule is presented as 

a = c>o + He (31) 

where both a 0 and H are dependent on temperature and the dominating atom 
content (in steel, the dominating atom content is carbon percentage). Figure 28 
gives an illustration of the linear hardening flow stress curve. 

In this law, if the linear hardening coefficient H is set to 0, the material behavior 
becomes elastic-perfect plastic, which is a special case of the linear hardening law. 

Johnson-Cook Flow Stress Model 

JC flow stress model is perhaps the most famous empirical flow stress model that 
takes strain rate and temperature effects into consideration. Such rate-dependent 
inelastic behavior of solids is called viscoplasticity in continuum mechanics theory. 
The general form of a JC flow stress law looks like 


206 


E. Ghassemali et al. 


Fig. 28 Illustration of the 
linear hardening law 



a 


(A + Be") (1 + Cln(e*)) (1 


T 


*\m 


(32) 


where e is the equivalent plastic strain, s is the normalized plastic strain rate, T is 
the normalized equivalent temperature, and A, B, C, n, and m are material constants. 
The normalized strain rate and temperature in the equation above are defined as 



and T* 


(T - Tp) 
(T m - T 0 ) 



where Sq is the effective plastic strain rate of the quasi-static reference test used to 
determine the yield and hardening parameters A, B, and n. T 0 is a reference 
temperature, and T m is a reference melting temperature. For conditions where 
T* < 0, t is assumed that m = 1. 

It is also worth noting that in JC flow stress rule, if the influences of the strain 
rate (second term in the equation) and temperature (third term in the equation) are 
ignored, the equation becomes a classic power law mentioned before. 


Microstructure-Based Flow Stress Model 

This flow rule or the so-called Taylor equation (Taylor 1934) is relatively new to the 
industry. However, it is very popular among the academia in solid mechanics 
research field, which provides a good link between the microscopic and macro- 
scopic levels by explaining the strain-hardening phenomenon using the dislocation 
pile-up theory. A general form of the equation is 


a = a 0 + aGb^ (34) 

where a is a dimensionless coefficient, G is the shear modulus, b is the Burgers 
vector, and p is the dislocation density. The strain hardening occurs when the 
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dislocations are piling-up during plastic deformation, by which their density 
increases and results in increment of material flow stress. The relationship between 
plastic deformation and dislocation density needs to be provided alongside this 
Equation. 


Poisson's Ratio 

The equations above are all for one-dimensional stress state. If multiaxial stress 
state is considered, the material may behave differently from that of the 
one-dimensional. In the elastic region, for instance, the material may show different 
Young’s modulus if uniaxially loaded in different directions. This is so-called 
anisotropy in elasticity. Most of the metals exhibit such anisotropy in elastic region 
but not in a large amount. More often, when a piece of metal is tensile loaded in one 
dimension, the other two dimensions will shrink accordingly to accommodate such 
shape change. Such phenomenon is called Poisson’s effect, with ratio of transverse 
to axial strain named as Poisson’s ratio. Most materials have Poisson’s ratio values 
ranging between 0.0 and 0.5. Majority of steels and rigid polymers when used 
within their elastic limits (before yield) exhibit Poisson ratio values of about 0.3, 
increasing to 0.5 for plastic deformation. Rubber has a Poisson ratio of nearly 0.5. 


Yield Function: Von Mises 

Yield function is a function describing the material yield point when triaxial stress 
is presented. For uniaxial loading, the yield point can be easily identified in the 
stress-strain curve as the transition point between linear elastic and plastic regions. 
When triaxial stress state is considered, if the material is isotropic, the yield 
condition should be based on the von Mises yield stress value a y . When the von 
Mises stresses or equivalent tensile stress a v > a y , then the material plastically 
deforms. If a v < = a v , then the material only deforms elastically. 



where are the principle stresses (normal stresses in the directions without 

any shear stresses) and 


/ < c> y , elastic deformation 

( a v > a y , plastic deformation 

It is worth noting that in this case, if uniaxial stress is considered, oi 7 ^ 0, a 2 = 
a 3 = 0, therefore the von Mises criterion simply reduces to cq = a v , which is the 
yield point of uniaxial loading. 
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Yield Function: Hill's 

The quadratic Hill’s yield criterion has the form 

F(c xx - cf yy ) 1 2 + G(c yy - c zz ) 2 + H(c xx - c zz ) 2 + 2Lc yz + 2M c zx + 2Nc xy = 1 

(36) 

Here F, G, H, L, M, and N are constants that are needed to be determined 
experimentally. The quadratic Hill yield criterion depends only on the deviatoric 
stresses and is volumetric stress independent. It predicts the same yield stress in 
tension and compression. It is especially useful when the material has strong 
texture, for instance, for rolled plates or hot-extruded billets. In materials science, 
texture is the distribution of crystallographic orientations of a poly crystalline 
sample. A workpiece in which these orientations are fully random is said to have 
no texture. If the crystallographic orientations are not random, but have some 
preferred orientation, then the sample has a weak, moderate, or strong texture 
depending on the number of crystallites sharing the sample orientation. The mate- 
rial properties show a strong anisotropy because of the texture, and the phenomenon 
of anisotropic yield can be accounted for using Hill’s yield criterion. The disad- 
vantage associated with the criterion is that there are quite a number of coefficients 
(six in the quadratic form) to be determined (hence multiple tests are required) 
before it can be applied to finite element simulation. 

Lankford Coefficient 

Earlier we briefly discussed the anisotropy in elasticity and anisotropy in yield. The 
anisotropy in plasticity is in no way simpler than that of elasticity or yield. A 
commonly used plastic anisotropy indicator is the Lankford coefficient (also called 
Lankford value or R- value). This scalar quantity is used extensively as an indicator 
of the formability of recrystallized low-carbon steel sheets. Its definition follows: 

If x and y are the coordinate directions in the plane of rolling and z is the 
thickness direction, then the Lankford coefficient (R-value) is given by 

R = % (37) 

£z 

where e£ y is the plastic strain in-plane and ef is the plastic strain through the 
thickness. In practice, the R -value is usually measured at 20 % elongation in a 
tensile test. 

For sheet metals, the R-values are usually determined for three different direc- 
tions of loading in-plane (0°, 45°, and 90° to the rolling direction) and the normal 
R-value is taken to be the average 

1 

R = - 
4 



(38) 
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Fig. 29 Optical micrograph 
of AA6061 aluminum alloy 
showing polycrystalline 
structure 



The planar anisotropy coefficient or planar R-value is a measure of the variation 
of R with angle from the rolling direction. This quantity is defined as 

R p = - (Ro — 2 R 45 + R90) (39) 

It has been widely recognized that anisotropy is closely linked with the material 
micro structure. The evolution of material micro structure during bulk forming 
process may greatly influence the end product’s structural integrity. Hence, under- 
standing the micro structural behavior, before and during the forming process, is one 
of the main focuses. 


Simulation of Microstructure Evolution During Bulk Metal Forming 
Processes 

The finite element simulation of micro structure evolution is a very hot topic in 
research communities. Divided opinions exist in many areas, even on the defini- 
tions of some fundamental mechanisms. Therefore, in this section, only the most 
popular definitions and generalized equation forms are provided. 

To start, an introduction on the metal microstructure has to be provided. Metal 
alloys are unusually polycrystalline solids, which consist of many crystallites that 
are small, often microscopic crystals that are held together through highly defective 
boundaries. Metallurgists often refer to these crystallites as grains (grain size 
~30 pm). Figure 29 provides an example of the grain structure in the steel. 

They are normally of different orientations and separated by grain boundaries. 
Grain boundaries are interfaces where crystals of different orientations meet. Grains 
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in the metal change shapes and orientations during forming process, which in turn 
influence the material mechanical properties. Modeling such changes may not be 
easy as there are thousands of grains in the workpiece and capture of individual 
behavior becomes computationally impossible. Therefore, such changes are always 
modeled using statistical methods. To represent the grain structure before defor- 
mation, methods like cellular automata (Wolfram 1983) and voronoi tessellation 
(Voronoi 1908) are commonly employed. A value of average grain size is of course 
another option (although very rough). 

For the grain structure evolution during deformation, numerous phenomenolog- 
ical models have been developed in this area, and controversies exist on the 
definitions of various recrystallization mechanisms. However, the computational 
algorithms behind them are similar: in each time step, local temperature, strain, 
strain rate, and evolution history, the mechanism of evolution is determined, and 
then the corresponding grain variables are computed and updated. In the condition 
that all the phenomena can be divided into the following three microstructural 
evolution groups, then in each group the corresponding mathematical function can 
be used to describe such evolution. 

Dynamic re crystallization ( DRX ) occurs during deformation and when the strain 
exceeds the critical strain. The driving force is dislocations annihilation. 

Static recrystallization occurs after deformation and when the strain is less than 
the critical strain. The driving force for static recrystallization is dislocations 
annihilation. The recrystallization begins in a nuclei-free environment. 

Grain growth occurs before recrystallization begins or after recrystallization is 
completed. The driving force is the reduction of grain boundary energy. 


Dynamic Recrystallization 

The dynamic recrystallization is a function of strain, strain rate, temperature, and 
initial grain size, which change in time. It is very difficult to model dynamic 
recrystallization concurrently during forming as this has the possibility of creating 
numerical instability. Instead, the dynamic recrystallization is computed in the 
group immediately after the deformation stops. The average temperature and the 
strain rate of the deformation period are used as inputs of the Equations. 


Activation Criteria 

The onset of DRX usually occurs at a critical stain e 




(40) 


where e p denotes the stain corresponding to the flow stress maximum 


8p = aid^e^e^ 1 /^ + ci 


(41) 


in which d 0 is the initial grain size, R is the gas constant, T is the temperature in 
Kelvin, and Q is activation energy. 
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Kinetics 

The Avrami equation (Avrami 1939) is used to describe the relation between the 
dynamically recrystallized fraction X and the effective strain. 



1 — exp 



a 10^p 


hd 


£ 0.5 


where e 0 .5 denotes the strain for 50 % recrystallization: 





Grain Size 

The recrystallized grain size is expressed as a function of initial grain size, strain, 
strain rate, and temperature 

d rex = a 8 d^ 8 e n 8 e m 8 e (Q8/RT) + c 8 (44) 

(if d rex > d 0 then d rex = 0 ) 


Static Recrystallization 

When deformation stops, the strain rate and critical strain are used to determine 
whether static recrystallization should be activated. The static recrystallization is 
terminated when this element starts to deform again. 

Activation Criteria 

When strain rate is less than e sr , static recrystallization occurs after deformation. 

e sr = Aexp(bi - b 2 d 0 - Q 2 /T) (45) 


Kinetics 

The model for recrystallization kinetics is based on the modified Avrami equation. 


X 


srex 


1 — exp 


P 


t 

to. 5 


h, 


where f 0 .5 is an empirical time constant for 50 % recrystallization 



to. 5 = a 3 d h 8 e n 3 e m 3 e (Q8/RT) 


(47) 
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Grain Size 

The recrystallized grain size is expressed as a function of initial grain size, strain, 
strain rate, and temperature 

drex = a6do h6 8 n6 8 m6 e^ Q6 ^ RT ^ + C6 (48) 

(if d rex > do then d rex = 0) 


Grain Growth 

Grain growth takes place before recrystallization starts or after recrystallization 
finishes. 

The kinetics is described by equation 




+ acjtexp 




Retained Strain 

When recrystallization of a certain type is incomplete, the retained strain available 
for following another type of recrystallization can be described by a uniform 
softening method: 



Temperature Limit 

The temperature limit is the lower boundary of all grain evolution mechanisms. 
Below this temperature, no grain evolution occurs. 

Average Grain Size 

The mixture law is employed to calculate the recrystallized grain size for incom- 
plete recrystallization: 


d = X rex d re x + ( 1 - X re x)d 0 (5 1) 

Based on the abovementioned equations, the evolution of the micro structure 
during bulk forming process can be estimated. 


Fracture Prediction in Bulk Metal Forming 

Perhaps one of the most important questions that mechanical engineers would like 
to ask is when the materials will fracture/damage during the forming process. The 
answer to this question depends on the geometry of the workpiece, the boundary 
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condition, and the material properties. As the first two factors have already been 
taken into consideration during simulation using finite element method, the focus 
here will be placed on the materials. There are many numerical models available 
which intend to provide the material damage criteria under different loading 
conditions. They may consider damage as a progressive process with initiation 
and evolution at different stages of loading. Indeed, in most of the metals, the 
ductile damage dominates, which is a process due to nucleation, growth, and 
coalescence of voids in ductile metals. However, in our case, the damage is treated 
as an instantaneous event with a single-value indicator to determine that particular 
damage for ease of application. Below listed are some of the most commonly used 
ones in the bulk forming areas: 

Maximum Principle Stress/Ultimate Tensile Strength 

Perhaps the easiest criterion that one can immediately think of is the comparison 
between the current stress (a) state and the maximal principle stress or UTS (g uts ). 
The critical value is given by the ratio between them as 

a a 

a = — < a c or a = < a c (52) 

Gi guts 


Cockcroft and Latham 

This is the most commonly used fracture criterion with bulk deformation (Cockcrof 
and Latham 1968), which states 


’6f 


o*de < C 


(53) 


0 


where o is the maximum tensile stress in the work piece, e f is the strain at fracture, 
and C is the C&L constant. This method has been used successfully to predict 
fracture in edge cracking in rolling and free-surface cracking in upset forging under 
conditions of cold working. 


Rice and Tracy 

This model is defined as a function of mean stress and effective stress, a is the 
model coefficient. 



u 



ao m 

e ° 


de < C 



Brozzo 

Brozzo model is defined as a function of principal stress and mean stress: 



i/ 




de < C 



The disadvantage associated with above criteria is that all these methods predict 
the damage based on a certain critical value, which can only be determined 
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experimentally. Moreover, the damage point in the experiment is not easy to 
identify as it is a progressive process. To make the situation even worse, the critical 
value varies from material to material, and sometimes different setups may also 
contribute to such deviation. Therefore, these fracture/damage values can only be 
used as a rough guideline for the process design. 

In this section, we had a brief overview of the challenges in utilization 
and application of FE simulation to design and optimize the forming pro- 
cesses. Issues such as the material friction behaviors, mechanical properties, 
microstructural evolution, and fracture prediction were covered. Surely the 
challenges in the simulation are far greater than those presented here. 
However, this provides a flavor to the readers of this handbook on the 
complexity the bulk metal forming processes have in terms of modeling 
and simulation. 


Lubrication in Bulk Metal Forming 
Introduction 

Friction in bulk metal forming is defined as the resistant force against relative 
movement of the die and workpiece. This can affect the metal flow, surface quality, 
and stresses on dies. In most cases, friction is undesirable, except in some rare cases 
such as rolling, which could not proceed without friction. 

The characteristics of frictional condition in bulk metal forming are as follows: 

• Sliding under high pressure, larger than yield stress of the workpiece 

• Plastic deformation in the workpiece 

• Surface expansion of the workpiece 

• High temperature of the workpiece (hot forming) 

Although there have been a lot of reports on the subject of friction in metal 
forming processes, its mechanism is still not fully understood. It is preferred to 
reduce the friction in bulk metal deformation by using proper types of lubricants to 
lower the frictional resistance between the die and the workpiece and to present 
wear as well as galling on the tools. 

Based on the form of the contact between the tool and the workpiece, the various 
friction and lubrication conditions can be summarized as in Table 11. 

The relative friction coefficient of various friction regimes is typically shown in 
Fig. 30. This so-called Stribeck curve is useful for the determination of the optimum 
lubricant based on the process parameters. 

Various types of friction regimes at the interface have been shown schematically 
in Fig. 31. Boundary lubricants have significant effects especially in microscale 
forming processes, since they can trap the lubricants in this scale and reduce the 
forming load. 
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Table 1 1 Different friction regimes (Mang et al. 2011) 


Friction 

regime 

Definition 

Remarks 

Relative 

friction 

level 

Solid 

friction 

(dry) 

When there is no separating 
layer (except oxide layers) 
between two solids in direct 
contact in metal forming 

Desirable in only rare situations 
such as hot rolling of plates and 
slabs 

High 

Boundary 

friction 

There is a molecular layer of 
chemical substances covering 
the contacting surfaces. The 
lubricant layer is created from 
surface-active substances and 
their chemical reaction products 

Practical when thick long-lasting 
lubricant films are technically 
impossible to achieve in a 
variety of geometrical and 
thermal conditions 

Medium 

Fluid film 
friction 

When a thick layer of a 
hydrodynamically formed 
lubricant is present between 
contacting partners 

Only works when the interfacial 
normal pressure, temperature, 
and relative speed of die and the 
workpiece are low 

Medium 

Mixed 

friction 

Combination of the fluid and 
boundary frictions 

Using the appropriate lubricants 
containing special organics, the 
machine elements experience 
mixed friction when starting and 
stopping their operation 

Low 


Fig. 30 Typical Stribeck 
curve for evaluation of liquid 
lubricants 


Dry 




Lubricant Selection 


The selection of the lubricant type depends on many factors, hence is empirical, 
with a very little analysis-based information. For instance, the lubricant choice in 
cold forging depends on the process parameters such as normal pressure and surface 
expansions. In that sense, upsetting of small specimens does not need the same 
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Fluid lubrication 


Real contact 



Area of dynamic Area of 

fl u i d 1 ubri cat ion boundary 

lubrication 


Mixed 



Real c o utaet L ub ri c ant (Mi cro pool ) 



<=> <0 o o 


Boundary lubrication 


Fig. 31 Schematic of differences between mixed and boundary lubrication 


Table 12 Typical lubricants being used for different forming processes 


Type of lubricant 

Typical lubricant 

Liquid lubricant for cold working 

Mineral, synthetic, and vegetable oil; wax 

Solid lubricant for cold/hot working 

Graphite; MoS 2 ; BN; metallic soap, glass 

Chemical conversion coating for cold working 

Zinc phosphate + metallic soap 

Aluminum fluoride + metallic soap 



Fig. 32 Schematic of disadvantages of using improper lubricants 

lubricant efficiency as conventional backward extrusion. Scaling down the bulk 
forming process to microscale sometimes brings up the size effect on the friction 
behavior, which makes the matter more complicated. 

There are many types of lubricants in use in the industry. The most common 
types may be categorized as listed in Table 12. 

Poor selection or poor application of the lubricant type may cause partial direct 
contact with high pressure between tool and workpiece (Fig. 32). This arises 
microscale adhesion which leads to galling or seizure or scratch on the final 
part’s surface. 
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Table 13 Different methods for applying lubricants on the surface (Mang et al. 2011) 


Application 

method 

Definition 

Remarks 

Dripping 

Dripping the liquid lubricant on the 
blank during the process 

Cheap and simple, but difficult to 
control the proper amount of desired 
lubricant 

Roll coating 

Lubricant is applied on the blank 
moving between two rollers with a 
certain pressure 

Precise control of the amount of 
lubricant, but applicable only for 
rolling processes 

Physical/ 

chemical 

coating 

Physical/chemical or electrical 
deposition of a layer of solid lubricant 
on the blank 

No lubricant wastage, but relatively 
more expensive 

Spraying 

Spraying a controlled amount of liquid 
lubricant on the blank during the 
process 

Minimal lubricant wastage, but it 
does not work for high viscous 
lubricants 


Application of Lubricants 

The most commonly used lubricant application methods can be summarized in 
Table 13. 


Post-Metal Forming Considerations in Lubricant Selection 

With increasing demand from industries, it is vital to protect the environment from 
polluting chemical lubricants during or after metal forming. To do so, some 
procedures for using biodegradable lubricants have been proposed especially for 
Euro-zone countries. As an alternative, technologies towards reusing of the waste 
lubricants have been developing. 

Furthermore, lubricants remained on the part surface may affect the subsequent 
processes such as welding and painting. Thus, the lubricants must be easily remov- 
able from the formed parts after the process, without environmentally hazardous 
effects. 


Methods for Evaluation of Lubricants 

To have the optimum lubricant selection, it is essential to evaluate the lubricants. 
There are many methods that have been proposed for lubricant evaluation. 
These methods have been designed based on the nature of the forming process 
(e.g., ring compression test is the best for evaluation of the upset forging process, 
since both methods involve compression stresses on the workpiece, while 
for more severe deformation conditions in extrusion, the double-cup backward 
extrusion test is a proper choice). Here only two of the most common tests are 
presented. 
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Initial Ring 




High Friction 



l . 



Low Friction 




Fig. 33 (a) Schematic of the friction effects on the final inner radius in the ring compression test; 
(b) determination of the shear friction factor using ring compression test and finite element 
simulation 


In general, for evaluation of the friction in hot metal forming, it is advised to 
consider the die chilling effect, contact time, and forming speed. 

In ring compression test, a flat ring shape specimen is compressed to a defined 
reduction. The change in internal and external diameters of the ring depends 
significantly on the friction at the interface. Thus, changing the friction factor 
would change the behavior of the internal radius dimension during 
compression test. 

To obtain the friction value for a specified lubricant, the change in the internal 
diameter of the pin in the experiment is compared with that obtained from finite 
element simulation (i.e., FE simulation of the ring compression test in similar 
dimensions using different friction factors). One typical example is plotted in 

Fig. 33. 

The outcome could be considered as universal as changes in material properties 
(i.e., strain hardening) have no significant effect on the result. 

For more severe deformation conditions (e.g., extrusion), the double-cup back- 
ward extrusion test may provide a more accurate evaluation for lubricant behavior. 
As shown in Fig. 34, this test consists of a forward and backward extrusion 
processes. The ratio of the cup heights (H1/H2) is very sensitive to the frictional 
behavior. With increasing friction, the cup heights ratio increases. Again here the 
FE simulation helps in finding the exact value of the friction coefficient. 

It is important to note that the upper punch is moving while the lower punch is 
stationary during the test. This leads to a relative velocity between the upper punch 
and the container, while there is no relative movement between the container and 
the lower punch. Consequently, the material flow behavior is different in upper and 
lower portions, which explains the reason behind the difference in the upper and 
lower cup heights. This test is relatively more dependent on the material properties. 
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Fig. 34 Schematic of the 
double-cup backward 
extrusion test 



Moving 

Punch 


Stationary 

Punch 



D 



As mentioned, many factors may affect the value of the friction factor. Thus, it is 
important to know the region where the friction test has been done. 


Tool Manufacturing and Material Selection for Bulk Metal 
Forming 

Introduction 

Since a considerable portion of the bulk forming process cost is from die 
manufacturing, the tool (die and punch) must be manufactured by modern 
manufacturing methods using appropriate material to provide a reasonable tool 
lifetime at an affordable cost. 

For a given forming process, the type of the tool material depends mainly on the 
maximum tool stress and the temperature of forming. Dies need to be changed 
before their lifetime expires. The lifetime of a die is affected by wear, plastic 
deformation, galling/seizure, corrosion, and fatigue cracking, etc. Many of these 
defects occur due to high temperature, high stress, and severe friction. Thus, to 
prevent severe deterioration of dies in bulk metal forming, special materials must be 
used and proper tool design employed. Using a proper lubricant can also help 
increase the die lifetime. 


Tool Material Selection 

In general, the material properties that determine tool material selection for the 
metal forming process can be summarized as listed in Table 14. 

In cold bulk metal forming, tool material selection depends mainly on the stress 
levels (type of deformation). For instance, in forward extrusion, the punch needs to 
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Table 14 Tool material selection criteria 


Material 

properties 

Definition 

Remarks 

Hardenability 

The depth to which a metal can be 
hardened (it is not related to the 
hardness value) 

Higher alloying elements increases 
the hardenability 

Wear 

resistance 

A gradual change in tool dimensions 
or shape due to corrosion, 
dissolution, or abrasion 

A high hot hardness value is required 
for wear resistance in hot forming - 
Mo and W alloys improve the wear 
resistance 

Yield 

strength 

Resistance to plastic deformation 
measured by yield strength 

Higher hardness leads to higher 
strength, but lower toughness 

Toughness 

Ability to absorb the forming energy 
without cracking - combination of 
strength and ductility 

Higher hardness lowers the impact 
strength; thus medium-alloy steels are 
the best in this case 

Resistance to 
heat cracking 

Caused by nonuniform thermal 
expansion at the surface and center of 
the tool 

It is critical for hot metal forming 
process to have a die material with 
high thermal conductivity; Mo 
alloying element increases this 
resistance 


have high compressive strength, whereas in backward extrusion, it needs to have 
high wear resistance as well. Normally, in conventional cold bulk forming, based 
on the deformation type and tool stress level in the process, the cold working die 
steels (such as D2, D3), or high speed steels (such as M2), are used for die material. 

Tool materials that are used for hot bulk metal forming processes can be 
summarized as listed in Table 15. 

Besides the material selection, other parameters such as tool design and the 
workpiece properties may affect the tool lifetime. For instance, it is advised to 
prevent sharp corners in tool making. This prevents stress accumulation on the 
corners, reducing the possibility of cracking. The proper heat treatment to the tool is 
of importance for increasing the tool life. The hard coatings of tools by PVD, CVD, 
and plasma nitriding are the useful methods to improve tool life by reducing the 
galling/seizure and wear. 


Manufacturing Techniques 

After computer-aided design of the tool geometry, the bulk metal forming tools are 
usually fabricated by machining processes. The main die manufacturing process 
may be divided into die design, rough machining, heat treatment, finish machining, 
manual finishing (polishing), or benching and hard coating (if necessary). 

High Speed and Hard Machining 

In conventional die-making techniques, the die is hardened after rough machining. 
This could cause distortion. To resolve the issue, the hard machining method has 
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Table 15 Typical tool material selection for hot metal forming processes (Altan et al. 1983a) 


Designation 

Alloy type 

Main 

Characteristics 

Application 

6G, 6F2, 6F3 
(ASM) 

Low alloy steel 

Good toughness 

Not good for higher forming 
temperatures of 500 °C 

Shock resistant 

Reasonable wear 
resistant 

6F5, 6F7 (ASM) 

Low alloy steel - 
higher amount of 
nickel (2-4 %) 

Good 

hardenability 

Could be used for more 
severe applications than the 
first group 

Good toughness 

6F4 (ASM) 

Low alloy steel - 
higher amount of 
molybdenum 

Age hardening 
capability 

Good for warm forming up 
to around 600 °C 

H10, HI 1, H12, 
H13, H14, H19 
(AISI) 

Chromium-based 
steel alloy 

High resistance 
to softening 

Good for hot metal forming 
in higher temperatures than 

600 °C 

High resistant to 
heat cracking and 
wear 

H21, H22, H23, 
H24, H25, H26 
(AISI) 

Tungsten-based steel 
alloy 

High resistance 
to softening 

Good for hot metal forming 
at severe forming load and 
speed 

Adequate 

toughness 


been replaced nowadays. Instead, the hardened metal (45-62 HRC) is machined, 
preventing the possible distortion, and provides better surface finish. 

Electro-Discharge Machining (EDM) 

EDM is a process that consists of a current-passing-through electrode which pro- 
vides a voltage potential between itself and the workpiece. Decreasing the gap 
between the electrode and the workpiece creates the spark that is required for 
vaporizing the workpiece surface. The removed material is flushed away by the 
EDM fluid. The hardness of the metal does not influence the efficiency of the 
process. Due to its good accuracy and its relative higher required process time, 
EDM functions on much smaller scales than conventional machining. 

The only disadvantage of this method may be the time needed for electrode 
design. Moreover, the sparking process consumes the electrode, limiting the repeat- 
ability of the process. 

The surface of tool sometimes needs to be polished after EDM for removing the 
surface damages caused by EDM and to improve surface roughness. 

Based on the operation, this process is divided into two types of sink and 
wire EDMs. 

In the sink EDM, the internal cavities are made by a copper of graphite electrode. 
The die cavity gets its shape from the electrode (Fig. 35). 

The wire EDM (Fig. 36) is similar to the sink EDM in case of functioning. The 
primary difference is that the electrode is a wire ranging in diameter from 0.05 to 
0.3 mm. 
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Fig. 35 Schematic of the 
sink EDM process 



Fig. 36 Schematic of the 
wire EDM process 



Wire Electrode 



Blank 


Wire Feed Direction 





In the case of superalloy dies, since the hardness value is high, occasionally it is 
better to cast these dies and subsequently obtain the final shape by EDM. 


New Technologies in Bulk Metal Forming Processes 
Press Technologies 

The application of servo-presses goes back to the 1950s where they were used for 
the cutting processes. With the development of transistor controllers, their power 
capacity was improved for forming technologies during 1990s. Prior to that, 
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Table 16 Typical conventional presses according to the performance (Altan et al. 1983a) 



Feature 

Typical presses 

Stroke- 

controlled 

presses 

The rotational movement of the motor is 
mechanically stored in the flywheel, and by starting 
the forming it is converted into the linear slide 
movement 

Crank press, knuckle- 
joint press, linkage 
press 

Energy- 

controlled 

presses 

The rotational movement of the flywheel is changed 
to the linear motion with a screw, and the slide stops 
when the energy stored in the flywheel is consumed 
completely 

Screw press, hammer 

Force- 

controlled 

presses 

The pressure of the working oil is raised by the 
motor, and the press velocity and position is 
controlled by the oil pressure 

Hydraulic press 


a 



b 


Power Supply 


Capacitor 


Servo Motor 


Balance Tank 


Main Gear 



Brake 


Drive Shaft 


Fig. 37 (a, b) Comparison of the conventional mechanical press and mechanical servo-presses 


conventional mechanical presses were broadly used for metal forming. These 
presses are categorized in three categories as summarized in Table 16. 

All the above mentioned features can be driven by a servomotor without 
requiring a flywheel and clutch. In other words, the mechanical servo-presses 
offer a combination of the flexibility of hydraulic presses with speed, accuracy, 
and reliability of mechanical presses. In Fig. 37, the operation mechanism of both 
types of presses is shown. 

Normally the major part of the overall forming energy is required during punch 
acceleration. In conventional mechanical presses, a portion of this energy is stored 
in flywheels, while in hydraulic presses, this energy is wasted. In servo-presses, 
however, this energy is stored in electronic capacitors, with relatively higher 
efficiency. 
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Table 17 Comparison of the different types of presses 



Conventional mechanical 
press 

Conventional hydraulic 
press 

Mechanical servo- 
press 

Speed control 

Low 

Medium 

High 

Flexibility 

Low 

Good 

High 

Accuracy 

Medium 

Medium 

High 

Energy 

consumption 

High 

Medium 

Low 

Noise 

High 

High 

Low 

Maintenance 

cost 

High 

High 

Low 


By precise control of the punch speed, the hitting velocity of the punch to the 
specimen can be controlled. This can help in decreasing the noise due to punch- 
workpiece impact. Furthermore, it improves the die lifetime and results in higher 
efficiency of the presses. 

Moreover, with a programmed punching velocity throughout the process, heat 
generation due to high strain rate can be controlled which provides better die 
lifetime with consistent product quality. Furthermore, the control over punch 
velocity can increase the forming process speed, in case a progressive forming 
process is desired. Table 17 compares different presses. 


Bulk-Sheet Metal Forming Processes 

With the growing competitive industrial vibe, it is important to develop into cost- 
effective production processes. Especially for some automotive components, it is 
suggested to incorporate bulk metal forming processes into sheet metals to produce 
high-quality sheet metal components commercially. 

Sheet-bulk metal forming (SBMF) processes are defined as sheet metal forming 
where the flow occurs in three dimensions similar to bulk metal forming. The main 
characteristic of these processes is that the final product has the dimensions of a 
magnitude similar to the sheet thickness, projecting out of the plane of the sheet. 
Based on this characteristic, only some special bulk forming processes can be 
applied on sheet metals as summarized in Table 18. 


Micro-Bulk-Sheet Metal Forming 

Production of very small parts is a trend in many technical areas such as electronics 
and medical industries (Chinesta et al. 2007). In general, metal forming is well 
suited for efficient production of micro-parts. Near-net shape and excellent 
mechanical properties of the final product, together with the mass production 
capability and lower manufacturing cost, have made this route an interesting 
alternative. A few years after the introduction of microforming processes by Geiger 
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Table 18 Different types of SBMF processes 


SBMF Process 

Application 

Remarks 

Upsetting/ 

ironing/flow 

forming 

Sheet thinning, 
sizing 

Combined upset/forging/drawing of sheets can be used 
for production of features such as nails, screws, and 
flanges 

Forging 

Sheet thinning 
and thickening 

Usually done in closed-die manner to have 
deformation flow normal to the sheet surface. 
Notebook cases or cell phone shields can be named as 
product examples 

Orbital forming 

Sheet thinning 
and thickening 

Relatively smoother surfaces, lower forming load, and 
less noise compared to forging and extrusion, but 
longer process time. Bevel gears and hollow ring gear 
parts are some product examples 

Coining/ 

embossing 

Sheet thickening 

Efficient process for production of small features at the 
sheet surfaces 


et al. (1996), many studies have been conducted towards the development of micro- 
parts manufacturing (Engel and Eckstein 2002; Okazaki et al. 2004; Ghassemali 
et al. 2011). 

Despite the relatively wide range of research in this field, microforming pro- 
cesses have not been adopted extensively for mass production in the industry. This 
is related to issues such as handling of micro-parts and even removal of the formed 
parts without damage which require further process and manufacturing system 
design and development. It was stated by Engel et al. (2007) that handling of 
parts is less difficult in sheet-bulk metal forming processes, since the parts usually 
remain connected to the strip. This is a big advantage for the sheet metal forming 
processes as they are scaled down to the micron level. 

Hirota (2007) suggested the use of sheet metal for production of micro-billets. 
However, a disc specimen with a predefined diameter was used as the initial raw 
material for the process in his study. A counterpunch was used under the formed pin 
to the push up after the pin forming process. However, as the pin diameter decreases 
in size and the surface area-to-volume ratio increases, such a pin removal method 
will become a challenge due to the greater risk for damage caused to the pin. 

Based on the concept by Hirota, a progressive micro-bulk- sheet pin forming 
system was designed and developed by Ghassemali et al. (2012). The system has 
the following advantages: (i) it can circumvent the handling issues of small billets 
needed for extruding pins of very small diameter; (ii) such a process uses a strip as 
the workpiece and is more productive as a progressive process can be implemented; 
and (iii) instead of ejecting the formed pin, the system uses a blanking process to 
remove the formed pin from the strip material, eliminating the possibility of 
buckling damage if a counterpunch is used as an ejector. Figure 38 shows the 
schematic of the process. 

As can be seen in this figure, the process setup consists of two stages: (i) pin 
forming by forward extrusion and (ii) blanking. In the first step, the strip is 
deformed by a punch of a defined diameter and specified displacement. As a result, 


226 


E. Ghassemali et al. 



anKingnjnc 
Di a meter 


Punch Diameter 


Pin Diameter 


Stage I = Extrusion 


Punch 


Guider 


i 


Fig. 38 Schematic of the progressive microforming process 
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a 






Fig. 39 Micro-pins produced by progressive microforming process: (a) before and (b) after the 
blanking process, (c) FESEM micrograph of the final micro-pins 


a portion of the material is forward extruded into the die cavity. The strip is inserted 
via guides on the setup. To ensure the precise positioning, guide holes are created 
on the edge of the strip and guide pins are used along the different stages of the die 
setup. The spring-loaded blank holder was used in this lab-scale study to distinguish 
the forming load from the blank-holder load. In industrial applications, the blank 
holder is usually attached to the press. 

After extrusion in stage I, the strip with the attached extruded pin is ejected by 
springs and can be moved to the next stage. 

It is noteworthy that the forming process only occurs in stage I of the process. In 
other words, stage II is only used for detaching the pins from the strip. Thus, the 
formed micro-pin at stage I is still attached to the strip which makes it easier for 
handling and transferring to the next stage either for blanking or for subsequent 
measurements. No counterpunch is used in this process, and the ejection by the 
springs on the strip only leads to the withdrawal of the formed pin from the forming 
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Punch Displacement, mm 


Fig. 40 Load- stroke curve for the 0.8 mm pin produced by 3.2 mm punch 


die. The feasibility of this process for production of micro-pins of diameters 
between 100 pm to macro-scale has been proven (Ghassemali et al. 2012, 2013a). 
It is important to note that due to its axisymmetric geometry, the progressive 
microforming process has a good capability of producing other symmetrical 
micro-parts such as hollow pins, stepped pins, or cups. 

After stage I, no significant damage in terms of pin fracture and buckling was 
observed in the experiments, as can be seen from Fig. 39. This was indicative that 
the ejection process of the formed micro-pin by the ejecting springs on the strips 
does not cause any damage on the formed pin in this process, although there were 
galling effects on the pins’ surface. Developing hard-coating techniques seems 
essential to be able to coat the very small die orifice (Ghassemali et al. 2013a). 

The three stages in the process can be seen also in the load-stroke curves. 
Figure 40 shows a typical load-stroke curve for the progressive microforming 
process. Almost a similar behavior was observed in the punch reaction of other 
processes, in which the first stage was contributed to the elasticity and the last two 
stages in the curve was related to the plasticity behavior of material in the process 
(Jiang and Chen 2011). 

In the first stage, loads are relatively low, which corresponds to the elastic 
forming initiation and indentation process. The portion of this stage is relatively 
small. At stage II, upsetting is the main phenomenon mainly due to the less force 
required for this phenomenon compared to that of the extrusion. Till this stage, the 
material mainly flows towards the outside of the punching area rather than moving 
towards the die cavity. After reaching stage III, the load increases rapidly. In this 
stage, the required extrusion force is less than the upsetting, and therefore, the 
extrusion becomes the dominant phenomenon occurring in the process. Therefore, 
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the metal will mostly flow towards the die cavity rather than moving outwards, after 
this stage. The unloading portion of the curve presents the elastic deflection of the 
material and punch essentially due to the force in stage III. This shape of the load- 
stroke curve is similar to what happens in the common impression-die forging 
process (Altan et al. 1983b) as reported also for bending process (Jiang and Chen 
2011 ). 

Based on this phenomenological study, this process has been optimized using 
upper bound theory (Ghassemali et al. 2013b). Using the developed model, it is 
possible to predict the material behavior during the process, with the least material 
wastage and competitive production rate. 
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Abstract 

In this chapter, a review of the materials involved in the metal-forming processes 
and some of the processing required before forming on the materials are pro- 
vided. At first the details of the materials formability definition and applications 
in different forming processes are discussed. Formability is one of the most 
important characteristics of the engineering alloys. Subsequently, information 
on the materials used for metal-forming tools and dies and their selection criteria 
are provided. These tool and die materials are categorized based on the process 
details and their limitation is further discussed. A brief look at the lubricants 
used in metal forming covers a subsequent topic in the current work and focuses 
on the effectiveness, characterization (friction reduction), types, and general 
applications as well as additives used in the lubricants. Lastly, a concise sum- 
mary of the raw material preparation for forming processes is covered, with main 
focus on casting and heat treatment. These are the main preprocessing routes for 
preparing the preform in the industry. This chapter serves as a quick reference of 
forming process material selection for researchers, engineers, and students in the 
mechanical and materials engineering field. 


Introduction 

The transformation of raw ingot to required complex geometry, under large 
plastic deformation, using sophisticated tooling, is known as metal forming. 
For the last century, metal forming has been one of the most widely employed 
processes, to mass manufacture engineered products, with little or no scrap. 
The technology know-how is one of the oldest and very mature. Metal-forming 
process can be classified into bulk and sheet metal forming. The bulk metal forming 
includes forging, rolling, extrusion, and wire-drawing processes, while shearing, 
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Copper Weapons Gold Ornaments 




Gold Coins 


Fig. 1 Antiques of third millennium BC made from copper and gold metals 


punching, blanking, bending, and deep drawing techniques comprise sheet metal- 
forming processes. 

The history of metalworking process date backs to the era of third millennium 
BC when copper and gold metallurgy was invented in the Middle East, primarily for 
making weapons, ornamentation, and coins as shown in Fig. 1. They were manually 
swaged using an iron hammer to produce a variety of products. 

Around second millennium BC, processes such as smelting , which helps to 
purify the impurities in metals from its ore, were explored and understood. This 
important discovery motivated a combination of pure metals to form new alloys 
which satisfied the quest for strength. This was also very evident during the Bronze 
Age in 4000 BC, when copper and tin were effectively used. Mainly, nonferrous 
metals and its alloys were explored until the Iron Age. This delay was due to the 
lack of understanding on high-temperature metalworking and on achieving iron 
purification. 

Similarly, the processes also changed systematically from crude hammering to 
more mechanized forging and rolling. During the Industrial Revolution, at the 
end of the eighteenth century, witnessed major development in various types of 
metalworking techniques and materials with special properties and applications, 
due to the demands of the manufacturing industry. The need to forge large steels 
led to the discovery of higher tonnage metalworking equipments such as mechan- 
ical (screw type) and hydraulic presses. High-speed tandem rolling mills were 
also used to mass produce strengthened steels. Due to the increasing need of 
high-strength materials in the automobile industry, steels, such as low carbon and 
advance high strength, largely benefited because of its ductility and high 
working temperature. The demands of the aerospace industry turned 
attention towards materials with high strength to weight ratios, and focus on 
forming aluminum- and magnesium-based alloys was considered relevant. 
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The formability (addressed in the section “Formability in Sheet Metal Forming”) 
of materials is an important criteria during material selection without 
compromising mechanical properties. 

The need for higher- quality products was driving the search for different kinds of 
materials. With new production processes and technologies, the quality of the 
materials, thereby the products, has improved significantly. With the advent of 
technological advancements, different metal forming processes have been keenly 
studied and are still being explored to improve the productivity. Moreover, the 
process gives a distinct advantage in terms of operational cost, over joining and 
cutting, while manufacturing near-net shape components. Research in new mate- 
rials, pertinent to bulk forming techniques, always has a huge potential, considering 
the future end products, which aims at higher quality. The materials used in forming 
tools (addressed in section “Die and Mold Materials”) are being designed to 
withstand adverse environments, which is a challenge. Moreover from the eco- 
nomic constraints, this plays a crucial role and influences the cost of the product. 
The material catastrophe also interrupts the production engineering and machine 
downtime. 

Properties such as high compressive strength, surface hardness, ductility, wear 
resistance, and reliability are desired in materials used for forming and have to be 
altered to meet specialized applications. Lubricants in metal forming (section 
“Lubricants for Metal Forming”) and raw materials (section “Manufacturing of 
Raw Materials for Forming”) and state of the art in bulk metal-forming process 
have developed considerably. The development of new heat treatment and surface 
coating processes to meet the needs of the forming tool and the end product has to 
be customized. 


Formability of Engineering Alloys 

Most of the engineering alloys during a uniaxial tensile test, after some 
degree of stretching, tend to become unstable, due to the onset of necking. 
With further stretching, there is neck growth in the specimen, and finally a 
fracture is formed in the neck, so that the specimen breaks into two pieces. 
When compression testing was discussed, it was stated that for ductile metals, 
larger strains can in general be obtained in this test than in tensile testing, 
because neck formation is avoided in compression. But also in the compression 
test, there are limitations on how much the specimen can be deformed. This is 
because various cracking phenomena may occur, especially when low-ductility 
materials are tested. 

In metal forming, it is a common problem that the material of the workpiece 
breaks down during forming, in a manner like that for materials subjected to 
technological tests. In this chapter, different phenomena that cause material failure 
during metal-forming operations will be discussed, with special emphasis on failure 
due to necking, tensile stress cracking , and shear cracking. 
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Formability and Workability 

Formability and workability of a material is a measure to which it can be deformed 
in a specific manufacturing process without any surface or subsurface defects or 
breakage. Traditionally, formability has more to do with the sheet metal forming, 
and workability is associated with the bulk forming processes such as wire drawing, 
forging, extrusion, etc. Formability of a material is also related to materials ductility 
defined as the maximum tensile elongation (on percentage basis) at fracture. 
Different phenomena that cause material failure are necking, tensile cracking, and 
shear cracking. Formability of a material also depends up on manufacturing process 
parameters such as temperature, loading strain rate, friction between the die and 
workpiece, reduction ratio, etc. Some of the factors influencing the formability are 
explained below: 

(a) Yield Strength: Materials with low flow stress are easy to deform and are said 
to be malleable. The higher the yield strength of the material, the higher the 
forces and energy are required to accomplish the deformation. 

(b) Material Work-Hardening: The flow stress of a range of engineering alloys 
can be expressed as o — Ks'\ where K is the strength coefficient in MPa and n is 
the strain-hardening exponent. Higher values of n imply the necessity of higher 
forces with increasing deformation, which can lead to tool wear and tear. Low 
values of strain-hardening may lead to localized necking during the deforma- 
tion, which results in nonuniform deformation. 

(c) Modulus of Elasticity: Having a high elastic modulus leads to the increased 
elastic recovery after the deformation forces are removed. For precise dimen- 
sions of the part, the spring back should be compensated. 

(d) Operating Temperature: When the forming temperature is greater than 
0.3 T m (where T m is the melting point of the material in K), it is said to be 
hot working, and it is easier to form the material due to lower stresses. 
Sometimes failures occur at very low strains during hot working due to hot 
shortness , which is due to the presence of liquid phase at the grain boundaries. 

(e) Hydrostatic Stress: High hydrostatic pressure suppresses the void growth in 
engineering alloys, thereby retarding the fracture. Formability of materials with 
limited ductility can be improved if the exit conditions from the die are under 
hydrostatic pressure. Edge cracks in rolling and central bursts in extrusion are 
likely due to the presence of tensile stresses at the exit section rolling press or 
extrusion die, respectively. 

(f) Grain Size: At low temperatures, it is well known that the strength of engi- 
neering alloys is inversely proportional to grain size: smaller-grain-size mate- 
rials have larger grain-boundary area giving rise to the higher resistance to 
plastic deformation by dislocation flow. If coarse-grain material is deformed, 
the plastic deformation leads to “orange peel” type of surface due to the 
different texture and directional orientation of the micro structure in various 
grains. 



236 


S. Idapalapati et al. 



Fig. 2 Surface cracks in (a) steel, (b) aluminium alloy and (c) metal matrix composite 


(g) Chemical Composition: Slag inclusions in the castings, traces of foreign 
elements, and nonmetallic compounds in the raw material change the chemical 
composition and degrade the further formability by secondary processes such 
as forging, extrusion, etc. by becoming the sources for crack initiation. Also, 
the change in chemical composition alters the surface finish. 

Detectable surface or undetectable subsurface defects (e.g., central burst cracks 
in extruded components) appear in the formed component whenever the workabil- 
ity or formability of a material is exceeded. By using either fail-safe design 
principles ox failure analysis diagrams (FADS) for a given load-bearing situation, 
a decision on the acceptance of the component is made during quality check. 
Further, some of the internal cracks in the load-bearing structures grow during 
service, and hence their continuous monitoring using nondestructive methods such 
as radiography, ultrasonic testing, magnetic particle testing, or eddy current testing 
is recommended. Some cases of plastic deformation where the limit of ability to 
form has been exceeded until the material has lost its integrity are shown in Figs. 2 
and 3. 


Stress Cracking Under Uniaxial Tension and Compression 

Ductility of an engineering alloy is the ability to deform plastically without fracture 
and expressed as a measure of the strain at fracture in a uniaxial tension test. 
However, the percentage elongation in a tensile test is often dominated by the 
uniform deformation. The end of uniform elongation coincides with the onset of 
plastic instability accompanied by void nucleation, their growth and coalescence, 
and final fracture by formation of shear lips on the outer surface. Because of the 
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Fig. 3 Shear stress cracks formed in aluminium alloys under (a) cold and (b) hot compression 


nonuniform deformation, Cockcroft and Latham (1968) suggested a criterion based 
on both stresses and strains by arguing that the plastic work must be an important 
factor. The total amount of plastic work done per unit volume at the fracture point 
can be formed from 


Jo* o'de = C ( 1 ) 

It appears that the elongation value is too complex to be regarded as a funda- 
mental property of a material, and it seems reasonable to assume that any criterion 
of fracture will be based on some combination of stress and strain rather than on 
either of these quantities separately. When the accumulated damage as given by the 
integral in Eq. 1 reaches a critical value, C, failure will occur. Most of the 
commercial finite element programs are able to predict the damage parameter 
through the calculation of stress and strain fields, thereby the energy-based damage 
parameters. 

Figure 2 illustrates predominantly three different kinds of surface defects gen- 
erated due to tensile stress cracking in the worked steel, aluminum, and a metal 
matrix composite samples: barrelling during uniaxial compression loading causes 
circumferential hoop tensile stresses which may lead to the cracking. Figure 2a 
depicts a low-ductility steel sample with several circumferential cracks across the 
periphery due to high compressive strain of 95 % in hot compression. An aluminum 
alloy sample in Fig. 2b was compressed to 1 .5 strain and a defect led to the cracking 
at its location on the circumference; elsewhere it has good surface finish. Figure 2c 
shows severe transverse surface cracking during the extrusion of a metal matrix 
composite due to fracture when the stress reached a critical value. 

Another predominant failure in the billet compression of engineering alloys is 
the formation of shear cracks due to strain-induced material softening. Figure 3 
shows such shear failures in the compression of aluminum alloys AA 7108 pro- 
duced through extrusion process. The presence of friction between the die and 
workpieces produces nonhomogeneous deformation and barrelling of the sample 
leading to a deformation state in which a shear cross forms. The largest 
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Fig. 4 Forming Limits as 
defined in the upset forging 
relating the hoop strain to 
axial compressive strain 



Compressive strain, e 2 


deformations will appear in the shear cross. When the cylinder has been com- 
pressed down to a certain stage, the initial strain-hardening part of the flow curve 
has been exceeded for the material in the shear layer. With continued compression, 
the shear layer will then experience strain-induced softening. Because of strain 
softening, this layer will then easily become overstrained, and a shear fracture will 
form here. It is also to be noted that barrelling during upset forging causes hoop 
tensile stresses that may lead to cracking. Kuhn (1978) plotted the hoop strain (eO at 
fracture as a function of the applied uniaxial compressive strain (e 2 ) as shown in 
Fig. 4 for 1045 steel. By means of loading punch lubrication with the sample and 
varying the height to diameter ratio of the test specimens, a number of points are 
obtained. The data points fit to a linear line as 

6l/ = C-( l/2)e 2/ (2) 

where C is the value of e \f for plane strain. This line parallels £i = —0.5 s 2 for 
homogeneous compression. 

In cylinder compression, the shear cross appears three-dimensional, with a 
complex shape. Even though the shear fracture is initiated somewhere along the 
plane of the shear cross, propagation of the fracture during further growth most 
commonly follows a plane path, so that a plane fracture surface is obtained. Upon 
cracking, a bit of the material may be detached from the rest of the cylinder. 


Formability in Sheet Metal Forming 

Sheet metal forming is an important manufacturing process in automotive, ship- 
ping, and aerospace industries. Knowledge of the formability of sheet metal is 
critical to the success of sheet metal stamping process. The ability to form sheet 
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metals into desired shapes is often limited by the occurrence of material instability 
leading to localized necking. For a stretched sheet metal, the necking could be 
either diffused or localized. Diffuse necking of sheet metal involves contraction in 
both the lateral and width directions. In sheet specimens, localized necking occurs 
after diffuse necking. During local necking, the specimen thins without further 
width contraction. 

The forming limit of a sheet metal is defined to be the state at which a localized 
thinning of sheet initiates when it is formed into a product shape in a stamping 
process. Formability of sheet metals is at present characterized by the forming limit 
diagram (FLD) introduced in the 1960s by Keeler and Backofen (1964). The 
forming limit is conventionally described as a curve in a plot of major principal 
strain vs. minor principal strain. It must cover as much as possible the strain domain 
which occurs in industrial sheet metal-forming processes. The curves are 
established by experiments that provide pair of the values of the limiting major 
and minor principal strains obtained for various loading patterns (equi-biaxial, 
biaxial, plane strain, and uniaxial). Hecker (1975) developed an experimental 
method to determine the forming limit curves. A widely used technique is to print 
or etch a grid of small circles of diameters on the sheet before deformation. The 
principal strains can be found by measuring the major and minor diameters after 
straining. These values at the neck or fracture give the failure condition, while the 
strains away from the failure indicate safe condition. As the experimental measure- 
ment of these strains is time consuming and an expensive process, it would be 
useful if the forming limit strains can be predicted using theoretical models. 

Researchers employed two main strategies to predict forming limits: firstly, a 
group of models that represent strain instabilities as a bifurcated state in an initial 
homogeneous material and, secondly, models where the strain instability appears in 
the deformation process due to an imperfection already present in the material. 

Forming Limit Diagram 

Hill (1952) developed the theory for localized necking in sheet metals assuming 
that localization band develops along the zero extension direction in a sheet metal. 
This analysis predicted that localized necking would not occur in a sheet subjected 
to positive biaxial stretching, for which no zero extension direction exists. There- 
fore, Hills criterion is only applicable to left-hand region of FLD. 

Swift (1952) developed a diffuse necking theory for biaxially stretched sheets 
introducing the concept of the maximum in-plane force condition in the necking 
and localization prediction. According to him, neck initiates when the total differ- 
entials of force become zero or negative at the same instant of deformation. 
However, the Swift diffusive necking criterion is often too conservative and it 
underestimates the experimentally observed forming limit strains. In order to 
improve the above model, Hora et al. (1996) considered the experimentally 
observed fact that the onset of necking depends significantly on the strain ratio 
and proposed a model called Modified Maximum Force Criterion (MMFC). Aretz 
(2004) addressed an important singularity in Horn’s MMFC model. It was observed 
that MMFC fails if the yield locus exhibits straight line segments often observed in 
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commercial alloys like Al which are properly described by non-quadratic yield 
functions. Brunet and Morestin (2001) coupled Horn’s model with Gurson’s (1977) 
porous plasticity model and obtained good predictions for forming limits. 

Marciniak and Kuczynski (1967) considered sheet metals subjected to in-plane 
biaxial loading and proposed a model taking into account that sheet metals are 
nonhomogeneous from both geometric and structural points of view. The RHS of 
the FLD for a material was determined by introducing a preexisting material 
imperfection that lies perpendicular to the major stress axis to explain the devel- 
opment of localized necking during biaxial loading. The imperfection assumed in 
M-K model need not be perpendicular to the major axis; here the above model was 
extended to the LHS region of FLDs by Hutchinson and Neale (1978) by consid- 
ering imperfection at an angle to the major stress axis. The angle selected is the one 
that minimizes the limit strain. 

Storen and Rice (1975) proposed an alternative concept for localized necking, 
caused by the vertex developed on the subsequent yield surface. Recognizing that 
the shape of the yield surface could cause localization near the vertex, this model 
predicted a bifurcation corresponding to localized necking under biaxial tension. 
This method can predict the localized necking over the entire range of forming limit 
curve (FLC). However, it underestimates the limit for localized necking at the LHS 
of the FLC. Zhu et al. (2001) considered the moment equilibrium in addition to the 
force equilibrium adopted by Storen and Rice (1975) for the prediction of FLC over 
the entire region (LHS& RHS). They also found that the discontinuity of shear 
stress inside and outside of the localized band is zero. The modified method 
achieves a good prediction on the LHS of FLCs. Since this method depends on 
deformation theory of plasticity, it is valid under the proportional loading condition. 

Friedman and Pan (2000) investigated the influence of different yield functions 
on FLD predictions using the M-K model. Their results indicate the significance of 
the yield function used in the FLD analysis. Further, they introduced a parameter 
(angle) to characterize the influence of the yield locus’ shape on FLD. Yao and Cao 
(2002) predicted FLCs using M-K model taking into account the effects of 
pre-strains and kinematic hardening. The exponent of the yield function used in 
this work was assumed to decrease with increasing pre-strain. 

One major drawback of FLC models presented above till now is its dependence 
on strain path, as the straining path of the FLC in sheet metal stamping is not known 
with any certainty. However, both experimental and numerical results have indi- 
cated that FLDs are very sensitive to strain path changes (Ghosh and Laukonis 
1976; Graf and Hosford 1993). There is no single curve in strain space that 
represents the forming limit. Therefore, finding a single path-independent curve 
to characterize forming limits is of considerable interest. Knowing the drawback of 
the conventional FLDs, Arrieux et al. (1982) represented formability based on the 
state of stress rather than the state of strain. They constructed a Forming Limit 
Stress Diagram (FLSD) by plotting the calculated principal stresses at necking. 

Stoughton (2000) extended the original idea of Arrieux et al. (1982) and showed 
that the forming limit for both proportional and nonproportional loading can be 
explained from a single criterion which is based on the state of stress rather than the 
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Fig. 5 (a) Forming limit curve in strain space after pre-strain (Graf and Hosford 1993). 
(b) Transformed forming limit curve in stress space (Stoughton 2000) 

state of strain by mapping the strain values from different strain paths into a stress 
space assuming plane stress condition. But it is quite difficult to measure the state of 
stress experimentally as compared to the state of strain. Using FEM it is possible to 
estimate the state of stress with good accuracy, but a criterion has to be used to 
construct the Forming Limit Stress Curve (FLSC) in stress space. To validate this, 
experimental FLSC is possible only with mapping of experimental FLC in strain 
space to stress space (Fig. 5). 

Stoughton (2001) studied the influence of material model on the stress-based 
forming limit criterion. Stoughton and Zhu (2004) reviewed the theoretical strain- 
based models of Swift, Hill, and Storen and Rice and their relevance to the stress- 
based FLD using plane stress conditions. Smith et al. (2003) studied the influence of 
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eFLC gFLC xsflc 

Fig. 6 Transformation of FLC to extended SFLC (Simha et al. 2007) 

transverse normal stress on strain space forming limit by assuming that the stress 
space forming limit is relatively insensitive to transverse normal stress. This model 
predicts an increase in formability in strain space that varies nonlinearly with an 
increasing magnitude of compressive transverse normal stress ratio. Smith and 
Matin (2004) extended their previous model by assuming that the strain ratio is 
defined to be constant even with the influence of transverse normal stress, instead 
assuming that the stress space forming limit is relatively insensitive to transverse 
normal stress. This proposed model is independent of the type of yield function and 
it is fundamentally different than their original model. Further, the proposed model 
is much simpler than the original model. Stoughton and Yoon (2004) extended the 
original idea of Arrieux (1995) and proposed the concept of anisotropic forming 
limit curve. They proposed that the forming limit is no longer defined by a curve but 
requires the definition of a surface in strain or stress space, and therefore, it is no 
longer appropriate to view these limits with a convenience of two-dimensional 
diagrams. A solution to the challenge of assessing formability for a planar aniso- 
tropic material is proposed by rescaling the stresses by a factor; here the scaled 
stresses have the same relationship to a single forming limit curve in a 2D plot in 
stress space, as the actual stresses have to the true anisotropic forming limit in 3D 
space. 

Simha et al. (2007) proposed an Extended Stress-Based Limit Curve (XSFLC). 
The stress-based limit curve is then transformed into equivalent stress and mean 
stress space to obtain XSFLC. However, both FLD and stress-based forming limit 
diagram (SFLD) are measured and derived, respectively, for plane stress loading 
conditions. In some metal-forming processes, such as hydroforming and stretch 
flange forming, the onset of necking occurs under loading conditions that are not 
plane stress (Fig. 6). 

Hagbart et al. (Alsos et al. 2008) proposed a new analytical criterion for 
predicting FLD and SFLD known as BWH criterion by combining Hills localized 
necking theory for LHS of FLD and Bressan Williams (1983) shear instability 
criterion for RHS of FLD. 
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Bai and Wierzbicki (2008) proposed a new concept of “cumulative forming 
severity” for predicting neck formation in sheets under nonproportional loading. 
Fahrettin and Lee 2004; (Ozturk and Lee 2004) investigated the forming limits of 
sheet metal using ductile fracture criteria by performing the finite element simula- 
tion of an out-of-plane formability test. The predictions for LHS of the FLD are 
good in agreement with experimental results. However, the predictions were not 
successful for RHS of FLD. They have concluded that ductile fracture criteria 
should not be used directly to calculate forming limits. 

Due to advancements in Continuum Damage Mechanics (CDM), Chow and his 
co-workers (Chow and Yu 1997; Chow et al. 2002, 2003; Chow and Jie 2004) 
developed a unified damage approach for predicting forming limit diagrams. This 
theory is extended to predict FLD on damage coupled kinematic-isotropic harden- 
ing model under nonproportional loading. Later, the damage theory is coupled with 
the modified vertex theory to deduce a generalized localized necking criterion. 
Based on this necking criterion, the forming limit strains of sheet metals with 
material damage consideration are computed. 


Die and Mold Materials 

In this section, an overview of the die and mold materials used in the metalworking 
industry is presented. Initially, comprehensive lists of ferrous and nonferrous 
materials are provided. Subsequently, the application of these materials based on 
the forming processes is covered, and the guidelines for the die and mold material 
selection for different metal-forming applications are provided. 


Classification and Properties of Tool and Die Steels 

Steel used for die and mold applications covers a wide range of ferrous alloys. The 
most commonly used forms are wrought tool steels. Other steels used for metal- 
working applications include powder metallurgy (P/M) steels, medium-carbon 
alloy steels, and maraging steels (Davis 1995, 1990). 

Wrought Tool Steel 

For wrought tool steels, a list of the principal types of tool steels with their 
properties, processing, and service characteristics is provided in Table 1. 

High-speed steels are tool materials developed largely for high-speed cutting 
tool applications. There are two classifications of high-speed steels: molybdenum 
high-speed steels, also called group M (see Table 1), and tungsten high-speed steels, 
called group T. Groups M and T are equivalent in performance. The main advan- 
tage of group M is the lower initial cost. This is due to the lower atomic weight of 
molybdenum, which is about half that of Tungsten. Based on the weight percentage, 
only about half as much molybdenum as tungsten is required to provide the same 
atom ratio. Compositionally, group M contains molybdenum, tungsten, chromium, 


Table 1 General properties, processing, and service characteristics of tool steels (Cockroft and Latham 1968) 


244 


S. Idapalapati et al. 


Resistance to cracking a 

Molybdenum high-speed steels 

in 

in 

in 

in 

in 

in 

in 

in 

in 

in 

in 

in 

in 

in 

in 

in 

in 

in 

in 

Tungsten high-speed steel 

On 

On 

in 

in 

Amount of distortion a 

co 

co 

co 

co 

co 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

• 

• 

o3 

C 

2 

o 

o3 

s 

m 

m 

m 

co 

in 

in 

m 

m 

in 

in 

in 

in 

m 

m 

in 

in 

in 


m 

in 

in 

in 

in 

Working hardness (HRC) 

63-65 

63-65 

63-66 

63-66 

63-66 

63-66 

63-65 

63-65 

63-65 

63-65 

63-65 

66-70 

66-70 

66-70 

66-70 

66-70 

66-70 

66-70 

66-70 

63-65 

63-66 

63-65 

63-65 

Toughness 21 

co 

co 

co 

co 

CO 

CO 

(N 



<N 










CO 

CO 

<N 


Wear resistance 21 

r- 

r- 

OC 

oc 

OC 

r- 

r- 

OC 

OC 

r- 

r- 

OC 

OC 

OC 

OC 

OC 

OC 

OC 

OC 

r- 

OC 

r- 

r- 

AISI designation 




M4 

M7 

M10 

M30 

M33 

M34 

M35 

M36 

M41 

M42 

M43 

M44 

M46 

M47 

M48 

M62 

H 

T2 

T4 

T5 


6 Materials in Metal Forming 


245 



IT) IT) f) If) If) 


o o o o o 

On G\ O'. 0\ O'. 0\ 



m m m mm ^ ^ m m m m m m m m m m r- r- m 


m m m mm r- r- m m mmmmmm m mmmm^H 


m 

m 

oo 


m 

^1" 



m 

m 

m 


m 


m 

^i" 

oo 

m 

m 

oo 


<N 


<N 

m 

<N 

o 

VO 

i 

vo 

i 

\o 

1 


vo 

i 

1 


m 

i 

m 

i 

m 

i 

m 

i 

m 

i 

m 

i 


m 

i 

m 

i 

i 

m 

i 

^t 

i 

m 

i 


\o 

1 


\o 

I 

\o 

i 

\o 

1 

VO 

1 

vo 

1 

m 

m 



i 

<N 



oo 

oo 

o 

o 

o 


o 

\o 

oo 

o 

m 

o 


m 


r- 

oo 



oo 

\o 

\o 

\o 



\o 


m 

m 

m 






m 

m 


m 

m 




m 

m 

m 

m 

m 




246 


S. Idapalapati et al. 



<D 

O 

c 

o3 


C/5 

5 

0) o o o o 

K -H T-H ^ 





£ 

<C in in r- in in in in in 




On Ov Ov Ov Ov cn On Ov Ov 


^H^H^Hin T) h ^ T) H 


o 

in h h m ho 



C/5 

C/5 

<D 

c 

o3 



t- 

\o 

<N 


^1" 


m 

vo 


<N 

<N 

m 



OO 

o 

o 

vo 

^t 

vn 

i 

m 

i 

VO 

I 

vo 

i 

vo 

i 

vo 

i 

VO 

i 

vo 

1 


vo 

1 

vo 

1 

vo 

i 

vo 

i 


vn 

i 

vo 

1 

vo 

1 

vn 

i 

VO 

i 

OO 

o 

vn 

OO 

OO 

OO 

OO 

OO 


r- 

r- 

OO 

OO 


o 

o 

o 

o 

OO 



m 

in 

in 

vn 

vn 

vn 


vn 

vn 

vn 

vn 


vn 

vn 

vn 

vn 

vn 





C/5 

C/5 

<U 

c 


bD 

O 

H 


oo co <N 


<N 


& 


(D 

O 

C 

o3 


C/5 

5 

<D 

Vh 

o3 

<D 


^■^■mococococc^ 


C 

o 


o3 

C 

bD 

T3 


00 

< 


OO On 

< < < 


(N cn in o 

Q Q Q Q Q 


CO CO CO CO 



^ n in 


i-H CN vo i> 

o o o o 


OO OO OO OO OO 



Tt- (S (S M CO 


— rxiin^or- 

oo oo oo oo oo 



6 Materials in Metal Forming 


247 


o 

r- o\ o\ o\ o\ ^ wo wo wo 


CO rH H rH H n m rH H O O O 


o wo 


o wo co wo wo 






<N 

<N 




^1" 



o 



m 

m 

m 

\o 

I 

\o 

I 


\o 

i 

i 

\o 

i 

\o 

i 

vo 

1 

*n 

1 

m 

i 


VO 

i 

VO 

i 

vo 

i 

*n 

*n 


oc 

oc 

oc 

oc 

oc 

o 

vo 


oc 

oc 

oc 




*n 

*n 

*n 

*n 

*n 

m 

m 


in 

in 

in 


'-O 0\ 0\ ON ON ON oo oo 


r- r- r- 
I I I 

CO CO CO 




248 


S. Idapalapati et al. 


vanadium, cobalt, and carbon as principal alloying elements, while group T con- 
tains tungsten, chromium, vanadium, cobalt, and carbon. 

Hot-work steels are the tool materials developed to withstand combinations of 
heat, pressure, and abrasion associated with punching, shearing, or forming metals 
at high temperatures. They are in group H and have medium carbon contents 
(0.35-0.45 wt%) and chromium, tungsten, molybdenum, and vanadium contents 
of 6-25 wt%. They can be divided into three subgroups: chromium hot- work steels 
(type H10 to HI 9), tungsten hot- work steels (type H21 to H26), and molybdenum 
hot-work steels (types H42 and H43). The chromium hot- work steels are the most 
commonly used ones for their good resistance to heat softening. The tungsten 
hot- work steels are better in heat resistance but worse in toughness. Molybdenum 
hot-work steels are similar to that of tungsten hot-work steels, but only with lower 
initial cost. 

Cold-work steels are restricted in forming applications that do not involve 
prolonged or repeated heating above 205-260 °C. There are three categories of 
cold-work steels: air-hardening steels (group A); high-carbon, high-chromium 
steels (group D); and oil-hardening steel (group O). Group A contains manganese, 
chromium, and molybdenum alloying elements to enable the steels to achieve full 
hardness in sections up to about 100 mm in diameter upon air cooling from the 
austenitizing temperature. Typical applications for these steels include shear knifes 
and forming and coining dies. The inherent dimensional stability of these steels 
makes them suitable for gages and precision measuring tools. Group D steels have 
from 1.5 to 2.35 wt% C and 12 wt% Cr and commonly 1 wt% Mo in their 
compositions. Typical applications for group D includes long-run dies for blanking, 
forming, thread rolling, and deep drawing. Group O steels have high carbon 
contents, plus sufficient other alloying elements in which full hardness can be 
obtained in small-to-moderate sections upon oil quenching from the austenitizing 
temperature. The most important service -related property of group O steels is high 
resistance to wear at normal temperatures, a result of high carbon content. On the 
other hand, group O steels have low resistance to softening at elevated tempera- 
tures. Group O steels are extensively used in dies and punches for blanking, 
trimming, drawing, flanging, and forming. 

In shock-resisting steels (Group S), the principal alloying elements are 
manganese, silicon, chromium, tungsten, and molybdenum in various combina- 
tions. The carbon content is about 0.5 wt% for all group S steels, which 
provides a combination of high strength, high toughness, and low-to-medium 
wear resistance. Group S steels are used primarily for chisels, rivet sets, punches, 
driver bits, and other applications requiring high toughness and resistance to shock 
loading. 

The low-alloy special-purpose steels (group L) contain small amounts of chro- 
mium, vanadium, nickel, and molybdenum. They are generally used for machine 
parts, such as arbors, cams, chucks, and collets, and their special applications 
require a combination of good strength, good toughness, and low price. 
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Table 2 List of the principal types of P/M tool steels and their compositions (Cockroft and 
Latham 1968) 



Constituent elements, wt% 

Hardness 

Trade name 

C 

Cr 

W 

Mo 

V 

Co 

S 

Others 

HRC 


Cold-work tool steels 


CPM 9V 

1.78 

5.25 


1.3 

9 


0.03 


53-55 

CPM 10V 

2.45 

5.25 


1.3 

9.75 


0.07 


60-62 

CPM 15V 

3.4 

5 


1.3 

14.5 




62-64 

CPM 440V 
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0.5 
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8 


1.5 
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59-63 

Vanadis 10 

2.9 

8 


1.5 

9.8 



1.0 Sn 
0.5 Mn 

60-62 


Hot-work tool steels 


CPM H13 

0.4 

5 


1.3 

1.05 




42-48 

CPM H19 

0.4 

4.25 

4.25 

0.4 

2.1 

4.25 



44 52 

CPM H19V 

0.8 

4.25 

4.25 

0.4 

4 

4.25 



44 56 


Mold steels, also called group P steels, contain chromium and nickel as principal 
alloying elements. Group P steels are used almost exclusively in low-temperature 
die-casting dies and in molds for the injection or compression molding of plastics. 

Water-hardening steels, also called group W steels, contain carbon as the 
principal alloying element. Small amounts of chromium are added to most of 
group W steels to increase their hardenability and wear resistance. Group W steels 
are very shallow hardening and usually have a hard case over a tough and resilient 
core. They are suitable for cold heading, striking, coining, and embossing tools. 

P/M Tool Steels 

Recently, powder metallurgy (P/M) becomes a major process for manufacturing 
high-performance tool steels products. The P/M process was used primarily for the 
production of advanced high-speed tool steels and now also being used in the 
manufacture of improved cold- work and hot- work tool steels. Table 2 provides a 
list of the principal types of P/M tool steels and their compositions. 

A number of improved, high-vanadium P/M tool steels designed for high-wear 
and cold- work applications are commercially available, which are listed in Table 2. 
The more uniform micro structure the P/M cold-work steels have, the better tough- 
ness they possess. This is crucial in cold-work tooling as it allows higher hardness 
to be achieved with associated improvements in yield strength and wear resistance. 
Furthermore, substantial improvements in wear resistance can be realized by using 
higher vanadium contents in P/M cold- work tool steels than in conventional cold- 
work tool steels. 

The no-segregation nature of P/M tool steels makes them very attractive for 
hot- work tool and die applications, because a common cause of premature failure of 
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large die-casting dies is thermal fatigue attributed to segregation. Powder metal- 
lurgy processing is an alternative method for producing segregation-free hot- work 
tool steels of both standard and improved compositions, and it offers near-net shape 
capability. The compositions of the three P/M hot-work tool steels now commer- 
cially available are given in Table 2. 

Medium-Carbon Low-Alloy Steel 

The medium-carbon low-alloy steels commonly used for metalworking applica- 
tions include AISI 4130, the higher- strength 4140, and the deeper hardening, 
higher- strength 4340 and 6150. These steels are not normally used as dies or 
other such demanding applications because of their low hardness values, but they 
are frequently used for auxiliary tooling, such as containers for hot-extrusion dies or 
holder blocks for molds for processing of plastics. Chemical compositions for these 
steels are given in Table 3. 

AISI 4130 is a water-hardening alloy steel of low-to-intermediate hardenability. 
It retains good tensile, fatigue, and impact properties up to about 370 °C. AISI 4140 
is similar in composition to 4130 except for a higher carbon content. It is used in 
applications requiring a combination of moderate hardenability and good strength 
and toughness, but in which service conditions are only moderately severe. AISI 
4340 is considered the standard to which other ultra- high- strength steels are com- 
pared. It combines deep hardenability with high ductility, toughness, and strength. 
It also has high fatigue and creep resistance. It is often used where severe service 
conditions exist and where high strength in heavy sections is required. It also 
exhibits good retention of strength. AISI 6150 is a tough, shock-resisting, shal- 
low-hardening chromium-vanadium steel with high fatigue and impact resistance in 
the heat-treated condition. 

Maraging Steels 

Maraging steels comprise a special class of high-strength steels. These steels 
differ from conventional steels in that instead of being hardened by a metallur- 
gical reaction involving carbon, they are strengthened by the precipitation of 
intermetallic compounds at temperatures of about 480 °C. Commercial maraging 
steels are designed to provide specific level of yield strength from 1,030 to 2,420 
MPa. These steels typically have very high nickel, cobalt, and molybdenum 
contents and very low carbon contents. Such characters make the maraging steels 
relatively soft after annealing. During age hardening, there are only very slight 
dimensional changes. Therefore, fairly intricate shapes can be machined in the 
soft condition and then hardened with a minimum of distortion and good 
weldability and fracture toughness. This makes maraging steels unique in many 
demanding applications, including aircraft and aerospace components and 
tooling components such as die-casting dies, plastic molds of intricate design, 
and casings for cold-extrusion tools. Table 4 lists the chemical compositions of 
the more common grades of maraging steels. 
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Table 4 Nominal compositions of commercial maraging steels (Cockroft and Latham 1968) 


Composition in wt%, all grades contain no more than 0.03 % C 


Grade 

Ni 

Mo 

Co 

Ti 

Al 

Nb 

Standard grades 

18Ni (200) 

18 

3.3 

8.5 

0.2 

0.1 


18Ni (250) 

18 

5 

8.5 

0.4 

0.1 


18Ni (300) 

18 

5 

9 

0.7 

0.1 


18Ni (350) 

18 

4.2 

12.5 

1.6 

0.1 


18Ni (Cast) 

17 

4.6 

10 

0.3 

0.1 


12-5-3(180) 

12 

3 


0.2 

0.3 



Cobalt-free and low-cobalt-bearing grades 


Cobalt-free 1 8Ni (200) 

18.5 

3 


0.7 

0.1 


Cobalt-free 18Ni (250) 

18.5 

3 


1.4 

0.1 


Low-cobalt 18Ni (250) 

18.5 

2.6 

2 

1.2 

0.1 

0.1 

Cobalt-free 18Ni (300) 

18.5 

4 


1.85 

0.1 



Nonferrous Tool and Die Materials 

This section describes the nonferrous materials used for metalworking and plastic- 
forming applications, including cemented carbides (which are the most commonly 
used nonferrous tool and die material), steel-bonded carbides, ceramics, graphite, 
diamond, plastics, and nonferrous alloys. 

Cemented Carbides 

Cemented carbides are employed in metal-forming applications because of their 
combination of high compressive strength, good abrasion resistance, high elastic 
modulus, good impact and shock resistance, and ability to take and retain excellent 
surface finish. Typical applications in this category include drawing dies, hot and 
cold rolling of strips and bards, cold heading dies, forward and back extrusion 
punches, swaging hammers and mandrels, and can-body punches and dies. Table 5 
lists nominal composition and properties of representative WC grades and their 
applications. 

Steel-Bonded Carbides 

Steel-bonded carbides are powder metallurgy materials that are intermediate in 
wear resistance between tool steels and cemented carbides based on WC-Co. They 
consist of 25-45 vol.% TiC homogeneously dispersed in a steel matrix. They have 
the following advantages over the conventional tool steels: (1) machinable in the 
annealed condition with conventional cutting tools, (2) hardenable with conven- 
tional equipment without decarburization and without experiencing an undue 
change in size, and (3) wear well after hardening on tough applications, giving 
performance equivalent or superior to that of conventional cemented WC. Common 
grades of steel-bonded carbides include C, CM, CM-25, CHW-45, CHW-25, SK, 
CS-40, PK, and MS-5A. 
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Table 5 Nominal composition and properties of representative cemented carbides and their 
applications (Cockroft and Latham 1968) 


Binder content 
in wt% 

Grain 

size 

Hardness, 

HRA 

Typical application 

20-30 

Medium 

85 

Heavy blanking punches and dies, cold heading 
dies 

11-25 

Medium 
to coarse 

84 

Heading dies (severe impact), hot forming dies, 
swaging dies 

11-15 

Medium 

88 

Back extrusion punches, hot forming punches 

10-12 

Fine to 
medium 

89 

Back extrusion punches, blanking punches and 
dies for high shear strength steel 

6 

Fine 

92 

Powder compacting dies, Sendzimir rolls, strip 
flattening rolls, wire flattening rolls 

10-12 

Fine to 
medium 

90 

Extrusion dies (low impact), light blanking dies 

12-16 

Medium 

88 

Extrusion dies (medium impact), blanking dies, 
slitters 

10 Co with TiC 
and TaC 

Medium 

91 

Deep drawing dies (non-galling), tube sizing 
mandrels 


Ceramics 

Similar to cemented carbides, ceramics are used in metalworking applications due 
to their properties: combination of temperature resistance, corrosion resistance, 
hardness, chemical inertness, and wear resistance. Ceramics offer unique advan- 
tages for metalworking applications as they provide very high stiffness-to-weight 
ratios over a broad temperature range. The high hardness of structural ceramics can 
be used in applications where mechanical abrasion is expected. The ability to 
maintain mechanical strength and dimensional tolerances at high temperatures 
makes them also suitable for high-temperature use (i.e., for isothermal forging 
dies). Typical applications for ceramics include bearings and bushings, close- 
tolerance fittings, extrusion and forming dies, spindles, metalworking rolls, 
can-making tools, wire-drawing machine components, and coordinate-measuring 
machine structures. Commonly used monolithic ceramic materials and their prop- 
erties are given in Table 6. 

Graphite, Diamond, and Plastics 

Graphite is the most commonly used material for constructing the die assembly in 
hot presses for applying both thermal and mechanical energy to affect the densifi- 
cation of the metal and ceramic powders. Graphite is easily machined, is relatively 
inexpensive, has moderate room-temperature strength properties that increase with 
temperature, and has good creep resistance up to 2,500 °C. Another important 
metalworking application for graphite is its use as a mold material during gravity 
casting. 

Both natural and synthetic (poly crystalline and single -crystal) diamonds are 
widely used in the wire industry, especially for cold drawing of small-diameter 
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Table 6 Representative properties of ceramics used for metalworking applications (Cockroft and 
Latham 1968) 


Material 

Bulk 

density, 

g/cm 3 

Flexure 

strength 

(MPa) 

Fracture 
toughness 
(MPa a Jm) 

Elastic 

modulus 

(GPa) 

Hardness 

(GPa) 

90 % A1 2 0 3 

3.6 

338 

3-4 

276 

10 

94 % A1 2 0 3 

3.7 

352 

3-4 

296 

12 

96 % A1 2 0 3 

3.72 

358 

3-4 

303 

11 

99.5 % A1 2 0 3 

3.89 

379 

3-4 

372 

14 

99.9 % A1 2 0 3 

3.96 

552 

3-4 

386 

15 

Sintered SiC 

3.1 

550 

4 

400 

29 

Reaction- 
bonded SiC 

3.1 

462 

3-4 

393 

25 

Si 3 N 4 

3.31 

906 

6 

311 

15 

ZTA 

4. 1-4.3 

600-700 

5-8 

330-360 

15-16 

Mg-PSZ 

5. 7-5. 8 

600-700 

11-14 

210 

12 

Y-TZP 

6.1 

900-1200 

8-9 

210 

12 

A1 2 0 3 - SiC w 

3. 7-3. 9 

600-700 

5-8 

330-380 

15-16 

Si 3 N 4 - SiC w 

3. 2-3. 3 

800-1000 

6-8 

330-380 

15-16 


round wire. Both natural and synthetic diamonds can be used to produce nonfer- 
rous, ferrous, and high-temperature alloy wires. 

Plastics are used extensively for dies and molds, primarily for forming 
aluminum alloys, low-carbon steels, and stainless steels. The most widely used 
materials are polyesters, epoxies, and polyurethanes. Plastics are generally used for 
small-to-moderate production runs of simple or moderately shaped parts. 

Nonferrous Alloys 

Beryllium-copper alloys that nominally contain 0. 4-2.0 wt% of Be and small 
amount (<2.0 wt%) of nickel and/or cobalt are widely used as mold material for 
injection molding of plastics. They have also been used for plunger tips during die 
casting of aluminum alloys. 

Aluminum bronzes are used primarily for dies and mold to form materials when 
scratching, scoring, or galling cannot be tolerated. These materials are available as 
sand castings, centrifugal castings, and extruded shapes. 

Aluminum-based alloy of high-strength wrought 7000 series containing nomi- 
nally 6.0 wt% Zn, 2.4 wt% Mg, and 1 .6 wt% Cu is also used as molds for forming 
plastics. 

Zinc-based alloys are used extensively for punches, dies, and molds to form, 
draw, blank, and trim steel and aluminum alloy, plastics, and other materials. 
Applications are in the automotive and aircraft industries for producing prototypes 
and limited quantities of large parts. Zinc alloy provides a low-cost, fast method of 
making punches, dies, and molds having a dense, smooth working surface. 
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Table 7 Typical tool steels for backward extrusion of cups (Cockroft and Latham 1968) 



Total quantity of parts to be extruded 

Material to be extruded 

5,000 pcs 

50,000 pcs 

Punch materials 

Aluminum alloys 

A2 

A2, D2, M4 

Carbon steels <0.4 wt% C 

A2 

D2, M2, M4 

Carburizing grades of alloy steel 

A2 

M2, M4 

Die material 

Aluminum alloys 

W1 

W1 

Carbon steels <0.4 wt% C 

01, A2 

A2 

Carburizing grades of alloy steel 

01, A2 

A2 


KO material 


Aluminum alloys 

A2 

D2 

Carbon steels <0.4 wt% C 

A2 

A2, D2 


High-temperature alloys used for hot-die and isothermal forging applications 
include wrought and cast nickel-based superalloys and molybdenum-based alloys 
such as TZM. 


Selection of Material for Bulk Metal-Forming Processes 

In the following two subsections, we will focus on the selection of materials for 
different metalworking processes from the material lists provided in the previous 
two sections. Some of the most common examples are provided here as rough 
guidelines in sight of that such selection is highly dependent on the process details 
and workpiece geometry (Davis 1995; Semiantin 1988). 

Selection of Material for Extrusion 

For cold extrusion, typical tool steels for backward extrusion and forward extrusion 
of cups are listed in Tables 7 and 8. 

For hot extrusion, in which the process is carried out at elevated temperatures, 
the selection of tool steels is different from that of the cold extrusion. Typical tool 
materials for hot-extrusion forming are listed in Table 9. 

Selection of Material for Rolling 

In shear forming and flow forming, the materials selected for mandrels, based on 
quantity of production, are gray iron (10-100 pcs), alloy cast iron (100-250 pcs), 
4150/52100 steel (250-750 pcs), and tool steels, such as 06, A2, D2, and D4 
(750 pcs above). For the roller, the five most widely used materials in ascending 
order of wearability and cost are W2, 06, D2, D4, and cemented carbide. 

Material selection for rolling mill rollers is highly dependent on the application. 
The four primary classes of material for metalworking rollers are cast irons, cast 
steels, forged steels, and cemented carbides. 
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Table 8 Typical tool steels for forward extrusion of cups (Cockroft and Latham 1968) 



Total quantity of parts to be extruded 

Material to be extruded 

5,000 pcs 

50,000 pcs 

Punch materials 

Aluminum alloys 

A2 

D2, M4 

Carbon steels 1010 

A2 

M2, M4 

Carbon steels 1020 and 1040 

A2 

M2 


Die material 


Aluminum alloys 

W1 

A2, D2 

Carbon steels <0.4 wt% C 

A2 

A2 


Table 9 Typical materials and hardness for tools used in hot extrusion (Cockroft and Latham 

1968) 



Material to be hot extruded 

Tooling application 

Aluminum and 
magnesium 

Copper and brass 

Steel 

Dies for shapes and 
tubing 

Hll, H12, H13 

Hll, H12, H13, H14, 
H19, H21 

H13, cast H21 insert 

Dummy blocks, 
bolsters, and die rings 

Hll, H12, H13 

Hll, H12, H13, H14, 
H19, Inconel 718 

Hll, H12, H13, H14, 
H19, H21, Inconel 718 

Mandrels 

Hll, H13 

Hll, H13 

Hll, H13 

Mandrel tips and 
inserts 

Tl, M2 

Inconel 718 

Hll, H12, H13, H14, 
H19, H21 

Liners 

Hll, H13 

A-286, V-57 

Hll, H12, H13 

Rams 

Hll, H13 

Hll, H12, H13 

Hll, H12, H13 

Containers 

4140,4150, 

4340 

4140, 4150, 4340 

H13 


For thread rolling, the materials for the dies are quite limited. Table 10 lists 
recommended thread-rolling die materials and hardness in service. 

Selection of Material for Forging (Cold/Hot/lsothermal) 

The recommended materials for cold forging dies include tungsten carbide and M2 
and M4 high-speed tool steels. Die inserts are usually fabricated from D2, M2, 
and M4. For cold heading, shallow-hardening tool steels such as W1 or W2 
are extensively applied for punches, open dies, and the solid dies made without 
inserts. Inserts are commonly made from higher-alloy tool steels, such as D2 or M2, 
or from tungsten carbide having a relatively high percentage of cobalt. Shock- 
resistant tool steel such as SI is also used for the cold heading of tools. 
For cold ironing and nosing, D3, M3 class 2, and W2 tool steels are commonly 
used for die and punch. 

There are several variables to be considered when selecting materials for 
hot-forging process: load, temperature, number of parts, and workpiece tolerance, 
just to name a few. Their influence on the material selection is still empirical and 



Materials in Metal Forming 



Table 10 Recommended thread-rolling die materials (Cockroft and Latham 1968) 



Recommended hardness, HRC 

Die material 

Flat die 

Circular die 


For threading aluminum, copper or soft steel blanks 


A2 

57-60 

56-58 

D2 

60-62 

58-60 

M2 

58-60 

58-60 


For threading ferritic steel (hardness >95 HRB) or austenitic stainless steel blanks 


A2 

57-59 

56-58 

D2 

59-61 

58-60 

M2 

59-61 

58-60 


Table 11 Nominal compositions of the proprietary low-alloy tool steels (Cockroft and Latham 

1968) 


Steel designation 

Composition in wt% 

C 

Mn 

Si 

Cr 

Ni 

Mo 

V 

6G 

0.55 

0.8 

0.25 

1 


0.45 

0.1 

6F2 

0.55 

0.75 

0.25 

1 

1 

0.3 

0.1 

6F3 

0.55 

0.6 

0.85 

1 

1.8 

0.75 

0.1 

6F4 

0.2 

0.7 

0.25 


3 

3.35 


6F5 

0.55 

1 

1 

0.5 

2.7 

0.5 

0.1 

6F6 

0.5 


1.5 

1.5 


0.2 


6F7 

0.4 

0.35 


1.5 

4.25 

0.75 


6H1 

0.55 



4 


0.45 

0.85 

6H2 

0.55 

0.4 

1.1 

5 

1.5 

1.5 

1 


qualitative. For this work, we only provide some of the proprietary low-alloy tool 
steels used in hot forging (Table 11) together with die steel properties ratings 
(Table 12) and subsequently by matching the die steels with the process details to 
facilitate hot-forging die material selection (Table 13). 

There is another class of forging worth mentioning before we leave this section: 
isothermal and hot-die forging. These are special categories of forging processes in 
which the die temperature are significantly higher than those used in conventional 
hot-forging process. The die materials for isothermal and hot-die forging include 
(1) superalloys, such as Waspaloy, Udimet 700, Astroloy, Inconel 718, and 
Unitemp AF2-1DA, and (2) molybdenum alloys, such as TZM, TZC, and MHC. 

Selection of Material for Drawing 

In here by drawing process we refer to the drawing of wire, rod, and tubing. In these 
processes, the cross-sectional area and/or the shape of a wire, rod, or tube is reduced 
by pulling it through a die. The recommended materials for wire drawing dies are in 
Table 14 and the recommended tool materials for drawing bars, tubing, and 
complex shapes are in Table 15. 
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Table 12 Die steel rating (Cockroft and Latham 1968) 


Increasing resistance 

A 

Resistance to 

Wear 

Thermal softening 

Catastrophic fracture 
(toughness) 


H26 

H26 

6 F2, 6 F3, 6 F5, 6 F7 

H23 

H24 


H24, A2 

H23 

H12 

H19, H14 

H19 

Hll 

H21 

H21 

H10, H13, 6 F4 

H10, H12, H13 

H14 

A9, 6H2, 6G 

A9, 6H2, A8 

H10 

H14 

HI 1, 6 F4 

6F4 

6H1, A8, H19, A2 

6H1, 6 F5, 
6H7, 6 F3 

Hll, H12, H13, A8 

H21, H23, A6 

6 F2, 6G 

A9 

H24 


6 F3, 6 F5, 6 F7, 6H1, 
6 F2, 6G 

H26 


Table 13 Die steels for forging various alloys (Cockroft and Latham 1968) 


Material to be 
forged 

Hammer forging, die steel/hardness, 
HRC 

Press forging die steel/ 
hardness, HRC 

Carbon steel 

6G, 6 F2/37-46 (die blocks), 6 F3, 
HI 2/40-48 (inserts) 

6 F3, HI 2/40-46 (die blocks), 
HI 2/42-46 (inserts) 

Alloy and 
stainless steel 

6 F3, 6 F2/37-46, HI 1, H12, H13/40-47 
(die blocks), Hll, H12, H13, 
H26/40-47 (inserts) 

Hll, H12, H13/47-55 (die 
blocks and inserts) 

Aluminum alloys 

6G, 6 F2/32-40 (die blocks), Hll, 
HI 2/44 — 4 8 (inserts) 

6G, 6 F2, 6 F3/37^14, 
HI 2/47-50 (die blocks), 
HI 2/46-48 (inserts) 

Titanium alloys 

6G, 6 F2/37-40 (die blocks), Hll, 
HI 2/44-5 2 (inserts) 

6G, 6 F2/37-40 (die blocks), 
Hll, HI 2/47-5 5 (inserts) 

Heat-resistant 
alloys, nickel- 
based alloys 

Hll, H12, HI 3/47-50 (die blocks and 
inserts) 

Hll, H13, H26/50-56, Inconel 
713C, Rene 41 (die blocks and 
inserts) 


Selection of Material for Coining and Cold Heading 

Coining is a cold closed-die forging process for embossing the contours of a die 
onto a prepared metal blank. In coining, the surface of the workpiece copies the 
surface detail in the dies with dimensional accuracy that is seldom obtained by any 
other process. That also posts great restriction on the selection for the die materials. 
Tables 16 and 17 list the typical materials for die used to coin small emblems and 
coin preformed cup to final size. 
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Table 14 Recommended materials for wire-drawing dies (Cockroft and Latham 1968) 


Metal to be drawn 

Round 
wire size 
(mm) 

Recommended die 
material for round 
wire 

Recommended die 
material for special 
shapes 

Carbon and alloy steels 

<0.15 

Diamond 

CPV 10 V, M2, 
cemented tungsten 
carbide 

>0.15 

Cemented tungsten 
carbide 

Stainless steels, titanium, 
tungsten, molybdenum, and 
nickel alloys 

<0.81 

Diamond 

CPV 10 V, M2, 
cemented tungsten 
carbide 

>0.81 

Cemented tungsten 
carbide 

Copper 

<1.6 

Diamond 

CPV 10 V, D2, 
cemented tungsten 
carbide 

>1.6 

Cemented tungsten 
carbide 

Copper alloys and aluminum 
alloys 

<0.81 

Diamond 

CPV 10 V, D2, 
cemented tungsten 
carbide 

>0.81 

Cemented tungsten 
carbide 

Magnesium alloys 

<0.57 

Diamond 


>0.57 

Cemented tungsten 
carbide 


Table 15 Recommended materials for drawing bars, tubing and complex shapes (Cockroft and 
Latham 1968) 


Metal to be drawn 

Common commercial 
sizes 

Maximum 
commercial 
size: die and 
mandrels 

Complex 
shapes: die 
and 

mandrels 

Bar and tube 
dies 

Tube 

mandrels 

Carbon and alloy steels, 
stainless steels, titanium, 
tungsten, molybdenum, and 
nickel alloys 

Tungsten 
carbide 
diamond or 
carbide 

W1 or 
carbide 

D2 or CPM 
10 V 

CPM 10 V 
or carbide 

D2 or 
carbide 

D2, M2, or 
CPM 10 V 

CPM 10 V 
or carbide 

Copper, aluminum, and 
magnesium alloys 

W1 or 
carbide 

W1 or 
carbide 

D2 or CPM 
10 V 

01, CPM 
10 V, or 
carbide 


Table 16 Typical materials for dies used to coin small emblems (Cockroft and Latham 1968) 


Tool material for striking a total quantity of 


Type of tool 

1000 

10,000 

100,000 

Machined dies for use on drop hammers 

W1 

W1 

01, A2 

Machined dies for use on presses 

01 

01, A2 

01, A2 

Hubbed dies for use on drop hammers 

W1 

W1 

W1 

Hubbed dies for use on presses 

01 

01, A2 

A2, D2 
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Table 17 Typical tool steels for coining a preformed cup to final size on a press (Cockroft and 
Latham 1968) 


Die material for total quantity of 


Metal to be coined 

1,000 

10,000 

100,000 

Aluminum and copper alloys 

W1 

W1 

D2 

Low-carbon steel 

W1 

01 

D2 

Stainless steel, heat-resisting alloys, and alloy steels 

01 

A2 

D2 


Table 18 Typical tool steels for coining a preformed cup to final size on a press (Cockroft and 
Latham 1968) 


Tool 

Material 

Cutoff quill (die) 

M4 or cemented carbide insert 

Cutoff knife 

M4 or cemented carbide insert 

Upset, cone, or spring punch 

W 1 , SI, Ml, or cemented carbide insert 

Cone-punch knockout pin 

M2 or CPM 10 V 

Backing plug 

01 

Finish-punch case 

H13 

Finish-punch insert 

Ml, M2, CPM 10 V, or cemented carbide 

Die case 

H13 

Die insert 

Ml, D2, M2, CPM 10 V, or cemented carbide 

Die knockout pin 

M2 or CPM 10 V 


Cold heading consists of forcing metal to flow fold into dies to form thicker 
sections and more intricate shapes. The materials commonly used in cold heading 
machines are listed in Table 18. 


Selection of Material for Sheet Metal-Forming Processes 
Selection of Material for Shearing 

Shearing is the method for cutting plates and flat sheets, coiled sheets, strips, and 
bars. The blade material is among the most important that needs to be selected from 
the tool materials list to enable robust processing. Table 19 presents the 
recommended blade materials for cold straight-blade shearing of flat metals. 
Table 20 presents the tool steels for shearing the nonmetallic materials. Table 21 
presents the blade materials for cold rotary shearing of flat materials. 

Selection of Material for Blanking and Piercing 

Blanking and piercing dies are used to blank, pierce, and shape metallic and 
nonmetallic sheet and plate in a stamping press. The primary measure of the die 
materials performance in blanking and piercing is the number of acceptable parts 
produced before failure occurs. Therefore, the selection of the tool materials for 
blanking and piercing is based on the part geometry and production quantity. 
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Table 19 Recommended materials for wire-drawing dies (Cockroft and Latham 1968) 


Metal to be sheared 

6 mm or less 

6-13 mm 

13 mm or 
over 

Carbon and low-alloy steels up to 
0.35 wt% C 

D2, A2, CPM 10 V 

A2, A9 

S2, S5, 
S6, S7 

Carbon and low-alloy steels, 0.35 
wt% C and over 

D2, A2, CPM 10 V 

A5, S5, S7 

S2, S5, 
S6, S7 

Stainless steels and heat-resisting 
alloys 

D2, A2, CPM 10 V 

A2, A9, S2 

S2, S5, 
S6, S7 

High- silicon electrical steels 

D2, T15, CPM 10 V, 
cemented carbide inserts 

S2, S5, S7 

S2, S5, 
S6, S7 

Copper and aluminum alloys 

D2, A2 

A2 

S2, S5, 
S6, S7 

Titanium alloys 

D2 




Table 20 Recommended tool steels for cold shearing of nonmetallic materials (Cockroft and 
Latham 1968) 


Material 

Material thickness 

Shear blade steel 

Blade hardness, HRC 

Paper 

All thicknesses 

A2 

56-60 



D2 

58-62 



D6 

58-62 

Plastic 

All thicknesses 

A2 

56-60 



D2 

58-62 



02 

56-60 



M2 

58-64 


Table 21 Recommended blade materials for cold rotary shearing of flat metals (Cockroft and 
Latham 1968) 


Material 

Material thickness 

Shear blade 
steel 

Blade 

hardness, HRC 

Carbon, alloy, and 
stainless steels 

D2, CPM 10 V 

D2, A2, A9 

A9, S5, S6, S7 

High- silicon electrical 
steels 

D2, M2, CPM 10 V, cemented 
carbide insert 

D2 


Copper and aluminum 
alloys 

A2, D2, CPM 10 V 

A2, D2 

A9, S5, S6, S7 

Titanium alloys 

D2, A2, CPM 10 V 




Table 22 lists typical punch and die materials for blanking 1.3 mm sheet based on 
the part geometry A and B (Fig. 7). Table 23 provides typical punch and die 
materials for shaving 1.3 mm sheet. Table 24 shows typical materials for perforator 
punches. 
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Table 22 Typical punch and die materials for blanking 1.3 mm sheet (Cockroft and Latham 
1968) 


Tool material for production quantity of 


Work material 

1,000 

10,000 

100,000 

1,000,000 

10,000,000 

Part A or similar with OD ^75 mm 

Aluminum, copper, and 
magnesium alloys 

Zn, 01, A2 

01, A2 

01, A2 

D2, CPM 
10 V 

Carbide 

Carbon and alloy steel, up 
to 0.7 wt% C, and ferritic 
stainless steel 

01, A2 

01, A2 

01, A2 

D2, CPM 
10 V 

Carbide 

Stainless steel, austenitic, 
all temper 

01, A2 

01, A2 

A2, D2 

D4, CPM 
10 V 

Carbide 

Spring steel, hardened, 
52 HRC max 

A2 

A2, D2 

D2 

D4, CPM 
10 V 

Carbide 

Electrical sheet, transformer 
grade, 0.64 mm 

A2 

A2, D2 

A2, D2 

D4, CPM 
10 V 

Carbide 

Paper, gasket, and similar 
soft materials 

W1 

W1 

Wl, A2 

Wl, A2 

D2, CPM 
10 V 

Plastic sheet, not reinforced 

01 

01 

01, A2 

D2, CPM 
10 V 

Carbide 

Plastic sheet, reinforced 

01, A2 

A2 

A2 

D2, CPM 
10 V 

Carbide 


Part A or similar with OD ^305 mm 


Aluminum, copper, and 
magnesium alloys 

Zn, 4140 

4140, A2 

A2 

A2, D2, 

CPM 
10 V 

Carbide 

Carbon and alloy steel, up 
to 0.7 wt% C, and ferritic 
stainless steel 

4140, A2 

4140, A2 

A2 

A2, D2, 

CPM 
10 V 

Carbide 

Stainless steel, austenitic, 
all temper 

A2 

A2, D2 

D2 

D2, D4, 

CPM 
10 V 

Carbide 

Spring steel, hardened, 
52 HRC max 

A2 

A2, D2 

D2 

D2, D4, 

CPM 
10 V 

Carbide 

Electrical sheet, transformer 
grade, 0.64 mm 

A2 

A2, D2 

A2, D2 

D2, D4, 

CPM 
10 V 

Carbide 

Paper, gasket, and similar 
soft materials 

4140 

4140 

A2 

A2 

D2, CPM 
10 V 

Plastic sheet, not reinforced 

4140 

4140, A2 

A2 

D2, CPM 
10 V 

Carbide 

Plastic sheet, reinforced 

A2 

A2 

D2 

D2, CPM 
10 V 

Carbide 


Part B or similar with OD ^75 mm 


Aluminum, copper, and 

01, A2 

01, A2 

A2 

A2, D2, 

Carbide 

magnesium alloys 




CPM 






10 V 



(< continued ) 


6 Materials in Metal Forming 


263 


Table 22 (continued) 



Tool material for production quantity of 

Work material 

1,000 

10,000 

100,000 

1,000,000 

Carbon and alloy steel, up 
to 0.7 wt% C, and ferritic 
stainless steel 

01, A2 

01, A2 

A2 

A2, D2, 

CPM 
10 V 

Stainless steel, austenitic, 
all temper 

A2 

A2 

D2 

D2, D4, 

CPM 
10 V 

Spring steel, hardened, 
52 HRC max 

A2 

A2 

D2 

D2, D4, 

CPM 
10 V 

Electrical sheet, transformer 
grade, 0.64 mm 

A2 

A2 

A2, D2 

D2, D4, 

CPM 
10 V 

Paper, gasket, and similar 
soft materials 

W1 

W1 

A2 

Wl, A2 

Plastic sheet, not reinforced 

01 

01 

A2 

A2, D2, 

CPM 
10 V 

Plastic sheet, reinforced 

01 

01 

D2 

D2, CPM 
10 V 

Part B or similar with OD ~ 

305 mm 




Aluminum, copper, and 
magnesium alloys 

A2 

A2 

A2, D2 

A2, D2, 

CPM 
10 V 

Carbon and alloy steel, up 
to 0.7 wt% C, and ferritic 
stainless steel 

A2 

A2 

A2, D2 

A2, D2, 

CPM 
10 V 

Stainless steel, austenitic, 
up to quarter hard 

A2 

A2 

A2, D2 

D2, D4, 

CPM 
10 V 

Stainless steel, austenitic, 
over quarter hard 

A2 

D2 

D2 

D2, D4, 

CPM 
10 V 

Spring steel, hardened, 
52 HRC max 

A2 

A2, D2 

D2 

D2, D4, 

CPM 
10 V 

Electrical sheet, transformer 
grade, 0.64 mm 

A2 

A2, D2 

D2 

D2, D4, 

CPM 
10 V 

Paper, gasket, and similar 
soft materials 

W1 

W1 

W1 

WCA2 

Plastic sheet, not reinforced 

A2 

A2 

A2 

A2, D2, 

CPM 
10 V 

Plastic sheet, reinforced 

A2 

A2 

D2 

D2, CPM 
10 V 


10,000,000 

Carbide 

Carbide 

Carbide 

Carbide 

D2, CPM 
10 V 

Carbide 

Carbide 

Carbide 

Carbide 

Carbide 

Carbide 

Carbide 

Carbide 

D2, CPM 
10 V 

Carbide 

Carbide 
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Fig. 7 Typical parts of 
varying severity that are 
commonly produced by 
blanking and piercing 


OD 

< > 



Part A 


OD 

< > 



Part B 


Table 23 Typical punch and die materials for shaving 1 .3 mm sheet (Cockroft and Latham 1968) 


Tool material for production quantity of 


Work material 

1,000 

10,000 

100,000 

1,000,000 

Aluminum, copper, and magnesium alloys 

01 

A2 

A2 

D4, CPM10V 

Carbon and alloy steel up to 0.3 wt% C and 
ferritic stainless steel 

A2 

A2 

D2 

D4, CPM10V 

Carbon and alloy steel 0.3 wt%-0.7 wt% C 

A2 

D2 

D2 

D4, CPM10V 

Stainless steel, austenitic, up to quarter hard 

A2 

D2 

D4 

D4, CPM10V 

Stainless steel, austenitic, over quarter hard and 
spring steel hardened to 52 HRC max 

A2 

D2 

D4 

M2, 

CPM10V 


Table 24 Typical materials for perforator punches (Cockroft and Latham 1968) 



Tool material for production 
quantity of 

Work material 

10,000 

100,000 

1,000,000 


Punch diameters up to 6.4 mm 


Aluminum, brass, carbon steel, paper, and plastics 

M2 

M2, CPM 
10 V 

M2, CPM 
10 V 

Spring steel, stainless steel, electrical sheet, and 
reinforced plastics 

M2 

M2, CPM 
10 V 

M2, CPM 
10 V 


Punch diameters over 6.4 mm 


Aluminum, brass, carbon steel, paper, and plastics 

W1 

W1 

D2, CPM 
10 V 

Spring steel, stainless steel, electrical sheet, and 
reinforced plastics 

M2 

M2, CPM 
10 V 

M2, CPM 
10 V 
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Fig. 8 An example press 
formed parts with mild 
severity of draw 



Selection of Material for Press Forming 

Press forming is a process in which the sheet metal is made to conform to the 
contours of a die and punch, largely by bending or moderate stretching, or both, by 
means of mechanical or hydraulic presses. In this case, the tool material is selected 
based on the standard part geometry shown in Fig. 8. Tables 25 and 26 show typical 
lower-die materials for forming the standard part in Fig. 8. 

Selection of Material for Deep Drawing 

Deep drawing of metal sheet is applied to form containers by a process in which a 
flat blank is constrained while the central portion of the sheet is pressed into a die 
opening to draw the metal into the desired shape without folding of the comers. The 
process is designed to form circular shapes, such as cooking pans, box shapes, or 
shell-like containers. Typical punch and die materials for various operations in deep 
drawing are listed in Tables 27, 28, and 29. 


Lubricants for Metal Forming 

Friction and wear are the main challenges of metal-forming processes. 
Metal-forming operations are diverse and can range from a heavy duty sheet 
metal stamping that weighs 1,100 t to fine microforming in electronics industries 
that ranges in milligrams. Accordingly, the lubricants are applied to keep the 
surface of the forming metal and work piece separated by a film of lubricant. 
This reduces the friction force due to shearing of the lubricant and thereby 
reduces the energy wasted and enhances the surface finish of the product. 
Additional challenges for lubrication exist in cooling the increased temperature 
and providing corrosion resistance. This applies to all metal-forming processes 
such as wire drawing, hydrostatic, extrusion, and deep drawing, specifically the 
rolling process. 
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Type 300 stainless, None None Yes Mild and Mild and Mild and 4140 steel A2, D2 D2 

to quarter hard Ml 40 steel 4140 steel 

Low-carbon steel Best ±0.1 Yes Mild and Mild and Mild and 4140 steel A2, D2, nitrided D2, nitrided D2 

M-140 steel 4140 steel D2 

High-strength Best ±0.1 No Mild and Mild and 4140, mild steel chromium plated D2, nitrided D2 
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Type 300 stainless, None None Yes Epoxy-metal, Epoxy-glass, Epoxy or polyester- A2 D2 

to quarter hard Polyester-metal, polyester-glass, glass, alloy cast 

zinc alloy zinc alloy iron 

Low-carbon steel Best ±0.1 Yes Zinc alloy Epoxy-glass, Alloy cast iron D2, Nitrided 

polyester-glass, nitrided A2 


6 Materials in Metal Forming 


269 


T3 

0> 

•tn <N 

£ Q 


T3 

rv 'G 


<N .X 

Q £ 


(N 

Q 


T3 

T3 


•Th <N 

£ Q 


^3 

cj 

T3 


X <N 

£ Q 


T3 

T3 


O 




T3 
< D 
T3 


<N X 

Q c 


<N 


’G 

.tn <N 

£ Q 


^3 

<D 

’G 

.tn <N 

£ Q 


T3 

O 

T3 


o 


< 


o 


< 


o 


< 


o 


< 


< 


bXj 

I 

<D 


c/5 

o 

CU 


o 


o 


N 


o 


N 


o 


N 


N 


o 


o 


o 


o 


N 


N 


N 


N 


N 


o 

Z 


C/5 

C/5 

C/5 


<D 

<L> 

CJ 

o 




Z 


-H 


-H 


-H 


HH 


-H 


C/5 

<D 

PQ 


<u 

C 

o 

Z 


C/5 

<D 

PQ 


C/5 

<D 

PQ 


"T3 

O 

O 

o 


xs 


bD 

C 

<D 


■ 

2 


CJ 

o 

o 


C/5 4Q 


8 fc 

CO 'S 

c/5 C3 

r s. s- 

3 h' 2 


C/5 fX 


>< U 

° s 

<u ^ 

Ph CT 

H 2 


o3 

<D 

QQ 


C/5 

o 


c3 

0 

1 

£ 

o 



270 


S. Idapalapati et al. 


Table 27 Typical materials for punches and blankholders (Cockroft and Latham 1968) 


Die component 

Tool material for production quantity of 

10,000 

100,000 

1,000,000 

For round steel cups with diameter -75 mm 

Punch 

Carburized 4140; W1 

Wl; carburized SI 

A2; D2 

Blankholder 

Wl; 01 

Wl; 01 

Wl; 01 

For square steel cups with edge -75 mm 

Punch 

Carburized 4140; Wl 

Wl; carburized SI 

A2; D2 

Blankholder 

Wl; 01 

Wl; 01 

Wl; 01 

For round steel cups with diameter -300 mm 

Punch 

Alloy cast iron 

01 

A2; D2 

Blankholder 

Alloy cast iron 

Alloy cast iron 

01; A2 


For square steel cups with edge -450 mm 


Punch 

Carburized 4140 

Wl; 01 

Nitrided A2; D2 

Blankholder 

Alloy cast iron 

Wl; 01 

01; A2 


Table 28 Typical tool steels for punches and dies to iron soft steel sheet at various reductions 
(Cockroft and Latham 1968) 



Tool material for production quantity of 

Ironing reduction, % 

1,000 

10,000 

100,000 

1,000,000 


Ironing punches 


Up to 25 

Wl 

01 

A2 

A2; carburized SI 

25-35 

Wl 

A2 

A2; carburized S 1 

D2 

35-50 

A2 

A2; carburized S 1 

D2 

D2 

Over 50 

D2 

D2 

D2 

D2 


Ironing dies 


Up to 25 

Wl 

01 

01 

D2 

25-35 

Wl 

01 

D2 

D2 

35-50 

01 

D2 

D2 

D2 

Over 50 

D2 

D2 

D2 

D2 


Table 29 Typical punch and die material for the reverse redrawing of steels (Cockroft and 
Latham 1968) 



Tool material for production quantity of 

Die component 

1,000 

10,000 

100,000 

1,000,000 


Small thick- wall cups 


Die and pressure ring 

01 

01 

A2 

D2 

Punch 

4140, 6150 

01, A2 

D2 

D3 


Medium and large thin-wall cups 


Die and pressure ring 

1018,4140 

4140, 01 

A2 

D2 

Punch 

Wl 

A2 

D2 

D2, D3 
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The provision of an effective lubrication is yet challenging to some specific 
processes such as cold forging and hot extrusion of steel until innovative lubrication 
methods such as phosphate-stearate lubricant coatings and molten glass film lubri- 
cants were devised. Additional contribution was from development of predictive 
subroutines of lubrication and friction. 

Corrosion protection can assume a good deal of importance, particularly after 
the metal-forming application has been completed. Completed parts may need to be 
kept in storage for up to 6 months. This means that a fluid may require a corrosion 
inhibitor to prevent corrosion under a variety of storage conditions. 


Friction Characterization 


A number of sophisticated models exist for frictional mechanics; nonetheless it can 
be expressed as a simplest mechanical concept based on an average behavior 
throughout the deformation zone. One useful concept involves the friction factor 
“m” which is the ratio of friction shear stress x and the workpiece shear strength x Q , 
under the prevailing conditions of strain rate and temperature. As cold welding of 
the workpiece with the forming surface takes place, p exceeds one. However 
sticking does not occur over the entire surface and p may range towards 0.5. 
Another practical factor is the friction coefficient “p” which is the ratio between 
shear stress and tool pressure “P.” In practical examples of metal forming, the 
average value of P ranges between two and five times of the blank’s shear strength. 
The friction coefficient is good to be in boundary lubrication range which varies 
between 0.1 and 0.15 range. Since x cannot exceed x Q , the p may reach a maximum 
of 0.5, and at other extreme under thick film lubrication condition, it may be as low 
as 0.01. 


One of the best methods to characterize is ring compression test for forging and 
upsetting (Male and Cockroft 1964), which is a ring of the workpiece material 
with outside diameter and inside diameter in the range of 6:3:1. The ring is 
compressed and the variations in the outside and inside diameter are measured 
from which p or m is quantified. For very low friction, both diameters expand, 
while for high friction, the inner diameter decreases. 

Maclellan (1952) and Wisterich (1955) described a split die wire and sheet 
drawing method to quantify axial and binder force to quantify shear stress and 
die pressure. 

Another method involves reliable mathematical model of the process that back 
calculates based on process geometry and driving force. This method needs 
accurate measure of flow stress at pertinent strain rate and temperature is 
important. Generally, under lack of data, plain material under room temperature 
at convenient strain rate of the actual workpiece by using published strain rate 
sensitivity exponent and temperature factors is used in simulation (Baker and 
Wright 1992). 
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Lubrication 

Under lubricated condition, the lubricant layer may or may not be able to fully 
separate the two surfaces. The regime in which the lubricant film is thick beyond the 
surface roughness is called the hydrodynamic lubrication (HL) regime, the region in 
which the entire load is carried by the interacting surface asperities is called 
boundary lubrication (BL) regime, and in between regimes is called the mixed 
lubrication (ML) regime (Fig. 9). 

A Stribeck diagram can demarcate the above regimes as a function of 
L-parameter that is defined by 


L _ rjv_ 

pR a 

where r\ is the dynamic viscosity of the lubricant, v is the velocity in the contact,/? is 
the nominal contact pressure, and R a is the roughness. Figure 10 shows a general- 
ized Stribeck diagram for a metal-forming application. 

The EHL regime is important for an effective cold-forming process such as deep 
drawing process, because it influences the appearance of the final product and the 
overall energy consumed due to friction (Wilson et al. 1995). For example, a deep 
drawing process typically operates in the BL regime or in the upper part of the ML 



Fig. 9 Different types of lubrication (i) Boundary Lubrication (ii) Mixed lubrication (iii) Elasto 
hydrodynamic lubrication, {a, b) subrate (c) boundary layer ( d ) lubricant 



Fig. 10 Generalized 
Stribeck diagram for metal 
forming operation 


Stribeck Factor (L) 
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regime (Ter Haar 1996). Plastic deformation of the sheet influences friction due to 
localized yield at the contacts. This applies true in the other way of sheet metal 
forming where controlling friction helps influencing the plastic deformation and 
thereby formability of the blank mainly (Seah and Lee 1988) and gives much 
benefit mainly due to its large area to thickness ratio without giving rise to galling 
failure of the tools. 


Forming Lubricants 


Lubricant in the global sense apart from reducing friction and wear should 
dissipate heat, protect surfaces, conduct electricity, keep out foreign particles, 
and remove wear particles. From friction point of view, the forming lubricants are 
to operate at mixed or (Elasto) hydrodynamic lubrication (EHL) by typical 
mineral (petroleum) or synthetic origin that are used as the base fluids along 
with viscosity regulating additives. Additional additives to form boundary layer 
help friction and wear control. However, the thermal stability of the boundary 
layer depends on the lubricant and the sheet surface reactivity. The boundary layer 
behavior depends on the mechanism of formation such as physical adsorption, 
chemical adsorption, and chemical reaction. Physical adsorption has clustered, 
long-chained hydrocarbons with a polar head. Examples include long-chained 
alcohols and fatty acids. Such systems have polar group that forms a high 
viscosity hydrocarbon layers. Fatty acids generally adhere to unreactive surfaces 
such as chromium-coated surfaces. In the presence of reactive surfaces, the 
chemisorption occurs which combines physical adsorption with chemical reaction 
with the surfaces to form a metal soap. For example, stearic acid (C 17 H 35 COOH) 
will chemically react with an iron surface to form C 17 H 3 5 COOFe and thereby 
adheres to the iron substrate. Thus, friction modifiers are additives that form such 
chemisorption-based boundary layers and can resist high temperature and are used 
to reduce wear and friction under moderate loads. Typical additives are fatty 
acids, long-chained fatty amides, and esters. Chemical reaction of lubricant with 
the substrate and blank involves complex compounds based on P, S, B, and Cl 
which are used to form metal salts, with high-temperature stability. The latter 
makes these additives suited for wear protection at severe loading conditions were 
extreme pressure (EP-additives) causes high contact temperatures. Especially B 
and Cl are suspected to harm the environment. For example, in sheet metal 
forming, compatibility with the mill and with the coating process is important. 
Cold-rolled steel needs corrosive protection during transport and storage, and 
hence preservation agent is used on the surface. Secondly, after forming, a 
corrosion protection agent is used. For both processes, the sheet surface is cleaned 
thoroughly and/or chemical reaction layers are avoided since it demands addi- 
tional unfriendly solvents to be used. Thus, adsorption is a preferred mechanism 
for BL formation. 

In the complexity of the lubricant, both in terms of costs (selection, purchase, 
application, and cleaning) and in terms of eco-toxicity of the waste, the choice of 
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lubricant becomes important in shop floors. Thus emerge new competing technol- 
ogies such as dry permanent lubricants and pre-coated sheet. 

Lubricating oils consist of base oil and additives which determine their perfor- 
mance characteristics. The base oil is responsible for the typical properties of the 
lubricant. The additives, however, determine its actual performance by influencing 
the base oil’s oxidation stability, anticorrosion properties, wear protection, emer- 
gency lubrication properties, wetting behavior, emulsibility, stick-slip behavior, 
and viscosity-temperature behavior. 

The advantages of lubricating oil as compared to grease are improved heat 
dissipation from the friction point and its excellent penetrating and wetting prop- 
erties. The main disadvantage is that a complex design is required to keep the oil at 
the friction point and prevent the danger of leakage. 


Additives for Lubricants 

Lubricant additives are chemical substances that may be either active or inert to 
evoke new properties or performance features to the formulated oil. They reduce 
deleterious effect such as oxidation or impart new performance such as viscosity 
index, pour point, and wear protection. Chemically active reacts with metal surface 
to form coating or reactive species can react with oxygen to counteract reactive 
species to sustain the lubricant’s activity. The important functionalities of the 
additives are controlling precipitation, oxidation, corrosion, viscosity, pour point 
to meet application needs. 


Testing Standards for Lubricants 

Tribo characteristics of mineral oils depend on the intended process parameters 
(temperature, load, speed, etc.) and should be tested for the following test methods 
(Table 30). 

Solid Lubricants 

Solid lubricants are to protect surfaces and provide lubrication which ranges in 
synthetic, metallic, and mineral powers, such as Teflon (PTFE), copper, graphite, 
and molybdenum disulfide (MoS 2 ). These operate under boundary lubrication 
regime and are more favored when liquid or cohesive lubrication is not favored 
(Table 31). 

Effect of Coating on Metal-Forming Dies 

The application of functional coating on metal-forming surface such as tools and 
dies are to decrease the coefficient of friction there by improving the die life, 
surface finish and reducing the power consumption for the metal forming operation. 
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Table 30 Tribo-technical testing standards for lubricating oils 


Parameters 

Test 

Description 

ISO-VG 

DIN 

51-519 

ISO-VG is the abbreviation of ISO viscosity grade, 
characterizing the classification of fluid lubricants in terms of 
viscosity 

Density 

DIN 

51-517 

The amount of lubricant required for certain friction point is 
generally indicated by volume 

Viscosity 

DIN 

51-561 

Indicates the resistance of a fluid substance to flowing 

Viscosity index 

(VI) 

DIN ISO 
2909 

Nondimensional figure characterizing the change of viscosity 
as a function of the temperature 

Viscosity- 

temperature 

relation 

DIN 

51-563 

Dependence of oil change in viscosity on the change in 
temperature 

Flash point 

DIN ISO 

2592 

Lowest temperature which an oil gives off sufficient vapor to 
form an ignitable mixture under standard conditions 

Pour point 

DIN ISO 
3016 

Lowest temperature at which an oil is still flowing when 
cooled down under standardized conditions 

Air entrainment 

DIN 

51-381 

TUV Impinger test. Air entrainment causes excessive wear 


Table 31 Friction reduction by solid lubricants (Ref: Robert Bruce) 


Solid lubricant 

Friction coefficient 

Max continuous temperature (°C) 

PTFE 

0.05-0.1 

250 

MoS2 

0. 1-0.2 

300 

WS2 

0. 1-0.6 

350 

Graphite 

0. 1-0.5 

500 


In general, harder coating has lower friction; however, studies of Hayward, 
Samandi, and Parker (Hayward) showed that the CrN-coated samples on draw 
beads used in deep draw forming had the lowest hardness and lowest friction. 

Coating if improperly adhered may be prone to powdering and flaking (Kato 
1993; Wenzloff et al. 1993; Mei and Morris 1993). Under high tensile strain 
coatings crack to accommodate the strain in the system (Wenzloff et al. 1993), 
and if the coating is not engineered to the substrate, the frictional force will cause 
the coating to crack and sometimes flake (Mei and Morris 1993). Such failures 
happen in car body panels such as Zn-Fe coating and could be used with the right 
design of draw beads in the process. 


Manufacturing of Raw Materials for Forming 

In this section, manufacturing of raw materials for forming and preforming pro- 
cesses is discussed. The raw materials for metal-forming processes are produced by 
metal casting. When the correct chemical composition of targeted alloy is obtained 
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in the melting furnace, the casting process takes place. Continuous casting for steels 
and ingot casting for other alloys are the most common method for producing raw 
material for metal-forming processes. Although the manufacturing path from the 
melt to the finished shape is direct route, it involves great difficulties. This is due to 
the need for simultaneous control of a number of processing steps including 
melting, alloying, molding, pouring, solidification, finishing, and so on. Each of 
these production steps needs to be correctly done to avoid failure in the acceptance 
of the final product. After production of the raw materials through casting, it 
requires to be homogenized in a high-temperature treatment. 


Casting Defects that Affect the Forming Processes 

Controlling the casting defects in the raw material for metal-forming process is of 
high importance as the casting defects have the direct effects on formability of 
materials. There is always the erroneous assumption about plastic deformation to 
eliminate the casting defects, but usually the strains involved in most working 
operations are too low to affect any significant welding of the faults and defects. 
Folding-in of a solid film from the surface of the liquid will form a defect which is 
called bi-film. Bi-films are in general merely pushed around and growing worse 
before getting better in deformation, whereas the segregation causes inhomoge- 
neous material properties and defects in metal-forming processes. 

During the working of cast material, for instance, by rolling, it is expected 
that the defects are elongated in the direction of working. The elongation of the 
defect results in its rotation, aligning the defect to the rolling direction. This is 
clearly seen in the elongation of graphite nodules during the rolling and forging of 
ductile iron. 

In addition, elongation of the workpiece increases the area of its defect. The 
newly extended surface in the bi-film will be oxidized by air entrapped in the defect. 
The entrapped air continues to oxides the new surfaces within the defect during 
deformation. Microscopic pores may be formed between the oxide and the metal 
crystal as a result of deformation of the defect, which means generation of more 
defects. During rolling, the continued oxidation of the expanding surface hinders 
the welding of the interface that is newly created and, by this means, assists the 
defects to grow as a crack, possibly to many times its initial size. 


Segregation During Metal Casting 

Segregation may be defined as any departure from uniform distribution of the 
chemical elements in the alloy. Segregation in cast metals can cause defects in 
metal-forming process. In this section different segregations in metal casting are 
briefly reviewed. Because of the way in which the solutes in alloys partition 
between the solid and the liquid during freezing, some level of segregation of the 
alloying element is present in all castings. 
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Fig. 11 Micro-segregation 
occurs during solidification at 
dendrite front 


/ 

/ 



The variation in composition that occurs on a microscopic scale between den- 
dritic arms of the solidified casting is known as micro-segregation. This can usually 
be significantly reduced by a homogenizing heat treatment for only minutes or 
hours as the diffusion distance required which is usually in the range 10-100 pm to 
redistribute the alloying elements is sufficiently small. On the other hand, macro- 
segregation cannot be removed. It occurs over distances ranging from 1 cm to 1 m 
and so cannot be removed by diffusion at high temperatures even in long homog- 
enizing times (Ramana Rao 2003; Ammen 2000). 


Micro-segregation 

During solidification of the melt, as the dendrite grows into the melt and the 
secondary arms spread from the main dendrite stem, the solute is rejected, effec- 
tively being pushed aside to concentrate in the tiny region enclosed by the second- 
ary dendrite arms. Since this region is smaller than the diffusion distance, it can be 
fairly considered as uniform in composition. Figure 11 schematically shows this 
segregation. The interior of the dendrite has an initial composition close to kCO, 
while, towards the end of freezing, the center of the residual inter-dendritic liquid 
has a composition corresponding to the peak of the final transient. This graduation 
of composition from the inside to the outside of the dendrite is commonly described 
as “coring” because, on etching a polished section of such dendrites, the progressive 
change in composition appear as onion-like layers around a central core. 

Dendritic Segregation 

Figure 12 shows how micro- segregation, the sideways displacement of solute as the 
dendrite advances, can lead to a form of macro-segregation. As freezing occurs in 
the dendrite, the general flow of liquid that is necessary to feed solidification 
shrinkage in the depths of the pasty zone carries the progressively concentrating 
segregate towards the roots of the dendrites (Ammen 2000; Campbell 2011). 

In the case of a freely floating dendrite in the center of the ingot that may 
eventually form an equiaxed grain, there will be some flow of concentrated melt 
towards the center of the dendrite if in fact any solidification is occurring at all. This 
may occur if the liquid is somewhat undercooled. However, this effect will be small 
and individual for each equiaxed grain. Thus, the buildup of long-range segregation 
in this situation will be negligible. 
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Fig. 12 Normal dendritic 
segregation (usually 
misleadingly called inverse 
segregation arising as a result 
of the combined action of 
solute rejection and shrinkage 
during solidification in a 
temperature gradient 


2 

o 



Pasty zone 


Liquid 



solute 


Movement 
of feed 
liquid 


Casting 
center line 



For the case of dendritic growth against the wall of the mold, however, the 
temperature gradient ensures that all the flow is in the direction towards the wall, 
concentrating the segregation in this location. Thus, the presence of a temperature 
gradient is necessary for significant buildup of segregation. 

Gravity Segregation 

In the case of ingot casting, the segregated liquid in the dendrite mesh moves under 
the influence of gravity. This melt has a density different from that of the bulk 
liquid. Thus, the lighter liquid floats, and the heavier sinks. 

In the case of steel, as the residual melt travels towards the roots of the dendrites 
to feed the solidification contraction, the density will tend to rise as a result of 
falling temperature. Simultaneously, its density will tend to decrease as a result of 
concentrating in light elements such as carbon, sulfur, and phosphorus. The com- 
positional effects outweigh the temperature effect in this case, so that the residual 
liquid will tend to rise. Because of its low melting point, the liquid will tend to 
dissolve dendrites in its path as solutes from the stream diffuse into and reduce the 
melting point of dendrites. Thus, as the stream progresses, it reinforces its channels, 
as a flooding river carve obstructions from its path. This slicing action causes the 
side of the channel that contains the flow to be straighter and its opposite side to be 
somewhat ragged. Figure 13 describes these phenomena schematically (Ammen 
2000; Campbell 2011). 


Surface Cleaning 

Cleaning the surface of cast metal before the forming process is essential for 
removal of the oxide layers to minimize the defects and increase the formability 
of metals. A treatment for the cleaning of castings of all types is sand blasting, in 
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Fig. 13 Development of segregation in a steel ingot during solidification 

which particles of sand are entrained in a powerful blast of air directed at the 
casting. The process is now almost universally unused as a result of the concerns 
about the generation of harmful silica dust. 

Treatment nowadays includes shot blasting in which steel or iron shot is used, 
centrifuged to high speed by spinning wheels in the so-called airless blasting. 
Stainless steel shot is, naturally, significantly more expensive than conventional 
carbon steel shot but has the advantage for aluminum castings that they remain 
bright and free from rust stains. Other treatments include grit blasting in which the 
grit is commonly a silica-free hard mineral such as alumina. Other blast media 
include glass beads, plastic particles, walnut shells, etc. 

After a blasting treatment, the surface of the casting obtains some degree of 
uniformity of appearance. For sand casting, however, not all of the sand particles 
are removed; a significant percentage of adhering particles are pounded into the 
surface and may become even more attached by the plastic flow of metal around 
and over them during blasting. This residual adhering sand may affect the rate of 
wear of machine cutting tools if extensive machining of surface is required. 
Sometimes caustic etching can help in removing these residual sands. However, 
such chemical treatments are to be avoided for environmental reasons. 


Heat Treatment 

In this section the necessary heat treatment process before metal forming is briefly 
discussed. The heat treatment of castings is one of costly preforming processes. 
Heat treatment is a vast subject that cannot be covered here. However, in what 
relates to metal forming, it is worth mentioning that castings are required to be 
homogenized at high temperature. 
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The initial thermal operation applied to ingots prior to hot working is referred to as 
“ingot preheating” or “homogenizing” and has one or more purposes depending on 
the alloy, product, and fabricating process involved. One of the principal objectives is 
to improve workability. The micro structure of most alloys in the as-cast condition is 
quite heterogeneous. This is tme for alloys that form solid solutions under equilib- 
rium conditions and even for relatively dilute alloys. The cast microstmcture is a 
cored dendritic structure with solute content increasing progressively from center to 
edge with an inter-dendritic distribution of second-phase particles or eutectic which 
was described in the previous section (Campbell 2011). 

Because of the relatively low ductility of the intergranular and inter-dendritic 
network of these second-phase particles, as-cast structures generally have inferior 
workability. The thermal treatments used to homogenize case structure for 
improved workability were developed mainly by empirical methods correlated 
with optical metallographic examination, to determine the time and temperature 
required to minimize coring and dissolve particles of the second phase. More 
recently, methods have become available to quantitatively determine the degree 
of micro-segregation present in cast structure and the rates of solution and 
homogenization. 

In rapid solidification, as a far-from -equilibrium condition exists, maximum 
micro-segregation across small dendritic walls resulted. However, in typical com- 
mercial ingots, large cells are more segregated than fine cells, and because of longer 
diffusion distances required, large cells are more difficult to homogenize. For 
instance, electron microscopy analyses of unidirectional solidified casting of an 
Al-2.5 %Mg alloy indicate that the degree of micro-segregation is greater in the 
coarser, more slowly solidified structure and that the approach to uniform solute 
distribution during heating at 425 °C is faster for the finer structure, as shown 
in Fig. 14 (Campbell 2011). 


Fig. 14 Effect of cell size on 
required time for uniform 
solute distribution during 
heating 
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The solution of a new phase, rejected inter-dendritically during solidification and 
influenced by the homogenizing treatment, is only one step towards providing 
maximum workability. Since most of the solute is in solid solution after this 
heating, further softening and improvement in workability may be obtained by 
slow cooling, to re -precipitate the solute in an inter- dendritic distribution of fairly 
large particles. Hence, we see that in addition to the constitutional equalization, 
normally considered to be the principal aim of the homogenization process, a 
number of other diffusion-related processes may be desirable or unavoidable, 
such as: 


• Grain coarsening or recrystallization 

• Precipitation of super saturated elements 

• Dissolution of unstable phases or precipitates 

• Coarsening of stable intermetallic particulates 

• Surface oxidation 

• Hydrogen degassing 

• Pore generation and agglomeration 

• Localized melting 

For quantification of the homogenization process, diffusion analysis of the 
materials systems can be done which is not the subject of this work. 


Modified High-Temperature Heat Treatment 

There is strong motivation to abandon or significantly modify the high-temperature 
solution treatment, and one of the most effective ways to reduce these excessive 
treatment times is using 10 °C rise rule especially in lower melting point alloys 
(Dossett and Boyer 2006). The reaction rate will be doubled by increasing the 
temperature by 10 °C. For instance, the treatment of an alloy at 180 °C for 4 h is 
more or less exactly equivalent to a treatment at 190 °C for 2 h or 200 °C for 

1 h. Although there is a limit to how far this translation of the treatment can be 
pushed to higher temperature and shorter times, for most practical purposes on the 
shop floor, it is often surprisingly effective over many conditions. 

The use of solution treatment temperature as high as possible, taking advantage 
of the general rule that an increase of 10 °C will double the rate of reaction, is of 
course limited by the temperature at which the alloy melts. 

However, well before the casting becomes too hot to melt entirely, it can suffer 
from overheating, a phenomenon known as “incipient melting.” Up heating some 
low-melting-point constituents of the alloy, which might constitute only 1 or 

2 vol.% of the alloy, can melt. On subsequent cooling, these constituents should 
resolidify so as to result in no net effect. But the reality is that this phenomenon 
compromises the mechanical properties of the material and particularly ductility, 
which would be significantly impaired by overheating (Totten 2007). 
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Summary 

In this chapter, authors have presented information and, in some cases, guidelines 
for the selection of materials used in the metal-forming processes. The details of 
preform, tool, and die materials, lubricants in forming and casting, and heat 
treatment processes before metal forming are presented. Preform materials are 
characterized by their formability, while tools and dies are provided materials are 
categorized based on the process details and limitations. The lubricants are briefly 
covered with a focus on their effectiveness and general applications. In the end, a 
concise summary of the raw material preparation for forming processes is provided, 
with focus on casting and heat treatment processes. This chapter is a quick reference 
to material selection and preparation for forming processes for researchers, engi- 
neers and students in the metal forming area are provided. 
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Abstract 

Roll forming is cost-effective compared to other sheet metal forming processes 
for uniform profiles. The process has during the last 10 years developed into 
forming of profiles with varying cross sections and is thereby becoming more 
flexible. The motion of the rolls can now be controlled with respect to many axes 
enabling a large variation in the profiles along the formed sheet, the so-called 3D 
roll forming or flexible roll forming technology. The roll forming process has 
also advantages compared to conventional forming for high-strength materials. 
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Furthermore, computer tools supporting the design of the process have also been 
developed during the last 10 years. This is quite important when designing the 
forming of complex profiles. The chapter describes the roll forming process, 
particularly from the designer’s perspective. It gives the basic understanding of 
the process and how it is designed. Furthermore, modem computer design and 
simulation tools are discussed. 


Introduction 

Roll forming is a metal forming process, which is spread throughout the world. Roll 
forming products are used in numerous applications, for example, buildings, auto- 
motives, airplanes, furniture, and domestic appliances. It is a highly productive 
process and the use of the process increases every year, Halmos (Halmos 2006). 
The benefit of the process in comparison to other metal forming processes is that 
auxiliary operations such as punching, welding, clenching, etc. can be included, and 
thus, it is possible to produce profiles directly ready to use. 

Roll forming can be defined as a continuous bending or forming operation by a 
number of consecutive rolls. Recently, it has also become possible to manufacture 
profiles with variable cross sections by varying the motions of the rolls during the 
process. Thus, it is not anymore limited to straight profiles with parallel edges and 
uniform cross sections. The roll forming process is sometimes called cold roll 
forming as it is most commonly performed at room temperature and there is very 
limited heating due to the process itself. However, the process has been combined 
with preheating at bend lines in order to improve ductility before rolling starts 
(Lindgren et al. 2009). 

All materials that can be bent can also be roll formed. Thus most metals and 
alloys like aluminum, various steels, copper, etc. The material can be pre-painted or 
pre-coated. The normal production speed that the profile can be produced varies 
between 10 and 30 m/min, depending on the tolerance of the cross section, the 
material, and how fast the machine can be fed with raw material or how fast the 
finished product can be removed from the runout table. The thickness of the 
material that can be roll formed is between 0.1 and 10 mm, but one can find 
application where thicker material has been used. 

The current section provides the basic understanding of the roll forming process 
and its design. 


The Roll Forming Process 

The bending occurs in several steps from an undeformed strip to a finished profile; 
the basic concept of the process is illustrated in Fig. 1. The forming is geometrical 
complex due to the forming does not only occur during the contact with the forming 
tools, but also between each forming stand (Bhattacharyya et al. 1984). Due to the 
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Fig. 1 The forming occurs in several steps from undeformed strip to a finished profile. The tools 
forming and driven the strip through the machine (Courtesy to data M Sheet Metal Solutions 
GmbH for the picture) 



Length of the 



Length of the bending zone 


Fig. 2 The material in the flange will travel a longer distance than the material in the bending 
zone between the forming steps. That difference will cause a strain in the flange 


latter, this will cause the material in the flange to travel a longer distance than the 
material in the bending zone, Fig. 2. This creates longitudinal strain in the flange 
(Bhattacharyya and Smith 1984), which should not be plastic in order to avoid 
large, longitudinal residual stresses in the finished profile. 
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Fig. 3 Different defects that the finished profile can have. From left , oil canning, flare, and 
wavy edge 

The roll forming process is very robust provided it is set up correctly. However, 
if the process is not adequately designed, then the profile can have defects, such as 
bow, twist, oil canning, flare, and wavy edges. Bow and twist are unwanted bending 
along the axis of the profile and rotation of the cross section, respectively. Oil 
canning, left part of Fig. 3, is a local buckling of surfaces of the profile. Flare, 
middle part of Fig. 3, denotes the local deformation at the end of the profile and 
wavy edges is shown in the right part of Fig. 3. Many of the defects are caused by 
too few forming steps, which give plastic deformations in other parts than the 
bending zone. These plastic deformations give residual stresses in the profile 
causing unwanted deformations. 

When tools for a new product are created, the tool designer starts with the 
wanted profile and unfold it in several forming steps until the profile is a flat 
sheet. To unfold the profile, the tool designer uses experience and makes a 
so-called flower pattern (see Fig. 4). The number of forming steps that is used to 
create the flower depends on the required geometry of the profile, the tolerances, the 
finish of the surface, and the material properties. If the tools are made for an already 
existing machine, the number of forming steps must fit this machine. It is often 
important to keep the number of forming step low as the cost increases linearly with 
the number of steps. On the other hand, more steps will usually improve the quality 
of the produced profile. 
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Flower design 


Tool design 



Simulation 



■ mm 







Tool manufacturing 



Fig. 4 The tool design process, from flower design to tool manufacturing 


Today one can find computer-aided engineering (CAE) systems (COPRA RF, 
data M Sheet Metal Solutions GmbH) that can support the tool designer when 
creating the flower and the tools. The designer can then use the system to estimate 
the strain and stress in the profile to see if the flower and number of forming steps 
are appropriate. These CAE systems use advanced geometric representations for the 
calculations (see section on “Finite Element Simulation of the Roll Forming 
Process” for some more details). One advantage of this type of analysis is that the 
simulation time is very short. After determining the number of steps and thereby the 
flower pattern, then the detailed design of the tools starts. At this stage the designer 
must know how the roll forming machine is configured. The latter is clarified in the 
next section. 

After designing the tools, it is possible to use finite element simulation (COPRA 
FEA RF, data M Sheet Metal Solutions GmbH) to verify the design before 
manufacturing the tools. This step may lead to redesign of the roll forming process 
if quality problems are found in the simulations. This is an iterative process, and 
after converging to a solution, then the drawings of the tools are finalized. Finite 
element simulations were not used in the design of the roll forming process until 
less than 10 years ago. Today one can find numerous examples where finite element 
models have successfully used to develop new profiles (Roll Forming, Creating 
real-world situations with dynamic simulation software). Finite element simula- 
tions are standard tools for larger companies. The design process is described step 
by step in the next section. 
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The Design of the Roll Forming Process 

The design process starts with a wanted profile that is to be produced. The process 
to determine the number of forming steps, the forming strategy, and the shape of the 
tools is the focus below. Halmos (Halmos 2006) also discusses the roll forming 
machine itself and secondary operation, etc. If the profile is to be manufactured on 
an existing roll forming machine, then the first step is to evaluate whether this is 
possible at all. The machine configuration must be known in order to assess this. 
The following issues must be considered: 

• The diameter of the shaft. It must be known in order to estimate if the machine is 
rigid enough for the required forces and torques. 

• The vertical distance between the upper and lower shafts and the horizontal 
distance between the forming stands of the machine. They determine whether 
the profile fits between the top and bottom shafts and if the tools have space 
enough between the stands. 

• If the machine has driven the top and bottom shafts, then it is necessary to choose 
gear ratios between these shafts. This is related to the driven point as discussed in 
the next section. 

• If the machine has vertical stands, then it can be used to form blind corners that 
else cannot can be reached by tools on the top and bottom shafts. 

• Whether the machine has a straightening stand that can be used to, for example, 
prevent twist of an unsymmetrical profile. 

• Maximum number of forming steps available in the machine. 

The tool designer goes first through the following steps: 

• Forming strategy 

• Calibration method and strip width calculation 

• Flower pattern 

• Simulation and machine configuration 

• Tool design 

Thereafter, shapes of the tools are known. The design may then be verified using 
finite element simulations. The results from the simulations can support the 
designer in deciding whether the designed roll forming sequence is appropriate or 
if further changes need be made. 


Forming Strategy 

Figure 5 shows two examples of flower patterns based on different forming 
strategies. They illustrate typical issues that have to be resolved during the design 
of the forming process and flower pattern. The advantage of producing the profile to 
the left in Fig. 5 is that the corners can be reached with top and bottom tools even 
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Fig. 5 Z-profiles, or flower 
patterns, that are roll formed 
with two different strategies 




when the legs are bent more than 90° to compensate for springback. The disadvan- 
tage is that the driven point on the profile will be either at the lower or at the upper 
corner when going from one forming step to the next. The diameter of the tools at 
this contact point will change when it shifts location. If the tools rotate with same 
speeds, then the tangent velocity of the tools at this contact is different. This can be 
prevented if the speed of the shaft can be adjusted pass by pass or the vertical 
distance between the tools can be changed pass by pass. Else the tools will scratch 
the surface on the formed material. 

The benefit to producing the z-profile to the right in Fig. 5 is that the driven point 
is on the flat area between the legs. It is the same driven point pass by pass. The gear 
ratio should then be 1 : 1 for the top and the bottom tools. The drawback is if the legs 
are bent more than 90° to compensate for springback, then the bottom and top tools 
cannot reach the corners, but side rolls or accessory rolls are needed to produce the 
wanted radius at the corners. The forming strategy to the right also requires tools 
with larger diameters to form the profile and thus needs more space between the 
shafts of the machine. 


Calibration Method and Strip Width Calculation 

Three different ways to form a 90° profile are shown in Fig. 6. One approach is to use 
tools with the same radius, left variant. Another way is to bend the same arc length 
with decreasing radii in every forming step, middle part in Fig. 6, until the wanted 
angle is achieved. The constant arc length is the most common method as it is easier 
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Constant 

radius 



Constant arc 
length 



Variable radius 
and arc length 







Fig. 6 Three different calibration methods to bend a profile to 90°. The constant arc length 
method is the most common calibration method 


for the sheet to move inward during the process. The constant radius method is used 
when it is necessary to overbend the profile in order to compensate for springback. It 
is then simple to determine how to distribute the overbending during several steps. 

Strip width calculation is needed in order to determine the required width of the 
initial sheet. The profile is then divided into straight and bent regions. The required 
width for the straight parts is easy to calculate. Estimating the requirements for the 
bent regions requires knowledge of the location of the neutral layer in the thickness 
direction. The amount of materials corresponds to the length of the neutral layer in the 
bent region. The neutral layer is where the material is neither compressed nor stretched 
(see Fig. 7). This is in the middle of the thickness of the sheet in the case of pure 
bending, as shown in the left part of the figure. However, if there is a bending moment 
and a stretching force as shown in right part of Fig. 7, then it is shifted downward. The 
radius to the neutral layer and the bent angle then give the length of the material in the 
bent comer. There exist different models for estimating this length. One example is the 
German standard DIN 6935 (COPRA RF User guide, data M Sheet Metal Solutions 
GmbH). Usually the radius to the neutral layer is located 0.3-0. 4 times the sheet 
thickness plus the inner radius of the bent comer. Thus, the most difficult part in 
estimating the width is the contribution from the bent regions. The total required width 
of the strip is then the sum from all straight and bent region contributions. 


Flower Design and Simulation 

The flower design process is illustrated for the C-profile in Fig. 8. The profile is 
unfolded in several steps to a flat sheet. There are no simple guidelines for this 
process. The basic idea is to begin creating the bent regions in the middle of the 
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Fig. 7 Location of the 
neutral layer when pure 
bending, top, or stretch 
bending, bottom, is used 


Neutral layer 

in pure 




Neutral layer 



i 



Fig. 8 To the top, the flower 
of an unfold C-profile. To the 
bottom, the 3D model of the 
flower. It can be seen that 
forming of the outer flange 
starts quite early 



sheet first and move outward in the forming process. CAE tools, like COPRA RF 
(COPRA RF, data M Sheet Metal Solutions GmbH), PROFIF (PROFIL, UBECO 
GmbH) using 3D models of the geometry can be used to estimate strains, etc.. They 
are used in this stage of the design process, whereas more time-consuming finite 
element simulations are used later in the design verification phase. 

The right part of Fig. 8 is a 3D model of the flower created by a CAE tool 
(COPRA RF, data M Sheet Metal Solutions GmbH). It shows how the edge moves 
up and inward during the forming process. This deformation needs larger forces 
when the outer flange of the C-profile develops. Therefore, this edge should be 
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Bottom roll 


Fig. 9 The side rolls forming the blind corner. Right part is a magnification 



Fig. 10 To the top, the 
profile is folded with a bend 
angle in the station so the 
inner bent corner can be 
reached with tools from the 
top and bottom shaft. This 
gives then in the next step the 
profile to the bottom where 
side rolls must be used to 
bend the so-called blind 
corners (see Fig. 9) 



formed in later stations. However, this is not possible for the specific C-profile. It is 
necessary to start with the forming of the flange in the early stages, as the flower 
shows, and finalize it by use of side rolls, or the so-called accessory rolls (see Fig. 9) 
in the final stations. This is clarified in the next section where details of the design 
process are given. 

Figure 10 shows a well-designed case where the tools from the top shaft exactly 
reach the lower comer of the C-profile. Thereafter, it is a blind corner, and side rolls 
are needed in the subsequent forming steps. 
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Fig. 1 1 Calculation of the longitudinal strain in the flange of the profile. Every bar corresponds to 
the strain in one forming step, pink for the left side and blue for the right side 


An overbending is required in order to account for springback, particularly for 
high-strength steels. If this compensation has not been distributed over the 
forming step, then a final step is required for this purpose. In this case, side 
rolls are needed in the current case to obtain the final geometry. Often also a 
calibration step, where the tools have exactly the shape of the wanted section, and 
straightening mills in the end of the forming are used. This is to compensate for 
any problems with, for example, twisting or longitudinal bow of the produced 
profile. 

When the flower is finished, specialized CAE software (COPRA RF, data M 
Sheet Metal Solutions GmbH) is used to calculate the deformation length of the 
flange (Bhattacharyya et al. 1984), and thereby the longitudinal strain in the flange 
can be estimated (Bhattacharyya and Smith 1984) for every forming step (see 
Fig. 11). This model requires that a flower is chosen, roll diameters are determined, 
and the distance between the forming steps and the material properties is known for 
the profile. The figure shows the flower, and the bar diagram shows the estimated 
longitudinal strains in the outer part of the flange. This strain should be lower than 
the strain for plastic yielding. If not, then quality problem like wavy edge, longi- 
tudinal bow of the profile, etc. can occur. If this is found to be the case, then the 
flower need be changed, and maybe more forming steps have to be added. In this 
current case, the bar diagram shows that the longitudinal strain is low in the three 
first forming steps. It increases when the outer flange is formed and reaches the 
yield strain denoted by the horizontal line. The strain is however not much higher, 
so it is likely that the chosen flower is sufficiently good. Sometimes it is necessary 
to iterate a couple of times between the calculations and the flower design and 
maybe also add more forming steps. 
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Machine Configuration and Design of Tools 

The machine configuration must be determined or known before the tools can be 
designed: 

• Vertical distance between the shafts must be set for each forming stand. It 
depends on the height of the final formed profile. 

• Horizontal distance between the roll forming stands. 

• Diameter of bottom and top tools. 

• Gear ratios between the upper and lower shafts. It depends on the speed 
difference between the top and bottom tool at the driven point and on the vertical 
distance between the shafts and roll diameters. 

• Shaft length. 

• Number of forming stations. 

• Vertical stands and a straightening stand. 

There are some general and practical issues to consider when designing the 
tools. The tools must naturally be possible to produce in a turning mill and mount on 
the shafts and be easy to line up. It should also be easy to check the roll gap. The 
flower pattern gives the ideal output profile after each step, but there will also be 
some springback. It must be ensured that the front edge of the profile does hit the 
roll at a location enabling the sheet to be guided smoothly into the contact region. 

Figure 12 shows the first forming station for the C-profile. The space between the 
top and bottom tools is the same as the sheet thickness on their interior and also for 
the outermost part of the tools. The latter region is not in contact with the sheet, but 
this makes it easy to measure the roll gap when setting up the tools. The shape of the 
upper and lower tools is designed so the movement in sideways is prevented. 




Fig. 12 To the left , the top and bottom tools for station 1. To the right , a magnification of the area 
with a small bend angle that guides the sheet so it moves inward smoothly 
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The tools are locking the sheet (see magnification in the right part of Fig. 12). These 
tools will together with the outer part of the bottom tools force the sheet to move 
inward at the same time the top tool bents the comer. The profile of the top has an 
upward angle, relief angle, which is 0.5°-l° greater than the bottom tool. This 
ensures that the driven point is on the web on the sheet and not on the flange. The 
rolls will have the same diameter at the driven point and same angular velocity. 
Thus, they will have the same tangential speed at the web but not at the flanges. The 
distance between the top and bottom tools outside the flanges, the area where the 
tools force the sheet inward, is about 20 % of the sheet thickness, making it 
impossible for the sheet to run into this area. The above design is adapted to the 
next two stations. The bend angle increases in every station. The total width of the 
tools in stations 1-3 should be the same as this makes it easier to line up the tools 
when they are installed. 

Station four is the station where the first forming of the inner comer is done. It is 
also the station where the over bending of the outer corner occurs. In this case this 
corner is impossible to reach with a top tool as it is a so-called blind corner (see 

Fig. 13). 

The upper tool consists of two rolls. The inner roll makes it impossible for the 
flange to move sideways. The top tools press down the top of the flange, and this 
will create a force on the bent comer, and it will be formed to a smaller radius. This 
bending must also account for springback, and thus, some overbending is required. 
The drawback of this approach is that it is not possible to independently control the 
inner radius of the comer. The inner top tool has also a chamfer as it is possible that 
the flange starts to move inward before it has left the current forming station. The 
flange will glide against this part of the tool if this chamfer is skipped. The tools are 
also designed with a relief angle in this forming region for all stations. The roll gap 
is the same as the sheet thickness in the web where the driving point is located. This 
design is the basis for all subsequent stations until it is not possible to access the 
inner corner anymore with the top tool. It will become a blind corner thereafter. It is 
best to design the flower pattern so the angle at the inner corner is as close as 
possible to this limit angle the last time the top tool reaches the comer (see Fig. 14). 



Fig. 13 Tools in stations 4 and 6. Magnification shows the forming of the inner and outer blind 
corner of the C-profile. Both the top and bottom tools consist of two rolls 
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Fig. 14 The last station before the inner comer will be a blind comer. The inner top tool only 
covers the radius 



Fig. 15 The side rolls forming the blind comer. In this case, the overbending step 


The design of the later stations requires side rolls or accessory rolls in order to 
give the inner corner a specific radius (see Fig. 15). Most roll forming machines do 
not have tool holders for side rolls or accessory rolls as it is difficult to know exactly 
where they should be located. This type of rolls depends on the specific profile. The 
tool holders are part of the design process. Tool holders for the outer side rolls are 
simple, whereas it may be a problem to design them for the inner side rolls due to 
lack of space. It is also beneficial if the side rolls are somewhat adjustable in vertical 
and horizontal directions. 




7 Roll Forming 


299 



Fig. 16 The figure shows how the tools are split into pieces, and it shows the roll attribute and the 
numbering system to keep track of the tools 


The final station must create a profile with overbending, Fig. 15, in order to 
compensate for springback. It is better to design the tools with too much than too 
little overbending as a too large angle can be easily compensated afterward. 

The designed tools will have additional features, roll attributes, than their profile. 
They need holes for the shafts, holes for bearings in side rolls, retain rings, etc. 
Figure 16 shows a drawing generated by COPRA RF (COPRA RF, data M Sheet 
Metal Solutions GmbH). Finally assembly and manufacturing drawings are made. 
However, it is becoming more common to use finite element simulation to verify 
the design before manufacturing the tools. This is described in the next section. 


Finite Element Simulation of the Roll Forming Process 


The CAE system used in the design process described above describes the geometry 
of the sheet between two forming steps with various shape functions, for example, a 
model for tubes by Kiuchi (Kiuchi and Koudabashi 1984). The boundary conditions 
are given by the tools at the forming stations. These boundary conditions and 
minimization of strain energy, assuming linear elastic behavior, make it possible 
to quickly evaluate the shape between the stations. The strain is thus possible to 
evaluate from the shape functions. However, the elastoplastic behavior of the 
material is not accounted for but the calculation is very fast. Finite element 
modeling, where the real material behavior is accounted for, can be used to simulate 
the rolling process. These simulations give relations between forming forces and 



300 


M. Lindgren et al. 


Fig. 17 To the top , the figure 
shows only half of the 
U-channel because the 
symmetry and the rosette 
strain gage are bonded on the 
top surface close to the edge 
of one flange. To the bottom , 
the roll load and torque were 
measured during the forming 


Bending zone 




deformations, and also springback can be predicted. The creation of the finite 
element model is quick if systems like (COPRA FEA RF, data M Sheet Metal 
Solutions GmbH) are used. However, the finite element simulation requires com- 
puter time, and evaluation of the result will also require some work. But use of finite 
element modeling can decrease the cost and time for the complete development 
process. There is a considerable amount of research papers about sheet metal 
forming but much less about roll forming and simulations, for example, Lindgren 
(2009a), McClure and Li (1995), and Roure Fernandez et al. (2009). 

Lindgren (2008) investigated the forming of a U-profile (see Fig. 17) and 
validated it by comparing the result with the corresponding experiments (see 
Fig. 18). The longitudinal strain, forming loads, springback, and the torque were 
measured during the forming of the U-profile. The profile was made of high- 
strength steel, HyTens® 1200. Four forming stations were used where the bend 
angles were 20, 40, 60, and 80°. The experiment was performed in a standard roll 
forming machine (see right part of Fig. 17). 
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Longitudinal Strain vs Displacement Tor Hy Tens 1200 



Fig. 18 The experimental and computed longitudinal strain is compared when four forming 
stations are used. The bend angles are 20°, 40°, 60°, and 80° 
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Fig. 19 Simulation of a profile with holes and investigation of forming forces 


Finite element simulation of the C-profile is described next in order to illustrate 
the use of finite element modeling. The software used for the simulations is COPRA 
FEA RF (COPRA FEA RF, data M Sheet Metal Solutions GmbH). It is possible to 
use the models with or without friction, with precut material or from coil. It is also 
possible to simulate cutting of the profiles and stamping holes, etc. (see Fig. 19). 
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It is recommended to start with a simple model with few elements. Even this first 
model can reveal weaknesses in the tool design. It goes also faster to find mistakes 
in the tool design when a smaller model is used. The model can be refined when the 
design is known to work. It is good to have a restart facility on the software when 
simulating several forming stations. Then it is possible to simulate one station and 
evaluate the results with respect to the tool design before continuing to the next 
station. It is also possible to start the simulation at a specific forming step, but then 
the strain history from earlier steps is not taken into account. Furthermore, the 
initial geometry will then have to be based on the ideal geometry that can be 
obtained from the CAE system. 

The first finite element model of the C-profile has no friction, and the sheet 
does not have any holes. The final cutting of the profile into shorter lengths 
is also included. An elastic-plastic material model with isotropic hardening 
is used. The von Mises yield surface and the associated flow rule are used 
(Marciniak et al. 2002). The hardening is described by the Ludwik-Hollomon 
equation 


o = o y +Ke n p 

where a y is the yield strength, K is the strength index, e p is the plastic strain, and n is 
the strain hardening exponent. The virgin yield limit of the material is 450 MPa, 
the ultimate yield strength is 490 MPa, and the A80 value is 22 %. The sheet is 
modeled using 5980 eight-node solid brick elements with one layer through the 
thickness. 

The simulation result shows that profile will flare out after cutting (see Fig. 20). 
This is a common problem in roll forming, and the flare can be reduced by 
increasing the number of forming steps. One of the reasons why flare occurs is 
that the residual strains on the outer and inner surface of the sheet are different. The 
cause in the current case is that probably the inner flange has been stretched too 
much in some of the forming steps. The computed longitudinal strain at the surface 
of the outer flange is shown in Fig. 21 for all forming stations. It is largest in 
forming steps 2 and 9. 

The springback of the profile is also obtained in the simulation. Evaluation 
shows that the overbending of the inner bent corner is somewhat too large. The 
tool designer can decrease the overbending in forming step 11 in COPRA RF 
(COPRA RF, data M Sheet Metal Solutions GmbH) and restart the simulation 
model after station 10 to quickly evaluate how much the overbending should be 
reduced. It is possible to iterate between the simulations and redesign of the tools 
and maybe increase the number of forming steps until a satisfactory solution is 
obtained. Then it may be worthwhile to use more advanced models, including 
friction and rotating tools, and more elements, including stamped holes, etc., in 
order to verify the design. However, quite often simple models will provide 
sufficient accurate results for design verification. 
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Fig. 20 To the top , the cut 
profile after simulation. To 
the bottom , the longitudinal 
strain in the outer flange is 
higher than in other parts of 
the profile 




Flexible Roll Forming 

The use of the roll forming process increases every year due to the possibility to 
form complicated products in different metals in combination with high productiv- 
ity. The disadvantage with roll forming up until about 10 years ago was that only 
sections with a constant cross section were possible to produce. Nowadays the “3D 
roll forming” or flexible roll forming technology is used to produce panels with a 
variable cross section (see Fig. 22). The key technology is the possibility to control 
the movements of the tools. The method is very flexible which means that profiles 
can be produced with the same setup of roll forming tools. To change the profile, the 
operator of the machine only downloads a new program to the control system of the 
machine that describes the movement of the tools. 

Experimental investigation of the flexible roll forming process can be found in, 
for example, Lindgren (2009b). This paper describes how to produce hat-profiles 
with variable depth and width with a new tooling concept. Different profiles with 
variable cross section, left part of Fig. 22, were produced. Giileceken et al. (2007) 
used COPRA RF (COPRA RF, data M Sheet Metal Solutions GmbH) coupled with 
the finite element module MSC.Marc to simulate the flexible roll forming process. 
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Fig. 21 The longitudinal strain at the surface of the outer flange is presented. The forming steps 
2 and 9 have higher strain than the other steps 



Fig. 22 Flexible roll forming, profiles with variable cross section. To the left described in 
(Lindgren 2009b) and to the right described in (Sedlmaier et al. 201 1) 


They simulated the forming of a U-profile. The simulation time was only 3-4 CPU 
hours due to relatively coarse mesh in the model. The result showed that the design 
roll forming process gave a final geometry close to the wanted profile. Abee 
et al. (2008) studied the characteristic process properties and the process limits of 
the flexible roll forming process. They focused on the wrinkling phenomena in the 
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Fig. 23 Finite element simulation of a 3D roll formed U-profile. Tension and compression stress 
acts in transition zone 
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transition zone. This is the zone where the cross section of the formed part changes 
(see Fig. 23). Based on the simulations, they proposed a blank holder system that 
improved the quality of the profile with less wrinkling. Sedlmaier et al. (2011) 
described a development of a flexible roll forming machine used to produce 
hat-profiles (see the right part of Fig. 22). The experiment showed that the use of 
a blank holder system could prevent wrinkling. The drawback was that the blank 
holder system was static and it scratches the sheet. The use of a so-called self- 
driving blank holder system removes this problem. This is a system where the 
holder follows the sheet between the forming stands and thereby surface defects are 
avoided. 


Summary 

Roll forming is an efficient production method that can be used for a large variety of 
metals with typical thicknesses from about 0.2-6 mm. The process has also been 
successfully applied to high-strength steels as the associated larger springback can 
be well controlled when designing the roll forming line. Furthermore, the develop- 
ment of flexible roll forming, section on “Flexible Roll Forming,” makes it possible 
to form intricate shapes and thereby expand the use of roll forming considerably, 
e.g., complex parts for the automotive industry. Advanced CAE tools, sections on 
“Flower Design and Simulation” and “Finite Element Simulation of the Roll 
Forming Process,” that can support the design of the process have matured and 
are very valuable when the formed shapes become more complex. 
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Abstract 

Casting is a cost-effective route to make extremely complex geometry near net 
shape components. To reach the full potential in the cost/complex perfor- 
mance, it is essential to understand the whole manufacturing route. This 
chapter on metal casting covers the fundamental of the metallurgical aspect 
of melting, and casting of both ferrous and nonferrous materials is covered. 
This includes furnaces and melting practices. The shaping of casting is 
described in more detail including mold filling and solidification. An overview 
of pressure die casting, die casting, and disposable mold casting including both 
sand and ceramic molds is covered. Important constraints such as process alloy 
capability and ability to produce complex and sound geometries are covered as 
well as the first basic steps towards a sound design are covered. In the 
description of the route to sound castings, Campbell’s Ten Commandments 
are followed and discussed in detail. Last but not least, the metal alloys and 
their micro structure and properties are treated covering both ferrous and 
nonferrous alloys. 
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Introduction 

Casting is one of the oldest methods to manufacture objects with cast copper compo- 
nents dating back to 3000 BC having been found in archaeological surveys. Copper 
was followed by bronzes (copper-tin alloys). Cast iron was developed in China around 
600 BC. Material classes like cast iron have evolved from early use in cast statues into 
a cost-effective advanced material that to a large extent has enabled the modem 
industry and transportation (Jarfors et al. 2010). Cost-effective cast aluminum alloys 
and magnesium alloys are all twentieth-century innovations. Technology develop- 
ments enabling casting of reactive materials such as titanium alloys and highly 
engineered single crystal nickel-super alloy provide a full palate of choices in com- 
ponent manufacturing. Together with the extraordinary design freedom offered by 
casting in addition to the full palate of materials and processes, casting has grown to a 
dominant process for components in most manufacturing sectors, laying a foundation 
for high-value manufacturing, in a robust and sustainable manner (Jarfors et al. 2010). 


General Process Description 

Casting has been used for a long time, and as such a great number of variants have 
been developed. The casting manufacturing route consists of seven critical events 
all critical to cost, process yield, and productivity. These seven events are: 

• Cast component design 

• Mold/die layout and manufacturing 

• Melting of the metal and metallurgical conditioning of the melt 

• Filling of the die cavity 

• Solidification of the melt in the die/mold 

• Shakeout/part removal 

• Fettling and part post-casting treatments 


Cast Component Design 

The foundation of a successful component made through the casting manufacturing 
route is based on some simple concepts. By making a sound design, a highly cost- 
effective component will not only provide value to the end use but also enable high 
yields in the manufacturing process and will thus also be resource efficient well 
suited in a sustainable manufacturing environment. 

Mold/Die Manufacturing 

The task of the die/mold is to shape the melt and to remove the heat so that the melt 
solidifies to the desired shape. Important to the choice of casting process is the 
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function of the mold/die as different types of molds/dies will create different 
constraints on the component design and the ability to handle different sizes of 
series. There are methods capable from single piece manufacturing up to millions of 
parts per month. Understanding these constraints is thus a part of making the right 
choice of process. The main discriminator for the choice of mold/die material is 
however determined by the melting point of the metal to be cast. High-temperature 
melting alloys such as steel and cast irons are commonly cast using sand or ceramic 
molds. Low -temperature melting alloys such as zinc, aluminum, and magnesium 
alloys are cast in metallic molds. Copper and brass can be cast in pressure die 
casting processes using permanent molds, but these alloys represent an upper limit 
for the use of metal molds. It should be noted that un-pressurized die casting of steel 
is used but only for moderately complicated shapes and limited sizes. The die/mold 
can be classified into the following three main principles (see also Table 1): 


Type of mold/die 
Permanent die 
Disposable mold 
Model 

Permanent model 
Disposable model 
Filling principle 
Gravitation 
Low pressure 
High pressure 


Principle 1 

The die is made from a solid material, most commonly hot-work steel such as HI 3. 
There are a number of grades supplied by the main tool steel makers in the world such 
as Hitachi, Uddeholm Tooling, and Daido. A physical model for the die cavity is not 
necessary as the die cavity is machined out by mechanical milling and electro- 


Table 1 Systematics of mold and dies (Jarfors et al. 2010) 


Method 

Type of mold 

Mold 

material 

Model type 

Filling 

principle 

1 

Permanent with parting line 

Metal 

CAD 

drawing 

High pressure 

Graphite 

Low pressure 

Gravitation 

2 

Disposable with parting line 

Sand 

Permanent 

Gravitation 

Plastic 

Wood 

Gypsum 

Plastic 

Ceramic 

Metal 

3 

Disposable without parting 
line 

Sand 

Disposable 

Gravitation 

Plastic 

Wax 

Low pressure 

Ceramic 

Plastic foam 

Centrifugal 
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discharge machining (EDM) followed by grinding and polishing. From a design 
perspective, a parting line in the tool is necessary for part extraction from the die 
cavity, and undercuts should in general be avoided. Slider cores or salt-based cores in 
some cases are possible to use, but add cost to the manufacturing. The degree of 
automation is high and a sizeable series is necessary for a profitable solution. 

Principle 2 

A disposable mold is commonly made from sand or ceramics, even though plastic 
molds exist, using a permanent model. As the model needs to be extracted from the 
mold cavity during mold making, a parting line is necessary. After model extrac- 
tion, the two mold halves are put together and the mold is ready for use. The use of 
cores to create inner features and undercuts is common. The degree of automation 
varies, but is possible through mold-making equipment from suppliers such as 
DISA and Hunter. Flexibility is high, and depending on series size, different 
types of models can be used. The simplest ones for shorter series are made from 
lacquered wood, and the most durable would be made from metal. 

Principle 3 

The use of a disposable mold and a disposable model increases the design freedom 
as there is no parting line needed. The mold is shaped around the model. The model 
is removed and the mold is subsequently sintered. The exception here is lost foam 
processing where the plastic foam material is evaporated during mold filling. 


Melting of the Metal and Metallurgical Conditioning of the Melt 


The melt is commonly made either from remelting a ready-made alloy or from 
scrap-based processing. Foundries also recycle runner and gating systems used to 
fill the mold/die in-house. In the melting furnace, the main melt conditioning is 
made to achieve the right alloy composition. Furthermore, special treatments, such 
as inoculation to control the micro structure of the part produced, are commonly 
used to control grain size in aluminum alloys and graphite shape in cast irons. Other 
treatments include strontium treatments to modify silicon in aluminum alloys for 
mechanical properties control. The main issue in cast components is porosity. In the 
process of making a sound component, having a high-quality melt is essential. 


Casting/Pouring 

In the casting/pouring step, the die cavity is filled. The rate of filling is very 
important in all casting processes, and minimal turbulence is the main aim. In 
gravity-assisted processes, this is achieved by the sprue, runner, and gating design 
in combination with the shop floor operator skill. In a pressure-assisted process, this 
is decided by the runner and gating system together with the computer-controlled 
injection system. 
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Solidification 

Solidification is primarily determined by the part design and as such also all defects, 
but there is a strong relation between what is possible and the choice of process. 
The main issue is porosity formation and stress buildup/cracking together with part 
distortion and lack of tolerances. 

Shakeout/Part Removal 

Once the part is solidified, it is to be removed from the mold/die cavity. This step 
changes the cooling rate and restrictions in shrinkage and affects the tolerances and 
residual stresses. The timing of shakeout or ejection/part removal is an important 
process control. 

Fettling and Part Post-Casting Treatments 

Subsequent to part extraction fettling is started with the removal of the runner and 
gating system. It should here be noted that for high-pressure die casting, it is 
common that the largest possible gate is to be used. Sometimes it is the fettling 
process that limits this as there is a practical force limitation in the de-gating step. In 
this single operation, the flash can also be removed. Following this machining, 
surface coating and heat treatment can be done. 

The differences in these steps and equipment setup generate different capabili- 
ties of the processes to cast different materials. A comparison between different 
processes and their alloy capability is shown in Table 2. 


Melting of the Metal and Metallurgical Conditioning of the Melt 
Melting of Aluminum Alloys 

Melting of aluminum alloys is normally based on remelting of pre-alloyed ingots 
from a material supplier. Foundry practice also involves in-house recycling of runner 
and gate materials and rejected parts. To maintain a good melt quality, a ratio of 70 % 
virgin material to 30 % in-house recycled stock is recommended (Brown 2010). 

Melting is often made in a central furnace, and melt is transported in transfer 
ladles to holding/dosing furnaces at the casting stations. During the transport, melt 
treatment is made commonly involving degassing, grain refinement, and strontium 
modification treatment where applicable. To melt aluminum there are four main 
types of furnaces: 

• Medium-frequency coreless induction furnaces 

• Reverberatory furnaces 

• Shaft furnaces 

• Crucible furnaces 


Table 2 Process alloy capability map adapted from Jarfors et al. (2010) 
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Zinc alloys 

Aluminum alloys 

Magnesium alloys 

Titanium alloys 

Material class 
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Cast steels 

Heavy alloys 

Light alloys 


X indicates an excellent prematch between material and process and Y represents acceptable capability 
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Fig. 1 Coreless induction 
furnace 


Lining 



Induction 

coil 


Medium-frequency coreless furnaces (Fig. 1) are an electric furnace that 
typically caters for melting 500 kg up to 2,000 kg per charge with up to 
2,000 kg/h as production capacity. Best economic practice is to leave a heel melt 
at the bottom of the furnaces and immediately start with the next batch. Induction 
heating stirs the melt ensuring a thermally and chemically homogeneous melt. The 
drawback is entrainment of oxides and slag into the melt, and transfer ladle- 
cleaning flux treatment is recommended (Brown 2010). 

Reverberatory furnaces (Fig. 2) are oil or gas fired and operated more or less 
continuously with capacities of 200 kg/h up to 1,300 kg/h. An advantage of 
reverberatory furnaces is that the use of bulky scrap becomes possible. A drawback 
is that the heating flame comes in direct contact with the melt generating 4-8 % 
melting losses (Brown 2010). 

The shaft furnace (Fig. 3) was developed for high melting capacity, typically 
between 1,000 and 3,000 kg/h. The shaft furnace is also well suited to manage high 
rates of foundry returns and melting small losses, typically in the order of 1 % 
(Brown 2010). 

Crucible furnaces (Fig. 4) are common in the smaller foundry as they are 
robust, available in a large number of sizes, and are relatively inexpensive 
(Brown 2010). These may operate in three different modes: 

• Tilting mode, where the melt is poured over a lip (Fig. 4) 

• Fixed lift out mode, where the cmcible is removed from the furnace before 
pouring 

• Fixed bale-out mode, where the melt is ladled out from the furnace and poured 
into the mold 
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Fig. 2 Reverberatory 
furnace 


Flue 




Do o r 


Fig. 3 Shaft furnace (Courtesy of Stotek A/S) 


Melting of Magnesium Alloys 


Magnesium is a reactive metal that cannot be melted in refractory materials, as it 
attacks refractory lining. Iron has only a slight solubility in Mg making Fe nearly 
inert to Mg and making mild steel a suitable crucible material. Ladling should be 
avoided and pumping systems are used to transfer melt to holding stations and the 
mold/die. 
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Fig. 4 Crucible furnace 


Electric 

heater 



Magnesium alloys are melted under protective atmosphere, and the use of 
cleaning fluxes is recommended (Brown 2010). Previously a dry air +0.5 % SF 6 
gas mixture was used, but SF 6 is one of the worst greenhouse gases. Today, instead 

T’A/t 

of SF 6 , solutions such as HFC-134a (AM-cover ) and dilute S0 2 are used with 
more solutions that are under way based on BF 3 and S0 2 F 2 (Alternatives to SF6 for 
magnesium melt protection 2006). 


Melting of Copper Base Alloys 


Melting of copper that is a less reactive metal is simpler. Copper and copper base 
alloys do however dissolve significant amounts of hydrogen and oxygen that 
together form water, resulting in porosity. Coupling this to the in general short 
solidification intervals of the copper base alloys, large cavities are found if the 
process is not maintained within a narrow process window. This is true for all 
copper alloys and bronzes except for those containing aluminum. In aluminum- 
containing bronzes aluminum will form very thick and tough oxide films that 
generate slag defects and harm fluidity of the melt (Brown 2010). 

To successfully melt copper, it is essential that the melting takes short time and 
that cover fluxing degassing and deoxidation is made properly. Copper alloys are 
commonly molten in the same type of furnaces as aluminum, but the type of flux 
used is matched with furnace and alloy used. The general melting principle is to 
keep oxygen content as high as possible for as long time as possible, to minimize 
hydrogen pickup. Once degassing and deoxidation is made, casting should be made 
immediately as this opens up for hydrogen pickup. Hydrogen pickup is fast as H 2 is 
a small molecule. This is of particular importance when casting high-conductivity 

o 

copper (Brown 2010; Fredriksson and Akerlind 2006). 

Brasses that contain zinc are also a special case for melting. Zinc has high vapor 
pressure preventing hydrogen pickup in the same way as oxygen in aluminum-free 
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bronzes. The high vapor pressure requires low casting temperatures and no 
overheating as zinc will be lost with worker zinc poisoning as potential risk when 
inhaling zinc vapor (Brown 2010). 


Melting of Zinc-Based Alloys 

Zinc alloys are members of the low-temperature melting alloy family. Zinc alloys 
are commonly melted in mild steel crucibles with no need for protective gas. It 
should be noted that overheating and excellent ventilation are required due to the 
high vapor pressure of zinc to maintain a good working environment (Brown 2010). 


Melting of Cast Irons 

Melting of cast iron is most commonly based on scrap, and alloys are usually made 
at the foundry. The main manufacturing routes are either by cupola furnaces or by 
an electric furnace. For these two classes, there are several types of furnaces for 
melting cast irons (Brown 2000): 

• Cupolas furnaces 

• Cold blast cupola furnaces 

• Hot blast cupolas 

• Cokeless cupolas 

• Electric furnace 

• Electric arc furnace 

• Channel furnace 

• Coreless induction furnaces 

Cupola furnaces (Fig. 5) are the classic route to produce cast irons generating a 
base melt. For a good result, a steady operation is necessary and large variations in 
production volume are difficult to manage. Superheating the iron is also difficult. 
Cupola furnaces are thus commonly used together with an electric furnace for 
conditioning and superheating before transferring to the casting station. The cold 
blast cupola uses ambient temperature air, in the combustion of the coke, resulting 
in end temperatures in the range of 1,350 °C up to 1,450 °C. Higher temperatures 
are possible but require the addition of pure oxygen. The lining life of the cold blast 
cupola is short requiring daily relining. The cold cupola is thus most commonly 
operated in pairs allowing one to be relined, while the other produces melt for casting. 
The hot blast cupola offers a route to longer lining life, up to several weeks, by 
preheating the blast gas above 400 °C. This reduces the coke consumption and allows 
for either long-life refractories to be used or for a lining-free solution to be used. The 
end temperature is for the hot blast cupola around 1,500 °C (Brown 2000). 

One issue with cupola furnaces is the hot fume emission containing grit, dust 
particles, and S0 2 gas. Scrubber solutions are available to reduce this problem. 
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Fig. 5 The cupola furnace 
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Fig. 6 Electric arc furnace Multi layered lining 

Another way to deal with this is to use a “cokeless cupola” that runs on gas or oil 
instead. The charge is supported by a bed of refractory spheres. Elimination of coke 
removes sulfur contamination of the melt. Sulfur is harmful to the making of ductile 
iron. High-quality foundry coke has low sulfur content, and in areas where it is hard 
to come by or its price is high, the “cokeless” cupola is a real alternative. The 
“cokeless” cupola is also more forgiving to impurities such as lead and zinc and 
allows use of wider range of scrap, contaminated by oil and water. The end 
temperature in a “cokeless cupola” is typically just below 1,400 °C (Brown 2000). 

Electric furnaces such as electric arc melting (Fig. 6) can be used for cast iron 
production, but the long tradition of using a cupola for primary melting makes this 
the preferred solution. As conditioning is difficult, using the cupola duplexing with 
an induction furnace is the most common combination (Brown 2000). 
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Fig. 7 Channel furnace 
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Table 3 Graphitization 
potential of common 
alloying elements (Brown 
2000) 


Graphitizes 

Carbide stabilizers 

C 

+3.0 

Cr 

-1.20 

P 

+1.0 

Mo 

-0.35 

A1 

+0.5 

Mn 

-0.25 

Ni 

+0.3 

V 

— 1.0 to 3.0 

Cu 

+0.3 




In the channel furnace (Fig. 7), an inductor is placed under a channel in the 
furnace allowing the melt to be both inductively heated and pumped through the 
furnace. This allows for both thermal and chemical homogenization of the melt 
making it excellent for melt conditioning before casting. It will however require 
continuous operation, just as for melting of aluminum a coreless induction furnace 
can also be used for the holding and conditioning before casting. 

In the production of grey cast iron, inoculation and control of the graphite 
morphology is extremely important. The first requirement is that the graphite poten- 
tial of the melt is high enough to avoid white solidification resulting in the formation 
of carbide (Fe 3 C) instead of graphite (C). Phase selection has both cooling rate and 
alloying sensitivity that each foundry must understand in their production. Table 3 
shows the graphitization potential of various common alloying elements. 

In addition to having a sufficiently high graphitization potential, inoculation to 
generate the correct graphite morphology is necessary. Flake graphite is the stan- 
dard morphology for graphite. Other forms of graphite involve the spherical 
graphite known as ductile iron and an intermediate shape the so-called compacted 
graphite iron. The compacted graphite iron has lately been getting more attention 
due to its high thermal conductivity in addition to good strength and ductility. It is 
however still inferior to the ductile iron in terms of strength and ductility, but for 
engine applications where both strength and ductility are equally important as 
thermal conductivity, it is breaking new ground. The control of the morphology is 
significantly more difficult than that of the ductile iron. 
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Fig. 8 Sandwich ladle 
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To spheroidize graphite the graphitization potential must be high in addition to 
having low sulfur content, achieved by a magnesium treatment. The magnesium 
treatment is made above the boiling temperature of magnesium, resulting in violent 
boiling where most of the magnesium is lost. The minimum requirement is that 
0.04 % Mg remains dissolved in the melt. The reaction products (oxides and sulfides) 
create dross that must be removed before further treatment. Just before casting or 
during casting, an inoculant is added to spheroidize the graphite. This is made as near 
in time to pouring as possible to avoid fading, making the inoculant ineffective. Due 
to the violent reaction and then strive to minimize magnesium loss, as well as 
controlling fume emissions, six key strategies have been developed (Brown 2000): 


Sandwich ladle 
Tundish cover 
Plunger 
Converter 
In-stream treatment 
In-mold reaction 


Sandwich ladle: The treatment material is placed at the bottom of the ladle with 
a metal cover that delays the reaction until it has molten and allows contact between 
the melt and the treatment additions (Fig. 8). This is most commonly used for 
treatment alloys containing less than 10 % Mg as the reaction is less violent with 
diluted additions (Brown 2000). 

Tundish cover: The principle is similar to the sandwich ladle process, but a 
special cover is developed that eliminates the glare from the reaction and more or 
less eliminates fume generation (Fig. 9). Due to this the tundish cover treatment is 
today very common (Brown 2000). 

Plunger: The alloy addition is placed in a plunger that is submerged into the 
melt (Fig. 10). The ladle is covered to reduce fume emissions. The plunger is made 
from a refractory material, but as Mg attacks fireclay, the wear of the plunger is 
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Fig. 9 Tundish cover 
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significant. Plungers may also be replaced with a cored wire treatment station where 
a wire is fed into the melt through the tundish cover (Brown 2000). The wire 
feeding assures submersion, and as the plunger is no longer needed, it is a more 
robust solution that has a high degree of automation. 

Converter treatment: George Fisher has developed a special treatment cham- 
ber where the magnesium addition is placed in a special compartment inside the 
converter (Fig. 11). The converter is rotated allowing for contact between the melt 
and the magnesium addition (Brown 2000). 

In-stream treatment: During pouring the inoculant can be added in the stream 
when pouring over lip (Fig. 12). This is possible for automated pouring as melt flow rate 
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Fig. 1 1 Converter treatment Treatment alloy Tiltable converter 



Fig. 1 2 In-stream treatment 
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and inoculant feed must be matched to guarantee a suitable treatment. Allowing for this 
also permits for a homogeneous mix as the melt will mix during filling (Brown 2000). 

In-mold treatment: In-mold treatment requires modification of the gating 
system with reaction chambers to allow some degree of mixing. The idea behind 
this is to enable minimum additions and to avoid fading. The drawback is that the 
homogeneity may be compromised as the first melt entering the reaction chamber 
will take some of inoculants with it as it leaves the chamber resulting in reduced 
additions. This results in a less robust process and is as such more suited for large 
series manufacturing where the conditions are similar and the process can be fine- 
tuned. The point of addition may be in the weir bush or in the sprue basin (Fig. 13). 

To make compacted graphite iron, the process is similar to that of ductile iron. 
One way to achieve this is to add a denodularization agent together with the 
traditional treatment for ductile iron described above. The typical agents are 
based on (Brown 2000): 
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Fig. 13 In-mold treatment 
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• Magnesium and titanium additions 

• Magnesium additions 

• Nitrogen additions 

• Cerium additions 

For ductile iron the nodularity required is to be as close to 100 % as possible. 
This is usually guaranteed at a residual magnesium level of 0.04 %. The compacted 
graphite will occur in the range of 0.005 % Mg up to 0.01 % Mg, making the 
process window very narrow. 


Melting of Steels 

The quality of a steel casting is greatly affected by effects from the melting and 
making practice. The key effects are: 

• Deoxidation practice 

• Sulfide morphology control 

• Inclusion management 

• Degassing practice 


Deoxidation practice is critical where dissolved carbon and oxygen form gaseous 

o 

carbon monoxide generating porosity (Fredriksson and Akerlind 2006). Deoxidation 
is not only important to manage porosity but also critical to produce a material with 
the right composition and properties. Alloying elements, e.g., manganese and silicon, 
also react with oxygen and create slag. Deoxidation is made by the addition of 
ferrosilicon, calcium silicide, or aluminum. Calcium silicide treatment is common, 
but the reaction products may harm fatigue performance. Ferrosilicon and calcium 
silicide are commonly used during the alloy fabrication process, but for final condi- 
tioning deoxidation by aluminum additions is today the dominant procedure. 
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Fig. 14 Minimum residual 
aluminum content in cast 
steel post-deoxidation 
treatment to generate type III 
sulfides (Freely after Brown 
( 2010 )) 
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Sulfide morphology control: Sulfides in steel castings come in three main 
morphologies: type I, globular particles; type II, chains or films; and type III, 
angular particles. The morphology that is most detrimental to steel toughness, 
which is especially important in high-strength steels, is type II. High residual 
aluminum content, post deoxidation, generates type I, while high residual alumi- 
num content creates type III. The strategy is most commonly to aim to create type II 
sulfides. This results in that, for each and every carbon content, there is a 
required aluminum addition to generate the necessary minimum residual aluminum 
content (Fig. 14). 

Inclusion management is made by controlling reoxidation. The use of calcium 
silicide treatment with final conditioning through aluminum additions reduces 
reoxidation, especially together with argon rinsing. The use of correct type of firebrick 
with respect to aluminum and manganese content in combination with ceramic foam 
filtering is the current state of the art in producing near particle-free steel. 

Gas porosity in steel castings is the result from at least one of the following 

o 

gases (Fredriksson and Akerlind 2006): 


• Hydrogen 

• Nitrogen 

• Carbon monoxide 


Hydrogen can be minimized by keeping oxygen partial pressure as high as 
possible until just shortly before the pouring when the temperature should be at a 
minimum and all alloying are done. This is the reason for deoxidation being one of 
the last actions made in the steelmaking process. The final aluminum treatment is 
made in the transfer ladle/pouring ladle. For aluminum-treated steels, 13 ppm 
hydrogen is a maximum limit to avoid porosity, while for silicon treatment only 
the maximum allowable amount of hydrogen is 9 ppm (Brown 2000). Carbon 
monoxide should be managed by an appropriate deoxidation procedure, but it is 
important that there is an excess of deoxidants to manage the reoxidation during 
filling which otherwise may result in pinholes. Nitrogen is more difficult but is best 
in managing raw materials as some nitrogen comes in through ferrochromium 
addition as well as from the breakdown of some sand mold binders. 
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Fig. 15 AOD converter 
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The equipment to melt steel is of three types: 

• Electric arc furnace 

• Induction melting 

• AOD converter 

Electric arc furnaces (Fig. 6) use three-phase AC current, with three electrodes 
in the furnace, made from graphite and are consumed in the melting process. In most 
steel foundries, the melting process is set up to allow for sulfur removal with a lining 
consisting of dolomite or magnesite with a tar binder. The melting process is 
commonly made using an oxidizing slag first to allow phosphorous removal. If sulfur 
content is high in the process, the slag is exchanged to a reducing slag cover for sulfur 
removal. Independent of a single stage oxidizing slag melting or a two-step slag 
practice, last step is deoxidation. Actual practice varies between foundries and needs 
adaptation to the scrap quality and steel grade produced (Brown 2000). 

Coreless induction furnace (Fig. 1) used for steel is of similar design as that 
used for aluminum, but induction frequency is adapted for better performance. 
Smaller furnaces operate at 1,000 Hz, while furnaces with charge capacities above 
3,000 kg typically operate at 500 Hz. A neutral lining is used, mainly consisting of 
alumina and magnesia. The use of a coreless induction furnace does not allow for 
sulfur and phosphorous removal during melting, and melt condition is relying 
solely on high melting rate combined with alloying and treatments (Brown 2000). 

AOD converter stands for argon-oxygen-decarburization converter (Fig. 15). 
Oxygen diluted with argon is injected into the AOD converter. The use of an AOD 
converter allows for sulfur and phosphorous removal as well as allows the foundry 
to generate steel of high quality with accurate chemistry. Its use adds cost and is 
thus not common (Brown 2000). 
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Metal Casting Processes 

As previously mentioned casting as a manufacturing process has been used 
for a long time, and a direct consequence of this is that there exists a huge 
variety of processes. Today there are uncountable processes, and each foundry 
has added its own adaptation of the process used. The aim of the current 
section is to provide an overview of the most typical and important processes 
as well as to provide some kind of systematics for process classifications. 
Earlier in Table 1 the starting point for this was given where the molds were 
divided into: 


• Permanent molds 

• Disposable molds 


This will also be the starting point for the division of this section. 


Permanent Mold Processing 

Common for all permanent mold processes are that the mold is used many times and 
that no physical model is used in the mold-making process. Permanent mold use can 
be further categorized according to the mode of mold filling: 

• Gravity -driven filling 

• Low-pressure-driven filling 

• High-pressure filling 

Gravity die casting (Fig. 16) is the simplest form of die casting where the 
die is made from hot- work steel such as HI 3. Filling can be made directly 
into the cavity, but for good quality castings, it is necessary to include 
a sprue, runner, and gating system. Slow filling speeds and low pressures 
enable the use of metal and sand cores. Risers are also commonly used to 
allow feeding. The process cycle for gravity die casting consists of the following 
stages: 

• Die preparation where the coated die surface is cleaned by air blowing. Auto- 
mation is possible. Disposable cores are also sometimes added into the die cavity 

(Fig. 17). 

• Die clamping involves closing the die and locking it to avoid leakage and flash 
formation. 

• Filling is achieved by ladling melt from a holding or dosing furnace or by 
tilting the furnace or pumping melt of a controlled volume/mass into the 
down sprue. 

• Solidification and cooling allow the part to solidify and attain the shape as well 
as to cool with thermal shrinkage and dimensional tolerances as controls. 
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Fig. 16 Gravity die casting (Courtesy CustomPartsNet.com) (CustomPartsNet 2013) 
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Fig. 1 7 Gravity die casting with a disposable core (Courtesy of Azom.com) (Azom.com A to Z of 
Materials 2013) 


• Die opening, opening of the two halves by release of the clamping mechanism. 

• Part removal can be either manual or automated. 

• De-gating and flash removal can be made manually or automated. Automation 
often involves the use of a blanking press resulting in a fine cut removing all flash. 
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Fig. 18 Tilt casting (a) after pouring and (b) after tilting and filling 

Gravity die casting is a relatively slow process, but capable of producing 
complex- shaped components. Die temperatures are high and to protect the die 
material, a die dressing is used. This dressing can be a simple soot coating to 
more advanced ceramic coatings, often 0. 1-0.3 mm thick. The wide range of 
coatings available can be divided into two main classes: 

• Insulating coatings: contain blends of insulating minerals such as talc, mica, 
kieselguhr, titanium dioxide, alumina, etc. 

• Lubricating coatings: based on colloidal graphite or boron nitride to aid release 
of the casting. 

Tilt casting (Fig. 18a, b) is a variation of the gravity die casting process. In tilt 
casting the process stages are identical to the gravity die casting process with one 
distinct exception, the filling stage. In tilt casting, melt is poured into a basin, 
without entering the die cavity. The complete closed die set is rotated at a controlled 
rate together with the basing, generating a slow and smooth filling of the die cavity. 

Low-pressure die casting (Fig. 19) is similar to gravity die casting, but here the 
gas pressure forces the melt into the die cavity from below the die. Filling is 
quiescent and sound castings are produced. The steps in low-pressure die casting 
are (Street 1977): 

• Die preparation with cleaning coated surfaces by air blowing, often 
semiautomated. Here disposable salt cores can be added. 

• Die clamping involves closing and locking the die. The die is often a four- to 
six-part die allowing for undercuts. 

• Filling is achieved by injecting air or nitrogen gas into a gas-tight crucible 
furnace, located below the die. The gas pressure acts on the melt surface forcing 
melt up the submerged stalk into the die cavity. 

• Solidification and cooling require more control than in other casting processes. 
The cooling sequence can be controlled by time or by thermocouples in the die. 
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Fig. 19 Low-pressure die casting (Courtesy Azom.com) 

The cooling media may change from air cooling to air-water mist mixture to 
ensure that solidification progresses from the outer portion of the casting towards 
the melt entry point. 

• Die opening starts with release of the furnace gas pressure being reduced just 
before die opening to allow melt in the stalk flow back into the crucible. 

• Part removal that can be manual or automated. 

• De-gating and flash removal can be made manually or automated. 

The pressure used is however moderate, and it is common that in addition to the 
upper and lower die halves, side walls are used as sliders. Wheels are a common 
product produced in low-pressure die casting. The upper and lower die halves 
are here related to the model design, while the side walls can be shared between 
many upper and lower pairs as these are only related to the wheel dimensions. Very 
cost-effective die solutions are thus possible despite the use of sliders. 

High-pressure die casting comes in two different forms, e.g., cold-chamber die 
casting (Fig. 20) and hot-chamber die casting (Fig. 21). The basic process sequence 
for these two processes is similar, but alloy capability is different. The overall 
process consists of the following process steps (Street 1977): 

• Die preparation starts with die lubrication/release agent being applied onto the 
die surface for good surface finish, die filling, and part release. The application of 
the die spray is sometimes followed by air blowing to remove water from the die 
surface. 

• Die clamping involves closing the die and locking to avoid leakage and flash 
formation. 
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Fig. 20 High -pressure die casting (cold-chamber setup) (Courtesy CustomPartsNet.com) 
(CustomPartsNet 2013) 
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Fig. 21 High -pressure die casting (hot-chamber setup) (Courtesy CustomPartsNet.com) 
(CustomPartsNet 2013) 
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Fig. 22 The steps in squeeze casting, (a) pouring, (b) pressing, and (c) ejection (Azom.com A to 
Z of Materials 2013) 

• Filling the melt into the die is achieved by pressure, but shot chamber geometry 
is different in cold- and hot-chamber die casting. 

• In cold-chamber die casting, melt is ladled from a holding/dosing furnace and 
poured into the shot chamber through a pouring hole. If the first phase filling 
sequence is ill managed, gas entrainment in the shot chamber will occur. The 
pressure transfer from the plunger into the die cavity is effective and gives good 
quality components. In vacuum-assisted filling, the die cavity is evacuated at the 
point where the plunge seals the pouring hole. 

• In hot-chamber die casting, the plunger is submerged into the melt in the holding 
furnace. The sleeve surrounding the plunger has a small inlet ensuring that the 
sleeve leading to the gooseneck is filled. The plunger moves downwards the melt 
is forced through the gooseneck and the nozzle into the die cavity. 

• Solidification and cooling are chosen such that the best possible tolerance is 
attained without the part fracturing from thermal stress. 

• Die opening, opening of the two halves by release of the clamping mechanism. 
The part follows the moving side. 

• Part removal, the ejector pins eject the part and the part is picked up using a 
manual or automated procedure. 

• De-gating and flash removal can be made manually or automated. Automation 
often involves the use of a blanking press resulting in a fin cur removing all flash. 

High-pressure die casting is today a process dominating the manufacturing of 
light and zinc alloys due to the high-productive and large series capability. Dimen- 
sional tolerances are also good. The main drawback is that there is a need for a 
runner and gating system that often exceeds the critical 30 % of the total mass. This 
exceeds the critical rule of thumb for foundry in-house material recycling. 

Squeeze casting (Fig. 22) is a near net shape process as there is no need for a 
runner and gating system. In this process the following steps are made: 

• Die preparation starts by application of a die release agent followed by air 
blowing to remove residual water from the release agent. A horizontal die 
arrangement makes this critical to achieve good surface finish. 
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Filling the die cavity is made by ladling directly from a dosing furnace or by 
pumping. The part is formed immediately as the die closes as a net shape process. 
A limitation of this process is the accuracy in press direction. This is limited by 
dosing accuracy. A direct consequence of this is that at least one surface is required 
to undergo a limited machining to meet tolerance requirements. 

Solidification and cooling take place under high-pressure removing shrinkage 
and gas porosity. A minimum pressure of 50 MPa will generate pore-free 
castings. Similar pressure is also found in high-pressure die casting, but as the 
gate freezes, the die cavity is cut off from the plunger providing the pressure 
generating porosity. Sometimes a so-called squeeze pin is added to the 
high-pressure die casting machine acting more directly onto the part. In this 
way semi-squeeze casting performance can be achieved. 

Die opening works as in low-pressure die casting, but here the part is preferably 
ejected from the bottom. 

Part removal that can be manual or automated. 


There is no immediate need for degassing in squeeze casting as die pressure is 
enough to remove porosity. It should also be noted that it is possible to cast wrought 
material in this process making it highly competitive for high-performance com- 
ponents for LED and for structural components made in 6061, 6063, and 6082 types 
of wrought alloys shaped by casting (Su et al. 2010). 

Table 4 presents a comparative benchmarking of different permanent die 
processes. 

The die set in high-pressure die casting is complex. A cold-chamber high- 
pressure die casting die set (Fig. 23a) consists of two halves. The part containing 
the ejector mechanism (left half) is referred to as the moving half as it is the half that 
moves to open and close the die. The part sticks onto the moving half following this 
as the die is opened. The opening movement also provides the ejection of the part as 
the motion creates a force on the ejector plate (Fig. 23b), extending the ejector pins 
into the die cavity. This normally does not present any problems as the plunger 
injecting the metal aid in the ejection of the biscuit on the fixed side. All die parts 
that come into contact with the melt, cavity core, cavity insert, and ejector pins are 
normally made from hot- work steel such as HI 3. HI 3 is also used in the runner and 
gate area as well as for injection sleeve material. Support plates, ejector boxes, and 
plates are made from an easily machined material such as mild steel. Sometimes 
angle pins are used to lock slider core (not shown in the figure). Slider cores are die 
element that are to create holes and undercuts by a separate action than opening and 
closing alone as additional hydraulics can be attached at the side of the die. These 
pins require higher strengths and are normally made from Cr-V steel. 

The anatomy of the hot-chamber die casting die (Fig. 24a) is similar to that of the 
cold-chamber die casting die (Fig. 23a). In hot-chamber die casting as in 
low-pressure die casting, the injection pressure is released before die opening. To 
make the part stick to the moving half, a so-called spreader is used creating a thin- 
walled cone instead of a biscuit (Fig. 24b). Ejection then requires additional ejector 
pins in direct proximity to the spreader to avoid distortion. 


Table 4 Benchmarking between permanent die processes, after information at references (CustomPartsNet 2013) 
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Fig. 23 (a) Cold-chamber die casting die. (b) Cold-chamber die casting die exploded view 
(Courtesy CustomPartsNet.com) (CustomPartsNet 2013) 


The dominating choices, primarily due to cost efficiency, are mild steel for the 
mold base and H13 for the part getting in contact with the melt. Depending on part 
size and melt temperature, thermal load may differ from part to part and from 
foundry to foundry. If the standard solution HI 3 does not work, it is important to 
understand how to choose differently from common alternatives (Table 5). To make 
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Fig. 24 (a) Hot-chamber die casting die. (b) Hot-chamber die casting die exploded view 
(Courtesy CustomPartsNet.com) (CustomPartsNet 2013) 


this choice, it is important to understand the influence of the alloying elements in 
tool steels. 

Carbon (C) is the main contributor to hardness. In most cases hardness is 
beneficial, but for larger castings or cold-chamber die casting of brass, toughness 









Table 5 Collation of the most common tool steels (Street 1977) 
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may become a limiting factor and lower hardness and greater toughness are better. 
A possibility is then to change from HI 3 to nonstandardized tool steel such as QRO 
80 supreme with a lower hardness and increased toughness. 

Silicon (Si) improves ductility and increases fatigue life but is added in similar 
amounts and is close to optimum in these steels. The fact that QRO 80 supreme has 
a lower Si content is well compensated for by the carbon content. 

Manganese (Mn) improves ductility and hardenability of the material and helps 
generating a more even hardness in thicker sections. 

Chromium (Cr) is an element that prevents high temperature and thermal 
cycling softening extending tool life when wear resistance is a limiting factor. 

Molybdenum (Mo) reduces the strength drop that most materials suffer at 
higher temperature. In QRO 80 supreme this is added to a greater amount than 
for the HI 3 material, is to compensate for the lower initial hardness of QRO 
80 supreme, caused by the low carbon content. 

Tungsten (W) has a similar influence as molybdenum. This is utilized in H12 
compared to HI 3. H 12 is also slightly lower in carbon reducing the initial hardness 
but compensating this by the tungsten addition. 

Vanadium (V) forms vanadium carbonitrides that improve wear resistance. 

Cobalt (Co) is used for high-temperature strength and is a special adaptation to 
cater for extreme strength at high temperature, in heavy duty application. 


Disposable Mold Casting Mold Processing 

The classification of disposable molds is complex but can be classified as shown in 
Fig. 25. This table also contains processes where either a permanent model or a 
disposable model is used as previously indicated in Table 1. 

Sand molding processes have many features in common and only the degree of 
automation and binder system differs between processes with the exception of shell 
molding and lost foam processes, treated separately. 

Sand molding has been around as a manufacturing process for long time due to 
its versatility and consists of the following steps: 

• Pattern making 

• Mold making 

• Mold assembly 

• Pouring 

• Cooling 

• Shakeout 

• De-gating, fettling, and post-casting processing 

Pattern making takes shrinkage and machining allowances into consideration. 
Sometimes runner and gating systems are included in the pattern, and sometimes 
these are put in place as part of the mold-making and assembly processes. The 
choice of pattern and pattern material is process and series size dependent. The 
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Fig. 25 Disposable mold processes 


same goes for choice of pattern material. There are four typical types of patterns 
used in sand molding (Fig. 26a-d). 

The solid pattern (Fig. 26a) is not so common but is used for simple-shaped 
castings and short series manufacturing due to low cost. Sprue, runner, and gate are 
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Fig. 26 (a) Example of a solid pattern, (b) Split pattern, (c) Example of a match plate pattern, 
(d) Example of a plate mounted cope and drag pattern (Courtesy CustomPartsNet.com) 
(CustomPartsNet 2013) 


added separately. The two mold halves cope and drag must be made in sequence 
making mold making slow and short series manufacturing the only option. 

The split pattern (Fig. 26b) is more common for slightly more complicated 
patterns. It is still a low-cost alternative, mostly used for small series production 
or very large components. Sprue runner, gates, as well as risers and feeders must be 
added in the mold-making and assembly processing. The cope and drag can be 
made separately, but matching of the two halves during mold assembly is difficult 
with risk of parting line flaws. 

Figure 26c shows a match plate pattern allowing for accurate positioning of 
the mold cavities in the cope and the drag, common in automated mold making. 
It does require sequential mold making limiting the model making rate, but moderate 
to large series production is feasible. In a match plate, it is also possible to make 
runners and gates, as well as the sprue for a pressurized system. For non-pressurized 
system the spme must be made separately as it requires an undercut. 

Separating the cope and the drag patterns but putting them on separate plates 
(Fig. 26d) enables high production rate with good accuracy of the cavity placement 
as well as high degree of automation. As for the match plate pattern, runners and 
gates can be added as well as a gate for a pressurized system. 
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Mold making is a complex process where series size strongly affects the mode 
of operation. There are in principle two main classes of sand systems used to make 
molds (Brown 2010, 2000): 


• Green sand 

• Chemically bonded sand 


For green sand mold, the most common sand used is silica sand as it is the most 
cost effective. Other types of sand include olivine, chromite, and zircon sand 
representing increased refractoriness as well as improved thermal stability in 
order (Brown 2010, 2000). 

A common mistake, in the foundry industry, is to mix chromite and silica that 
will lead to the risk of fayalite formation that significantly reduces the refractory 
properties of the sand. This is a practice that comes from that silica is not stable 
enough for many castings and “hard facing sand” is chosen to be used next to the 
casting. Combining silica with olivine and zircon is fine; use of chromite will only 
lead to serious casting defects in the long run. Problems will not appear immedi- 
ately after chromite introduction, but there is a slow systematic buildup reducing 
the allowable process window and will harm process yield significantly. DO NOT 

DO THIS ! 

The binder in a green sand mix is water and clay. Clay is either a sodium-based 
bentonite or a calcium-based bentonite or a mixture of both. Calcium bentonites 
produce hard molds but are prone to cracking and sensitive to the moisture content. 
Sodium alleviates this difficulty. Coal dust is sometimes added to help stripping the 
mold but increases the risk of misruns and creation of lustrous carbon defects in the 
casting. In steel castings starch and dextrin are used to increase strength. This type 
of binder is normally referred to as cereal binders (Brown 2010, 2000). 

Chemically bonded sand is a science by itself as there are many variants. The 
main groups are (Brown 2010, 2000): 


Self -hardening systems (no-bake or cold setting binders) 
Trigger hardened systems - hardening 
Heat-triggered systems (hotbox) 

Gas-triggered systems (S0 2 process) 

Sodium silicate-bonded sands 


Sand molds are also coated with the purpose to control physical interaction 
with the molten metal. Metal penetration is the most important interaction as it 
increases severity of melt-mold chemical reactions. Coating is also made to 
control chemical effects where sand burn-on affects casting surface finish. 
As the metal is cast into a chemically bonded mold, the binder breaks down 
under formation of carbonaceous compounds. These form lustrous carbon that in 
excessive amounts causes poor surface finish and filling-related defects, such as 
laps. Coating also reduces mold wear during filling. Wear debris is detrimental to 
component fatigue properties. 
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Fig. 27 Sand molding (Courtesy CustomPartsNet.com) 


Cores are smaller and shaped in a metal die and subsequently cured. Cores are 
commonly made in a cold-box or a hotbox system or using a sodium silicate -based 
process. A cross section through a sand mold is shown in Fig. 27. 

Mold assembly requires guides to align the two mold halves during assembly 
to eliminate displacement of the cope and drag. Before mold-closing cores are 
placed into the mold. These have to be carefully added, and in the case of long cores 
or core -producing blind holes, so-called chaplets are used to give support. Chaplets 
are made from the same material as the casting or from a material that dissolves 
during casting and/or heat treatment. The two halves need to be clamped together, 
and sometimes weights are used to keep the mold together just as locking force 
is required in die casting. In gravity casting the metallostatic pressure inside 
the cavity asserts force on the mold that will separate the cope and drag without 
counteraction. During assembly vents are made to allow off gassing from cores 
in particular. 

Pouring is controlled to allow gas present in the mold cavity to be vented. 
The total filling time is primarily controlled by the sprue, runner, and gate design. 
The degree of control is different between a pressurized and un-pressurized system, 
discussed more in detail under the section on gate design. As time control is 
not absolute, operator skill is important in manual pouring. Commonly the melt 
is poured into a weir bush to ensure calm filling to minimize gas entrainment. 
End of filling is visually verified as melt is seen filling an open riser opposite 
the sprue. This rider also functions as the main venting port as the gas in the mold 
is displaced. 
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Fig. 28 Shell molding (Courtesy CustomPartsNet.com) (CustomPartsNet 2013) 


Cooling is to allow for the metal to solidify. This can be estimated by 
Chvorinov’s law t = C(V/A) where C is a constant and V is the casting volume 
and A is the area of the cooling surface. Depending on the material being cast, 
different recommendations of cooling time may be required (Jarfors et al. 2010). 

Shakeout takes place after required cooling time. The mold is broken into pieces 
allowing for sand to be separated and recycled. Sand recycling can achieve recov- 
ery efficiencies from 80 % up to 95 %. Sand loss is due to burn-on and due to sand 
particle disintegration during recycling. The sand mold requires a certain amount of 
permeability to function properly, and the amount of fine particles is restricted as it 
prevents a good permeability. In this process core sand from internal features is 
removed (Brown 2000). 

De-gating, fettling, and post-casting processing are the last step where runner 
and gating systems are removed. Gates and risers can for most materials be knocked 
off using a hammer. In steel casting however, cutting them off is more common. 
After de-gating it is common to shot blast the castings with the sole purpose to 
remove the last burnt on sand. The time to de-gate and fettle is proportional to the 
casting envelope (Brown 2000). Gates and risers are retuned for melting and the 
casting is further processed with machining, heat treatment, and so on depending on 
the material and component requirements. 

Shell molding (Fig. 28) was invented to reduce binder usage and to improve 
casting tolerance and consists of the following steps: 
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• Pattern making uses a cope and drag pattern made from iron or steel, even 
though aluminum and graphite sometimes are used for prototyping and short 
series production. As the pattern is reusable and made in two halves, a well- 
defined parting line and a sufficient draft are necessary. 

• Mold making utilizes a mold material based on a standard sand base with a heat- 
triggered resin binder. The pattern is first coated with a stripping or release agent 
and subsequently heated (175 °C up to 370 °C) and brought into contact with the 
sand. As heat travels through the sand, a layer of partially cured skin of sand 
forms around the pattern. Final curing is subsequently done separately in an 
oven. Once the two halves are cured, it is stripped from the pattern. 

• Mold assembly is made by clamping the two halves together and put into a flask 
filled with a backing material, casting sand without binder. In the assembly cores 
are also added if necessary. 

• Pouring needs special care as runner and gating systems are simple. 

• Cooling is as for sand casting controlled by time. 

• Shakeout is also made in a similar manner as sand casting. The main benefit is 
though that the mold contains a large amount of backing material making sand 
reclamation easier. 

• De-gating, fettling, and post-casting processing again are as for sand casting 
processes. 

Lost foam casting (Fig. 29) is well suited for manufacturing of large tool 
components and complicated shapes. Lost foam casting does not require a parting 
line as the pattern is disposable and left in the mold. As the melt enters the mold 
cavity, the pattern evaporates and the metal fills the mold. The lost foam casting 
process consists of the following steps: 

• Pattern making starts with the manufacturing of the pattern in expandable 
polystyrene or copolymer foams. Simple shapes can be made using a polymer 
injection molding process allowing large series production. An alternative is to 
make patterns by cutting the shape out from a block of polystyrene foam 
allowing single piece production or prototyping. If more complicated shapes 
are needed or very large parts are to be made, they can be created from smaller 
parts that are glued together. As the pattern is to be left in the mold, there is no 
need to consider undercuts and the design freedom is great. Gating and venting 
are of greater importance than for normal casting process as the foam is 
evaporated and must be well vented. The foam is typically 96 % air and the 
additional venting requirements come from evaporation of the 4 % solids. The 
final stage in pattern preparation is to coat the pattern in a refractory paint. A 
variant is the full mold casting where green sand is used instead and there is no 
need for the coating. 

• Mold making consists of putting the pattern in a flask which is subsequently 
filled with dry sand compacted by vibration resulting in a fluidized bed of sand. 

• Pouring needs special care as the pattern must evaporate. Initially casting of 
irons was impossible as lustrous carbon formed in excess during the filling 
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Fig. 29 Lost foam casting (a) polymer foam pattern, (b) post coating, and (c) in flask just before 
pouring 


process. Now with proper venting and the right choice of foam and pouring 
conditions, iron casting is today a commodity. 

• Cooling is as for sand casting controlled by time. 

• Shakeout is also made in a similar manner as sand casting. As the sand used is 
dry, sand recycling is relatively simple. 

• De-gating, fettling, and post-casting processing again are as for sand casting 
processes. 

Gypsum casting uses plaster of Paris as mold material being easily shaped and 
having refractory properties suitable for casting nonferrous alloys. To optimize the 
performance, typically 20-30 % of additives are used to provide the mold with 
sufficient resilience against cracking, controlling both strength and thermal expan- 
sion. The process steps for gypsum casting are the following: 
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Pattern Tree Shell-Making Investment Casting Casting 


Runner-. 
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gating 
system -I 

Wax patterns - 



Ceramic shell 


Ladle 


Hollow 

ceramic 

shell 


Flask 


Finished 

casting 


Ceramic slurry 


Fig. 30 Investment casting or lost-wax casting (Courtesy CustomPartsNet.com) 


• Pattern making is here the preparation of a mold to cast the plaster mold. 
The die used to do this is commonly a metal mold as plaster is slightly 
abrasive. For shorter series and prototyping rubber, molds may be used. 
These molds can allow for smaller undercuts and advanced features as they 
are compliant. Otherwise a draft is necessary as the casting process requires 
parting. 

• Mold making is the assembly of the two gypsum halves. Prior to assembly the 
mold halves are baked in the oven from 125 °C up to 260 °C for sufficient 
strength. 

• Mold assembly is made by clamping the two halves together also involving a 
preheating. 

• Pouring needs special care as the runner and gating systems are very simple. 

• Cooling is as slower than usual and prevents large volume casting. 

• Shakeout is made by crushing the plaster mold. The plaster used cannot be 
reclaimed. 

• De-gating, fettling, and post-casting processing standard for nonferrous castings. 

Gypsum casting is a process well suited for prototyping and short series casting 
as mold cost is low and the process is flexible. 

Investment casting (Fig. 30) was previously lost-wax casting, an old process 
used to produce jewelry and high-quality artifacts. Today the process is used for 
manufacturing of components with the most extreme requirements, such as turbine 
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blade for jet engines and steam turbines. The process has a unique capability in 
making complicated shapes with superior surface finish but is complicated and 
slow. The process steps are: 

• Pattern making starts by casting a wax pattern in a metal mold. Commonly a 
single wax part is cast or a portion of the part is cast. The individual pieces are 
subsequently assembled into a model including runner and gating as well as 
filtering. For directionally solidified materials and single crystal materials, the 
seeding is also included as a geometric feature or as a metallic seed. 

• Mold making consists of three steps. The first step consists of dipping the pattern 
into ceramic slurry that will create a fine surface layer being the foundation of 
the very fin surface reproduction and surface finish capability of the process. The 
slurry cover is not sufficient to build up a mold, and coarser grained sand is 
added to the mold by either dipping the newly coated pattern into a fluidized bed 
of sand or inserting the part in a chamber exposing it to a gentle rain of sand. This 
process is repeated dipping and subsequently sanding until a sufficiently thick 
layer has formed. The second step is a drying process that reaches up to 800 °C in 
which the mold is given partial hardening, but foremost the wax melts and is 
removed from the mold cavity. The third step is to burn the mold at 1,200 °C for 
final hardening. This also effectively removes any potential contamination from 
carbonaceous species. 

• Mold assembly is not required as all shapes are incorporated at the pattern 
assembly, and mold making is made by clamping the two halves together and 
put into a flask that is subsequently filled with a backing material, casting sand 
without binder. In the assembly cores are also added if necessary. 

• Pouring needs special care as the runner and gating systems are often simple. 

• Cooling is as for sand casting controlled by time. An exception here is directionally 
solidified materials and single crystal components. Here either the part or the 
furnace is moving allowing solidification to take place in a very controlled manner. 
For maximum efficiency, especially design heating bodies are used to ensure an 
appropriate temperature distribution during solidification and cooling. 

• Shakeout is also made by crushing the molds. 

• De-gating, fettling, and post-casting processing again follows standard procedures. 

Shaw and Unicast processing: Precision casting is not a single type of casting 
process but rather a family of processes where the Shaw and the Unicast processes 
are the two most prominent processes. Sometimes investment casting is also 
counted to this group of processes. The Unicast process is similar to sand molding 
as it uses a standard pattern. Instead of the traditional molding sand, ceramic slurry 
is poured onto of the pattern. This results in a highly refractory mold with the ability 
of fine shape replication and excellent tolerances. The Shaw process is similar, but 
here extremely fine-grained silica particles are mixed with liquid ethyl silicate and a 
gelling agent is added (Clegg 1991). 

Cosworth process (Fig. 31) uses an electromagnetic pump to pump aluminum 
melt up into a mold. The submerged nozzle ensures a clean melt, and the 
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Fig. 31 Cosworth process Rotatable 



pump 


counter- gravity filling principle aids the calm filling as in low-pressure die casting. 
To increase productivity the mold is rotated, and by appropriate design, the casting 
is fed in the gravity direction. The mold is made as in traditional sand casting but 
using zircon sand for thermal expansion and a no-bake (S0 2 ) cold setting process. 

Table 6 presents a comparative benchmarking of the disposable mold casting 
processes. 

Mold Filling and Solidification 

Mold filling and solidification in the mold and during filling are a complex 
multiphase flow with heat transport and phase change and conservation of species 
(Jarfors and Ervasti 2000). In this handbook simple rules for starter estimates are 
given to illustrate and understand fundamental physics of the processes. The 
process to fill a mold requires sufficient melt fluidity. Fluidity is a measure of 
how far a melt can flow without stopping and is a combination of process condi- 
tions, alloy chemistry, and mold dimensions. The key parameters that affect fluidity 

o 

are (Fredriksson and Akerlind 2006): 

• Melt temperature 

• Melt viscosity 

• Melt flow rate 

• Chemical composition and mode of solidification (columnar or equiaxed as well 
as the size of the solidification interval) 

• Thermal diffusivity 

• Fatent heat 

• Surface tension and work of wetting 
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For gravity -driven processes, melt is poured into a cup, or weir bush, filling up 
the tapered sprue and transported through the runner and the gate into the cavity 
consisting of the part, feeders, and risers. The purpose of the gating system is to 
generate a quiescent flow with a minimum of temperature loss. A successful casting 
can be achieved following Campbell’s Ten Commandments (Campbell 2002): 

• Clean metal 

• No meniscus damage 

• No liquid metal front stop damage 


Table 6 Benchmarking between disposable mold casting processes, after information at refer- 
ences (CustomPartsNet 2013; Clegg 1991) 


Property name 

Sand casting 

Shell mold 
casting 

Investment 

casting 

Gypsum casting 

Shapes 

Thin-walled and 
complex shape 

Thin- walled and 
complex shape 

Thin-walled and 
complex shape 

Complex 
shapes with 
capability of 
minor undercuts 

Part size (kg) 

0.02-450,000 

0.01-100 

0.0006-230 


Materials 

Alloy steel, 
carbon steel, cast 
iron, stainless 
steel, aluminum, 
copper, 
magnesium, 
nickel lead, tin, 
titanium, zinc 

Alloy steel, 
carbon steel, 
cast iron, 
stainless steel, 
aluminum, 
copper, nickel 

Metals, alloy 
steel, carbon 
steel, stainless 
steel, aluminum, 
copper, nickel 
cast iron, lead, 
magnesium, tin, 
titanium, zinc 

Aluminum, 
copper base 
alloys, 
magnesium, 

Surface finish, 
Ra (pm) 

8-15 

1.2-8 

1.2-4 

1.3-4 

Tolerance 

(mm) 

±0.7 

±0.3 

±0.1 

±0.1 

Wall 

thickness 

(mm) 

3—. . . 

1.5-50 

1.6-20 


Quantity (pcs) 

1-1,000,000 

100-1,000,000 

1-1,000,000 

Short series 

Lead time 

Days 

Weeks 

Weeks 

Days 

Advantages 

Can produce 
very large parts, 
can form 
complex shapes, 
many material 
options, low 
tooling and 
equipment cost, 
scrap can be 
recycled, short 
lead time 
possible 

Can form 
complex shapes 
and fine details, 
very good 
surface finish, 
high production 
rate, low labor 
cost, low tooling 
cost, little scrap 
generated 

Can form 
complex shapes 
and fine details, 
many material 
options, high- 
strength parts, 
very good 
surface finish and 
accuracy, little 
need for 
secondary 
machining 

Inexpensive, 
reusable, more 
accurate than 
steel molds, fast 
to produce, and 
easy to change 

Undercuts 
possible using 
rubber molds 


(< continued ) 
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Table 6 (continued) 


Disadvantages 

Poor material 
strength, high 
porosity 
possible, poor 
surface finish 
and tolerance, 
secondary 
machining often 
required, low 
production rate, 
high labor cost 

High equipment 
cost 

Time- 
consuming 
process, high 
labor cost, high 
tooling cost, 
long lead time 
possible 

Slow and 
limited to 
nonferrous 
material with 
low melting 
temperature 

Applications 

Engine blocks 
and manifolds, 
machine bases, 
gears, pulleys 

Cylinder heads, 
connecting rods 

Turbine blades, 
armament parts, 
pipe fittings, 
lock parts, hand 
tools, jewelry 

Prototyping 
short series 
manufacturing 


• No bubble damage 

• Reduce core blow damage 

• No shrinkage damage 

• Avoid convection damage 

• No unplanned segregation damage 

• Controlled residual stress 

• No machining damaged 

Clean metal is primarily managed during melting and melt treatment, but 
filtering and mold cleanliness help to avoid unnecessary gas pickup and inclusion 
generation/entrainment (Campbell 2002). 

Meniscus damage is caused by front breakup, which is governed by a balance 
between inertial forces (filling speed and viscosity) and surface forces (surface 
tension), characterized by the dimensionless Weber number 

prv 2 
We = - — 

r 


Here p is melt density. 
v is melt flow speed (m/s). 
y is melt surface tension (N/m). 

r is the duct hydraulic radius (half the cross-sectional area/wetted perimeter) (m). 
Smooth surfaces during filling are obtained for 0.2 < We < 0.8, while dramatic 
turbulent breakup occurs when We > 100. A more conservative measure for 
the breakup is that of a wavy surface flow but not resulting in folding. It has been 
shown that We < 2 is a practical maximum value in the transitional regime 
for the surface resulting in an approximate critical flow speed of the melt (Totten 
et al. 2003). 
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^critical 



Once the melt front is broken up, it will entrain gas, creating oxide double films. 
Oxides will aid gas precipitation as it nucleates both porosity and cracking seriously 
harming component performance and process yield as a result from in-process 
crack formation, especially for aluminum. For iron and steel, the films may be 
oxides, but there is also risk for nitride and lustrous carbon formation both having 
similar effects as the oxide in aluminum. Meniscus damage is entirely controlled by 
filling. 

No liquid metal front stop damage is coupled to the formation of lap defects 
may appear lap defect if not fulfilled. This is caused by large flat surfaces and by 
sudden dimensional changes. There is to some extent a design component to the 
control of this defect, but melt temperature and the possibility to cast large flat 
surfaces on inclined plane make the control of this factor primarily a filling -related 
matter (Campbell 2002). 

No bubble damage is related to entrainment of gas into the melt is all about melt 
turbulence and inappropriate taper of the sprue and flawed dimensioning of the 
runner and gating system. Entrainment comes partly at the filling front but may also 
occur at sharp comers as a consequence of aspiration. A measure of the degree of 
turbulence is the dimensionless Reynolds number that describes the balance 
between inertial forces (density and flow speed) and viscous forces (viscosity) 



The transition from quiescent laminar flow to turbulent flow occurs for a fully 
developed duct flow in the range of 2,300 < Re < 4,000. Resulting in an upper and 
lower bound estimate for a critical filling speed, 


1, 150 — < ^critical < 2,000 — 

pr pr 

In gravity-driven processes, the We-number stability will be dominant. In high- 
pressure die casting, the ambition to maintain a full integrity front is not practical, 
and the gate speed selection is more related to the Reynolds number stability 
criterion. Independently the no bubble damage requirement is essentially a filling 
controlled requirement. 

Reduced core blow damage is achieved by controlling gas emission from sand 
cores and is all about venting during filling. Venting is partly related to the design of 
the venting system, but requires a filling pattern that allows for venting and does not 
block venting path (Campbell 2002). Similar defects may occur in pressure die 
casting and even in coreless processing. In thin- walled casting inserts segments and 
ejector pins serve as vents. At these locations release agent may build up, resulting 
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Fig. 32 A core blow-like 
defect in pressure die casting 
of AZ91D revealed by 
pickling 


in gas release during filling (Jarfors et al. 2003). Figure 32 is an example of core 
blow-like defects caused by residual release agents in a thin- walled AZ91D mag- 
nesium casting. Again this process is largely affected by the filling process. 

No shrinkage damage is related to feeding. To avoid shrinkage defect, it is 
essential that all feeding is made in the direction of gravity as uphill feeding is 
impossible. Casting design and mold lay out dominated this defect and affect filling 
but are not affected by filling. It does require a systematic approach to manage 
shrinkage damage (Campbell 2002). 

Avoid convection damage is a difficulty for larger casting and is a thermal 
siphoning effect, if the gating system of a large casting is such that the thermal 
gradients act with gravity. Sometimes the best solution is to roll over the casting 
after pouring, but requires a complete rethinking about the risering and feeding of a 
casting. This requirement changes the way gating can be made but is not signifi- 
cantly affected by filing itself (Campbell 2002). 

No unplanned segregation damage is again related to large castings and 
directional casting that is slow and uncommon in most casting processes. It is not 
affected by filling but sensitive to casting layout and solidification conditions 
(Campbell 2002). 

Controlled residual stress is control of residual stress in a part. Residual 
stress is generated by restricting part contraction during solidification and 
cooling. This is determined by the part design and by runner and gating, locking 
motion during contraction. This is not strongly affected by the filling conditions 
but puts restrictions on runner and gating layout as well as feeders and risers 
(Campbell 2002). 

No post-machining damaged is related to the way a casting is solidifying 
and as such related to the melting process and the melt treatment made. 
Commonly there is a cast surface layer that is sound in terms of porosity, but 
1-2 mm below the surface porosity may be found. This is not affected by filling 
but can be managed by design as well as by the melting and melt treatment 
proactive (Campbell 2002). 
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Fluid Dynamics of Mold Filling 

To get the first estimate of the dimensions of the runner and gating system for 
casting, it is essential to have some simple means of calculation. There are two 
principles that lay as a foundation for the computations (Jarfors et al. 2010; 

o 

Fredriksson and Akerlind 2006): 

• The continuity principle 

• Conservation of energy 

The continuity principle used here is under the assumption that the melt is 
incompressible. The melt volume entering the runner and gating system does not 
change and thermal contraction is disregarded from. The volumetric flow at any 
point is thus always constant. This may be expressed as (Jarfors et al. 2010; 

o 

Fredriksson and Akerlind 2006) 


AjVj — C i 

Here A t is the cross-sectional area of a plane perpendicular to the flow direction 
in the casting. 

v t is the flow speed across that cross section (m/s). 

Conservation of energy is given by the Bernoulli’s equation. In its simplest 
form, it states that the sum of potential energy, kinetic energy, and pressure work is 

o 

constant. This can be expressed as (Jarfors et al. 2010; Fredriksson and Akerlind 
2006) 


Pi + pghi + = c 2 

Here, p f : pressure in point i (Pa) 
g: gravitational constant (m/s ) 
hj\ height at point i (m) 


Heat Transfer and Solidification in a Sand Mold 

Solidification is governed by heat transfer, and even though it is commonly 
relatively fast compared to the complete casting cycle, it determines part quality 
and properties to great extent. Heat transport in casting takes place in three different 

o 

ways (Jarfors et al. 2010; Fredriksson and Akerlind 2006): 

• Conduction 

• Radiation 

• Convection 
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Fig. 33 Simplified 
temperature profile in a dry 
chemically bonded sand mold 



re 



Conduction is heat transport through energy transport by electrons and phonons. 
Radiation is heat transported by electromagnetic radiation with a rapid 
exchange of heat with the surroundings. The higher the temperature, the more 
significant the radiation becomes. 

Convection is heat transported by mass movements, often driven both by the 
variation in density with temperature and by the variation of surface tension with 
temperature. Forced transport of mass occurs during mold filling during which the 
heat transfer can be very high. 

Heat conduction is governed by Fourier’s second law under transient conditions. 

o 

This is expressed as (Fredriksson and Akerlind 2006): 

d T _ ( d 2 T d 2 T [fl T \ 

<)t <X \ dx 2 dy 2 dz 2 ) 

Here T: is temperature (K) 
a: thermal diffusivity (k/pCp) (nr/s) 
k: thermal conductivity (J/ms) 
p: density (kg/m3) 

Cp: specific heat (J/kg) 
x, y, z: space coordinates (m) 

Not many problems can be solved by conduction alone, and more complex 
problems require a numerical approach, using computational software such as 
MagmaSoft, NovaFlow & Solid, ProCast, and AnyCasting, all purposely developed 
software for simulation of casting. One problem that allows closed form solution is 
sand casting using dry chemically bonded sand with the temperature profile in 
Fig. 33. Heat transport is limited by the sand transport properties and can serve as an 
order of magnitude estimation for the time to solidification and estimation of the 
earliest point in time for shakeout. 
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To estimate the time for solidification for this case, a number of assumptions are 
made. These assumptions are (Jarfors et al. 2010): 

• The mold surface (x = 0) temperature is identical to that of the melt, e.g., the 
melt liquidus temperature, and remains constant during solidification. 

• The melt is poured at no superheat and that mold filling takes place with zero 
heat loss. 

• The melt thermal conductivity is much greater than that of the mold. 

• The mold dimension is much greater than the heat-affected region. This allows 
for a semi-infinite approach to the solution. 

Under these conditions it is possible to solve the problem through thermal 
conduction in the mold only. The temperature distribution in the mold is then 
described by: 


Here 






u 



z (2n—\) 

(2 n — 1 )(n — 1) 


o 


A f and B f are constants and can be determined by the boundary conditions BC1 
and BC2: 




BC1 : y = 0,T(0,t) =Ti 
BC2 : y = — oo, T(— oo, t) = T 0 

This results in that the temperature distribution in the mold is 

THv,,) = r, + ( r, - w y=) 

Here, Tf: mold temperature (K) 

Tp the interface temperature, here identical to the melt liquidus temperature 

T m (K) 

T 0 : ambient temperature (K) 
a/, mold thermal diffusivity (m /s) 

All heat transported through the mold interface (y = 0) is the heat generated by 
solidification of the melt in the mold. The amount of heat (Q) that is transported 
away from the interface into the mold is expressed as 
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f --**($) 

dt \dy ) J=0 

Here A: the cross-sectional area over which the heat is conducted (m ) 
k f : mold thermal conductivity (J/mKs) 

Taking the derivative of the expression of the mold temperature T f and inserting 
at y = 0 results in 



Integrating this expression will result in an expression for the total heat 
transported across the interface: 


Q _ A 2k f (Ti-T 0 )Vt 

V™f 

For the melt to completely solidify, this must then be identical to the amount of 
heat released during solidification of the volume, which is expressed as 


Q = V p m AH m 

Q 

Here V : metal volume solidifying (m ) 

Q 

p m \ metal density (kg/m ) 

A H m : latent heat of fusion for the metal (J/kg) 

The total time for solidification is then possible to express as 



PmAHm^tajV 

2 kfpi-To)) 




This is the Chvorinov’s law. This has a fairly general application just keeping in 
mind that A may not always be the complete surface of the casting but rather the 
area over which heat is transported. This rule can be phenomenologically expanded 
to incorporate superheat under the assumption that the amount of superheat AT is 
significantly smaller than the liquidus temperature. This can thus be expressed as 



p m (AH m + C P AT)^Waj\ 2 (V\ 2 (V\ 2 

2k f (Tj — To) ) \a) \a) 


If the casting has a relatively even wall thickness, the term (V/A) can be replaced 
by half the wall thickness. It should here be noted that the relation between time and 
dimension is parabolic which is typical for the thermal diffusion problems (Jarfors 
et al. 2010; Fredriksson and Akerlind 2006). 
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Heat Transfer and Solidification in High-Pressure Die Casting 

The thermal events in a sand casting are slow and will be the time-limiting stem and 
not the mold filling. In pressure die casting, the situation is the opposite. Filling and 
solidification take only a minor part of the casting cycle time but are on the other 
hand a very fast and dynamic process. In this case it is essential to get a good 
measure of temperature loss during filling. For alloys like magnesium, it is also not 
uncommon that solidification actually starts during filling. The problem will be 
divided into two halves as a first rough estimate of the removal of superheat and the 
second half including solidification. The heat generation rate by reduction of the 
superheat melt is described as 




SdTl 

2 dt 


The heat released by the reduction of the superheat must be transported across 
the melt/die interface. The mechanism for this is primarily a combination of 
conduction and convection for light metals and bet described by a heat transfer 
coefficient. The transport across thermal/die interface is described as 


{,- = *■ - r ») 

It should here be noted that this is valid under the assumption that there is no 
gradient in cross section perpendicular to the flow direction and parallel to the heat 
conduction direction. This may be validated by Biot’s number as the absence of 
gradients requires Biot’s number to be significantly lower than one, i.e., 



This temperature profile is shown in Fig. 34 

The amount of heat released by the reduction in superheat and the heat 
transported across the die/melt interface must be equal in order to conserve energy. 
This balance is thus written as 






0 


The first result is thus 





dpCp | n f T Tp \ 
2 h\ \T m — To) 


This is the time to remove the superheat. 
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Fig. 34 Temperature profile 
in a metal die 


Temperature 



The second part of the derivation is also made under the assumption that there 
are no thermal gradients in the cross section and as such has the same limitation of 
validity as the first section. The heat transfer coefficient during filling is higher than 
during solidification as solid contraction during filling will result in a poorer contact 
between the die and the casting. For a worst case estimate, the heat transfer 
coefficient may be assumed to be of the same order of magnitude. The heat released 
due to isothermal solidification is expressed as 



p(-AH) 


8_df_ 

2 dt 


The heat generated by solidification must also pass across the die melt interface. 
This is expressed as 


f = Mr - t 0 ) 

The amount of heat released by the solidification and the heat transported across 
the die/melt interface must be equal in order to conserve energy. This balance is 
thus written as 


p {-AH) 8 - d 4- = h 2 {T 


2 dt 


T 


o 



p(-AH 


S 

2 


f 


h 


df = h 2 (T - T 


o 


dt 


o 


o 


p(-AH) S 
h 2 (T - To) 2 


f 



This is the second result showing the time to solidify to a certain fraction solid /. 
The total time available is thus 
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P(-Atf) 8 

T 0 J h 2 (T-T 0 ) 2 T 


Assuming that the wall temperature is constant, a semiempirical relation for the 
heat transfer coefficient can be used for metals: 


where 



^ Sv 
Re = — Pr = 

v k 

It should also here be noted that as for sand casting, it is possible to express the 
time for solidification (f = 1 ) in a similar manner as for sand casting by replacing 
half the wall thickness with {VI A). A being the cooling surface and V being the 
volume of the casting, 



p(-AH) m 
h 2 (T - T 0 ) \Aj 

There are also some rule of thumb guidelines available where DCRF (Die 
Casting Research Foundation) recommends for aluminum that (Street 1977) 



0.12 F Mclt 


T^Liquidus \ 


V T 


Melt 


Td ie 


^Casting 

^Casting 


and for zinc alloys 



0.057 ( Tms " 


T^Liquidus \ 


V T 


Melt 


T 


Die 


^Casting 

^Casting 


North American Die Casting Association recommends a general rule as 



34.25 C Melt 




T 


Melt 


7^Liquidus+ 1 1 5 \ 


T 


d 


min 


Die 


Gating Design for Gravity Drive Casting and in Particular Sand 
Casting 

There are many ways to gate a sand casting. There are two main schools for the 
design of the down sprue, runner, and gate: 

• Pressurized system 

• Un-pressurized system 
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Fig. 35 A pressurized runner 
and gating system 


Conical cup 



in cope Gates in 

drag 


Fig. 36 An un-pressurized 
runner and gating system 


Weir bush 



Sprue Runner Gates in 

basin located cope 


in drag 


Pressurized systems (Fig. 35) are designed so that the flow constriction that 
limits the flow is at the gates as the area relation is A runner > A gate . The choke should 
allow the runner and gate to fill and allow back flow of entrained gas to leave 
through the conical pouring cup. These late chokes have a tendency to cause 
acceleration of the melt at the gate resulting in increased Weber and Reynolds 
numbers of the flow causing turbulence. 

Un-pressurized systems (Fig. 36) rely on that the down sprue will fill 
quickly and in combination with the weir bush inhibit gas entrainment. In the 
expanding runner and gates A mnner < A gate , the flow calms down supporting a 
quiescent flow surface resulting in an improved flow situation. It is however 
extremely important that the down sprue has a taper matching the contraction 
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Fig. 37 Geometry for angled 
gating design 



of the melt jet exiting the weir bush. Using Bernoulli’s equation, the speed of the 
free falling jet is 



Using this and the equation of continuity, the area at two heights are related as 


A\V\ — A2V2 


This results in that 



The height h is counted from the top of the weir bush. 

To minimize aspiration pressure loss and generation of intrinsic cavitation, it is 
common to use an angle at the gate, especially if there are multiple gates along the 
runner. After each gate the runner continues with a decreased cross section 
(Fig. 37). Using the equation of continuity, the ideal entrance angle can be calcu- 
lated as 


v\A 


1 


V2A2 + V3A3 


The ideal angle is given by the resultant vector speed as function of v 2 and as 


tan (cp) 


V3 

vi 





V 2 tan ( cp ) 


This then results in 



A\V\ A2 
A3V2 A3 


Using continuity this can be further refined to 


A\ (A 2 + A 3 ) A 2 


tan {(p) 


A1A2A3 A3 
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Fig. 38 The four standard types of gates used in high-pressure die casting, (a) blind runner, 
(b) fan runner, (c) offset runner, and (d) T-runner 


Gating Design for Pressure-Driven Casting and in Particular 
High-Pressure Die Casting 

In high-pressure die casting, the runner and gating systems are very important. 
Filling is normally very quick and most commonly governed by the ability to fill the 
mold. Entrainment of gas is common and the flow is such a turbulent. To work 
around this problem, vacuum-assisted high-pressure die casting is used. The usage 
of vacuum-assisted high-pressure die casting is growing as the demand of being 
able to do full heat treatment of die cast components is growing. 

The type of gating systems for a single cavity die consists of four different types 
(Fig. 38) (Street 1977): 

• The blind runner 

• The T -runner 

• Fan runner 

• Offset runner 

In addition to the choice of runner system, the lip gate between the runner and 
the part plays an important role as it controls the entry pressure drop and partially 
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Fig. 39 Friction loss factors 
and geometry (After 
Svensson (2001)) 



a - 0,6 cc-0.8 


also the tendency towards initial spray formation. In general the gate area tends to 
be as large as possible to reduce the gate speed of the melt, making fan runners and 
T-runners the most common types of gates. 

The maximum allowable time for filling can be calculated based on the equa- 
tions above. The gate area is then calculated as 

^ _ ^Casting 

^gate ^max 

The runner area is given by its geometry where corners and area reductions 
caused friction losses in the flow. The friction factor is typically in the range 0.4 < 
alpha < 0.6. The friction loss factor can easily be calculated based on the following 
geometrical effects, (Fig. 39). The friction loss for a system with a contraction and a 
90 degree bend is alpha = 0. 8*0.6 = 0.48. The runner cross section area is then 
calculated as: 


a _ ^Casting 

^runner 

O'Z^gate %iax 

For the T runner, it is important to use the lip length for flow balancing into the 
part. The fan gate is typically designed using a parabolic change of depth with a 
linear width expansion to accommodate for the melt jet contraction. The appropri- 
ate design of the fan runner is shown in Fig. 40. The calculation of the correct 
cross-sectional geometry is given by the following relationships (Street 1977): 


D 1 = (Al/Wl) D 2 = (A2/W2) D 3 = (A3/W3) 
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Fig. 40 Fan gate design illustration (Freely after Street (1977)) 


Al = I +a J a J^A £- I + A g3 f^l I + 4, 


4 


2 


4 


1 1 3 

Wl = Wg - - (Wg - Wr)W 1 = Wg--(Wg- Wr)W 1 = Wg--(Wg- Wr) 

I I 

In general there are some guidelines on dimensions of the gates. For aluminum it 
is recommended that (Street 1977): 


• Gate speed is in the range 25-35 m/s. 

• End of fill temperature is crucial to avoid cold shuts; hence filling should ideally 
be over well before the liquidus temperature is reached. 

• Gate depth is typically 1 .25-3.00 mm with a minimum requirement of 1 .5 mm if 
intensification is used. 


For zinc alloys, it is slightly different with the following recommendations 

(Street 1977): 

• Gate speed should be in the range 35—45 m/s. 

• For high surface finish fill times of 20 ms is preferred, while 40 ms is sufficient for 
functional parts. It should here be noted that very little systematic research has been 
made on the effect of processing conditions on the surface finish, but recent finding 
suggests that die surface temperature differences are detrimental to part finish. 

• Typical gate depths of 0.15 up to 0.5 mm are typically used. 


Feeding 


Feeding is the protection against shrinkage damage during solidification. The 
function of a feeder is to provide molten metal to the solidifying component and 
compensate for the solidification shrinkage, p, of the casting and the feeder itself. 
The volume of the feeder, V Fe eder> is calculated as 


F Feeder — P 


V 


casting 


(e~P) 
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Fig. 41 Recommended 
feeder spacing (a) close to the 
end of a casting taking edge 
chill effect into account, 

(b) inter-feeder spacing 



b 






2d 

2d 



8 is the feeder efficiency that for most insulated cylindrical feeders is approximately 
0.6 (Svensson 2001). Each feeder has a limited feeding range depending on part 
dimensions. Figure 41 illustrates typical feeder spacing guidelines. 

Post-Casting Processing 

Post-casting processing involves all normal metallurgical treatment such as heat 
treatment machining and coating. Machining and coating processes are treated in 
other sections of this book, and the effects of heat treatment are discussed more in 
detail in the sections on cast materials and their properties. 

It should be noted that as porosity is a common defect and weld repair is a 
common way to solve this problem. Special electrodes for weld repair of castings 
have been developed. However the change in solidification and cooling conditions 
makes the micro structure different and as such it will have a different set of 
properties. It should here be noted that an appropriate component design alleviates 
this problem and is the key element to come to a low cost, high process yield, and 
high-quality reliable components. 


Casting Materials 
Cast Iron 

Cast irons are iron-carbon alloys with approximately 2-4 % C and 0.5-3 % Si with 
a defined eutectic reaction during solidification. Low price, ability to tailor mechan- 
ical and physical mechanical properties, castability, etc., make cast iron a very 
attractive metal in most countries of the world. The mechanical and physical 
properties may be tailored by the alloy composition, inoculation, and treatment, 
heat treatment, or cooling conditions, making them suitable for a wide range of 
applications (see Fig. 42). The unique properties like damping and good tribology 
for some cast irons enable further penetrations in components where damping of 
sound and vibration is important. 
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Fig. 42 The world’s oldest cast iron bridge (1779) built to show technological strength of the 
Coalbrookdale ironmaster in England (Courtesy A. E. W. Jarfors) 


Table 7 Different types of cast irons and corresponding graphite morphologies 


Class of cast iron 

Morphology of graphite phase 

Grey iron 

Lamellar graphite eutectic 

Nodular cast iron 

Round graphite spheres 

Compacted (vermicular) cast iron 

Short and compacted graphite flakes 

White iron 

Eutectic cementite cast iron 

Malleable cast iron 

Rosettes 


There are five different types of cast iron (see Table 7), depending on the 
morphology of the graphite which is a result of melt treatment, solidification, and 
heat treatment. Cast iron is a group of materials with many different microstruc- 
tures, classified according to Fig. 43a, and hence a wide variety of properties. The 
micro structure can be further developed by changing the chemical composition 
through alloying. 

A effective production of cast iron castings starts at the melting shop where the 
whole production should be selected carefully. According to (Totten et al. 2003) all 
starts with “the type of melting furnace, charge materials, treatment of the melt, 
inoculation of the melt, pouring system, mold and core practice, and quality 
system.” Regarding the furnaces, cupola (normally for grey iron) and induction 
furnaces (mostly for compact and ductile cast irons) are considered the key methods 
of melting cast irons. For grey cast iron melt, the charge material for preparing grey 
cast iron melts is composed of foundry scrap or used grey iron products (e.g., 50-60 
%), steel scrap (20-30 %), pig iron (15 %), and alloying elements that are utilized; 
the charge might vary among the foundries around the world. In order to promote 
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Fig. 43 The different types of cast iron are illustrated in (a) classification of graphite morphology, 
ISO-945. The different types of cast iron are illustrated in (b) Fe-C phase diagram, showing the 
composition range of cast irons (Courtesy Granta Design, UK) 

the solidification of a eutectic with lamellar graphite, the sulfur content has to be 
maintained relatively high. In induction furnace, the normal charge materials are 
steel scrap and foundry returns, with carbon and ferrosilicon added to match the 
desired composition (Totten et al. 2003). 

The micro structural, mechanical, and physical properties of cast irons are to a 
large extent governed with the graphite morphology and the phase transformations 
taking place in the solid state. After solidification, the cast iron structure mainly 
consists of graphite and/or carbides dispersed in an austenitic matrix (see Fig. 43b). 
The decomposition of austenite into different phases and structures is very much 
affected by graphite morphology, cooling conditions, and alloying elements. There- 
fore, a great number of matrix constituents can be obtained in the different cast 
irons. Some characteristics concerning the solid-state transformations in the differ- 
ent types of cast irons will be embedded in relation to the cast iron type discussed. 
Nevertheless, one should bear in mind that castings often have a large variation in 
wall thickness which will lead to a situation where the cooling conditions will vary 
during both solidification and solid-state transformations in different parts of the 
casting. As a consequence, the micro structure will not be uniform throughout the 
casting, which will therefore have a variation in mechanical properties. 


Grey Cast Iron 

In this kind of alloy, the carbon is mainly precipitated as lamellar graphite in grey 
iron (see Fig. 44), and the graphite/austenite eutectic grows in a spherical way 
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Fig. 44 Grey iron (Dioszegi 
2004) 



which in turn is depending on cooling conditions and the number of growing cells 
(Totten et al. 2003). Inoculation of graphite is facilitated by inoculant additions 
such as FeSiCa alloys where approximately 0.2-0. 5 % is added in the pouring 
stream or in the ladle, which is the common procedure. As inoculant fades, their 
lifetime is very limited and their potential to nucleate graphite decreases with time. 
This means, in order to have a good effect, the inoculation should be done as late as 
possible (Totten et al. 2003). Grey cast iron castings may sometimes end up in 
“white eutectic,” with unwanted high hardness, which could be due to section 
thickness of the casting. In thin sections, associated with high cooling rates, grey 
solidification becomes suppressed and that the white eutectic, consisting of cement- 
ite and austenite, is formed (Totten et al. 2003). Furthermore, bad inoculation, low 
silicon and carbon contents, and high contents of cementite stabilizers such as Cr 
and Mo may pose a risk for white solidification. 

The properties of grey cast iron are controlled to a great extent by the chemical 
composition and, as mentioned earlier, wall thicknesses. The chemical composition 
can be expressed in carbon equivalent, CE (Svensson 2001): 



%C + 





For cast irons the CE concept is used to understand how alloying elements will 
affect the heat treatment, mechanical properties, and casting behavior. A couple of 
formulas exist on how to determine the CE in cast irons. Besides the chemical 
composition, the solid-state transformations taking place during the solidification 
are of major concerns. If the austenite is allowed to transform according to 
equilibrium conditions, it should result in formation of ferrite and graphite. Grey 
irons, for most practical purposes, exhibit mainly pearlitic matrixes and conse- 
quently transformation process under nonequilibrium conditions. Factors that pro- 
mote formation of free ferrite are a slow cooling rate through the eutectoid 
transformation range, a fine graphitic structure, and a high content of silicon (Totten 
et al. 2003; Janowak and Gundlach 1982). Elements normally available in cast iron 
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Fig. 45 Relation between tensile strength, cooling rate, and carbon equivalent in grey irons 
(Reproduced from ASM Specialty Handbook (1996)) 


such as tin, antimony, copper, manganese, nickel, and chromium act as pearlite 
promoters. As (Totten et al. 2003) points out, besides ensuring a fully pearlitic 
structure in heavy sections, one should bear in mind that most of these pearlitic 
promoting elements also increase chill tendency during solidification, which there- 
fore could lead to chill in thin sections. 

Once a fully pearlitic structure has been obtained, the mechanical properties can 
be further improved by adding alloying elements, which have a refining effect of the 
pearlite. Such elements include vanadium, molybdenum, and chromium, which all 
have a very strong pearlite refining effect, and copper and nickel, which have a 
somewhat weaker effect. Additionally, an increased cooling rate also results in a 
refinement of the pearlite (Totten et al. 2003; Svensson 2001). 

Additionally, the mechanical properties are improved when decreasing the 
carbon equivalent and increasing the cooling of solidification (Fig. 45). In order 
to further control and tailor the properties, cast iron can be heat treated in different 
ways. The most common is stress relaxation heat treatment, but cast iron can be 
hardened, annealed, and soft annealed. Lamellar grey cast iron has very good 
damping capacity. This property is used in many components where damping of 
sound and vibration is important (Svensson 2001). 
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Flake Length (mm) 

Fig. 46 The effect of graphite flake length on tensile strength in grey irons (Freely after Karsay 
(1971)) 


Another measure of concern is the graphite lamellae that act as stress risers, 
which is the main reason for the generally low ductility of grey irons. Therefore, 
there is a rather strong influence between flake length and mechanical properties, 
which is shown in Fig. 46 (Karsay 1971). 


Ductile Cast Iron 

A cast iron type with a relatively short history, developed around 1950, came to 
differ in graphite morphology, as compared to grey cast iron. The graphite, precip- 
itated as spheres or nodules (see Fig. 47), develops without sharp edges, as in 
lamellar cast iron; due to the spherical or nodular shape of graphite, this cast iron is 
called nodular cast iron. To obtain the nodularity, magnesium is normally used to 
change the growth mechanisms. Magnesium controls the sulfur and oxygen level in 
the melt. Mixing with other deoxidants and changing the level of the addition can 
control the shape of graphite (Totten et al. 2003; Svensson 2001). 

In comparison with grey iron, the nodular cast iron has a good castability, 
however not as good as grey cast iron, and if inoculation is performed correctly, 
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Fig. 47 Graphite nodules in 
ductile cast iron (Dioszegi 
2004) 



thin sections can be cast without white solidification taking place. Due to the 
solidification mechanisms, with the diffusion of carbon through the austenite 
layer, ductile iron has a higher tendency for white solidification than grey cast 
iron that also promotes that ductile irons solidify at higher supercooling as com- 
pared to grey cast irons, where solidification mainly is controlled by carbon 
diffusion in the liquid (Totten et al. 2003; Svensson 2001). 

The spherical morphology of the graphite remarkably enhances the mechanical 
properties, with an increased ductility, which is why it is also called ductile cast 
iron, overall tensile strength, and Young’s modulus on the expense of heat conduc- 
tivity and damping capacity that are lowered. Ductile iron is often an alternative to 
rolled, forged, and cast steel in components (Totten et al. 2003; Svensson 2001). 

Increasing the pearlite content or reduction in wall thickness, a higher tensile 
strength (see Fig. 48) and hardness are obtained, but the ductility will be decreased. 
Besides wall thickness and roundness of graphite, manganese and copper content are 
controlling to a certain extent the structure of the ductile iron matrix, as normally they 
promote pearlite formation, which in turn governs the strength. The cooling rate 
during solidification and solid-state transformations and the number of graphite 
nodules are other important factors for the mechanical properties (Totten et al. 2003). 


Compacted Graphite Cast Iron 

An intermediate form of graphite growth defines compacted graphite, sometimes also 
called vermicular graphite (Fig. 49), which gives a structure and properties between 
that of lamellar and nodular cast irons (Fig. 50). The morphology of the compact 
graphite can be controlled, and depending on the shape of the graphite, a wide variety 
of properties may be obtained. Compacted graphite irons usually contain some 
fractions of nodular graphite which is normally controlled by the cooling rate, 
amount of inoculation, and level of Mg or Ce contents. Being able to offer relatively 
high strength with appreciable ductility and good thermal conductivity, compacted 
graphite irons are penetrated in a number of engineering applications, particularly 
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Fig. 48 The effect of section size on the tensile and yield strengths of a cast pearlitic ductile iron 
(Reproduced from ASM Specialty Handbook (1996)) 



when demands on thermal cycling, such as ingot molds, cylinder heads, engine 
blocks, and braking components, are required (Totten et al. 2003; Jarfors 
et al. 2003; Svensson 2001). 

As the majority of cast irons, the matrix can vary from mostly ferritic to fully 
pearlitic. But if demands on high-strength qualities are to be fulfilled, as most 
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% pearlite 

Fig. 50 Relation between pearlite content and tensile strength in compacted iron (Svensson 1980) 


applications seem to require, a mostly pearlitic matrix in compact cast irons 
is required. In order to obtain a pearlitic matrix, copper and tin are usually 
added in considerably higher levels than normally applied in grey irons. 
Pearlite refining elements can also be used in the same way as for grey iron 
(Totten et al. 2003). 


White Cast Iron 

Being hard, with massive amounts of cementite, this free of graphite cast iron type, 
white cast iron, is normally used for corrosion-resistant and abrasion-resistant 
applications such as pump wheels and machinery for crushing and grinding. 
A fractured surface of this material appears white, hence the name. 

Alloyed with chromium to promote white solidification and nickel, molybde- 
num, and copper to avoid pearlite in the structure, the white cast iron structure can 
be controlled. A typical range of composition might be 2-3.6 % C, max 2.0 Mn, 
max 2.0 Si, 2-5 % Ni, 1.4-25 % Cr, 1-3 % Mo, and max 1.2 % Cu (ASM Specialty 
Handbook 1996). 
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Malleable Cast Iron 

In similarity with grey and nodular cast iron, malleable cast iron also contains 
graphite. The graphite in malleable cast iron, compared to other cast irons, on the 
contrary is obtained by a heat treatment procedure of white cast iron, where the 
metastable cementite formed at solidification decomposes to graphite. The irregular 
nodules or balls after such a post-solidification treatment result anyhow in better 
ductility compared to grey and white iron. By controlling the cooling of the 
solidification, ferritic malleable, pearlitic malleable, and martensitic malleable 
cast irons can be obtained, hence offering a wide range of mechanical and physical 
properties (Svensson 2001). 


Defect Formations in Cast Irons 

Porosity is the most common defect in cast irons. The main causes of this type of 
defect are: 

• Shrinkage-related defects 

• Gas solubility- and pressure-related defects 

• Porosity formation by external pressure 

Shrinkage-Related Defect 

The volume changes in cast irons start with changes in the volume of the liquid 
during solidification. Liquid shrinkage is normally compensated by the feeders or 
by the ingate system, during filling and before the ingates solidify. The primary 
austenite shrinks during solidification, and often rather large porosities, due to this 
early shrinkage, are found in the casting (Fig. 51). 

Austenite and graphite are precipitated at the same time during the eutectic 
reaction. The graphite structure is rather open with layers of carbon hexagon rings, 
and the density of graphite is around one third of the melt and austenite. The 
graphite precipitation can compensate for the volume loss caused after the austenite 
has formed. The nature of grey iron and some types of compact graphite irons 
secures furthermore internal feeding due to the fact that the melt is in contact with 
the melt during solidification of the eutectic, which ensures less porosity levels. 

In ductile irons, instead, the graphite precipitation starts directly in the melt, 
which compensates some of the shrinkage occurring at the primary austenite 
precipitation or during austenite precipitation resulting from divorced eutectic 
growth. When an austenite shell surrounds the nodules, the compensation is more 
difficult. The outward movements get stronger on the mold and, if the mold is too 
weak, the total volume increases and loss of material occurs. As a rigid network 
has been formed, the volume of the melt due to segregation can be of larger density, 
and micro shrinkage can occur between the austenite-covered nodules (Fig. 52; 
Svensson 2001). 
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Fig. 51 Interdendritic 
shrinkage porosity (Dioszegi 
2004) 



Fig. 52 Micro shrinkage in 
ductile iron 



Gas Solubility- and Gas Pressure-Related Defects 

Gas solubility- and gas pressure -related defects: As most metals, cast irons 
dissolve gases, which result in porosity formation during solidification. The liquid 
metal has an internal strength, which has to be exceeded to form porosity. The 
nucleation in a pure melt is thus very difficult, but in cast irons there are normally 
slags and oxide and carbon films, which assist and can be sites for nucleation of gas 
pores (Campbell 2002). 

As the majority of melts, the cast iron melt is exposed to air, and by this exposure, 
the melt will be able to dissolve nitrogen. The fact that air contains water that in turn 
reacts with the melt will lead to an enhanced level of hydrogen in the cast iron melt. 
The hydrogen solubility at one atmosphere hydrogen gas pressure is shown in Fig. 53 
for some iron-based materials. The reason for porosity formation is that the solubility 
of the gas is much lower in the solid state than in the liquid, as declared. Casting in 
green sand molds, with more and less water- saturated in it, increases the dissolution of 
hydrogen inside the melt further. The oxygen content in the liquid will influence the 
possible hydrogen pressure at the surface and the possible concentration of hydrogen 
(Svensson 1980; Svensson and Fredriksson 1980). 
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Fig. 53 Hydrogen solubility 
in some grey irons and steels 
(Azom.com A to Z of 
Materials 2013) 
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Porosity Formation by External Gas Pressure 

Shrinking of metals at solidification can be compensated by gravity feeding. The 
feeder is then placed at a higher level than the area to be fed and is in contact with 
the atmosphere, in order to have a transport of liquid from the feeder to the 
shrinking area. It is worth to have in mind that the direct contact may be broken 
between the atmosphere and interior if a thin metal shell is formed at the surface of 
the feeder. If at the same time the melt has contact with the atmosphere at a lower 
point than the shrinking area, the external pressure can transport the metal upwards. 
In the example shown in Fig. 54, the sand core in the ductile iron casting has been 
heated, and the external gas pressure has transported the melt to the shrinking areas, 
including the feeder. The result is that the feeder is completely free from porosities, 
while the casting contains an excessive amount of them (Svensson 2001). 


Aluminum Alloys 

A1 that contains a maximum of 1 wt% of both Fe and Si is generally called pure Al. 
Pure Al is a very soft and ductile material; why it is used is quite limited. Alloying 
pure Al with other elements, the properties can be tailored remarkably and the 
application fields enhanced. Al alloys are classified into two groups: wrought and 
cast alloys. Furthermore, each of these groups is then divided into classes according 
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Fig. 54 Porosity formed in 
ductile iron by external 
pressure (Svensson 1980) 



to the composition. A1 cast alloys, foundry alloys that have Si as the major alloying 
element, are the most important commercial alloys due to their superior casting 
characteristics such as low melting points and excellent fluidity; rapid solidifica- 
tion; hydrogen solubility, which nowadays can be controlled; corrosion resistance; 
good weldability and machinability; and easy recyclability. 

In order to fulfil the varied demands on A1 alloy products, a great number of 
alloys have been developed over the years. Designation systems and alloy nomen- 
clature for aluminum casting alloy are not internationally standardized; there exist a 
number of national systems; Fig. 55 illustrates the European nomenclature system. 
Besides the major alloying element Si in foundry alloys, EN AC 4XXXX series, 
considerable amounts of Cu, Mg, Fe, and Mn are also found in A1 castings. 

During the solidification of alloys within EN AC 4XXXX series, where the Si 
content is below 12.7 % (see Fig. 56), A1 dendrites (almost pure Al) solidify first. 
The spaces between these dendrites are then filled with Al-Si eutectic, intermetal- 
lics, and defects based upon alloying and processing conditions. The distance 
between these dendrites is identified as the secondary dendrite arm spacing 
(SDAS), which is related to local solidification time (see Fig. 57). At levels 
above 12.7 % of Si, the solidification sequences start with precipitation of primary 
Si particles. Increasing the solidification rate, the dendrite cells become smaller, 
subsequently leading to a refinement of the eutectic structure (Seifeddine 2006). 


The Relationship Between Process, Microstructure, and Mechanical 
Properties 

Generally speaking, the microstructure and mechanical properties can be tailored by 
controlling the following three key sets of variables (Seifeddine 2006; Sjolander 2011): 
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Fig. 55 Classification of A1 casting alloys according to (a) EN 1780 and (b) EN 1706. 
Reproduced by permission of DIN Deutsches Institut fur Normung e.V. The definitive version 
for the implementation of this standard is the edition bearing the most recent date of issue, 
obtainable from Beuth Verlag GmbH, BurggrafenstraBe 6, 10787 Berlin 



Fig. 56 Illustration of the Al-Si phase diagram (Courtesy Granta Design, UK) 


• Melt treatment and processing variables, which include molten metal treatment, 
mold filling behavior, molding materials and their parameters, local solidifica- 
tion time, etc. 

• System/metallurgical variables, which include chemical compositions such as H, 
level of modification, grain refinement, etc. 

• Post-solidification variables, which include solution heat treatment, quenching, 
ageing, hot isostatic pressing, etc. 
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Fig. 57 Illustration of 
the SDAS 



A1 castings can be manufactured by almost all casting processes such as highl- 
and low-pressure die-, gravity die-, and sand casting processes including lost foam. 
Different manufacturing processes offer different casting qualities and soundness 
due to the mold filling behavior and solidification conditions. By solidification 
condition it is meant the local solidification time, cooling rate, heat transfer 
coefficient, and the heat diffusivity in the mold material. In this sense, permanent 
mold castings will exhibit a short SDAS in comparison to sand casting processes. 
While high-pressure die casting processes are associated with turbulent flow, air 
entrapment and oxide films which are adversely influencing the mechanical prop- 
erties, the processes of low pressure, permanent mold gravity die, and sand casting 
enable a smooth and controlled mold. 

Generally, the mechanical properties of Al-Si are governed to a large extent by 
the amount and fineness of the eutectic in the micro structure. Alloys in which the Si 
particles (eutectic or primary) are small, round, and evenly distributed are usually 
ductile. Alloys in which the Si particles are faceted and acicular are usually much 
less ductile due to brittle nature of the large Si plates but exhibit slightly higher 
strength. This comparison yields only if the materials are processed under similar 
conditions. The distance between the secondary dendrite arms, SDAS, which 
defines the coarseness of micro structure in Al-Si based alloys, depends on the 
solidification conditions including type of process applied and the geometry, 
specially the thickness, of the castings. A short distance between these dendrite 
arms are generally accompanied with finer eutectic structure and intermetallic 
compounds which altogether realize higher quality and sounder castings (see 
Fig. 58). The coarseness of micro structure and type of phases and defects are 
fundamental to the material behavior (Seifeddine 2006). 

Besides the cooling rate dependency and processing conditions, the microstruc- 
ture and mechanical properties are widely dependent on the actual composition 
including level of H, metal treatment such as degassing, and formation of defects at 
mold filling and solidification. 

Increasing the Si content, the hardness, ultimate tensile strength, and yield strength 
will be enhanced on the expense of lowering the elongation. The characteristics of 
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Fig. 58 The influence of SDAS on the mechanical properties of an Al-7%Si-0.4%Mg alloy 
(Seifeddine 2006) 

Al-Si can be altered furthermore by the addition of alloying elements such as Mg and 
Cu. Besides the effect of natural hardening, these elements allow a thermal treatment 
of castings. It can be looked at the Mg content in a variety ways, Mg in A1 alloys 
makes up in some alloys as the principal alloying element, but in other alloys it can be 
regarded as an impurity. The solubility of Mg in A1 reaches a maximum of 17.4 wt% 
at 450°. The Mg content in A1 is limited since its strong tendency to react with other 
elements forming inclusions. In Al-Si foundry alloys, and as heat treatment is applied, 
Mg gives Al-Si-based foundry alloys a very strong response to mechanical perfor- 
mance, if added in proper amounts, i.e., 0. 3-0.7 wt%. 

In A1 alloys, Cu is found partially soluble in an A1 solid solution, with a 
maximum equilibrium solubility of 5.65 wt%, or forming intermetallics such as 
A1 2 Cu which will be solidified in two forms after the main Al-Si eutectic reaction: one 
massive or blocky and the other as fine eutectic form. The main purpose by adding Cu 
(up to 3-4 %) to Al-Si alloys is to enhance their strength. The higher the content of Cu, 
the higher level of hardness will then be achieved. Increasing the Cu content beyond 
6 % the castability appears to improve, but the specific gravity of the alloy and 
porosity formation increase considerably. The strength and ductility of these alloys 
depend on whether Cu is present in solid solution as evenly distributed particles or as 
intermetallics in a variety of complexities. As for Cu being dissolved in the A1 matrix 
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in solid solution and/or precipitates in the matrix, the alloys will have the largest 
increase in strength and retain significant ductility. On the other hand, if Cu is present 
as intermetallics on a microstructural levels, any appreciable increase in strength will 
not additionally be realized, but a loss of ductility (Seifeddine et al. 2013). 

Fe in Al alloys has to be controlled carefully as it can cause a number of 
limitations. No one disagrees and literature is in agreement that this element is 
considered as the most deleterious element to the corrosion resistance and mechan- 
ical performance due to precipitation of brittle Fe-rich intermetallics with a variety 
of shapes and complexities (see Figs. 59 and 60). The morphology and size of the 
Fe-bearing phases in Al castings depend on the alloy composition especially the Fe 
level, melt treatment, casting conditions, and cooling rate; and in Al-Si foundry 
alloys, they may appear as small or large platelets called P-phase with a stoichi- 
ometry of Al 5 FeSi. 

Due to the sharp edges of theses platelets, a severe stress concentration is likely 
introduced to the alloy’s matrix. Due to this fact, these phases are recognized as the 
most detrimental to the casting’s mechanical properties, especially the ductility. That 
is why efforts should be dedicated to developing means of controlling their precipi- 
tation, growth, and morphology. In addition, the morphology of these platelets might 
block the interdendritic feeding channels during the solidification, of course if they are 
precipitated as primary or during the eutectic reaction depending on the Fe level in the 
melt, leading to an increase level of porosity. In order to enhance the overall properties 
such as tensile, fatigue, and corrosion properties, the morphology of the P-phase in a 
platelet shape can be altered to a more compact, less harmful intermetallic compounds 
such as Al 15 (Fe, Mn) 3 Si 2 when adding Mn (see Fig. 4). Other modifiers that are 
frequently used for that purpose are also Cr, Co, Sr, Be, and Ca. When alloying with 
Mn and Cr, caution has to be taken in order to avoid the formation of hard complex 
solid component sludge, intermetallic inclusion. These intermetallic compounds are 
hard and can adversely affect the overall properties of the casting and can be avoided 
if the melt is produced properly (Seifeddine 2006). Sometimes, Fe is added intention- 
ally, during the melting operation, in order improve the resistance to hot tearing and to 
decrease the tendency for die sticking or soldering (Seifeddine 2006). 

Mg, Cu, and Fe, if they are present in Al-Si alloys, promote the formation of 
different kinds of coexisting intermetallics. In the occurrence of Mg, the jc - phase 
Al 8 FeMg 3 Si 6 can form as Chinese script and/or as globules. Mg may also form 
complex intermetallics with Cu, precipitating as Cu 2 Mg 8 Si 6 Al 5 . In Al-Si alloys, 
when Cu is present, the Al-Al 5 FeSi-Si eutectic forms as thin needles. Other phases 
that Fe forms when Cu is present is the needlelike Al 7 FeCu 2 phase (Seifeddine 
2006; Sjolander 2011; Seifeddine et al. 2013; Seifeddine and Svensson 2010). 

All Al alloys contain in fact different amounts of impurities like Ni, Cr, and 
Zn. By impurities it is meant any species of foreign elements which have a 
detrimental influence on the castability, fluidity, mechanical and physical proper- 
ties, corrosion resistance, etc. Elements that dissolve in the Al may have a minimal 
influence on alloy properties. Under some conditions, certain elements, considered 
impurities, might have appreciable effects on the alloy properties, due to formations 
of intermetallic compounds. 
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Fig. 59 (a) Micrograph showing the morphology of Fe-rich needle, (b) Micrograph illustrating 
the Fe containing Chinese script due to Mn additions 



Strain (%) 

Fig. 60 Illustration of the influence of Fe on the stress-strain behavior of Al-9%Si-3%Cu alloy 
with a variety of SDAS (Seifeddine and Svensson 2010) 

Further control and improvement/refinement of the micro structure and mechan- 
ical properties of A1 castings can be achieved by grain refiners and modifier agents. 
It is worth to bear in mind is that the SDAS is not related to the grain size. 

Grain refinement is a chemical process in which nucleating agents are added to 
the melt to promote the formation of fine, uniform, and equiaxed grain structure 
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Fig. 61 (a) Schematic illustration of the role of grain refinement where (a) is an unrefined and 
(b) grain refined (Courtesy Ame Dahle) 


(Fig. 61). This process of adding grain refiner to the melt is also known as 
inoculation. The grain size of the alloy has inverse relationship with the number 
of nuclei present in the liquid; the greater number of nuclei will allow more grains 
to form and hence smaller grain size. The most commonly used grain refiners are 
Al-Ti, Al-Ti-B, and Al-B master alloys (Jarfors 1992). 

Some of the benefits of exhibiting finer grains in Al-Si castings are: 

• Improved feeding 

• Improved castability 

• Reduced and evenly distribution of shrinkage porosity 

• Better distribution of intermetallics 

All these benefits that grain refiner adds to Al-Si castings will promote improve- 
ments of the mechanical properties; it is therefore worth to bear in mind that in 
Al-Si alloys, the mechanical performance is not solely enhanced by grain-size 
reductions. In order to realize higher ductility, the micro structure of Al-Si alloys, 
and mainly the Si particles, has to be modified. Besides its brittleness, the mor- 
phology of Si acting as internal stress raisers results in early fracture of casting as it 
is exposed to load. Modification is the process in which the morphology of the Si 
particles is changed from brittle acicular shape into fine fibrous form (see Fig. 62) 
(Zamani et al. 2013). Among available modifiers, Na and Sr are mostly utilized. 

The heat treatment process that is commonly applied on Al-Si-based cast 
components consists of solution heat treatment, quenching followed by nature 
and/or artificial ageing. 

The soluble phases that have been formed during solidification, such as Mg- and 
Cu-rich phases, can be redissolved into the Al matrix. Solution heat treatment has 
also the characteristic of changing Si crystal morphology, from needle to spherical 
shape promoting the coarsening of these particles and homogenizing the distribu- 
tion of alloying elements. This process is initiated because of the interfacial 
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Fig. 62 The role of modification where (a) is unmodified and (b) is modified structure of Al-7% 
Si-Mg alloy (Zamani et al. 2013) 


instability between two different phases, driving to reduce the total interfacial 
energy. The spheroidization that is achieved compensates to the micro stresses 
created by the formation of precipitates, thereby affecting the ductility. The time 
and temperature, at which the solution treatment is conducted, depend upon the 
alloy composition and the melting point of the intermetallic phases formed. 

To avoid grain boundary melting, the restrictive solution temperature for Al-Si 
with Cu containing alloys should be limited to 525 °C for 1 wt% Cu and 495 °C for 
more than 2 wt% Cu. This treatment does not produce optimum mechanical 
properties for these alloys because it neither maximizes the dissolution of Cu-rich 
phases nor is able to sufficiently change the Si morphology. Modification with Sr is 
also recommended as it increases the melting point by 5 °C raising the eutectic 
temperature of Cu-rich phases. AlSiMg alloys permit higher solution temperatures 
when compared to AlSiCuMg alloys. Due to turbulent flow of melt when high- 
pressure die casting is employed, components may contain a large number of air 
entrapments that may expand and blister at elevated temperatures, making the 
application of heat treatment inappropriate. 

The second step of major importance is the quenching which is defined as 
cooling of the alloy from the high to room temperature, after solution treatment. 
This cooling, if done at a faster rate, ensures better mechanical properties while 
retaining the supersaturated solute and quenched in vacancies at maximum level in 
the matrix. The improvement of the strength properties such as hardness and tensile 
strength is obtained during the ageing process. 

The final step, ageing, allows the control of precipitate formations. Ageing takes 
place at room temperature (natural ageing) or at an elevated temperature in the 
range of 150-210 °C (artificial ageing). The objective of ageing is to obtain a 
uniform distribution of small precipitates, which give a high strength. The high 
level of supersaturation and the high vacancy concentration after quenching cause 
rapid formation of Guinier-Preston (GP) zones, which are clusters that contain a 
high fraction of solute atoms. These clusters are very small and finely dispersed in 
the matrix, because diffusion is limited at room temperature. The clusters are 
coherent with the matrix, but elastic stresses are induced around the clusters due 
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to the difference in size between the solute and the solvent atoms. Coherent 
precipitates are formed because they have a small interface energy which gives a 
smaller critical radius that makes them precipitate relatively easy. The GP zones 
and the stress field around them hinder the dislocation motion, resulting in an 
increase in strength. 

Alloys containing Mg harden fast at room temperature (see Fig. 63a). An 
increase in hardness is seen after about 1 h and a hardness plateau is reached after 
about 100 h, where no further increase in hardness occurs. The hardness that can be 
reached depends on the composition of the alloy; a higher Mg concentration (<0.7 
wt%) should normally result in a higher hardness. Al-Si-Cu alloys harden slowly at 
room temperature. If a small concentration of Mg is added, the alloy responds 
quicker to natural ageing (see Fig. 63b) (Moller et al. 2007; Reif et al. 1997). 

Artificial ageing, similarly to natural ageing, involves precipitation but at an 
elevated temperature, normally in the range 150-210 °C, at different times 
depending on the strength level required. At these temperatures atoms can move 
over larger distances, and the precipitates formed during artificial ageing are 
normally much larger in size than GP zones. 

The initial events that take place within the components, exposed to ageing after 
quenching, start with the GP zones formations followed, with time, by metastable 
precipitates that are either coherent or semi-coherent with the matrix. The metasta- 
ble precipitates, when reaching a critical size, may nucleate on the GP zones, 
homogeneously in the matrix or heterogeneously on dislocations or other lattice 
defects. The metastable precipitates grow upon further ageing by diffusion of atoms 
from the supersaturated solid solution to the precipitates. As the supersaturation 
decreases, the precipitates continue to grow in accordance with Ostwald ripening. 
The process is driven by a reduction in surface energy, meaning that the larger 
precipitates coarsen as the smaller ones dissolve. As the precipitates grow, with 
time, they become non-coherent with a direct negative impact on strength. Loss in 
coherency is then the last stage in the precipitation sequence. The precipitation 
procedure does not necessarily have to follow the abovementioned sequence, it can 
also start at a certain stage, i.e., an intermediate stage, depending on the thermal 
history of the component or tested material (natural ageing, artificial ageing tem- 
perature, heating rate, etc.) (Sjolander 2011). 


Defect Formation in Aluminum Alloys 

Defects in castings, in the form of gas and shrinkage porosity, cold fills, dross, oxide 
films, inclusions, entrapped air bubbles and/or unwanted phases such as the dele- 
terious Fe-rich particles, Al 5 FeSi, sludge, etc., will all act as initiation sites of 
cracks and as propagation paths. The presence of defects in Al castings promotes a 
wide scattering and lowering of the strength and ductility, precluding any pre- 
dictions of the tensile behavior. 
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Fig. 63 Natural ageing curves for (a) a SSM HPDC A356 alloy (Moller et al. 2007) and (b) an 
Al-9Si-3.5Cu and an Al-9Si-3.5Cu-0.5Mg alloy (Reif et al. 1997) 
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The formation of porosity is a result of many elements such as the initial gas 
content in the melt, grain nucleation, processing conditions like cooling rate, 
thermal gradient, solidification time and external pressure, the chemical composi- 
tion of the alloy and the inclusion content, etc. Mostly, the formation of porosity is 
devoted to the cooperative effect of dissolved hydrogen, nucleation sites, and 
shrinkage associated with the solidification process. 

In general, the level of porosity will govern the mechanical properties; a high 
level and a certain porosity morphology will promote a risk of occurrence of a 
premature failure, especially for parts with thin sections. It is generally stated that 
the static tensile properties, such as ultimate tensile strength, yield strength, and 
elongation to fracture, are all decreased with an increased degree of porosity. On 
contrary, according to other related publications, the reduction in tensile properties 
has almost no correlation with the average of volume fraction of porosity. In fact, 
the decrease in tensile properties is attributed to the length and/or the area fraction 
of the defects in the fracture surface. Regarding the fatigue strength, it is dependent 
on the combination of pores and metallurgical defects such as alumina skins, cold 
fills, and dross. 

The surfaces of A1 melts comprise oxide films which will enter the bulk liquid if 
the surface happens to fold or by droplets forming and falling back into the melt. As 
(Campbell 2002) has proved, these entrained folded oxide films that remain in the 
melt become the largest defects in the final cast component, leading to leakage path 
defects and structural weakness in castings requiring strength, ductility, and fatigue 
resistance. These folded films can be unfurled, opening, growing, and swelling due 
to the hydrogen in solution, shrinkage during solidification, Fe in the melt, and large 
grain size. Worth to remember, this unfurling procedure will be suppressed if the 
casting is solidified quickly, while castings with longer solidification times will 
exhibit higher levels of swelled gas as well as shrinkage porosity and also larger 
amounts and coarse Fe-rich particles which are supposed to nucleate and grow on 
the outer, wetted surfaces of the oxide films. But the oxide films are not only evil, if 
they continue to stay on the surface of the melt; they will offer a valuable protection 
layer from catastrophic oxidation (Campbell 2002). 


Magnesium Alloys 

Being the lightest of all stmctural metals, with a room temperature density of 1 .74 

o 

g/cm , Mg forms the basis for commercial alloys which have found successful use in a 
wide variety of applications including automotive, aerospace, and telecom industries 
(Jarfors et al. 2009). Comprising about 2.7 % of the earth’s crust and 0.13 % in the 
seawater of the world’s oceans justify the saying that Mg is relatively abundant. 

Magnesium crystallizes in HCP, hexagonal close-packed, structure and is com- 
paratively weak in pure form. Therefore, in order to improve the metal performance, 
Mg is usually strengthened by the use of suitable alloying elements. Al, which is 
relatively inexpensive, is the most widely used alloying element with a strong 
strengthening effect on Mg, on the expense of ductility (see Fig. 64 and Table 8). 
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Fig. 64 Illustration of the mechanical properties variation when alloying with Al and varying the 
cooling conditions (Cao and Wessen 2004) 


Table 8 The influence of alloying elements on the mechanical properties (Kainer) 


Property 

AZ91 

AM60 

AM50 

AM20 

AS41 

AS21 

AE42 

Ultimate tensile strength, 
MPa 

240 

225 

210 

190 

215 

175 

230 

Yields strength (0.2 % 
offset), MPa 

160 

130 

125 

90 

140 

110 

145 

Elongation, % 

3 

8 

10 

12 

6 

9 

19 

Young’s modulus, GPa 

45 

45 

45 

45 

45 

45 

45 

Hardness, HBS 

70 

65 

60 

45 

60 

55 

60 

Impact strength, J 

6 

17 

18 

18 

4 

5 

5 


It is the reason why the most commercial Mg alloys are based on the Mg-Al 
system with small additions of other elements such as Zn, Mn, Si, and rare earth, to 
reach the following setup demands on properties (see Table 8): 

• Al and A: improved castability, strength, and corrosion resistance but decreased 
ductility 

• Zn and Z: minor improvement in strength and corrosion-resistant alloys 

• Mn and M: improved corrosion resistance (controls the Fe content) 

• Si and S: improved creep strength (forms intermetallic constituents which 
stabilize grain boundaries) 

• Rare earth E: improved creep strength (similar to Si) 

• Impurities Cu, Ni, and Fe: diminishes the corrosion resistance 

The nomenclature system of Mg alloys is somewhat different from the other 
metal alloys systems and here follows an example based on the ASTM norm. 
Consider the three alloys AZ91A, AZ91B, and AZ91C in these designations: 
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Fig. 65 The influence of cooling rate on the eutectic of Mg alloys containing 10 % Al: (a) is 
illustrating a slow cooling rate compared to (b) which is solidified under higher cooling conditions 
(Cao and Wessen 2004) 

• A represents aluminum, the alloying element specified in the greatest amount. 

• Z represents zinc, the alloying element specified in the second greatest amount. 

• 9 indicates that the rounded mean aluminum percentage lies between 8.6 and 9.4. 

• 1 signifies that the rounded mean of the zinc lies between 0.6 and 1.4. 

• A as the final letter in the first example indicates that this is the first alloy whose 
composition qualified assignment of the designation AZ91. The final serial 
letters B and C in the second and third examples signify alloys subsequently 
developed whose specified compositions differ slightly from the first and from 
one another but do not differ sufficiently to effect a change in the basic desig- 
nation. Sometimes the last letters are D which corresponds to high purity and E 
to high corrosion resistance. 

Due to the crystal structure of Mg, solid forming processes are infrequently used; 
instead, liquid forming technologies, mainly high-pressure die casting, have exten- 
sively been employed for producing components. The excellent castability of this 
metal enables high rate production of complex and thin-walled castings combined 
with low costs. Other processes which have gained major considerations and 
development lately are the semisolid forming processes and technologies. Besides, 
Mg alloys have the potential to be fully recycled; melt residues and scrap can be 
directly reinserted into the process (Jarfors et al. 2009). However, depending on 
wall thickness and selection of process and type of Mg alloys, a wide range of 
properties could be offered (see Figs. 64 and 65). 

The major driving force for the increased application of die-cast magnesium 
components comes from the automotive industry, where lightweight solutions for 
structural components are in demand to reduce the vehicle weight and improve the 
engine efficiency. Nevertheless, Mg alloy sand castings are also used in some 
aerospace applications because they offer a clear weight advantage over other 
materials. A considerable amount of research and development on these alloys 
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Fig. 66 Illustration of a hot 
tears in AM50 (Cao 
et al. 2005) 
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has resulted in some spectacular improvements in general properties compared with 
the earlier AZ types (Jarfors et al. 2009). 

The properties of Mg alloys could be further tailored by grain size control but 
compared to Al and Mg components still only have a small market share so far. The 
main reason is that Mg alloys have critical drawbacks such as the high sensitivity to 
defect formation during the casting process and the high turbulence associated with 
high-pressure die casting process during cavity filling ensuring that entrapment of 
air is unavoidable. Using vacuum-assisted die casting may decrease the gas entrap- 
ment to some extent. Besides gas porosity, due to the large solidification range of 
most Mg alloys, it is not surprising that shrinkage porosity is also likely to form as 
hot tearing (see Fig. 66). Another drawback is due sensitivity to the formation of 
segregation band defect. These segregation bands always contain a higher fraction 
of eutectic and normally follow the outer contour of the casting sometimes also with 
certain fraction of porosity. 

By keeping certain elements controlled and at minimum, such as Fe, Cu, and Ni, 
the corrosion resistance of Mg alloys is considerably improved and in some cases 
being superior to Al die-cast alloys. Nevertheless, galvanic corrosion can not only be 
controlled by governing levels of impurities but also careful design such as suitable 
surface coating as well as suitable design of the assembly of the Mg components with 
other metals. Another limitation for an increased penetration of Mg is the low creep 
resistance at elevated temperature, and the currently mostly used alloys such as AZ91, 
AM50, and AM60 start to creep already at temperature above 100 °C which is mainly 
due to grain and phase sliding. However, development of alloys such as AS21, AE42, 
Mg-Zn-Al-Ca, etc., for improved creep resistance has been successful but castability 
and cost have put hinders for large-scale commercial use (Jarfors et al. 2009). 
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Copper Alloys 

Copper is a metal with thousands of years of history; apparently, back to dates 
around 8700 BC, a copper item was found in what is now Iraq. Signs of copper 
smelting have also been registered to dates 5000 BC. High conductivity of elec- 
tricity and heat, good corrosion resistance, relatively good tribological properties, 
and the ease with which it can be formed into complex shapes make the copper so 
valuable. Being difficult to cast, almost 80 % of the copper is used in the form of 
pure copper, and prone to defect formations as porosity and internal cavities and 
surface cracking, pure copper is alloyed with other elements allowing this metal to 
reach strength superior to many other metals such light alloys and some steels. 

Q 

Copper alloys have generally higher density, 8.9 g/ cm at room temperature, than 
steel which is sometimes can be considered as a disadvantage. Compared to light 
alloys such as aluminum and magnesium, copper alloys are generally better in 
terms of fatigue and creep properties. Although some copper alloys exhibit rela- 
tively high yield strength (see Table 9), their specific strength is typically less than 
that of aluminum or magnesium alloys (ASM Handbook 1992; Copper Develop- 
ment Association 1991). 

A variety of cast copper alloys are available and here is a rough breakdown: 


Bronzes and red metals. 

- Bronzes and red metals are based upon alloying with Sn (see Fig. 67), 
followed by additions of Al, Ni, and Pb but also some Fe and Mn. The 
bronzes are subdivided into tin bronzes where Sn is around 8-14 %, 
tin-lead bronzes where the Pb is somewhat higher than Sn ranging from 
10 % to 20 %, and red metals. Red metals are characterized by being alloyed 
with Sn and Pb, levels between 1 % and 6 %, respectively, and with additions 
of 2-8 % Zn. These alloys are relatively strong but exhibit low ductility. 
Typical uses are as valves, pipe fittings, and pumps. They are also frequently 
used for bearings where loads and speeds are moderate (ASM Handbook 
1992; Copper Development Association 1991). 

- Aluminum bronzes may be alloyed with up to 14 % Al, but increasing the 
Ni level in these alloys between 4 % and 7 %, the alloys are named nickel- 
aluminum bronzes. Alloying with 10-14 % Mn, the alloys are recognized 
as manganese-aluminum bronzes. Aluminum bronzes offer high 
mechanical properties (see Table 9) and superior corrosion resistance. 
The strength obtained with these alloys is comparable to some steels 
exhibiting relatively good wear properties. These alloys are found in 
components such as propellers, pump housings, wear turbines, etc. 
(ASM Handbook 1992; Copper Development Association 1991). Being 
the major alloying element, 10-30 % Ni, these alloys are denoted as 
copper-nickel alloys and utilized in heat exchangers, pump parts, bearings 
due to excellent sliding and wear properties, etc. Typical mechanical 
properties are presented in Table 9 (ASM Handbook 1992; Copper Devel- 
opment Association 1991). 
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Cu wt % Sn Sn 


Fig. 67 Illustration of Cu-Sn phase diagram (Courtesy Granta Design, UK) 


• Copper alloys based upon the alloying element Zn are denoted brasses (see Fig. 68) . 
Brasses can be further alloyed with Pb, called red brasses, and with Al, Fe, Mn, and 
Ni in order to enhance the mechanical performance. Alloying with Pd is mainly to 
facilitate chip cutting during machining. As the Zn content increases, the melting 
point, density, modulus, and electrical and thermal conductivities decrease, but on 
the other hand, the expansion coefficient, the strength, and hardness increase (ASM 
Handbook 1992; Copper Development Association 1991). 

• High copper alloys, normally pure, are characterized by the high conductivity of 
heat and electricity, a good corrosion resistance, and the ease with which it can 
be formed into complex shapes. However, the strength is quite low and the alloys 
have hereby alloyed for strength to be improved but this is at the expense of 
conductivity (ASM Handbook 1992; Copper Development Association 1991). 

• Copper alloys may also be alloyed with one or more of the substances such Cr, 
Co, Be, Zr, and Si and thus obtain high strength with a maintained high 
conductivity and serve in applications such as welding jaws, heat sinks, and 
molds. The copper-beryllium alloys are known for their high strength and 
stiffness and non-sparking qualities extending the application fields of copper 
alloys further (ASM Handbook 1992; Copper Development Association 1991). 

Copper-based components can be produced by a variety of casting processes 
such as sand, shell, and investment processes as well as permanent mold casting 
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Fig. 68 Illustration of Cu-Zn phase diagram (Courtesy Granta Design, UK) 


processes such as gravity- and pressure die casting. Copper castings are completely 
recyclable and, if segregated from ferrous material, they have a high value and this 
is also true of scrapped obsolete components. As nearly all metal alloys, some 
porosity are found in pressure die castings, and as continuous castings solidify 
directionally under well controlled conditions, feeding is excellent and no porosity 
is encountered. Sand castings on the contrary need to be designed differently 
assuring directional solidification and hence ensuring good feeding arrangements 
(ASM Handbook 1992; Copper Development Association 1991). 

The melting of copper based alloys is associated with dissolving both oxygen 
and hydrogen. During the solidification process of copper alloys, the oxygen and 
hydrogen will be able to form water vapor causing porosity in the component. 
Besides, copper alloys containing A1 might form oxide skins which can act as 
cracks within the casting causing, for instance, leakage (Campbell 2002). In order 
to ensure sound castings, it is therefore recommended to apply special melting and 
metal treatment techniques when casting copper alloys. 

As for the micro structural and mechanical properties, cooling conditions will 
also influence the fineness of microstructure and overall properties. Castings solid- 
ifying under relatively high cooling rates exhibit finer micro structures and hence 
greater performance. 

Copper and its alloys can be classified according to unified numbering system, 
UNS, as follows: the letter C, for copper, followed by a 5-digit number. Only the 
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first digit means anything: Cl**** designates almost pure copper; the C2, C3, and 
C4 series are brasses with increasing Zn content; the C5s are bronzes based on Cu 
and Sn; the C6s are other bronzes containing A1 instead of Sn; and the C7s are 
copper-nickel alloys. There exist also a number of other designation systems 
developed by the International Organization for Standardization, ISO, and the 
Copper Development Association, CDA. 


Zinc Alloys 

If you are searching for a combination of design attributes such as stiffness and 
strength, larger than what A1 and Mg alloys can offer, then Zn alloys can be an 
option. Zn alloys are also unique in terms of producing parts with high impact 
strength and ductility as well as are utilized in castings where acceptable heat 
conductivity properties along with good machinability and corrosion resistance 

o 

are desirable; but the density is somewhat high, nearly 7.14 g/cnr at room temper- 
ature (ASM Handbook 1992; Porter 1991). 

Being one of the biggest users of Zn alloys in components such as lamp 
housings, handles, and safety features such as seat-belt buckle and chairs, the 
automotive industry is also employing Zn in applications such as carburetor bodies, 
windshield wiper part, parts for hydraulic brakes, etc. Given the good conductivity 
and electromagnetic shielding, these alloys are also found in electronic and telecom 
industries. Like other metals, in order to achieve the desired strength or quality, Zn 
must be alloyed with other elements; the most commonly used ones are Al, Cu, and 
Mg (see Table 10). Cd, Pb, and Sn may be present in small quantities causing 
swelling of cast parts, crack formations or distortions, and surface defects (blisters), 
if not controlled, leading to reduction in overall performance. Besides the negative 
impact on the mechanical properties of components, these impurities increase the 
proneness of alloys to intragranular corrosion (ASM Handbook 1992; Porter 1991). 

Cast Zn components are mainly produced by high-pressure die casting, hot 
chamber, due to the excellent castability offered by these alloys and for high 
production rate and cost savings. Nevertheless, sand and gravity die casting pro- 
cesses are also producing Zn-based components (Goodwin 1998). 

The major alloying element in Zn die casting alloys is Al which is for conventional 
alloys nearly 3/ 4—4.5 % (see Table 10). Besides increasing the fluidity and improving 
the castability, Al in Zn provides strengthening and reduction of grain size and 
diminishes the affinity of alloys to react with the steel dies and equipment. Alloys 
with Al levels below 3.4 % require higher casting temperature that results in short- 
ening the life of tools and dies and also resulting in lower mechanical properties and 
dimensional stability. The maximum impact strength of Zn is achieved at 3.4 % Al, 
but it is reduced if the level of Al is above 4.3 %. At Al levels of 5 %, a brittle Zn-Al 
eutectic is formed (see Fig. 69), but the material exhibits maximum fluidity. Alloys 
with higher Al levels offer an increase in mechanical properties (see Table 11) and 
have been replacing other metals such as aluminum and cast irons in components such 
as pumps, impellers, vehicle parts, etc. (ASM Handbook 1992; Porter 1991). 
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Table 10 Chemical compositions of some typical cast Zn alloys (ASM Handbook 1992; Porter 
1991) 



Alloy 

Element 

2 

3 

5 

7 

Al 

3.5-4. 3 

3. 5-4. 3 

3. 5-4. 3 

3. 5-4. 3 

Cu 

2.5-3. 5 

Max 0.25 

0.5-1.25 

Max 0.25 

Mg 

0.02-0.06 

0.02-0.06 

0.03-0.08 

0.005 

Fe max 

0.10 

0.10 

0.10 

0.075 

Pb max 

0.005 

0.005 

0.005 

0.0030 

Cd max 

0.004 

0.004 

0.004 

0.0020 

Sn 

0.003 

0.003 

0.003 

0.0010 

Ni 

— 

— 

— 

0.005-0.02 




r 

Wt % Zn 


Zinc die- casting aifoys 



0 20 40 60 70 80 9 0 95 100 



0 20 40 60 SO 100 


Al 


Atomic % Zn 


Zn 


Fig. 69 Illustration of the Al-Zn phase diagram (Courtesy Granta Design, UK) 


Besides Al, Zn-Al alloys are also alloyed with Cu to minimize, or at least 
neutralize, the influence of impurities and increase the resistance to intragranular 
corrosion. It is advised that Cu should be kept around 1.25 % in order to meet the 
demands upon dimensional stability. However in some countries Cu levels of 3 % 
are found, leading to a better castability, higher strength, and corrosion resistance. 
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Table 11 The variety of mechanical properties achieved with the different pressure die cast 
alloys at room temperature (Porter 1991; Goodwin 1998) 


Alloy 


Properties 

2 

3 

5 

7 

ZA8 a 

ZA12 a 

ZA27 a 

Ultimate tensile strength, 
MPa 

358 

283 

328 

283 

365-386 

392-414 

407-441 

Yield strength (0.2 offset), 
MPa 


220 

230 


283-296 

310-330 

365 

Young’s modulus, GPa 





85.5 

83 

75 

Elongation (on 5 1 mm), % 

8 

10 

7 

12 

6-10 

4-7 

2-3.5 

Hardness (Brinell 500 kg) 

100 

82 

91 

80 

100-106 

95-105 

116-122 


Separately cast test bars 


Mg at around 0.03-0.06 % is normally found in Zn alloys for primarily similar 
purposes as Cu. Mg at these levels improves the intragranular corrosion resistance 
and provides additional strength. At higher levels, Mg tends to impair fluidity, 
enhances hot tearing susceptibility, and lowers the ductility (ASM Handbook 1992; 
Porter 1991). 

A generic term to describe the families of the pressure die cast Zn-alloys has 
been developed by “the New Jersey Zinc Co” and further adopted by “the Imperial 
Smelting Corporation Ltd” which are ZAMAK and MAZAK, respectively (Porter 
1991). In both cases, the initial letters of the elements in their composition are 
differing only in the letter K, which instead for Cu it is designated from the German 
Kupfer. In Europe, the die-casting industry uses alloy ingots manufactured 
according to the EN 1774-1998 standard (Goodwin 1998). The variety of alloys 
developed is presented in Table 10, all containing A1 levels below 5 %. The 
nomenclatures of alloys developed to withstand higher strength up to 440 MPa in 
ultimate tensile strength and up to 400 MPa in yield strength (see Table 11) are 
commercially known as alloys ZA8, ZA12, and ZA27 where the figures correspond 
to the A1 levels found in the alloys. The ZA-based alloys exhibit superior or similar 
hardness to that of aluminum, brass, or bronze. 

Comparable to many metals alloys, the mechanical properties of Zn alloys are 
influenced by the geometry and wall thickness of the component. A reduction in wall 
thicknesses will result in an increase in the strength on the expense of ductility. Lower 
casting temperatures and gate velocities are preferable even though the latter have a 
little effect on the tensile properties. But due to interaction with other casting 
parameters, it is desirable if the gate velocity is relatively low (ASM Handbook 1992). 


Steel 

Does not “steel” instantly bring your mind to railroads, automotive, oilrigs, tankers, 
and skyscrapers, laundry equipment, chemical-processing equipment, jet-engine 
parts, surgical tools, furnace and boiler components, etc. (Askeland et al. 2011)? 
The main separating point between steels and cast irons is the carbon level which is 
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for steels below 2.1% carbon. Besides ferrite, which is pure iron at room temper- 
ature, the combination and arrangements of micro structure constituents, including 
heat treatment, allow steel alloys to offer a wide combination of properties to meet 
setup demands on the particular applications. With this said, it is no wonder 
why nothing else may at the same time offer excellent design attributes such as 
strength, ease of formability, and being so inexpensive. Some of the most 
important micro constituents present the micro structure (ASM Handbook 1992; 
Askeland et al. 2011): 

• Cementite, Fe 3 C, that contains 6.67 wt% C, may exhibit a HV up to 1,100 and 
appears as lamellae embedded in the pearlite in the microstructure. 

• Pearlite has a total of 0.8 % C, consisting of ferrite and Fe 3 C in a lamellar 
structure. The hardness of the pearlite may be as high as 400 HV. 

• By transformation of austenite at a large undercooling, bainite is obtained 
consisting of a rounded and finer cementite, compared to pearlite, in a ferritic 
matrix. The HV in this case can be up to 650. 

• A hardness of 900 HV can be achieved by quenching of the austenite forming 
martensite, diffusionless process that results in a metastable iron phase. 

• Tempered martensite, a mixture of very fine and nearly round cementite in 
ferrite, forms when martensite is reheated following its formation. 


Types of Steel Alloys 

The different steel types can be grouped in the following categories: carbon steel, 
also called plain carbon steel, and alloyed steels. Carbon steels are steels where the 
main alloying constituent is carbon. Nevertheless, carbon steel contains also small 
quantities (no minimum content is specified) of, for instance, Cr, Co, Mo, Ni, Ti, 
etc., but a maximum of 1.65 % Mn and 0.6 % Si is allowed (ASM Handbook 1992). 

Carbon Steels 

Carbon steels can be classified according to their carbon content into three broad 
groups (see Fig. 70): 

1. Low-carbon steels are alloyed with (approximate values) up to 0.30 % C, Mn 
(0.50-1.00 %), Si (0.25-0.80 %), and S and P up to 0.05 %. The automotive and 
the railroad industries are typical areas of applications in castings. These mate- 
rials are characterized by being relatively strong, being ductile, having no useful 
hardening by heat treatment except by normalizing, being weldable, their 
ductile -brittle transition temperature that is just below room temperature, etc. 
(ASM Handbook 1992). 

2. If a great balance between ductility and strength along with appreciable wear 
resistance are desired, medium-carbon steels exhibiting 0.30-0.70 % C can be 
recommended. This kind of steels represents more than half of all major parts of 
steel castings used typically in sectors such as railroad automotive, machinery 
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Fig. 70 Illustration of the phase diagram of steels (Courtesy Granta Design, UK) 


and tools, equipment for rolling mills, mining and construction equipment, etc. 
Medium-carbon steels are characterized by being very strong, heat treatable to 
produce a wide range of properties in quenched and tempered conditions, but 
difficult to weld and can become brittle below room temperature (ASM Hand- 
book 1992). 

3. High- and ultrahigh carbon steels with 0.70-2.0 % C and alloyed with elements 
such as Mn (0.50-1.0 %), Si (0.30-0.80 %), S, and P. Application fields that 
require relatively high strength levels and wear resistance are typical users of 
these steels. One of the disadvantages with kind of steels is the limited 
weldability and ductility; these alloys cannot be welded and tend to be brittle 
if the structure is not carefully controlled (ASM Handbook 1992). 

The strength of carbon steels is remarkably improved as the C level is increased 
(increasing cementite fraction) which is done on the expense of ductility as well as 
reduced weldability (see Table 12). Additional strength improvements are achieved 
by applying a proper heat treatment. Generally, additions of other alloying elements 
than C will influence the micro structure and accordingly the size of the phase areas 
in the phase diagram; for instance, alloying with Ni might lead to stable austenite at 
room temperature. 
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Table 12 The influence of increasing the carbon level on the mechanical properties of steel 
(Askeland et al. 2011) 



Alloy % C 

Mechanical properties a 

0.1 

0.3 

0.5 

0.7 

0.9 

Ultimate tensile strength, MPa 

350-400 

450-500 

650-700 

800-850 

950-1,000 

Yield strength (0.2 offset), MPa 

250 

320 

420 

500 

500 

Elongation, % 

40 

30 

25 

17 

12 

Hardness (Brinell) 

120 

160 

200 

240 

280 


a The mechanical properties can vary, and lower values could be obtained based upon alloying and 
processing conditions. The properties presented are corresponding to a normalized steel bars and 
are approximates 


Table 13 The influence of some alloying elements found in steel on the micro structure and 
properties 


Alloying 

element 

Effect on properties 

Si 

Improved castability and heat resistance 

Mn 

Improves hardness and strength at the expense of ductility 

Cr 

Improves hardenability, strength, both at room and elevated temperature , and 
ductility. At levels above 12 %, it improves corrosion resistance 

Ni 

Favors grain refinement, increased hardenability, toughness and overall 
mechanical properties as well as corrosion resistance 

Mo 

Increases hardenability as well as reducing the risk for tempering 
embrittlement 


Alloy Steel 

In order to obtain properties that are not found in plain carbon steels such as 
hardenability (the ability of the steel to form martensite during heat treatment), 
machinability, strength through heat treatments, corrosion and wear resistance, high 
strength at elevated temperature, etc., alloy steels have been developed by steels 
with one or more elements (see Table 13). Alloy steel is subdivided into two groups 
(Svensson 1980): 


1 . Cast low-alloy steels that are in addition to C may contain alloying elements such 
as Mn, Si, Ni, Cu, Cr, etc. with a specified minimum limit but as a total alloying 
elements no more than 8 %. Al, Ti, and Zr might also be found in order to control 
the oxygen content in the alloys. A large number of cast low alloy steels are 
developed in order to meet end-use requirements such as strength, corrosion, wear 
and heat resistance, etc. Carbon-manganese cast steels for high-strength grades 
(see Table 3), manganese-molybdenum cast steels for high yield strength at high 
temperature, and manganese-nickel-chromium-molybdenum cast steels for high 
hardenability requirements are some of the alloys found in this category. 

2. Cast high-alloy steels are characterized by their high strength approximately 
1,200 MPa in yield strength and around 1,500 MPa in tensile strength, corrosion 
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Table 14 The effect of heat treatment on some low-alloy steels (ASM Handbook 1992) 



Alloy 


Low alloy 

SAE4130 

SAE 4130 

SAE8630 

SAE8630 

Mechanical properties 

steel a 

N and T b 

Q and T 

NandT 

Q and T 

Ultimate tensile 
strength, MPa 

550-1,700 

650-720 

930-1,050 

750-830 

920-1,010 

Yield strength (0.2 
offset), MPa 

400-1,400 

475-525 

680-750 

530-590 

830-914 

Elongation, % 

3-38 

17-20 

13-16 

20-25 

13-16 


a The mechanical properties are estimates; lower values could be obtained based upon alloying and 
processing conditions. The wide ranges presented declare the properties that can be obtained by 
proper alloying and post-solidification treatments 
Abbreviations are N normalizing, T temper, and Q quenching 


resistance in aqueous media near room temperature and to serve in applications 
exposed to hot gases and liquids at elevated and high temperatures, etc. These 
alloys are normally alloyed with similar elements as for the low-alloy steels but 
with higher concentrations of mainly Cr and Ni, with levels up to 30 % and 60 %, 
respectively. Among the high-alloy steels for corrosion resistance are the ones 
normally referred to as cast stainless steels, with Cr levels above 12 % Cr. These 
steels are normally found in applications as materials in chemical processing and 
power generation equipment as well as in areas where high service temperature 
requirements, around 650 °C, are requested. Stainless steels, whether cast or 
wrought, are normally classified based upon their alloying elements or micro- 
structures and post-solidification treatment and can be denoted as martensitic , 
austenitic , ferritic , austenitic-ferritic , or duplex (ASM Handbook 1992; 
Askeland et al. 2011). 


The mechanical properties of these alloys may also be changed by suitable 
alloying (see Tables 12 and 13) and further heat treatment (Table 14). There are 
exceptions; however, cast high-alloy steels with more than 20-30 % Cr + Ni, in 
comparison with plain carbon and low alloy steels, do not exhibit similar phase 
changes during heating or cooling between room temperature and the melting point. 
Due to this fact, they are therefore considered non-hardenable, and their properties 
are only controlled by their chemical composition rather than heat treatment. With 
this in mind, each grade of high-alloy cast steel has to be given a special consid- 
eration with regard to casting design, foundry practice, and subsequent post- 
solidification treatment (ASM Handbook 1992). 

An attractive engineering material can also exhibit some disadvantages. The 
pouring temperature of steel is higher than for most other casting metals, ~ 1,650 
°C, and at these temperatures, steel readily oxidizes, so molten metal must be 
isolated from air. Besides oxidation, these high temperatures lead to increased 
wear-related damages on furnace and ladle, higher demands on mold material, 
and increased affinity to form defects. Molten steel has also relatively poor fluidity; 
castings require extensive feeding because of shrinkage and heat treatment in order 
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improve machinability, possess lower damping quality than cast irons, and are 
normally associated with relatively higher costs. Besides, oxygen and hydrogen 
are likely to be present in steels degrading the mechanical properties. Oxygen, for 
instance, reduces the impact energy, brittleness, etc., and hydrogen, like oxygen, 
also leads to the formation of porosity, both gas and shrinkage related as well as 
blowholes. Additionally, being in moist form at higher temperature, hydrogen also 
acts as a decarburizing agent (Campbell 2002). 

As the casting process, normally sand casting or mold by refractory materials 
generally gives rise to the formation of an inhomogeneous material micro- and 
macrostructures due to the cooling conditions, variety of geometrical complexity, 
and hence a variety of mechanical properties. Additionally, cast steel components 
can be heat treated for homogenization and for tailoring performance. The heat 
treatments are done to improve properties such as strength, increase/decrease 
hardness, dissolve carbides or other unwanted structures, and among the common 
heat treatment done on cast steel components are (ASM Handbook 1992; Askeland 
et al. 2011): 

• Normalizing including heating to 55-85 °C above the A 3 (a transition temper- 
ature when austenite transforms to ferrite) or A cm (transition from austenite to 
austenite + cementite) temperature in order to produce a uniform and desirable 
grain size distribution and help also in dissolving carbides and carbide network. 
This process normally leads to improved strength, impact toughness, and fatigue 
resistance. 

• Annealing comprising heating to 15—40 °C above A 3 or A { (the eutectoid 
temperature) temperature in order to improve machinability or plastic deforma- 
tion process. 

• Hardening with heating to the austenite region and subsequent quenching in 
order to obtain a martensitic structure. 

• Tempering is usually performed after hardening, to reduce some of the excess 
hardness, and is done by heating the metal to a much lower temperature 
than was used for hardening. The material toughness and ductility are then 
improved. 

• Spheroidizing, requiring several hours at about 30 °C below the A { in order to 
change the morphology of Fe 3 C into large spherical particles, results in 
increased softness and ductility making castings easily machined and deformed. 

The American Iron and Steel Institute (AISI) and the Society of Automotive 
Engineers (SAE) classify the carbon and low-alloy steels with a 4-digit code. The 
first number is the major alloying element and the second number designates the 
second alloying element. The last two numbers approximate the amount of carbon 
expressed in hundredths of a percent, for instance: 1,080 steel would be plain 
carbon steel with 0.80 % carbon and 4,340 steel would be Mo-Cr alloy with 
0.40 % carbon. High-alloy cast steels are most often classified using the designation 
system of the High Alloy Product Group of the Steel Founders’ Society of America 
(ASM Handbook 1992; Askeland et al. 2011). 
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Cast Component Design 

Casting design has very few and extremely simple guidelines but their application 
requires ingenuity and creative thinking. These simple rules can be broken down 
into two main classes and subclasses: 

• Design for manufacturing yield. 

• Use the same thickness everywhere. 

• Avoid hot spots in components and tools. 

• Do no restrict motion. 

• Design for low cost. 

• Near net shape design to reduce machining and assembly needs. 

• Design for cheap tools and molds. 


Design for Manufacturing Yield 

Use the Same Thickness Everywhere Rule 

The simplest rule of casting design is to think uniform thickness everywhere. 
Uneven solidification results in concentration of heat and thus also shrinkage 
porosity. This is particularly important in comers and should be applied in all 
sections and in corners. Example of this is in Fig. 71. 

If a deviation from the same thickness rule is necessary then a careful transition 
is essential and should be made following the guidelines shown in Fig. 72. Doing 
this will spread the heat concentration dispersing larger porosity into smaller micro 
porosity that is manageable. 


Avoid Hot spots in Component and Tools Rule 

Hot spots are generated in corners and in junctions and are in many cases in part a 
direct consequence of a deviation from the same thickness everywhere rule. The 
main goal is to disperse the area where the last solidified material ends up. If this 
area is too small and localized, then large shrinkage porosity forms. Avoiding heat 
concentrations in the component will thus spread the final shrinkage and a finer 
porosity may form instead or even be more or less avoided. This is best managed 
by, for instance, staggering multi- wall connections and applying ample radius in all 
areas where material is concentrated (Fig. 73). 


Do Not Restrict Motion Rule 

During cooling the material contracts and the mold and die will commonly restrict 
the motion of the part. In the case of a restricted motion, this will either result in 
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Fig. 71 Ideal application of 
same thickness everywhere in 
a comer 


Poor 





Better 


Fig. 72 Transition rule from 
a thick to thin section 


R=D/3 for d>2D/3 






L) 

d 


Use slope and blend for for d<2D/3 



residual stresses or cracking. In high-pressure die casting, this is partly controlled 
by the cooling time, where an early die opening reduces stress, but increases 
warpage and may as such create tolerance problems. In low pressure die casting 
processes and gravity-driven processes, this can also to some extent be used but the 
effect is not as dramatic as for high-pressure die-casting due to the higher cooling 
rates in the latter. In all cases it is important that the appropriate allowance for 
thermal shrinkage is made (Table 15). 

The shrinkage allowance will give the right dimensions of the part but will not 
help part stripping or ejection where problems with drag mark are common in high- 
pressure die casting of mold fracture in sand molding during pattern stripping. 
This is managed by using draft angle on surfaces perpendicular to the parting line 
(Table 16). 
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Fig. 73 Avoiding hot spots 
in the casting 


Poor 



Better 



Table 15 Shrinkage 
allowance - pattern 
(Svensson 2001) 


Casting alloy 

% 

Possible % span 

Cast iron 

1 

0.5-1. 3 

Steel 

2 

1. 5-2.5 

Aluminum alloys 

1.2 

0.8-1. 5 


Table 16 Draft angle 
requirements (Svensson 
2001 ) 


Section height (mm) 

Draft angle (°) 

5 

6 

10 

3 

25 

1 

65 and above 

0 

Internal surfaces require double draft 


Apart from the draft angles, there are additional design features that should 
be considered. All angles generate a motion restriction as illustrated in Fig. 74. 
It should here be noted that even with appropriate draft angle relative motion 
between different sections will generate stress. A typical such example are thin- 
flange LED cooling devices. The standard extruded design is a poor casting design 
(Fig. 75). Rearranging the fins to a better design allows the shrinkage to take place 
in the radial direction as experience by the casting. This is also shown in Fig. 75 as a 
better design. This requires that the designer understands the process physics and is 
able to couple part in-process behavior to design. Critical here is to initially design 
for the casting process and to involve a foundry early in the design process for 
maximum profitability. 
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Fig. 74 Illustration of 
designs alleviating shrinkage 
problems 



Better 







Fig. 75 Example of flange 
geometry on a round base 
catering for reduced griping 
in a metal die, enabling 
vesting of long cooling fins 
(i top view) 



Design for Low Cost 

Near Net Shape Design to Reduce Machining and Assembly Needs 

Casting is very flexible in terms of design freedom and capable to allow for 
design with built-in multi-functionality in the part. For this there are no general 
rules but requires creativity by removing or reducing need for secondary 
processing. This can be cast-in threading, magnets, or other features that may 
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Fig. 76 Example of functionalization of a component with precast holes ( top part ) in a compo- 
nent with complex features and thin walls (0.35 mm). Standard design is shown as the lower part 
(Courtesy of COMPtech AB, Sweden) 


Poor 



Design requires cores 


Better 

Fig. 77 Example on how to 
redesign to avoid use of cores 

be useful in your particular application. In Fig. 76 below is an example on how to 
significantly reduce secondary machining by precasting holes. This is also a good 
example on how to use the latest techniques such as rheo-casting to cast thin walls 
down to 0.35 mm. 



Design for Cheap Tools and Molds 

Cores can be made from sand, salt, or metal. The use of cores adds to the 
complexity and cost to the manufacturing process. A simple example of a redesign 
to reduce the use of cores is shown in Fig. 77. It is important to rethink function so 
that cores can be avoided. This is also one of the main reasons to adapt design early 
to the manufacturing route to make profitable parts. 
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Summary 

Castings are produced by a manufacturing method that together with alloy selection 
and melt treatment inherently affect the component’s overall micro structural prop- 
erties including defects. Whether ferrous or nonferrous materials are employed for a 
current application, design of components including runner, riser and gating, 
metallurgy, and process relations are existing, which lead to components containing 
a wide variety of micro structures including defect size, morphology, and distribu- 
tion and hence a variety of properties over the entire geometry. The wall thickness 
influences the coarseness of the micro structure, and the component will have 
properties depending on the local metallurgical and thermal histories. Moreover, 
it can build up residual stresses during cooling which in turn affects the deformation 
behavior of cast components. With this in mind, it is obvious that in order to meet 
setup demands on performance generally requires compromises to strike a balance 
between several conflicting objectives. Often, mechanical simulations of the load 
situations are based on the assumption that the cast product has constant material 
properties throughout the entire casting. Reality differs and therefore designers are 
encouraged to bear in mind that properties that originate from standardized test bar 
might probably lead to uneconomic over-dimensioning, improper results, and 
conclusions in most cases. 
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Abstract 

Incremental sheet metal forming (ISMF) has demonstrated its great potential to 
form complex three-dimensional parts without using component-specific tools 
against the conventional stamping operation. Forming components without 
component-specific tooling in ISMF provides a competitive alternative for 
economically and effectively fabricating low-volume functional sheet metal 
products; hence, it offers a valid manufacturing process to match the need of 
mass customization, which is regarded as the future of manufacturing. In ISMF 
process, sheet is clamped in a fixture/frame with an opening window on a 
programmable machine, and a hemispherical/spherical ended tool is 
programmed to move in a predefined path giving shape to the clamped sheet 
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by progressively deforming a small region in incremental steps. Although 
formability in incremental forming is higher than that of conventional forming, 
the capability to form components with desired accuracy and surface finish 
without fracture becomes an important requirement for commercializing the 
ISMF processes. This chapter presents various configurations developed to 
incrementally form the sheet metal components, experimental as well as numer- 
ical methods for estimating forming limits, procedures for enhancing the accu- 
racy, and methodologies for tool path generation. 


Introduction 

Evolution of manufacturing can be broadly divided into manual manufacturing, 
mechanization, hard automation (mass production), soft automation, and integrated 
manufacturing. Metal forming processes also have gone through a similar evolution 
process. Incremental forming is in practice from the day human began processing 
the metals using manual hammering for making tools and ornaments. Manual 
manufacturing is carried out in an integrated fashion with many limitations. For 
example, a blacksmith would have knowledge/information to design, fabricate, 
deliver, repair/service, and recycle say a plowing tool. Quality of the tool/ornament 
depends on the skill of technician and repeatability is not guaranteed. In addition, 
productivity is low. The industrial revolution and development of electrical 
and hydraulic machines lead to mass production of forming components using 
dies, resulting in traditional art of incremental processes disappear from center 
stage except in few cases (Groche et al. 2007). Incremental forming is characterized 
by small overlapping regions of deformation in desired sequence to form 
the intended geometry using simple tools. Hence, smaller deformation loads are 
sufficient at any instance. However, in the recent past, industry has witnessed a 
renewed interest in incremental forming (bulk as well as sheet metal) processes due 
to change in consumer psychology, emergence of individualism which led the 
market towards mass customization, change in market dynamics, reducing product 
prices and increasing product features and product variety forcing industry 
to reduce costs in order to stay competitive, and the possibility of developing 
numerical control machines with complex kinematics. The conventional production 
methods used in mass production are no longer able to fulfill the challenging 
demand of flexibility and agility at competitive prices. In incremental forming 
the shape of product is defined by kinematics of the tools instead of part-specific 
dies and punches. Tooling costs and subsequent investment in tool storage, space 
required to store dedicated tools and dies, and cost associated with maintenance 
of tools are low in incremental forming. Based on the surface area to volume 
ratio of material being deformed, incremental forming processes can be broadly 
classified into incremental bulk metal forming (IBMF) and incremental sheet metal 
forming (ISMF). It is well known that in IBMF, surface area to volume ratio is 
smaller than that of ISMF. This chapter will discuss the developments related to 
ISMF only. 
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Incremental Sheet Metal Forming and Configurations 

Sheet metal forming covers a variety of processes wherein shearing, bending, 
stretching, and drawing or their combinations are used to produce parts for a 
wide variety of applications. Most of the conventional sheet metal forming opera- 
tions require component-specific and costly tooling, and their design and fabrica- 
tion add to the lead time. Hence, these forming processes are suitable for mass 
production to offset for the tool costs. Societal changes impacted manufacturing 
including other engineering fields. Earlier industrialized economies were on mass 
production; however, a combination of advances in technology and information is 
making it increasingly possible to manufacture the customized products. In the 
competitive world market, customers are demanding for more flexible and personal 
design in quality, performance, services, and aesthetics of the products with 
approximately same cost (if not lesser) and quality (if not higher). To be compet- 
itive in the global economy and to satisfy customer demand without much lead 
time, majority of manufacturers are using computer-aided tools such as computer- 
aided design (CAD), computer-aided engineering (CAE), computer-aided process 
planning (CAPP), computer-aided manufacturing (CAM), computer-aided inspec- 
tion (CAI), etc., leading to integrated manufacturing without any physical bound- 
aries between different functional departments of an organization. Development of 
neutral data exchange standards led to proper integration of CAX tools. To be 
competitive in mass customization era, one shall be able to convert the ideas 
quickly into products using flexible and rapid prototyping and manufacturing as 
well as computer-aided technologies. In addition, some traditional technologies 
also may have to be used. For mass customization (manufacturing system shall be 
agile to consumers with customized products at mass production prices with 
equivalent or higher quality and lesser lead time), manufacturing processes have 
to be flexible with minimum change over time and tooling costs. Manufacturers 
worldwide now attempt to grow by competing on product differentiation as much as 
on price. This trend leads to short product life cycles, low volume of a chosen 
product model, and profitability that is as much dependent on the speed at which 
new models and products are introduced as on the control of direct cost. This trend 
in manufacturing sector led to the development of flexible manufacturing technol- 
ogies including layered manufacturing and particularly in the area of machining to 
changeover between different products with the help of fully automated systems. 
However, developments in the area of flexible forming (Allwood and Utsunomiya 
2006 ) have not kept pace with machining mainly because of the requirement of 
component-specific tooling in many of the operations. 

Conventional sheet metal forming operations require component- specific and 
costly tooling, and their design and fabrication add to the lead time. Incremental 
forming is one of the technologies that have emerged as an alternative to some of 
the conventional sheet metal forming processes for mass customization. Incremen- 
tal sheet metal forming (ISMF) is commonly regarded as a die-less sheet metal 
forming process which can form complex three-dimensional parts using relatively 
simple tools. It is receiving attention from the engineering community due to its 
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Fig. 1 Single -point 
incremental forming (SPIF) 



flexibility and low cost. This unique combination enables the rapid prototyping of 
functional sheet metal parts before mass production. In addition, it offers a valid 
manufacturing process to match the need of mass customization, which is regarded 
as the future of manufacturing (Wulf 2007). In ISMF process, sheet is clamped in a 
fixture/frame with an opening window on a commercial computer numerical 
control (CNC) machine, and a hemispherical/spherical ended tool is programmed 
to move in a predefined path giving shape to the clamped sheet by progressively 
deforming a small region in incremental steps. As the forming tool moves and 
deforms a small portion at a time, the overall time to produce one component is 
comparatively more, but for small batch sizes and prototyping, ISMF demonstrated 
its potential of being cost competitive. Existing experimental configurations for 
incremental sheet metal forming can be broadly classified into two categories: with 
(full/partial) and without die/pattem (Jeswiet 2001) support. 

Negative die-less incremental forming, also known as single -point incremental 
forming (SPIF), is the earliest form of incremental forming. Figure 1 shows the block 
diagram of SPIF with a spherical tool. In SPIF, tool generally comes in contact with 
the sheet close to the clamped boundary at a programmed location and moves down 
by an amount equivalent to chosen incremental depth or a fraction of that depending 
on the type of tool path used. Figure 2a, b depicts the contour and spiral tool paths. In 
both the cases, tool moves along the programmed path peripherally. 

In contour tool path, tool moves down by an amount equivalent to incremental 
depth at the starting of each contour, whereas in spiral tool path, tool gradually 
moves down and completes the downward movement equal to incremental depth by 
the time tool completes 360° movement along the spiral, and the same can be 
clearly seen in Fig. 2a, b. Note that in contour tool path, tool disengages at the end 
of each contour, moves to the starting point of next contour, and plunges down 
equivalent to incremental depth from its original location at the end of earlier 
contour whereas in case of spiral tool path, tool disengages only after completely 
forming the component. Figure 2c, d shows the components produced using contour 
as well as spiral paths, and a distinct mark connecting all the start/end points of each 
contour can be noticed in case of contour tool path. Bending between the clamped 
region and component periphery can be clearly seen from these figures and is one of 
the reasons of inaccuracy. All wood et al. (2005) have designed and built a dedicated 
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Fig. 2 Contour and helical tool paths 


Fig. 3 Block diagram of 
SPIF developed at Cambridge 
University (Allwood 
et al. 2005) 



Forming tool 
Sheet fixture 


SPIF machine at Cambridge University and its block diagram is shown in Fig. 3. 
They have designed a tool-mounting system that allows free rotation of tool 
passively and for ease of mounting and un-mounting of tools. Provision for 
measurement of load(s) is provided by mounting the fixture on load cells. 

Positive die-less incremental forming, also referred to as two-point incremental 
forming (TPIF), is another variant of ISMF and is known to be first attempted by 
Matsubara (2001). In this process (Fig. 4), clamped sheet can move up and down. 
One can clearly see from Fig. 4 that the sheet metal is restrained at two locations 
(other than the clamping), i.e., at one location by forming tool and the second 
location is the static support. This static support can be a partial die or a full die. 
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Fig. 4 Two-point incremental forming (TP IF) configurations: (a) partial support and (b) full 
support 



Forming tool 


Pattern support 
Sheet fixture 



Fixture 


Fig. 5 Configuration of commercial machine developed by Amino Corporation (http://www. 
aminonac . c a/product_e_diele s s . asp ) 

Full die can be either a negative one as shown in Fig. 4b or a positive die. Most of 
the above configurations are realized by mounting the required tools on stand-alone 
NC machines. Amino Corporation of Japan has developed a commercial machine 
for incremental sheet metal forming with pattern support and is shown in Fig. 5. 
Note that only movements in two directions are provided to tool and the third 
movement necessary to form the component is provided to the fixture, and in 
addition, the fixture can move up and down. 

Recently, a variant of TPIF (Fig. 6) in which, instead of partial or full die, 
another independently controlled forming tool is used which provides further 
flexibility to the process as features on both the sides of initial plane of the sheet 
can be formed. This variant is named as double-sided incremental forming (DSIF). 
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Fig. 6 Double- sided 
incremental forming (DSIF) 




Here, the deforming and supporting roles of each tool will keep changing depending 
on the geometry being formed at the instant and the geometry that has to be formed 
later. This configuration enhances the complexity of the components that can be 
formed and reduces many of the limitations associated with incremental forming. 

Modifications of the process setup have been made by various research groups, 
for example, to form doubly curved surface, Yoon and Yang (2001) have used a 
movable punch. Their setup (Fig. 7) consists of a movable punch and a supporting 
tool that has four hemispherical -headed cylindrical pins arranged in a grid. The 
sheet is placed between punch and supporting tool without clamping; hence, the 
deformation in this arrangement is mainly due to bending. The downward move- 
ment of punch bends the sheet, and the movement of sheet on the support tool 
changes the deformation location. Each bending operation produces a spherical 
shape and their combination gives doubly curved surface. The sequence of opera- 
tions in this process is shown in Fig. 7c. Required curvature (. R ), during the 
deformation, is controlled by selecting the downward movement (A z) of the 
punch from the initial plane of the sheet and grid size (2a) and is given by: 



2 * Az 



Meier et al. (2011) have used two robots (Fig. 8) to control different tools on 
either side of sheet and termed the process as duplex incremental forming (DPIF). 
They used position control for the forming tool and combination of force and 
position control for the support tool to maintain continuous contact. Many people 
(International Patent 1999; Jurisevic et al. 2003; Emmens 2006) have tried water jet 
as a forming tool along with die/pattem support. Note that water jet system is force 
controlled, whereas the NC tool systems generally used for incremental forming are 
displacement controlled. 

The first patent on incremental sheet metal forming is by Leszak (1967) way 
back in 1967. In this invention, a turning machine with a backing plate and a pair of 
clamping rings (to hold the sheet) mounted on the machine turntable and a roller 
tool mounted on carriage as shown in Fig. 9 is used. The sheet is rotated on 
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Fig. 7 Setup proposed by Yoon and Yang (2001) to form doubly curved surface 
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Fig. 8 Configuration of duplex incremental forming 


turntable, while the roller moving on the carriage applies pressure on the sheet. This 
methodology can only be used to produce axisymmetric components. More 
recently, Emmens et al. (2010) reviewed the available literature, especially patents, 
available on different variants of ISF. Starting from the inception the chronological 
developments in ISF have been summarized by them. It can be seen from their 
review that all patents are on different forms of SPIF and TPIF with or without 
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Fig. 9 Schematic showing the apparatus used by Leszak (1967) 


partial or full dies (four variants as shown in Figs. 1, 4, and 6). All such kinds of ISF 
process variants are able to form simple to complex geometries but having features 
only on one side of the initial plane of sheet metal workpiece. 

First comprehensive review article on ISMF has appeared in Annals of CIRP by 
Jeswiet et al. (2005a). Based on the work presented in literature, they provided many 
useful observations and guidelines for incremental forming, as summarized below: 

• Formability in SPIF increases with decrease in tool size as well as incremental 
step-down. 

• Anisotropy has an influence on formability, greater formability being achieved 
with smaller diameter tools in the transverse direction. 

• Formability decreases with sheet thickness. 

• Large incremental step-down increases the roughness. 

• Increase in the incremental step-down and tool size increases forming forces. 

• There is a limitation on the maximum draw angle that can be formed in one pass; 
hence, multiple pass methodologies are preferred for forming large angle 
components. 

• Spiral tool path is preferred over contour one, but the tool path generation is 
difficult. 
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Fig. 10 Sine law for 
thickness prediction in 
incremental forming 



Most of the attempts on all variants of ISMF are experimental in nature. 
Experimental observations/measurements of both single- and two-point incremen- 
tal forming have shown that using spiral tool path for simple and reasonably 
complex geometries with small to moderate wall angles, deformation is close to 
plane strain, i.e., strain in one direction is zero. Material does not deform signifi- 
cantly in the peripheral direction, i.e., strain is negligible in the peripheral direction 
(tool movement direction). Assuming that the strain in the peripheral direction is 
principal one, the thickness strain (reduction in thickness during the deformation) 
has to be equal to the meridional strain (increase in length) with opposite sign to 
satisfy the volume constancy. Based on the above discussion, for a constant wall 
angle cone shown in Fig. 10, the relationship between the wall thickness after 
deformation and the original sheet thickness in terms of wall angle can be written as 

tf = to sin (90 — a) (2) 

The above relation is very well known as sine law in ISMF. In case of continuously 
varying geometries, thickness can be obtained at any location by using the local wall 
angle. However, thickness measurement of a wide variety of components formed 
using single- and two-point incremental forming has shown that there is a consider- 
able difference between the measured values and those predicted by sine law. Note 
that the sine-law expression for predicting the thickness is derived under the ideal 
deformation conditions as shown in Fig. 10 where there is no radial displacement of 
any through- thickness section of the material. Thickness calculated using the sine law 
(Eq. 2) can only serve as a rough and an average approximate estimate. 

To understand the above mentioned conclusions and insight into various aspect 
of ISMF, details of each aspect to a reasonable extent are presented in this chapter. 


Formability and Thinning 

Literature (Jeswiet et al. 2005a) indicates that formability in incremental sheet 
metal forming is much higher than the one reported in deep drawing. Most of the 
studies related to formability in ISMF are experimental in nature. Capability to 
form components with desired accuracy and surface finish without fracture 
becomes an important requirement for commercializing the ISMF processes. Accu- 
rate prediction of formability helps in assisting better design of part geometry and 
corresponding tool path. It has been well accepted that conventional forming limits 
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Fig. 11 


Schematic representation of forming limit curve in SPIF against conventional forming 


are not suitable for incremental forming even when path dependency of strains are 
considered. The forming limit of a sheet metal in conventional forming is defined to 
be the state at which a localized thinning of sheet initiates when it is formed into a 
product shape in a stamping process. Formability of sheet metals in conventional 
forming is at present characterized by the Forming Limit Diagram (FLD) intro- 
duced in 1960s by Keeler and Backhofen (1964). The forming limit is convention- 
ally described as plot of major principal strain vs. minor principal strain. It must 
cover as much as possible the strain domain which occurs in industrial sheet metal 
forming processes. The curves are established by experiments that provide pair of 
the values of limiting major and minor principal strains obtained for various loading 
patterns (equi-biaxial, biaxial, plane strain and uniaxial). A schematic representa- 
tion comparing the forming limit curves of conventional forming (stamping/deep 
drawing) with that of single-point incremental forming is shown in Fig. 11a. It can 
be clearly seen that the forming limit curve for incremental forming is a straight line 
with a negative slope in the positive region of minor strain whereas the conven- 
tional one is present in both the regions of minor strain. Deformed grid obtained 
during incremental forming shown in Fig. 1 lb, c indicates that the deformation that 
occurs varies from plane strain to biaxial stretching. 

Iseki and Kumon (1994) have proposed an incremental sheet metal stretch test 
(Fig. 12a) to estimate the forming limits in incremental forming using a rolling ball 
and formed a groove shown in Fig. 12b. It can be seen that both plane strain and 
biaxial stretching states are captured during the experiment. Shim and Park (2001) 
experimentally investigated the formability (of fully annealed Al-1050 sheets) in 
incremental forming (IF) using a tool having freely rotating ball tip similar to that 
used by Iseki and Kumon (1994) and generated strain-based forming limit curves 
(FLC) for incremental forming by measuring the major and minor strains using a 
deformation grid. Different tool path strategies are used to study various strain 
paths and their suitability to generate forming limit curves for incremental forming. 
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Fig. 12 Groove test to obtain formability in incremental forming (Iseki and Kumon 1994): 
(a) schematic and (b) formed groove 



e : Minor strain 


Fig. 13 Groove test: (a) tool path and (b) forming limit curve ( FLC ) 


They recommended an alternate imposition of vertical and horizontal displace- 
ments to form a deep straight groove (similar to that formed by Iseki and Kumon 
(1994)) at the central portion of a square specimen clamped peripherally. 
Recommended methodology is depicted pictorially in Fig. 13. It is realized by 
them also that biaxial stretching occurs near the start and end of groove and plane 
stretching occurs along a straight path and tendency of cracking is reported to be 
greater at the ends due to biaxial stretching. They formed components having 
different shapes starting from triangular opening to octagonal opening and then a 
circular opening and observed that deformation is close to equi-biaxial at the 
corners and plane strain in the straight regions. Failure conditions of the 
abovementioned geometries were in good agreement with the forming limit curve 
generated by them using the groove test. 
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Fig. 14 Variation of formability with tool diameter and incremental depth during groove test 



Fig. 15 Tool path to achieve biaxial stretching at the center of two perpendicular straight grooves 

Kim and Park (2002) have used the groove test mentioned above to study the 
effect of different process parameters on the formability of fully annealed Al-1050 
sheets by conducting experiments, and their trends are presented in Fig. 14. It is also 
reported that the use of freely rotating ball tool enhances the formability as the 
friction along the tool work interface reduces. Formability decreases with increase 
in tool diameter and incremental depth. In addition, they carried out finite element 
analysis using PAMSTAMP - explicit to understand the effect of process param- 
eters on the deformation. 

Filice et al. (2002) have proposed experimental tests aiming at achieving differ- 
ent straining conditions that occur in incremental forming to study the formability. 
They formed a truncated pyramid to obtain the conditions of plane strain deforma- 
tion and proposed a geometry with two straight grooves perpendicular to each other 
(tool path shown in Fig. 15) to achieve almost pure biaxial stretching condition at 
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the center of the geometry. In addition, they proposed to form truncated cone 
geometry with a spiral tool path explained earlier to achieve the conditions between 
plane strain and biaxial states. In all the cases, they formed the components using 
hemispherical ended tool (not a freely rotating ball tool) till failure and generated 
the forming limit curves for incremental forming. Later, Fratini et al. (2004) have 
carried out experimental study to understand the effect of material properties on the 
formability in SPIF for commonly used sheet materials, namely, copper, brass, 
high-strength steel, deep drawing quality steel, AA1050-O, and AA6114-T4, in 
sheet metal industry. They measured strain-hardening constant (K), strain- 
hardening exponent (n), normal anisotropy index (R), ultimate tensile strength, 
and percentage elongation (%PE) of all the materials by conducting tensile tests. 
Truncated cones and pyramids of varying wall angles were formed till failure, to 
study the influence of abovementioned process parameters on formability of incre- 
mental forming and perform statistical analysis. Based on the analysis, they con- 
cluded that strain-hardening exponent, strain-hardening constant, and normal 
anisotropy index have high, medium, and low influence respectively. Ultimate 
tensile strength has negligible influence, whereas percentage elongation has 
medium influence. Among the interactive terms, only interaction between strain- 
hardening exponent with strain-hardening constant and percentage elongation are 
significant, whereas the others are insignificant. In addition, they reported the 
influence of same process parameters on conventional forming limit diagram also. 

Ham and Jeswiet (2006) used two fractional factorial designs of experiments to 
study the effect of process variables on formability of AA3003. First set of design 
(wall angle, vertical step size/incremental depth, base/component opening diame- 
ter, component depth, feed rate, and spindle speed) indicates that feed rate, spindle 
rotation speed, step size, and forming angle decide whether a part can be success- 
fully formed or not. It was also reported that faster spindle rotation speed improves 
formability. Second set of experiments (vertical step size, sheet thickness, and tool 
diameter) reveals that vertical step size (they used 0.05, 0.127, and 0.25 mm) has 
little effect on the maximum forming angle. Material thickness, tool size, and the 
interaction between material thickness and tool size have a considerable influence 
on maximum forming angle. Later they (Ham and Jeswiet 2007) used Box-Behnken 
design of experiments to study the forming limits in SPIF. They considered five 
parameters, namely, material type (three different aluminum alloys), thickness, 
formed shape (cone, pyramid, and dome), tool size, and incremental step size, at 
three levels for the experimentation. Material type has significant effect; the one 
having lower ultimate tensile strength will have greater formability. Shape also has 
some influence on the formability as the type of deformation is dependent on 
geometry. Allwood et al. (2007) conducted experiments using Al 5251-H22 sheets 
to explain the reasons behind higher forming limits observed in incremental 
forming and shown that the lines joining corresponding points on the upper and 
lower surfaces of sheet formed by SPIF remain almost normal to the surface in 
meridional plane (Fig. 16), indicating that the deformation in this plane is predom- 
inantly pure bending and stretching. Whereas measurements indicated that there is 
a relative movement between corresponding points parallel to the tool movement 
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Fig. 16 Deformation 
behavior in meridional plane 
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Fig. 17 Force measurement and force trend before failure 


direction, suggesting that shear occurs in this direction. Above observations 
indicate that an appreciable amount of through-thickness shear occurs during 
incremental forming in the direction parallel to tool motion. Due to the presence 
of through-thickness shear, the tensile stresses responsible for fracture get reduced; 
hence, formability is higher in incremental forming. 

Ambrogio et al. (2005a) made an attempt to predict the sheet thinning in SPIF 
using finite element analysis (implicit) and compared with that of experimental 
values and concluded that the predictions using FEA as well as measured experi- 
mental values are less than that of sine-law thickness prediction over most of the 
deformed region. In addition, experimental values are lower than the values 
predicted using FEA. Ambrogio et al. (2006) used a force-based strategy that can 
be used online to predict formability/failure. They measured the vertical component 
(thrust force on the tool) of the forming force by mounting the incremental forming 
fixture on top of a dynamometer similar (Chintan 2008) to that as shown in Fig. 17a. 
They used the force gradient (monotonically decreasing curves, Fig. 17b) as a 
criterion for failure. Later, Szekeres et al. (2007) observed the similar force trends 
while forming cone but not for pyramid shape; hence, they indicated that tool force 
measurement may not be very reliable. Many researchers have conducted experi- 
mental study on the forming forces by forming a cone using SPIF (Jeswiet 
et al. 2005b; Jeswiet and Szekeres 2005; Duflou et al. 2005, 2007a, b, 2008a; Filice 
et al. 2006; Ambrogio et al. 2007a; Aerens et al. 2009; Henrard et al. 2011). It is 
reported that the forming forces increase with tool diameter, wall angle, incremen- 
tal step size, and sheet thickness as shown in Fig. 18. Increasing direction of the 
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Fig. 18 Variation of vertical component of force with (a) tool diameter, (b) wall angle, 
(c) incremental depth, and (d) sheet thickness 


parameter is shown using arrow. Note that force in the tool axial direction reaches 
peak value and then attains steady state in all cases. 

To reduce the number of experiments required to test the forming limits, Hussain 
and Gao (2007) considered an axisymmetric component with varying wall angles 
along the depth direction. The cross section of the component parallel to the 
incremental depth direction is chosen as circular profile, and the same is shown 
schematically in Fig. 19. Components are formed till the failure depth and quanti- 
fied the formability as wall angle at that height. Later conical components with 
failure angle obtained using varying angle components mentioned above are 
formed, and it is concluded that formable wall angles obtained by forming conical 
cups are more accurate but time-consuming. This work is extended (Hussain 
et al. 2007a) by forming components with different cross-sectional profiles, namely, 
elliptical, exponential, and parabolic, and found that the forming limits are different 
for different profiles (lowest for circular and highest for exponential). Based on the 
above observation, they concluded that the forming limits depend on history of 
forming in ISMF. 

Hamilton and Jeswiet (2010) studied and analyzed the effect of forming at high 
feed rate and tool rotational speed in SPIF. They concluded that tool rotational 
speed does not have significant effect during SPIF and forming at high feed 
(5,080-8,890 mm/min) produced similar thickness distribution as of low feed. 
Thus, higher feed rate (less forming time) can be used to reduce the forming 
time, making ISMF process suitable for industrial applications. During the study 
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Fig. 19 Funnel geometry to 
evaluate formability 



of the micro structure, it was observed that the change in grain size after forming is 
dependent on the step size, spindle rotation speed, and feed rate. Cao et al. (2008) 
presented a comprehensive review along with the advances and challenges in 
incremental forming including formability. They used Oyane criterion (Oyane 
1980) along with FEA as well as simple force equilibrium analysis to predict the 
forming limits. The Oyane model includes the effect of hydrostatic stress history on 
occurrence of the ductile fracture as given below: 


/ = — (ci + ds (3) 

Jo v ° J 

Here constants c x and c 2 have to be obtained experimentally. They conducted the 
necessary experiments and obtained the values of c x and c 2 for AA5052-O material. 
The FLC experiments for the SPIF were carried out on a CNC machining center 
using straight groove geometry and tool path similar to the one shown in Fig. 13. 
Malhotra et al. (2012a) analyzed the fracture behavior in SPIF by using a fracture 
model in FEA to predict forming forces, thinning, and fracture. The material model 
was calibrated using the forming force history for a cone shape. The model was 
validated using a funnel shape (Hussain and Gao 2007; Hussain et al. 2007a) 
(varying wall angles as depth increases) by comparing experimental forming 
forces, thinning, and fracture depths with predicted values from FEA. It was 
concluded that fracture in SPIF is controlled by both local bending and shear. 
Local stretching and bending of sheet around the hemispherical tool causes higher 
plastic strain on the outer side of the sheet resulting in increased damage on the 
outer side as compared to the inner side. Greater shear in SPIF only partially 
explains the increased formability as compared to conventional forming. The 
local nature of deformation in SPIF is the root cause of increased formability as 
compared to conventional forming. 

Bhattacharya et al. (2011) conducted experiments to study the effect of incre- 
mental sheet metal forming process variables on maximum formable angle. 
Box-Behnken method is used to design the experiments for formability study. 
From the experimental results it was concluded that during incremental forming, 
the formability decreases with increase in tool diameter. The formable angle first 
increases and then decreases with incremental depth. The variation in the formable 
angle is not significant in the range of incremental depths that are used in their work 
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Fig. 20 (a) Schematic showing the consideration of overlap in thickness calculation, (b) Com- 
parison of predictions of formable angle using triaxiality criterion with the experimental results 


to produce good surface finish. To predict thickness during the SPIF, a simple 
model is proposed considering the overlap in deformation (Fig. 20a) and thus 
estimated the deformed sheet thickness more accurately. They used the proposed 
thickness prediction model along with a simple analytical model based on the force 
equilibrium to obtain the stress components at any section during SPIF. Ratio of 
mean stress value to the yield stress value for all the experiments conducted for 
formability study at the maximum formable angle is calculated by them using the 
force equilibrium analysis, and its value is found to be very close to unity at all 
conditions, hence, used the same as the failure criterion. Maximum formable angles 
predicted using a triaxiality criterion mentioned above for plane stretching condi- 
tions are in very good agreement with the experimental results (Fig. 20b). Duflou 
et al. (2007b, 2008a) have demonstrated that the use of local heating (using NdYAG 
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Fig. 21 Tool configuration 
used to form truncated 
pyramid in positive forming 




Fixture 



Central support 


Guide columns 


laser) ahead of tool reduces the forming forces and enhances the formability of the 
material. They used TiA16V4 sheets of 0.6 mm thickness. Cone with 56° wall angle 
and 30 mm depth is successfully formed with local heating, whereas the maximum 
angle that could be formed without heating using the same parameters is only 32°. 

Park and Kim (2003) studied the formability of annealed aluminum sheet by 
forming different shapes using both positive and negative incremental forming. 
While forming a truncated pyramid with negative single-point incremental forming, 
cracks occurred at the corners due to biaxial deformation, but no cracks appeared 
during positive forming (in positive forming a suitable jig support is used - Fig. 21). 
Strain measurements with the help of a grid indicated that in negative forming both 
plane strain and biaxial stretching are present whereas in positive forming only 
plain strain stretching is present. Further extending their work to more complex 
geometries using positive forming, they concluded that the formability in positive 
incremental forming is better as the deformation occurs under plane strain condi- 
tions in addition to providing the capability to form sharp comers. Designing of 
support jig is a challenging task and depends on the components geometry. One can 
make use of pattern support to replace the jig necessary during positive incremental 
forming. 

Recently, Jackson and Allwood (2009) have conducted experimental study to 
explain the deformation mechanism in SPIF (Fig. 22a) and TPIF (Fig. 22b, with full 
pattern support) to that of forming the same geometry using conventional pressing 
operation and to evaluate the validity of sine law and in turn to relate the thickness 
measurements to the deformation mechanics. For the abovementioned purpose, 
they used copper plate of 3 mm thickness, cut into two halves. Cut section passes 
through the center of the component. Faces of each half are machined flat and 
1.5 mm x 1.5 mm grid pattern was marked. Later these plates were brazed together 
and used for forming conical components by SPIF, TPIF, and pressing. After 
forming the components are heated to separate the two halves. Then the formed 
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Fig. 22 Incremental forming configurations used for experimental study and grid for measuring 


geometry, thickness, and strains are measured. One half of the component with grid 
pattern is shown in Fig. 22c. Note that the measurements of abovementioned 
quantities can be carried out before joining the sheets together and after separating 
them once the component is formed. Hence, evolution of strain at any instance 
cannot be measured, but the final strains representing the final geometry with 
respect to initial geometry can be measured. Strain values were calculated by 
measuring the relative movement of the points of intersection of the grid before 
and after the deformation. Local coordinate system shown in Fig. 22c is used for 
strain measurement, where m, 9, and t are meridional, tool movement, and thick- 
ness directions, respectively. After analyzing the measured thickness, geometry, 
and strains, they concluded that in both SPIF and TPIF, deformation is a combina- 
tion of stretching as well as shear and the same increases with the successive tool 
laps in the meridional direction with the greatest strain component being shear in 
the tool movement direction and shear occurs perpendicular to the tool movement 
direction in both SPIF and TPIF but is more significant in SPIF. Deformation 
mechanism in SPIF and TPIF is significantly different from the idealized mecha- 
nism of shear spinning on which sine-law thickness is proposed. One can easily 
realize the same in SPIF as there is no support similar to spinning but the support is 
present in TPIF. Critical analysis of measurements indicated that grid lines in the 
thickness direction more or less remain axial in ideal mechanism, but not after 
TPIF. In addition, circumferential shear is present in TPIF which is absent in the 
ideal mechanism. Above observations clearly indicate that there is a radial move- 
ment of through-thickness sections during deformation; hence, there is a significant 
difference in measured and ideal thickness values in both the processes. 

Single-pass SPIF can successfully form wall angles up to 70° for various 
aluminum as well as steel sheets of 1 mm initial thickness (Jeswiet et al. 2005a) 
using suitable process parameters such as incremental depth, tool diameter, 
and forming speed. However, forming a wall angle close to 90° using SPIF is 
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Fig. 23 Multistage forming strategy using TPIF to increase formable wall angle 

challenging. There have been a few attempts to increase the maximum formable 
angle by using multi-pass single-/two-point incremental forming (Hirt et al. 2004; 
Skjoedt et al. 2008; Duflou et al. 2008b). Hirt et al. (2004) proposed and 
implemented a multi-pass forming strategy with partial support die (Fig. 23) to 
form components with steep walls which cannot be formed using single-pass 
incremental forming. The stages involved in their strategy are: 

1. First, form the shallow angle component with final desired component height 

(Fig. 23b). 

2. Next, the wall angle is increased in steps of 3-5° by moving the forming tool 

alternatively upwards and downwards as shown in Fig. 23c, d. 

Using the above strategy, first, they made a pyramidal component with 45° wall 
angle in the stage 1 and then formed up to 81° wall angle. Thickness measured by 
them revealed that the thickness at any location along the wall is more than the 
thickness predicted by sine law for 81°. This can be attributed to the availability of 
more material as the component is formed to the final desired height in stage 1 and 
upward movement of tool in later stages. In addition, they performed finite element 
analysis along with damage criterion of Gurson-Tvergaard-Needleman to predict 
the damage evolution during incremental forming process and predicted that the 
damage increases with increase in tool diameter as well as incremental depth. 

Skjoedt et al. (2008) proposed a five- stage strategy as shown in Fig. 24 to form a 
cylindrical cup with a wall angle of 90° with height/radius ratio equal to one but the 
components fractured in either the fourth stage or fifth stage. They used two 
approaches, namely, Down-Down-Down-Up (DDDU, Fig. 24a) and Down-Up- 
Down-Down (DUDD, Fig. 24b). They demonstrated that the thickness variation 
is dependent not only on tool path but also on its direction (downwards or upwards) 
in each stage. In one of the above strategies, i.e., DUDD, component failed in the 
fourth stage, while in the other one, i.e., DDDU, it was successful up to the fourth 
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Fig. 24 Multi-pass strategies used by Skjoedt et al. (2008) 


stage, but the component formed did not confine to the designed shape. In addition, 
they used the profiles for all the stages with the required component depth similar to 
that used by Hirt et al. (2004). Note that the component height is constrained by the 
jig used in TPIF (Hirt et al. 2004) but the material shifts down during down pass and 
results in more depth than the required component in SPIF. 

Duflou et al. (2008b) stated that the only way to achieve large wall angles was to 
aim for material redistribution by shifting material from other zones in the blank to 
inclined wall regions. They deformed the region of the workpiece area that was 
originally unaffected in single-pass SPIF tool paths (Fig. 25) to form vertical walls 
without leading to failure. Their tool paths always moved from the periphery 
towards the center of the sheet and all the stages are to a depth of required 
component. Hence, the stepped features (Abhishek 2009) have resulted using the 
tool path used by (Duflou et al. 2008b). Multistage incremental forming consists of 
a number of intermediate stages to form the desired geometry. To propose inter- 
mediate stages, it is very important to predict the shape that actually forms after 
each intermediate stage to select the profile for next stage as well as the direction of 
tool movement (i.e., in-to-out or out-to-in). It is very well known that when the tool 
is moved from out-to-in during any stage, the material which is present ahead of it 
moves down like a rigid body. The stepped feature formed at the bottom of the final 
component (Skjoedt et al. 2008; Duflou et al. 2008b) during multistage SPIF is a 
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Fig. 25 


(a) Five stages of tool path to form vertical walls, (b) Stepped features (Abhishek 2009) 


result of accumulated rigid body translation during the deformation of the interme- 
diate shapes. Abhishek (2009) and Malhotra et al. (2011a) have proposed a meth- 
odology using a combination of out-to-in (OI) and in-to-out (IO) tool paths (Fig. 26) 
in one pass and by selectively deforming certain regions in an intermediate pass. 
Here, “OI tool path” refers to the tool moving from the outer periphery of the sheet 
towards the center of the sheet while moving down in the z-direction. The “IO tool 
path” refers to the tool moving from the center of the sheet towards the outer 
periphery while moving up or staying at a constant depth in the z- direction. They 
(Abhishek 2009; Malhotra et al. 2011a) used a seven-stage strategy to successfully 
form a cylindrical component with height to radius ratio equivalent to one. For each 
tool path the first number denotes the stage number, the number after the dash 
denotes the order of execution, and the arrow shows the direction of tool path. Note 
that some regions are not deformed in stages 4, 5, and 7. Abhishek (2009) has 
extended the methodology to make hemispherical and ellipsoidal components. Note 
that stepped features are avoided in the works of Abhishek (2009) and Malhotra 
et al. (2011a). Very recently, rigid body movement during in-to-out and out-to-in 
tool paths is modeled analytically and validated using finite element simulation for 
different materials (Xu et al. 2012). 

Kim and Yang (2000) compared two double-pass strategies for SPIF. One was 
based on linear blending, i.e., the intermediate shape was calculated to be at a height 
of 0.5 times the final height. In the other approach, they calculated the intermediate 
shape so that highly deformed regions in the final shape were subjected to lesser 
deformation in the intermediate shapes. Their methodologies yielded better thick- 
ness distributions than single-pass forming. They concluded that the double-pass 
forming method results in improved formability as well as higher mechanical 
strength of the formed component. Young and Jeswiet (2004) experimentally 
studied the effect of single-pass and double-pass forming strategies on the thickness 
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Fig. 26 Successful multistage strategy to form vertical wall without formation of stepped 
features: (a) tool path, (b) profile comparison, and (c) sectional view of component 


distribution in forming a 70° wall angle cone. They concluded that double-pass 
strategy causes marked thinning at the flange near the backing plate. In addition, 
they concluded that the sine law will not predict the thicknesses correctly when 
multiple passes are used for forming angles exceeding 40°. For better understanding 
of formability in terms of wall angle for different materials and process variables, a 
Table 1 is provided below: 


Accuracy and Surface Finish 

Accuracy of the component during incremental sheet metal forming is affected by 
bending of sheet between the clamped boundary and component opening, contin- 
uous local springback that occurs as soon as tool moves away from a point, and 
global springback that occurs after the tool retraction and unclamping of the 
component. Comparison between the ideal geometry and the possible final geom- 
etry during SPIF due to the abovementioned effects is schematically represented in 
Fig. 27. One can reduce the deviation between the ideal and formed geometries due 
to bending that occurs close to clamped boundary in SPIF by providing a suitable 
support/b ackup or can be eliminated by using a pattern support. 
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Table 1 Maximum formed wall angle for various parameters 


S. No. 

Material 

Sheet 
thickness 
t (mm) 

Tool 
diameter 
D (mm) 

Incremental 
depth 
Az (mm) 

Formed angle (degrees) 

1 

Al 3003-0 

2.1 

— 

— 

78.1 (Jeswiet 
et al. 2005a) 

2 


1.3 

— 

— 

72.1 (Jeswiet 
et al. 2005a) 

3 


1.21 

— 

— 

71 (Jeswiet 
et al. 2005a) 

4 


0.93 

— 

— 

67 (Jeswiet 
et al. 2005a) 

5 

Al 5754-0 

1.02 

— 

— 

62 (Jeswiet 
et al. 2005a) 

6 

Al 5182-0 

0.93 

— 

— 

63 (Jeswiet 
et al. 2005a) 

7 

AA 3003-0 

2.1 

4.76 

1.3 

77 (Ham and Jeswiet 
2006) 

8 


1.3 

12.7 


75 (Szekeres 
et al. 2007) 

9 


1.2 

10 

0.5 

73 (Ambrogio 
et al. 2007a) 

10 

AA 1050-0 

1.0 

18.0 

0.3 

70 (Ambrogio 
et al. 2006) 

11 


1 

12 

1 

70 (Ambrogio 
et al. 2007a) 

12 

Aluminum 

0.91 

8.0 

0.15 

70 (Hussain 
et al. 2007a) 
(exponential profile) 

13 

Steel-DDQ 

1.0 

— 

— 

70 (Jeswiet 
et al. 2005a) 

14 

Steel-DC04 

1 

12 

1 

70 (Ambrogio 
et al. 2007a) 

15 

LY 12 M 

0.91 

8.0 

0.15 

66.0 (Hussain 
et al. 2007b) 


Hirt et al. (2004) proposed an iterative method to improve geometrical accuracy. 
First, the formed component was measured using coordinate measuring machine 
(CMM) and compared with intended geometry. Deviation vector between mea- 
sured and target points is generated and the same is used to modify the tool path. 
Ambrogio et al. (2004) have studied the effect of tool diameter and incremental 
depth on the geometrical accuracy in SPIF by conducting series of experiments and 
concluded that the accuracy is better while using smaller diameter tools and lower 
incremental depth. Note that the use of lower incremental depth increases the 
forming time. In addition, they also stated that elastic springback and bending in 
absence of a backing plate/die is the major cause for geometrical inaccuracies. 
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Fig. 27 Schematic representation of ideal and formed profiles 


Importance of a proper tool path for obtaining desirable geometrical accuracies in 
the formed component is emphasized, and a modified tool path is proposed to 
enhance the accuracy by forming a component with higher wall angle than the 
desired angle up to some depth and then forming the remaining component with the 
desired angle. Conical components formed using the above strategy enhanced 
the accuracy by reducing the bending near component opening (Fig. 28). Note 
that the modified tool path with larger angle than the desired starts deforming the 
sheet at somewhat distance away from the clamping region than the ideal tool path. 
Ambrogio et al. (2007b) studied the influence of process parameters (tool diameter, 
step size, sheet thickness) on springback and pillow effect (Fig. 29) in SPIF by 
forming a truncated pyramid with square base. Pillow effect is compensated by 
overbending. This in turn increases the bending at the opening region although it 
reduces the deviation in the wall region. 

Tanaka et al. (2007) have used a backup tool that comes in contact with the work 
piece intermittently (following sinusoidal path) during SPIF. Finite element anal- 
ysis has demonstrated that this strategy reduces the residual stress gradients and in 
turn enhances the component accuracy. Duflou et al. (2007b, 2008a) made an 
attempt to improve the geometrical accuracy by applying local heating principle. 
Improvement in geometrical accuracy was realized by achieving more localized 
forming effect (due to local heating) and reduced residual stress levels which 
reduces the springback effects (Fig. 30). 

Allwood et al. (2009) suggested a closed-loop control strategy (Fig. 31) that uses 
spatial impulse responses to enhance the product accuracy in single-point incre- 
mental forming with feedback provided by a stereovision camera. They used a 
Weibull distribution curve to impulse responses from a set of experiments for a 
conical component and then used the same to form similar conical components 
with =b 0.2 mm accuracy prior to unclamping the component. They measured and 
reported that the formed component deviate around 2-3 mm from that of intended, 
without using the feedback. Their strategy requires an iterative process in which a 
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Fig. 28 Schematic representation of tool path strategy adopted by Ambrogio et al. (2004) to 
reduce bending in SPIF 


Fig. 29 Schematic 
illustrating bending and 
pillow effect observed 
during SPIF 




Fig. 30 Schematic to show 
the effect of local heating on 
accuracy 
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Fig. 31 Block diagram of closed-loop strategy proposed by All wood et al. (2009) 


few trials were first conducted for a component, and based on the spatial impulses 
obtained, tool path compensation was done to obtain a better component. 

Behera et al. (2013) proposed a compensated tool path generation methodology 
for SPIF to improve the accuracy by using multivariate adaptive regression splines 
(MARS) as an error prediction tool. The MARS generates continuous error 
response surfaces for individual features and feature combinations. Two types of 
features (planar and ruled) and two feature interactions (combinations of planar 
features and combinations of ruled features) were studied with parameters and 
algorithms to generate response surfaces. The method has two stages: (i) training 
stage and (ii) MARS model generation stage. In training stage formed components 
are scanned to generate point cloud and are compared with available point cloud of 
stereolithography (STL) model to generate accuracy reports and used the same to 
build MARS engine (Fig. 32). Feature-assisted single-point incremental forming 
(FSPIF) module detects the features and MARS engine adjusts these features and 
generates adjusted STL models in second stage (Fig. 33). The components formed 
from these adjusted models are again compared with original STL model to 
generate reports for validation. The validation results show average deviation of 
less than 0.3 mm, and the maximum deviation for horizontal nonplanar feature is of 
0.72-0.99 mm. Although the methodology reasonably enhances the part accuracy, 
the process of implementation of MARS system is difficult and limited. Also, for 
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Fig. 32 Flowchart illustrating the generation and use of strategy proposed by Behera et al. (2013) 
to improve accuracy in SPIF - generation stage 



Fig. 33 Flowchart illustrating the generation and use of strategy proposed by Behera et al. (2013) 
to improve accuracy in SPIF - usage stage 
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parts with wall angles close to maximum formable angle, the compensation of the 
STL file results in a compensated geometry having zones with wall angle greater 
than maximum formable angle. 

Twist is observed in incremental forming of components and its quantity is 
dependent on the type of configuration and other process parameters involved. 
Twist in incrementally formed sheet metal components is first reported by 
Matsubara (2001) while forming cone as well as pyramid shapes using two-point 
incremental forming (TPIF). Note that in TPIF, tool moves from inside (starting 
from fixed/support tool) to outside. Due to this reason, already formed region of the 
part is compelled to tilt/rotate about the fixed tool. Here, fixed tool acts like a pivot. 
Matsubara (2001) reported twist as high as 30° in TPIF when the tool path direction 
is kept same during each step. To reduce this twist, tool path direction is reversed 
between consecutive contours. Alternative tool movement directions are used by 
many researchers to reduce the twist in incremental forming (Jeswiet et al. 2005a). 
Although twist accumulation can be reduced by alternating the tool path direction, 
it also results in the deterioration of surface quality at the location of contour 
transition. In addition, spiral tool path (Jeswiet et al. 2005a) is better for product 
quality and to enhance formability. Very recently, Duflou et al. (2010) and Vanhove 
et al. (2010) have studied the twist in SPIF of pyramid and conical shapes using 
unidirectional tool path. Note that in SPIF, tool moves from outside to inside and no 
fixed tool or any support is present as in TPIF. Twist quantified by them (Duflou 
et al. 2010; Vanhove et al. 2010) is in terms of angle by drawing appropriate lines 
(radial lines in case of cone) prior to forming and measuring its deviation after 
forming. They classified the twist in two categories, namely, conventional (that 
occurs at low wall angles) and reverse (close to formability limits). They concluded 
that at lower wall angles, tangential force on the deforming sheet results in twist in 
the tool path direction and is named as conventional twist. Asymmetric strain 
distribution along the meridional direction at higher wall angles for pyramidal 
structures was observed and the twist started reducing and even reversal is reported. 
They also concluded that the geometrical features like ribs/corners have significant 
influence on the twist. However, they reported that the twist is independent of tool 
diameter, rotation speed of the tool, and tool feed rate. They concluded that the twist 
along tool movement direction increases with wall angle up to some value and then 
the trend reverses with further increase in wall angle. Asghar et al. (2012) have 
carried out experimental and numerical analysis to study the effect of process 
variables on twist in incremental forming and concluded that the twist increases 
with increase in incremental depth and decrease in tool diameter and sheet thick- 
ness. Feed rate effect is insignificant. Numerical predictions are in good qualitative 
agreement with experimental results. 

Surface finish of the formed component is of equal importance as that of 
accuracy. Hagan and Jeswiet (2004) carried out experimental study by forming 
conical parts with contour tool paths to measure surface roughness in SPIF. 
Incremental depths varying from 0.051 to 1.3 mm are used to form a conical 
component of 45° wall angle using 12.7 mm tool diameter with a feed rate of 
25 mm/s. Peak to valley heights measured between 5 and 25 p correspond to 
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incremental depths between 0.051 and 1.3 mm. In addition, spindle speeds varying 
from 0 to 2,600 rpm were used. They concluded that it does not have significant 
effect, but a minimum surface roughness was observed around 1,500 rpm. Ham 
et al. (2009) carried out experiments to study the effect of incremental depth and 
tool diameter on the surface roughness of components formed using contour tool 
paths and observed that usage of smaller tools results in more distinctive cusps. As 
the tool size increases, the increased contact area between the tool and work for the 
same incremental depth results in greater overlap of the tool with the previously 
formed material; hence, the surface cusps become indistinguishable with increase 
in tool diameter. Singh (2009) and Bhattacharya et al. (2011) have studied the 
influence of wall angle (20, 40, and 60°), tool diameter (4, 6, and 8 mm), and 
incremental depth (0.2, 0.6, and 1 mm) on surface finish by forming conical 
components with spiral tool path keeping sheet thickness, feed rate, and other 
parameters constant. Design of experiments was carried out using full factorial 
design and the best surface finish (in terms of R a ) obtained by them in the range of 
parameters used is 0.3 p. Empirical equation that relates surface roughness (Ra) to 
process variables is reproduced below: 

R a = 8.41 - 0.069a - 2.14d + 9.13Az + 0.0035ad + 0.0191aAz 

- 0.417dAz + 0.00005.7a 2 + 0.153d 2 - 4.66Az 2 (4) 

They concluded that surface roughness decreases with increase in tool diameter 
for all incremental depths and it happens due to the increase in overlap between the 
neighboring tool paths with increase in tool diameter. In addition, up to certain wall 
angle, surface roughness value initially increased with increase in incremental 
depth and then decreased. With further increase in angle, surface roughness 
increased with increase in incremental depth. It was observed that the undeformed 
region between successive tool paths is more at lesser wall angles and higher 
incremental depth. Note that the overlap increases with increase in wall angle 
whereas contact area of the tool reduces for a given incremental depth. The average 
surface roughness (R a ) achieved by SPIF at different process parameters is sum- 
marized in Table 2. 


Double-Sided Incremental Forming 

Double-sided incremental forming (DSIF) uses one tool each on either side of the 
sheet. As stated earlier, the deforming and supporting roles of each tool will keep 
changing depending on the geometry. DSIF configuration enhances the complexity 
of the components that can be formed and reduces many of the limitations associ- 
ated with incremental forming. Cao et al. (2008) mounted two tools on a single rigid 
C-frame (Fig. 34) to demonstrate DSIF to form features on both sides of initial 
plane of sheet. They introduced squeeze factor as ratio between tool gap and initial 
sheet thickness and reported that decrease in squeeze factor improves the 
dimensional accuracy of the formed component (relative error of 46.6-28.4 %, 
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Table 2 Average surface roughness at different process variables in SPIF 


S. No. 

Material 

Wall angle 
(degrees) 

Tool diameter 
(mm) 

Incremental 
depth (mm) 

Surface roughness 
(R a ) (microns) 

1 

Al 3003 

45 

12.7 

0.051 

0.5 (Hagan and 
Jeswiet 2004) 

2 


45 

12.7 

1.3 

5.0 (Hagan and 
Jeswiet 2004) 

3 

AA 

60 

4 

1.0 

2.99 (Singh 2009) 

4 

5052-0 

60 

8 

0.2 

0.30 (Singh 2009) 

5 


40 

4 

1.0 

3.91 (Singh 2009) 

6 


40 

8 

0.2 
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Fig. 34 Double-sided incremental forming using a C-clamp to mount both the tools (Cao 
et al. 2008) 

for squeeze of CM-0 % sheet thickness). Their group (Malhotra et al. 2011b) further 
studied the effect of squeezing on geometrical accuracy for conical component with 
fillet (65° wall angle, depth 36 mm) using two independently moving tools on either 
side of the sheet. Note that the squeeze factor definition here is the ratio between 
tool gap at any instant and the expected thickness using sine law at that location. 
The contact condition of the bottom tool (support tool) is improved with decrease in 
squeeze factor, but the effect of squeeze factor on part accuracy is not consistent. 

Malhotra et al. (2012b) have proposed another DSIF strategy (accumulative 
DSIF) using in-to-out (10) tool path strategy (Malhotra et al. 2011a) with displace- 
ment controlled forming and support tools. Using ADSIF, they formed cones of 40° 
and 50° angles and reported maximum shape deviation of 1.15 mm. Forming time, 
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Fig. 35 Block diagram showing the interaction of different modules to generate the tool path for 
roboforming (Meier et al. 2012) 

using ADSIF, increases drastically to achieve the desired depth and geometry of the 
component as it uses only in-to-out tool paths. Meier et al. (2011) used two robots to 
control different tools on either side of sheet and termed the process as duplex 
incremental forming (Fig. 8). They used position control for the forming tool and 
combination of force and position control for the support tool to maintain contin- 
uous contact. They used a vision-based measurement technique to estimate devia- 
tions from ideal profile and modified the tool path to reduce the deviation. They 
made use of FEA predictions also to modify the tool path and reported significant 
improvement of accuracy in the wall region. But, the improvement in accuracy in 
component opening region and bottom regions is marginal (deviation in the range 
of 0. 5-1.0 mm). Due to the superimposed pressure from support tool, 12.5 % 
increase in formability has been reported. Meier et al. (2012) presented an inte- 
grated CAx (CAD, CAM, CAE) process chain (Fig. 35) for the robot-assisted 
incremental forming process, called as roboforming, to quickly realized the path 
planning and simultaneously raise the geometrical accuracy using different com- 
pensation methods. Commercial CAM system with additional features has been 
developed for two synchronized tool paths according to different forming strate- 
gies. A simulation model uses this tool path for animation of robot movements and 
to ensure the experimental safety. Forming results are forecasted using the tool path 
in an established FEM model, which are fed back to the CAM program. After the 
comparison with the target geometry, the geometrical deviations were used to 
adjust the tool paths. They reported reduction of profile deviation from 1.08 to 
0.2 mm, but FEA consumes about 10 h for simulation; thus, the real-time compen- 
sation is difficult to implement. Recently, DSIF machine has been designed and 
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developed at Indian Institute of Technology Kanpur and many studies are in 
progress (Srivastsava 2010; Koganti 2011; Asghar 2012; Shibin 2012; Lingam 
2012). Very recently, Northwestern University, Evanston, has also developed a 
DSIF machine. 


Tool Path Planning 

In incremental forming, tool path plays a significant role on the forming limits, 
component accuracy, surface quality, thickness variation, and forming time. Many 
attempts have been made to study various tool path strategies (contour, spiral, 
radial, and multiple passes - in-to-out and/or out-to-in) and their effect on forming 
limits, thickness distribution, accuracy, and surface quality in various variants of 
incremental forming (SPIF, TPIF, DSIF), and the same has been presented in the 
earlier sections. Hence, certain aspects are not discussed in detail in this section. As 
presented earlier, there are two types of tool path used for ISMF process , namely, 
contour and spiral (Fig. 2a, b). Most of those tool paths have been generated using 
surface milling modules of commercial CAM packages developed for machining. 
Deformation is biaxial at the starting and end points of each contour in contour tool 
path and is near to plane strain in between; hence, the tendency for fracture at the 
start and end points of each contour is higher. Contour tool path leaves stretch 
marks at the start points of each contour (Fig. 2c). To avoid equi-biaxial stretching 
and the tool marks at the end points of each contour, Filice et al. (2002) have 
suggested the use of a spiral tool path (Fig. 2b). In SPIF, tool (either contour or 
spiral) moves from out-to-in and represented as conventional strategy in Fig. 36a. 
Bambach et al. (2005) have proposed a conical strategy in addition to conventional 
strategy. In conical strategy, tool movement starts at the center and opens up with 
increasing depth. They studied two in-plane tool movement strategies (contour, 
Fig. 36b, and radial, Fig. 36c) coupled with two z-movement strategies, conven- 
tional and conical strategies. Considering the time required for producing the part 
and the uniformity of sheet thickness throughout the part, they concluded that the 
conventional/contour strategy is better (2 min production time with comparable 
thickness distribution) as compared to radial strategies (9 min for cone/radial and 
40 min for conventional radial trajectory). Time required to form the component 
reported by them can be easily understood with the schematic representations 
presented in Fig. 36. 

Kopac and Kampus (2005) studied the effect of various in-plane tool movement 
strategies for axisymmetric components. They examined four procedures in terms 
of tool movement, namely, (A) from exterior to interior, (B) from interior to 
exterior, (C) first in the center then from exterior to interior, and (D) first in the 
center then from interior to exterior, and reported that the maximum depth of 
forming can be achieved with case D, i.e., first in the center then from interior to 
exterior. Ambrogio et al. (2005b) developed a tool path modification strategy by 
integrating an on-line measuring system and tool path to be followed for forming 
the remaining portion of the component under consideration. For achieving the 
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Fig. 36 Different tool path strategies used by Bambach et al. (2005): (a) conventional and 
conical, (b) contour tool path, (c) radial tool path, (d) conventional-contour, and (e) conical- 
contour strategies 

same, they utilized the combination of online measuring techniques, numerical 
simulation (Deform3D), and optimization techniques to modify the tool path. Their 
path-correcting methodology makes use of the measurement of the previous refer- 
ence point on a given section of the spiral and its comparison to the desired position 
at the final and current step. Three-dimensional spiral tool paths for forming 
asymmetric components (Skjoedt et al. 2007) are generated by first generating 
contour tool paths using the milling module available in Pro/ENGINEER and 
then interpolating between the individual contours to produce a single 3D spiral. 
They used constant incremental depth for generating the contour tool path. 
Stereolithography (STL) files are used to develop an iterative tool path generation 
methodology (contour tool path was generated using a combination of forming, 
scanning, and reforming) (Verbert et al. 2007). It is well known that STL format has 
inherent chordal errors and iterative nature increases time. Attanasio et al. (2006, 
2008) have investigated two types of tool paths, one with a constant incremental 
depth (Az) and the other with a constant scallop height (h) (Fig. 37). It was reported 
that the surface quality of the formed component improved by decreasing the 
values of incremental depth (Az). A shortcoming of their approach is that 
when the value of Az is arbitrarily reduced to a very small value, the forming 
time increases, especially when larger components are to be formed. This kind of 
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Fig. 37 Schematic of tool path with (a) constant incremental depth and (b) constant scallop 
height 


approach does give the designer the freedom to form a component quickly with 
acceptable surface finish. 

Malhotra (2008), Malhotra et al. (2010) developed and implemented a platform- 
independent methodology for generation of contour and spiral tool paths for an 
arbitrary component formable by SPIF. The methodology takes neutral part format 
STEP AP203/AP214 of CAD model as input. Adaptive slicing techniques used in 
layered manufacturing (Pandey et al. 2003) and 3D spiral tool path generation 
methodology for surface milling of freeform shapes with constant scallop height 
(Lee 2003) have been modified and used for generating tool paths. Tool path 
methodology developed by them for single-point incremental forming addresses 
the trade-off that exists between geometric accuracy, surface finish, and forming 
time. Steps involved in their tool path generation methodology are: 


1 . 

2 . 

3. 


4. 

5. 

6 . 


Generation of contour tool path with a constant incremental depth. 

Calculation of volumetric errors and scallop heights to form components with 
good accuracy and surface finish. 

Usage of adaptive slicing criterion to evaluate the need for new contour 
insertion. 

(a) Repeat steps 2-4 for the inserted contour and the one above when the new 
contour is inserted. 

(b) Otherwise move down to the next pair of contours and perform steps 3 and 
4 till all the contours are finished. 

Perform deletion of slices to get the final contours. Here, extra slices are deleted, 
which will not adversely affect the accuracy by evaluating the volumetric error 
percentage between slice n and slice n + 1 . 

Generation of final spiral path using the finalized contours. 

Apply the tool radius compensation (Fig. 38) using local geometrical conditions 
and generate the tool path for forming the components. 
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Fig. 38 Schematic showing 
the importance of tool radius 
compensation 




b 




Fig. 39 Comparison of ideal and measured profiles: (a) with and (b) without tool radius 
compensation 


The adaptive slicing methodology along with volumetric error criterion 
enhances the conformity between the formed and the desired components and the 
same can be seen from Fig. 39. 

Azaouri and Lebaal (2012) proposed a tool path optimization strategy for SPIF 
for finding the shortest tool path (less forming time) which distributes the material 
as evenly as possible throughout the part. As a consequence of volume conser- 
vation, the part with the lowest reduction in sheet thickness would have a 
homogeneous distribution of thickness. FEA, in combination with response 
surface method and sequential quadratic programming algorithm, was used for 
determining the optimal forming strategy. The authors reported a better thickness 
distribution (minimum thickness: 0.91-0.96 mm with optimization and 0.79 mm 
without optimization, for an initial sheet thickness of 1.5 mm) and less forming 
forces. However, in order to achieve the shortest path, the accuracy and surface 
finish of the component get deteriorated. Many (Cao et al. 2008; Hirt et al. 2004; 
Skjoedt et al. 2008, 2010; Duflou et al. 2008b; Abhishek 2009; Malhotra 
et al. 2011a; Xu et al. 2012; Kim and Yang 2000; Young and Jeswiet 2004) 
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have proposed tool paths for other variants of incremental forming, and 
their details are presented in earlier sections dealing with formability, thickness 
distribution, and accuracy. 


Summary 

Incremental sheet metal forming is gaining importance in automobile (Governale 
et al. 2007), aerospace (Jeswiet et al. 2005a), and biomedical industries (Ambrogio 
et al. 2005c) and even for processing recycling panels (Jackson et al. 2007; Takano 
et al. 2008) and producing dies/molds quickly using complex sheet metal surfaces 
produced by incremental forming at very less expense (Allwood et al. 2006). 
Accuracy of the component during incremental sheet metal forming is affected 
by bending of sheet between the clamped boundary and component opening, 
continuous local springback that occurs as soon as tool moves away from a point, 
and global springback that occurs after the tool retraction and unclamping of the 
component. Predicting springback using numerical analysis and compensating for 
the same during tool path generation is a time-consuming process due to the nature 
of process. Development of DSIF enhanced the product complexity to form parts 
with double curvature on both sides of the initial sheet plane without any additional 
setups with better accuracy than other incremental forming configurations. 
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Abstract 

The combined sheet and bulk forming process is a key technology for the near- 
net- shape forming of lightweight metal components with thin and multi wall 
thickness. In this forming process, bulk forming process, i.e., a compressive 
forming process, is applied to the sheet or plate workpiece in combination with 
the conventional sheet forming (stamping) process to minimize the material 
waste and manufacturing cost. “Spin forming” (flow forming, shear forming) 
as well as “combined stamping and forging” are key technologies in this type of 
the process. 

The flow forming process can reduce the wall thickness of rotating cylindrical 
hollow workpiece into an intended value incrementally, and shear forming can 
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form various kinds of conical/semispherical/cup- shaped hollow components 
from flat disk-shaped workpiece of sheet metal. The advantages of spin forming 
are low cost, simple tooling, easy lubrication during forming, high flexibility in 
forming, and applicability to cold forming of high-strength and difficult-to-form 
materials. The optimization of the forming process, forming parameters, tooling 
design, and initial workpiece design is required for successful spin forming. 

The combined stamping and forging process includes the local compression 
of workpiece in its thickness direction for reducing the wall thickness locally in 
combination with some forming mode of conventional stamping of sheet metal. 
In this process, various kinds of thin-walled 2D/3D components with multi 
thicknesses, ribs, pins, and webs can be formed in near net shape from sheet 
workpiece of uniform thickness. The optimizations in forming sequence in 
multistep forming and tooling design are essential for successful forming 
practice. 

Introduction 

The combined sheet and bulk forming process has a high potential for the near-net- 
shape forming of lightweight metal components with thin and multiwall thickness. 
In this forming process, bulk forming, i.e., a compressive forming process, is 
applied to sheet or plate workpiece, in combination with the conventional sheet 
forming (stamping) process to minimize the material waste and manufacturing cost. 
The current work focuses mainly on “spin forming” and “combined stamping and 
forging process” which are key technologies in the combined sheet and bulk 
forming process. 

The spin forming process includes the flow forming and shear forming of 
axisymmetric hollow components. This process has high potential for near-net- 
shape forming of high-strength and lightweight hollow components with thin wall 
of multi thicknesses. In the flow forming process, the wall thickness of rotating 
tubular (cylindrical) workpiece is reduced incrementally into intended value 
between the roller(s) and mandrel. The internal diameter of workpiece remains 
constant by a mandrel which is installed inside workpiece. In the shear forming, 
various kinds of conical, semispherical, and cup-shaped hollow components can be 
formed from flat disk-shaped sheet by using forming roller(s) and mandrel. It is 
important to optimize major process parameters and the initial workpiece (perform) 
for successful spin forming. 

The combined stamping and forging process utilizes some of the bulk forming 
modes, i.e., partial compression of workpiece (sheet or plate) in its thickness 
direction (such as upsetting, coining, pin/boss/rib forming, ironing, etc.) for reduc- 
ing the wall thickness locally, together with some of the forming modes of con- 
ventional stamping of sheet metal (such as deep drawing, bending, expanding, 
piecing, etc.). In this process, various kinds of thin- walled 2D/3D components 
with multi thicknesses, ribs, pins, and webs can be formed in near net shape from 
sheet metal workpiece with uniform thickness. The technical issues in this process 
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are high forming pressure in local compression (upsetting) of sheet workpiece and 
potential defects caused by lateral material flow. Therefore, the optimization in 
forming sequence in multistep forming and tooling design is essential for successful 
forming. 

In this chapter, the basic process knowledge are presented to cover the working 
principles, deformation characteristics, important process parameters, forming pro- 
cess and tooling design, forming machine, case studies for application, etc. 


Spin Forming of Cup and Tubular Components 

Working Principle and Typical Forming Modes 

Working Principle (ASM INTERNATIONAL 1988; Leifeld Metal Spinning 
AG 2012; WF Maschinenbau und Blechformtechnik GmbH 2012; Nihon 
Spindle Mfg. Co., Ltd. 2012; Hayama 1981) 

The spin forming process includes flow forming and shear forming as shown in Fig. 1. 

In flow forming, the axisymmetric tubular workpiece mounted on mandrel is 
rotationally driven together with mandrel and the wall thickness of workpiece is 
incrementally reduced by the axial feed of rollers as shown in Fig. 2. The rollers 
contact with workpiece only in small areas and most of outside of workpiece is 
free surface, i.e., not constrained by roller. Therefore, the process parameters 
need to be optimized to maximize axial metal flow or axial strain (s z ) and 
minimize circumferential (tangential) metal flow or circumferential strain (e#) 
for successful forming. Under optimum forming conditions, the internal diameter 
of workpiece remains unchanged during forming and will be almost equal to 
mandrel diameter. The multi-pass forming can be used for achieving higher 
reduction in wall thickness. 



a Forward b Backward Shear forming of 

Flow forming of cylindrical tube conical shape 




Workpiece 


Mandrel 


Fig. 1 Schematic diagram of spin forming 
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Fig. 2 Tube elongation/wall thinning in flow forming 


Two techniques are used in flow forming, i.e., forward and backward methods. 
In the forward flow forming, the rollers are fed axially toward the free end of 
workpiece, and metal flows in the same direction as roller feed. The advantages of 
forward flow forming as compared with backward flow forming are smaller distor- 
tion of formed portion and closer control of formed length. In backward flow 
forming, rollers are fed axially toward the fixed end of workpiece and metal 
flows in the opposite direction as roller feed. Simpler shape of workpiece can be 
used without an internal flange for clamping and smaller axial travel of roller is 
required in backward flow forming. 

In shear forming, conical and dome shapes (cup) can be formed from flat 
workpiece by feeding of roller in the direction of half angle of the cone. The 
workpiece is clamped at the end of mandrel and rotationally driven together with 
the mandrel. The outside diameter of workpiece is kept almost constant during the 
shear forming, and the wall thickness of cup is basically determined according to 
the sine rule as shown in Fig. 3. Therefore, a smaller cone angle results in a greater 
reduction in wall thickness of the cup. Therefore, a special multi-pass forming 
process for preforming of workpiece is required to obtain smaller reduction in wall 
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Roller 

Workpiece 


t = t 0 sin a 

Initial workpiece thickness 

t- Wall thickness after forming 
a = Conehalf angle 


Fig. 3 Wall thickness in shear forming 


thickness in the forming of conical cup with smaller cone angle, as will be shown 
later in more detail. 

The advantages of spin forming are as follows: 

(a) High-precision forming (near-net- shape forming) of axisymmetric components 
with small wall thickness 

- This leads to high material yield and reduction in material waste. 

(b) Simplicity and low-cost tooling 

- Round-shaped roller tooling with simple geometry. 

(c) Low forming load and small forming machine because of incremental forming 
principle. 

(d) Easy lubrication during forming 

- Easy supply of lubricants close to tool-workpiece contact area. 

(e) High potential for forming high-strength and difficult-to-form materials 

- This results from the working principle of incremental compressive forming 
and easy lubrication. 

(f) High flexibility in forming modes and product size (CNC) 

- Higher flexibility in setting roller pass and final workpiece dimensions. 

(g) High potential for application to high-mix low-volume manufacturing because 
of low tooling cost and high flexibility in forming. 

Spin forming has the following disadvantages as compared with conventional 
cold forging and stamping: 

(a) Applicable to only axisymmetric components. 

(b) Lower production rate as compared with cold forging and stamping. 

(c) Necessity for specialized forming machine 

- Limited numbers of commercial machine for general use and high price of 
machine. 

(d) Insufficient publication of engineering knowledge/data/know how for produc- 
tion by spin forming. 
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Typical Forming Modes 

(a) Tube elongating or wall thinning (Fig. 2) is a basic forming mode in flow 
forming. In this mode, the wall thickness of tubular workpiece is reduced, and 
workpiece is elongated in its axial direction. Either of two directions of 
forming, i.e., forward or backward, is selected depending on the product 
shape and machine specification (e.g., maximum roller travel). 

(b) In cup forming mode (Fig. 4), the cylindrical cup with short length can be 
formed directly from a disk-shaped workpiece in one pass or multi passes by 
feeding the roller in the axial direction. This mode is useful to make the preform 
for subsequent flow forming process. 

(c) In boss forming mode (Fig. 5), a boss shape can be formed at the central portion 
of disk-shaped workpiece by using radial feed of the roller. This mode is very 
unique and useful to form hollow boss in spin forming. 

(d) Splitting/V-groove forming mode (Fig. 6) can be applied to split the disk- 
shaped workpiece to form V-shaped groove at outside periphery of the work- 
piece. This mode can be used for forming thin flanges as shown later in section 
“Near-Net-Shape Components by Spin Forming (Case Studies)”. 

(e) Deep cup with thin wall can be formed from disk- shaped workpiece by com- 
bining cup forming mode and wall-thinning mode as shown in Fig. 7. 

(f) Other shapes such as U-shaped groove (Fig. 8), T-shaped cross section (Fig. 9) and 
cup with flange (Fig. 10) can be formed by combining splitting and elongating modes. 

(g) Conical shapes of different cone angles can be formed by single pass and multi 
pass in shear forming as shown in Figs. 1 1 and 12. The cone angle of the formed 
product is a little larger than that of the mandrel because the diametric growth 
of workpiece appears during shear forming in a manner similar to flow forming. 
Shear forming by multi pass is an effective technique to form smaller cone 
angles without the breakage of workpiece during forming. 


Roller 


Mandrel 




C 


i\ 


\ 



Fig. 4 Cup forming 
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Sa (Axial depth of cut) 




Mandrel 



Roller 


Workpiece 





Basic mode (by SIMTech) 




Practical mode (by Leifeil Metal Spinning AG) 


Fig. 5 Boss forming 

Deformation Characteristics and Process Parameter in Flow Forming 
Process Parameters and Its Effect on Deformation 

The major process parameters for tube elongation/wall thinning in flow forming are 
approach angle (a) of the roller, wall thickness reduction (%), and feed rate 
(mm/roller) of the roller or workpiece as shown in Fig. 13. These parameters 
have significant effects on the deformation of workpiece, dimensional accuracy 
of the formed component, and forming load: 

(a) Approach angle of roller: 15-25° is recommended for usual cases. Larger 
approach angles cause the development of “bulging” deformation (shown in 
Fig. 13) at the front portion of contact area between the roller and workpiece. 
An excessive bulging sometimes causes the overlapping defect on the formed 
surface. Too small approach angles lead to an increase in the forming load and 
excess tangential metal how which causes larger diametric growth of formed 
product. The selection of comer radius of the roller is important in the shear 
forming of conical cup to obtain good surface finish of cup. 
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Fig. 6 Splitting/V- groove forming 


Fig. 7 Combination of cup 
forming and wall thinning 


Initial workpiece Formed cup 


Final component 
(after wall thinning) 



(b) Wall thickness reduction (%): The wall thickness reduction below 40 % is 
recommended per pass in usual case because too large wall thickness reduction 
causes wall breakage of workpiece, excessive growth of bulging, larger dia- 
metric growth of the workpiece, and larger forming load. Lower thickness 
reduction should be selected for forming high-strength or difficult-to-form 
materials. 

(c) Feed rate (mm/roller) of the roller or workpiece: This parameter affects strongly 
the deformation behavior of the workpiece. An effect of feed rate on the plastic 
strain destitution in workpiece is predicted by FE simulation as shown in Fig. 14 
(Wong et al. 2010). At the smallest feed rate of 0.2 mm/rev, deformation is 
mainly concentrated in the outside layer of the workpiece. With higher feed 
rates (0.4 and 0.8 mm/rev), much higher strain is developed across wall 
thickness. The hardness distribution (Fig. 15, Wong et al. 2010) across wall 
thickness of formed cup showed that higher hardness, i.e., larger work harden- 
ing caused by larger strain, was induced in the outer layer at the smallest feed 
rate of 0.2 mm/rev as compared with larger feed rates. 

(d) Rotational speed of workpiece is also important because it affects lubricating 
condition (e.g., galling or wear of roll, etc.) and temperature rise of workpiece 
during forming as well as the production rate in flow forming. 
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Fig. 8 Combination (A) of splitting and elongation 
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Fig. 9 Combination (B) of splitting and elongation 
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Initial Splitting Elongating Straightening 



Initial wokpiece and final shape 

Fig. 10 Combination (C) of splitting and elongation 


Fig. 1 1 Initial workpiece 
and conical shapes after shear 
forming (by single pass) 


i 


D a = 65mm, f 0 =3mm, a=15°, 
A6061 






(e) Number of roller is another factor influencing the productivity of forming. 
Forming speed (feeding speed of roller and rotational speed of workpiece) 
can be increased proportionally with the number of roller under a constant 
feed rate (mm/roller) of roller/workpiece as seen in Fig. 13. 


In the shear forming of cone-shaped products, the approach angle (a) of the 
roller, wall thickness reduction (%), and feed rate (mm/roller) of roller or 
workpiece are also major process parameters in analogy with flow forming men- 
tioned above. In this case, the optimization of forming process design and initial 
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D 0 = 65mm s f 0 =3mm P A6061 



1 st pass (of 15°) 2 nd pass (of 10°) 3 rd pass (of 5°) 

Fig. 12 Conical shapes after shear forming (by single and multi pass) 



Nose radius r 


Fig. 13 Process parameters 
in flow forming (tube 
elongation/wall thinning) 


1. Roller approach angle: a(°) 

2. Wall thickness: Start: fo(mm), Finished: t (mm) 

3. Axial feeding speed of roller (mm/min): V= Nwx fx i 

4. Rotational speed of workpiece (rpm): Nw 

5. Feed rate of roller/workpiece (mm/roller): f = (Vi) x (M/Nw) 
where, i = Number of roller 

6. Wall thickness reduction (%) : Rt = [(to -t) / to] x 100 


shape/dimensions of workpiece (blank) is also important for successful forming of 
cone-shaped product with a desired wall thickness, as shown in Fig. 16 (Hayama 
1981). In the case of (b) in Fig. 16, i.e., combination of conventional spinning and 
shear forming, the outside diameter of workpiece decreases during the conventional 
spinning (first pass). Therefore, the final wall thickness of cups after second pass 
(shear forming) can be larger as compared with the normal shear spinning shown in 
Fig. 16a. In the forming of cone-shaped cup shown in Fig. 16c, the modified 
workpiece with larger thickness at the outer portion is also effective to obtain larger 
and uniform thickness of the cup. 
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Material: Aluminum alloy A6061 
Internal dia. 30mm 
Start wall thickness: 3.0mm 
Wall thickness reduction: 20% 

Roller approach angle: 20° 

Number of roller: 2 

FE simulation: DEFORM-3D, Rigid-plastic, 

adopting ALE formulation 



Fig. 14 


Effective strain across wall thickness (predicted by FE simulation) (Wong et al. 2010) 



Material: Aluminum alloy A6061 
Internal dia. 30mm, 

Start wall thickness: 3.0mm 
Wall thickness reduction: 20% 
Roller approach angle: 20° 
Number of roller: 2 


Fig. 15 Hardness values measured across wall thickness of flow formed sample (Wong 
et al. 2010) 


The methods for workpiece centering and stable rotational driving of workpiece 
are to be taken into consideration in the design of workpiece/preform and forming 
tools in shear spinning. When some redundant deformations of workpiece such as 
thinning and expansion of center hole appear at the bottom (vertical wall) of cups 
during shear forming, these should be minimized by optimizing the workpiece/ 
perform design, forming tool, forming process design, and forming parameters. The 
hole at the center of workpiece is useful for the centering of workpiece in the initial 
stage of spin forming. 

However, the center hole of workpiece is dispensable if the initial centering of 
workpiece can be done by other means. 
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(a) (b) 


(c) 


Modified blank for shear forming 

of ellipsoidal cap 


Combination of conventional 
spinning and Shear forming 


Fig. 16 Examples of forming process for forming conical shaped cups (Hayama 1981) 


A superior lubricity and heat dissipation (cooling) are required for the lubricants 
used in spin forming to obtain excellent surface finish of product and to prevent 
galling on roller surface as well as tool wear. The emulsion-type lubricants (lubri- 
cants dispersed in water) with high performance are effective in the spin forming of 
aluminum alloys, alloy steels (including low alloy steels and stainless steels), and 
nickel-based alloy (such as INCONEL 718). Graphite lubricant was coated on 
AZ31 workpiece before the warm flow forming. The lubricant is to be supplied 
properly close to the contact area between the roller and the workpiece during spin 
forming by using lubricant nozzles. 

Diametric Growth of Workpiece in Flow Forming 

For near-net- shape forming of short cylindrical hollow cup (length/diameter < 3), 
the dimensional accuracy after flow forming is very important (Danno et al. 2011). 
In tube elongation/wall thinning process by flow forming, a diametric growth is 
induced, in which the internal diameter of flow-formed cups tends to be enlarged 
gradually along the workpiece axis from the start position to final position of 
forming. This phenomenon is presumed to be caused by the circumferential metal 
flow ( e 0 in Fig. 2) during flow forming. 

Some experimental data (Danno et al. 2011) of diametric growth are shown in 
Figs. 17, 18, and 19. The diameter growth of cups increases at higher thickness 
reduction and higher feed rate, but it can be reduced in multi pass of flow forming. 
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A6061 cup after 1 st pass 
1 roller, a=20°, c//'=50mnn, 
Feed rate=0.24 mm/roller 


A6061 cup after 1st pass, 2 roller, 
a=20°,d/=30mm, to= 3.0mm, 

Rt= 20%, Feed rate F(mm per roller) 



Distance from bottom (mm) 


mas 

— mao 
£ 

E m.ss 


30,30 
30.2 5 
= 30,20 


to 

% 

B 

CD 

■ mm 
— 


c 

c 


mis 

30.10 


- 30.05 

moo 


-Li 


0 


Thickness reduction Rt=20% 


■ F=0, 2mm 
A F-0. 4rtm 
* F=0. $mm 



1 


1 




A 




U. ■ 


-l 


a 



5 


10 


15 


20 


25 


nee from bottom (mm) 


Fig. 17 Effects of wall thickness reduction and feed rate on diametric growth in flow forming 
(wall thinning) (Danno et al. 2011) 
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Diametric growth after multi pass of flow forming (wall thinning) (Danno et al. 2011) 


The number of roller ( 1 ~ 3 roller) does not affect the diametric growth. The degree 
of diametric growth depends strongly on the material of workpiece, as shown in 
Fig. 19, for example. Magnesium alloy AZ31 (warm forming at 300 °C), stainless 
steel AISI304L, and nickel-based alloy of INCONEL7 1 8 showed larger diameter 
growth compared to aluminum alloy A6061, Cr-Mo alloys steel, and AISI1010 
steel. The thickness variation was much smaller than the degree of diametric growth 
in these cases, e.g., 0.025 ~ 0.065 mm and ~0.03 mm in the case of Figs. 17 and 18, 
respectively. Severe bulging and “bell mouthing” were observed at higher feed 
rates as shown in Fig. 20. 
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Fig. 19 Effect of workpiece material on diametric growth in flow forming (wall thinning) (Danno 
etal. 2011) 


2 roller, a=20°,c//=30.0mm, to= 3.0mm, Feed rate=0.8mm/roller, 

Wall thickness reduction Rt =20%, 

FE simulation: DEFORM-3D, Rigid-plastic, adopting ALE formulation 



Bell mouthing 


Fig. 20 Sever bulging and ‘bell mouthing’ of A6061 cup (Wong et al. 2010; Danno et al. 2011) 

Formability of Materials in Flow Forming 

Formability of materials in flow forming is shown in Table 1 based on the testing 
and evaluation conducted at SIMTech (Singapore Institute of Manufacturing Tech- 
nology, Singapore). Aluminum alloys (Al 100, A6061), carbon steels (0.1 ~ 0.5%C), 
and low alloy steels have excellent formability, and stainless steels (304L, 304), 
maraging steel (Fe-18%Ni-9%Co-5%Mo), and INCONEL 718 have good 
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Table 1 Formability of materials in flow forming (tested in SIMTech) 


Materials 

Formability 

Remarks 

Aluminum 

alloys 

A1100 

A 

-0 — 24H 

A6061 

A~B 

-0, — T4 

Steels 

(structural) 

Carbon 

steel 

A~B 

0.1-0. 5C 

Low alloy 
steels 

A-B 

Fe-Cr, Fe-Cr-Mo, Fe-Ni-Cr-Mo, Fe-Cr-Mo-V, 
etc 

Stainless 

steel 

B 

304L (Fe-9Ni-18Cr), 304 (Fe-8Ni-18Cr), 1% 
C-13%Cr (martensitic) 

Bearing 

steel 

B~C 

AISI 52100 (Fe-0.7C-13Cr) 

Steels (high 
strength) 

Margaing 

steel 

B 

Fe-18%Ni-9%Co-5%Mo (solution treated) 

Fe-42Ni 

alloy 

B~C 

Ni-Span-C 

Titanium alloy 

a+p alloy 

B~C 

Ti-6A1-4V 

Super alloy 

INCONEL 

B 

INCONEL 718, 625 


Formability: A Excellent, B Good, C Poor 



Stainless Steel 304L INCONEL 718 


Ti-GAI -4V 


Fig. 21 Typical cracks in flow forming (provided from SIMTech) 


formability. However, the formability of bearing steel (Fe-0.7C-13Cr), Fe-42%Ni 
alloy (Ni-Span-C), and Ti-6A1-4V alloy is good or poor. The formability of some 
high-strength steels, nickel-based alloys, and titanium alloy was limited mainly by 
cracking during flow forming as shown in Fig. 21. Magnesium alloy AZ31 can be 
flow formed at a warm temperature range (around 300 °C). 


Spin Forming Machine 

The roller arrangements in the spin forming machine with 1 ~ 3 roller, two types 
(horizontal and vertical) of spin forming machine, and typical commercial spin 
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Fig. 22 Roller arrangements 
in spin forming machine 



1-RoIIer 



2-Roller 



3-Roller 


Horizontal 


Vertical 


R ; Roller 
W: Work piece 
M : Mandrel 
C : Clamper 





Fig. 23 Two types of spin forming machine 


forming machines are shown in Figs. 22 and 23, and Table 2, respectively (Leifeld 
Metal Spinning AG; WF Maschinenbau und Blechformtechnik GmbH; Nihon 
Spindle Mfg. Co., Ltd.). In many cases in the spin forming machine, the mandrel 
is rotationally driven and the rollers are freely rotatable. Therefore, the workpiece is 
rotationally driven by clumping the workpiece on mandrel by using tailstock and 
the friction between mandrel and workpiece during forming. The roller rotates only 
by the friction between rollers and workpiece. 

The horizontal three-roller machine has an advantage of high stability in work- 
piece centering as well as high rigidity of roller supporting unit. This type of 
machine is suitable for flow forming of long tube and heavy-duty forming of 
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Table 2 Typical commercial spin forming machines 


Type of 
machine 

Number of 
roller 

Typical products 

Remarks 

Horizontal 

3 

Long tube 

High stability in workpiece centering 

Wheel disk 

Heavy duty forming 

1~2 

Cone and dome 
shapes 

High flexibility in size and shapes of 
product 

Tube end forming 

(necking, 

reducing) 

Simple machine 

Vertical 

3 

Aluminum wheel 

High stability in workpiece centering 

2,3 

Cup shaped 
components with 
small axial length 
(pulley, clutch 
housing) 

Small floor space of machine 

High productivity 

High flexibility in size and shapes of 
product 


thick wall components such as cone- shaped- wheel disk. Another type of horizontal 
machines with one roller or two rollers can be used for forming various kinds of 
components because of its higher flexibility in size and shape of the product. In the 
case of machines with two rollers, the roller driving system is simpler as compared 
with three-roller machine and there is no geometrical constraint in the minimum 
workpiece diameter. The vertical machines have advantages of smaller floor space 
and are used for forming various kinds of cup-shaped components with short axial 
lengths. The roller motion (axial and radial travel) is usually controlled by a CNC 
system in these machines. 

With regard to the material for forming tools (roller, mandrel), the cold work die 
steels (e.g., D2) and high-speed steel (e.g., M2) can be used after quenching and 
tempering. Heavy-duty metal forming oil or heavy-duty cutting fluid (water based) 
can be used for the lubrication in spin forming. 


Near-Net-Shape Components by Spin Forming (Case Studies) 

Spin forming can be applied to the near-net- shape forming as shown below and in 

Figs. 24, 25, 26, and 27. 

(a) Reducing wall thickness and elongating of the tubular components by flow 
forming. 

- Forming of hollow components (low alloy steels, nickel-based alloy, stain- 
less steel, aluminum alloy, etc.) with thin wall or varying wall thickness 

(Figs. 24 and 25) 

- Elongating of the hollow shaft with flange (Fig. 27c) 

- Forming of miniature tube of hard material (l%C-13%Cr martensite stain- 
less steel, Fig. 26b). 
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Fig. 24 Formed samples 
(flow forming of long tube of 
Cr-Mo-V steel by Leifeld 
Metal Spinning) 
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^ Stainless steel 304L 


050mm 



Fig. 25 Formed samples (spin forming, Provided from SIMTech) 

(b) Forming of thin-walled hollow component from disk-shaped workpiece (alu- 
minum, Fig. 25b) by using cup forming and flow forming. In this case, there is 
no center hole at the bottom of cup. 

(c) Forming of cone by shear forming, and hemispherical components by combi- 
nation of conventional spinning and shear forming from disk- shaped workpiece 
(Figs. 11, 12, 25a, and 26d). In the case of Fig. 26d, there is no hole at the center 
of workpiece. 
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Pre-form After forming 



Internal spine (Aluminum) Miniature tube 

(Martensitic stainless steel) 



High aspect component Hemispherical cup 

(Aluminum alloy) (Stainless steel, 080, t=2.5mm) 


Fig. 26 Formed samples (spin forming, Provided from SIMTech) 


(d) Forming of high aspect ratio component from disk-shaped workpiece with large 
thickness by splitting and flow forming (aluminum alloy, Fig. 25c). 

(e) Forming of V-belt pulley (performing by stamping and spin forming of poly-V 
grooves) (mild steel, Fig. 26b). 

(f) Cold forming of aluminum wheel from forged perform or warm forming of 
aluminum wheel from cast perform (Fig. 26a): This process is effective to 
reduce the wall thickness and weight of wheels. 

(g) Forming of internal spline teeth in flow forming by using mandrel with spline 
teeth profile (Fig. 25a). 
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Aluminum wheel 
(Leifeld Metal Spinning) 


Initial surface 
of pre-form 



Poly v-pulley (Low carbon steel) 



P re- form 

{Forged and machined) 




of shaft {Leifeld Metal Spinning) 


Fig. 27 Other applications of spin forming 


As seen in the pictures of the formed samples, the surface finish of the formed 
products are very smooth due to the effect of the rolling contact with roller or roll 
burnishing effect during forming. 

Some dimensional accuracy data of flow-formed cylindrical components are 
shown in Tables 3 and 4. In these data, the accuracy of the internal diameter 
includes the diametric growth of workpiece during flow forming. 


Combined Stamping and Forging Process for Thin-Walled 
Components 

Forming Principle and Typical Forming Mode 
Working Principle 

In the combined stamping and forging process, various kinds of thin-walled 2D/3D 
components with multi thicknesses and ribs/webs or pins are formed in near net 
shape (Merklein et al. 2011a, b, 2012; Yang et al. 2011). They are formed from 
sheet metal workpiece with uniform thickness by combining some bulk forming 
modes and stamping (sheet metal forming) modes. Major forming modes in this 
process are: 
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Table 3 Dimensional accuracies of flow-formed cylindrical components (Provided from 
SIMTech) 


Size range and respective tolerances of finish components 



A (2 roller, CNC, 
Smaller size machine) 

A/B 

B (2 roller, CNC, Heavy 
duty machine) 

Inner diameter 

3.5-30 mm 

30-50 mm 

50-200 mm 

Wall thickness (min) 

0.7 mm 

0.8 mm 

1-3 mm 

Maximum length 

80 mm 

80-300 mm 

300 mm 

Diameter 

±0.01~±0.2 mm 

±0.2 mm 

±0.2~±0.4 mm 

Wall thickness 

±0.01 mm 

±0.02 mm 

±0.05 mm 

Int./Ext. Surface 
finishing (Ra) 

0.1~0.8 pm 


Note: This table is for general references of the dimensional accuracy of the components formed 
by spin forming machine in SIMTech 

Dimensional accuracy is highly dependent on workpiece material and size 

Surface roughness depends on condition of forming tools, workpiece material and forming 

condition (e.g., axial feed rate, lubrication etc.) 


Table 4 Dimensional accuracies of flow-formed cylindrical components of INCONEL 718 with 
thin wall thickness (Provided from SIMTech) 


Dimensional accuracies of INCONEL 718 thin wall components (Wall thickness 1 mm or less, 
Tot I length 120 mm, start thickness 5 mm) 


Thickness 

±0.05 mm 

Roundness (ID) 

0.05 mm (feed rate: 0.2 mm/roller) 

0.08 mm (feed rate:0.1 mm/roller) 

Surface roughness (outer) 

Ra 0.13 pm (smaller feed <0.1 mm per roller), 

Ra 0.23 (larger feed: 0.2 mm per roller) 

Surface roughness (inner) 

Ra 1.1 pm 

Tapering of ID a 

tan 6 0.009 ~ 0.012 


Forming conditions: 

2 roller, Initial ID of workpiece = 50.0 mm, Total number of pass = 6, Rotational speed of 
workpiece = 120 ~ 250 rpm, Thickness reduction 20 ~ 29 % per pass, Feed rate 0. 1 ~ 0.2 mm/roller 
a Tapering (tan 6 ) is defined as shown in Fig. 18 


(a) Bulk forming modes (Fig. 28): 


Upsetting 

Thickness of workpiece is partially reduced by fiat punch. 

Forging 

Thickness of workpiece is partially reduced by die or punch with cavities. 

Coining/ 

embossing 

Small areas of workpiece is compressed in thickness direction. 

Ironing 

Wall thickness of hollow workpiece is slightly reduced between die and 
mandrel. 

Rib/pin extrusion 

Rib or pin is extruded by compressing workpiece in its thickness direction. 
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Upsetting- A 


d 




Forging -B 




Ironing 


Fig. 28 Bulk forming modes 


(b) Stamping/sheet metal forming modes (Fig. 29): 


Deep 

drawing 

Cup is formed from flat workpiece by drawing workpiece in radial direction into 
the die hole. 

Expanding 

Central region of workpiece is expanded (stretched) by punch, while outer 
portion of flat workpiece is kept deformed. 

Flanging 

Circular vertical wall is formed at the center of workpiece by pushing punch 
through hole with smaller diameter. 

Bending 

Workpiece is bent by punch and die. 

Punching 

Hole is formed by shearing with punch and die. 


These forming modes are combined properly to form the whole shape, change 
thickness locally, and form web, ribs, or pins on the workpiece. In the upsetting and 
die forging of bulk forming mode, the workpiece is compressed partially in its 
thickness direction to generate vertical and lateral metal flow. In many cases, the 
vertical metal flow is effective to form ribs/webs or pins on the workpiece. 
However, the lateral metal flow along the tool surface needs to be adequately 
controlled to prevent defects such as wrinkling/bucking at the thinner portion and 
folding/shear of ribs as shown in Fig. 30. 
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Fig. 29 Stamping/sheet metal forming modes 
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Fig. 30 Defects caused by lateral metal flow in bulk forming mode 
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Fig. 31 Mean forming 
pressure in compression of 
disk and plate (Thomsen 
et al. 1965) 



r f /h 


Axisymmetrical compression of disk 



°0 5 10 15 20 25 30 35 40 

'f/2h 


Plane strain compression of plate 


It is widely known that the forming pressure in the compression of disk or plate 
increases with the increasing radius or width to thickness ratio of the workpiece, as 
shown in Fig. 31 (Thomsen et al. 1965). This is caused by the friction between 
workpiece and tools. Therefore, the lubricating condition for workpiece and tools is 
significantly important to reduce the forming pressure and control the metal flow. 

The frictional condition also affects the lateral and vertical (extrusion) metal 
flow. That is, lower friction makes the lateral flow easier, and higher friction 
increases the extrusion of rib or pin in vertical direction. 

The advantages of combined stamping and forging process are: 

(a) Combining major benefits of conventional stamping and forging (bulk forming) 
into one process, i.e., compressing workpiece in its thickness direction by bulk 
forming mode and forming whole shape by stamping mode 

(b) Capability of near-net-shape forming of thin-walled components with multiple 
wall thicknesses and complex shape from sheet workpiece under lower forming 
load as compared with conventional cold forging 
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(c) Improvement of material formability due to bulk forming mode under com- 
pressive stress and optimization of wall thickness 

(d) Milder frictional condition between tool and workpiece as compared with 
conventional cold forging 

On the other hand, this process has the following disadvantages: 

(a) Useful forming mode and sequence for forming (e.g., decrease of forming 
pressure and better control of lateral metal flow) have not been developed 
enough for higher strength materials. 

(b) Design rule for forming tool and products have not been clearly established. 

(c) Press machine and tooling system specialized for combined stamping and 
forging process are still under development. 

(d) Unsatisfactory accumulation of technical know-how for wider 
commercialization. 

Forming Process and Tooling Design 

The important technical issues in combined stamping and forging process are high 
forming pressure and potential defects caused by lateral material flow in the bulk 
forming modes as mentioned above (Merklein et al. 2011a, b, 2012; Nakano 2001; 
Nakano et al. 2006). Therefore, the optimization in forming sequence in multistep 
forming and tooling design is essential to reduce the forming pressure and control 
the metal flow for successful forming. 

Figures 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, and 42 show potential forming 
modes and process designs for the combined stamping and forging process. 

1 . In the deep drawing from the workpiece with a uniform thickness as shown in the 
left in Fig. 32, the workpiece thickness is thickened at the flange portion by the 
circumferential shrinking deformation during the deep drawing. Therefore, the 
deep drawing of the cup from tapered blank (shown in the center and left in 
Fig. 32) is effective to improve the cup thickness distribution and formability in 
deep drawing. The tapered blank can be made by upsetting from an initial 
workpiece with uniform thickness. 

2. The combination of deep drawing and upsetting/extrusion (Fig. 33) is useful for 
forming cups with boss at its center (Merklein et al. 2012; Nakano 2001; Nakano 
et al. 2006). The follow or solid boss can be extruded by compressing workpiece 
and reduce its thickness around boss. 

3. Local upsetting/forging/pin extrusion (Fig. 34) can be used for forming chamfer, 
holes, and pins (Merklein et al. 2012; Nakano 2001). 

4. The cup with sharp bottom corner and thin wall can be formed by combination of 
upsetting, deep drawing, and ironing (Fig. 35) (Merklein et al. 2012; Nakano 
et al. 2006). In this method, the local thinning of workpiece at cup comer is 
prevented successfully. 
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From blank of 
uniform thickness 



From tapered (upset ) blank 


Fig. 32 Deep drawing of cup from blank of uniform thickness and tapered (upset) blank 
(provided from SIMTech) 




Fig. 33 Combination of 
deep drawing and upsetting/ 
extrusion (Merklein 
et al. 2012; Nakano 2001; 
Nakano et al. 2006) 


drawing + pin extrusion 



Blank 1 st step 2 nd step 

Deep drawing + local upsetting/ extrusion 



Fig. 34 Local upsetting/forging/pin extrusion (Merklein et al. 2012; Nakano 2001) 
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Fig. 35 Forming of cup having sharp bottom comer and thin wall (Merklein et al. 2012; Nakano 
et al. 2006) 



Fig. 36 Rib/Web forming by forging 


Fig. 37 Multi-step upsetting 
process (Merklein 
et al. 2011a) 

5. Rib/web can be formed in rectangular components by forging mode (Fig. 36). In 
this method, the rib and web is extruded vertically by compressing the workpiece 
in its thickness direction. 

6. Multistep upsetting process was proposed to move the material in lateral direc- 
tion and increase the thickness at the end portion of workpiece (Fig. 37) 
(Merklein et al. 2011a; Oyachi et al. 2011). 

7. Incremental forging process in multiforming stages (Figs. 38, 39, and 40): In this 
process, the workpiece is compressed incrementally to cause the metal flows 
from the central region toward the free end of the workpiece. A cushion 
mechanism is used to prevent the backward metal flow and buckling of the 
thin portion of workpiece which is compressed in the previous step. This process 
has a potential for forming multi thickness components with the rib and web 
under reduced forming load. However, some further R&D works are required to 
make clear the design rule for forming process and tooling as well as the forming 
limitation/capability of this process. 

8. Forming of cup-shaped and gear-teethed component was tried by deep drawing 
and extrusion/forging process (Fig. 41) (Merklein et al. 2011b). The thin mate- 
rial around gear teeth is removed by a subsequent trimming operation. 


1st step 
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Fig. 38 Incremental forging (basic concept A, provided from SIMTech) 


Important process parameters 

• Thickness reduction: t/t 0 

• Increment of forming: w-j/to, W2 / t 

• Cushion (counter) pressure: Pc/Ai 



Fig. 39 Important process parameters in incremental forging (basic concept A, provided from 
SIMTech) 

9. Coining/forging can be applied to form grooves at cup corner after deep 
drawing (Fig. 42). 

In these forming modes, the elastic deflection of forming tool components 
cannot be neglected because the workpiece thickness is small and forming pressure 
is high. The elastic deflection of forming tool components should be compensated 
by modifying the component dimensions (height, etc.) for high precision forming. 
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Product cross section 



Blank 




Fig. 40 Incremental forging process (basic concept B, provided from SIMTech) 



Fig. 41 Forming of gear component by deep drawing and extrusion/forging process (Merklein 
et al. 2011b) 


Fig. 42 Coining/forging at 
cup comer after deep drawing 
(SIMTech) 
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Therefore, FE simulation is essential for forming process design and optimization 
of process parameter as well as tool design. 

As for press machine, the conventional press machines for stamping and cold 
forging can be used for the combined stamping and forging process. However, 
higher rigidity of bolster/slide and press frame is recommended for stamping press 
because the forming pressure/load in the combined stamping and forging process is 
higher as compared with conventional stamping. Servo press (mechanical) is 
suitable to the combined stamping and forging process because of its high capabil- 
ity in high- accuracy setting of ram stroke. 

A superior lubricity is required for the lubricants used in combined stamping and 
forging process to prevent galling on tool surface and to reduce the friction. The 
chemical conversion coating such as zinc phosphate + metallic soap (for steels) and 
aluminum fluoride + metallic soap (for aluminum alloys) is preferable in severe 
frictional conditions. The emulsion-type lubricant (lubricant dispersed in water) 
with high performance is applicable to the forming of aluminum alloys and low 
carbon steels under moderate frictional condition. 


Future Development Trend 

The combined sheet and bulk forming process has high capability in near-net- shape 
forming of components with thin and multiwall thickness from lightweight and 
high-strength metals. This process is more important in the future to reduce the 
material waste and cost in the manufacturing of metal components. Further devel- 
opment work is required in the following areas in future: 

1 . Spin forming technology 

- Development of universal and less expensive forming machine for commer- 
cial use 

- Improvement in dimensional accuracies of formed components, e.g., 
decrease in diametric growth in flow forming and undesirable redundant 
deformation at the bottom (vertical wall) of cups in shear forming 

- Expansion of forming capability of process (e.g., forming of unaxi symmetric 
components) 

- Accumulation of technical know-how and design rules of process/tool for 
wider commercialization 

2. Combined stamping and forging process 

- Development of advanced forming mode and sequence for forming higher- 
strength material (e.g., decrease of forming pressure and better control of 
lateral metal flow) 

- Advanced design rule for forming tool and products 

- Investigation on better lubrication and optimization of frictional condition 
between tool surface and workpiece 
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- Development of multiaxis press machine and tooling system specialized for 
combined stamping and forging process 

- Accumulation of technical know-how for wider commercialization 


Summary 

The combined sheet and bulk forming process has a high potential for the near-net- 
shape forming of metal components with thin and multiwall thickness. In this 
forming process, the bulk forming process, i.e., compressive forming process, is 
applied to sheet or plate workpiece in combination with the conventional sheet 
forming (stamping) process to minimize the material waste and manufacturing cost. 
The spin forming (flow forming, shear forming) as well as the combined stamping 
and forging are focused in this chapter, because these are key technologies for the 
combined sheet and bulk forming process. The basic process knowledge of these 
two technologies is provided with as for the working principles, deformation 
characteristics, important process parameters, forming process and tooling design, 
forming machine, case studies of application, etc. 

The flow forming process can reduced the wall thickness of rotating cylindrical 
hollow workpiece into intended value incrementally, and shear forming can form 
various kinds of conical/semispherical/cup-shaped hollow components from flat 
disk-shaped workpiece of sheet metal. The advantages of spin forming are low-cost 
simple tooling, easy lubrication during forming, high flexibility in forming, and 
applicability to cold forming of high-strength and difficult-to-form materials. The 
optimization of forming process, forming parameters, tooling design, and initial 
workpiece design is required for successful spin forming. 

The combined stamping and forging process includes the local compression of 
workpiece in its thickness direction for reducing wall thickness locally and some 
forming mode of conventional stamping of sheet metal. In this process, various 
kinds of thin- walled 2D/3D components with multi thicknesses, ribs, pins, and webs 
can be formed in near net shape from sheet workpiece of uniform thickness. The 
optimizations in forming sequence in multistep forming, lubrication, and tooling 
design are essential for successful forming. 
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Abstract 

This chapter provides an overview of powder-forming methods for ceramics and 
metals. Powder forming is distinct from traditional melt-forming methods in that it 
involves forming a component from powder and densifying it without melting, via 
solid state sintering. Sintering typically occurs at around 80 % of the melting point. 
The primary benefits of powder forming are as follows: (a) reduced forming tem- 
perature (reduced energy cost), (b) capability for engineered porosity, (c) elimination 
of mold component reactions caused by melt forming, and (d) suitability for mass 
production of small metal components and ceramics of all shapes and sizes. Almost 
all ceramics are manufactured by powder forming. Most metals are formed by melt 
casting; however, powder metallurgy has grown into a large industry. This overview 
begins with a review of powder characterization and powder manufacturing tech- 
niques. Powder-forming techniques are then reviewed including the two main 
dry-forming methods (die pressing, cold isostatic pressing) and a range of wet- 
forming techniques including extrusion, plastic forming, slipcasting, tapecasting, 
powder injection molding, direct coagulation casting, gelcasting, and thixotropic 
casting. The overview then discusses powder densification techniques including 
pressureless sintering, self-propagating high-temperature synthesis, microwave 
sintering, two-step sintering, hot-pressing, hot isostatic pressing, spark plasma 
sintering, and sinter forging. Future trends discussed include additive manufacturing 
(powder 3D printing), functionally graded materials, and hydrostatic shock forming. 


Introduction 

Powder forming is predicated on the technically and commercially useful concept 
of sintering: densifying a powder into a solid article at a temperature well below its 
melting point. The primary benefits of powder forming are: 

• Reduced forming temperature and thus cost 

• Capability for engineered porosity 
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• Elimination of mold to component high-temperature interactions caused by melt 
forming 

• Suitability for mass production of small metal components and ceramics of all 
shapes and sizes 

Most metals are sintered at 70-80 % of their melting point, some refractory 
metals up to 90 %. High purity ceramic sintering temperatures sometimes approach 
90 % of melting point, but with the widespread use of sintering aids, liquid phase 
sintering, and traditional ceramics with their broad compositional range, ceramic 
sintering is usually closer to 70 % of the melting point. The powder-forming 
process contains four basic steps: 

1 . Powder preparation. 

2. Powder mixing, blending, and/or slurry preparation (a powder-liquid suspension 
is known as a slurry). 

3. Forming. 

4. Sintering. 

The origins of powder forming date back at least 20,000 years, as shown from 
the archaeological evidence of the first clay crockery. Today, apart from the glass 
industry, all ceramics are formed by a powder-forming approach. This is because 
the extremely high melting point of ceramics makes casting from a melt commer- 
cially unviable unless driven by a strong commercial imperative. The only signif- 
icant exception to this is fuse-cast refractories, which are used in glass-forming 
furnaces. In contrast, metal fabrication is predominantly via melt forming, with 
powder metallurgy (PM) forming a smaller subset of metal forming, practiced only 
when it brings commercial benefit. Ceramic and metal powder forming, while they 
share many common traits, also have significant differences, primarily because of 
two key differences: 

1 . Ceramic powders are hard and their products brittle, while metal powders are 
plastic and their products ductile. 

2. Ceramics are generally in their highest oxidation state and can be sintered in air, 
while metal sintering generally needs a vacuum or a protective atmosphere. 

Powder metallurgy (PM) is ideal for mass production of small metal components 
and porous metal components. Almost all PM products are just a few centimeters in 
size, although in a few rare cases PM products as large as 50 cm have been 
produced. PM has its origins in Egypt 3 ,000 years ago where a crude form of iron 
powder forming was practiced. Modern industrial PM mass production arose in the 
nineteenth century, making copper coins, medallions, ingots, and tungsten wires. 
Self-lubricating bearings were developed in the early twentieth century, and after 
World War II, the PM industry grew exponentially, primarily driven by the 
booming automotive industry. From 1960 to 1980, iron powder consumption 
increased by a factor of 10. In this era, iron and steel replaced copper as the 
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dominant PM metal. Today, iron and steel account for about 85 % of the PM 
industry, with copper at about 7 %. Stainless steel and aluminum are also significant 
PM metals. Automotive PM products now account for about 70 % of the PM 
market. In 1990, the average car had 10 kg of PM components; today, it has doubled 
to more than 20 kg. 

Many advancements in the PM industry have emerged in the last 50 years. 
In the 1960s, advances were primarily driven by the nuclear and aerospace 
industries. Full density PM products emerged in this era. In the 1970s, PM 
superalloy components emerged, for example, jet turbine blades, a technology 
which underpins the booming commercial jetliner industry. PM injection molding 
arose in the 1980s and grew dramatically in the 1990s. Now, PM products are found in 
a very wide range of applications, including household appliances, office equipment, 
electronic components, magnetic materials, medical devices, dental products, tools, 
industrial motors, hydraulics, and advanced aerospace components. 

An important word of warning here is that fine metal powders (high surface area, 
commonly through flaky morphology) are explosive and belong in the pyrotechnics 
industry. Coarse spheroidal metal powders are quite safe for PM. Most metal 
powder suppliers will specify the purpose of the powder, whether for powder 
metallurgy or pyrotechnics. It is wise to be aware of this issue when sourcing 
metal powders for PM research and development. 

Powder forming has been used to make ceramics for thousands of years. For 
millennia, ceramic manufacture was exclusively the domain of pottery, a powder 
wet-forming technology based on clay and aluminosilicates. Today, traditional 
ceramics is a huge industry primarily servicing the building industry with three 
main products: cement powder production, glass production, and clay ware prod- 
ucts. Traditional advanced ceramics comprise refractories, electronic ceramics, 
optical ceramics, and magnetic ceramics. Advanced structural ceramics comprise 
wear-resistant linings for tribological applications, biomaterials, ballistic armor, 
and automotive components. 

The final important branch of powder forming is the cermets (ceramic-metal 
composites). Beginning with PM-formed tungsten carbide cutting tools in the early 
twentieth century, titanium carbide and other carbides for machine tools and 
tribological use followed later. In the 1950s, siliconized silicon carbide (reaction- 
sintered silicon carbide) was developed in the UK, and with its superior hardness 
and reduced weight for comparable cost, it has significantly supplanted alumina in 
its traditional areas of tribology and ballistics in the last decade. The most recent 
powder-formed cermet innovation is the reaction- sintered boron carbide, developed 
in the USA in the 1990s. 


Today, while powder forming of ceramics and metals are huge industries, powder 
forming of polymers is rare. While polymer powders are a common feedstock, it is 
uncommon for polymer components to be fabricated by a “powder metallurgy” 
approach. It is necessary only in isolated cases, for example, manufacturing compo- 
nents of engineered porosity and forming polymers that can’t be injection molded, 
e.g., from UHMWPE. However, even UHMWPE is usually formed by machining 
from billets, but sometimes it is formed by a “powder metallurgy” approach. 
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Powder Characterization, Production, and Treatments 
Introduction 

One of the driving forces for high-quality PM product manufacturing is the charac- 
teristics and quality of the raw materials, i.e., the powder particles (PPs). Their 
characteristics and quality influence the powder-forming processes (compaction, 
sintering, and post- sintering operations) and, by consequence, the formed part qual- 
ity. This section (Powder Characterization, Production, and Treatments) describes the 
main topics of particle characterization, production, and appropriate treatments to 
improve the particle quality before subsequent powder-forming operations. 


Powder Characterization 
Physical Properties 

Also known as morphological properties, physical properties refer to the particle 
size, shape, and structural characteristics. Their assessment is based on specific 
methods/tools and analytical equipment as detailed below. 

Particle Size and Particle Size Distribution 

Poly granular powder mixtures are frequently used in industrial applications that are 
a combination of particles of different size (from a few nanometers up to maximum 
of a few hundred microns) for optimal self-packing (Fig. 1). 

Particle size means the one-dimensional value of the geometrical parameters 
describing the particle shape which is projected into areas, inscribed or 
circumscribed to a circle whose diameter represents the particle size (Table 1). 



Fig. i Particle arrangement 
in the loose state 
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Table 1 Relationship between particle shape and size descriptors for particle size (http://www. 
pharmacopeia.cn/v29240/usp29nf24s0_c776.html) 


Projections from particle shape 
(volume) to linear descriptors 

Volumes 

Cubes 

Spheres 


Ellipsoids 

Areas 

Circles 

Squares 

Ellipses 



Size descriptors 



Characteristic lengths 


Particle length - the longest distance from edge to 
edge of the particle oriented parallel to the ocular scale 

Particle width - the longest distance of the particle 
perpendicularly measured to the length 


Lengths 


Perimeters 


Diameters 



Characteristic diameters 

Feret diameter - the distance between two parallel 
lines, tangent to the contour of a particle randomly 
oriented 

Martin diameter - the distance between two points on 
a particle contour that divides the particle in two equal 
halves 

Projected area diameter - diameter of a circle having 
the same area as the projection area of particle 

Volume diameter - diameter of a sphere having the 
same volume of a particle 

Surface diameter - diameter of a sphere having the 
same surface as the particle 

Surface volume diameter - diameter of a sphere 
having the same surface to volume ratio as the particle 

Perimeter diameter - diameter of a circle having the 
same perimeter as the projection perimeter of particle 


Also, other types of diameter are used to determine particle size, including: 

• Stoke diameter - the free falling diameter of a particle in the laminar flow region 

• Drag diameter - diameter of a sphere having the same resistance to motion as a 
particle in a fluid of the same viscosity and the same speed 

• Equivalent light scattering diameter - diameter of a sphere giving the same 
intensity of light scattering as that of a particle, obtained by the light scattering 
method 
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laser diffraction 



Fig. 2 Schematic representation of the measurement methods for particle size and distribution 


• Sieve diameter - the diameter of the smallest grid in a sieve that the particle will 

pass through 

Powder mixtures can contain particles from a few nanometers up to a maximum 
of a few hundred microns that are divided into fractions. The quantity of particles 
by volume, weight, number, or surface area within each fraction represents the 
particle size distribution (usually Gaussian) which is simultaneously measured, in 
general, with the particle size, on the same equipment (e.g., laser diffractometer), 
under ISO conditions and terms (Fig. 2). It is important when considering a reported 
particle size distribution to note whether it is based on particle weight, volume, or 
number within each fraction. If it is a single component powder (as distinct from a 
blend of powders of different composition), volume and weight have the same 
meaning. If it is a mixture of powders of different composition, and therefore 
density, weight distribution can be misleading and therefore volume distribution 
is more meaningful. Particle number distribution, and by association surface area 
measurement, can be misleading as it gives an overrepresentation of the fine 
particles in terms of their actual volume contribution; however, it can be a good 
indication of potential sintering efficiency, since this is dominated by the fine 
particles. In practice, there are four types of particle size distributions according 
to the distribution mathematical functions: numerical, length, surface, and volu- 
metric distribution. Annex 1 to Annex 6 present a selection of ISO standards of the 
common measurement methods (Table 2). 

Particle Shape 

Microparticles and nanoparticles can be imaged with respect to particle shape, 
described by geometrical parameters. The main criterion for identifying different 
particle shapes is based on the ratio between the particle dimensions: length (1), 
width (w), and height (h). For each powder shape, there are specific shape descrip- 
tors (Table 3). 

The nanotechnology age cannot exist without high-tech microscopy. In the 
1600s, the first objects microscopically analyzed by Francesco Stelluti were bees 
(Stephan 2000). The next big leap forward was the invention of the electron 
microscope in the 1930s. Scanning electron microscopy (SEM) evolved into envi- 
ronmental SEM (ESEM) which does not require conductive coatings, allowing 
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Table 2 Representative methods for particle size measurement (Angelo and Subramanian 2008; 
www.nanotechnologyworld.co.uk) 


Particle 

Measurement 



size 

methods 

General remarks 

Standards 


Sieve analysis 


44-840 

mm 

Sieving using 

mechanical 

shaker 

Mechanical separation (dry conditions) 


5-44 mm 

Micro-mesh 

sieve 

Ultrasonic separation (dry/wet 
conditions) 

Image analysis 

0.1-100 

mm 

Optical 

microscopy 

(OM) 

Microscopic imaging by the means of 
specialized software 

ISO 13322 
(image analysis) 

0.01-10 

mm 

Scanning 

electron 

microscopy 

(SEM) 

0.001-50 

mm 

Transmission 

electron 

microscopy 

(TEM) 

1 nm-10 
mm 

Atomic force 

microscopy 

(AFM) 

Sedimentation analysis 

2-50 mm 

Sedimentation 

Gravitational separation of particles in 
liquid ambient 

ISO 13317 

(gravity 

sedimentation) 

Decantation 

Liquid pouring away from the 
precipitated PPs 

Pipette 

(Andreasen 

method) 

Samples removed from suspension, 
followed by drying and cooling for 
particles separations 

1-50 mm 

Gravitational 

Stokes diameter measurement by the 
means of settling velocity and 
concentration of the settling suspension 

0.05-50 

mm 

Turbidimetry 

Particle size measurement by the 
variation of the intensity of the light 
beam interacting the powders 
suspension 

ISO 13323 

(single particle 
light interacting) 

0.05-10 

mm 

Centrifugal 

Particle separation from a suspension 
by means of centrifugal acceleration 

(10 2 -10 6 g) 

ISO 13318 

(centrifugal 

sedimentation) 

3 nm-80 

mm 

Differential 
centrifugal 
(Vegad 2007) 


(< continued ) 


1 1 Powder Processing of Bulk Components in Manufacturing 495 


Table 2 (continued) 



Particle 

size 

Measurement 

methods 

General remarks 

Standards 

Elutriation analysis 




5-100 mm 

Elutriation 

Powder separation by the means of a 
rising liquid/gas column as a separation 
medium 

ISO 13317 

(gravity 

sedimentation) 

5-40 mm 

Roller air 

Powder separation by the means of a 



analyzer 

rising air column as a separation 




medium 



Permeability analysis 


0.2-50 

mm 

Fisher 

Sub-Sieve Sizer 

Particle diameter of surface area 
measurement by the means of the 
measurement of air speed through the 
powder mass 

ISO 10070 

2-100 mm 

Light scattering 
analysis 

Equivalent spherical diameter 
measurement of the PP sample as a 
diluted suspension 

ISO 13323 

(single particle 
light interaction) 

1-900 mm 

Light 

obscuration 

analysis 

0.15 

nm-6,000 

mm 

Laser diffraction 

Particle size measurement, available 
online, too, at the powder production 
plants (Dyroy et al. 2008) 

ISO 13320 (laser 

diffraction) 


ceramic particles and hydrated powders to be visualized without preparation and 
particle corrosion to be analyzed in real time. In parallel was the evolving trans- 
mission electron microscope (TEM) and its offspring scanning TEM (STEM) for 
improved resolution, high-resolution TEM (HR-TEM) and computed tomography 
TEM (CT-TEM). Since the 1980s, scanning probe microscopy (SPM) was a 
turning point, followed by the atomic force microscope (AFM), scanning tunneling 
microscope (STM), ballistic electron emission microscope (BEEM), chemical 
force microscope (CFM), feature-oriented scanning probe microscope (FOSPM), 
and scanning thermal microscopy (SThM). All these microscopy advancements 
have meant that microparticle and nanoparticle characterization has come of age 

(Fig. 3). 

Specific Surface Area (SSA) and Internal Structure of Powder Particles 

Powder mixtures often contain agglomerated and/or aggregated particles, espe- 
cially with nanoparticles and/or nanostructured particles, so particle size determi- 
nation is not so accurate. A more accurate morphological parameter used to 
quantify these powder agglomerates is specific surface area (SSA) which measures 
the total surface (including connected and terminating open pores) of a material per 

O O Q 

unit of mass [m /g], solid or bulk volume [nr /cm ], or cross-sectional area (Fig. 4). 

Due to various pore sizes (standardized by IUPAC) and shapes, a 
specific parameter distinguishes nanoparticles from coarser particles, namely, the 
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Table 3 Particle shape classification 


Particle classification versus shape 


By dimensions 

By descriptors 

One-dimensional (ID) 

1>>W 

l»h 


Acicular — needle-like 

s%s 

Irregular rod-like — lacking any symmetry 

Fibrous — regular or irregular tread-like shape 

Two-dimensional (2D) 
l@w>>h 


Dendritic branched, tree-like 

— \ 

Flake = irregular plate-like shape 

Lenticular lens-like shape 

Three-dimensional (3D) 
l@w@h 

o 

Spherical = global shaped 

o 

Irregular = lacking any symmetry 

/ "■) 

Angular = sharpen edged or roughly polyhedral 
shaped 

Granular = approximately equidimensional 
irregular shaped 


Porous 


volume- specific surface area (VSSA) that takes into consideration the SSA 
and the density of the powder material. Thus, nanoparticles are characterized by 
VSSA > 60 (nrr/cm ). The methods for porosity/S S A/VS S A determination are 
presented in Table 4. 

SSA is related to the porosity of the particles, both pore size and shape. 
According to the International Union of Pure and Applied Chemistry (IUPAC) 
standards, pores are divided in three main groups from the point of view of the size 
(Sing et al. 1985). 

• Micropores: <2 nm. Zeolites, activated carbon and metal organic frameworks 

• Mesopores: 2-50 nm. Mesoporous silica and activated carbon 

• Macropores: >50 nm. Sintered metals and ceramics 

Pore shapes are shown in Fig. 6 (Rouquerol et al. 1999). For industrial applica- 
tions, the usual technique to measure porosity is gas absorption (Fig. 5) (Lowell and 

Shields 1991). 

The standard SSA measurement method is BET absorption, developed by 
Brunauer, Emmett, and Teller (Brunauer et al. 1938), which is based on the 
following assumptions: 
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compact structure (SSA^ porous structure (SSA ii »SSA i ) 


Fig. 4 Relationship between the specific surface area and the internal structure of a particle: (a) 
particle shapes with constant mean size and various SSA: spherical (SAA^, irregular (SAA 2 ), 
dendritic (SAA 3 ), (b) compact and porous spherical particles (Redrawn from (Rouquerol et al. 1999)) 
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Table 4 Methods for porosity determination 


Porosity determination 

Remarks 

Gas adsorption 

Easy and frequently used technique for (0.4-50 nm) open pores, using 
BET method, Fig. 5 (redrawn from Lowell and Shields (1991)) 

Mercury porosimetry 

TEM 

Seldom used for interconnected pores (<5 nm) 

SEM 

Small-angle X-ray 
scattering 

Expensive technique, suitable for any open/closed pore size 

Small-angle neutron 
scattering 


• The gas molecules are infinitely absorbed by the pore surfaces in multilayers 
(Fig. 7). 

• Between layers there are no interactions. 

• The Langmuir theory (Langmuir 1916) can be applied to each adsorbed layer. 


Other Physical-Chemical Properties 

Transitioning from the micron-level to the nano-level, particles change their prop- 
erties due to the change in atomic arrangement, as shown in Fig. 8. 

The main consequence of this atomic change is a greatly increasing chemical 
reactivity, with different effects on the environment and on human health. 
Parameters have been defined for complex characterization of nanoparticles. 
Specifically: 


• The crystallinity. For particles in the 1-100 nm range, SSA changes may 
determine allotropic transitions for the same powder. 

• The redox potential, which can enhance or block electron transfer. 

• The photocatalysis activity which is enhanced by the nano -features of the particles. 

• The zeta potential which controls the colloidal stability especially when the 
nanoparticles are electrically charged. 


Technological Properties 

Automation of PM technology involves, among other steps, the mechanization of 
particle production/handling. Knowledge of particle technological properties 
allows the best control over their production, with benefits in subsequent processing 
steps. Specialized organizations in the field (ISO, MPIF, ASTM) provide standards 
for the technological property determination, as described below. 

Apparent Density 

The apparent density of a free-flowing mass of particles is calculated as follows: 



(i) 
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adsorptive gas 




2. migration into pores of adsorbent 


adsorbate (the gas adsorbed on the pore surface) 



M 


^ + -v v 


3. monolayer builds up of adsorbate 


Fig. 5 The stages of the adsorption process of gas into a porous material 

where: 

o 

p a - apparent density [g/cm ] 
m - powder mass [g] 

V - loose powder volume [cm 3 ] 

Hall/Carney funnels or Arnold meters are used for the apparent density deter- 
mination for micron- sized particles. For submicron powders, the same 
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Fig. 6 Pore shapes: 

(a) cylindrical, (b) conical, 

(c) spherical or ink bottle, 

(d) slits, (e) interstices 




rbate (the adsorbed gas on the pore surface) 


rbent (powder particle atoms) 


Fig. 7 BET absorption method considering multilayers of gas molecules 


grain boundary with 



nanometric crystalline grains with nanometric crystalline grains with 

atomic long-range order atomic short-range order 

Fig. 8 Comparison between: (a) microstructured materials and (b) nanostructured materials 


measurements develop on different standards, based on the mobility and aerody- 
namic characteristics of these powders, using complex equipment such as an 
aerosol mass spectrometer (AMS), aerosol differential mobility analyzer (DMA), 
nano-DMA, and tapered element oscillating microbalance (TEOM). The factors 
affecting the apparent density are the chemical composition, particle size and 
distribution, particle shape, and manufacturing process. 
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Tap Density 

The tap density of a mass of particles is calculated as follows: 

31 

cm 




where : 

o 

r t - tap density [g/cm ] 
m - powder mass [g] 

V - tapped powder volume [cm 3 ] 

Also, depending on the particle size, different standards provide specific mea- 
surement procedures. 

Flowability 

This characteristic represents the powder property to flow readily, uniformly, 
and rapidly into a die or mold cavity, and it is evaluated by the flow rate 
parameter: 



where : 

f = powder flow rate [s/g] 

t = required time by the powder mass to free-flow through the equipment [s] 
m = powder mass [g], usually 50 g is used for the standard determination 

The flow rate is influenced by the following factors: particle shape, size, distri- 
bution, humidity, and interparticle friction. 

Segregation Susceptibility 

Mixing and segregation (or demixing) phenomena have been researched from the 
point of view of the complex mechanisms and processes occurring. The most 
strongly influencing factors on the segregation susceptibility are particle size, 
shape, density, and chemical composition. 


Compressibility and Compactibility 

The compressibility , measured by green density, represents the ability of a mass of 
powder to decrease its volume under pressure. The green density is calculated by 
formula (4) and graphically represented by the compressibility curve (Fig. 9): 



( 4 ) 
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d 



Fig. 9 The determination of powder compressibility: (a) compressibility of CuNi water atomized 
particles (Courtesy of Dr. Adriana SORCOI, Technical University of Cluj-Napoca, Romania); 

(b) tool sets: 1 punch, 2 die with cylindrical or rectangular cavity, 3 green compact, 4 base support, 

(c) cylindrical cavity die, (d) rectangular cavity die 


where : 

o 

r g = powder green density [g/cm ] 
m = powder mass [g] 

V = green compact volume [cm 3 ] 

The following factors influence the powder compressibility: particle shape, size 
distribution, hardness, and lubricant addition to the powder mixture. The powder 
compactibility , evaluated by the green strength, represents the ability of that powder 
to be compressed into a compact of a specified strength, and it is calculated by the 
Rattler test/value as follows: 





1 1 Powder Processing of Bulk Components in Manufacturing 


503 


Table 5 Powder particle 
processing methods versus 
the aggregation phase 


Aggregation phase 

Processing methods 

Solid phase 

Coarse comminution 

Milling at room/cryogenic temperature 

Liquid phase 

Atomization 

Gaseous phase 

Gas phase reaction/precipitation 


A-B 

R = — x 100 [%] 



where : 

R = rattler value [%] 

A = green compact weight before the test [g] 

B = green compact weight after the test [g] 

Powder Production 
Introduction 

The quality of a PM product is strongly dependent on the quality of the powder 
particles. Partially affected by the 2008 global financial crisis, the PM sector has 
regained its place on the industrial market with a reduced number of players. 
A significant reason for this recovery was the powder recycling and recovery policy 
accompanied by appropriate legislation and educational assessments. 

Classification of Powder Processing Methods 

Methods to process powder particles can be divided into the following criteria: 

(a) The aggregation phase (Table 5) 

(b) The structure of the particles, tailored by the reaction kinetics during their 
processing (Fig. 10). Thus, microstructured metallic particles are obtained by 
equilibrium processes (continuous lines) and amorphous/nanostructured pow- 
ders by nonequilibrium processes (dot lines) (Froes et al. 1994, 1995). 

(c) Reactions during processing: mechanical, chemical, and electrochemical 
methods 

(d) The approaching route: the top-down and bottom-up methods for submicron 
powder production (Fig. 11) 

(e) Low-gravity (G) methods: conducted on the Moon (0.125 G) or Mars (0.369 G) 
providing more advantageous powder particle processing technologies than on 
Earth (1.0 G) for space applications 

(f) The production ranking: dividing the processing methods into primary and 
secondary routes (recycling, recovery) 

(g) The material type (Tables 6 and 7) 
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Fig. 10 Metal powder processing methods versus, particle structure. Micro structured metallic 
particles are obtained by equilibrium processes ( continuous lines 1, 2, 3) and amorphous/nano- 
structured powders by nonequilibrium processes ( dot lines 4 , 5, 6) (Froes et al. 1994; Froes 
et al. 1995) (Redrawn from (Suzuki et al. 1982)) 


T op -down 

(enfitiMd! ilz appro achmz . 



Fig. 1 1 Top -down and bottom-up approaches for submicron/nanoparticle processing 





1 1 Powder Processing of Bulk Components in Manufacturing 


505 


Table 6 Metal powder 
processing methods 


Selection criteria 

Synthesis process 

Mechanical methods 

Solid comminution 

Liquid comminution 

Chemical and electrolytic methods 

Metal oxide reduction 

Electrolysis 

Physical-chemical methods 

Gas phase precipitation 


Table 7 Ceramic powder processing methods 

Selection criteria 

Synthesis process 

Aggregation state (Ring 1996) 

Solid state 

Liquid state 

Gas state 

Other combined processes 

Methods (Shigeyuki and Rustum 2000) 

Mechanical 

Thermal decomposition 

Precipitation or hydrolysis 

Hydrothermal 

Melting and rapid quenching 


Powder Production Methods 
Metallic Powder Production 

Mechanical methods for metal powder production involve solid/liquid comminu- 
tion. In solid comminution, the initial solid particles keep the aggregation state 
during their processing by coarse comminution or milling. 

(a) Coarse comminution (cold stream) processing (Fig. 12) is recommended in the 
case of large brittle material feed (« 20 mm max.) that needs to be reduced to as 
fine as 30 mm by crushing/grinding. Similar processes develop in ball/vertical 
roller/hammer/vibration mills. 

(b) Milling. As an extension to comminution, submicron particles and 
nanoparticles are produced, with/without new structural phase synthesis, 
depending on the energy developed inside the mill by means of the milling 
media, which are usually balls (Fig. 13). 

Figure 14 shows mechanical milling (MM) with no mass transfer for homoge- 
nization. Figure 15 shows mechanical alloying (MA) with mass transfer phenom- 
ena. Reactive mechanical alloying (RM), cryomilling (CM), mechanical 
disordering (MD), and others are alternatives of the milling process. 

The particles obtained depend on the mechanical alloying processing parame- 
ters: milling time, tool material, milling ambient, milling temperature, milling 
speed, ball-to-powders ratio, milling atmosphere, and milling tools. 


506 


A. Ruys et al. 


0,7 MPa, 20°C : 0,1 MPa, 0"C 

I 



Fig. 12 Cold stream process: 1 initial granulated particles entrapped by the gas, 2 nozzle, 3 fine 
particles, 4 target, 5 detail of particle shape 



I 


(A+B)PPs 


(C)PPs 


Fig. 13 Milling process: 1 mill, 2 milling balls, 3 , 4 initial particles A and B, respectively (with 
initial morphology: size and shape), 5 main disk of the mill, 6 milled particles (particles - final 
A+B powders with final morphology: size, shape), 7 milled particles (new particles, C, processed 
from A+B) 


Liquid comminution or atomization consists of dispersing a molten metal stream 
into droplets by means of a fluid (liquid/gas) or solid support (Fig. 16). 

The droplets solidify into particles of various morphologies, which are con- 
trolled by the processing parameters, as shown in Fig. 17. 

Chemical and Electrolytic Methods Oxide reduction via a reducing atmosphere 
is a frequently used industrial process to manufacture fine (<150 mm) and low 

Q 

density (0.5-3 g/cm ) metallic powders (Hoganas, Ruthner, and Lurgi (www. 
hoganas.com; Ruthner 2012; Danninger et al. 1990) (Fig. 18). 

The carbonyl process provides better quality processed metal particles (chemical 

Q 

purity, apparent density 4. 1—4.5 g/cm , particle size 2.4-8 mm; Ni, Fe, Co, Cr, W, 
Mo) in comparison to oxide reduction (Fig. 19). 

The electrolysis method produces dendritic particles with higher purity and 
greater sinterability than the other processing routes (Fig. 20). 
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Fig. 14 Mechanically milled mixture of hydroxyapatite and TiH 2 powders (Courtesy of Univ. 
Prof. Dr. Oana GINGU and Assoc. Prof. Dr. Ileana PASCU, University of Craiova, Romania) 



Material systems 

MA process development 

early stage medium stage final stage 

1 

ductile ductile 

CCyO 

A B 

S' \ ^ cG 

^ ' ncor P orat ' on wor ^ hardening 

flattening conglomeration comminution 

in lamelar structured PPs in smaller PPs 

2 

ductile brittle 

ccy£j 

A B 

\ L 

f \ incorporation f— ■ *0* work hardening k -_J 

Q3D <6 } 

A - flattening dispersion of b comminution 

B - comminution comminuted pps j n smaller PPs 

into A matrix 

3 

brittle brittle 

A B 

<2P - rff..., 

Vi 

A - comminution _ . 

B - comminution + incorporation in A d A 


Fig. 15 The mechanical alloying process 
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Fig. 16 Atomization: 

(a) water/gas atomization 

(. 1 stream of molten material, 
2 nozzle, 3 gas inlet, 

4 solidified droplets which are 
the metal particles); 

(b) centrifugal atomization 

(. 1 stream of molten material, 
2 nozzle, 3 rotative disc, 

4 solidified droplets which are 
the metal particles) 



Fig. 17 (a) (Left) Gas 
atomized pure magnesium 
particle (Courtesy of Univ. 
Prof. Dr. Jose Manuel 
TORRALBA, IMDEA- 
Materials, Universidad 
Carlos III of Madrid, Spain), 
(b) {right) Water atomized 
Cu90Nil0 particles (Courtesy 
of Dr. Adriana SORCOI, 
Technical University of 
Cluj-Napoca, Romania) 
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Fig. 18 Molybdenum 
particles made by hydrogen 
reduction of the oxides 
(Courtesy of Univ. Prof. 

Dr. Herbert DANNINGER, 
Vienna University of 
Technology, Austria) 



WD 

mag 

HFW 


11.5 mm 

50GQ* 

29.3 urn 



spot pressure 
3.0 9 44*- 4 Pa 


Quanta 2C0 


Fig. 19 Iron particles by the 
carbonyl process (Courtesy of 
Univ. Prof. Dr. Herbert 
DANNINGER, Vienna 
University of Technology, 
Austria) 



Physical-Chemical Methods Gas phase precipitation generates submicron 
metallic powders from metal compounds with a transient gas phase toward solid 
powders. 

Ceramic Powder Production 

Advanced ceramic applications require specific micro structural characteristics: 
small defect size, homogeneous dispersion of the structural phases, uniform phase 
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Fig. 20 Electrolytic copper 
particles (Courtesy of Univ. 
Prof. Dr. Herbert 
DANNINGER, Vienna 
University of Technology, 
Austria) 



composition of the grains boundary, and ability to work at room temperature and 
high temperatures. Some of the processing methods for ceramic powders are listed 
in Table 7 above. 

• Mechanical methods (comminution/milling route). This approach is more effi- 
cient for ceramic powders than for metallic powders, due to the inherent 
brittleness of ceramics (Table 6 and Shigeyuki and Rustum 2000). This method 
is a pathway to grain size refinement of the ceramic end product, increasing the 
specific surface area by decreasing particle size, with structural changes occur- 
ring at room temperature. 

• Thermal decomposition method (chemical vapor deposition (CVD) route). Short 
reaction time and low decomposition temperature are the main advantages of the 
CVD method for producing high purity powders of thin coatings, with tailored 
properties related to the application using gas/vapor phase reactants. 

• Precipitation or hydrolysis method (solgel route). This method provides good 
control of powder chemical purity, porosity, crystal size, particle size distribu- 
tion, and other physical-chemical properties (Vegad 2007). 

• Hydrothermal method (hydrothermal (HT) synthesis route). This method, and its 
alternatives, aims to crystallize anhydrous ceramic powders directly from solu- 
tion, producing powders with outstanding properties: submicron grain size, 
narrow particle size distribution, weak agglomeration effect, crystalline or 
amorphous structure, and very high reactivity with benefits for sinterability. 

• Other combined methods (spray pyrolysis (SP) route). This method allows 
powder processing from microparticles to nanopowders with precise morphol- 
ogy (spherical, hollow, porous, fibrous) and precise chemical composition 
(ceramic and ceramic-based composites). 
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Fig. 21 Water atomized 
Cu90Nil0 powder, in 
hydrogen-reduced state 
(Courtesy of Dr. Adriana 
SORCOI, Technical 
University of Cluj-Napoca, 
Romania) 



Powder Treatments 

Subsequent to the powder production stage, for specific cases, powder treatments 

allow improvements in their properties. 

Mechanical Treatments 

The aims of these treatments are: 

• Powder cleaning to remove superficial solid or gaseous residues by a degassing 
process (cold/hot, static/dynamic methods). 

• Powder grinding for comminution. This can be carried out at the powder 
production facility for online correction of the particle size and distribution, 
during the processing stage. 

• Powder blending and mixing (in wet or dry conditions) to prepare homogeneous 
powder mixtures for the compaction stage. 

Heat Treatments 

Some powder properties can be improved by means of heat treatments. A fre- 
quently used process is annealing in hydrogen or other reducing atmosphere, which 

enables: 

• Improvement of the powder chemical purity (Fig. 21) 

• Reducing the work hardening effect on metal powders caused by the milling 
process 

• Morphological changes of the powder particles 

• Mechanical (strength, ductility) and structural (grain size, impurity segregation) 
beneficial changes resulting from optimal powder synthesis (Fig. 9) 
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Fig. 22 Criteria for producing ideal powders for use in powder-forming technology 


Concluding Comments: Powder Characterization, Production, 
and Treatments 

The above discussion on powder characterization, production, and treatments is 
intended to give guidance for PM technologists to design the optimal infrastructure 
for ideal powder research, development, and production facilities, optimized with 
respect to forming and sintering outcomes (Fig. 22). 


Powder Forming: Consolidation of Powder into Green Compact 
for Subsequent Sintering 

Dry Forming 

Die Pressing 

Die pressing is the predominant powder-forming process with metals. In contrast, 
wet forming is the predominant powder-forming process with ceramics. With 
metals, more pressing pressure produces more consolidation, whereas with 
ceramics, more pressure usually causes problems such as delamination and 
nonuniform firing shrinkage. This is why die pressing of ceramics involves only 
low to moderate pressures and is generally used only with thin simple shapes, such 
as tiles. 

In its simplest form, uniaxial die pressing involves relatively dry powders, 
containing a minimum amount of liquid and some binders. The powders are placed 
in a uniaxial hollow die with two opposing pistons or pressing plates (Fig. 9). 
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Table 8 Die-pressing 
pressures for PM forming 


Metal PM product 

Forming pressure (MPa) 

Porous filters 

40-70 

Cermets and carbides 

70-200 

Refractory metals 

70-200 

Porous bearings 

150-350 

High density Cu and A1 

250-275 

Steel machine components 

300-700 

High density steel 

700-1,700 


The die cross section is commonly a simple shape: circular, square, or rectangular. 
Pressures are typically in the 20-50 MPa range for ceramics and range widely for 
metals (Table 8, Fig. 9). Pressure is applied to the two opposing pistons to 
consolidate the powder into a green body. Die pressing metal powders is a simple 
process in most cases because the plastic deformation of the particles enables 
excellent powder flow, cold welding, and in some cases almost complete consoli- 
dation by cold welding. 

For ceramics, consolidation by uniaxial die pressing is much more problematic. 
Theoretical particle packing models predict that the highest packing density possi- 
ble for mono-sized non-deformable spheres is 74 %, in a hexagonal close-packed 
array. The reality is that ceramic powders ordinarily attain a green density of only 
about 50 % from die pressing. Higher green densities can be achieved by 
gap-grading (mixing coarse medium and fine powders in appropriate ratios), vibra- 
tory pressing, and the careful use of lubricating binders. However, in practice, these 
strategies are rarely used except for a few specialty advanced ceramics. 

The main advantages of uniaxial die pressing are no need for drying, simplicity, 
and ease of automation. With a fully automatic press, and multiple cavity molds, 
this process can produce hundreds of components per minute. 

Unlike metal powders which deform plastically under pressure, the main disad- 
vantage of uniaxial die-pressing ceramics is the nonuniform particle packing due to 
pressure gradients generated in the powder compact during pressing. The thicker 
the pressed part, and the more complex its shape, the more significant this problem 
becomes. Nonuniform particle packing translates into nonuniform firing shrinkage, 
which at best causes distortion, and at worst cracking. Cracking during sintering can 
involve obvious destructive macrocracks causing immediate rejection or invisible 
microcracks which may cause early catastrophic failure some time later during 
service life. 

The main application of uniaxial die pressing for ceramics is in fabricating 
ceramic tiles. The major application of uniaxial die-pressed conventional ceramic 
tiles is porcelain, terra cotta, stoneware, and heavy clay ware tiles for floor, wall, and 
roofing claddings, specialty curtain-walls for high-rise buildings, and die-pressed 
construction bricks. The major application of uniaxial die-pressed advanced 
ceramic tiles of alumina, silicon carbide, and boron carbide is ballistic armor and 
wear-resistant tiles for the mining industry. Industrial cermets such as tungsten 
carbide and titanium carbide are also commonly formed by uniaxial die pressing. 
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Ceramic tiles for the building industry have traditionally been no larger than 
about 300 x 300 mm in size. In recent years sizes up to 600 x 600 mm have 
become commonplace. In the 1990s, commercial tiles up to 1,000 x 1,000 mm in 
size came onto the market, from Germany (Buchtal), and China (Eagle Brand). The 
problems with increasing tile size are threefold: 

• Extremely large press capacities are required. Typically ceramic tiles are pressed 
at pressures in the 20-50 MPa range, both ceramics for the building industry and 
advanced ceramics such as alumina, silicon carbide, and boron carbide. Thus, a 
5,000-t press is needed for 1,000 x 1,000mm tiles, for example. Such a press 
costs many millions of dollars to purchase; is large, heavy, and very expensive to 
transport and install into a standard factory; and has a huge footprint on a 
production floor. 

• Difficulty in supporting the tile during sintering. The larger the tile, the larger the 
absolute firing shrinkage. For a 15 % firing shrinkage, this equates to 150 mm 
linear dimensional change in a 1,000 x 1,000 mm tile. The tile support in the 
kiln needs to allow significant movement during shrinkage without any chemical 
interaction taking place between support and tile at the 1,100 °C+ sintering 
temperatures. Various strategies are utilized such as ceramic balls, ceramic 
granules, and other proprietary substrate systems. 

• Transportation and handling. The larger the tile, the higher the cost and the 
higher the risk of breakage during shipping and installation. 

One of the most useful innovations in recent years in ceramic powder forming 
is vibratory pressing. This typically involves a pneumatic press operating on com- 
pressed air, which pulses the pneumatic pressure during pressing, thereby vibrating 
the consolidating powder during the pressing process. This has two key benefits: 

• Elimination of powder packing gradients. 

• Significant reduction in required pressing pressure. Instead of 20-50 MPa which 
is usual for die-pressed ceramics, pressing can be achieved at pressures as low as 
1-5 MPa. Thus, instead of a 5,000 t hydraulic press being required for 1,000 x 
1,000mm tiles, they could be pressed with a 100 t vibratory press - a very much 
smaller, cheaper, and more portable unit. 

Thus, vibratory pressing is most useful when pressing very large tiles or 
complex- shaped ceramic components. 

When manufacturing advanced ceramics for specialty applications with 
serious consequences for failure, such as ballistic armor and wear-resistant 
linings for the mining industry, it is critical to ensure that the component 
contains no microcracks, which are most commonly caused by pressing nonunifor- 
mities. X-ray radiographic examination, once primarily the domain of metal fatigue 
inspection, is a common QA procedure for inspecting die-pressed advanced ceramics. 
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Cold Isostatic Pressing (CIP) 

Cold isostatic pressing (CIP) involves uniform pressing from all directions. It is 
widely used for powder forming of ceramics. For example, three billion spark 
plug insulators are made every year by CIP. Metal powder CIP forming applications 
are much less common. Most powder metallurgy products are pressed by die 
pressing, and CIP is generally only used in the rare event that very complex 
shapes are required and powder injection molding and die casting are not available 
options. 

CIP is usually done by placing the powder in a thin- walled deformable “bag” or 
“mold” and immersing in an oil chamber which is subsequently pressurized. 
Thus, consolidation occurs by immersing the powder (held in an impervious thin- 
walled mold) in a pressurized fluid. Typically pressures in CIP mass production are 
in the order of 70 MPa, although pressures up to 400 MPa are not uncommon. 
Molds are usually thin walled and made of elastomers such as latex, silicone, 
neoprene, nitrile, or polyurethane. Another manifestation of CIP is post-CIP in 
which a component is pre-pressed by uniaxial die pressing, and then the green part 
is either coated in elastomer or placed in an elastomer bag (vacuum de-aired) and 
then CIPed. 

A CIP is basically a fluid-filled pressure vessel with a mechanism enabling rapid 
automated opening and closing, a high-pressure pump, and a system for gentle 
depressurization. The automated CIP has multiple racks or arms for supporting 
numerous molds in one production cycle. 

While uniaxial die pressing is a widely used and effective means of producing 
PM metals and ceramic tiles, fabrication by pressing of any ceramic shape 
more complex than a simple tile usually requires CIP. The main benefit of CIP is 
that the nonuniform particle packing due to pressure gradients generated in the 
powder compact during pressing is much less significant than for uniaxial die 
pressing, although this needs to be seen from a viewpoint of relativity. For a simple 
thin tile, uniaxial die pressing should give a uniform result. For a complex shape 
such as a spark plug insulator, CIP is the only pressing process that is viable. Of 
course, wet-forming methods are also viable for complex shapes, but CIP has the 
advantage of being long established and well suited to automation, with large 
throughput rates. Multiple powder-filled molds can be loaded into the CIP for 
each pressure cycle. 

For solid components, all that is required is a thin-walled elastomer mold. The 
elastomer mold wall needs to be thin so as to ensure maximum pressure transfer to 
the component. Moreover, the elastomer mold wall needs to be of uniform thick- 
ness so as to ensure uniform pressure transfer to the component. For hollow 
components, the elastomer mold provides the external wall dimensions and shape 
of the component, and a removable metal inner mandrel is used which provides the 
inside wall dimensions and shape of the component. The mandrel is usually made of 
hardened steel to ensure durability after multiple removals from ceramic powder 
components with their abrasive surfaces. 



516 


A. Ruys et al. 


Some of the key advantages of CIP for ceramics include: 

• Low mold cost which makes it ideal for complex parts with small production 
runs. For post-CIP, there is no CIP mold cost, simple elastomer bags, or coatings 
suffice. Of course, the uniaxial pre-pressing stage requires molds. 

• There is no size limitation. Therefore, it is ideal for very large parts. Large 
refractory components heavier than 1 t have been produced by CIP. 

• Unlike uniaxial die pressing, CIP produces uniform shrinkage in components 
with nonuniform cross sections, resulting in less cracking during sintering. 

• Short cycle times. This is mainly because no drying or binder burnout is required. 

Some of the key disadvantages of CIP for ceramics include: 

• Poor dimensional control. A common solution to this is to CIP oversized 
components, or blanks, followed by green machining. This requires the use of 
binders and carries the risk of cracks introduced from green machining. 

• It is suitable for relatively simple shapes only. For highly complex shapes, 
powder injection molding is a better choice. 

• Powders must have good flowability. This generally means ceramic powders 
must be spray dried, which adds an extra cost to the process. Mold vibration 
helps for mold filling, but if complex-shaped molds are involved, spray-dried 
powders are essential. 

Thus, CIP is an appropriate choice for mass production of advanced ceramics of 
complex shape or extremely large ceramic components. The number one product is 
spark plug insulators. Other examples of commercial ceramic products mass produced 
by CIP include powerline insulators, specialty refractory components, oxygen sensors, 
specialty wear parts for pumps, ball milling balls and grinding media, and tableware. 

CIP is not to be confused with hot isostatic pressing (HIP) which is a densifica- 
tion process (section “Hot Isostatic Pressing” discussed later). 


Wet Forming 

Introduction to Wet Forming 

Powder wet forming dates back thousands of years to the invention of pottery - 
plastic forming of clay ware. The potter’s wheel was invented in ancient Mesopo- 
tamia more than 4,000 years ago. Early pottery was fired in bonfires, and kilns were 
invented much later. In the last century, many novel powder wet-forming methods 
have been reported. The most widely used methods are extrusion, plastic forming, 
slipcasting, and injection molding. Extrusion and plastic forming are considered 
low-technology powder-forming methods and are almost exclusively used for 
clay ware. The potter’s wheel is the most common type of plastic forming commer- 
cially, and there are many automated methods that utilize the principle of the 
potter’s wheel, machining the rotating plastic clay body, for example, in making 
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tableware. Extrusion of plastic powder slurries is widely used to make clayware 
building bricks and pipes. Extrusion is also used to make advanced ceramics such as 
alumina furnace tubes, hollow insulators, and thermocouple sheaths, using a highly 
optimized water/powder/deflocculant/binder mix. 

One of the most important issues with powder wet forming is drying. Drying 
shrinkage can be more than 10 % in many cases, and even when large amounts of 
binder are used, this can cause cracking during drying, both visible cracks and 
invisible cracks which manifest only later, either after sintering or after some time 
in service. Preventing cracking requires attention to the following issues: 


• Add binders to the slurry (a powder-liquid suspension is known as a slurry). 

• Maximize solids loading to reduce shrinkage. 

• The finer the particles, the greater the drying shrinkage (and conversely the 
better the sintering). 

• A broader particle size distribution reduces shrinkage. 

• Slower drying times reduce cracking. 

• The larger the component, the larger the overall shrinkage, and therefore the 
greater the cracking risk. Thus, wet forming large ceramics is highly problematic. 


Several high-technology drying solutions have recently begun to be introduced: 
controlled humidity drying chambers, drying in ethylene glycol, and supercritical 
drying (SCD) using C0 2 . All three of these techniques can greatly reduce cracking 
during drying; however, they are slow and not widely used commercially. 

There are a number of high-technology wet-forming methods that warrant 
further discussion here, both from the viewpoint of their novelty and their potential 
to push the frontiers of powder forming into new possibilities. 


Slipcasting 

Slipcasting is a powder-based wet-forming method which has been used for a long 
time in the traditional ceramic industry for the manufacture of sanitary ware and 
tableware. This process has been also used for the manufacture of technical 
ceramics, e.g., alumina. This method involves a filtration process which separates 
powders from the solvent using a porous medium. In a typical example of this 
process, a suspension, usually a water-based suspension (slip), is poured into a 
porous plaster mold which, by its porosity, creates capillary forces and removes 
liquid from the suspension. A flow of solvent causes the particles to create a 
consolidated layer or a filter cake on top of the mold surface. The cast formation 
time is reduced if high solids loaded slurry is used. Although less concentrated 
ceramic slurries give better control of the thickness of the cast layer, the chance of 
segregation among particles increases, especially when composite slurries are used. 
Once the required layer thickness is obtained, the process is stopped, either by 
removing the excess slurry or by letting the casting cakes meet each other in the 
center of the piece (solid slipcasting). Generally, the casting shrinks in the mold 
during air-drying enabling easy removal from the mold for further drying or 
sintering (Adcock and McDowall 1957; Tiller and Tsai 1986; Lewis 2000). 
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The main advantage of slipcasting is that it is a low-cost process with the 
capability to produce complex geometries with high homogeneity. Furthermore, 
the mold material is relatively cheap. However, the durability of the molds is an 
issue which limits the application of this technology for mass production, requiring 
many molds and large plant area. 

Pressure slipcasting or pressure casting solves some of the problems with the 
slipcasting process. In this process, instead of using plaster molds, molds of polymeric 
materials are used which have a high porosity consisting of larger pores that do not 
give the same capillary forces but require an externally applied pressure to drive the 
filtration process. The higher the applied pressure (<40 bar), the faster is the casting 
cycle. In conventional slipcasting, the capillary forces correspond to a pressure of 
1-2 bar. Using pressure can help to dry the parts immediately and cast pieces can be 
demolded, and a new casting cycle can be started. The durability of plastic filters is 
much better than plaster and can help to achieve parts with better tolerances. 

Centrifugal slipcasting is a similar process in which ceramic particles in a 
suspension are forced toward a mold cavity which can be nonporous, and the top 
solvent is poured off after the casting. This process can improve the packing density 
of the filtered cake and improve the quality and reliability of the final product 
(Steinlage et al. 1996; Huisman et al. 1994). 

Tapecasting 

Tapecasting, as the name suggests, is used to form powder slurries into sheets that 
have a large surface area and a thin cross section. These tapes are widely used to 
produce electronic components and packages (Ho watt et al. 1947; Mistier 1990). 
Tapecasting arose in the 1940s as a method for fabricating thin layers of dielectric 
materials for capacitors and piezoelectrics. Unlike slipcasting, tapecasting slurries 
are usually nonaqueous. The slurry is fed through a device known as a “doctor 
blade” which is either stationary over a moving substrate or moves over a stationary 
substrate. The slurry is screeded by the blade so that it spreads out as a thin layer of 
slurry due to the relative movement of doctor blade substrate, then dries rapidly by 
evaporation. Thickness is controlled by adjusting the gap between the doctor blade 
and the substrate. Tapecasting involves the following stages: 

• Ball milling the slurry 

• De-airing the slurry 

• Filtering (to remove lumps) 

• Tapecasting using the doctor blade 

• Drying 

Some of the key parameters involved in optimizing tapecasting include: 

• Particle size and size distribution of the powder 

• Chemical composition of the solvent (suspending liquid medium) 

• Composition and concentration of the deflocculants, so as to maximize solids 
loading 
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• Composition and concentration of the binders, which enhance the green strength 
of the cast tape 

• Composition and concentration of the plasticizers, which enhance the flexibility 
of the cast tape 

• Solids loading 

• Doctor blade clearance 

• Relative velocity of doctor blade and substrate 

• Drying cycle 

There have been many patents filed on tapecasting since the original one in 1952 
(Ho watt 1952). Tapecast alumina is now used for many cutting-edge microelec- 
tronic and biomedical technologies, for example, fabricating advanced multi-array 
microelectrodes for specialized applications such as the bionic eye (Guenther 
et al. 2013). 

Powder Injection Molding 

Powder injection molding is best suited for the high-volume near-net- shape pro- 
duction of small metal or ceramic parts starting from powders. The use of metal or 
ceramic powders enables a wide variety of ferrous and nonferrous alloys in powder 
form to be used as well as many advanced ceramics in powder form. The material 
properties (e.g., strength, hardness, wear resistance, corrosion resistance, tempera- 
ture resistance) can be close to those of wrought metals or ceramics made by other 
processes. Since no melting of the functional material occurs in the PIM process 
(unlike metal casting processes), the total heat input of the process is much lower 
compared with casting processes. High-temperature alloys can be used without any 
adverse effect on tool life. Metals commonly used for PIM parts include low-alloy 
steels, high-speed steels, stainless steels, cobalt alloys, tungsten alloys, nickel 
alloys, and titanium alloys (Piotter et al. 2007, 2008; German et al. 1991; Blackburn 
and Wilson 2008). Ceramics such as alumina, zirconia, silicon carbide, silicon 
nitride, and aluminum nitride have been successfully demonstrated and commer- 
cially produced by the PIM process, a prominent example being ceramic turbo- 
charger rotors, now mass produced by PIM on a large scale. Composites of 
ceramics and metals can also be fabricated using this process (Ye et al. 2008). 

Powder injection molding parts are found in numerous industry sectors, includ- 
ing automotive, aerospace, oil and gas, consumer products, medical/dental, and 
telecommunications. 

Depending on the type of material, powder injection molding is sometimes 
called metal injection molding (MIM) or ceramic injection molding (CIM). As 
shown in Fig. 23, the process uses fine powders of ceramics or metals (typically 
<20 pm dia.) that are heated and mixed with thermoplastics polymer binders, 
waxes, or other organics to form a feedstock (Supati et al. 2000). Some additives 
are added to keep the dispersion of particles and plasticity of the component. After 
cooling, the mix is granulated into the form of pellets with a nominal size of 
1-5 mm and fed into an injection molding machine. The granulated mixture is 
heated to a temperature below 250 °C to form a flowable paste and injected into a 
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Fig. 23 Flowchart of powder 
injection molding process 
from powders to final product 
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closed mold to form the “green” compact. The equipment which is used for PIM is 
very similar to plastic injection molding. However, in PIM the material used for the 
mold and extruder is more critical because of the risk of erosion from the abrasive 
binder-powder mix. Unlike conventional powder metallurgy processes where the 
organic content is ~1 wt%, the binder content of the green PIM compact may be as 
high as 40 wt%. The green part is cooled and removed from the mold. Chemical 
and/or thermal methods are used to dissolve and remove the majority of the binder. 
The remainder is removed from the porous compact by heating at an intermediate 
temperature, then the temperature is elevated to the sintering temperature. The PIM 
process can be used for mass production of parts with much greater complexity than 
for press-and-sinter powder metallurgy parts, but smaller in size. 

Design rules for plastic injection molding are still applicable when the parts are 
to be manufactured by PIM. However, there are some exceptions or additions, such 
as the following (www.custompartnet.com): 

• Wall thickness limitation: Wall thickness has to be minimized and be kept 
uniform in order to facilitate binder removal and minimize warpage or distor- 
tion. In normal PIM processing, a thickness below 7 mm would help to make 
crack-free and uniform structures. However, very thin- walled sections need a 
support to avoid deformation from the thermal cycle. 

• Draft: Although using draft in mold design for plastic injection molding is 
essential, many PIM parts do not require any draft. The polymer binder used 
in the powder material releases more easily from the mold than most injection 
molded polymers. Also, PIM parts are ejected before they fully cool and shrink 
around the mold features because the metal powder in the mixture takes longer 
to cool. 

• Sintering support: During sintering, PIM parts must be properly supported or 
they may distort as they shrink. By designing parts with flat surfaces on the same 
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plane, standard flat support trays can be used. Otherwise, more expensive 
custom-made supports may be required. 


Direct Coagulation Casting 

Direct coagulation casting (DCC) is a near-net- shape fabrication method for mak- 
ing ceramic components from concentrated aqueous powder suspensions (Balzer 
and Gauckler 2003; Baader et al. 1996; Prabhakaran et al. 2009). In DCC, the 
powder suspensions are cast into a mold and they set through an in situ coagulation 
or destabilization of the suspensions by producing acid, base, or electrolyte from 
water soluble precursor molecules present in the suspension medium. Setting via 
generation of ammonia (base) and electrolyte like ammonium bicarbonate and 
ammonium carbonate from urea by urease catalyzed in situ hydrolysis is a common 
approach, well studied for most of the ceramic powder systems, such as alumina, 
zirconia, silicon carbide, and silicon nitride (Prabhakaran et al. 2009). By a pH shift 
near the isoelectric point, the ammonia can coagulate the slurry prepared in acidic 
pH. Ammonium bicarbonate and ammonium carbonate coagulate the powder 
suspensions by compressing the electrical double layer thereby promoting coagu- 
lation. In situ generation of acid from acid anhydrides, esters, lactones, hydroxyl 
aluminum acetate, and MgO has also been reported for destabilization of powder 
suspensions. 

Although the preparation of the slurry for this process is very similar to gelcasting, 
the mechanism for gelation is different. There are some limitations corresponding to 
this method, such as premature coagulation of the slurry, low green density, cracking 
of the parts via drying, or sintering. This process is still in the research stages and has 
not been commercially taken up for manufacturing of ceramics. 


Gelcasting 

The gelcasting process dates back to 1990s when Omatete and Janney from Oak 
Ridge National Laboratory, USA, came up with a new slurry-based manufacturing 
process for making ceramic components in which conventional wet processing 
techniques were combined with polymer chemistry (Janney and Omatete 1989; 
Omatete et al. 1997; Yang et al. 2011) (Fig. 24). 

In this technique, a ceramic powder and a monomer solution are mixed together 
to form a slurry. A high solids loading is possible with a minimum amount of 
polymer. The slurry should be highly loaded with the ceramic powder and maintain 
high flowability for the casting process. Therefore, a suitable slurry formulation is 
necessary. Mechanical milling is essential to achieve high solids loading. The slurry 
is degassed under a low pressure and poured into a nonporous mold (unlike 
slipcasting which uses a porous mold) after addition of a chemical initiator. Over 
time, a polymer gel network is formed in situ to the shape of the mold. On 
polymerization, all ceramic particles are immobilized because of the formation of 
a cross-linked gel network. After removal from the mold, the part is dried, the 
polymer is burned out, and the sample is sintered to a high density. Gelcast green 
bodies also have a high green strength, which allows them to be machined easily 
before sintering (Janney et al. 1998). 
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Fig. 24 Flowchart of powder injection molding process from powders to final product 


Gelcasting overcomes many limitations associated with other ceramic 
processing methods, such as injection molding (e.g., long debinding time, flaw 
generation, or distortion via binder removal) and slipcasting (e.g., slow casting 
rates, insufficient strength for green machining), and is suitable for the complex 
shape fabrication by offering short processing times, high yields, high green 
strength, and low-cost machining. Unlike the powder injection molding process 
in which a high amount of binder (15-20 wt%) is used and long debinding steps are 
required, the polymer content in the dried gelcast samples is 2-5 wt%. This amount 
of binder can be easily removed by pyrolysis without a significant effect on 
distortion or cracking of the parts (Yang et al. 2011; Janney et al. 1998). 

One of the advantages of the gelcasting process is the variety of materials usable 
for molds. Wood, plastic, wax, and metals such as aluminum and stainless steel 
have been used for gelcasting molds. However, some plastic and metallic materials 
tend to react or interfere with the gelation process which must be avoided. 
Demolding agents need to be used to ensure fast and easy part removal (Yang 
et al. 2011; Janney et al. 1998). 

Gelcasting has been used for many years for manufacturing of ceramic compo- 
nents. Gelcasting metal parts has also been successfully demonstrated. However, 
since metals can be more easily processed by casting from a melt, forming, or 
conventional powder metallurgy and machining, gelcasting of metals has not 
received as much attention as ceramics (Omatete et al. 1997; Yang et al. 2011; 
Janney et al. 1998; Huang et al. 2007; Stampfl et al. 2002). 

Gelcasting has been combined with other methods such as tapecasting, injection 
molding, centrifugal casting, and stereolithography (Stampfl et al. 2002). 
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Gel-tapecasting is considered the main manufacturing method for large-scale 
ceramic sheet production with thickness between 0.1 and 1.0 mm 
(Yu et al. 2004). By combining injection molding as a fully automated method 
and gelcasting, the advantages of the two processes can be merged to improve 
productivity and reliability of ceramic parts. The new process is called colloidal 
injection molding of ceramics (CIMC) (Huang et al. 2004). In this process, pressure 
is used to induce a gel reaction, which can avoid the effects of the temperature 
gradient on the gelcasting process. The efficiency and reliability of the product were 
greatly improved due to more homogeneous solidification which reduced the 
internal stresses from the colloidal-forming process and prevented the formation 
of microcracks in the green body. 

In another method, gelcasting is combined with centrifugal casting. In this 
method, the slurry is transferred to a centrifuge machine with an embedded mold, 
after addition of the initiator and catalyst. The centrifugal force not only can form 
the ceramic slurry into the mold cavity but also can eliminate air bubbles and 
accelerate the gelation process. Therefore, no degassing is required and the reli- 
ability of parts has been shown to be improved. Moreover, very fine features of the 
mold can be filled by slurry by the action of centrifugal force. However, the 
centrifugal force applied for this process is much lower than for conventional 
centrifugal slipcasting processes, as the role of the force is to shape the slurry and 
not to cause powder segregation (Maleksaeedi et al. 2010). 

In a novel concept, gelcasting is merged with 3D stereolithography, as a solid 
free form fabrication method. In this case, dense ceramics directly from a CAD file 
have become a reality (Zhou et al. 2010). The gel formula should be adapted for a 
photopolymerization process (Chartier et al. 2002). 

Some limitations with the gelcasting process are as follows (Omatete et al. 1997; 
Yang et al. 2011): 

• Toxicity of the gelcast materials 

• Inadequate strength of the parts after gelation and via demolding which can 
cause severe distortion or cracking if the part is not well supported 

• Difficulty to achieve high solids content of gelcast slurry for some specialty 
ceramics 

• High susceptibility for cracking during the drying stage 

• Not easily automated, preventing gelcasting being utilized in the industrializa- 
tion of high-performance ceramics 

Thixotropic Casting 

Thixotropic casting is a ceramic wet-forming process derivative of solid slipcasting 
that utilizes a casting slip that is fluid under vibration and plastic in the absence of 
vibration (Ruys and Sorrell 1999; Chesters 1973; Norton 1974). Thixotropic casting 
is the powder-forming method that gives the highest green densities of all known 
powder-forming methods. Green densities up to 82 % are achievable. Solids 
loadings of up to 78 vol.% have been reported for thixotropic casting (Legrand 
and Da Costa 1989). Most powder-forming methods give green densities in the 
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40-60 % range. Theoretical particle packing models state that the highest 
packing density possible for mono-sized spheres is 74 %, in a hexagonal close- 
packed array. The reality is that ceramic powders ordinarily attain a green 
density of only about 50 % from die pressing and slipcasting. Even lower from 
wet-forming processes involving large amounts of binder removal, such as injection 
molding. No known method other than thixotropic casting can give green densities 
above 70 %. 

Thixotropic casting is unsuitable for powders with particle sizes below one 
micron, for example, clays or submicron advanced ceramic powders. The high 
surface area of colloidal-based slurries limits the maximum solids loading attain- 
able (Sushumna et al. 1991). 

Secondly, thixotropic casting gives by far the lowest drying shrinkage of any 
known wet-forming method. Typically, it is 1 %, although it can be less than 0.5 %. 
This has proven very effective when manufacturing large ceramic components by 
thixotropic casting. A one-meter-sized ceramic component, such as a large refrac- 
tory or a large floor tile, requires a 5,000 t press for manufacture by conventional 
uniaxial pressing. No wet-forming method other than thixotropic casting can be 
used to make such a large ceramic as the drying shrinkages are in the order of 
100 mm or more. However, a 1-m-sized ceramic component can be manufactured 
crack-free by thixotropic casting, as 0.5 % drying shrinkage corresponds to just 
5 mm. 

Thixotropic casting is a simple process. A highly concentrated powder-liquid 
slurry is prepared, typically in the 70-80 vol.% solids range, and vibrated into a 
mold. The thixotropic casting process involves the following stages: 

• Powder and aqueous deflocculant solution are batched and mixed into a uniform 
stiff doughy consistency. 

• The molds are mounted on a vibrating table. 

• Vibration is switched on and the thixotropic casting slurry is slowly vibrated into 
the mold. 

• Vibration is switched off as soon as the mold is filled, so as to prevent 
segregation. 

• The component is left to air-dry in its mold, followed by oven drying and finally 
densification. 

The high solids loadings, and by association high green density and low drying 
shrinkage, are achieved by three means: 

• Firstly and most importantly, the slurry is so concentrated that it will only flow 
under vibration. A thixotropic fluid has a lower viscosity when subjected to 
shear. This is the principle of thixotropic casting. Therefore, a thixotropic 
casting slurry typically shows no fluid characteristics in the absence of vibration. 
Its consistency depends on the particle size of the slurry and varies from a stiff 
“paste” to a damp “mortar.” 


1 1 Powder Processing of Bulk Components in Manufacturing 


• The use of gap-grading to maximize the efficiency of particle packing. This can 
be achieved either by using a powder of broad particle size distribution or by 
blending coarse and fine powders in optimized ratios. 

• The use of chemical deflocculants at optimal concentrations to maximize 
the thixotropy of the slurry and to maximize the fluidity for a given solids 
loading. 

Thixotropic casting is primarily used in the refractories industry, where it is 
ideal because of the widespread use of “grog” (coarse particulates) in refractory 
mixtures, and the large size of many refractories, which necessitates the minimi- 
zation of drying shrinkage. It has also seen use in boutique applications such 
as specialty bioceramic components, ceramic composites, functionally graded 
materials, and very large ceramic tiles (Ehsani et al. 1995; Ruys et al. 1994; 
Kerdic et al. 1996; Ehsani et al. 1996; Ruys and Sorrell 1992, 1994). Thixotropic 
casting molds can be porous (e.g., plaster or timber) or nonporous (e.g., metal or 
plastic). 


Sintering 

Introduction 


Definition 

Sintering is a heat treatment process involving densifying a powder into a solid 
article, without melting. Densification occurs via mass transfer phenomena, at a 
temperature well below its melting point. The word “sinter” is a German word with 
its origins in the late eighteenth century, derivative of the English word “cinder.” 
Sintering a green compact improves its bulk properties due to the interparticle 
connections generated by mass transfer and solid state diffusion. The bonds 
between the former powder “particles” are identical to the atomic cohesion inside 
the metallic or ceramic particles. 


Sintering Parameters 

A sintering cycle comprises at least three stages: heating of the green compact 
(or loose powder) up to the sintering temperature (T s ), holding at this temperature, 
and cooling to room temperature (Fig. 25). 

Sintering time correlates linearly with diffusion and thus with the degree of 
sintering. Sintering kinetics increases exponentially with temperature. Diffusion 
rate approximately doubles for every 20 °C temperature rise; thus, by raising the 
sintering temperature by 20 °C, one can halve the sintering time, temperature- 
specific reactions notwithstanding. Sintering is also influenced by pressure. Fur- 
thermore, the sintering atmosphere may enhance or inhibit sintering effects. The 
sintering atmosphere may be neutral (vacuum, argon, nitrogen), reducing (hydro- 
gen, dissociated ammonia), or oxidizing (air, oxygen). Most metals require neutral 
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Fig. 25 Sintering cycle 



and ceramics oxidizing. Reducing is used for specific reactive sintering situations. 
The purity of the atmosphere may also influence chemical interactions with the 
sintered material. 

The sintering temperature (Ts) is an important parameter, generating all the 
phenomena inside the green compact during this heat treatment. Usually, Ts is 
about 70-80 % of T m (melting temperature). The threshold density equation relates 
the properties of the green components to those of the sintered material: 

V • p 2 = C\p 2 + C 2 m 2 + C^mp (7) 


where: 

p = threshold density 
m = green compact mass 
V = sintered part volume 

C l9 C 2 , C 3 = constants related to powder intrinsic properties, sintering temperature, 
and soak time 

Heating and cooling rates have a lower impact than T s . They influence the 
amount of transient liquid phase during liquid phase sintering and homogeneity 
of heating. Heating rate should be slower for thicker component cross sections, 
especially for ceramics, which are thermal insulators. Nonuniform temperature 
distribution in an unsintered component creates potentially destructive internal 
thermomechanical stresses. Also, if the component is outgassing, a fast heating 
rate can crack or distort the component at best and cause it to disintegrate at worst. 
In general, high thermal conductivity and/or thin cross section enables a fast heating 
rate (e.g., 1,000 °C/h). Low thermal conductivity and/or thick cross section requires 
a slow heating rate (e.g., 100 °C/h). 

Sintering Mechanism 

The aim of sintering is mainly to improve the mechanical strength of the green 
compact. This occurs by heat-activated interparticle mass transfer via surface and 
bulk transport mechanisms (Figs. 26 and 27). 
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EC: evaporation-condensation 
SD: surface diffusion 
VD: volume diffusion 
D: PP diameter 
d : neck width 

L : distance between the PP centers 


Surface transport mechanism (no shrinkage or densification) 



Bulk transport mechanism (with shrinkage or densification) 


PF : plastic flow 

GB: grain boundary diffusion 

VB: volume diffusion 

L| : initial distance between the PP centers 
L f : final distance between the PP centers 
i: initial position of PP 

f: final position of PP 



Fig. 26 Mass transport mechanism (Redrawn from German (2010), Djohari and Derby (2009), 
and Wang (2006)) 
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Fig. 27 Mass transport mechanism versus sintering temperature 


528 


A. Ruys et al. 


pore 



green compact 




pore 


pore 


sintered parts 


swelling due to . 


tensile stress from forming or non-uniform heating 
entrapped gas in PPs 

H 2 0 evaporation from PPs surface 
Kirkendall effect 
Ostwald ripening 
sintering atmosphere 
contaminated PPs 



shrinkage due to : 


* high quality of the sintering atmosphere 
(i.e. high vacuum, carbon-free) 

* fine PPs additions 



(RT) 


(TS) 


Fig. 28 Associated sintering phenomena (PPs = powder particles) 


Associated Sintering Phenomena 

Due to the diffusion process and specific to a given materials system, other 
phenomena may occur during sintering, for example (Fig. 28): 

• Dimensional changes: shrinkage, swelling 

• Porosity modification from open to closed and/or interconnected 

The key with sintering is to control the density/porosity of the product while 
minimizing the grain size. The coarser the grain size, the worse the mechanical 
properties of the sintered part. 

Liquid Phase Sintering (LPS) 

Regardless of the mutual solubility of the particles of a powder mixture, sintering 
can progress by solid state diffusion or liquid phase sintering. A permanent or 
transient liquid phase can initiate or enhance sintering in systems where it may not 
otherwise readily occur, e.g., SiC with Si liquid phase. The liquid phase can be due 
to a molten powder component or a eutectic phase. The liquid dissolves and 
rearranges the powder particles allowing their interconnectivity by means of con- 
tact flattening (Fig. 29). All these phenomena are related to the liquid-solid contact 
angle q (Fig. 30). 
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Fig. 29 Liquid phase sintering 


Fig. 30 Wettability in LPS 
process 
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Some examples of material systems sintered by liquid phase sintering include: 

• Metallic: Cu/Sn; Fe/Cu 

• Cermets: WC/Co 

• Ceramics: Si 3 N 4 with glassy liquid phase, SiC with Si liquid phase 

Usually, swelling phenomena occur, but this can be diminished by accurate 
control of the sintering parameters. Four types of sintering treatments are well 
recognized for PM material systems (Table 9). 

Sintering Equipment 

Industrial sintering furnaces must fulfill many requirements, including: 

• Accurate temperature control and heating/cooling rates 

• Atmosphere control: composition and/or flow 

• Efficient loading/unloading system 

• Safety systems 
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Table 9 Types of sintering treatments 


Sintering 

types 

Powders type/mutual solubility in solid state 

Sintering phase 

New 

structural 

phase 

synthesis 

Solid 

Liquid 

Yes 

No 

I 

Soluble 
Pure metals 
Solid solutions 
Chemical compounds 

X 



X 

II 

Insoluble 

Mechanical mixture 

X 

— 

— 

X 

III 

Partially soluble 
Mechanical mixture 

X 

— 

X 

— 

IV 

Soluble and/or insoluble 


X 


X 


green 
compact 
feeding 


T[°C] 


hot zone 


preheat zone 



delubrication 


oxide reduction 


cooling zone 


sintering 


heat treatment 


sintered 

parts 



evacuation 



Fig. 31 Sintering cycle in each zone of the furnace (evacuation refers to furnace unloading zone) 


• High productivity 

• Energy and cost efficiency 

The three key systems of a furnace are: 

• Heating system 

• Construction (insulation and external structure) 

• Loading/unloading system 

Figure 31 schematically presents a continuous furnace (tunnel kiln) with the 
thermal cycle superimposed. 
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Fig. 32 Conventional versus advanced sintering treatments 


Table 10 Specific features for metal (M) and ceramic (C) sintering regimes 


Sintering 

treatment 

Metal 

Versus 

Ceramic 

T emperature 

T (M) 

< 

T (C) 

Time 

t (M) 

<< 

kc) 

Atmosphere 

Inert, reduction 

Versus 

Air 

Concomitant 

phenomena 

Shrinkage (M ) 

Versus 

Shrinkage^) 

Swelling 


Applications 

Structural parts for room and 
high-temperature environment 

Versus 

High mechanical and corrosion 
strength to high temperatures 

Technologies 

Pressureless (with LPS) 
Pressure-assisted sintering 


Microstructured Versus Nanostructured Sintered Materials 

When sintering nanoparticles to produce nanostructured materials, advanced tech- 
niques need to be used in order to suppress grain growth, e.g., lowered sintering 
temperatures and shorter sintering times than for conventional sintering of the same 
materials (Fig. 32). The reduced temperature is commonly compensated by pres- 
sure assistance (see section “Pressure- Assisted Sintering”). 

Metal Versus Ceramic Sintering 

The intrinsic characteristics of metals and ceramic materials determine different 
sintering regimes (Table 10). 

Depending on the application, the thermal cycle of a conventional sintering 
cycle (as distinct from hot-pressing or other enhanced sintering methods) may 
present, along its stages, different combinations between the technological param- 
eter values (heating/cooling rate) with specific heat treatments. The following 
examples are presented. 
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Fig. 33 Conventional 
sintering cycle for PIM 
specimen (section “Powder 
Injection Molding”): 
heating with dewaxing stage 
(T cs = conventional sintering 
temperature) 




Fig. 34 SEM homogeneous micro structure of 80Ni-20Cu alloy showing few rounded pores and 
cuboid crystalline grains of a solid solution (Courtesy of Dr. Adriana SORCOI, Technical 
University of Cluj-Napoca, Romania) 


During the Heating Stage 

One holding stage for the green compact dewaxing, usually designed for metals and 
for some ceramic materials (Fig. 33). Homogeneous structure of 80Ni-20Cu alloy 
from water atomized powders conventionally sintered at 1,070 °C for 90 min in 
hydrogen is presented in Fig. 34. 

Multiple temperature holds associated with different heating rates, e.g., for 
reducing the sintering time by decreasing the PIM dewaxing time (Fig. 35). 

During the Sintering Stage 

Presintering and sintering stages in separated thermal cycles, to improve the 
mechanical properties of multicomponent products (e.g., composites with metallic 
binder: Figs. 36, 37, 38, 39). 
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T[°C] 



Fig. 35 Conventional sintering cycle: heating rates and dewaxing stages (T cs =conventional 
sintering temperature) 


Fig. 36 Conventional 
sintering combined cycle for 
ceramic-based products 



T[° 



Fig. 37 Conventional 
sintering combined cycle with 
post-sintering heat treatments 
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Fig. 38 SEM 

micro structural aspects of 
NiTiCu processed by 
conventional sintering, 
followed by low heat 
treatment and sizing. The 
homogeneous and dense 
structure is made of 
allotriomorphic crystalline 
grains, of high purity, with 
fine precipitates at the grains 
boundary and inside (courtesy 
of Dr. Mariana LUCACI, 
ICPE-CA, Bucharest, 
Romania) 



stages 

A new research in PM steel technology has developed a cost-saving route - a 
single-stage sintering process: solid state carburizing + gas sintering of the (Fe + 
graphite) green compacts (Fig. 40). The obtained subsurface micro structure pre- 
sents a homogeneous distribution of the pearlite (gray islands) in the white ferrite 
matrix, with grain size and rounded pores (Fig. 41c). The toughness of these PM 
steels is better than that of similar PM steels processed by gas sintering + solid 
carburizing (Figs. 39 and 41b) and only by gas sintering (Figs. 33 and 41a). 

During the Cooling Stage 

Sintering cycles can involve dual cooling rates, with or without external loading 
(see section “Hot-Pressing (HP)” or section “Hot Isostatic Pressing (HIP)”), 
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Fig. 40 Conventional sintering combined cycle with thermochemical treatment in one single 
stage 
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Fig. 41 Micro structural aspects (optical microscopy) of PM low-carbon steel micro structures 
processed by (a) gas sintering; (b) gas sintering + solid state carburizing in two stages; (c) solid 
state carburizing + gas sintering in one single stage (Courtesy of Prof. Mihail MANGRA, Ph. D., 
University of Craiova, Romania) 
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Fig. 42 Conventional 
sintering cycle with various 
cooling rates 
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Fig. 43 SHS and conventional sintering (CS) thermal cycles 


for example, to minimize the thermal residual stress generated during sintering of 
metallic matrix composites reinforced by ceramics (Fig. 42). 

Self-Propagating High-Temperature Synthesis (SHS) 

Also known as solid flame or solid flame combustion, SHS was patented in 1967 
(Merzhanov and Borovinskaya 1972) as a method to produce mainly ceramics and 
intermet allies by exothermic reactions developed in much shorter times (<150 s) 
and higher temperatures (2,000-3,500 °C) than for conventional sintering (Fig. 43). 

SHS can be used to sinter conventional materials and functionally 
graded materials (see section “Hydrostatic Shock Forming”), with porous 
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structure formation 



micro structures (grain size from 1 to 5 mm down to nano structured or amorphous). 
The main applications are refractory products (for high strength and corrosion 
resistance at high temperatures) and potentially specialty porous biomaterials. 
The advantages of SHS include: 

• Simple process 

• Low energy cost 

• High-quality products (i.e., chemical purity) 

• Process versatility: simultaneous formation and densification of the products 

• A large range of achievable micro structures as well as material type (solid 
solutions, composites, metastable phases) 

The basic principle of SHS consists of the development of an exothermic 
reaction (e.g., redox, oxidation, thermal decomposition) in a loose or green 
compact powder (A+B powder mixture), initiated by an external heat flux 
(Fig. 44, initial stage). The ignition temperature (600-900 °C) initiates the 
combustion reaction, which instantaneously increases the temperature to T SHS = 
2,000-3,500 °C (Fig. 43) for few seconds, enabling self-propagating synthesis of 
the entire part (Fig. 20, transient stage), resulting in the sintered material (Fig. 44, 
final stage). 

The SHS process may be applied to bulk products as well as to powder particles, 
thin films, liquids, and gases. The method can be conducted in a vacuum, open air, 
or an inert/reactive gas. The initial powder refinement is a critical parameter with 
high relevance for the ignition and propagation of the synthesis process. The finer 
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Fig. 45 Extru sion- acti v ated 
SHS (redrawn from www. 
dominique.vrel.free.fr) 
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Fig. 46 Microwave classification 


the particles, the higher the specific surface area (SSA), and consequently the 
reactivity is much increased. 

Recent SHS research has developed new SHS routes regarding the ignition 
method. For example, extrusion-activated SHS (EA-SHS) is based on severe 
extrusion of powder mixtures that takes place at relatively low pressure (P < 500 
MPa), high temperatures, or even room temperature (in the case of powders with 
increased reactivity by different methods (www.dominique.vrel.free.fr, Fig. 45). 

The advantage of this method is the capability to process high-textured materials 
at low temperatures, even approaching room temperature. 

Microwave Sintering (MWS) 

Microwaves lie between radio waves and infrared in the electromagnetic spectrum. 
Microwaves are better known for their use in heating food. However, microwave 
sintering of powder-formed materials is a well-developed technology that typically 
involves 2.45 GHz waves (wavelength =122 mm, Fig. 46). 
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Fig. 47 MWS versus conventional sintering thermal cycles 
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Fig. 48 MWS mechanism 


Microwave interactions with materials result in almost instantaneous transfor- 
mation of the electromagnetic field energy into thermal energy, enabling very fast 
and uniform heating, not constrained by thermal conductivity, which is most 
advantageous for ceramics with their low thermal conductivity, enabling lower 
temperatures and shorter holding times than for conventional sintering (Fig. 47). 

Ceramic and metallic materials (metals require ceramic insulators/susceptors) 
may be processed by conventional/non-plasma MWS (Agrawal 1998; Roy 
et al. 1999), the microwaves being differently adsorbed (Fig. 48), depending on 
the material as well as the volume of the processed parts. 

MWS is also advantageous for cermets made by powder forming. In this case, 
the ceramic component (as matrix or reinforcement) functions as susceptor, i.e., 
transfers the microwave energy into heat, thereby heating the metallic component 
and improving the densification process during sintering. For example, finer and 
denser microstructures are obtained for barium ferrites type M processed by MWS 
than by classic sintering (Fig. 49). 
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Fig. 49 Micro structural aspects (optical microscopy, XI 00) on Al/SiC composites processed by 
(a) conventional sintering at 650-700 °C/( 3-6 h), reaching density of 2.4-2. 7 g/cm 3 ; (b) MWS at 
300-550 °C/( 15-20 min), reaching density of 2. 0-2. 6 g/cm 3 . MWS utilized lab microwave 
furnace, 2.45 GHz, anode voltage 180-240 V, anodic current intensity 200-240 mA, inert 
atmosphere 

MWS produces volumetric heating and provides the following advantages over 
conventional sintering: 

• Finer grain sizes 

• Increased shrinkage rates 

• Rounded pores 

• High ductility and toughness 

• High productivity and reproducibility 

• Low production costs 

• Simple equipment maintenance 

MWS can be further enhanced by plasma-assisted MWS and by two-directional 
rapid MWS. Also, mono-mode and multimode MWS chambers can be used related 
to the material to be processed. The general description of MWS equipment is 
shown in Fig. 50. 

Two-Step Sintering (TSS) 

TSS is a pressureless sintering process for a green compact/mold in two stages 

(Fig. 51): 

• Stage 1, heating to a peak temperature (TO to reach an intermediate density 

• Stage 2, by rapid cooling to T 2 < T 1? for final densification without grain growth 

The purpose of TSS is to obtain highly dense ceramics (>98 % p t ) with nano 
grain size (25-300 nm) even after a long sintering cycle (Chen and Wang 2000). 
This technology is suitable for Y 2 O 3 , BaTi0 3 , Ni-Cu-Zn ferrites, ZnO, Zr0 2 , 
A1 2 0 3 , and SiC. It combines outstanding ceramic microstructures with low produc- 
tion costs, mostly requiring only a conventional furnace. The TSS mechanism 
comprises two main phenomena corresponding to each step. Step 1 is governed 
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Fig. 51 TSS versus conventional sintering (CS) thermal cycles 
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Fig. 52 TSS - 1st step mechanism 
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Fig. 53 TSS - 2nd step mechanism 


by the initiation of the diffusion process between the particles to reach a min. 
density of 75 % p t . The pores must be smaller than the grain size for Step 
2 densification (Fig. 52). 

Step 2 is controlled through the “kinetic window” allowing achievement of fully 
dense nano structured materials without grain growth by a very close control of the 
ratio between grain boundary diffusion and migration (Fig. 53). 

The TSS method is attractive since it provides nanostructured ceramics but 
requires only conventional furnaces. High accuracy with temperature and time 
control are essential to enable processing using step 2 below the kinetic window 

(Fig. 54). 
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Fig. 54 BSE-SEM of hydroxyapatite-based biocomposites reinforced by titanium, processed by 
TSS 617=800-1, 100 °C, T 2 = 700-1,000 °C; porosity >20 %). The sintering necks between 
ceramic/metallic and ceramic/ceramic particles create a porous high-wear-resistant scaffold for 
trabecular bone tissue engineering applications (courtesy of Assoc. Prof. Ileana PASCU and Prof. 
Oana GINGU, University of Craiova, Romania) 

Pressure-Assisted Sintering 

Pressure can enhance densification, enabling two main possibilities: 

• Densifying a composite which cannot densify by conventional pressureless 
sintering, e.g., fiber-reinforced ceramics 

• Producing full densification without grain growth, by hot-pressing at a lower 
temperature than for conventional pressureless sintering 

There are two main types of pressure-assisted sintering: hot-pressing (HP) and 
hot isostatic pressing (HIP). 

Hot-Pressing (HP) 

HP combines, in one single step, compaction and sintering (Fig. 55), enabling 
almost fully dense and fine micro structured ceramic-based material synthesis, 
which is especially useful for electronic applications. 

The HP densification mechanism consists of particle rearrangement and plastic 
flow at the particle contacts that develop in the solid/transient liquid phase (Fig. 56), 
by direct/in situ reactions between particles, through a normal/rapid regime 
(Fig. 57). Moreover, homogeneous or textured structures can be obtained, 
depending on the application demands. Thus, the key advantages of HP are: 

• High density 

• Small overall grain size 

• Textured structures 
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Fig. 55 HP versus conventional sintering (CS) thermal and compaction cycles 
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Fig. 57 HP routes: (a) rapid hot-pressing (v 2 >Vi, t 2 <t x ); (b) in situ hot-pressing 


• Enhanced mechanical properties 

• Some ceramic composites cannot be densified without pressure, e.g., fiber- 
reinforced ceramics 


Special adapted furnaces for inductive indirect heating or resistive direct heating 
are used for the HP process, with air/vacuum/argon atmospheres (Fig. 58). HP dies 
are usually graphite, which limits the HP pressures to typically 20 MPa and a 
maximum of about 40 MPa. 


Hot Isostatic Pressing (HIP) 

Originally known as gas pressure bonding in 1955, HIP is to HP what CIP (see 
section “Cold Isostatic Pressing (CIP)”) is to die pressing (see section “Die Press- 
ing”). HIP involves sintering at high temperature in a pressurized gas, either 
through sealed adiabatic heating of a gas-filled vessel or through the use of a 
powerful external gas pump. It enables more uniform distribution of the compaction 
pressure compared to HP (Fig. 59). Moreover, higher pressures are possible in HIP 
than HP since the pressurized gas used for HIP can easily be pumped to pressures up 
to 200 MPa, whereas HIP is rarely above 40 MPa and usually below 20 MPa. HIP 
benefits include pores closing up to near full densification of ceramics which will 
not densify by any other means (<95 %p t ) and equiaxed small grain size (1-5 mm). 
Mechanical properties are enhanced at room and high temperatures as well as 
improvement of surface finish and machinability by eliminating the subsurface 
porosity. Although HIP equipment is quite expensive, HIP is more amenable to 
mass production than HP. A large number of ceramic powder-formed components 
can be placed in a large HIP for a single HIP run, whereas HP is generally one 
component per cycle, with a cycle time of many hours. Thus, while HP is primarily 
a research technique, HIP can be both for research and mass production. 
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Fig. 58 HP equipment 
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Fig. 59 HIP technology (schematic) 


HIP is usually used for mass production densification of advanced ceramics 
when mechanical integrity is essential, e.g., millions of alumina prosthetic hip joint 
components undergo HIP every year. The benefits are seen in the fact that the 
failure rate of conventionally sintered alumina prosthetic hip joint components was 
up to 10 %, whereas for components that underwent HIP, it is about 0.004 %. Also, 
large PM metal components (2-3 m diameter and 4-5 m as height) can be densified, 
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Fig. 61 Pseudo-HIP methods (PPS=powder particles) 


e.g., from stainless steels and other high-performance alloys for the offshore and 
marine sectors. HIP is also used as posttreatment for metal castings to eliminate 
casting defects and porosity. HIP temperatures/pressures are much lower for metals 
than ceramics (Fig. 60). 

In order to avoid the huge capital outlay of purchasing a HIP, the pseudo-HIP 
route can be used. This is more of a research technique as it has the single 
component per cycle constraint of HP. Pseudo-HIP utilizes a solid environment 
instead of a pressurized gas. Thus, the heating operation is developed in an easily 
deformable and thermally stable powder material in two ways: (a) the degassed 
green compact of particles, poured into a steel can, is heated up at T [°C] and then 
the compaction pressure acts on the can by the means of a hydraulic press in low 
thermal conductivity powder, e.g., talcum (Fig. 61a and b) the semi-product is 
heated up by a coil and, simultaneously, the compaction pressure acts on the 
product emerged in a solid environment (Fig. 61b). 
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Fig. 62 SPS versus conventional sintering cycles 


Spark Plasma Sintering (SPS) 

SPS, also known as PECS (pulsed electric current sintering) or FAST (field-assisted 
sintering technology), is based on electrical spark discharge utilizing on-off DC 
pulse energizing (Tokita 1993). This discharge generates high localized tempera- 
tures (xlO 4 °C) between the particles, resulting in greatly enhanced sintering 
compared to conventional sintering (Fig. 62). 

SPS is recommended for fine ceramics, functional graded materials (FGMs), and 
electronic/nanophase/hard alloy s/di amond tool materials, characterized by high 
density (>99 %), controlled grain size, high dimensional, and structural accuracy. 
The SPS mechanism is based on the repeated application of the DC pulse voltage and 
current between the powder particles, the spark discharge point and joule heating 
point generate mass transport phenomena for sintering neck formation (Fig. 63). 

SPS equipment simultaneously allows the compaction by HP of a powder 
mixture and sintering in vacuum (Fig. 64). 

Recent research demonstrates SPS effectiveness with high-performance mag- 
nets with excellent densification (Fig. 65) and high- wear composites (Sima 
et al. 2011; Gingu et al. 2011) with excellent nanostructures (Fig. 66). 

Sinter Forging (SF) 

SF is a low-cost sintering method carried out under axial pressure without lateral 
confinement of the product. Thus, near-fully dense parts (90-93 % of theoretical 
density) processed by SF present an anisotropic shrinkage, textured micro structure 
(Fig. 67). Benefits include higher E and UTS and better fatigue behavior than for 
extruded products. 

The SF-textured micro structure is critical for applications such as: 

• Metallic alloys for connecting rods or synchronizer rings/cam stators as high 
strength parts used in transmissions for vehicles which transfer engine output to 
shaft 
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SPS mechanism 



• mass transport phenomena 
for sintering neck formation 

Fig. 63 SPS mechanism 


• Pure metals (Bi) for current leads 

• Ceramics (SiC, Zr0 2 ) or ceramic-based nanocomposites for electronic and 
advanced optical products 

The advantages of SF versus other similar methods (HP+extrusion; HP+forging) 
concern the micro structural features: minimized risk of grain cracking/growth, 
closed pores, and low porosity. The SF mechanism is based on stress-assisted 
diffusion during the sintering stage: dislocation movement for metallic grains and 
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Fig. 64 SPS equipment 




Fig. 65 BSE-SEM micrographs of Nd 16 Fe 78 B 6 +5 wt% Fe SPS magnets consolidated as disks 
with F= 10 mm and t = 2 mm at: (a) 50 MPa and (b) 80 MPa. T sintering = 630 °C; t sinter i ng = 
300 s. One can notice the difference in the morphologies of the two types of magnets and the fact 
that higher pressures are melting the powder boundaries and the resulting material is similar to a 
bulk material. Both magnets prepared at 50 and 80 MPa, respectively, are mainly consisting of 
primary Nd 2 Fe 14 B hard magnetic phase and secondary aFe soft magnetic phase (Courtesy of 
Dr. Nicoleta FUPU from the National Institute of Research and Development for Technical 
Physics, Iasi, Romania. First published in (Fupu et al. 2011)) 
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Fig. 66 BSE-SEM of 
hydroxyapatite-based 
biocomposites reinforced by 
titanium processed by SPS at 
1,100 °C for 20 min. Low 
porous (<10 %) structure, 
with high- wear rate of 1.5 x 

A O 

10 mm /Nm, for cortical 
bone tissue engineering 
(Courtesy of Prof. Oana 
GINGU and Assoc. Prof. 
Ileana PASCU, University of 
Craiova, Romania) 



Fig. 67 SF and conventional sintering cycles 



grain boundary sliding for ceramics. These mechanisms are responsible for the 
intercrystalline pores closing. As for the large inter-agglomerate pores, the plastic 
deformation of the product enables their closure (Fig. 68). 

The SF working regime includes: 

• Accurate control of pressure (40-120 MPa), temperature (T SF < 0,5 T m , where 
T m represents the melting point of the metallic component), and loading rate 
(5-100 N/s) for the material to be processed. 

• Processing in solid or liquid state. Liquid state is adequate for those material 
systems requiring some additives (e.g., A1 2 0 3 , Y 2 0 3 , CaO) that become a 
transient liquid phase for ceramic nanocomposites (e.g., SiC). 

SF equipment is schematically shown in Fig. 69. 
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Fig. 68 SF mechanism (PPs means powder particles) 


Recent SF research is focused on improving the quality and efficiency of the 
method, especially for ceramic materials, and developing new routes, for example, 
microwave SF. Flash sinter forging (F-SF) is a new sintering method that combines 
spark plasma sintering (SPS) with conventional SF (Francis and Raj 2012). The 
electric field, applied through the metal electrodes layered between the punches and 
the sample, generates a sudden large amount of defects inside the particles, improv- 
ing the diffusion kinetics. The F-SF advantages include: 

• Temperature decrease (5-15 %) 

• Pressure increase (5-10 times) for high densification without grain growth 
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Fig. 70 MW-SF equipment (Redrawn from Tyagi (2010)) 


The combination of microwave sintering and SF (MW-SF) provides a significant 
improvement in ceramic component density (up to 97 % of theoretical density) 
while the grain size is similar to that obtained by conventional SF (Tyagi 2010). The 
MW-SF mechanism, under external load conditions (Fig. 70), is enhanced by the 
almost instantaneous transformation of the electromagnetic field energy into heat. 
The thermal effect is more intense due to the ceramic susceptors and “weight 
punch” surrounding the ceramic sample. 


Future Trends 

Additive Manufacturing (AM) 

Additive manufacturing (AM) is defined by ASTM as the “process of joining 
materials to make objects directly from 3D model data, usually layer upon layer, 
as opposed to subtractive manufacturing methodologies, such as traditional 
machining” (ASTM Standard F2792 - 12a 1994). Additive manufacturing technol- 
ogies can be used anywhere throughout the product life cycle from preproduction 
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(i.e., rapid prototyping) to full scale production (also known as rapid manufactur- 
ing) and even for tooling applications or postproduction customization. Since its 
emergence 25 years ago, additive manufacturing has found applications in indus- 
tries ranging from architecture, aerospace, automation, defense, precision engineer- 
ing, oil and gas, dentistry, orthopaedics, and consumer products. Across all 
industries, additive manufacturing accounted for $1.3B in worldwide sales of 
materials, equipment, and services in 2010 and is poised to exceed $3B by 2016 
(Scott et al. 2012). 

The fastest growing application for AM parts is end use products or functional 
part production. Unlike rapid prototyping, AM is used as a step in the design or 
production process. In direct part production, AM creates a final product for sale or 
use. This application category has grown from 4 % of total AM revenues in 2003 to 
nearly 20 % in 2010 (Scott et al. 2012). This rise in direct part production has 
happened by increasing material quality from AM processes, reducing cost, and 
growing awareness of the potential of additive processes (Gu et al. 2012). 

From the processing point of view, all AM technologies can be divided into three 
categories: powder-based, liquid-based, and solid-based rapid manufacturing (Chua 
et al. 2003). However, using powders as starting materials for making 3D functional 
objects is the most common in the AM industry. Since the scope of this section is on 
powder processing of materials, some specific manifestations of powder AM will 
be examined in more detail. 

Powder Bed Additive Manufacturing 

Powder bed additive manufacturing is the most common and successful technol- 
ogy for manufacturing of metallic, ceramic, and polymeric components. In this 
technique, the powder is spread by a counter-rotating roller onto a build platform 
inside a build box. By means of a binding technology which can be laser, electron 
beam, or even a binder dispenser the cross section of the object is selectively fused 
or bonded on the powder bed. Subsequently, the built platform advances down- 
ward by one layer thickness, and a new layer of powder is spread and the process is 
continued layer by layer until full object is complete. A schematic of the process is 
shown in Fig. 71. One of the advantages of powder bed 3D printing is the 
supportive function of the powder bed for the complex object. Therefore, the 
need for support material is greatly reduced and the achievable complexity of 
the parts is improved. 

Common technologies based on powder bed additive manufacturing include 
inkjet 3D printing, selective laser melting, selective laser sintering, direct metal 
laser sintering, and electron beam melting. 

Powder Jet Additive Manufacturing 

As schematically shown in Fig. 72, in powder jet additive manufacturing, a high- 
powered laser is used to melt metal powder supplied coaxially to the focus of the 
laser beam through a deposition head. The laser beam typically travels through the 
center of the head and is focused to a small spot by one or more lenses. The head 
itself, or the table, will be moving to make one layer from the cross-sectional data of 
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Fig. 71 A schematic of powder bed additive manufacturing technologies using different binding 
phenomena 


the object. Metal powders are delivered and distributed around the circumference of 
the head either by gravity or by using a pressurized carrier gas. An inert gas is often 
used to shield the melt pool from atmospheric oxygen for better control of proper- 
ties and to promote layer-to-layer adhesion by providing better surface wetting. 
Laser engineered net shaping or laser aided additive manufacturing are some 
common names for this technology. 

The advantage of this method is the ability to make functionally graded materials 
through changing composition or rate of powder feeding via the process. Since no 
support material is used, this process is very limited in the complexity of the object. 
However, this method has been very successful for repair and remanufacturing 
purposes. 

Colloidal Additive Manufacturing 

Other powder-based additive manufacturing can be categorized under colloidal 
methods, in which ceramic or metallic powder are incorporated into a polymer or 
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Fig. 72 A schematic of 
powder jet additive 
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solvent and ink jetted or extruded through a nozzle and immobilized using 
polymerization or solidification mechanisms. Depending on the material and 
use, the solvent or carrier might need to be burnt off or removed afterwards, and 
parts must be fired for densification. Since particle friction is greatly reduced in 
liquid form solutions, the packing and density of the achievable parts can be 
enhanced and controlled. Methods such as 3D stereolithography, robocasting, 
and paste dispensing apparatus are some of the common methods for making 3D 
objects based on colloidal paste or slurries. Most of this technology is in the 
research stage. 


Functionally Graded Materials by Powder Processing 

Functionally gradient materials (FGMs) are a category of composite materials 
characterized by a compositionally graded interface between two component 
phases, most commonly ceramic/metal, although ceramic/ceramic, metal/metal, 
and metal/polymer FGMs are also fabricated. The concept originated in Japan in 
1984 as a hypothetical spaceplane skin for Mach 10 + with the capacity to withstand 
a surface temperature of 2,000 °C and a temperature gradient of 1,000 °C across a 
cross section of 10 mm or less, an application for which FGMs are the only practical 
solution (Niino 1990). While the thermal barrier role of FGMs is predominantly a 
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high-technology application, improvements in ceramic-metal and ceramic-ceramic 
bonding through the FGM concept have applications, from nuclear fusion reactors to 
biomaterials to wear-resistant tiles. This is because FGMs are an ideal solution to the 
problem of thermomechanical property mismatch at the interface in a ceramic/metal 
bond and also in a ceramic/ceramic bond. The main use of FGM films is in bonding 
dissimilar materials. They are also useful for minimizing thermomechanical 
mismatch for coatings, for example, a hydroxyapatite-coated alumina femoral 
component for a knee prosthesis. For hypersonic spaceplanes, an FGM skin is the 
only viable thermal barrier material. 

Most of the published FGM research has involved modelling the properties of 
hypothetical FGMs, not fabrication techniques. Reported fabrication methods can 
be split into two main categories: thin film/interfacial FGMs (gradient thickness in 
the micron range) and bulk FGMs (gradient thickness millimeters to centimeter 
thick). Bulk FGMs can be further split into two broad categories, namely, stepwise/ 
layered bulk FGMs and continuous bulk FGMs. FGM thin films are primarily made 
by plasma spraying or CVD (Kerdic et al. 1996). Reported fabrication methods for 
layered bulk FGMs include laser cladding, slipcasting, sedimentation, slurry dip- 
ping, centrifugal casting, co-sedimentation, and electrophoretic deposition (Kerdic 
et al. 1996; Chavara and Ruys 2007; Chavara et al. 2009a). 

Continuous bulk FGMs are the most advanced manifestation of FGMs. They can 
be made by reconstructive methods or constructive methods. Reported reconstruc- 
tive methods include sedimentation forming, centrifugal forming, slipcasting, and 
thixotropic casting. “Reconstructive methods” means that gradient formation is 
achieved by gravitational segregation of an originally homogenous two-component 
powder blend as a slurry. It is almost impossible to achieve a linear gradient via a 
reconstructive method, usually it is irregular, or sigoidal at best. The only reported 
constructive method for continuous bulk FGMs (constructively engineering the 
gradient) is impellor dry blending (IDB) (Chavara and Ruys 2007; Chavara 
et al. 2009a). IDB involves the following stages: 

• Preparing ceramic powder feedstock, comprising submicron powders granulated 
and sieved for appropriate flow rate. 

• Sourcing metal spheroidal powder feedstock for appropriate flow rate. 

• Flow rate optimization studies of the feedstock and, based on these, program- 
ming the computer-controlled feedgate to produce a linear gradient for the 
chosen feedstock pair. 

• Powders are loaded into their respective two feed hoppers of the computer- 
controlled feedgate. 

• Hoppers are vibrated as the computer-controlled feedgate switches on powder 
flow and feeds the powders into the blending impellor from initially 100 % metal 
to ultimately 100 % ceramic. 

• The powder blend exits the blending impellor and, guided by the exit port, settles 
like snowflakes onto the cylindrical mold with its rotating screed. 

• FGM powder blend is pressed and sintered, usually by hot-pressing. The 
submicron ceramic powders enable optimal sintering. 
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Hydrostatic Shock Forming 

High-rate forming is a long-established method for forming metals (Rinehart and 
Pearson 1963). Commonly, it involves placing metal sheet or plate against a mold, 
immersed in a liquid medium. An explosive charge is detonated in the medium, and 
the shock waves passing through the liquid press the sheet metal against the mold 
with high intensity, thereby producing a perfect shape replica in microseconds. It is 
not a widely used method as it is costly, hazardous, unsuitable for mass production, 
and therefore rarely commercially justifiable. Its main niche application is for 
one-off situations requiring high forming pressures, with no high-capacity press 
available, or press-tooling unviable. 

In the case of ceramic powders, however, there are powders which cannot be 
densified by conventional means. An even more extreme case is a ceramic-metal 
functionally graded material for which the melting point of the ceramic powder is 
1,000 °C or more higher than the metal powder. For example: 

• Aluminum (660 °C)-alumina (2,050 °C) 

• Copper (1,080 °C)-magnesia (2,800 °C) 

• Stainless steel (1,400 °C)-silicon carbide (2,700 °C) 

Densification by combustion synthesis was proposed as one possible solution to 
this problem (Ruys et al. 2001). However, this limits FGM densification to the rare 
situation in which both the metal and ceramic components are combustible, which 
is not the case for most commercially useful metal-ceramic and ceramic-ceramic 
combinations. Spark plasma sintering (SPS) is also an option, but even SPS cannot 
succeed with a 1,000 °C sintering temperature difference of metal and ceramic. 
Hot-pressing enables sintering to take place a few hundred degrees below that 
required for pressureless sintering. However, with the melting point differentials of 
typical metal-ceramic combinations commonly in excess of 500 °C, this is often 
not viable either. 

Hydrostatic shock forming (HSF) involves the use of a focussed explosive 
charge which enables localized pressures in excess of 30 GPa, in the order of a 
thousand times larger than the pressures for hot-pressing. Moreover, 30 GPa is 
higher than the microhardness of all ceramics (typically 5-15 GPa) except diamond 
and its polymorphs. Indeed, at pressures in the 30 GPa magnitude range, ceramics 
can flow hydrodynamic ally. In addition, there is the instantaneous adiabatic heating 
from the detonation, which further enhances the potential for densification of 
metal-ceramic or ceramic-ceramic FGMs with a large difference in melting points 
between the two components. 

The velocity of a detonating high-explosive charge can be anywhere from 3 to 
9 km/s and is typically in the order of 6 km/s, and this immense velocity generates 
an intense pressure wave. A focussed explosive charge transmits a shockwave 
through a conical water column into the FGM powder blend. Densification of 
metal-ceramic FGMs by hydrostatic shock forming (HSF) has been demonstrated 
for Cu-SiC and stainless steel-SiC FGMs (Ruys et al. 2001). Metal-ceramic 
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powder blends were prepared as pellets using impeller-dry-blending (Chavara and 
Ruys 2007; Chavara et al. 2009b) and placed into pre-machined mild steel dies with 
a water-filled conical shock-focussing chamber linking the explosive charge with 
the powder preform. The ceramic powder side of the FGM was on the side in 
contact with the conical shock-focussing chamber since the ceramic powder needed 
the shock wave much more than the metal powder. 

It is unlikely that HSF will see widespread adoption. However, for specialty 
applications, such as metal-ceramic FGMs for hypersonic spaceplane skins, it is 
sometimes the only option and therefore is of significance for the future of powder 
forming. 


Summary 

The powder-forming process contains four basic steps: 

• Powder preparation 

• Powder mixing, blending, and/or slurry preparation (a powder-liquid suspension 
is known as a slurry) 

• Forming 

• Sintering 

This chapter has provided a comprehensive overview of each of these steps with 
an equal focus on powder preparation, powder characterization, powder forming, 
and sintering. 


Annex 1. ISO Standards with Relevance for Particle 
Characterization 

Annex 1.1. ISO Technical Committee 206 (TC206) Fine Ceramics 
(selection) 


References 

Title 

ISO 14703:2008 

Fine ceramics (advanced ceramics, advanced technical ceramics) - Sample 
preparation for the determination of particle size distribution of ceramic 
powders 

ISO 18753:2004 

Fine ceramics (advanced ceramics, advanced technical ceramics) - 
Determination of absolute density of ceramic powders by pycnometer 

ISO 18754:2003 

Fine ceramics (advanced ceramics, advanced technical ceramics) - 
Determination of density and apparent porosity 

ISO 18757:2003 

Fine ceramics (advanced ceramics, advanced technical ceramics) - 
Determination of specific surface area of ceramic powders by gas adsorption 
using the BET method 


(< continued ) 
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References 

Title 

ISO 

23145-1:2007 

Fine ceramics (advanced ceramics, advanced technical ceramics) - 
Determination of bulk density of ceramic powders - Part 1: Tap density 

ISO 24235:2007 

Fine ceramics (advanced ceramics, advanced technical ceramics) - 
Determination of particle size distribution of ceramic powders by laser 
diffraction method 


Annex 1.2. ISO Technical Committee 229 (TC229) Nanotechnologies 
(selection) 


References 

Title 

ISO 9276-1:1998 

Representation of results of particle size analysis - Part 1 : Graphical 
representation 

ISO 9276-2:2001 

Representation of results of particle size analysis - Part 2: Calculation of 
average particle sizes/diameters and moments from particle size 
distributions 

ISO 9276-3:2008 

Representation of results of particle size analysis - Part 3: Adjustment of 
an experimental curve to a reference model 

ISO 9276-6:2008 

Representation of results of particle size analysis - Part 6: Descriptive 
and quantitative representation of particle shape and morphology 

ISO 9277:1995 

Determination of the specific surface area of solids by gas adsorption 
using the BET method 

ISO 13317-1:2001 

Determination of particle size distribution by gravitational liquid 
sedimentation methods - Part 1 : General principles and guidelines 

ISO 13317-2:2001 

Determination of particle size distribution by gravitational liquid 
sedimentation methods - Part 2: Fixed pipette method 

ISO 13317-3:2001 

Determination of particle size distribution by gravitational liquid 
sedimentation methods - Part 3 : X-ray gravitational technique 

ISO 13318-1:2001 

Determination of particle size distribution by centrifugal liquid 
sedimentation methods - Part 1 : General principles and guidelines 

ISO 13318-2:2007 

Determination of particle size distribution by centrifugal liquid 
sedimentation methods - Part 2: Photocentrifuge method 

ISO 13318-3:2004 

Determination of particle size distribution by centrifugal liquid 
sedimentation methods - Part 3 : Centrifugal X-ray method 

ISO 13319:2007 

Determination of particle size distributions - Electrical sensing zone 
method 

ISO 13320:2009 

Particle size analysis - Laser diffraction methods 

ISO 13321:1996 

Particle size analysis - Photon correlation spectroscopy 

ISO 13322-1:2004 

Particle size analysis - Image analysis methods - Part 1: Static image 
analysis methods 

ISO 13322-2:2006 

Particle size analysis - Image analysis methods - Part 2: Dynamic image 
analysis methods 

ISO/TS 13762:2001 

Particle size analysis - Small-angle X-ray scattering method 

ISO 14488:2007 

Particulate materials - Sampling and sample splitting for the 
determination of particulate properties 
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References 

Title 

ISO 14887:2000 

Sample preparation - Dispersing procedures for powders in liquids 

ISO 15900:2009 

Determination of particle size distribution - Differential electrical 
mobility analysis for aerosol particles 

ISO 15901-1:2005 

Pore size distribution and porosity of solid materials by mercury 
porosimetry and gas adsorption - Part 1 : Mercury porosimetry 

ISO 15901-2:2006 

Pore size distribution and porosity of solid materials by mercury 
porosimetry and gas adsorption - Part 2: Analysis of mesopores and 
macropores by gas adsorption 

ISO 15901-3:2007 

Pore size distribution and porosity of solid materials by mercury 
porosimetry and gas adsorption - Part 3: Analysis of micropores by gas 
adsorption 

ISO 20998-1:2006 

Measurement and characterization of particles by acoustic methods - 
Part 1: Concepts and procedures in ultrasonic attenuation spectroscopy 

ISO 21501-1:2009 

Determination of particle size distribution - Single particle light 
interaction methods - Part 1: Light scattering aerosol spectrometer 

ISO 21501-2:2007 

Determination of particle size distribution - Single particle light 
interaction methods - Part 2: Light scattering liquid-borne particle 
counter 

ISO 21501-3:2007 

Determination of particle size distribution - Single particle light 
interaction methods - Part 3 : Light extinction liquid-bome particle 
counter 

ISO 21501-4:2007 

Determination of particle size distribution - Single particle light 
interaction methods - Part 4: Light scattering airborne particle counter 
for clean spaces 

ISO 22412:2008 

Particle size analysis - Dynamic light scattering (DLS) 


Annex 2. Standards for Apparent Density Determination 


Type 

Number 

Title 

ISO 

ISO 3923-1:2008 

Metallic powders. Determination of apparent density. Part 1: 
Funnel method 


ISO 3923-2:1981 

Metallic powders. Determination of apparent density. Part 2: 
Scott volumeter method 


ISO 18459-1:2009 

Metallic powders. Determination of apparent density and flow 
rate at elevated temperature. Part 1: determination of apparent 
density at elevated temperature 

MPIF 

MPIF Standard 
04 2007 Edition 

Method for determination of apparent density of free-flowing 
metal powders using the Hall apparatus 


MPIF Standard 
28 2007 Edition 

Method for determination of apparent density of non-free- 
flowing metal powders using the Carney apparatus 


MPIF Standard 
[48 2007 Edition 

Method for determination of apparent density of metal 
powders using the Arnold meter 
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Type 

Number 

Title 

ASTM 

ASTM B703-10 

Standard test method for apparent density of metal powders 
and related compounds using the Arnold meter 


ASTM B2 12-09 

Standard test method for apparent density of free-flowing 
metal powders using the Hall flowmeter funnel 


ASTM B329-06 

Standard test method for apparent density of metal powders 
and compounds using the Scott volumeter 


ASTM B417-11 

Standard test method for apparent density of non free-flowing 
metal powders using the Camey funnel 


Annex 3. Standards for Tap Density Determination 


Type 

Number 

Title 

ISO 

ISO 3953:2011 

Metallic powders. Determination of tap density 

MPIF 

MPFI Standard 
46 2007 Edition 

Method for determination of tap density of metal powders 

ASTM 

ASTM B527-06 

Standard test method for determination of tap density of 
metallic powders and compounds 


Annex 4. Standards for Flow Rate Determination 


Type 

Number 

Title 

ISO 

ISO 4490:2008 

Determination of flow rate by means of a calibrated funnel 
(hall flow meter) 


ISO 18549-2:2009 

Metallic powders. Determination of apparent density and flow 
rate at elevated temperatures. Part 2: Determination of flow 
rate at elevated temperature 

MPIF 

MPIF Standard 
03 2007 Edition 

Method for determination of flow rate of free-flowing metal 
powders using the Hall apparatus 

ASTM 

ASTM B964-09 

Standard test methods for flow rate of metal powders using the 
Carney funnel 


Annex 5. Standards for Powder Compressibility Determination 


Type 

Number 

Title 

ISO 

ISO 

3927:2011 

Metallic powders, excluding powders for hard metals. 
Determination of compressibility in uniaxial compression 
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Type 

Number 

Title 

MPIF 

MPIF 

Standards 45 
2007 Edition 

Method for determination of compressibility of metal powders 

ASTM 

ASTM 

B331-10 

Standard test method for compressibility of metal powders in 
uniaxial compaction 


Annex 6. Standards for Powder Compressibility Determination 


Type 

Number 

Title 

ISO 

ISO 

3995:1985 

Determination of green strength by transverse rupture of rectangular 
compacts 

MPIF 

MPIF 

Standards 15 
2007 Edition 

Method for determination of green strength of unsintered 
compacted powder metallurgy materials 

ASTM 

ASTM 
B3 12-09 

Standard test method for green strength of specimen compacted 
from metal powder 
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Abstract 

Solid-state welding is a group of welding processes that produce sound joints at 
temperatures essentially below the melting point of the parent materials or 
without bulk melting of the parent materials. Solid-state welding processes 
have been widely applied in automobile, aircraft, and aerospace industries 
because of their enormous advantages associated with solid-state feature. The 
joints produced by solid-state processes are usually free of various solidification 
defects such as gas porosity, hot cracking, and nonmetallic inclusions, which 


J. Guo (S) 

Singapore Institute of Manufacturing Technology (SIMTech), Agency for Science, 
Technology and Research (A*STAR), Singapore 
e-mail: jfguo@ simtech.a-star.edu. sg 


© Springer-Verlag London 2015 

A.Y.C. Nee (ed.), Handbook of Manufacturing Engineering and Technology , 
DOI 10.1007/978-1-4471-4670-4 55 


569 


570 


J. Guo 


may otherwise be present during fusion welding processes. No filler metals, 
flux, or shielding gas is required during solid-state welding process. The 
metal being joined can have mechanical properties similar to or even better 
than that of their parent metals due to the absence of defects and heat-affected 
zone in most of these processes. In addition, solid-state welding processes 
are also very suitable for joining dissimilar materials as their chemical 
compatibility, thermal expansion, and conductivity are no longer important 
problems. Solid-state welding, alternatively called solid-state bonding, covers 
a wide spectrum of processes including cold welding, forge welding, ultrasonic 
welding, friction welding, friction stir welding, resistance welding, diffusion 
bonding, and explosion welding. In these processes, bonding is achieved through 
deformation and diffusion at certain pressures and temperatures by using 
mechanical, electrical, or thermal energy. Unlike various fusion welding 
processes which are well known, solid-state welding is usually not well 
acquainted by industrial engineers. This chapter aims to give the engineers and 
the graduates working in relevant industries some basic knowledge in the 
selection of welding processes. Emphasis will be given to friction stir welding 
as it is a relatively new solid-state joining process and has generated great 
interests from various industries. 


Introduction 


Solid-state welding (SSW) refers to a group of welding processes that produce 
sound joints at temperatures essentially below the melting point of the parent 
materials or without bulk melting of the parent materials. During SSW, bonding 
is achieved through deformation and diffusion at certain pressures and temperatures 
by using mechanical, electrical, or thermal energy. Bonding time may vary from 
less than one-tenth of a second in explosive welding to a few hours in diffusion 
bonding depending on the applied pressures and temperatures. 

The history of solid-state welding may date back to earlier than 1,000 B.C. when 
ancients joined metals by simply hammering them together which we call forge 
welding today. However, SSW was not widely accepted in industrial until the 
second half of the twentieth century. The digitalization of manufacturing during 
the still ongoing third industrial revolution makes it possible to precisely control 
various parameters during SSW processes and thus has significantly boomed their 
applications. 

The solid-state joints are usually free of various solidification defects such as gas 
porosity, hot cracking, and nonmetallic inclusions, which may otherwise be present 
during fusion welding processes. No filler metals, flux, or shielding gas is required 
during solid-state welding process. The metal being joined can have mechanical 
properties similar to that of their parent metals due to the absence of defects and 
heat-affected zone in most of these processes. In addition, SSW is also very suitable 
for joining dissimilar materials as their chemical compatibility, thermal expansion, 
and conductivity are no longer important problems. 
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SSW, alternatively called solid-state bonding, covers a wide spectrum of 
processes including cold welding, forge welding, ultrasonic welding, friction 
welding, friction stir welding, resistance welding, diffusion bonding, and 
explosion welding. Each of SSW processes is different from one another in 
terms of principles and applications. Therefore, each of the SSW processes will 
be discussed briefly below. Emphasis will be given to friction stir welding as it is 
a relatively new solid-state joining process and has generated great interests from 
various industries. 


Cold Welding 

Cold welding is a solid-state welding process that is conducted at room temperature 
with extremely high pressure applied on the pre-cleaned surfaces of workpiece 
through either dies or rolls. A substantial deformation of the welding parts caused 
by the applied pressure will first disrupt the oxide films on the faying surfaces and 
bring them into intimate contact at atomic level to finally form a strong and 
defectless bonding. Therefore, cold welding is typically used to join ductile mate- 
rials such as aluminum, copper alloys, and even low-carbon steels. At least one of 
the welding parts must be ductile when welding dissimilar materials. Because of the 
high pressure involved, cold welding is mainly applied to small parts such as wire, 
rods, and bars. Compared to electrical welding, cold welding is cleaner, easier, and 
also more cost-effective, particularly for joining large rod sections. Figure 1 shows 
one cold welding machine which is capable of joining aluminum rod with up to 
30 mm diameter, while Fig. 2 shows one example of metal joint produced by this 
cold welding machine. 


Ultrasonic Welding 

Ultrasonic welding is a solid-state welding process, during which a sound joint of 
two components is produced with the local application of high-frequency acoustic 
vibration to the faying surfaces held under pressure. The frequency of ultrasound 
may range from 10 to 75 kHz (Jellison et al. 1997). The oscillating energy creates 
shearing stress along the interface of the two components. This shearing stress 
combined with the normal pressure causes minute plastic deformation at the 
interface breaking up oxide films and thus leads to intimate contact of faying 
surfaces to finally produce a sound joint. The plastic deformation could raise the 
temperature in the weld zone to from one-third to one-half of the melting point of 
the metals being joined (Kalpakjian and Schmid 2006). At such low temperatures, 
the produced welds can usually retain the original properties of the parent materials. 
Therefore, ultrasonic welding can be used to join a wide variety of metallic (ductile) 
and nonmetallic materials including dissimilar metals and has been extensively 
used in electronic, aerospace, and instrument industries. However, this process may 
suffer when joining parts with moderate or thick thickness as the power of 
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Fig. 1 Cold welding 
equipment for joining metal 
rods and bars (Anonymous 
2012 ) 



Fig. 2 Titanium metal rod 
produced by cold welding 
process (Anonymous 2012) 



ultrasonic generator is limited. Besides, workpieces may risk of bonding to the 
anvil. Other concerns on fatigue of workpieces and equipment components as a 
result of vibration should also be considered. 


Friction Welding 

Friction welding is a solid-state process that uses heat generated from the friction 
between a moving workpiece and a stationary one to produce the weld. There are 
two major variations of friction welding process: direct drive welding and inertia 
drive welding. In direct drive friction welding, one of the workpieces driven by a 
motor is rotated continuously at a predetermined constant speed, while the other is 
restrained from rotation. Heat is generated as the faying surfaces rub together under 
pressure. The rotational driving force is discontinued at a preset time, and the 
rotating workpiece is stopped by either the application of a brake force or its own 
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Fig. 3 Basic steps in friction welding 


resistance to rotation. The forge force must be maintained or increased after rotation 
ceases to produce a sound weld. In inertia friction welding, one of the workpieces is 
connected to a flywheel, and the other is restrained from rotation. The stored energy 
in the flywheel decelerated from a preset speed causes the two faying surfaces rub 
together under pressure and produce a weld. Figure 3 shows the basic steps of a 
typical friction welding process. 

As a solid-state welding process, friction welding does not need any filler metals, 
flux, or shielding gas. Another advantage of friction welding is that the requirement 
of surface cleanliness is not as significant as other solid-state welding processes 
because the friction motion tends to disrupt and displace oxides films. With these 
advantages, friction welding has found widespread application for joining similar 
and dissimilar materials in automotive, aviation and aerospace, machinery, electri- 
cal, electronic, and chemistry industries. However, the application of conventional 
rotary friction welding is limited to join small components with axisymmetric 
feature at a relatively low productivity. A variation of this technique is linear 
friction welding which works on the same principle as conventional rotary friction 
welding except that one of the welding parts oscillating laterally instead of rotating. 
Linear friction welding offers the advantage of joining components with any shape 
compared to only circular ones in conventional rotary friction welding. However, 
this process may require extremely high forging force, and the parts to be joined 
need to possess high shear strength. Up to now, the world’s largest linear friction 
welding machine has a capacity of 100 tonnes in terms of forging force that can 
apply to the welded joint (MOOG 2012). Figure 4 provides two examples of joints 
produced by linear friction welding. 

A concern in friction welding is that the dissolution of hardening precipitates 
may cause mechanical property degradation, although this can be recovered by 
applying proper post-weld heat treatment. In addition, tight concentricity require- 
ments, when needed, may be difficult to meet, and the welding flash may require 
further finishing operation. 
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Fig. 4 Examples of joints 
made by linear friction 
welding (MOOG 2012) 



Friction Stir Welding 

Friction stir welding process, as illustrated in Fig. 5, involves a rotating tool that is 
inserted into the joint edges and moves forward along the joint line. The heat generated 
from the local friction between the tool and the workpiece causes the stirred material to 
soften without melting. A sound weld is produced in solid state as the material is 
transferred from the front to the back of the moving tool pin. FSW is a relatively new 
welding technique that has demonstrated its great potential in joining materials that are 
traditionally considered to be unweldable or difficult to weld. As a variation of friction 
welding, it provides the opportunity of welding components with relatively flat geom- 
etry at high productivity. Compared with fusion welding processes, the solid-state 
feature of FSW prevents many metallurgical problems such as porosity and formation 
of brittle phases and enables us to produce welds with a minimum distortion and 
shrinkage that result from the melting and solidification. In addition, a much finer 
microstmcture can be obtained owning to a dynamic recrystallization process, which 
will benefit the mechanical properties of the welds. Furthermore, FSW also eliminates 
the needs of shielding gas and filler metal because of the elimination of the liquid 
welding pool. With these interesting advantages, FSW has gained wide acceptance for 
welding aluminum alloys and is becoming an important technique in many industries. 
It can be applied not only for aluminum alloys, as it was firstly developed for, but also 
for other materials, including steels, titanium alloys, and nickel alloys. 


Process Parameters 

Various process parameters including the tool rotation speed, welding speed, tilt 
angle, and the vertical pressure on the tool can be changed to control FSW process. 
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Fig. 5 Schematic of friction i Downward Force 

stir welding process 
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Table 1 Main process parameters of FSW and their effects on the process 


Process parameter 

Effects on FSW process 

Tool rotation speed 

Peak temperature increases with increasing rotation speed 

Welding speed 

Peak temperature decreases with increasing welding speed 

Tilt angle 

Suitable tilt angle may help the tool hold the stirred material 
and transfer the material efficiently from the front to the back 

Vertical pressure or insertion 
depth of pin shoulder 

Too high pressure leads to overheating, excessive flash, and 
joint thinning, while too low pressure may cause insufficient 
heating and inner voids or surface groove 

Preheating 

May help high-melting-point materials soften and plasticize 

Cooling 

Could help low-melting-point materials avoid too much 
grain growth and dissolution of strengthening precipitates 


The resulted material flow and temperature profile during the process further 
determine the micro structural evolution and the subsequent mechanical properties 
of the welds. Thus, the major FSW parameters and their effects on the process are 
listed in Table 1. 


Tool Design 

As described previously, in FSW process, a nonconsumable rotating tool with a 
specially designed pin and shoulder is inserted into the abutting edges of sheets or 
plates to be joined and traversed along the line of joint. The heat generated from the 
friction between the tool and the workpiece will plasticize the materials around the 
tool, and the softened materials are transported from the front to the back of the tool. 
A sound joint is then produced as the welding tool moved along the joint line. The 
FSW tool is a critical component to the success of FSW process. The selection of 
tool materials and tool geometry is therefore briefly discussed below. 
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Currently, cost-effective and long-life tools made of tool steel alloys are avail- 
able for the FSW of soft materials such as aluminum and magnesium alloys. Steel 
tools have also been used for the joining of dissimilar materials in both lap and butt 
configurations through proper joint design to avoid direct contact of the tool with 
hard materials. For instance, Lee et al. (2009) welded Al-Mg alloy with low-carbon 
steel in lap joint configuration using tool steel as tool material without its excessive 
wear by placing the softer Al-Mg alloy on top of the steel plate and avoiding direct 
contact of the tool with the steel plate. In butt joint configuration, the harder 
workpiece is often placed on the advancing side, and the tool is slightly offset 
from the butt interface towards the softer workpiece (Chen 2009; DebRoy and 
Bhadeshia 2010). However, tool wear can be a severe issue when joining metal 
matrix composites or high-melting-temperature materials including steel, nickel, 
and titanium alloys because the FSW tool is subjected to extreme abrasive condi- 
tion, high stress, or high temperatures (Rai et al. 2011; Mishra and Ma 2005). 
Poly crystalline cubic boron nitride (pcBN) and W-based alloys are important candi- 
date materials for the FSW of these hard or high-melting-temperature materials due to 
their high strength, hardness, and wear resistance even at elevated temperatures. 
There is also an interest in Si3N4 as a prospective tool material because it had 
produced welds comparable with pcBN tools at a much lower cost (Rai 
et al. 2011). Further developments in FSW tool materials are required to address 
the problem of high tool cost with low tool life during welding of hard alloys. 

The tool geometry plays a critical role in heat generation and material flow, thus 
determining the uniformity of micro structure and properties as well as process 
loads. Important factors are shoulder diameter, shoulder surface angle, pin geom- 
etry including its shape and size, and the nature of tool surfaces. Generally, a 
concave shoulder and threaded cylindrical pins are used. With increasing experi- 
ence and improvement in understanding of material flow, the tool geometry has 
evolved significantly. Complex features have been added to alter material flow and 
mixing and reduce process loads. For example, various Whorl™-type probes, 
shown in Fig. 6 (Thomas et al. 2003), are designed to weld in excess of 6-mm-thick 
plate at higher speeds than conventional pin-type probes. Flared-Triflute™ tools 
illustrated in Fig. 7 are designed for lap joining to increase the width of the weld 
region to achieve a better bond (Thomas et al. 2003). Recently, computational tools, 
including finite element analysis (FEA), were used to visualize the material flow 
and calculate axial forces, which can be an effective method for tool design and 
optimization. However, FSW tools are currently designed empirically by trial and 
error. Work on the systematic design of tools using scientific principles is just 
beginning. A more detailed discussion on this subject can be found elsewhere in the 
literature (Rai et al. 2011; Mishra and Ma 2005). 


Microstructural Evolution 

The microstructural evolution is one of the most important issues since it further 
determines various properties of the materials. In the case of FSW, the 
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Fig. 6 


Basic variants of Whorl™-type probes (Thomas et al. 2003) 


micro structural aspect has been extensively documented in the literature. Gener- 
ally, three distinct regions shown in Fig. 8 can be identified in aluminum friction stir 
welds as nugget zone, thermomechanic ally affected zone (TMAZ), and heat- 
affected zone (HAZ). According to Lippold et al. (2003), the nugget zone is a 
recrystallized region associated with the passage of the tool. The TMAZ is a region 
unique to FSW in which the material has undergone some macroscopic deformation 
without grain refinement since the force and temperature are not high enough to 
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Fig. 7 Basic variants of the Flared-Triflute™-type probes (Thomas et al. 2003) 




Advancing 


Retreating 


2000 pm 


Fig. 8 A typical macrograph showing various micro structural regions in aluminum friction stir 
welds 


promote dynamic recrystallization, while the HAZ is a region without deformation 
but may undergo some metallurgical change. In fact, apart from the grain structure 
difference, metallurgical changes are happening simultaneously in all these three 
regions depending on the temperature profile and alloy composition. 

Effects of Process Parameters 

The thermal cycle experienced by the workpiece is the main factor that determines 
the micro structural evolution and the final property of friction stir welds. Any 
change in process parameters will undoubtedly lead to variation in the thermal 
cycle and hence in the micro structure and properties. 

For both heat-treatable and non-heat-treatable aluminum alloys, it was recorded 
that the grain size in the nugget zone increased with increasing rotation speed or 
decreasing welding speed (Hassan et al. 2003a, b; Hirata et al. 2007; Sato 
et al. 2007). Figure 9 gives a good example of micrographs showing the effects 
of rotation speed on grain structure in the FSW nugget of AA7010 Al joint. At 
180 rpm, the average grain size in the nugget was extremely fine, around 1.7 pm, 
compared with 6 pm in the welds made at 450 rpm. These results can be explained 
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Fig. 9 EBSD maps showing the grain structures in the nugget zones of FSW welds with different 
rotation speeds but a constant welding speed, (a) and (c) at the center, (b) and (d) at the bottom of 
the welds (Hassan et al. 2003a) 


by the thermomechanical cycle in the weld. As mentioned earlier in Table 1, the 
peak temperature during FSW could be controlled by varying the rotation speed and 
the welding speed, which in turn determines the resultant grain structure of a 
dynamic recrystallization process. In a different way, Benavides et al. (1999) 
obtained finer grain size of about 0.8 pm compared to the original of 10 pm through 
lowering the starting temperature from 30 °C to —30 °C, which effectively 
decreases the peak temperature from 330 °C to 140 °C. 

Precipitate Behavior During FSW 

FSW process exerts a thermal cycle on the workpiece in solid state. For many 
aluminum alloys strengthened by precipitation hardening mechanism, it can be 
predicted that FSW would cause a soften region within and around the weld nugget. 
As illustrated in Fig. 10, Nandan et al. (2008) used a schematic diagram to interpret 
this softening process. HVmin and HVmax represent the hardness in the solution- 
treated and precipitation-hardened states. They described that the precipitates may 
experience coarsening, dissolution, and reprecipitation during FSW depending on 
the localized temperature. The heat produced during FSW causes a drop in hardness 
from HVmax to HVmin, curve (a) in Fig. 10. Some reprecipitation may occur 
during the cooling part of the thermal cycle, resulting in a hardness value beyond 
HVmin, curve (b) in Fig. 10. The minimum hardness would thus appear at some- 
where in HAZ as a result of the competing effect of dissolution and reprecipitation. 
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Fig. 10 Schematic diagram 
showing dissolution and 
reprecipitation in 
age-hardenable aluminum 
alloys (Nandan et al. 2008) 


◄ Peak temperature 


▲ 


Hardness 



In fact, the above theory has been proven in several studies on microstructural 
evolution of FSW (Su et al. 2003; Sato et al. 1999). In the work of Su et al. (2003), 
the typical precipitate distribution in each weld region of an AA7050-T651 alloy 
was illustrated by transmission electron micrographs as shown in Fig. 11, which 
corresponds very well to the above interpretation. Similarly, a comparison between 
AA6063 friction stir welds and isothermal aging samples has been made by Sato 
et al. (1999) according to simulated thermal cycles, which also confirmed again the 
precipitation sequences in heat-treatable aluminum friction stir welds. 


Abnormal Grain Growth 

Abnormal grain growth sometimes known as secondary recrystallization usually 
takes place in recrystallized materials at high temperatures. This process may lead 
to the formation of large grains of even greater than several millimeters, which 
causes negative effects for mechanical property. Figure 12 shows two micrographs 
of the cross sections of AA7075 FSW joints in the as- welded and T6 post- weld 
heat-treated conditions. Its occurrence during post- weld heat treatment (PWHT) of 
friction stir welds has been reported in the literature (Hassan et al. 2003a; Krishnan 
2002; Attallah and Salem 2005). A good review of this phenomenon in FSW could 
also be found in the work of Sato et al. (2007). Hassan et al. (2003a) studied the 
stability of friction stir- welded AA7010 alloy during solution treatment. It was 
concluded that the abnormal grain growth might be promoted by the fine-grain 
structure in the nugget zone and the partial dissolution of second-phase particles 
during solution treatment. The welds made with higher heat input seem to be more 
stable since only a few large grains developed at the base of the weld on its 
advancing side. Attallah and Salem (2005) also revealed that the rotation speed 
and the welding speed can be used as a tool to control abnormal grain growth during 
subsequent heat treatment. It was found that the welding parameter determines 
the extent of abnormal grain growth and the welds made with lower rotation 
speed and higher welding speed have higher tendency to abnormal grain growth. 
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Fig. 11 TEM micrographs showing precipitate distribution in different FSW weld regions: 
(a) parent metal, (b) HAZ, (c) TMAZ I, (d) TMAZ II, (e) nugget zone (Su et al. 2003) 


Nevertheless, none of the above two investigations were able to completely avoid 
abnormal grain growth. Recently, Sato et al. (2007) conducted a more systematic 
research work with AA1100 alloy. Different rotation speeds and heat treatment 
temperatures were used in FSW and PWHT, respectively. The peak temperatures 
experienced during FSW were measured by placing alumel-chromel thermocou- 
ples at the bottom of the stirred zone. As shown in Fig. 13, the abnormal grain 
growth was only observed in conditions with PWHT temperature higher than the 
peak temperature experienced during FSW. For instance, the peak temperature 
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Fig. 12 Micrographs showing the cross section of AA7075 FSW joints in (a) as-welded, (b) T6 
post-weld heat-treated conditions 


Fig. 13 Effect of rotation 
speed on onset temperature of 
abnormal grain growth (Sato 
et al. 2007) 



Rotational speed (rpm) 


measured during FSW with 1,800 rpm was 755 K; PWHT were carried out under 
different temperatures of 573, 623, 643, 673, 773, and 873 K. Only the materials 
heat treated at 773 and 873 K, which are higher than 755 K, experienced abnormal 
grain growth. 
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Mechanical Properties 
Microhardness 

Hardness data were usually measured as an initial evaluation of mechanical prop- 
erties across the FSW weld zone. Although it was reported that the hardness 
responses to FSW are dependent on the materials’ composition, initial temper, 
and also the welding process parameters, these materials can generally be classified 
into heat-treatable and non-heat-treatable alloys. The commonly reported hardness 
plots for heat-treatable alloys usually have “W” shapes, which consist of a local 
maximum central plateau with two minimum on both sides in HAZ, but the overall 
hardness values in the weld zone are much lower than that in the base materials 
(Hassan et al. 2003b; Sato et al. 1999; Reynolds et al. 2005; Murr et al. 1998). For 
non-heat-treatable alloys, the hardness distribution seems to be controversial 
though the same principle was used to explain the different results. 

The hardness of heat-treatable alloys is dominated by precipitation behavior. As 
discussed in the previous part, the precipitate experiences a process of coarsening, 
dissolution, and reprecipitation during FSW depending on the localized tempera- 
ture. Thus, it is not surprising to have a “W”-shaped hardness profile in the weld of 
heat-treatable alloys, and an example is shown in Fig. 14. It is important to note that 
the welding process parameters have some influences on the hardness profile of the 
weld. Reynolds et al. (2005) did an elegant work to build the relationship between 
welding parameters and hardness distribution in AA7050 alloy. It was found that 
the average hardness in the weld nugget is closely related to welding process 
parameters. As shown in Fig. 15, the average hardness increases with increasing 
welding speed at a constant weld pitch (ratio of welding speed and rotation speed). 
Furthermore, higher hardness was achieved at higher rotation speed for a given 
welding speed. It can thus be concluded that the average nugget hardness level 
increases with increasing the heat input during welding (Reynolds et al. 2005). It 
was also observed that the hardness minimum on both sides increases with increas- 
ing weld power. Hassan et al. (2003b) investigated the effect of welding parameters 
on the hardness of the nugget zone. Similar results were obtained except that the 
hardness minimum did not change with welding parameters as in the nugget. 

On the other hand, the hardness of non-heat-treatable aluminum alloys is mainly 
dependent on grain size and dislocation density. It was generally reported that FSW 
did not cause too much changes in hardness of the weld for annealed alloys (Murr 
et al. 1998; Liu et al. 2003), while an obvious softened region could be observed in 
the weld of strain-hardened alloys (Liu et al. 2003; Flores et al. 1998). However, 
Saito et al. (2007) observed higher microhardness in the weld zone for both the 
rolled and annealed AA1050 alloy samples. Recently, Kwon et al. (2003) examined 
the average hardness across the weld of a cold-rolled AA1050 alloy after welding at 
different rotation speeds from 560 to 1,840 rpm. It was reported that the average 
hardness of the weld increased significantly with decreased tool rotation speed, as 
shown in Fig. 16. Fifty percent increase of the average hardness in the weld was 
observed by adjusting the rotation speed from 1,840 to 560 rpm (32 vs. 48 HV0.1). 
They suggested that the weld zone has much finer grain size and lower dislocation 
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Fig. 14 Hardness profiles in 
the welds made with the same 
weld pitch (ratio of travel and 
rotation speed) (Reynolds 
et al. 2005) 



Position relative to weld centerline, mm 


160 


Fig. 15 Effect of weld pitch 
and welding speed on the 
average hardness in FSW 
nugget (Reynolds et al. 2005) 
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Fig. 16 Average Vickers 
microhardness of the 
unprocessed base metal and 
friction stir- welded zone 
(Kwon et al. 2003) FZ friction 
stir-processed zone, UZ 
unprocessed zone 
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Table 2 Tensile properties of friction stir welds in as- welded condition 



Friction stir weld properties 

Base alloy 

UTS 

(MPa) 

YS 

(MPa) 

Elongation 

(%) 

Efficiency 
(%, UTS) 

Ref. 

AA1050-H24 

72-85 

38-51 

9.5-20.5 

62-74 

Liu et al. (2003) 

Cold-rolled 

AA1050 

125-202 

— 

— 

91-146 

Kwon et al. (2003) 

AA2219-T6 

225-336 

— 

5-7.3 

54-81 

Liu et al. (2006) 

Overaged 

AA2095 

188-211 

— 

— 

95-100 

Attallah and Salem 

(2005) 

AA5083-O 

303-344 

— 

— 

100 

Svensson 
et al. (2000) 

AA6061-T651 

186-217 

114-141 

8.5-11.8 

60-70 

Ren et al. (2007) 


211-243 

— 

8. 1-8.8 

69-79 

Liu and Ma (2008) 

AA6082-T6 

226-254 

— 

— 

75-84 

Svensson 
et al. (2000) 


210-270 

136-185 

10-11.6 

69-89 

Cavaliere 
et al. (2008) 

AA7 010- 
TV 651 

375-518 

— 

3.5-22 

68-94 

Hassan 
et al. (2003b) 

AA7050- 

T7451 

427 

303 

5.7 

77 

Pao et al. (2001) 


density than the base metal. Therefore, at lower rpm (560-1,350), the grain size 
contributes more to the hardness than dislocation, while for 1,840 rpm, the effect of 
the grain size cannot overcome the negative influence of dislocation decrease 
(Kwon et al. 2003). 

Tensile Properties 

Like the hardness, tensile properties are also determined by the microstructure of 
the materials. Compared with fusion welds, friction stir welds are generally char- 
acterized by finer grain microstructure and less reduction of mechanical properties. 
The tensile properties of some friction stir welds are summarized in Table 2. Most 
of the data focused on alloys in strain-hardened or aging-hardened conditions. It can 
be seen that the joint efficiency is generally higher than 60 %, but lower than 100 %. 
Higher efficiency could be achieved through adjusting the process parameters. It 
should be noted that extremely high joint efficiency of about 146 % has been 
obtained in FSW of a cold-rolled 1050 alloy (Kwon et al. 2003). 

As already mentioned, the thermal cycle experienced by the workpiece is the 
main factor that determines the micro structure and the final mechanical properties 
of friction stir welds. It can be controlled by varying the process parameters such as 
the tool rotation speed and the welding speed. The effect of process parameters on 
tensile properties of heat-treatable alloys has been investigated in several studies 
(Hassan et al. 2003b; Svensson et al. 2000; Ren et al. 2007; Liu and Ma 2008; 
Cavaliere et al. 2008; Pao et al. 2001). Hasson et al. (2003b) found that there is an 
optimum FSW condition that gives the best tensile properties for AA7010-T7651. 
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They explained that the tensile properties of the materials are dominated by the 
behavior of precipitates. Relatively higher heat input could lead to more dissolution 
of coarsened precipitate particles, and thus more solutes are available for natural 
aging process for 7xxx series alloys. However, too high heat input will deteriorate 
the weld properties due to reprecipitation occurring during the cooling process after 
welding, which would reduce the natural aging response of the welds. Similar 
trends of tensile properties for AA6082-T6 alloy were also observed in another 
study (Cavaliere et al. 2008). On the other hand, the effect of process parameters on 
tensile properties of non-heat-treatable alloys has also been reported in the literature. 
Kwon et al. (2003) examined the mechanical properties of a cold-rolled AA1050 
plate after FSW by using different tool rotation speeds. It was shown that the average 
tensile strength of the welds increased significantly with decreased tool rotation 
speed (from 123 to 202 MPa), which has a similar tendency to the average hardness 
results (from 32 to 46 HV0.1). They attributed this behavior to the much finer grain 
size in spite of lower dislocation density. Meanwhile, Liu et al. (2003) reported 
a similar study of FSW for AA1050-H24 plates by varying the welding speed from 
1.7 to 13.3 mm/s with constant rotation speed of 1,500 rpm. It was observed that the 
yield strength increased slightly with increased welding speed, and the ultimate 
strength however showed a peak value at 6.7 mm/s. Unfortunately, the authors did 
not give any explanation for this phenomenon. 

Post-weld heat treatment is sometimes applied to recover the mechanical prop- 
erties of friction stir welds. It can easily be understood that solution treatment 
followed by artificial aging could probably completely restore the base metal tensile 
strength (Krishnan 2002). However, attention needs to be paid to the occurrence of 
abnormal grain growth, which is believed to be very harmful to mechanical 
properties. As described previously, the mechanism of abnormal grain growth has 
been understood to some extent. Nevertheless, its effects on tensile properties were 
not reported in these studies probably because the mechanical properties of the 
joints have been degraded to unacceptable level (Hassan et al. 2003a; Krishnan 
2002; Attallah and Salem 2005). 


Inspection 

The integrity of friction stir welds is of great importance as it determines the 
mechanical performance of joints. Surface defects such as flashes and grooves 
can be readily identified visually. Figure 17 shows two examples of surface pores 
and groove caused by inappropriate process parameters. Discontinuous pores or 
continuous wormholes inside the welds can also be present usually on the advanc- 
ing side. Such defects may be detected by using nondestructive testing techniques 
such as X-ray and ultrasonic inspection, and some success has been reported in the 
literature (Chen et al. 2009; Guo et al. 2012; Maity et al. 2008). However, the 
equipment for NDT inspection is relatively expensive, and root defects such as 
insufficient penetration cannot be detected. Liquid penetrant inspection is com- 
monly used as an effective and low-cost method to eliminate friction stir welds with 
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Fig. 17 Surface defects in 
FSW joints caused by 
inappropriate process 
parameters (a) Surface pores 
(b) Groove 





insufficient root penetration. Nevertheless, the penetrant can be harmful for the 
operator’s health. Therefore, three-point bending test together with conventional 
metallographic examination is still the dominated method to evaluate the soundness 
of friction stir- welded joints. 


Diffusion Bonding 

Diffusion welding is a welding process that produces a solid-state bond between 
two components under a certain load at a temperature below the melting point of the 
materials to be joined. During this process, no macroscopic deformation or relative 
motion of the workpiece takes place, but a filler material may be added at the 
interface to facilitate the bonding process. Inert atmosphere or vacuum may be 
required to protect the workpiece. Alternatively, diffusion bonding is sometimes 
called as diffusion welding or hot pressure bonding/welding in different industries. 

In diffusion bonding, the complete atomic bonding of two solid surfaces can be 
described by a three-stage model, as shown in Fig. 18. The first stage comes after 
the contacting of the faying surfaces when the surface asperities deform due to 
localized yielding and creep producing a large fraction of interface. Grain bound- 
aries at the area of contact with voids are the main features of joint interface at the 
end of this stage (Fig. 18b). The second stage is dominated by the diffusion of atoms 
and the disappearance of voids. The interfacial grain boundaries come to a more 
equilibrium state, leaving some of the remaining voids inside the grains at the 
interface (Fig. 18c). The third stage is mainly a process of the elimination of 
remaining voids through atom diffusion inside the volume of grains. At the end 
this stage, no void can be found at the interface, or the voids are small enough, 
having no impact on the strength of interface (Fig. 18d). 


588 


J. Guo 




Fig. 18 Typical stages during diffusion bonding 


To obtain a successful diffusion bond, attentions have to be paid to surface 
preparation, bonding temperature, pressure, and time. Among them, temperature is 
the most influential factor because it controls not only the yielding and creep 
properties of the workpiece in the first stage but also the diffusion rate of atoms 
in the second and third stages. Bonding pressure and surface roughness are very 
important factors determining the fraction of contact area and the amount and size 
of voids formed in the first stage. Pre-removal of pressure or too rough surfaces can 
result in large voids that must be eliminated in the later stages of welding. Bonding 
time, on the other hand, is not an independent variable. The necessary time required 
to produce a sound joint depends on the applied temperature and pressure. 

Diffusion bonding possesses great advantages for joining dissimilar materials, 
metal matrix composites, and reactive materials as it avoids the formation of 
intermet allies or oxides. Therefore, diffusion bonding has been widely used in 
electronics, composites manufacturing, and aircraft or aerospace industries. How- 
ever, the dimensions of the parts that can be diffusion bonded are limited to the size 
of bonding furnace. Besides, this technique is usually a time-consuming process 
and thus has a relatively low productivity. 


Resistance Welding 

Resistance welding is a group of welding processes including resistance spot, 
projection, and seam welding that uses electrical resistance heating to weld two 
pieces of metals held under pressure. In principle, it involves a coordinated appli- 
cation of electric current and mechanical pressure of preselected values and 
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duration. Electrical heat is first developed to melt a confined volume of metal from 
the two workpieces. The current is then turned off, allowing the molten metal to 
solidify, forming an electron- sharing metallurgical bond. The current density and 
bonding pressure must be such that enough molten metal is raised to form a nugget, 
but not so high to squeeze out the molten metal from the weld zone. Based on the 
principle, it is more appropriate to classify resistance welding into fusion welding 
techniques, but it is more often discussed together with other solid-state welding 
processes because all these processes are conducted under a certain pressure. 

In resistance spot welding, the weldment of two metal pieces is produced 
directly at the electrode site. If the location of weldment is determined by a 
projection on the surface of a workpiece or by the intersection of parts, the term 
“resistance projection welding” is commonly used. If a series of overlapping 
weldment is produced with rotating wheel electrodes, resistance seam welding is 
the right term. A seam weld can be also produced by spot welding equipment, but 
the productivity will be much lower. 

Resistance welding techniques have been widely used in the assembly of sheet 
metal products such as automotive body-in-white assemblies, domestic appliances, 
furniture, building products, enclosures, and aircraft components because of their 
relatively high welding speeds and suitability for automation and inclusion in high- 
production assembly lines. Nonetheless, only relatively thin sheet metal can be 
welded by using resistance welding. Besides, lap joint design used in these pro- 
cesses tends to add weight and material cost compared to butt joints. In addition, the 
high power used in resistance welding requires high-quality line demand, especially 
for some single-phase welding machines. Moreover, the weld path for seam 
welding has to be straight or uniformly curved path since any abrupt in welding 
direction cannot be welded leak tight. 


Explosion Welding 


Explosion welding is a solid-state welding technique that produces a weld by a 
controlled detonation on the surface of a metal. It is considered as a room temper- 
ature process though a large amount heat is generated from the explosive collision. 
During explosion welding, the base component, usually supported with an anvil, 
remains stationary, while the prime component (flyer plate) is positioned with an 
angle or parallel to the base plate, as illustrated in Fig. 19. The explosive uniformly 
distributed on the prime component is then detonated from one edge of the prime 
surface and propagates to the other edge at very high velocity. A strong weld is then 
produced as a result of collision through plastic flow of the metal on the surfaces. 
The collision velocity is usually higher than the material sonic velocity. And the 
explosive must supply uniform detonation so as to achieve a collision velocity that 
is uniform from the start to finish of the weld. 

Explosion welding is well known for its capability of joining a wide variety of 
both similar and dissimilar combinations of metals that cannot be joined by 
any other techniques. It is a simple, portable, and cost-effective welding process. 
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Fig. 19 Schematic of 
explosion welding process: 
(a) constant interface 
clearance gap, (b) angular 
interface clearance gap 
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High-quality welding can be achieved quickly over large surface areas. The 
strength of the weld can be equal to or greater than the weaker one of the two 
metals being joined. Presently, explosion welding is mainly used in the cladding of 
base metals with thinner alloys, tubular transition joints, pipeline joining, and repair 
of cylindrical components in various industries. Nevertheless, explosion welding is 
not become widely used because of several limitations. First, materials with low 
ductility and low impact toughness cannot be welded by explosion welding. 
Second, the geometries that can be welded by explosion welding must be very 
simple, e.g., flat and cylindrical. Third, more restrictions on personal safety and 
industrial use of explosives are applied due to the noise and vibration issues (Findik 
2011; Banker and Reineke 1998). 


Summary and Outlook 


Various solid-state welding techniques have been concisely described in this 
chapter. It is evident that the application of these processes has significantly 
boomed in the last few decades because of their various advantages such as free 
of solidification defects; no filler metals, flux, or shielding gas required; and the 
capability of joining dissimilar materials over fusion welding. The digitalization of 
manufacturing during the still ongoing third industrial revolution will continue to 
facilitate their incorporation into the atomized production lines. At the same time, 
more efforts in research and development on SSW have to be made to further 
improve the welding equipment and process, especially with the emerging of new 
advanced materials. Specifically, friction (stir) welding, diffusion bonding, resis- 
tance welding, and ultrasonic welding will greatly expand their application fields in 
the coming decades. Cold welding, forge welding, and explosion welding will 
continue to play important roles in their specific areas, although some of them 
may suffer either economical or environmental issues. 
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Abstract 

This book chapter deals with the basics of arc welding processes, heat source 
used for arc welding, different types of arc polarities, effect of shielding gases, 
and welding power sources necessary for arc welding. The fundamentals of 
formation of arc and arc physics were discussed. Small versions of conventional 
TIG, MIG, and PAW techniques are developed for high quality precision arc 
welding with ultralow energy input. In this chapter, different types of high 
precision arc welding process will be discussed. This chapter also describes 
the high productivity arc welding processes like twin wire gas tungsten arc 
cladding, plasma cladding, and laser-arc hybrid welding processes. Results 
showed that a significant increase in deposition rate and high productivity can 
be achieved with the high productivity arc welding processes. 


Introduction to Basics of Arc Welding 


Welding process is one of the important processes in any manufacturing sector. 
This technique is used to join similar or dissimilar materials by the application of 
heat or pressure or combination of both heat and pressure. At present, several 
welding processes have been developed to weld most of the known engineering 
materials. The most important aspect is to provide a high structural integrity with 
high joint efficiency during welding. Welding can be performed manually, semi- 
automatically, and fully automatically. The major disadvantage of this process is 
disturbance of the base metal micro structure due to the heat input involved during 
welding. Residual stresses and distortion may occur by the unbalanced heat input 
and thus the quality of the welded joints highly dependent on the skill of the welders 
and operators (Jeffus 1997). 


Source of Energy 

Welding process can be classified into two types such as fusion welding and solid 
state welding. In fusion welding, the temperature involved in the process is more 
than the melting point of the materials to be joined. Bonding achieved at the 
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interface is due to the influence of the high mobility of the atoms in the molten state, 
and strong joint could be achieved during cooling. In solid state welding, the 
temperature involved is lower than the melting point of the materials, and in few 
processes, no heating is involved. The joint formed at the interface is mainly 
influenced by the plastic deformation or by molecular diffusion of atoms or 
combination of both. 

Three fundamental sources of energy are used in welding, viz., (i) electrical, 
(ii) mechanical, and (iii) chemical. Electrical energy in the form of an electric arc 
between electrodes, resistance heating by ER effect, induction heating, or high 
energy electron beam are used to melt base and filler metals during welding. Energy 
generated by friction, plastic deformation and impact energy are used as mechan- 
ical source for the welding. Whereas, energy released from the combustion of gases 
or reaction between chemicals is used as chemical sources of energy (Kou 2002; 
Messier 2004). 

The distinguishing feature of all fusion welding processes is the intensity of the 
heat source used to melt the material. The characteristics of each type of heat source 
employed in a welding process are determined by its intensity. This power-density 
spectrum shown in Fig. 1 gives an idea of the power densities available in the 
common joining processes. In general, 1,000 W/cm is necessary to melt most of 

f\ H o 

the metals, and at the same time, heat intensities of 10 or 10 W/cm will vaporize 
most metals within a few microseconds. At levels above these values, all of the 
solid that interacts with the heat source will be vaporized, and no fusion welding 
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Fig. 1 Range of heat intensities used of different welding processes 
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can occur. Thus, the heat sources for all fusion welding processes should have 
power densities between approximately 0.001 and 1 MW/cnrT. 

Normally, low-intensity processes can be recommended for welders to begin 
their training because the process is inherently slow and does not require any rapid 
response time in order to control the size of the weld puddle. Greater skill is needed 
to control the more rapid fluctuations in arc processes. The weld pool created by the 
high-heat-intensity processes, such as electron-beam and laser beam welding, is 
difficult to be manually controlled and must therefore be automated. The need for 
automation leads to increased capital cost of the equipment. For increasing the 
intensity of the power, the spot size of the source of energy is to be decreased. 
A decrease in the spot size will produce a squared increase in the heat intensity. 
However, small spot size, coupled with a decreased interaction time at higher power 
densities, complexes the problem of controlling the higher-heat-intensity process. 
A shorter interaction time means that the sensors and controllers necessary for 
automation must operate at higher frequencies. The smaller spot size means that the 
positioning of the heat source must be more precise. Thus, not only must the high- 
heat-intensity processes be automated because of an inherently high travel speed, 
but the fixturing requirements become greater and the control systems and sensors 
must have ever-higher frequency responses. These factors lead to increased costs, 
which is one reason that the very productive laser-beam and electron-beam welding 
processes have not found wider use (Welding brazing and soldering 1998). 


Arc Welding 

The heat source for the arc welding is an electric arc, and it is an important and more 
versatile process. In this process, high intensity of heat is generated by the arc 
between the electrodes which can melt any known material. It is useful to under- 
stand the fundamentals of electric arc to get a proper knowledge of arc welding. 

A welding arc converts electric energy into heat energy through the gaseous 
electric conductor. The gaseous electric conductor is known as plasma. The electric 
arc is usually developed between two electrodes, namely, anode and cathode. Due 
to the shape of the electrodes irrespective of the polarity, the welding arc always 
appears to be in bell shape. 

When current is passed through the electrode, due to the Joule effect, the 
electrode gets heated and the electrons get emitted from the hot electrode surface 
by the thermionic emission effect. Electrons emitted by thermionic emission 
accelerate due to the potential difference between electrodes and ionize the gases 
present between the electrodes. Electrons and ions move toward the respective 
attractive side when the current flows through the ionized gas. The resultant arc 
discharge is due to the combined effect of higher current and low voltage. During 
this arc discharge, due to the current flow, the ionization process is self-maintained 
due to the continuous production of plasma, and stable arc is formed between the 
electrodes. 
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Fig. 2 Schematic representation of relative arc voltage distribution versus relative arc length 
between electrodes 


Generally, the welding arc can be divided into five parts, viz., the cathode spot, 
the cathode drop zone, the arc column, the anode drop zone, and the anode spot as is 
depicted in Fig. 2. A sharp voltage drop is observed at the cathode and anode drop 
zones, whereas the voltage drop is gradual at the arc column. 


Cathode Spot 

A region at which electrons are emitted from the surface is called the cathode spot. 
From the spot, the electrons move toward the anode radially and axially. 

There are three different types of cathode spots, namely: 

(i) Mobile cathode spot 

(ii) Thermionic cathode spot 

(iii) Normal mode 

(i) Mobile Cathode Spot 

In materials like aluminum, magnesium, and copper, mobile cathode spot is 
observed. The presence of the adherent oxide layer retards the formation of the 
stable cathode spot, and emission occurs through the oxide layer wherever it is 
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weak. One or more cathode spots which appear at the cathode surface travel- 
ling at a high speed will usually leave behind a visible trace. The oxide film on 
the surface of the metal thus gets loosened by the movement of the mobile 
cathode spot. 

(ii) Thermionic Emission 

Thermionic emission occurs at the cathode tip when the current is flowing 
through the electrode. Increase in temperature of the electrode increases the 
electron emission from the cathode spot. 

(iii) Normal Mode 

In the normal mode, the cathode spot does not form any well-defined spot. 
In a low carbon-coated steel electrode, the cathode spot appears to envelop the 
entire molten tip of the electrode. 


The Cathode Drop Zone 

It is the gaseous region adjacent to the cathode in which a sharp drop in voltage 
occurs. It is usually brought about by an excess of positive ions at the cathode 
created by a positive space charge, which shields the cathode. However, in some 
forms of non-self-sustained electric current in a gas where there is intense electron 
emission from the cathode, a cathode drop is developed by the negative space 
charge (an excess of electrons); such a cathode drop limits emission and inhibits 
further increase of the space charge. The main processes that make possible the flow 
of electric current in a gas operate in the zone of cathode drop and its immediate 
vicinity. 


Arc Column 

It is the bright visible portion of the arc and has a high temperature and a 
low potential gradient. The arc column temperature depends upon the gases 
present in it and the amount of welding current flowing in the circuit. 
The arc column temperature which varies from 6,000 °C to more than 20,000 °C 
depends on the gases present in the arc column. At such a high temperature, 
almost all gases present in the arc column get split into atomic form and further 
split into electrons and ions. The cross section of the arc column is circular and 
contains two concentric zones. The inner zone is called the plasma which carries 
most of the current and has the highest temperature, whereas the outer zone is 
relatively cooler. 


Anode Drop Zone 

Anode drop is the electric connection between the arc column and the positive pole 
(anode), and a small temperature drop occurs from the arc column to the anode due 
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to fewer ions in this region. If the workpiece acts as the anode, the mean current 
density is quite low; the voltage drop is between 1 and 3 volts. The depth of the drop 

o 1 

zone is of the order of 10 “-10 mm. When the rod electrode acts as an anode, the 
anode spot occupies the lower hemisphere of the molten droplet at the tip of the 
electrode (Messier 2004; Welding brazing and soldering 1998). 


Effects of Polarity 

Three different types of arc polarities can be used for the welding arc generation, 
viz., (i) direct current electrode negative (DCEN), (ii) direct current electrode 
positive (DCEP), and (iii) alternating current (AC) where the polarity is changed 
every half cycle. 


Direct Current Electrode Negative (DCEN) 

In DCEN mode, electrons move from the electrode to the positive workpiece. 
During this mode, the heat distribution is more at the workpiece compared to the 
electrode. Lightweight electrons move faster than positively charged ions. As a 
result, the bulk (approximately two-thirds) of the heat is generated at the positive 
workpiece where the electrons give up their kinetic energy through collision with 
atoms in the positive electrode. The variation of the heat distribution is shown in 
Fig. 3. This mode results in deeper penetration and narrow welds. 
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Fig. 3 Schematic representation of effects of the various polarity modes possible and their effects 
on weld bead shape 
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Direct Current Electrode Positive (DCEP) 

In DCEP mode, electrons move toward the electrode and positive ions move toward 
the workpiece. Only one-third of heat is transferred to the workpiece in DCEP 
mode, and as a result, shallow welds are produced. This mode of operation is good 
for welding of thin sections and temperature -sensitive materials. This mode of 
operation is exclusively used for the removal of oxide layer in the aluminum and 
its alloys during welding. But DCEN mode is generally preferred for 
nonconsumable electrode processes than the DCEP mode to safeguard the 
electrode. 


Alternating Current (AC) 

The AC mode combines the characteristics of DCEN and DCEP modes. In this 
mode, oxide cleaning is achieved during positive cycle and reasonably good 
penetration is obtained during negative cycle. However, alternating current makes 
the arc unstable compared to DC modes. So, in AC welding, machines are generally 
equipped with high-frequency and high-voltage (HFHV) unit. 


Effect of Shielding Gases 

The welding arc is formed and sustained by the establishment of a conductive 
medium called the arc plasma. However, the formation and structure of the arc 
plasma is dependent on the properties of the shielding gases used for welding. To 
protect the molten weld metal from atmospheric contamination, inert gases like 
argon (Ar) and helium (He) and active gas carbon dioxide (C0 2 ) are used. Apart 
from these, nitrogen (N 2 ), oxygen (0 2 ), hydrogen (H 2 ), and their mixtures with the 
above gases are also used for achieving desired mechanical and weld metal 
properties. The ionization potential is the energy, expressed in electron volts, 
necessary to remove an electron from a gas atom making it an ion, or an electrically 
charged gas atom. The properties of the gases are listed in Table 1 (Welding brazing 
and soldering 1998). 

Arc starting and arc stability are greatly influenced by the ionization potentials of 
the components of shielding gases used in the welding process. Gases with a low 


Table 1 Properties of shielding gases (Welding brazing and soldering 1998) 


Gas (symbol) 

Density (g/1) 

Ionization potential (eV) 

Argon (Ar) 

1.784 

15.7 

Carbon dioxide (C0 2 ) 

1.978 

14.4 

Helium (He) 

0.178 

24.5 

Hydrogen (H 2 ) 

0.090 

13.5 

Nitrogen (N 2 ) 

12.5 

14.5 
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ionization potential, such as argon, can dissociate into ions easily. Helium, with its 
significantly higher ionization potential, produces a harder to start, less stable arc. 
Although other factors are involved in sustaining the plasma, the respective energy 
levels required to ionize these gases must be maintained; as a consequence, the arc 
voltage is directly influenced. For equivalent arc lengths and welding currents, the 
voltage obtained with helium is appreciably higher than argon. This translates into 
more available heat input to the base material with helium. 

It is well known that the 100 A helium arc can melt more metal than a 100 A 
argon arc. This is not only by the influence of the ionization potential values but 
also due to the conductivity of the shielding gases. The thermal conductivity of a 
gas is a measure of how well it is able to conduct heat. It influences the radial heat 
loss from the center to the periphery of the arc column as well as heat transfer 
between the plasma and the liquid metal. 

In monatomic gases such as helium and argon, the thermal conductivity is 
controlled by the mass diffusivity of the atoms, which, according to the kinetic 
theory of gases, is proportional to the inverse square root of the mass of the atom. 
Though the atomic weight of the argon is ten times greater than helium, its thermal 
conductivity is only 30 % of helium. So helium will conduct three times as much 
heat across the gaseous layer. A gas with a high thermal conductivity gives a more 
even distribution of heat to the work surface and produces a wider fusion area. So, 
helium gas arc melts more metal than argon arc. The gas mixtures such as 75 % Ar- 
25 % He and 50% Ar- 50 % He, are reported to increase the rate of heat transfer 
over the pure argon, as thermal conductivity values of the gas mixtures is interme- 
diate to the pure gases. The additions of diatomic gases like H 2 and 0 2 with the Ar 
enhance the thermal conductivity due to the dissociation of the gas in the plasma 
and recombination in the gas boundary layer (Eagar 1990). 


it 


V-I" Relationship for Welding Power Sources 


An arc is generated between electrode and workpiece by the supply of low 
voltage and high current from the power source. The power for the arc is 
delivered by different sources like transformer, electric generator, alternator 
driven by a diesel engine or an electric motor, solid-state diodes, silicon- 
controlled rectifiers (SCRs), thyristors, and inverters. Welding power source 
efficiency is determined by its static and dynamic output characteristics. Under 
steady-state conditions, static output characteristics are measured. Dynamic 
output characteristics are determined by measuring very short-duration like 
0.001 s transient variations in the output voltage and current in the arc. Transient 
effects in arc welding arise during (i) arc striking, (ii) change in arc length, (iii) 
the transfer of molten metal to the workpiece, and (iv) AC mode of welding. 
A set of output voltage-ampere (V-I) relationship curves is used to describe 
static characteristics. As for static V-I relationship, different types of power 
source are employed, (i) constant current (CC), (ii) constant voltage (CV), 
and (iii) combined CC and CV. 
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Fig. 4 Constant current V-I 
characteristic curves 



(in Amp) 


Constant-Current (CC) Output Power Sources 

The typical V-I output curve for a constant-current (CC) power source is shown in 
Fig. 4. Such sources are also called as drooping power sources because of down- 
ward slope or droop of the V-I curves. This mode of output is generally preferred 
for manual welding machines like manual metal arc welding (MMAW) or gas 
tungsten arc welding (GTAW). 

The change in current is small with the change in arc voltage; as a result, the 
electrode melting rate would remain fairly constant with a change in arc length 
which is advantageous for the consumable electrode welding process. Such power 
sources are preferred in welding processes using relatively thicker consumable 
electrodes which may sometimes get stubbed to the workpiece or with 
nonconsumable tungsten electrodes where touching of the electrode during arc 
strike may lead to damage of electrode if the changes in current are high. 


Constant-Voltage (CV) Output Power Sources 

The typical V-I curve for a constant-voltage (CV) power source is shown in Fig. 5. 
In CV sources, it does not have true constant voltage regardless of current, but it 
exhibits some downward slope as current increases due to the internal electric 
impedance in the welding circuit. This type of power sources is generally preferred 
for constantly fed continuous electrode processes, such as gas metal arc welding 
(GMAW) and submerged arc welding (SAW), to maintain constant arc length. 

With constant voltage power supply, the arc voltage is established by setting the 
output voltage on the source. The power source shall supply the necessary current to 
melt the electrode at the rate required to maintain the preset voltage or relative arc 
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Fig. 5 Constant voltage V-I 
characteristics curves 



(in Amp) 


length. The speed of the electrode drive is used to control the average welding 
current. The use of such power source in conjunction with a constant electrode wire 
feed results in a self-regulating or self-adjusting arc length system. Due to some 
internal or external fluctuation, if the change in welding current occurs, it will 
automatically increase or decrease the electrode melting rate to regain the desired 
arc length. 


Combined CC and CV Power Source 

A single power source is developed that can provide either constant-voltage or 
constant-current output. Such power supplies are advantageous in that they have 
utility for a variety of processes and are actually a combination of the straight CV or 
CC types. As shown in Fig. 6, the higher-voltage region of the V-I curve is 
essentially constant current, while below a certain threshold voltage, the curve 
switches to a constant voltage type. Such a power supply is useful for manual metal 
arc welding (MMAW) to assist in starting and avoid electrode sticking or stubbing 
in the weld pool should the welder use to short an arc length (Messier 2004; 
Welding brazing and soldering 1998; Nandkarni 2005). 


High Precision Arc Welding Processes 

The main feature of high precision arc welding process is tight operational toler- 
ance, in order to maintain the product functionality and quality of very small 
components during j oining . The characteristic features of conventional arc welding 
processes such as gas tungsten arc welding (TIG), metal inert gas welding (MIG), 
and plasma arc welding (PAW) are high heat input, larger heat-affected zone 
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Fig. 6 Combined V-I 
characteristics curve 



(HAZ), larger weld bead size, and low precision of welding. Applications such as 
bio-implants, special hermetic sealings, electronic components, mould and tool 
buildup and repair, precision electric assemblies, and many other applications 
where precision miniature joining is required need high precision welding tech- 
niques such as micro arc, micro laser, and electron beam welding. These high 
precision welding operations often need suitable visualization tools such as micro- 
scopes and micro manipulation tools such as fully or semi-robotic instruments to 
facilitate precision assembling, heating, and filler additions during operation. Min- 
iature versions of conventional TIG, MIG, and PAW techniques are developed for 
precision arc welding with ultralow energy input. Dies, tools, and components with 
close dimensional tolerances need ultralow energy input as excessive heat can 
cause distortion after the welding. Furthermore, with ultralow energy input, weld 
bead size can be controlled to close tolerances with the final component requiring 
minimal machining after welding. Special torches are developed, which can be 
operated manually or mechanically to perform heating and metal deposition at the 
joint location. Ultralow and highly concentrated heat input is employed to join the 
metallic parts with minimal HAZ without any damage to the surrounding material 
or components. Apart from joining complex assemblies, repair and buildup of 
expensive high alloy steel molds, which are damaged and worn out, can be carried 
out using these techniques. 

Tungsten inert gas welding (TIG), also known as gas tungsten arc welding 
(GTAW), is a nonconsumable arc welding technique, in which an electric arc 
between a nonconsumable tungsten electrode and workpiece is used as heating 
source. Filler is fed in the form of a wire into the weld pool, though autogenous TIG 
welding without fillers is also employed more frequently. A constant flow of 
shielding gas (Ar 2 or He) through the torch protects molten metal from oxidation 
during welding. Though micro TIG and conventional TIG welding processes work 
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Fig. 7 Schematic diagram showing differences between TIG and MIG welding 


on the same principle, there are minor differences in terms of amperage and filler 
wire size. In micro TIG welding, current in the range of 5-50 A and filler wire with 
0. 1-0.5 mm diameter are employed, in contrast to 50-300 A current and 1.5-3 mm 
diameter wire in conventional TIG welding. Furthermore, micro TIG uses special 
microscopic setup and for strict arc gap maintenance due to smaller currents that are 
used during welding. Torch manipulation can be both manual or with a robotic arm 
depending on the precision and application requirements. Typical micro TIG 
welding systems come with automated pre- and post-weld preparation tools along 
with the power supply and electronically controlled torch manipulation setup. 

In metal inert gas welding (MIG), also known as gas metal arc welding 
(GMAW), a consumable electrode is continuously fed into the arc as filler metal. 
A long filler wire spool allows continues welding without much interruption and is 
best suited for manual and robotic welding. Micro-MIG welding equipment also 
employs setups similar to the micro-TIG welding except filler wire acts as electrode 
compared to the inert tungsten electrode in TIG torch. Figure 7 shows the schematic 
diagram showing differences between TIG and MIG welding. 

Plasma arc welding (PAW) typically uses two types of torch designs, viz., 
transferred arc and non-transferred arc torches. In transferred arc PAW, arc is 
generated between one electrode inside the torch and the workpiece, whereas in 
non-transferred arc torch PAW, arc is generated between the two electrodes 
placed inside the torch. In both the cases, electrodes are nonconsumable. As 
non-transferred PAW provides flexibility to maintain a stable arc without com- 
plications such as strict arc-length maintenance, it is preferred for the micro PAW 
torch design. In the plasma non-transferred arc process, arc is maintained between 
the tungsten electrode and copper electrode which is also a part of the torch 
assembly. Superior energy concentration, excellent penetration, greater arc sta- 
bility, and good operational flexibility are the advantages of PAW over TIG 
welding. Micro PAW uses currents below 20 A, which makes it suitable for 
welding and repair of components where precision is of utmost importance. 
Machined parts can be assembled with minimal or no post-weld machining 
requirements due to high heat efficiency and ultralow heat input in micro-plasma 
welding. Very thin sheets and precision assemblies can be joined without any 
distortion and dimensional damage. Micro-plasma welding can be employed to 
weld materials such as 316 L and Ti-6Al-4 V without much microstructural 
damage (Tovar et al. 2011). 
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Fig. 8 Photograph showing effect of high-frequency pulsing on arc constriction: (a) 20,000 Hz 
pulse frequency and (b) conventional inverter DC at 85 Amp average current (Image courtesy: 
VBC Instrument Engineering Asia Pte Ltd) 

High-Frequency Pulsing 

One of the main requirements of micro arc welding is a high energy concentration 
heat source with low heat input. Welding jobs carried out with low heat input and 
high energy concentration heat source can provide benefits such as low heat- 
affected zone (HAZ), faster heating and cooling cycles, and minimal workpiece 
distortion. The gas tungsten constricted arc welding (GTCAW) was developed by 
modifying conventional GTAW. Arc wave form is modulated using very high- 
frequency (~20,000 Hz) pulsing which constricts the arc to get high energy 
concentration and low overall heat input for the same deposition rate compared to 
conventional GTAW. Magnetic field generated around the arc due to high- 
frequency modulated current wave form constricts the arc giving highly concen- 
trated energy to make precise welds. Titanium alloys, which are highly prone to 
oxidation during welding, can be welded outside the inert chamber without trailing 
gas shielding due to very high heating and cooling rates. Figure 8 shows the 
difference between the arc shapes between GTCAW and GTAW at 85 Amp 
average current. 


Micro Arc Welding for Repairs 

Apart from joining metallic parts with high precision and less HAZ in a regular 
production cycle, micro arc welding techniques are also frequently employed in 
repair welding of wom-out expensive tools, molds, structures, and machine com- 
ponents. Restoration of component dimensions to specific tolerance by repair 
welding not only saves cost of replacement but also saves the time required to 
procure or produce new components for replacement, which can substantially 
increase the cost of plant maintenance. Pre -preparation of weld area and filler 
selection play a major role in the quality of the final component during repair 
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Fig. 9 Photograph showing 
worn-out component repaired 
by micro TIG welding (Image 
courtesy: JMF Precision 
Welding, Dacula, GA 30019) 



welding. Corrosion debris, other nonmetallic materials present at the weld area, and 
other oils or greasy deposits need to be thoroughly cleaned. Chemical cleaning, 
heating in order to evaporate organic solvents, and mechanical cleaning are fre- 
quently employed as a part of pre-preparation process during repair welding. 
Aluminum alloys, in particular, are very sensitive to the presence of oxide deposits, 
organic material contamination, and moisture, as it can result in porosity in the 
deposited material. Most of the times, these components being repaired are not 
readily weldable and preheating is often required before welding. However, met- 
allurgical changes due to preheating also need to be kept in mind before deciding 
the preheating temperature. Filler selection is another major challenge in the repair 
welding as deposits made from fillers with exactly the same composition of the 
component might not yield mechanical and chemical properties similar to the base 
metal. Cast micro structure and micro-segregation in the deposited metal often gives 
inferior properties compared to the component being repaired. To weld AA6082 
aluminum alloys, filler materials with the same composition are never used; instead, 
AA4043 or A5356 fillers are used. Figure 9 shows the worn-out component 
repaired by micro TIG welding. 


Freeform Fabrication or 3D Micro Welding 

Growing technological needs demand development of advanced materials and 
components with specific high-end properties. However, there are limitations in 
the production of components that are made from intermetallic compounds, refrac- 
tory metals, and light metals due to high material costs and other production 
difficulties. As a result, several techniques, such as laser-engineered net shaping 
(LENS), metal injection molding (MIM), stereo-lithography (SLA), fused deposi- 
tion modeling (FDM) selective laser sintering (SLS), etc., also known as solid 
freedom fabrication or rapid prototyping, are developed to produce components in 
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Fig. 10 (a) Schematic diagram describing principle of 3 D-micro welding for freedom fabrication; 
(b) image of 3D freedom fabricated model built from Ti-Al-based intermetallic alloy 


near net shape from metals, ceramics, and polymers (Terakubo et al. 2007; Horii 
et al. 2008). Freedom fabrication process uses computer control from a computer- 
aided design and manufacturing (CAD/CAM) system to produce 3 -dimensional 
components without molds. In this method, material is deposited/printed according 
to a computer design to produce net-shaped objects. Material that is being deposited 
can be in powder or wire form, and a heat source is used to form a melt pool. These 
freedom fabrication techniques are better suited to produce functionally graded 
components with customized properties at various locations of a single component. 
All these methods are popularly known as additive manufacturing processes and 
more commonly called “3D printing.” 

A welding technique to produce near-net-shaped components as one of the 
methods for freedom fabrication has been developed by many researchers (Choi 
et al. 2001; Zhang et al. 2002; Ouyang et al. 2002). A combination of micro TIG 
welding and layered manufacturing method was developed as one of the freedom 
fabrication methods that uses welding arc as heat source (Horii et al. 2007). 

A fused micro metal bead is formed on a substrate from a thin metal filler wire 
using pulsed TIG micro arc. Subsequent metal beads will be deposited layer by 
layer using CAD/CAM system to build the required 3D object (Fig. 10a). Metal 
alloys, refractory metals, intermetallic s, etc., can be successfully deposited using 
TIG micro welding. A high level of dimensional accuracy can be obtained by 
controlling arc current, pulse frequency, shielding gas selection, and thickness of 
the filler wire. An example of 3D freedom-fabricated model built from Ti-Al-based 
intermetallic s alloy is shown in Fig. 10b. 


High Productivity Arc Welding Process 

Cladding of metallic parts by fusion techniques is widely used in the industry to 
improve surface properties of new components and to repair worn-out parts. In 
order to use material efficiently, the substrate material can be normally less 
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expensive low alloy steels; while the working surfaces, where sufficient wear 
and/or corrosion resistance is required, can be cladded with high alloyed material 
to withstand specific working conditions. This is a technically viable and econom- 
ically beneficial solution for many industrial applications. 

Fusion techniques are used to develop a strong metallurgical bonding between 
the coating and the substrate, in contrast to the less strong mechanical bonding in 
thermal spraying processes. Coatings developed by thermal spraying processes can 
display good oxidation and corrosion resistance with a limited wear and impact 
resistance. Due to the strong metallurgical bonding between coating and the 
substrate, coatings developed by fusion techniques can also perform well under 
wear and impact loading condition along with corrosion and oxidation. Both arc 
and beam welding techniques can be employed to accomplish fusion cladding. Arc 
welding processes such as manual metal arc (MMA), metal inert gas (MIG), 
submerged arc (SA), gas tungsten arc (GTA), and plasma arc welding (PAW) are 
widely used in various cladding applications (Miami 1991; Lucas 1994, 1997; 
Anderson 1985; Liburdi et al. 1989; Lowden et al. 1991; Lambert and Gilston 
1990; Alexandrov et al. 1993; Colussi et al. 1994; Koshy 1985; Anon 1997; Tusek 
1996). Among these, nonconsumable electrode arc processes, i.e., GTA and PAW, 
produce superior quality welds due to their inherent characteristics such as high arc 
stability, spatter free, precise control of welding variables, and inert gas shielding. 
For example, GTA cladding is widely used in precision built-up applications such 
as aircraft engine components where high quality is critical (Anderson 1985; 
Liburdi et al. 1989; Lowden et al. 1991). Although GTA and plasma cladding can 
produce excellent overlay with a variety of alloy materials, their deposition rates 
are low compared with consumable electrode arc processes such as MIG and 
SA. Therefore, modifications of nonconsumable electrode arc processes to increase 
their deposition rates are of continuous interest for many industrial applications 
where both high quality and high productivity are required. Two approaches will be 
commonly used. First, novel twin-wire TIG cladding techniques in which two wires 
are fed simultaneously into the weld pool. Secondly, plasma cladding, a high- 
frequency generated non-transferred pilot arc is initiated between a tungsten cath- 
ode and water-cooled copper anode. 


Twin-Wire Gas Tungsten Arc Cladding 

The selection of the twin wire in GTA cladding has several advantages when 
compared to single wire (Sun et al. 1999). (1) Arc energy can be utilized more 
efficiently by melting the two wires under the conical shape of the gas tungsten arc. 
(2) Two wires will consume more heat from the arc column when compared to 
single wire resulted in less heat input into the substrate metal, a smaller heat- 
affected zone (HAZ), less distortion, and high deposition rates. 

Figure 11 shows the schematic of GTA twin- wire cladding in trailing and 
leading feed mode (Sun et al. 1999). In twin-wire GTA cladding, two wires are 
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Welding Direction Welding Direction 

Trailing feed Leading feed 


Fig. 11 


Schematic illustration of GTA twin- wire cladding in trailing and leading feed mode 


Table 2 Welding conditions in single- and twin-wire GTA cladding 



Welding 
current (A) 

Voltage 

(V) 

Travel speed 
(mm/min) 

Oscillation 
amplitude (mm) 

Shielding gas flow 
rate (1/min) 

Single 

wire 

200 

14.5 

35 

15 

20 

250 

17 

35 

Each side 


300 

17 

35 

Twin 

wire 

200 

14.5 

35 

15 

20 

250 

17 

35 

Each side 

300 

17 

35 


simultaneously fed parallel to the weld pool by two separate wire feeders. In trailing 
feed, the wire is fed directly into the weld pool, while in leading feed, the wire is 
often fed to the arc before it reaches the weld pool. This results in differences in the 
melting efficiency. Leading feed melts more wire than trailing feed. To maximize 
the wire feed rate, the leading feed technique is preferred, because the arc has 
higher temperature than the weld pool. Sun et al. (1999) compared the deposition 
rates between the single- and twin- wire processes at different welding currents. 
Table 2 lists the major welding parameters. 

They observed that the weld bead thickness is in the range of 2.0-3. 0 mm and 
3. 0-4. 5 mm for single- and twin- wire samples, respectively. Weld bead volume in 
the twin- wire sample is about 50 % larger, which accounts for the wire feed rate 
difference (about 60 %). Detailed measurements showed that the HAZ size in the 
twin wire sample is about 35 % less than that of the single wire sample (Sun 

et al. 1999). 

Figure 12 compares the deposition rate between single- and twin- wire GTA 
cladding using different welding current levels. The increase in deposition rate is 
50 %, 90 %, and 70 % at welding current levels of 200 A, 250 A, and 300 A, 
respectively. This could represent a significant cost saving for large-scale applica- 
tions of the twin- wire technique. 
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Fig. 12 Comparison of 
deposition rates between 
single- and twin- wire GTA 
cladding 
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Table 3 Classification of PTA 



Micro-PTA 

Standard PTA 

High-power PTA 

Current 

range 

1-50 A 

50-300 A 

300-500 A 

Deposition 
rate (max.) 

1 kg/h 

7 kg/h 

1 5 kg/h 

Dilution 

5 % 

5 % 

<7 % 

Application 

areas 

Surfacing on thin substrates 
in precision components 

Hard-facing on various 
industrial applications 

Large area overlay 
with high 
productivity 

Thickness 

0.15-0.5 mm 

0.5-5 mm 

>5 mm 


Plasma Cladding 

PAW represents an advance over GTAW due to its greater energy concentration, 
deeper and narrower penetration achievement, and greater arc stability. Micro- 
plasma arc welding (MPAW) is a variation of the PAW process and uses intensities 
lower than 15 A. A micro-plasma-transferred arc (PTA) is a low heat input 
surfacing process. With micro-PTA, it is possible to produce coatings for turbine 
blades or narrow rims with current in the range of 10-30 A (Sun and Huang 1998). 
Table 3 shows the classification of PTA processes. 

Figure 13 shows a comparison of wire feed rate for TIG and plasma cladding at 
the same welding current level (250 A). It is clearly seen that plasma cladding has a 
much higher deposition rate than TIG cladding in both cold-wire and hot-wire 
modes. The difference between plasma and TIG cladding can be at least 1 .5 times in 
terms of wire feed rate. This is attributed to the highly concentrated arc energy in 
the plasma process. Therefore, it can be concluded that plasma cladding has a 
higher deposition rate relative to TIG cladding. 
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Fig. 13 Comparison 
between plasma and TIG 
cladding at various hot-wire 
heating currents (welding 
current: 250 A) 
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Laser-Arc Hybrid Welding (LAHW) 

Laser-arc hybrid welding (LAHW) process has multiple advantages such as deeper 
penetration welds with minimal heat input, distortion, and weld defects such as 
porosity (Abe and Hayashi 2002). Laser beam aids in deeper penetration and 
stabilization of the arc, whereas arc provides energy needed for the welding (Kah 
2012). Deeper weld penetration due to laser could aid in improved productivity by 
reducing the number of passes. Though laser-TIG hybrid welding was developed 
initially at laboratory scale, laser-MIG hybrid welding was the first process to be 
commercialized. The important process parameters that influence the welding 
process and weld quality are type of laser, laser power, arc power, laser-arc 
configuration, shielding gas, and filler. 

Less capital cost and ease of operation have made the arc welding process more 
popular in production as well as job scale industrial applications. However, arc 
welding techniques suffer from the lack of weld penetration, high heat input when 
deep penetration is needed, and arc instability in some cases. Most popularly used 
industrial and laboratory laser welding sources are C0 2 and Nd:YAG lasers. Nd: 
YAG lasers have special advantages such as less reflection from the shining 
workpieces due to short wavelength of the laser and high level of flexibility in 
choice of shielding gas mixtures compared to C0 2 lasers, where pure He or gas 
mixtures with high He content are needed. Weld metal microstructures and weld 
pool characteristics can be controlled by selecting appropriate shielding gas mix- 
tures which is one of the important advantages of Nd:YAG lasers. High-power laser 
sources for welding application are highly expensive resulting in higher invest- 
ments for the welding setup. Furthermore, laser welds often suffer from pinholes 
when used to weld low boiling point materials such as Mg alloys due to faster 
cooling rates. Apart from that, conventional laser welding process also suffers from 
cracks in thick sheet welding due to fast thermal cycles. 

LAHW has multiple advantages compared to stand-alone arc or laser welding 
processes. When laser and arc heat sources are used simultaneously during welding, 
it is reported that laser aids in stabilization of the arc by leading arc in to the hot 
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Fig. 14 (a) Typical LAHW torch configuration showing laser and arc torch; (b) Schematic of 
weld bead shape showing arc and laser zones 


keyhole created by the laser. Furthermore, metal ions from the plasma created by 
laser enter the arc column which can narrow the arc and lead the arc in to the 
keyhole (Kah 2012). Even a small 200 W laser can improve the arc stability and 
control in laser-pulsed MIG hybrid welding process by locking the cathode location 
(Shinn et al. 2005). Both consumable and nonconsumable arc welding torches 
(TIG, MIG, and PAW) can be used in LAHW process. Various laser and torch 
configuration are available to suit different weld conditions. Typical LAHW con- 
figuration is shown in Fig. 14a, where laser perpendicular to the workpiece and 
the arc torch is inclined between the laser and the workpiece. A schematic of the 
LAHW-weld bead shape is shown in Fig. 14b. As illustrated in the figure, the weld 
bead can be divided into two parts, viz., top wider portion of bead called as “arc 
zone” and the bottom narrow portion called as “laser zone.” 

The wider upper part suggests that the arc energy mainly acts on the top portion 
and laser energy acts through the keyhole till the root of the weld bead. The volume 
ratio of arc and laser zones can be varied by changing the arc and laser energy ratio 
supplied during welding. Higher arc versus laser power ratio increases the volume 
of arc zone and results in coarse grained pearlitic micro structure in steels where as 
lower ratios results in martensitic micro structures. LAHW process is more suitable 
for higher duty cycle welding processes such as ship building and pressure vessel 
manufacturing and less suited for small job works and complex assemblies. LAHW 
process gives higher power densities and high deposition rates with low heat input 
which can yield substantial energy savings. 


Summary 

Miniature versions of conventional TIG, MIG, and PAW techniques are developed 
for precession arc welding with ultralow energy input. Using ultralow energy input 
and computer-controlled special torches, special dies, tools, and components with 
close dimensional tolerances can be successfully welded. The gas tungsten 
constricted arc welding (GTCAW) was developed by modifying conventional 
GTAW in which arc wave form is modulated using very high frequency 
(~20,000 Hz) pulsing to constrict the arc. This can result in high energy 
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concentration and low overall heat input for the same deposition rate compared to 
conventional GTAW. Micro arc welding technique can be successfully employed 
for mold and tool repair and save cost of replacing the expensive machine parts. 
Freedom fabrication process uses computer control from a computer-aided design 
and manufacturing (CAD/CAM) system to produce 3 -dimensional components 
without molds. In this method, material is deposited/printed according to a com- 
puter design to produce net- shaped objects. 

It has been demonstrated that high productivity arc welding processes increase 
the deposition rates by more than 50 % when compared to conventional welding 
processes. For example, the twin- wire cladding process produces better quality 
cladding with less dilution, smaller HAZ, and less distortion than the single wire 
GTA process. Micro-PTA can produce not only the required deposit properties, but 
also clad beads with sizes suitable for various applications. Laser-arc hybrid 
welding (LAHW) process has multiple advantages such as deeper penetration 
welds with minimal heat input, distortion, and weld defects such as porosity. 
Both laser and arc power source are simultaneously applied at the welding area to 
derive advantages of both the processes in LAHW process. 
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Abstract 

Today, high-energy beams, such as laser and electron beams, have been increas- 
ingly adopted in the industry for welding, cladding, and additive manufacturing. 
Due to the high-energy density and small beam size, welding and additive 
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manufacturing using high-energy beams show the characteristics of deep pene- 
tration, low heat input, small heat-affected zone, low thermal distortion, good 
dimensional accuracy and integrity, as well as ease of automation. Compared 
with other types of heat sources (i.e., arc and plasma), high-energy beams are 
more suitable for the applications requiring a precise control of heat energy for 
difficult-to-process materials. Other than the abovementioned characteristics, 
high-energy beam welding process belongs to a contactless process which pro- 
vides more flexibility in new design and manufacturing. With these superior 
characteristics, high-energy beam welding process has been used in a wide range 
of industrial sectors (i.e., aerospace, automotive, oil and gas, shipyard, medical 
devices, etc.) and is expected to keep playing an important role in welding 
industry in the future. 


Introduction 

The purpose of this chapter is to introduce the fundamentals of high-energy beam 
(laser and electron beams) welding and additive manufacturing, as well as the 
industrial applications. In the first two sections of this chapter, fundamentals of 
laser welding and electron beam welding are introduced, respectively. The associ- 
ated parameters of the welding processes, such as welding system, processing 
parameters, and material properties of welding samples, are discussed. Further- 
more, general welding defects which can cause quality deterioration are mentioned 
as well. Following the welding sections, fundamental and industrial applications of 
laser cladding technology are presented in the third section of this chapter. In this 
section, laser cladding with wire and powder feeding are introduced. General 
process variables and the ranges of parameters applied in laser cladding are 
highlighted. Approaches of calculating and controlling the dilution rate in laser 
cladding are also mentioned in this section. The last part of this chapter focuses on 
the process of selective laser sintering. The fundamental of selective laser sintering 
is introduced. Sintering mechanisms are discussed in detail. Some industrial exam- 
ples of selective laser sintering are provided in this section as well. 


Laser Welding 

Laser welding is increasingly used in a wide range of industrial applications due to 
its outstanding characteristics such as high-energy density, high precision, high 
speed, high flexibility for welding three-dimensional components, as well as pro- 
ducing a weld with a high aspect ratio of weld depth to width. In this section, 
general information in laser welding will be provided including fundamentals of 
physical mechanisms occurring in laser welding and key parameters generally used 
and its advantages for industry. 
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Introduction 

In laser welding, a laser beam is used as a heat source. Compared with other fusion 
welding technologies, the energy density used in laser welding is similar to electron 
beam welding but higher than gas welding and arc welding, as shown in Fig. 1. In 
this section, the content will cover physical fundaments of laser welding, influenced 
parameters in laser welding, its advantages and disadvantages, and potential indus- 
trial applications. 


Physical Mechanisms in Laser Welding 

The principle of laser welding is using a laser beam as the heat source to cause 
heating (melting) and cooling (solidification) processes in welding materials. The 
interaction between a laser beam and a welding material is schematically illustrated 
in Fig. 2. Its profile of thermal cycle in welding materials depends on the amount of 
heat input and heating/cooling rates applied in laser welding. Metallurgical effects 
of welding materials will occur and affect welding quality due to phase trans- 
formations inducing from the heating and cooling processes. Figure 3 shows the 
relationship between thermal profiles and metallurgical effects in laser welding of 
structural steel. 

Laser welding can be divided into conduction mode welding and keyhole mode 
welding, as shown in Fig. 4. Generally, they can be easily distinguished according 
to their aspect ratio of weld depth to width or the peak temperature occurring in 
laser welding (Martukanitz 2005). In conduction mode welding, Fig. 4a, the energy 
density of laser beam is below 10 W/cnr and process temperatures in the melting 
pool are between its melting point and boiling point. Besides, its aspect ratio of 


Fig. 1 Comparison of energy 
density between different 
fusion welding technologies 
(high-energy beam welding 
includes electron beam 
welding and laser welding) 
(Kou 2002) 


Increasing 
damage to 
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Fig. 2 Illustration of the 
interaction between laser 
beam and welding material in 
laser welding (David and 
Debroy 1992) 
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Fig. 3 Relationship between thermal profiles and phase transformations in laser welding of 
structure steel (Bruggemann et al. 2000) 


weld depth to width usually is about 0.5. In the contrast, a keyhole will be formed 
during laser keyhole welding. It is because the energy density of laser beam (more 

f\ O 

than 10 W/cm ) used in keyhole mode welding is higher enough for vaporizing 
welding materials and therefore forms a keyhole and plasma, as shown in Fig. 4b. 
A higher aspect ratio of weld depth to width (higher than 0.5) will be obtained in 
keyhole mode welding because multi-reflections of laser beam caused on the 
keyhole walls can enhance energy absorption. The peak temperature occurring in 
this mode usually is higher than the boiling point of welding material. 

Regarding thermal profiles in welding materials, three different metallurgical 
zones generally are defined and named as fusion zone (FZ), heat-affected zone 
(HAZ), and base material (BM), as indicated in Fig. 2. During laser welding, the 
peak temperature of welding material happens in FZ and usually is higher than its 
melting point or even boiling point. A keyhole will be formed and welding material 
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Fig. 4 Type of laser welding: (a) conduction mode welding and (b) keyhole mode welding 

(BOC 2009) 


will be vaporized if the peak temperature of welding material is higher than its 
boiling point. Normally, temperatures in HAZ are between melting point of 
welding materials and room temperature which can induce phase transformations 
resulting in changes of its micro structure and mechanical properties. 


Influenced Parameters 

Parameters in laser welding can be classified in laser system parameters, welding 
material parameters, and processing parameters, as shown in Fig. 5. Each parameter 
plays a role in determining welding performance and welding quality. Among, laser 
system parameters are not able to be changed once a laser system has been chosen 
before laser welding. Based on material parameters used in developing laser 
welding process, processing parameters will be varied in a range to optimize the 
process. In the following sections, the role of each parameter in laser welding will 
be briefly introduced. 

Laser System Parameters 

Maximum output of laser power is an index to predict a laser system’s capability 
used in laser processes. A laser system with a higher maximum output of laser 
power provides a wide working window in terms of material type, thickness of 
material to weld, as well as a higher welding speed that can be used in laser 
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Fig. 5 Influenced parameters in laser welding 

welding. Except maximum laser power output, laser beam quality and its wave- 
length are other important technical parameters which determine where a laser 
system can be applied and what welding quality can be achieved. Generally 
speaking, a better laser beam quality provides a smaller laser beam focus diameter 
resulting in a higher energy density and a keyhole mode welding. In the aspect of 
laser beam wavelength, it also affects welding performance. The relationship 
between laser beam wavelength, laser energy reflectivity, and material type is 
shown in Fig. 6. 

Laser power delivery method can be classified into continuous wave (CW) and 
pulsed wave (PW) modes. In laser CW mode welding, laser power used is kept at 
the same value. Adversely, laser power can be varied with time during laser PW 
welding depending on the settings of type of pulsed wave, average laser power, 
peak power, pulse duration and frequency, etc. Laser PW welding usually used in 
applications required a precise controlling on heat input and heating/cooling rates. 
However, laser CW welding is more widely used in industry. Compared with PW 
welding, the advantages of laser CW welding are its low investing cost, capability 
of performing a higher-speed welding, and better weld appearance. 

Welding Material Properties 

In general, properties of welding material can be considered as factors to determine 
the settings of processing parameters. This part will be discussed in the next section. 
Before laser welding is carried out, type of welding material and its thickness are 
the first parameters to be checked before setting other processing parameters. For 
example, thickness of welding material will affect the setting of laser power, 
welding speed, or even the number of welding pass. Thicker materials may need 
a multi-pass welding or keyhole mode welding to achieve a penetration weld 
compared with welding of a thinner material, such as a foil, by using a conduction 
mode welding. 
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Fig. 6 Relationship between laser beam wavelength, laser energy absorption, and material type 

(Stollhof 2009) 

Regarding the type of welding material, its optical, physical, and mechanical 
properties have to be considered. Optical properties of welding material, such as 
laser light reflectivity, will influence the process stability and welding efficiency. 
Physical and chemical properties of welding material usually play roles in deter- 
mining laser welding performance and welding quality. With these reasons, thermal 
conductivity, density, diffusivity, and absorption of a laser beam are usually 
considered since they can affect heat distributions in welding materials. Besides, 
a material with a high thermal conductivity or a low absorption will decrease laser 
welding efficiency. At the same time, welding efficiency can be affected by 
material absorption of the laser beam. Absorption depends on material type and 
laser beam wavelength, as shown in Fig. 6. For example, A1 and Cu alloys have 
been recognized as lower absorptive materials compared with other metallic mate- 
rials when laser beam wavelength is beyond the infrared range. 

Processing Parameters 

In laser welding, processing parameters generally can be laser power, welding 
speed, focal point position of the laser beam, type of shielding gas and its setup, 
etc. Laser power and welding speed are two main processing parameters which can 
affect the amount of heat input delivered to welding materials. Welding perfor- 
mances, such as weld geometry, weld defects, and macro/micro segregations in 
welds, can be different according to the total amount of heat energy that has been 
absorbed and transferred in welding materials. The amount of heat input applied in 




624 


C. Hui-Chi et al. 


a 



Laser beam 



'via 


tical lens 


# 


\ J 


\Mr\ r l i nan ! orQ 

* * -(j r - - - - 


De -focused 


b Laser beam 


A 


# 


\ 


i 


'W 


tical lens 



tical lens 


I 


\ r 


Working piece 


Focused 


C 


Laser beam 


A 

f \ 


t 


t 


/ 


Optical lens 




'ismm&i 


V 




Working piece 


De-focused 


Focal plan 



Fig. 7 Illustration of different focal point positions in laser welding: (a) laser beam focuses above 
a welding material (defocused); (b) laser beam focuses on a welding material (focused); (c) laser 
beam focuses below a welding material (defocused) 


laser welding is changed by increasing laser power or decreasing welding speed 
when other parameters are kept. If the thickness of welding material is given, laser 
power will be increased with an increase of welding speed. 

Instead of varying laser power or welding speed, locating the focal point of laser 
beam at different positions is another way to change the amount of heat input 
applied to welding materials. It is because energy density and beam size of the laser 
beam irradiated on the top surface of welding materials will be different when the 
focal point of the laser beam is located at different positions (i.e., above, on, and 
below the top surface of welding materials), as illustrated in Fig. 7. 

The function of shielding gas used in laser welding is to protect melting pools 
from oxidation, suppress the plasma formed above the melting pool as well as avoid 
optical damage of lens from spatter during welding. Table 1 summarizes physical 
properties of different shielding gases commonly used in laser welding. According 
to the materials to be welded, choice of shielding gas, flow rate of shielding gas, 
diameter of nozzle, and setup of shielding gas device (Fig. 8) have to be optimized 
for achieving good welds. Currently, helium, argon, and nitrogen gases are fre- 
quently used in laser welding. 


Laser Welding Configurations 

Since laser welding is a contactless welding process, it is suitable for welding 
materials in diverse configurations, as illustrated in Fig. 9. Except for parameters 
mentioned in previous sections, individual parameters have to be paid attention 
when welding different configurations. In laser lap welding, a gap between the top 
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Table 1 Physical properties of different shielding gases (Lyttle 1993) 


Gas 

Chemical 

symbol 

Molecular 

weight 

(g/mol) 

Specific gravity with 
respect to air at 1 atm and 

0°C 

Density 

(g/L) 

Ionization 

potential 

(eV) 

Argon 

Ar 

39.95 

1.38 

1.75 

15.7 

Carbon 

dioxide 

C0 2 

44.01 

1.53 

1.98 

14.4 

Helium 

He 

4.00 

0.14 

0.18 

24.5 

Nitrogen 

n 2 

28.01 

0.97 

1.25 

14.5 

Oxygen 

0 2 

32.00 

1.11 

1.43 

13.2 


a Coaxial nozzle 

laser beam 


Laser 



b Ring nozzle 

laser beam 





laser beam 


Laser 



Welding gas 


30-50 


Weld 

bead 


Fig. 8 Setup of shielding gas device used in laser welding: (a) coaxial nozzle; (b) ring nozzle; 
(c) side tube (BOC 2009) 


and bottom plates will affect welding quality. For example, applying an appropriate 
gap between the top and bottom plates reduced the formation of porosity in the 
welding and therefore improved the weld quality in laser lap welding of Zn-coated 
steels (Amo et al. 1996; Graham et al. 1996; Li et al. 2007). 


Laser Welding Defects 

In laser welding, heating and cooling rates will cause variation of chemical com- 
positions, stress distributions, and phase transformations in FZ and HAZ resulting 
in the formation of defects in welds. Those defects can be cracking, porosity, loss of 
alloy elements, and micro segregation. The formation of cracking usually occurs 
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Fig. 9 Laser welding type: (a) butt welding; (b) fillet welding; (c) lap welding; (d) spot welding; 
(e) flange welding; (f) edge welding; (g) T-joint welding; (h) flare welding; (i) comer welding; 
(j) kissing welding (Steen and Mazumder 2010) 


when the range of liquidus temperatures of welding material is wide or micro 
segregation. When a material has a wide range of liquidus temperatures, solidifi- 
cation at different locations of a weld may happen at different time during laser 
welding. This phenomenon will induce uneven stress distributions across a weld 
and fractures resulting to initiations of failure under fatigue environments. 

Weld porosity is another common issue in laser welding, especially for welding 
of light metals, such as aluminum alloys and titanium alloys. Mechanisms to form 
weld porosity can be due to quick solidification rate in FZ and a rapid closure of the 
keyhole in laser welding, as illustrated in Fig. 10a, b, respectively. In Fig. 10a, the 
shielding gas used or air are unable to leave the keyhole before FZ is solidified. It is 
due to the nature of quick heating and cooling rates in laser welding. This type of 
porosity usually is founded in the middle of welds. Another reason which can cause 
porosity is due to an unstable melting pool during laser welding. The front wall of a 
keyhole easily falls, particularly for those materials that have low surface tension 
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a Laser beam 


Molten 



b Laser beam 



Fig. 10 Mechanisms to form weld porosity: (a) quick solidification rate; (b) rapid closure of the 
keyhole (Kawahito et al. 2006) 


and viscosity and therefore form a void in the tip of weld, as shown in Fig. 10b. 
Normally, weld porosity will induce stress concentrations leading to early failures 
of welds. 

In laser keyhole mode welding, molten materials may be brought out from the 
keyhole by vaporized forces, as captured in Fig. 11. General influences of spatters 
are degradations of corrosion resistance and unsuccessful assembly with other 
components after laser welding. An example of spatter formed on the top surface 
of a weld is shown in Fig. 12. Typical remedies for minimizing the formation of 
spatter are optimizing processing parameters (i.e., laser power, welding speed, type 
of shielding gas) and using filler materials. 


Electron Beam Welding 

Similar to laser welding, electron beam welding (EBW) has a high-energy density 
compared with other fusion welding technologies, as shown in Fig. 1. A general 
setup of EBW is illustrated in Fig. 13. In EBW, the kinetic energy will be 
transferred to heat energy when electrons strike the welding materials. With this 

o c 

mechanism, a high- vacuum environment (10 -10 atm) is normally required in 
EBW. It is because any air or gas can affect the electron flow during welding. 
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Fig. 11 Image of a melting behavior captured during laser welding (Katayama and Kawahito 

2009) 


Fig. 12 Formation of spatter 
on the top surface of 
Zn-coated steel weld 
(Milberg and Trautmann 

2009) 



A higher vacuum environment applied in welding, a deeper and narrower weld will 
be produced. Its weld geometry is similar with a weld produced from laser keyhole 
mode welding. Compared with other fusion welding technologies, not only a wide 
range of material thicknesses is able to be welded but also no shielding gas and filler 
material are required in EBW. Today, EBW has been used in the sectors of civil, 
aerospace, nuclear, and electric power plants. 

Influenced parameters in EBW are similar with those in laser welding. They also 
can be classified into system parameters, welding material parameters, and 
processing parameters. Except those mentioned (i.e., the amount of heat input, energy 
density, and welding material properties) in the section “Laser Welding,” the vacuum 
level of a welding chamber is another key parameter in determining its weld quality, 
especially for producing a higher penetration weld. Because of a higher vacuum level 
in the chamber used, a high depth-to- width weld ratio will be produced in EBW. 

Overall, EBW has been developed for decades and plays an important role in 
applications particularly requiring specific welding quality, such as in aeronautic 
and nuclear power industries. It can be foreseen that EBW will continuously be 
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Fig. 13 Schematic 
illustration of electron beam 
welding (Messier 1999) 



applied in these areas due to its advantage of producing a high aspect ratio of the 
weld geometry. 

Laser Cladding 
Introduction 

Laser cladding is a metal coating process in which laser is utilized as a heat source to 
generate a melt pool on the substrate surface and at the same time deposit the additive 
material in the form of powder or wire on to the substrate. As both a thin layer of the 
substrate and the additive material are fully melted, metallurgical bonding can be 
formed with very high bonding strength. Laser cladding can be widely used for 
improving corrosion, erosion, and wear resistance as well as for repair and modifi- 
cation of damaged/wom parts. Electrical arc (MIG and TIG), plasma arc, and laser 
are mostly used heat sources for cladding. Laser cladding shows several advantages 
over other traditional surface cladding technologies such as PTA and TIG. 


Process Principle of Laser Cladding 

Laser cladding can be classified as a two-stage process with pre-placed additive 
material and a one-stage process with in situ fed material in the form of powder or 
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Fig. 14 One- stage laser 
cladding with coaxial powder 
feeding 
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wire. In the one-stage process, the additive materials can be fed into the melt pool 
either laterally or coaxially to the laser beam. Cladding with coaxial powder feeding 
prevails over other methods, because this feeding method guarantees a multidir- 
ectional and more flexible process. The powders with a typical diameter of 20-150 
pm are fed through a coaxial powder nozzle by the carrier gas (e.g., Ar, He) into the 
melt pool. The gas also has the function of a shielding gas to prevent the clad layer 
from oxidation. The laser beam melts a thin layer of the work piece (in the 100 p 
range) and the fed powder simultaneously. Thus, the melted powder and the base 
material are metallurgically bonded with up to 100 % density. Figure 14 shows 
schematically the one-stage laser cladding process with coaxial powder feeding. 
Figure 15 shows schematically the one-stage laser cladding with lateral wire 
feeding (Mok et al. 2008). During laser cladding process, the heat input is limited 
to the local area, which results from the good focusability of the laser beam. As a 
result, a small heat-affected zone (HAZ) and low dilution in the base material can 
be achieved. Typical depth of the HAZ ranges from a few hundred microns to one 
millimeter. 

Dilution is defined as the amount of intermixing of the clad material with base 
material. Low dilution means there is very little of the base material mixed with the 
clad, which means that the properties of the clad layer is very close to the original 
additive material. This ensures that the full functions of the clad can be achieved in 
one single layer. Figure 16 shows schematically the evaluation of the cross section 
of the clad track (Bi et al. 2006). Dimensions of the clad layer, HAZ, and dilution 
zone can be measured in the cross section of the clad. The dilution r\ can be roughly 
calculated with Eq. 1 : 



Hp 

Hc+H d 



Normally the dilution of laser cladding can be controlled in the range of 5-10 %. 
Figure 17 shows examples of laser clad with low and high dilution. 
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Fig. 15 One- stage laser 
cladding with lateral wire 
feeding 
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Fig. 16 Schematic diagram of the cross section of a clad 


Process Parameters 

There are several parameters which govern the laser cladding process. A reasonable 
deposition rate with sound layer quality can be achieved by an optimized set of 
process parameters. For laser cladding with powder, laser power P L , scanning speed 
v v , and powder feeding rate m p are the most important process parameters, which 
determine the quality and geometry of the clad. As for the cladding with wire, the 
main process parameters are laser power P L , scanning speed v v , and powder feeding 
rate m p /wire feeding rate m w . Typical ranges of the process parameters are shown in 
Table 2. 
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Fig. 17 Photos showing the dilution level of laser clad, (a) Low dilution, (b) High dilution 


Table 2 Comparison of the main properties of different high-power lasers 


Process parameter 

Typical range 

Laser power P L 

200-3,000 W (Nd: YAG/diode/hber laser) 

500-10,000 W (C0 2 laser) 

Beam size 

0.5-6 mm (in diameter) 

Power intensity I 

5 * 10 3 -10 5 W/cm 2 

Scanning speed v v 

50-2,000 mm/min 

Powder feeding rate m p /wire feeding rate m w 

1-30 g/min 

Powder carrier gas (shielding gas) V S g 

2-15 L/min Ar 


Materials 

A wide range of materials can be used for direct laser deposition process. Besides 
the normal carbon steels, many other kinds of metals, including tool steels, powder 
metallurgy steels, and Ni-based superalloys, Ti-based superalloys, Cu-based alloys, 
and Al-based alloys, can be used as base material. For the additional materials, 
many kinds of alloys can be used. The commercially available powders and wires 
used for the thermal spraying processes can be utilized in laser cladding. Co-, Ni-, 
Fe-, A1-, Cu-, Ti-based alloys are commonly applied. The powders are more 
commercially available and more flexible to fit for the deposition process than 
wire. However, wire deposition can achieve nearly 100 % material usage, which is 
the main advantage over laser cladding using powder. Laser cladding can be also 
used to generate functionally graded layers. 


Applications 

The applications of laser cladding have been widely explored. It has been success- 
fully introduced into industry for deposition of wear and/or corrosion resistant 
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Fig. 18 Cladding of the cutting blade on a cutting roller, (a) Cutting roller after deposition, 
(b) Cross section, (c) Cutting roller after machinin 


coatings, repair of turbine components, moulds and dies, as well as rapid tooling 
(Leunda et al. 2011; Schubert et al. 1999; Mei et al. 2003; Thivillon et al. 2009; 
Peyre et al. 2008). This process is extremely suitable for the high-mix-low- volume 
production with high value added. The applications can be widely found in nearly 
all the industry sectors, including aerospace, marine and offshore, oil and gas, 
medical, precision engineering and military, etc. 

Example 1 In the paper industry, cutting rollers are applied for cutting and 
punching paper. The cutting blade must have the properties of enough hardness, 
wear resistance, and significant toughness. Normally, it is machined by stock 
removal method using cold work tool steel X165CrMoV12. To reduce the costs 
and manufacturing time, the cutting blade will be deposited with hot work tool steel 
X38CrMoV51 onto normal carbon steel C 45 and machined to the end geometry. 
The deposited cutting blade and the cross section are shown in Fig. 18a, b, 
respectively. The optical examination in cross section proves that the deposited 
layer is free of porosity and crack. The hardness in the layer amounts to 700 HV0.3. 
Figure 18c shows the cutting roller after machining. 





634 


C. Hui-Chi et al. 



Repair in process Repaired aero-engine blade 

Fig. 19 Repair of aeroengine blade, (a) Repair in process, (b) Repaired aeroengine blade 


Example 2 Laser cladding is suitable for applications in which only minimum 
distortion can be tolerated, as well as for materials that are difficult to clad using 
conventional methods, for example, high-temperature-resistant nickel-based alloys 
in gas turbines. Figure 19a shows the process of repairing the edge knife of an 
aeroengine blade made of nickel-based super alloy INI 00. The repaired blade in 
Fig. 19b shows a good dimensional accuracy. 


Selective Laser Sintering 
Introduction 

Selective laser sintering (SLS) is a laser-based 3D additive manufacturing tech- 
nique that builds objects layer by layer from powders using computer-aided design 
(CAD) technology. SLS technique is to some extent similar to other laser fabrica- 
tion processes such as laser engineered net shaping (LENS) (Griffith et al. 2000; 
Keicher et al. 1997), direct metal deposition (DMD) (Dinda et al. 2009), laser metal 
deposition (LMD) (Gasser et al. 2010), etc. All of these techniques incorporate 
features from stereolithography and laser surface engineering, using a CAD model 
to control sequentially thin-layered forming process. But unlike LENS, DMD, and 
LMD, where powders are delivered in a gas stream into the focus of a high-power 
laser beam to create a molten pool, the main difference in the SLS technique is that 
powders are spread from the feed bed across the part bed using a roller. After 
solidification of the melt, a thin solid layer is formed and fused to the previous 
sintered layer. By repeating the process, a nearly full dense structure can be made. 
Schematic of SLS technique is shown in Fig. 20. 

When the laser beam strikes the powder surface, as shown in Fig. 21, the 
laser-powder interaction can be described by the following steps: (i) absorption 
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Fig. 20 Schematic of the 
selective laser sintering setup 
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Fig. 21 Schematic of the 
laser-powder interaction 
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of photons leading to an increase in temperature, (ii) mass transfer or diffusion 
prompts interparticle necking and grain growth, and finally (iii) densification of the 
sintered objects. 

As additional light energy is absorbed by the sintered powder and the temper- 
ature rises beyond the melting point, powder material then undergoes partial or full 
melting. Surrounding particles are dragged into the melt pool to decrease surface 
tension, i.e., reduction of surface energy. As the scanning laser beam proceeds to 
the neighboring powder bed, the melt pool eventually cools down and starts to 
solidify, forming grains on the remelted surface of the previous layer (Das 2003). 
Densification is attained as pores are filled by molten material. Studies have 
demonstrated that SLS can process high- temperature, high-performance materials 
such as Ni- or Ti-based alloys into functional parts. Another advantage of SLS is its 
capability to fabricate complex-shaped objects with minimal post-processing, lead- 
ing to significant reduction in manufacturing and material costs. 
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Laser Sintering Mechanism 


Sintering is a method for fabricating high-density structures from powder materials. 
Laser sintering takes place by laser-induced localized heating as the powder 
absorbs the light energy. Subsequently, heat transfers to the powder bed by con- 
duction and to the surrounding atmosphere by convection/radiation. There are three 
main architectures that are commonly employed for laser sintering: (i) solid-state 
sintering, (ii) melting, and (iii) liquid-phase sintering. Solid-state sintering and 
melting usually apply for a one-component system, although multicomponent 
systems are at times applicable as well. For liquid-phase sintering, two or more 
components are often involved. 

Although the dwell time of the laser beam on individual particle is short, which 
is typically a few milliseconds, full melting of the particle can sometimes occur if 
the temperature exceeds the melting point of the material. Laser parameters and 
substrate conditions should be carefully controlled to prevent particle 
spheroidization or sometimes called balling, a phenomenon where molten powders 
quickly agglomerate and consolidate into a sphere of diameter approximately equal 
to the laser beam size. Balling is caused by poor wetting of the melt on an oxidized 
substrate and can be explained by the surface interfacial energy involved. In order 
to prevent rapid oxidation, the sintering process must be carried out in a vacuum or 
inert gas atmosphere. Another approach to solve the balling problem is to use a 
two-phase system. The powder bed now consists of at least two different powders of 
significantly different melting points. The high-melting-point component forms the 
structural part, and the low-melting-point component acts as the binding interface. 
The laser energy should be high enough to raise the processing temperature 
sufficiently to make the binder material melt and flow by gravitational force 
and/or capillary action in order to fill the gaps between solid particles. 


Applications 

Selective laser sintering is a powerful and versatile technique as a variety of materials 
can be prepared using this method. It is especially well suited to fabricating three- 
dimensional parts aimed for aerospace, automotive, and medical applications. For 
example, aircraft manufacturer European Aeronautic Defense and Space (EADS) 
routinely uses additive manufacturing techniques to fabricate lightweight and reliable 
aircraft parts. As shown in Fig. 22, through innovative design along with SLS 
technique, there is 65 % weight savings for engine cover door hinges on the Airbus 
A380 compared to conventional casting design, which conserve a significant amount 
of fuel, and thus, a lifetime cost reduction can be expected. SLS is also a material 
efficient process as there is minimal wastage of materials. 

Another application is for the automotive industry. As shown in Fig. 23, an air 
intake manifold was produced using a novel quasicrystal-polymer composite. 
Advantages of using this material in SLS process include low porosity and directly 
leak-tight without post-impregnation of resin. Compared to other composites such 
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Fig. 22 Significant weight savings for engine cover door hinges on the Airbus A3 80 using SLS. 
The material used is titanium alloy Ti-6A1-4V (Fachot 2011) 


Fig. 23 Air intake manifold 
created by selective laser 
sintering. The material used is 
AlCuFeB 

quasicrystal-polymer 
composite (Kenzari 
et al. 2012) 



as polystyrene or polyamide filled with carbon fibers, glass particles, or glass fiber 
used in SLS technology, it is found that friction is reduced and wear resistance is 
improved. 


Summary 

High-energy beams have been widely used for various material joining purposes 
today. By using a high-energy beam, a precision welding, cladding, and sintering 
can be carried out easily. In this chapter, fundamentals of laser and electron beam 
welding, laser cladding, and selective laser sintering are discussed systematically. 
Applications of each process are also included in each section. After going through 
this chapter, a basic understanding of each process and its potential application 
should be obtained. 
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Abstract 

This chapter presents the solid-state bonding technologies, in particular the 
thermocompression bonding and thermosonic bonding technologies, which are 
used to form microjoints in the electronics industry. The diffusion bonding 
mechanism and the key bonding conditions required to form reliable joints are 
presented. Moreover, the recent progresses in the thermocompression bonding 
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and thermosonic ball- wedge bonding technologies are highlighted. Lastly, the 
effects of different bonding materials and their surface characteristics on the 
joints’ performance are also discussed. 


Introduction 

Solid-state joining processes involve the bonding of two interfaces without the 
melting of base materials. These processes can be broadly classified as (i) pressure 
processes, (ii) resistance processes, (iii) arc-heated processes, (iv) surface displace- 
ment processes, and (v) diffusion bonding processes (Gould 2008). Figure 1 shows 
the general classifications of the various solid-state bonding processes. 

In this chapter, solid-state joining processes adopted to form microjoints are of 
particular interest. Solid-state bonding technologies particularly the thermocom- 
pression bonding and thermosonic bonding technologies used in the electronics 
industry will be discussed in detail in the forthcoming sections. The bonding 
mechanism (in relation to diffusion bonding) and the necessary bonding conditions 
to form reliable joints will also be presented. Furthermore, this chapter will show- 
case the advances in thermocompression and thermosonic bonding technologies. 


Solid-State Diffusion Bonding 

Diffusion bonding of materials in the solid state is a process to create a joint through 
the formation of bonds at atomic level. This is the resultant of closure of the mating 
surfaces due to localized plastic deformation taking place at elevated temperature 
which aids the interdiffusion at the surface layers of the materials being joined 
(Kazakov 1985). The bonding can be used to form joints between similar and 
dissimilar materials. A solid interlayer can also be used between the mating surfaces. 
This interlayer serves to form solid solution or reacts with the parent materials. 


Mechanisms of Diffusion Bonding 

When two interfaces to be bonded are placed in close proximity, the mating 
surfaces have a large amount of microscopic asperities. The first contact is a 
discontinuous one, such that the interfaces touch each other at the ridges of the 
high spots. In cases without the presence of a diffusion aid, a three-stage mecha- 
nistic model as illustrated in Fig. 2 describes the joint formation. At the first stage, 
upon initial contact with applied pressure, the surface asperities deform mainly due 
to yielding and creep deformation. This leads to intimate contact over a large 
interfacial area with grain boundary at the areas of contact with voids between 
these areas. At the second stage, diffusion dominates over deformation and grain 
boundary diffusion of atoms result in the disappearance of voids. At the same time, 
any remaining voids will be found within the grains due to the interfacial grain 
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Fig. 1 Classifications of the various solid-state bonding processes 



Initial contact of asperity 



1 st stage: Deformation and formation 

of interfacial boundary 



2 nd stage: Migration of grain boundary 

and elimination of pores 



3 rd stage: Elimination of pores by 

volume diffusion 


Fig. 2 Three-stage mechanistic model of diffusion bonding 
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Fig. 3 Surface 
characteristics of a metallic 
surface, revealing the surface 
roughness and presence of 
barrier layers 

Interfacial 

gap 


boundary migration away from the initial joint interface. At the third stage, any 
presence of voids will be eliminated by the volume diffusion of atoms. 

For any metallic materials, their surface characteristics include: 

Surface roughness 

Presence of oxidized or chemically reacted surface layer 
Presence of contaminant layer (like oil, dirt, grease) 

Presence of gas/moisture adsorbent layer 

For any surfaces, there is no absolute smoothness. At microscopic level, the 
smoothest surface can be characterized by a distribution of asperities. These 
asperities can vary in heights, curvature radii, and densities. They act as initial 
contacting sites when two surfaces are brought close together. The asperity summits 
touch before the rest of the spots within the contact area. These asperities undergo 
extensive plastic deformation due to stress concentration at the contact tips, leading 
to bonding. Figure 3 shows the characteristics of a typical metal surface. In order to 
achieve a strong joint, adequate bonding conditions (such as the bonding temper- 
ature, bonding time, and bonding pressure) need to be applied to overcome the 
surface roughness and the presence of the various barrier layers, to expose the 
underlying fresh metallic surface to facilitate diffusion bonding. 

The key mechanisms responsible for diffusion bonding include (i) plastic yield- 
ing, (ii) creep, and (iii) mass transport processes (Derby 1981). 


1 . 

2 . 

3. 

4. 


Layers of oxide and contaminant 



Bonding Conditions 

In order to form a strong and reliable diffusion joint, the following five bonding 
conditions are essential: 

1 . Application of temperature to facilitate plastic deformation and diffusion 

2. Mechanical closeness of faying surfaces through application of pressure 

3 . Application of sufficient holding time 

4. Disruption and dispersion of any surface contaminants 

5. Presence of vacuum or inert bonding environment (dependent on material) 
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Bonding Temperature 

Temperature is the most influential process parameter in the case of thermally activated 
processes like diffusion bonding. A change in temperature will influence the change in 
the process kinetics. Bonding temperature coupled with the application of pressure is 
instmmental to the extent of interfacial contact area taking place during stage 1 (refer to 
Fig. 2). Moreover, the rate of diffusion which governs the shrinkage and elimination of 
voids during stages 2 and 3 (refer to Fig. 2) is controlled by the bonding temperature. 

Diffusion is a phenomenon of material transport by atomic motion. In order for 
diffusion to take place, there must be driving forces like (i) a concentration gradient 
and (ii) temperature. The temperature should be high enough to overcome energy 
barriers to atomic motion. In order for the atoms to move from a lattice site to 
another site, atoms need energy (thermal energy) to break the bonds with their 
neighbors and to cause the necessary lattice distortions during motion from site to 
site. This energy is derived from atomic vibrations. 

Diffusivity can be expressed as a function of temperature as follows: 

D=D 0 e (^) (1) 

where D is the diffusion coefficient at bonding temperature (T), D 0 is the constant 
of proportionality, Q is the activation energy for diffusion, T is the absolute bonding 
temperature, and k is the Boltzmann’s constant. 

A typical diffusion bonding temperature is around 0.5-0. 7 T m (where T m is the 
absolute melting temperature of the parent material) (Kazakov 1985). Though a 
high temperature is necessary under certain circumstances to (i) facilitate the 
disruption and dispersion of surface contaminates/oxides and (ii) to maximize 
plastic deformation and diffusion rate to shrink/eliminate voids, a compromise in 
bonding temperature is still necessary to preserve temperature-sensitive structures 
and to minimize the deformation to maintain dimensional tolerances. 

Bonding Pressure 

The bonding pressure which is applied uniaxially and perpendicularly to the 
bonding surfaces must be sufficient to (i) bring the two mating surfaces within 
interatomic distances, (ii) assist the deformation of surface asperities, (iii) locally 
disrupt any oxide/contaminant layers, and (iv) shrink the voids present. In cases 
whereby the applied pressure is inadequate, voids could still persist in the joint, and 
this weakens the overall joint’s strength. 

As shown in Fig. 2, pressure is critical during stage 1 of the diffusion bonding 
process to permit intimate interface contact. Pressure is also necessary to produce a 
large area of contact at the applied bonding temperature. Although pressure could 
be removed after stage 1 of the bonding process, an early release of pressure before 
the completion of stage 1 could be unfavorable to the process. 

Bonding Time 

The creep deformation of surface asperities and the diffusion process are both time- 
dependent processes. Adequate bonding duration is thus necessary to allow 
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shrinkage and elimination of voids. In addition, to achieve a good joint quality, the 
bonding duration used is closely linked to the bonding temperature and pressure. 

Bonding Surface Condition 

Bonding interfaces which are free of contaminants and oxide are ideal to the 
formation of high strength joints. Proper cleaning (such as oxide removal with 
chemical etching, oxide reduction with forming gas anneal) prior to bonding is 
important to disrupt and disperse the barrier layers present. When fresh, bare metal 
interfaces are exposed, metal-to-metal contact is established, the metallic atoms are 
within the attractive force fields of each other, and a reliable joint is thus created. 

Bonding Environment 

A protective environment could be used to reduce the oxidation of metallic surfaces 
during the bonding process. Bonding under vacuum condition could be used, but 
this adds to the processing costs. Inert gases like argon, helium, and nitrogen are 
used to protect the bonding surfaces. In the case of copper wire bonding, a 
protection gas mixture of nitrogen and hydrogen (in a ratio of 95:5 forming gas) 
is used. 

Advances in Thermocompression Bonding 

Thermocompression bonding is widely used to form interconnections at first-level 
package interfaces. Wire bonding, flip chip bonding, and wafer bonding technolo- 
gies are examples of the thermocompression bonding process. The formation of 
joint is enabled by diffusion which needs the bonding interfaces to be in intimate 
contact. This allows the metallic atomic interaction across the bonding interfaces 
when force and heat are applied. The earliest approach of bonding wires to silicon 
semiconductor devices is by thermocompression bonding. This is followed by the 
development of ultrasonic and thermosonic modes of wire bonding technologies. In 
this section, thermocompression bonding related to flip chip bonding and wafer 
bonding technologies will be discussed. 


Flip Chip Bonding 

A flip chip is a wafer- scale operation whereby the interconnects are formed on an 
entire wafer. The wafer is then diced and the individual dice is picked and placed on 
the substrate. Flip chip bonding takes place by the thermocompression process as 
schematically presented in Fig. 4. In the case of solid-state diffusion bonding, it 
usually involves the direct metal bonding of Au or Cu interconnects by thermocom- 
pression. The Au and Cu interconnects are either plated or formed as stud bumps 
using the wire bonder as shown in Fig. 5. 

In a study by Ang et al. (2006), the interfacial temperature and bonding pressure 
dependences in thermocompression gold stud bonding were established. Results 
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Fig. 4 Diagram showing a 
flip chip thermocompression 
process 







20 WD = 12.0 mm Signal A = SEi EHT = 15.00 kV Mag = 750 X 



Fig. 5 Schematic diagrams and photos showing flip chip interconnects: (a) wire stud bump and 
(b) plated bump 

revealed that there is a critical bonding temperature below which no bonding will 
take place. However, above this temperature, the metal surface becomes thermally 
activated, and the shear strength improves with bonding temperature due to the 
increase in the true bonded area. This critical temperature can be interpreted to be 









648 


S.M.L. Nai et al. 


the onset of the breakup of organic barrier films. Furthermore, pressure dependence 
was demonstrated when the tensile strength of the joint exhibited a maximum 
beyond the critical temperature. 

With continual miniaturization of feature size to meet the ever-increasing 
performance demands on integrated circuits, three-dimensional (3-D) integration 
is the technology which offers the flexibility in system design, placement, and 
routing. A 3-D integrated circuit (IC) is made up of a stack of vertically bonded and 
interconnected chips. The inter-chip connection is done using the through- silicon- 
via (TSV) technology. Cu-Cu bonding is favored by the industry for TSV integra- 
tion as it concurrently forms the necessary mechanical and electrical connections. 
For 3-D interconnects, the Cu-Cu thermocompression bonding technology is the 
most prominent direct metal bonding method. Cu-Cu joints are desired compared 
to joints formed using solder materials as: 

(i) Cu-Cu bond is more scalable and ultra-fine pitch can be achieved. 

(ii) Cu-Cu joints exhibit better electrical, mechanical, and thermal properties as 
no intermetallic compound is formed, unlike for the joints formed by solder 
materials. 

(iii) Cu has better electromigration resistance and can withstand higher current 
density in the future. 

In recent years, much research effort is also placed to achieve low-temperature 
Cu-Cu bonding for (i) thermal budget consideration, (ii) lower thermal-mechanical 
stress, and (iii) alignment control. 

In existing literature, several promising approaches such as the utilization of a 
self-assembled monolayer and a nanoporous metal layer have successfully demon- 
strated the formation of metal-metal joints by thermocompression. 

Use of Self-Assembled Monolayer (SAM) 

Several researchers have reported that by coating a SAM monolayer on metallic 
surfaces such as gold or copper prior to bonding, the bonding temperature required 
for forming joints can be significantly reduced (Li et al. 2009, 2010; Ang 
et al. 2007, 2008a, b, 2009; Chin et al. 2006; Tan et al. 2009). 

Typically, for the case of direct Cu-Cu bonding, a high bonding temperature 
(more than 300 °C) is necessary to forge a reliable joint, due to the oxidative nature 
of copper. In a study by Ang et al. (2009), direct Cu bonding with the aid of self- 
assembled monolayer (SAM) layer was demonstrated under ambient condition, at 
low bonding temperature below 140 °C. The joint formed has a reliable mechanical 
joint integrity of 50 MPa. In this study, electroplated Cu bumps were acid cleaned 
and immersed into ethanolic thiol solution followed by the deposition of a layer of 
undecanethiol. It was reported that the SAM monolayer protects the Cu surface from 
further oxidation after acid clean. In comparison to the bulk Cu oxide layer, the SAM 
monolayer is easily displaced by mechanical deformation at the bonding interface. 
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Fig. 6 Micrograph showing the cross-sectional view of the Au nanobump joint formed at 150 °C 
with bonding pressure of 10 MPa (Extracted from Oppermann and Dietrich (2012)) 


In another study by Chin et al. (2006), SAMs are also applied to improve the 
bondability of Au-Au thermocompression bonding at low temperatures. Au stud 
bumps were successfully bonded to sputtered Au surface with a layer of 
dodecanethiol (DDT) SAMs. Low-temperature bonding ranging from 80 °C to 
180 °C and bonding pressures ranging from 225 to 566 MPa are achieved. The 
presence of the SAM layer was found to passivate the bonding surface after the 
removal of the organic contaminant layer by chemical cleaning. 

Nanoporous Au Bump Joints 

Oppermann and co-workers from IZM (Oppermann and Dietrich 2012) formed 
joints of nanoporous Au bumps. Firstly, nanoporous metal layers are electroplated 
on copper strips and then transferred to Si wafers using a standard wafer bumping 
equipment. The nanoporous gold bumps were deposited by electroplating 
silver-gold alloy. The silver is then etched away to result in a nanoporous gold 
layer as an open porous sponge with porosity of 70-80 %. The porous joints are 
formed by thermocompression flip chip bonding, and a significant reduction in 
bonding temperature and pressure can be achieved with the nanoporous bump 
structure (refer to Fig. 6). The lowest bonding temperature of 150 °C is achieved 
at a bonding pressure of 10 MPa. The bumps formed are reported to have a 
reduction in height without an increase in the bump’s diameter due to the com- 
pressible interface. 
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Wafer Bonding 

Wafer Bonding with Metallic Interlayer 

Wafer bonding technologies can be broadly classified into (i) direct wafer 
bonding (DWB) and (ii) mediated wafer bonding (MWB). For DWB, there is no 
need for any intermediate layer between two plain wafers which are to be bonded, 
while for MWB, as the name suggests, there is a presence of an intermediate 
layer between the two wafers to facilitate the formation of permanent joint. 
Thermocompression bonding is one of the techniques categorized under the 
MWB approach. Figure 7 shows the schematic diagram of a thermocompression 
bonding setup of two wafers with a metal interlayer such as gold or copper (Tsau 
et al. 2002, 2004; Taklo et al. 2004; Tsau 2003; Kim et al. 1995). Firstly, the two 
wafer surfaces are brought to close contact. Heat and pressure are then applied to 
allow interdiffusion to take place. Any oxide present on the surface of the metal 
interlayer will impede the formation of good joint; thus, it must be removed prior 
to bonding. 

Tsau et al. (2004) demonstrated the thermocompression bonding of silicon 
substrates using a bonding layer of gold thin film with modest application of 
temperature and pressure. The following summarizes the findings: 

(1) There is a need for a diffusion barrier layer (such as silicon dioxide, Si0 2 ) 
as the presence of Si at the Au bonding layer will hinder the thermocom- 
pression bonding. The Au film reacts with the underlying Si (Kim 

et al. 1995). 

(2) In order to facilitate interdiffusion at the bonding interfaces, the mating sur- 
faces must be brought into close contact when pressure is applied. Thus, a 
smooth Au film surface is desired so as to utilize low bonding temperatures. 

(3) UV-ozone treatment is recommended to aid the removal of organics present on 
the surface of the wafers. 

(4) Based on the study (Tsau et al. 2004) on the effects of bonding pressure, 
bonding temperature, and bonding time on the bond toughness, it is 
recommended that a bonding temperature of 260 °C and bonding pressure of 
120 MPa can meet the repeatability and yield requirements of the industry. 
Furthermore, bonding time is found to have the least influence on the bond 
quality, and owing to the preference of faster throughput, the bonding duration 
can be as short as 2 min. 

In another wafer-level thermocompression bonding with gold study by Taklo 
et al. (2004), a bond yield of 89 % and bond strength of 10.7 =b 4.5 MPa are 
achieved when the wafers are bonded at 298 °C, with an applied pressure of 4 MPa 
for 45 min. The wafer pair consists of a piece of Coming #7740 (Pyrex) glass wafer 

o 

with a 2,000 A thick Au layer bonded to a Si wafer. The joints are reported to be 
sufficiently strong and dense for MEMS application. 
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Fig. 7 Schematic diagram of a thermocompression bonding setup of two wafers with a metal 
interlayer such as gold or copper 




Fig. 8 Cross-sectional SEM images showing Cu nanorod arrays bonding at (a) 200 °C and 
(b) 400 °C (Extracted from Ko and Chen (2012)) 


Cu Nanorod Array 

In a study by Wang et al. (2009; Ko and Chen 2012), an array of Cu nanorods with a 
diameter of 10-20 nm grown by an oblique angle deposition technique is used to 
achieve low-temperature wafer bonding. It is reported that the surface-melting 
disintegration of the nanorod arrays took place at a temperature significantly lower 
than that of the bulk Cu melting point. With the aid of a wafer bonder, at a bonding 
temperature of 200 °C and an applied pressure of 0.32 MPa, the Cu nanorod arrays 
undergo coalescence. As the bonding temperature increases to 400 °C, a fully dense 
Cu bonding layer with homogeneous structure is achieved (refer to Fig. 8). This is 
attributed to the higher-mobility nature of Cu species associated with Cu nanorods in 
comparison with the use of a Cu blanket film. It is observed that the phase transfor- 
mation temperature of Cu nanorod is considerably lower than its bulk melting point 
of 1,085 °C. Based on the Gibbs-Thomson theory, it is well known that the 
temperature to activate atomic transport decreases with decreasing particle size. 
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Fig. 9 Schematic diagram showing the surface-activated bonding process flow (Adapted from Ko 
and Chen (2012)) 


Surface-Activated Wafer Bonding 

Conventionally for direct wafer bonding, high annealing temperature is neces- 
sary. This often results in thermal stress and the generation of defects. In order 
to overcome the use of high annealing temperature, the surface-activated bond- 
ing approach is developed to achieve room temperature without any annealing 
steps. As schematically shown in Fig. 9, the contaminant layers and surface- 
absorbed molecules are removed from the bonding surfaces by intentionally 
bombarding the surfaces with ion or fast atom beam. When the two surfaces are 
brought into close promixity, the content area propagates spontaneously by 
surface attractive forces between the wafers. This is due to the resultant dan- 
gling bonds on the surface atoms after the surface activation step. The atoms 
being in the unstable state will form strong chemical bonds between the two 
surfaces. 

Room temperature Cu-Cu direct wafer bonding is reported by Kim et al. (2003) 
using the surface-activated bonding method. Thin Cu films are firstly deposited on a 
diffusion barrier layered 8 in. silicon wafer. It is a requisite for direct bonding to 
have clean bonding interfaces. The presence of any oxides and carbon-based 
contaminants on the wafer surfaces will hinder effective wafer bonding. Thus, 
prior to bonding, the Cu surface is activated using a low-energy Ar ion beam of 
40-100 eV. Auger analysis revealed that the strong carbon and oxygen peaks 
detected due to organic contaminants and native metal oxides are mostly removed 
after the cleaning step using Ar ion beam bombardment for 60 s. Furthermore, 
although no thermal annealing is needed to increase the bond strength after bond- 

o 

ing, the bonding is carried out under an ultrahigh vacuum environment (~10 
Torr). Tensile test results revealed that despite bonding at room temperature, high 
bonding strength equivalent to that of bulk material is achieved. The bonded Cu 
interfaces also showed no presence of voids from the transmission electron micros- 
copy (TEM) analysis. 
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Advances in the Manufacturing of Thermosonic Ball-Wedge 
Bonding 

Current Trend of Thermosonic Bonding 

The wire material used in thermosonic bonding has been dominated by the use of gold 
(Au), be it 99.99 % (4N) purity or lesser purity of 99.9 % (3N) and 99 % (2N) until 
recent years. Almost all thermosonic ball bonders designed and manufactured for 
mass production of integrated chip (IC) device assembly before mid- to late 1990 were 
for Au wire only. The majority of size used in device assembly has been around 
25-30 pm (1-1.2 mil) in diameter. The decision on wire diameter followed the ITRS 
wafer technology node road map for ICs and the associated wire bond interconnect 
pad pitch trend. As wafer technology node evolved into smaller dimension, the size of 
the wire also reduced accordingly to ensure that more wires can be packed into the 
device. Before 2008, the industry had been working on reducing Au wire diameter for 
fine technology node as well as for cost reduction. Figure 10 shows how much cost can 
be saved with reducing wire diameter. Just by reducing the wire diameter from, say 
25 to 23 pm (1 to 0.9 mil), a 16 % cost savings can be realized. Compared to, however, 
the cost of using copper (Cu), which is just a tiny fraction of today’s Au price, the cost 
savings by wire diameter reduction is insignificant. 

With the continued increase of Au price over the last 10 years, semiconductor 
package assembly houses have to consider lower cost of bonding wire materials in 
order to remain competitive in the market, especially high-density devices that use 
more than a hundred or even a thousand wires. Gold price increased from around 
US$300 per troy ounce in the beginning of 2002 to well over US$1700 10 years 
later, and at one point of time, it even reached close to US$1900 in September 2011 
(Fig. 11). Using a lower-cost material, such as Cu, can help the assembly houses 
save millions of dollars due to the huge difference in unit cost of the material 
(less than one US dollar per oz). 

Copper wire had been used for many years in power devices where thick wires in 
the dimension of 50 pm or larger have been used and the number of wires per device 
was few. These wires are mainly used with ultrasonic bonding process in which no 
ball formation is required. However, the economic tsunami in 2008-2009 period 
had driven the use of fine diameter Cu wire to be used to replace Au wire for major 
cost savings in IC packages. The size of Cu wire started at 25 pm (1 mil) to replace 
the more expensive Au material of the same diameter used in low-pin count IC 
devices such as SOP, TSOP, etc., but the trend did not stop there and more 
conversion from Au to Cu happened to all wire diameters down to 20 pm 
(0.8 mil) as well as 18 pm (0.7 mil) in recent years. 

When processing Cu bonding wire, changes to the design of ball bonder must be 
carried out because Cu being an easily oxidized material when heated in the 
presence of oxygen, a protection environment must be provided during the melting 
of the wire to form a ball before bonding onto the IC pad. Figure 12 shows an EFO 
(electric flame off) wand design for Cu wire bonder. For Au wire, there is no need 
for oxidation protection due to its inertness to environment. EFO wand is visible 
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Cost Saving Assumption: 

200 I/O QFP LGP = $900/troz 
IIPH = 80 Wire = 25um 
Utilization = 80% 

Wire consumption = 3.5mm/pin 
Annual wire use = 387 km/bonder 


Annual Savfng/bonder= US $ 1 5,500 (with 

23um) 

Annual Saving/bontiers US $ 35,900 (with 
20um) 


Fig. 10 Material cost saving of Au wire by reduction of diameter 


Gold $/oz (Basis London Fixing) 



25 pm (1 mil) — 

23 pm (.9 mil) — 

— 20 pm (.8 mil) 

- 15 pm (.6 mil) *■ 



Fig. 11 Price trend of gold (Au) from 2002 to 2012 

underneath the tip of the wire. Hence the kit, which can be of glass or ceramic, is not 
needed and the Au wire is exposed in air during processing. However, for Cu, 
depending on the machine maker and design model, an EFO kit must be used so as 
to allow the protection gas to be purged to protect the Cu ball during formation to be 
oxide-free. The common protection gas is a mixture of nitrogen and hydrogen in a 
ratio of 95:5 (forming gas), although for Pd-coated Cu wire, nitrogen alone can be 
used. The kit can be an enclosure type or open concept as long as the tip of the wire 
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Fig. 12 Electric flame off kit 
of a K&S Cu bonder with gas 
purging facilities 


N 2 or FG Gas 

inlet tube 


EFO Body 



NVT 

<N, supply] 


EFO firing electrode 
inside ceramic tube 


Ceramic Tube 


0.1-10 Ipm flow meter included in kit 


is inside the gas region during ball formation. The EFO assembly shown in Fig. 12 
is in a Kulicke & Soffa (K&S) Cu wire bonder. The rectangular white object is the 
EFO tube which has an enclosed feature allowing better protection to the tip of the 
wire which is located inside the flow path of forming gas. 


Properties of Thermosonic Bonding Wires 

The elements, Au, Cu, Ag, and Al, in wire form with low electrical resistivity - 1.63 
to 2.7 pO.cm - are commonly used for interconnections (Harman 1997). These four 
elements are face centered cubic in crystal structure with different atomic properties 
(Cullity 1978). Moreover, for a thermosonic ball-wedge bonding, a wire has to 
satisfy the essential factors: (a) free air ball (FAB) formation, (b) deformation of the 
FAB and easy welding to Al bond pad, (c) loop formation, (d) deformation of wire 
and easy welding to lead fingers, and (e) reliability of ball and wedge bonds (first 
and second bond) to attain good electrical interconnections. Among these four 
elements, the inert Au satisfies all the five vital factors and has been used for 
three decades in the field (Harman 1997; Prasad 2004). Next to Au, Cu wire 
bonding technology has matured, especially in the formation of FAB under pre- 
ventive environment. For niche applications, Ag wire bonding is widely used in the 
industries, while Al wire bonding is still in developmental stage to find a suitable 
method for the formation of a good FAB. The following sections mainly discuss the 
salient engineering advancement in the field of Au and Cu wire bonding. 

Composition of Bonding Wires 

High pure Au (5N, 99.999 %) is extremely soft, and for a successful wire drawing 
and bonding, it is usually doped (Qi and Zhang 1997; Chew et al. 2004) with Ca, 
Cu, Be, Pd, Pt, Ag, Ce, etc., in parts per million (ppm). In addition, the annealing 
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process design of 4N and 3N purity Au wire produces a stiff and soft wire suitable for 
bonding, particularly for looping. Later, 2N purity Au wire, a high-reliability product 
without any Kirkendall voids, has been used. Electrical grade oxygen-free high 
conductivity (OFHC) 4N Cu has been used for bonding. Fine wires made using 5N 
and 6N purity Cu are examined in the view of developing a softer Cu wire than 4N 
Cu. Alloyed Cu with IN purity (>90%) with certain elemental addition and Pd-coated 
Cu (PCC) wire with 4N purity Cu core and 4N purity-plated Pd are examined for better 
bondability and reliability (specifically biased HAST/THB) performances. 

Processing of Bonding Wires 

Wire processing (Zompi et al. 1991; Camenschi and Sandru 1980) is an established 
technique since the 1980s. In general, the processing of bonding wire sequences is 
as follows: melting the raw material under preventive atmosphere/vacuum, contin- 
uous casting to thick rods, heavy wire drawing at room temperature, intermediate 
annealing, final wire drawing at room temperature with stringent die management, 
and final annealing and spooling in a clean room environment. 

At present, bonding wires using Au, Cu, Ag, and Al are wire drawn at room 
temperature until 10 pm diameter. The manufacturing yield loss is high on drawing 
the wires in the diameter range of 10-15 pm. Compared to electrical grade Cu wires 
in the market, the bonding wires have to be processed with high quality and tight 
tolerances, as it is guided via a capillary during wire bonding processes. 

Properties of Bonding Wires 

Physical, thermal, electrical, and mechanical properties of the bonding wires are 
tabulated in Table 1. The tensile curves of Au, Cu, PCC, Ag, and Al wires are shown 
in Fig. 13. The tensile test was performed using Instron 5,400 series model. As Au is 
malleable, the fine wires are annealed between 2 % and 6 % elongation (EF) to suit for 
wire bonding. Fine Cu wire (standard (Std)) is harder and stiffer than Au and generally 
annealed to a higher EF of 12-16 %. Soft Cu wire developed by special processing 
technique requires low load to deform under tensile (Fig. 13). Also, the modulus of soft 
Cu wire remains the same as that of Au. Thus, the soft Cu wire gains popularity and is 
being used nearly 50 % of market shares among the bare Cu wires available in the 
bonding wire manufacturing industries. PCC wire is annealed similar to Cu, and Ag 
wire is annealed similar to Au wire. The low melting point of element Al, which forms a 
stable surface oxide film, has to be annealed in batches and not continuous strand 
annealed like the other bonding wires with a high-temperature melting point. 

Table 1 provides the measured resistivity of bonding wires, calculated using the 
basic formula p = R.A/F, where R is the measured resistance in ohms using an 
ohmmeter connected to a fixture, A is the cross-section area of the wire, and F is the 
measured length of the wire. Fusing current is the maximum current carrying 
capacity of a wire, above which the wire fuses (melt). The fusing current is 
measured at room temperature and atmospheric conditions (Table 1). Factors 
such as wire diameter, wire length, wire type (Au, Cu, Ag, Al), and input pulse 
current strongly influence the fusing current. For instance, increase in wire diameter 
or decrease in wire length increases the fusing current (Fig. 14). Fiterature 
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Table 1 Measured properties of 4N purity bonding wires 


Properties 

Au 

Std Cu 

Soft Cu 

PCC 

Ag 

Al 

Density, a g/cm 

19.30 

8.92 

8.92 

9.03 

10.51 

2.70 

Melting point, a °C 

1,063 

1,083 

1,083 

1,081 

961 

660 

Specific heat capacity 
at 25 °C, a J/kg K 

126 

386 

386 

352 

237 

900 

Coefficient of thermal 
expansion (CTE, 

0-30 °C), a pm/m °C 

14.2 

16.5 

16.5 

16.6 

19.1 

23.1 

* Young’s modulus, 
GPa 

80-90 

110-140 

80-90 

95-105 

75-85 

70-75 

^Tensile strength, 
MPa 

140-290 

240-250 

190-210 

200-220 

160-290 

140-200 

^Elongation, % 

2-7 

8-16 

8-16 

10-18 

2-8 

4-8 

Hardness b (HV, 
10 mN/5 s) 

55-70 

100-110 

85-95 

90-105 

60-70 

60-70 

Electrical resistivity, 13 

p£2.cm 

2.36 

1.69 

1.69 

1.70 

1.63 

2.70 

Fusing current, 13 mA 
(300 ms input pulse 
current) 

300 

330 

330 

330 

310 

260 


a l mm wire 

b 20 pm wire diameter 


(Krabbenborg 1999; Mertol 1995; Loh 1983) reports few methods to calculate 
fusing current of fine metal wires. Trend of the predicted and measured reading is 
similar for varying wire diameter, length, and type, yet the measured value is 10 % 
lower than the predicted one. 


Floor and Shelf Life of Bonding Wires 

End users consider floor and shelf life of bonding wires as an important factor. Gold 
is inert, and if it is stored inside the plastic holder in a dirt-free environment, the 
floor and shelf life is generally practiced as 3 months and 1 year, respectively. No 
rigorous floor and shelf life studies are being conducted for Au wires. Copper is 
prone for oxidation and is usually sealed under vacuum. It is recommended to be 
stored away from corrosive industrial atmosphere, such as humid and chlorine-rich 
coastal atmosphere. Similarly, Ag is prone to corrode if the atmosphere is rich in 
sulfur. Moreover, Al wire with stable oxide layer is much safer than Cu and Ag 
wire, yet it is possible to corrode if the environment is corrosive. 

Floor life of a wire is defined as how long the wire can be used after opening the 
vacuum seal, and shelf life is defined as the period that the vacuum-sealed wire 
spools can be stored and used later for bonding. The annealed wires are stable and 
expected to have no significant changes in micro structures and tensile properties. 
Therefore, wire surface oxidation and bonding performance are studied in detail. 
For instance, wire surface oxidation and stitch pull evaluation of 0.8 mil soft Cu 
wire tested for 90 days after opening the vacuum seal and 6 months vacuum- sealed 
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Fig. 13 Tensile curve of 20 |im (0.8 mil) annealed bonding wires 


conditions are shown in Fig. 15. The recommended floor and shelf life of soft Cu 
wire is 7 days and 6 months, respectively. Similarly, the floor and shelf life of A g 
and Al wires are recommended by the manufacturers. 


Surface Characterization of Bonding Wire, Bond Pad, and Lead 
Finger 

The roughness and oxide layer of the two joining surfaces are the two factors that 
influence a good bonding of a wire to bond pad or lead finger to establish an 
interconnection. This section provides data on the surface characterization of 
wire/pad/finger for a good bonding. 
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Fig. 14 Measured fusing current of 4N soft Cu wire for varying diameter and wire length (300 ms 
input pulse current) 
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Fig. 15 Floor and shelf life of Cu bonding wires: (a) and (b) surface oxidation measurement using 
SERA, (c) and (d) stitch pull evaluation 


Surface Oxidation of Bonding Wire 

Except Au, almost all metals readily get oxidized when exposed to air and so as 
Cu. ECI Technology has designed and developed an equipment, QC200, which 
measures oxide layer thickness by sequential electrochemical reduction analysis 
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Fig. 16 Surface oxide 
measured using SERA for 
good and oxidized wires 



(SERA) technique (Tench 1994). The measurement is based on Faraday’s principle 
of electrolytic reduction of a cell. Surface Cu oxides are reduced with hydrogen ions 
in a cell consisting of reference and auxiliary electrodes, borate electrolyte, 
and specimen. The cathode potential is constant for each type of surface film 
(CuO, Cu 2 0). Oxide film reduction and its thickness are proportionate to surface 
area of Cu wire sample, current flow, potential difference, electrolyte, etc. The 
SERA technique is accurate enough to measure the copper oxide layer thickness in 

o 

A. Reduction of Cu 2 0 or CuO occurs at a unique voltage range, and the time taken 
for the reaction is used to calculate the thickness of the layer as per Faraday’s 
equation: 


M * I * t * 10 8 , . 

T = (2) 

n * F * S * d 

where I is current in ampere, t is reduction time measured in sec, M is molecular 
weight in g, F is Faraday’s constant (96,498 coulombs), n is number of electrons, 
S is surface area of immersed Cu section in cm 2 , and d is density of film in g/cm 3 . 
The annealed 4N Cu wires in normal production and laboratory environment 
showed neither presence of Cu 2 S nor detectable. On the other hand, CuO is detected 
when Cu wire is exposed to harsh oxidizing atmosphere (Fig. 16). The system is 
also capable to measure AgO or Ag 2 S on Ag wire surface. 

As a quality control measure, every batch of final annealed Cu wire is examined 
for dirt and loose wires under low power scope (l-10x)as well as surface oxidation 
using SERA. For a good Cu wire bonding, the surface oxide (Cu 2 0) layer thickness 

o 

is observed to be ~20 A. A typical voltage versus time plot of a reduction of 0.8 mil 
Cu wire surface is shown in Fig. 16. Observation of the wire surface in scanning 
electron microscopy (SEM-EDX analysis) at high magnification (200-1 0,000 x) 
revealed a clean, contamination-free surface with no traces of oxygen. 

Surface Contaminants and Oxidation of Bond Pad 

Metals exposed to atmosphere at room temperature form an oxide film thickness of 

o 

20-100 A. Auger analysis of Al-metallized bond pad surfaces clearly revealed an 
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Table 2 Mechanical properties of pure metals and their oxide layers 


Metal 

VHN, 

kg/mm 2 

Threshold deformation, 

% 

Surface 

oxide 

Oxide hardness, 
VHN 

Al 

35-48 

40 

AI 2 O 3 

1,800 

Ag 

25-95 

— 

Ag 2 0 

135 

Au 

20-60 

— 

— 

— 

Cu 

49-87 

78 

Cu 2 0 

160 

Pd 

37-100 

— 

PdO 

— 

Pt 

40-100 

— 

Pt 2 o 3 

— 

Ti 

60-90 

— 

Ti0 2 

— 

Ta 

90-200 

— 

TaO 

— 

Ni 

45-220 

89 

NiO 

480 

Si 

254 

— 

Si0 2 

— 


Extracted from Murali et al. (2007) 


oxygen peak. Depth profile analysis using secondary ion mass spectroscopy (SIMS) 
on bare Cu lead frame surface revealed presence of oxygen until 2.4 nm thickness 
(Murali et al. 2007). Both Al and Cu surfaces possess good bondability. This 
indicates that the aluminum/copper oxide films do exist on the surfaces before 
joining. However, they are broken in the course of wire bonding for a successful 
welding. Hard oxide layers such as aluminum oxide (alumina, 1,800 VHN) seen on 
Al are comparatively more difficult to break than soft metallic oxide layers 
(Table 2) (Murali et al. 2007). During wire bonding, supplied ultrasonic energy 
displaces the capillary by ~8 pm; hence, the FAB compressed by capillary first 
scrubs the bond pad surface as well as breaks the oxide layers (aluminum oxide, 
copper oxide, etc.) of the contact surfaces. The oxide film on Cu FAB and wire 
surface have to be broken in order to weld. In addition, contaminants such as 
deposits of carbon, fine dirt, epoxy outgassing, substrate outgassing, fluorine, 
sulfur, etc., are generally found on the metallized bond pad (Al, Au) and plated 
lead frame finger (Ag, Cu, Pd, Au) surfaces (Murali et al. 2007). Deposition of the 
outgassed compound may cause nonstick on lead (NSOL). 

Surface Roughness and Plastic Deformation of Asperities of Bondable 
Surfaces 

Surface roughness (Ra) of Al- and Au-metallized bond pad surfaces is within the 
range of 0.02-0.09 pm, and bonding to this smooth surface does not cause a 
problem with either Au or Cu ball bonds. Ra measured for plated lead frame finger 
surfaces varies between 0.02 and 1 .4 pm. The surface profile of Al bond pad and 
Ag-plated lead frame measured using 3-D laser microscope are shown in Fig. 17. 
Au-, Cu-, Ag-, and Pd-plated surfaces with Ra of 0.2-1 pm revealed good bonding. 
For better grip of epoxy molding compound to lead finger surface, rough lead 
finger surface is preferred to avoid delamination. But increase in Ra greater than 
1 pm may cause nonstick on lead (NSOL). Considering bare Cu lead frame, 
Ra greater than 0.15 pm causes NSOL. Ra measured by noncontact method on 
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Al-lwt%Si-0.5wt%Cu Bond Pad 





Ag Plated Lead Frame Finger 



Fig. 17 Surface profile of Al bond pad and Ag lead finger observed using Lext-OSL4000, 3-D 
laser microscope 


Cu FAB spherical surface and wire surface revealed around 0.17-0.55 pm Ra 
(Murali et al. 2007). The reported data are measured for the devices used in the 
industries. 

Surfaces assumed to have hills and valleys and the asperity contact are important 
for joining. Deformation at the asperity contact can be elastic or elastic-plastic. 
Welding is not possible below the threshold deformation of asperity. Table 2 
consolidates the hardness and tensile strength of some pure metals, threshold 
deformation of certain metals, and hardness of some oxides. The threshold defor- 
mation for Al is 40 %, while to crack the aluminum oxide film, 1 % or less 
deformation is required (Murali et al. 2007). Hence, oxide film can be easily broken 
by the supplied energies when two surfaces contact with each other. Grain orien- 
tation and surface micro structure may also affect the bondability, especially for 
bare Cu surface. 


Thermosonic Bonding Process 

Wire bonding process involves four important steps: 

(a) Axisymmetric spherical and consistent FAB formation without off-center 

(b) First bond (ball bond) bondability, FAB to deform, weld, and possess intact 
contact with the bond pad without any irregularity 

(c) Looping - bonded wire between ball and stitch bonds to be stiff and not to sag 
during post processing to avoid shorting with adjacent or upper/lower tier wires. 
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It also has to satisfy the dimensional tolerances for long loop, low loop, high 
loop, reverse bonding, leaning, swaying, loop height, fine-pitch bonding, etc. 

(d) Second bond (“stitch” or “wedge” bond) bondability, the wire to deform, weld, 
and possess integral contact with the lead finger. Create a consistent tail end on 
breaking the wire at stitch. 

For the last three decades, Au wire has satisfied the abovementioned steps for the 
diameter from 15 to 75 pm (0.6-3 mil). With hike in Au price, industries are forced 
to find an alternate to Au wire, which must satisfy the above steps to enter the 
family of bonding wires. Undoubtedly, bare Cu, alloyed Cu, and PCC wires started 
replacing Au wire (Breach 2010; QiJia et al. 2010; Zhong 2009). Industries have 
started to investigate Ag wire for niche applications. This section discusses the 
advances in the engineering of wire bonding. 

FAB Formation 

Electric spark to the tail end of the wire melts a controlled length of it, as the 
temperature of the wire reaches higher than the melting point. Due to the surface 
tension, the molten pool rolls up into spherical ball shape and then solidifies 
(Huang et al. 1995). This is termed as “free air ball (FAB)” (Fig. 18a). In 
principle, balance between heat and mass transfer creates the FAB. Heat transfer 
is predominantly by thermal conduction at the neck between the ball and wire 
regions. Secondary mode of heat transfer may be due to thermal convection and 
radiation by air cooling. The shape-up of spherical molten pool satisfies the 
minimum surface area criteria. Rapidly solidified FAB in the atmosphere is 
axisymmetrical to the wire axis (Fig. 18a). The ratio of FAB to wire diameter is 
usually maintained between 1.6 and 2.2. The processing parameters that govern 
the formation of FAB are: 

(a) Electric flame off (EFO) firing current 

(b) EFO time 

(c) EFO voltage 

(d) EFO wand gap (gap between the wire tip and electrode tip) 

(e) Morphology of the wire tip 

(f) Gas flow rate, for bonding wires other than Au 

(g) Plating thickness, for coated bonding wires 

The factors that influence the FAB formation are: 

( 1 ) Reaction of the molten FAB surface to its environment 

(2) Balance of surface tensional force on the molten FAB surface 

(3) Gravitational force 

(4) Composition of the wire 

(5) Density of the wire 

(6) Thermal conductivity of the wire 

(7) Electrical conductivity of the wire 
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Fig. 1 8 Scanning electron microscopic (SEM) observation of (a) good Cu FAB, (b) apple bite Au 
FAB, (c) oxidized Cu FAB, (d) pointed Cu FAB, (e) pointed and oxidized PCC FAB, and (f) ring 
pattern in Cu FAB 


Au wire is inert; on melting no sign of oxidation is observed on the FAB surface, 
thus balancing the gravitational force (weight of FAB) and the surface tension at the 
solid-liquid interface of the FAB. Certain elemental addition in weight ppm can cause 
apple bite structure in Au FAB (Fig. 18b). During alloy design of bonding wires, study 
of FAB formation has to be considered as an utmost factor to qualify the wire. 

Copper FAB 

In the last few years, industries have moved to Cu wire bonding on a large scale of 
manufacturing. Electrical firing of Cu wire at the atmosphere leads to severe surface 
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oxidation (Fig. 18c) and mostly absence of FAB formation. On purging with 
reactive forming gas (N 2 :H 2 , ratio of 95:5), a good Cu FAB is formed. An 
axisymmetrical Cu FAB formation depends strongly on gas flow rate besides 
EFO processing parameters. Nominal forming gas flow rate required to form 
axisymmetrical FAB is 0.5-0. 6 1pm. Gas flow rate greater than 1 1pm causes 
pointed FAB (Fig. 18d) due to rapid solidification, and gas purging lesser than 
0.3 1pm creates surface oxidation and pointed FAB (Fig. 18e). The rapid solidifi- 
cation of high volume of purging forming gas exhibits forced convection of heat 
transfer of Cu FAB. Now and then, at the neck region, few ring patterns are 
observed (Fig. 18f), perhaps molten ball roll-down in steps. Though reason is 
unknown, it could be due to a slight fluctuation in gas flow velocity in the Cu kit 
causing the imbalance between gravitational force and surface tension. 

From the cross-sectioned micro structure, pore-free FAB is apparent (Fig. 19a). 
SEM observation by backscattered mode of etched FAB is shown in Fig. 19b, where 
surface grain boundaries are clear. Columnar grain growth inside the FAB and wire- 
deformed structures are observed for both 4N Au and 4N Cu wires, but no distinct 
heat-affected zone (HAZ) is observed for Cu wire (Fig. 19c, d). A typical grain 
orientation of 4N Cu FAB is shown in Fig. 19e with mostly { 101 } plane of 
orientation. Texture analysis of FAB formed using 3N, 4N, and 5N Cu wires 
exhibited variation in plane of crystallographic orientation of grains (Srikanth 
et al. 2007). High-purity 5N Cu wire showed FAB grains formed with mixed 
modes of { 1 1 1 } and { 001 } plane of orientation, while 3N and 4N Cu wires revealed 
majority of the FAB grains formed in the mode of {001} plane of orientation 
(Srikanth et al. 2007). Published literature on the texture analysis of wire and FAB 
is minimal. Nucleation and growth of grains in the FAB depends on the presence of 
impurity elements and its concentration. Residual stress, density of dislocation 
pileup, and type of atomic defects in the deformed wire due to the presence of 
impurities are the source of grain nucleation and hence resulted in the change of FAB 
texture. Differences in the FAB hardness are expected when the FAB texture varies. 

Pd-Coated Cu (PCC) FAB 

Considering the FAB micro structure of PCC wire, conveniently PCC wire can be 
categorized as a next-generation bonding wire. PCC wire can form an axisymmet- 
rical spherical FAB with the purging gases, inert N 2 or reactive forming. A good 
FAB is attained by purging 0. 3-0.4 1pm of N 2 or forming gas. On electric sparking, 
the core Cu melts and rolls up to form FAB; it is expected that the coated Pd be 
dissolved into the molten Cu as Cu-Pd has a good dissolution (Cu-Pd is 
solid-solution mixture and forms a homogeneous alloy (ASM Handbook 1995)). 
Surprisingly, Pd with high melting point (1,554 °C) than Cu (1,083 °C) has neither 
dissolved into the molten Cu FAB nor wetted and covered the FAB. Instead, Pd 
moved up to the top and then segregated within the neck region of FAB (Fig. 20a). 
This indicates that within a few milliseconds of time of FAB formation, nanometer 
thick coated Pd layer has poor dissolution into the molten Cu. Furthermore, coated 
Pd has melted, non-wetted with molten Cu, shaped along with the FAB and 
accumulated along the top of FAB periphery (near to wire neck region). 
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Fig. 19 Etched micro- and macrostructures of FABs: (a) pore-free, 2 mil cross-sectioned Cu 
FAB, (b) backscattered SEM mode of 1 mil Cu FAB, (c) cross-sectioned structure of 1 mil Au 
revealing FAB/neck/wire regions, (d) cross-sectioned structure of 1 mil Cu revealing FAB/neck/ 
wire regions, and (e) inverse pole figure of 4N Cu FAB processed using 0.8 mil wire 


The macro observation of the PCC FAB using an optical scope showed bluish- 
silvery color for Pd areas and reddish-brown color for exposed Cu regions (Fig. 20a). 
The cross-sectioned FAB also revealed segregation of Pd within the FAB closer 
to wire neck region (Fig. 20b). The optical color contrast images with bluish 
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Fig. 20 PCC FAB processed using 0.8 mil wire: (a) macro color contrast optical view of Pd 
distribution on Cu FAB top hemisphere, (b) cross-sectioned micro structure, (c-e) typical swirl of 
Pd in Cu FABs 


(bluish-silvery) color on EDX exhibited rich in Pd. Some FAB revealed swirl in bluish 
color, thus indicating Pd segregates in a swirl form (Fig. 20c-e). This swirl reveals a 
possibility of the presence of two immiscible liquid pools (i.e., high volume of molten 
Cu non-wetted with low volume of molten Pd) leading to the composite microstruc- 
ture of swirl Pd in a Cu FAB on solidification. Interestingly, the unequal density of 
molten Pd and molten Cu causes different speeds of motion during roll-up into a ball 
due to gravitational effects, resulting in swirl form. The balance of gravitational force 
and surface tension by the two unequal density immiscible molten pools generate an 
axi symmetrical spherical PCC FAB with composite micro stmcture. 
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Progressive cross-section 
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Fig. 21 Progressive sectioning of PCC FAB processed using 0.8 mil wire 

A progressive sectioning of PCC FAB showed segregation of Pd (bluish area) in 
the outer area of the FAB and is absent towards the inner core of the FAB (Fig. 21). 
Varying FAB processing parameters can improve Pd distribution especially on the 
lower hemisphere of the FAB surface. But a complete (100 %) coverage of Pd 
distribution cannot be achieved (Lin et al. 2012; Tang et al. 2010). With an increase 
in EFO current, bluish layer at the neck is reduced and hardness of the PCC FAB is 
increased, indicating that Pd dissolution into Cu is much improved leading to PCC 
FAB micro structure with less in composite structure. 

On etching the PCC FAB using Cu etchant showed unetched layer along the 
periphery of FAB indicating the dissolution of Pd into Cu to a few micron thickness 
(Fig. 21 E-E). However, this Pd-enriched Cu peripheral layer (solid solution of 
Cu-Pd) of FAB cannot be considered as Pd-coated Cu FAB, which is the demand of 
wire bonding industries. Globally, wire bonding end users aim to get a uniformly 
coated Cu FAB with Pd, which would form a Pd line at the interface between the Cu 
wire bond and Al bond pad. But in reality, Pd segregates at the neck of PCC FAB. 
This constitutes a major disadvantage of PCC FAB leading to a nonuniform 
distribution of Pd on the FAB surface. The bluish segregated Pd-rich area is 
unidentified, whether it is a pure Pd or a solid solution Cu-Pd or a CuPd-ordered 
phase. 
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Silver and Aluminum FAB 

Since the 1970s, researchers have worked on A1 wire and reported that the forma- 
tion of a good FAB is difficult to be attained even if purged using inert (N 2 , Ar) and 
reactive forming gases (Onuki et al. 1986). The reduction of surface aluminum 
oxide layer on Al wire by hydrogen (purging forming gas) may not be effective like 
in the case of copper oxide reduction on Cu wire surface using forming gas. 
Continuous forming of aluminum oxide film and its stability finds it difficult to 
remove in an Al wire to form a good FAB. Furthermore, axi symmetrical Ag FAB 
formation is observed on purging the forming gas. Alloyed Cu, alloyed Ag, and 
Pd-coated Ag wires (Tanna et al. 2012) are also examined to replace Au wires. 


Ball Bonding Process (First Bond) 

The steps involved in first bond process are the following: (a) FAB tip comes into 
contact with Al bond pad with certain contact velocity without damaging the bond 
pad, (b) deformation of FAB, (c) mass displacement of Al on bond pad surface, and 
(d) diffusion bonding of deformed FAB and Al bond pad. During the thermosonic 
bonding, the wire bonds undergo up to 8 pm displacement, which are bound to 
cause heating due to the friction between two surfaces in contact. 

Microstructures of Au and Cu wire bonds reveal no sign of melting at the bond 
interface (Murali 2006). An increase in temperature around 200 °C during wire 
bonding is reported. Perhaps, thermosonic wire bonding of these common metals 
undergoes solid-state reaction. The bonded unit is soaked in 20 % NaOH or KOH 
solution to dissolve the remaining aluminum, and then gold aluminide distribution is 
observed by placing the detached bonded ball in a topsy-turvy position (Fig. 22a). It 
is well known that a uniform distribution of gold aluminide (intermetallic coverage 
(IMC)) of about 85 % is recommended. On thermal aging, a nonuniform IMC would 
lead to an erratic intermetallic growth as shown in Fig. 22b, for Au ball bonds. The 
apparent diffusion of atoms is always a combined effect of lattice diffusion and grain 
boundary diffusion. However, the interface reveals mostly the lattice diffusion with 
intermetallic nucleation along the horizontal direction. In addition to IMC study, 
a good ball bond needs to have the following performance characteristics: 


ry 

(a) Ball shear to be greater than 5.5 g/mil 

(b) Ball pull with neck fracture, without bond pad metal lift or bond pad cratering 

(c) Pass dimensional specification of ball bond concentricity 

(d) Minimal Al splash 

(e) Through cross-sectional analysis, maintains at least 30 % of the remaining Al 
thickness underneath the bonded ball 


The intermetallic nucleation at the Cu wire bond interface is of few nanometers, 
revealed clearly in transmission electron microscopy (TEM) studies (Xu et al. 
2009). While the technique of dissolving bond pad and topsy-turvy Cu wire bond 
to observe the intermetallic coverage at the interface does not provide the informa- 
tion as good as Au wire bond because of the formation of copper aluminide in 
nanometer level. Instead, for Cu wire bond intermetallic coverage observations, it is 
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Fig. 22 Intermetallic formation at the wire bond interface: (a) gold aluminide observed on 
topsy-turvy position of Au wire bond at t = 0, after dissolving the Al bond pad; (b) distribution 
of gold aluminide for low, mid-, and high bonding parameter settings, leading to erratic growth of 
intermetallic s at the interface; and (c) copper aluminide observed on Al bond pad after dissolving 
Cu wire (IP - intermetallic phase; note that Cu wire bond was post thermal treated at 175 °C for 5 h) 


practiced to heat-treat the wire bonds at 175 °C to 250 °C for 30 min to 5 h and then 
dissolve the Cu wires using nitric acid and observe the intermetallic coverage on the 
bond pad (Fig. 22c). Though the intermetallic coverage can be seen on the bond 
pad, strictly the area of coverage cannot be considered for representing the nucle- 
ation of intermetallic (distribution of copper aluminide at t = 0 condition) on 
bonding without any heat treatment. While the intermetallic coverage measures 
the nucleation (on bonding) and growth of copper aluminide on heat treatment, in 
contrast to gold aluminide distribution on nucleation. 

Phonon Interaction Mechanism 

Phonon is a quantized mode of vibration of atomic lattice of a solid, and it plays a 
major role on the physical properties such as thermal, electrical, and sound of solid 
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materials. It is a special type of lattice oscillation with the same frequency as vibration 
movements. Quantized behavior results in wave characteristics, that is, lattice vibra- 
tions (thermal energy) move as a strain wave (elastic wave) at the speed of sound. 
Phonon generation is a dislocation movement and the frequency required for disloca- 

o 

tion movement is about 10 Hz (megahertz range). Ultrasonic energy is in the 
frequency range of 10 -10 Hz, the predominant source of supplied energy during 
wire bonding. Hence, fundamentally, the FAB and bond interface materials may 
experience phonon interactions or effects close to it: softening the FAB for an easy 
deformation on compression, consequently influencing a good interdiffusion at the 
interface and welding the FAB to bond pad. Gold is malleable; hence, it is expected to 
be deformed and flow much more easily than Cu on thermosonic bonding. 

Slips, deformation bands, and grain substructures are evident in Cu wire bonds 
(Murali et al. 2003a). Also, modeling of the wire bonds revealed Cu wire has a 
higher stress to flow than Au wire. Sensitive devices with complex structures 
underneath the pad especially designed and fabricated with fragile low-k materials 
suffer from cratering. Hence, the development of soft Cu wire to produce FAB with 
lower work-hardening rate on bonding is being investigated in order to reduce bond 
pad cratering. 

Stitch Bonding Process (Second Bond) 

After first bond, wire is looped following the trajectory of capillary and the last 
thermosonic sequence is the second bond process. The second bond is also termed 
as “stitch bond” or “wedge bond.” It follows the following steps: (a) the wire scrubs 
the plated finger surface the moment it has been contacted, (b) deformation of wire, 
(c) mass displacement of plated material but less significant than Al bond pad 
displacement, (d) diffusion bonding of deformed wire to plated finger, and 
(e) cutting the wire such that it possesses a consistent tail end leading to a consistent 
FAB diameter of the next cycle. A good stitch bonding needs to have the following 
performance characteristics: 

(a) Stitch pull higher than the specification. 

(b) Fracture of heal area on stitch pull. 

(c) Low in mass displacement of the plating material. 

(d) Through cross-sectional analysis, confirm the intact stitch bonding. 

Stitch bonding using Au wires (2N to 4N purity) to Ag-/Au-/Pd-/Cu-plated 
surfaces is normally good with higher stitch pull strength (Harman 1997; Fan 
et al. 1999; Johnson et al. 1999). The process parameters used for stitch bonding 
may vary with different plated surfaces. For example, Cu wire bonding can be 
bonded using soft 4N Cu wire than standard Cu wire (Fig. 23). The stitch pull and 
second bond process window are wider for soft Cu wire than standard Cu wire. PCC 
wire bonding shows a significant benefit on second bonding than any bare Cu wire 
(soft or standard) with high throughput (number of device bonded in an hour also 
referred to as unit per hour (UPH)). The enhancement of diffusion bonding between 
the plated Pd of the wire to the lead finger material is attributed for the second bond 


672 


S.M.L. Nai et al. 




Fig. 23 Stitch bond process window and stitch pull strength of 4N Cu, standard versus soft 


benefit of PCC wire than bare Cu wire. The consistency of tail end and capillary 
contamination may also be added factors for the betterment of PCC wire perfor- 
mance in the second bonding process step. For this reason, PCC wire has taken a 
place in the bonding wire market since 2010. 

The 4N purity A g, 4N purity Al, and alloyed Ag wires can be bonded to plated 
fingers. Wedge to wedge ultrasonic bonding of Al wire for automotive application 
is another versatile topic which is not covered in thermosonic bonding. 

The supply of ultrasonic energy and compressive force work hardens the stitch 
bond that is similar to ball bond. The behavior is obvious in Cu bonds than Au 
bonds. Hardness of soft Cu wire used in the industries is in the range of 85-95 HV. 
After bonding, the stitch bond possesses 1 10-140 HV. The wires from 0.6 to 6 mil 
diameter reveal the same behavior. Observation of slip bands, micro bands, and 
deformation cells (sub-grains) probably created by the application of ultrasonic 
energy is attributed to the work hardening of ball and stitch bonds (Murali 
et al. 2007; Murali et al. 2003a). 
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Reliability of Wire Bonds 

Basically, reliability of wire bonds is tested in two ways: 

(a) Thermal aging (also referred to as high-temperature storage (HTS)) of the 
unmolded device 

(b) Rigorous accelerated stress test of molded device using epoxy molding com- 
pound (EMC) 

The molded device undergoes reliability tests such as thermal aging, thermal 
cycling, thermal shock, highly accelerated stress test (HAST), temperature-humidity 
test (TH/THB), pressure cooker test, reflow, preconditioning, moisture-sensitive 
level, tested biasing 1 to +5 V supply or unbiased, etc. First bond is more sensitive 
for these tests than second bond. Development of any new wire has to be first tested 
for thermal aging, and once it passed this reliability test, further detailed reliability 
investigations have to be carried out. 

The high-temperature stored unmolded device is evaluated for bond pad 
cratering failure. Moreover, the molded device is tested for electrical resistance 
and I-V characteristic curve. Also, the epoxy mold is de-encapsulated (decapped) 
using fuming sulfuric acid and/or concentrated nitric acid. Epoxy molded device 
with Au wire bonds is decapped manually or by using semiautomated unit with a 
mixture of these two acids, whereas for epoxy molded device with Cu, wire bonds 
are decapped only by using semiautomated unit (Murali and Srikanth 2006). But to 
some extent, the Cu wire and its bond can be dissolved. Currently, combination of 
laser and acid decapping approach has proved that epoxy molds can be decapped 
without damaging the Cu wire bonds. Ag wire bonds and A1 wire bonds are also 
examined by this technique. 

Thermal aging of Au ball bonds has been investigated for the void formation at 
the interface causing the failure, whereas the reliability tests such as HAST, THB, 
and PCT have been studied for the failure of Cu ball bonds due to galvanic 
corrosion. These two topics are associated with the interfacial reaction occurring 
at the ball bond on reliability tests and are discussed below. 


Microstructures and Interfacial Reactions of Ball Bonds 

In the 1970s, voids observed at the Au wire bond interface were considered as a 
major research project. Later it was explained that the voids are Kirkendall type 
created due to atomic diffusion at the wire bond interface. From 1990 to 2000, 2N 
alloyed Au wire (99 % purity) was developed by wire suppliers which could avoid 
Kirkendall voids at the Au wire interface. Studies on the interfacial reaction in a 
wire bond on high-temperature storage are vital to understand the stability and 
reliability of Au wire bonds, especially first bond. The interfacial reaction strongly 
depends on the following factors: 

(a) Properties of the joining metals (similar or dissimilar) 

(b) Testing temperature 
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Table 3 Type of second phases formed for different two-phase mixtures 


System 

Type of second 
phases formed 

Compounds 

Au-Al 

Intermetallics 

AuA 1 2 , AuAl, Au 2 A1, Au 5 A1 2 , A1Au 4 

Au-Ag 

Solid solution (S-S) 

No compound is formed, a homogeneous mixture (Ag, Au) 

Au-Pd 

S-S + ordered phase 

AuPd, Au 3 Pd, AuPd 3 , phases formed for specific 
temperature and composition range 

Au-Cu 

S-S + ordered phase 

AuCu, Au 3 Cu, AuCu 3 , phases formed for specific 
temperature and composition range 

Au-Ni 

Eutectic mixture 

Above 810-955 °C solid solution, below 810 °C Au and Ni 
eutectic structure 

Cu-Al 

Intermetallics 

CuAl, Cu 2 A1, A1 2 Cu, Cu 9 A1 4 , Cu 5 A1 3 

Cu-Ag 

Eutectic mixture 

No compound is formed, Cu and Ag eutectic structure 

Cu-Pd 

S-S + ordered phase 

CuPd, Cu 3 Pd, phases formed for specific temperature and 
composition range 

Ag-Al 

Intermetallics 

Ag 4 Al, Ag 2 Al 

Al-Pd 

Intermetallics 

AlPd, AlPd 2 


Extracted from ASM Handbook (1995) 


(c) Testing time 

(d) Environment 

(e) Area of welding 

Wire bond interface has different combinations of dissimilar metal joints. For 
first bond, more than 90 % of the bond pad materials are Al bond pad. In general, the 
Al bond pad compositions are Al-lwt%Si-0.5wt%Cu, Al-0.5wt%Cu, and pure 
Al. On bonding the four types of wire Au, Cu, Ag, and Al to Al bond pad, one 
can expect the Al-Al wire bond to be the most stable one, but it is not in use 
(because of the difficulty in the formation of FAB). On thermal aging, the other 
three types of wire bonds form intermetallics at the interface (Au-Al, Cu-Al, 
Ag-Al). From the phase diagram, the type of second phases that are formed at 
the interface for different two-phase mixtures is provided in Table 3. 

Abundant studies are available on gold aluminide and void formation at Au-Al 
wire bond interface (Harman 1997; Murali et al. 2004). The wire bonds bonded with 
4N Au wires that are in the range of 20-75 pm diameter revealed void growth for a 
storage longer than 2,000 h at 175 °C. The finer the wire diameter, the earlier the 
bond failure occurs; the failure is bond pad peeling/cratering on ball pull due to 
Kirkendall void growth (Murali et al. 2004). Kirkendall voids observed in Au ball 
bond increase the junction resistance (Maiocco et al. 1990). Fattice interdiffusion is 
predominant over grain boundary diffusion leading to a uniform growth of inter- 
metallics along the entire bond interface (horizontal direction). Voids are always 
observed at the gold-gold aluminide interfacial layer suggesting the interdiffusion 
couple between Au and Al: diffusion of Au is faster than Al. 

Nowadays, on thermal aging, wire bonds using 2N Au showed no bond pad 
cratering on ball pull until 10,000 h of storage at 175 °C. Cross-sectioned analysis 
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also showed no significant growth of Kirkendall voids (Fig. 24a). The diffusion rate 
at the interface is slowed down by an appropriate addition of dopants in the 2N Au 
wire. Thermal aging wire bonds of ultrafine wire in the diameter of 10-15 pm with 
2N Au also showed no ball pull lift up to 5,000 h of storage at 175 °C. According to 
JEDEC standard, a reliable bond has to pass without any bond pad lift failure for 
1,000 h of storage at 150 °C. Industries demand the reliability of wire bond for 
longer storage period at higher temperature than 150 °C such as to test at 175 °C or 

200 °C. 

Wire bonds with 4N Cu are stable on thermal aging unlike Au wire bonds, and a 
very slow growth rate of copper aluminide is formed at the interface. The thickness 
of the intermetallics observed at the interface for gold aluminide and copper 
aluminide is shown in Fig. 24b. For the same thermal aging conditions, growth of 
intermetallics is exponential. For 4N Au, within a few hours, thick gold aluminide is 
formed, while for 4N Cu, a micron thick copper aluminide is formed after 1 ,000 h 
for the same temperature of storage. The growth of copper aluminide behavior of 
PCC wire bond is similar to 4N Cu wire bond. 

Wire bonds with alloyed Ag wire have passed 1,000 h of thermal aging at 
175 °C, and the growth rate of silver aluminide is faster than copper aluminide 
formation in 4N Cu wire bonds (Fig. 24b). Interestingly, of all the tested wires, 4N 
Au, 2N Au, 4N Cu, PCC, alloyed Cu, and alloyed Ag, the alloyed Cu showed the 
slowest growth rate of intermetallic (copper aluminide) formation at the interface 
(Fig. 24b). Even at 250 °C of thermal aging alloyed Cu wire bonds, the growth rate 
is much slower than 4N Cu wire bonds. 

In Pd-doped Au wire (2N Au) bonded to A1 bond pad on high temperature, Pd 
segregates at the interface between Au and Au-Al intermetallic layers. Industries 
are expected to have a similar behavior with PCC wire bonds or alloyed Cu wire 
bonds, but no distinct segregation of secondary elemental addition is observed at 
the interface between Cu and Cu-Al intermetallic layers. 

Mechanism of Interfacial Reaction (Intermetallics/Kirkendall Void Formation) 

Interdiffusion of atoms plays a crucial role on the formation of intermetallics. The 
physical, thermal, and atomic properties of the joining metals, especially the 
vacancy-solute binding energy and the atomic radii, have a significant influence 
in the interdiffusion of atoms (Murali 2006; Murali et al. 2003b). Mobility of atoms 
in the substitutional solid solution strongly depends on the atomic radii of the 
joining elements. With regard to Au-Al, Au-Ag, and Ag-Al systems, all the 
three elements have the same atomic radii revealing an absence of lattice mismatch 
and thus providing an unstrained crystal lattice. Therefore, there may be a free flow 
of atoms without any hindrance assisted by vacancy mechanism (Murali 2006; 
Murali et al. 2003b). In other combinations such as Cu-Al, Au-Pd, Cu-Pd, and 
Au-Cu of metallic bonding, in spite of similar prevailing vacancy mechanism, 
difference in atomic radii leads to striving of mobility of atoms, which is expected 
to hinder the growth of intermetallics on thermal aging. The following points are 
discussed on the interdiffusion of atoms of different bonding systems (Murali 
2006): 
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Fig. 24 Thermal aging of bonding wires: (a) cross-sectioned micro structure of 2N Au wire bond 
with absence of Kirkendall voids at the interface even after thermal aging for 10,000 h at 175 °C 
and (b) average intermetallic (aluminide) thickness at the wire bond interface of various fine 
bonding wires bonded to Al bond pad 
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• In Au-Al system, higher vacancy-solute binding energy of gold-vacancy 
pair (Murali 2006; Murali et al. 2003b) and strain-free lattice aid boundless 
atomic mobility, causing a thorough mixing of atoms. In addition, difference 
in electronegativity between Au and Al atoms is higher than other systems. 
Hence, the ionic character of the bond is high, favoring the formation of 
intermetallics. 

• Effortless interdiffusion can occur due to strain-free lattice of Ag-Al system. 

• During bonding Cu wire to Al bond pad, lower degree of vacancy-solute binding 
energy and higher percentage of lattice misfit hinder the mobility of atoms, thus, 
resulting in the absence of formation of intermetallics (t = 0). 

• Similarly in Cu-Au system, due to severe lattice misfit, formation of ordered 
phases may be difficult. 

• In Cu-Pd system, the vacancy mobility is equivalent to Au-Al system. Differ- 
ence in atomic radii between Cu and Pd causes lattice misfit and reduces the 
mobility. In addition, both elements are stiff and more ultrasonic power is 
required to deform and bond. 

• High lattice misfit, low vacancy-solute binding energy, same valence, and 
electronegativity of Cu and Ag lead to the formation of diffusion bonding with 
eutectic mixture. 

• Same atomic radii and valence of Au and Ag and its closer electronegativity 
values lead to the formation of solid solution on mixing and so in the wire bond. 


The concepts of vacancy mechanism and atomic strain-free lattice with effort- 
less mobility of atoms/vacancies are attributed to the growth of the intermetallics 
and voids. Therefore, in order to slow down the growth of intermetallics and 
formation of Kirkendall voids, the rate of interdiffusion has to be reduced (Murali 
2006). This can be achieved by adding elements with a large difference of atomic 
radii to the bonding metals. For example, Y, Sn, Zr, In, Cd, Ni, Cr, Fe, Mn, and Si 
can be added to bond pad Al metallization or bonding wire. Addition of the 
mentioned elements either individually or combined may effectively reduce the 
rate of interdiffusion at the interface. 


Galvanic Corrosion in Cu Ball Bonds 

In the last 2 years, rigorous research work has been carried out on reliability of bare 
Cu wire bond in semiconductor packaging industries. So far the key observations 
are Cu wire bonded to Al bond pad (Al, Al-0.5Cu, Al-lSi-0.5Cu) using BGA 
substrate and tested for HAST (130 °C, 85 %RH, biased and unbiased), and THB 
(85 °C, 85 %RH, biased and unbiased) tests revealed failure in short period (less 
than 100 h). Industries require a stringent criterion to pass as long as 500 h of 
BHAST and THB tests, for instance, biased at +3 to +5 V. The epoxy mold 
compound manufacturers reported (Su et al. 2011) Cl content and pH of the EMC 
are the reason for de-bonding of Cu ball bond from the Al bond pad causing open 
circuit. They also mentioned that Cl ions could be diffused from BGA substrates, 
especially containing high Cl ions of about 60 ppm compared to a green mold 
compound containing less than 20 ppm. 
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It is well known (Fontana 1987) that when two dissimilar metals are 
welded together and exposed to corrosive environment (Cl - , Br - , F - , SO~, NOT, 
OH - , etc.), an electrochemical galvanic corrosion is unavoidable. The rate of 
corrosion strongly depends on: 

(a) Welded dissimilar metals 

(b) Area of welding 

(c) Source of the corrosive medium and its strength 

(d) External bias and its current density and polarity 

Comparing the EMF series (Fontana 1987) with reference to hydrogen elec- 
trode, Cu welded to aluminum has 2 V potential difference. If corrosive environ- 
ment does exist, an electrochemical corrosion could occur at the welded joint 
interface. In a molded device with Cu wire bond, various interfacial layers can be 
observed: 

(a) Cu and Al interface (first bond) 

(b) Al bond pad in contact with EMC 

(c) Cu wire surface in contact with EMC 

(d) Cu and Ag/Au/Pd/Cu interface (second bond) 

Among the interfacial layers, the Cu and Al interface is expected to corrode 
under galvanic conditions than the rest. Any corrosion reaction is electrochemical 
in nature and needs a path for electron to flow, since EMC is an insulator and its 
interface in contact with Al bond pad or Cu wire may not corrode without an 
electron flow path. The second bond interface, welding Cu wire to noble metals 
(Ag/Au/Pd) or Cu, may have less significance on galvanic corrosion. Where Au/Ag/ 
Pd/Cu belongs to cathodic EMF series (unlike Al in anodic series), therefore, the 
EMF potential difference is negligible between Cu and noble metals (Fontana 

1987). 

In this concern, the PCC wire which was extensively examined with an antici- 
pation to solve the galvanic corrosion problem of Cu wire bond failed due to 
inhomogeneity of Pd distribution on the FAB surface (bond interface). The confi- 
dence level of PCC wire bond on BHAST is still in the evaluation stage. Moreover, 
the alloyed Cu wire is examined on these grounds. Table 4 provides BHAST and 
HTS performance of a molded device with alloyed Cu wire bond. The test was 
conducted in collaboration with Sumitomo Bakelite Co., Ltd. using 16pSOP device, 
molded with G-700 series EMC, and biasing +20 V. Several industries have keen 
interest on the alloyed Cu wire, since the corrosion potential of the wire is much 
better than the bare Cu wire. High ball shear 11-13 g/mil for 0.8 mil bare and PCC 
Cu wires reveal survival of wire bonds for longer periods on BHAST compared to 
bonded with 9-10 g/mil . This factor of Cu wire bonds with high ball shear is now 
examined for alloyed Cu wire. 

In addition to Cl ions, SO x ions can corrode Ag wire bonds dramatically if they 
exist in the surrounding environment. EMC manufacturer claim that the presence of 
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Table 4 Reliability of alloyed Cu wire bond 




Electrical 
resistance, ohm 

Results 



0.6 mil 

0.8 mil 

— 

HAST 

R 0 @ HAST, t - 0 (+20 V bias, 130 °C, 85%RH, 
16pSOP, G-700 series EMC molded) 

1.329 

0.913 

— 

Ri68 @ HAST, t 168 h (pass if Ri 68 < 1.2R 0 ) 

1.348 

0.914 

Passed 

HTS 

R 0 @ HTS, t - 0 (16pSOP, G-700 series EMC 
molded) 

— 

0.932 

— 

R 336 @ HTS, t 336 h, (pass if R 336 < 1.2R 0 ) 

— 

0.965 

Passed 


sulfur (S) is less than 10 ppm in green mold compound. Studies on galvanic 
corrosion of A g wire bonds due to Cl ions and SO x ions are utmost important to 
qualify Ag wire; the topic is under evaluation with high priority. 


Future Trends 

Future thermosonic wire bonding will see continual decrease in the use of Au wire 
due to the high precious metal price, and industry is continuously converting this 
material to lower-cost alternatives like Cu, be it bare or coated for oxidation 
protection. There will be increasing use of Ag or Ag alloy wires also for applica- 
tions that Cu wire is unable to replace Au such as in LED which saw application in 
low-end device. For IC devices like memory chip, there has been evaluation and 
qualification ongoing with Ag alloy wires in areas where cost is less critical. Based 
on industry prediction, the demand for Au wire will reduce to less than 50 % of total 
bonding wires used (Fig. 25). Majority of wire material will still be Cu based and 
with complex configuration in composition for bare Cu and plating technology for 
coated wire. Silver alloy wire will see higher growth in low-end LED devices in the 
next few years but not for high-end devices until more confidence is built up on its 
reliability. Industry will continue to evaluate the use of Ag wire for high-speed 
memory device such as DDR4 in which Cu wire is not able to meet the application 
requirement. The composition of Ag alloy wire will continue to evolve for different 
application needs if cost is not a critical requirement. 

Size of wire is also getting finer for high-density packages moving from 20 pm 
(0.8 mil) in 2008/2009 to 18 pm (0.7 mil) in 2010/2011 and even 15 pm (0.6 mil) 
beyond 2012. Even finer diameters of 12 and 10 pm are being evaluated for future 
product and applications. 

The bond pad and substrate materials are also moving more and more to 
nonaluminum type such as Au, Pd-Au, Ni-Pd, and Ni-Pd-Au for better first 
bond reliability while that of metal substrate for second bond will see more usage 
of pre -plated lead frame (PPF) which has Au on Pd and Ni layers. For major cost 
savings, bare Cu lead frame is already been evaluated but not with great challenges. 
On organic substrates, continual development of Au plating technology has been 
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Fig. 25 Graphical illustration showing the demand trend of different bonding wires from 2010 
to 2015 


ongoing to achieve better and more robust stitch pull strength and reliability. 
ENEPIG, or electroless Ni, electroless phosphorus (P) immersion Au plating, is 
the key to future BGA substrate. 

With more variety of wire and bond surface material, the interfacial reaction 
between wire and bond surface is becoming more complicated, and studies needed 
to be carried out to understand the intermetallic compound formation and growth 
between different alloying composition of Cu and Ag wire with bond pad material 
(i.e., first bond) and how their reliability performance can be compared with 
conventional high-purity Au and Cu. The second bond reliability must also be 
studied due to less well-known interfacial reaction between new wire material and 
Ag or Au on bond fingers. 
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Summary 

This chapter begins with an overview of the diffusion bonding mechanisms which 
are instrumental to the solid-state diffusion bonding process. The key bonding 
conditions which influence the formation of a strong and reliable diffusion joint 
are also discussed. Furthermore, the advancements in thermocompression bonding 
and thermosonic ball-wedge bonding are presented. In essence, the information 
provided in this chapter reflects that the solid-state microjoining technologies and 
the joints’ performance are closely related to the type of bonding materials (such as 
the interconnect bumps and bonding wires) and their surface characteristics. 


Acknowledgement For section “Advances in the Manufacturing of Thermosonic Ball-Wedge 
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Abstract 

Joining is an integral part of manufacturing. Microjoining and macrojoining have 
been widely used to join metallic and polymeric parts. The parts are often joined via 
welding. The process often involves melting the parts and adding a filler material 
that resolidifies to form a strong joint. The energy source to melt the parts can be a 
laser beam, an electron beam, friction, or ultrasound. Sometimes pressure is applied 
to enhance the process. Brazing and soldering are also processes commonly used in 
metal joining. As nanomaterials become more prevalent, nanojoining gains impor- 
tance. Nanojoining facilitates the assembly of nano-sized building blocks to form 
practical products. It can also involve the use of nanomaterials to assist joining in 
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bulk materials. In this chapter, the unique properties of nanomaterials owing to the 
extremely small dimensions and joining of nanoparticles and nanowires via laser 
irradiation, solder reflow, application of energetic particles such as an electron 
beam or current, and other methods are discussed. The use of nanomaterials that 
assist joining of bulk materials by reducing the joining temperature is also 
reviewed. The addition of nanoparticles and nanotubes can also enhance the 
properties of the composites formed. 

Introduction to Nanomaterials 

Nanomaterials are stmctures with at least one dimension less than a few hundred 
nanometers. This class of material offers unique features unseen in bulk materials due 
to the quantum characteristics of its properties and its large surface area to volume ratio. 
Quantum effects are produced because electrons are confined in the small dimensions of 
the nanomaterial. Such quantum effects can influence the band stmcture of the material 
and alter the electrical and optical properties of the nanomaterial. The surface area to 
volume ratio increases as the dimensions of a nanomaterial decrease. This property is 
desired in catalytic chemical reactions which occur at surfaces. A given mass of 
nanomaterial will be significantly more reactive than similar bulk material of the 
same mass. Nanomaterials can be broadly classified into three groups: zero-dimensional 
(OD) systems, one-dimensional (ID) systems, and two-dimensional (2D) systems. 

OD systems are nanoscale in all three dimensions. Semiconductor quantum dots 
(e.g., CdSe, InAs, and InP) and metal nanoparticles (e.g., silver nanoparticles and 
platinum nanoparticles) are some examples of OD systems. They can be produced 
via various techniques. One common technique used is wet chemistry. ID systems 
are nanoscale in two dimensions and extended in one dimension. Common exam- 
ples of ID systems include inorganic nanowires (e.g., Si nano wires, GaAs 
nanowires, Ti0 2 nanowires, and ZnO nanowires) and carbon nanotubes. They are 
usually synthesized via physical vapor deposition techniques (bottom-up approach) 
or controlled etching of the bulk material (top-down approach). 2D systems which 
are nanoscale in one dimension and extended in two dimensions are often known as 
nanomembranes. Besides graphene, these systems also comprise of semiconductor 
nanomembranes (e.g., MoS 2 , Sr 2 Nb 3 O 10 , GeS, Si, and GaAs). Nanomembranes can 
be formed by mechanical exfoliation from solids that have naturally layered 
structures (e.g., MoS 2 , Sr 2 Nb 3 O 10 , and GeS), chemical vapor deposition (e.g., 
graphene), or release from an underlying sacrificial layer in a cleverly designed 
solid structure (e.g., Si on insulator and GaAs on AlAs; Rogers et al. 2011). 


Mechanical Properties of Nanomaterials 

Nanomaterials offer mechanical features unavailable in bulk materials. The 
strength of crystals increases as their grain size decreases. Hence, nanomaterials 
(i.e., zero-dimensional, one-dimensional, and two-dimensional materials) exhibit 
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higher strength than bulk materials with coarse grains. This is an effect of grain- 
boundary strengthening (Hansen 2004; Pande et al. 1993; Zhao et al. 2003). A grain 
boundary can effectively impede the propagation of dislocations because more 
energy is required to change the direction of a moving dislocation or move the 
dislocation into an adjacent grain. As more dislocations propagate to the boundary, 
the dislocations accumulate at the boundary and dislocation pileup occurs. This sets 
up a driving force to reduce the energy barrier for diffusion across the boundary. 
Eventually, the dislocations diffuse across the grain boundary and deformation in 
the material results. Decreasing the grain size of the crystals in a material decreases 
the dislocation pileup at the boundary. Hence, the dislocation propagation is 
impeded by grain boundaries which hinders the onset of failure and increases the 
strength of the material. The relationship between the strength and grain size of 
crystals can be described by the Hall-Petch relation: 


o 


y 


< 7 () + 


k 

\fd 


where o y is the yield stress, <j 0 is the material constant that describes the resistance 
of the crystal lattice to dislocation motion, k is the material strengthening coeffi- 
cient, and d is the average crystal grain diameter that make up the material. 

Another interesting mechanical property of nanomaterial, specifically 
nanomembrane, is its mechanical flexibility. Although a specific bulk material is 
brittle, its nano-size counterpart, a nanomembrane of similar material, can be bent 
(Khang et al. 2006). A thin silicon nanomembrane can have a flexural rigidity that is 
more than 1 5 orders of magnitude smaller than its bulk wafer counterpart (Rogers 
et al. 2011). Besides mechanical flexibility, the energy release rate of a 
nanomembrane is very small because energy release rate for thermally driven 
delamination is found to scale linearly with thickness (Rogers et al. 2011). 
Hence, a nanomembrane with very low energy release rate can conform and bond 
strongly to any material. A strong bond between a 200 nm-thick Si membrane and 
bulk Ge wafer was demonstrated (Kiefer et al. 2011). Although Ge and Si have very 
different thermal expansion coefficients, no delamination or fracture was observed 
after a temperature change beyond 350 °C. This provides many exciting opportu- 
nities for integration of various materials to form novel heterogeneous systems. 


Electrical Properties of Nanomaterials 

The electrical properties of nanomaterials differ significantly from bulk materials. 
The size dependence of electrical resistivity of metals is complex because it is based 
on various mechanisms. The electrical resistivity of metals can be described by 
Matthies sen’s rule: 

P=Pl+ Pd 

where p is the net electrical resistivity, p L is the electrical resistivity caused 
by scattering by thermal phonons, and p D is the electrical resistivity caused by 
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Fig. 1 Density of states per unit volume and energy for (a) a 3D bulk material, (b) a 2D 
nanomembrane, (c) a ID nano wire/nanotube, and (d) a OD quantum dot 


scattering by defects that perturb the periodicity of the lattice (Kittel 2005). As the 
electrical resistivity contributed by the individual scattering mechanism is inversely 
proportional to the respective mean free path between collisions, the Matthiessen’s 
rule can be re-expressed as: 


1 


A A 


1 1 

- + 


L 


A 


D 


where A is the mean free path. Decreasing a material’s dimension has two effects. 
The first effect is a reduction of defects which reduces the defect scattering and 
hence electrical resistivity. However, this makes a negligible contribution to the net 
electrical resistivity of metals at room temperature. The second effect of a smaller 
dimension is the additional contribution to electrical resistivity due to surface 
scattering (Hensel et al. 1985; Zhang et al. 2004). When the nanomaterial’s critical 
dimension is smaller than the electron mean free path, the electrons are scattered by 
the surface of the nanomaterial. 

In nanomaterials, the quantized electronic states determine their electrical prop- 
erties. The quantized electronic states can be described by first understanding the 
density of states (i.e., the number of states per interval of energy at each energy 
level that are available to be occupied by electrons) in nanomaterials (Kittel 2005). 
The density of states varies between bulk materials, 2D systems, ID systems, and 
0D systems, as shown in Fig. 1. Parts a, b, c, and d of Fig. 1 show the density of 
states in a 3D bulk material, 2D system, ID system, and 0D system, respectively. 
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Fig. 2 Density of states for a 
carbon nanotube as a function 
of energy. Ell and E22 
denote the energy 
differentials between two van 
Hove singularities, cl and vl 
and c2 and v2, respectively 
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The density of states decreases to zero at low energy in a 3D bulk material 
and increases by a constant value at each subband in a 2D nanomembrane. 
However, in a ID system, the density of states diverges at each subband threshold 
(Deshpande et al. 2009). These are known as van Hove singularities and affect the 
electrical and optical properties of ID systems (which will be discussed later). 
Meanwhile, in a 0D system, there are discrete energy levels which are reflected in 
the quantization phenomena of their electrical and optical properties (which will be 
discussed later). 

The van Hove singularities in ID systems are reflected in the band structure of 
ID systems such as carbon nanotubes. When measuring the tunneling conductance 
of a carbon nanotube using scanning tunneling spectroscopy, peaks in the differ- 
ential conductance are observed at the bias voltages that correspond to the energy 
levels of the nanotube’s van Hove singularities (McEuen 2005). Besides influenc- 
ing the electrical properties, the van Hove singularities also dominate the optical 
absorption and emission of carbon nanotubes. Optical transitions occur between 
two van Hove singularities where the initial and final densities of states are high, as 
shown in Fig. 2. 

A ID system has a finite conductance even when the channel is so short that 
there is no scattering in the channel. This is due to the presence of quantized contact 
resistance. The resistance quantum Rq is: 

R 0 = 2L = 12.906k£2 

J 2e 2 

where h is Planck constant and e is the elementary charge carried by an 
electron (McEuen 2005). The resistance quantum is the result of a finite density 
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Fig. 3 Bandgap of PbS as a 
function of its diameter. 
Square symbols represent 
experimental data. Cross 
symbols represent 
calculations based on the tight 
binding model. Dashed line 
shows calculations using 
effective mass 
approximation. Solid line is 
derived from a hyperbolic 
band model 
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of charge carriers moving across the interface between the ID channel and an 
electrode. When scattering occurs in the ID channel, the overall resistance for 
the ID channel is: 


h l_h 3+ (l- 3) _h h 
~2^3~2^ 3 “ + 

where 3 is the probability for electron transmission through the channel and 
= 1 — 3 is the reflection coefficient (McEuen 2005). The first term in the last 
part of the above equation represents the quantized contact resistance as mentioned 
earlier in the text, while the second term represents the resistance due to scattering 
from barriers in the channel. The second term is zero in a perfect conductor without 
any scattering and this reduces the overall resistance. 

In 0D systems, the density of states is quantized, as mentioned earlier in the text. 
The electron states in the conduction band and the hole states in the valence band 
are also quantized. Hence, the emission and absorption energies in 0D systems are 
discrete, which is similar to the ID systems (Li et al. 2001). According to quantum 
mechanical models, the lowest electron state increases in energy and the highest 
hole state decreases in energy as the size of the 0D system decreases. Hence, the 
energy bandgap increases. An example is shown in Fig. 3. The bandgap of PbS 
increases as its diameter decreases (Wang and Herron 1991). The experimental data 
observed matches closely to the calculations based on tight binding model. 

Besides a tunable bandgap via size control, 0D systems have many other 
interesting electrical properties. One of which is a phenomenon known as Coulomb 
blockade. As described earlier, 0D systems have discrete charge states. The energy 
difference between successive charge states can be very large owing to the coulomb 
repulsion between the electron already present in the system and the electron added 
to the system. When a 0D system (i.e., quantum dot or nanoparticle) is in electrical 
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contact with a metallic electrode, electrons can tunnel into the OD system until the 
electrochemical potential for adding another electron to the OD system exceeds the 
electrochemical potential of the metallic electrode. When transport of electrons into 
the OD system is suppressed because the Fermi level of the electrode lies below the 
electrochemical potential for adding another electron into the OD system, this is 
known as Coulomb blockade (McEuen 2005). The electron can be added to the OD 
system once a gate voltage is applied to shift the electrochemical potential for 
adding another electron lower. Hence, with increasing gate bias, more charge states 
in a OD system can be occupied. This results in Coulomb oscillations in the current 
of OD systems as a function of gate bias (Fuse et al. 2003). The operation principle 
behind a single electron transistor is based on this phenomenon. This transistor can 
be used as an ultrasensitive electrometer. 


Thermal Properties of Nanomaterials 

The larger surface area to volume ratio in nanomaterials results in increased phonon 
scattering at the surfaces and interfaces. Hence, the thermal conductivity of 
nanomaterials is lower when the characteristic length scale in nanomaterials is 
smaller than the phonon mean free path (Tang et al. 2010; Yu et al. 2010). Figure 4 
shows the thermal conductivities of two nanomesh films and three reference films 
as a function of temperature (Yu et al. 2010). The two nanomesh films (labeled NM 
and represented by diamond symbols) show obviously lower thermal conductivity 
than the other three reference films (labeled TF, EBM, and NWA and represented 
by circle and square symbols). 

Besides lower thermal conductivity, ID or 0D systems have the unique property 
of quantized vibrational modes. As an example, carbon nanotubes have various 
distinctive vibrational features. A prominent vibrational feature of carbon 
nanotubes is the radial breathing mode (RBM). The vibrational frequency of the 
radial breathing mode in carbon nanotube measured via Raman spectroscopy can 
provide insights to the radius of the carbon nanotube excited. 

Finally, like the electrical conductance in ID channel mentioned in the section 
earlier, the thermal conductance in ID channel is determined by fundamental 
constants and the absolute temperature when there is no phonon scattering in the 
ID channel. The thermal conductance per ID phonon subband is: 

(i D ) n 2 k\T 

G ‘ h "^/T J 

where k B is the Boltzmann constant, T is the absolute temperature, h is the Planck 
constant, and 3 is the transmission probability for phonons through the structure 
(McEuen 2005). 

The unique and interesting properties of nanomaterials can be exploited in novel 
applications when cleverly engineered, making the field of nanomaterials an excit- 
ing area to explore. 


692 


X. Qi et al. 


Fig. 4 Thermal conductivity 
of two nanomesh devices 
(represented by diamond 
symbols and labeled NM), an 
array of nanowires device 
(represented by square 
symbols and labeled NWA), a 
thin him device (represented 
by solid circle symbols and 
labeled TF ), and a large 
feature size mesh 
(represented by open circle 
symbols and labeled EBM ) as 
a function of temperature 



Temperature (K) 


Nanojoining of 0-2-Dimensional Nanomaterials 


Nanojoining, which is quite new compare to microjoining, is studied mostly on 
research and university communities. So far, there is still no standard way to define 
nanojoining. This chapter mainly focuses on nanomaterial. Therefore, the 
nanojoining will be introduced based on various dimensions of nanobuilding 
blocks. 


Joining of Nanoparticles 

Joining of Single Metal Component Nanoparticles 

Since nanomaterials have their unique size-dependent properties apart from the 
bulk material, new joining properties were shown in the nanosystems. 
Nanoparticles are known as those ultrafine particles sized between 100 and 1 nm. 
In the bottom-up fabrication of nanostructure and nanodevices, welding of metals at 
the nanoscale is likely to have a very important role. Au and Ag nanoparticles, as 
the most typical single metal component nanoparticles, have been widely investi- 
gated. Ag nanoparticles, as reported, are able to sinter together at room temperature 
through a simple process (Wakuda et al. 2007). In that method, bare Ag 
nanoparticles were produced by removing dodecylamine through rinsing Ag 
nanoparticle-coated glass substrates in methanol for 10-7,200 s. The Ag 
nanoparticles started to coalesce as early as 30 s after dipping in the methanol 
(Fig. 5a, b), and coalesce increased with increasing dipping time (Fig. 5c, d). Clear 
connections of the coalesced nanoparticles were found in Fig. 5b. After sintering, 

nr 

the resistivity of the Ag nanoparticle film was lowered to 7.3 x 10 Om. On the 
other hand, for the purpose of controllable coalesce and stable storage of Ag 
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Fig. 5 (a-b) TEM images of the nanostructural changes of Ag nanoparticles in the (a) initial state 
and (b) coalescing of nanoparticles after dipping in methanol for 30 s; (c-d) FESEM images of 
micro structural changes in Ag nanoparticles in the (a) initial state and (b) coalescing of 
nanoparticles after dipping in methanol for 3,600 s 


nanoparticles, different types of materials were coated on the surface of Ag 
nanoparticles, targeting various sintering temperature. Controllable sintering at 
300 °C was achieved by coating organic shell on 11 nm Ag nanoparticles (Ide 
et al. 2005). And the sintering temperature can be tuned to 160 °C or even 100 °C, 
when an organic shell of a mixture of Ag 2 C0 3 and citrate was coated on the bare Ag 
nanoparticles (Hu et al. 2010). These developments are quite promising for flexible 
electronics packaging on plastic substrates. 

Besides thermal sintering, nonthermal processing was also applied in the single- 
component metal nanoparticle systems. It was reported that, the Au nanoparticles 
could be welded together with 100 f. laser pulses (Hu et al. 2009). This welding was 
able to protect the bulk materials by providing welding only on surface with depth 
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Fig. 6 (a-b) TEM images of 
the mixture of (a) bare Au and 
Pt nanoparticles and (b) 3D 
network of Au-bridged Pt 
nanoparticles after laser 
irradiation, (c) Schematic 
illustration of the 
corresponded 3D network 



20 nm 20 nm 


in nanoscale. Moreover, mixture of Ag and Pt nanoparticles were easily welded 
together when irradiated by nanosecond pulsed laser (Mafune et al. 2003). As can 
be seen from Fig. 6, the Au nanoparticles were melted to bridge Pt nanoparticles 
forming 3D network. 

Joining of Multicomponent Nanoparticles 

Multicomponent nanoparticles, a big group of nanoparticles, are also involved in 
the precise control joining in nanorange. As reported, individual CuO nanoparticles 
were controllably constructed into 3D ordered structure by using polyphenylene 
dendrimers (G2Td) as template and bridge (Qi et al. 2009). Figure 7a shows the 
SEM and TEM images of the final polymer-joined 3D ordered structure of CuO 
nanoparticles (RSA). An RSA having the length of 210 =b 40 nm and the width of 
96 =b 30 nm is observed (Fig. 7a). In addition, the TEM image (Fig. 7b) reveals that 
these RSAs are composed of numerous small primary CuO NPs with an average 
diameter of 6.2 =b 0.4 nm (Fig. 7c). These primary CuO NPs are separated from 
each other with an inter-particle distance of ~3 nm that is close to the size of G2Td 
(Zhang et al. 2000). Noteworthy, irradiation of nanoparticles under energetic 
particles, electron, for example, is used not only for structure monitoring but also 
for creating joining in nanorange. The high-energy electron beam of TEM was used 
as the external force to in situ join the individual CuO nanoparticles and induce 
structural transformation between CuO and Cu 2 0 (Fig. 8; Qi et al. 2010). Interest- 
ingly, the intermediate structural transformation from the coexistence of CuO and 
Cu 2 0 to pure Cu 2 0 was monitored by the TEM and associated with FFT. The 
phases (CuO and Cu 2 0) are proven by indexing and comparing these observed FFT 
patterns with the typical patterns represented in Fig. 8b', c'. As illustrated in Fig. 9, 
the phase transformation from CuO to Cu 2 0 under the electron beam irradiation 
happens gradually; the coexistence of CuO and Cu 2 0 phases is observed in Fig. 9a, 
a' at exposure time of 44 min. With increasing the irradiation time at intervals of 
2 min, the FFT patterns coming from the CuO phase progressively disappear from 
Fig. 9a / -f / . Finally, the phase is totally transferred into the pure Cu 2 0 phase 
(Fig. 9f, t = 54 min). 
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Fig. 7 (a) SEM images of G2Td-CuO RSAs. (c) TEM image of a G2Td-CuO RSA with clearly 
view of separated CuO nanoparticles bridged by G2Td. Inset shows the corresponding SAD 
pattern along the (001) axis 



Fig. 8 (a) TEM image and (a') SAD pattern of original G2Td-CuO RSA. (b, c) HRTEM image of 
the selected area of the G2Td-CuO RSA before (b) and after (c) 70 min irradiation, (b', c') The 
corresponding FFT of image b with identifiable CuO SAD pattern (b r ) and identifiable Cu 2 0 SAD 
pattern (c') 
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Fig. 9 A series of in situ TEM images (a-f) and corresponding (001) FFT patterns (a'-f ) taken 
from the same area of G2Td-CuO RSA, starting from the exposure time at 44 min to 54 min, 2 min 
interval, (g) Magnified moire pattern from Fig. 4-1 8a. (g') Simulated moire pattern along the (001) 
axis, obtained from the superposition of CuO (001) lattice structure (h) and Cu 2 0 (001) plane 
lattice structure (i) 
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Fig. 10 Schematics of different cold- welding geometries: (a) head to head and (b) side to side for 
ultrathin gold nanowires; (c) TEM image of two separated nanowire moving toward each other 


Joining of Nanowires 


Joining of Single-Component Metal Nanowire 


Nanowires, which are the structure of nanomaterial that have a thickness or 
diameter constrained to tens of nanometers or even less and an unconstrained 
length, are one of the most studied ID nanostructures. It has shown great promise 
in plenty of applications, including electronics, sensors, optics, and biomedical 
devices (Shi et al. 2001; Zhan et al. 2005). Single metal component nanowires, such 
as Au, Ag, and Pt nanowires, were widely investigated in the nanojoining and nano- 
interconnection. Besides the common joining method through heating, melting, 
and solidification, ultrathin Au nano wires were reported to be joined together 
within seconds by mechanical contact alone at room temperature (Lu et al. 2010). 
There are several possible welding geometries for nanowires to contact each other, 
and “head-to-head” and “side-to-side” joining procedures were performed in 
Fig. 10a, b. A tungsten or gold STM probe was used to manipulate the movement 
of one Au nanowire toward another (Fig. 10c). The head-to-head joining orientation 
was monitored by TEM (Fig. 11). The alignment of the two nano wires was 
continuously adjusted by the STM probe, until they approached each other head 
to head (Fig. 11a, b). As shown in Fig. lie, once the two Au nanowires connected, 
they were welded instantly within 1.5 s. After welding, the crystal structure at the 
junction continuously finalize itself till it was in a free standing state when the STM 
probe was retracted (Fig. lid). However, this perfect cold welding happened in 
single crystalline Au nanowires driven by oriented attachment and assisted by 
surface-atom diffusion, and it is too critical for industrial applications. Thus, 
other methods about welding nanowires through providing exterior energy are 
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Fig. 11 TEM images of the head-to-head welding of two Au nanowires, (a-b) Alignment 
procedure to optimize the contact of the nanowires in their front surfaces, (c) The completion of 
welding process within 1.5 s. (d) Crystal structure finalization of the connection through structure 
relaxation. Scale bars, 5 nm 


considered to be more operable. Fig. 12a, b shows a setting in the SEM chamber for 
the head-to-head welding of two Pd nanowires by passage of a welding current 
(Tohmyoh and Fukui 2009). The joint of the two Pd nano wires was formed by the 
melting and fusion of the Pt at the tips, driven by the Joule heating provided by 
the welding current (Fig. 12c, d). Welds driven by Joule heating were also demon- 
strated in dissimilar metals, such as the formation of heterojunctions through 
welding Pt nanowires to Au wire (Tohmyoh et al. 2007). 

Joining of Multisegmented Metal Nanowires 

The fusion and welding manipulated and observed in SEM and TEM display 
remarkable success in joining nanoparticles and nanowires. However, these methods 
suffer various inextricable limitations, such as high equipment cost and slow 
processing ability, which make them less attractive in real industry applications. 
Thus, new processes, which were inspired by the soldering process in the 
microjoining, were developed. Multisegmented metal nanowires (such as Sn/Au/ 
Ni) were fabricated. Within which, the low-melting-point soldering materials, such 
as tin, nickel, and indium, were added as segments in the nanowires and acting as 
solder to join individual multisegmented metal nanowires. Building nanowire-based 
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Fig. 12 (a) Photograph of the setting for welding two thin Pt nanowires, which is a SEM holder 
with three-axis piezoelectric manipulators, (b) SEM images of two Pt nanowires with lengths 
1 1 and 12 to be joined. Inset shows the morphology and environment of the tips of Pt nanowires, 
(c-d) In situ SEM image of the Pt nanowires before (c) and after (d) welding 


device will be very simple, by simply infrared heating or solder reflow. The 
nanowires soldered together with various structures, as shown in Fig. 13, because 
of the coalescence of the melted solder and their quick solidification when cooling 
(Gu et al. 2004; Cui et al. 2009). Besides the randomly dispersed and soldered 
nanowire networks, large-scale ordered structures were also demonstrated, induced 
by self-assembly or directed assembly techniques. For example, a magnetic field 
was used to assist the assembly technique (Hongke et al. 2006). Typically, a piece of 
patterned silicon substrate with a few drops of ethanol suspension of nanowire on top 
was placed in a magnetic field for 20-30 min to fabricate the well-orientated 
nano wires. Moreover the concentration of the nano wires in between the pads was 
controllable by tuning the concentration of the nanowire suspensions. Therefore, an 
array of Ni pads would be easily connected by a liner chain of assembled nano wires. 

Joining of Carbon Nanotubes 

Carbon nanotubes (CNT), which are a series of carbon based nanowires, take a big 
part and role in the nanojoining of related structure and device fabrication. Similar 
to metal nanoparticles and nano wires, CNT can be joined by the high-energy 
particles from TEM (Fig. 14; Terrones et al. 2002). Under electron beam irradia- 
tion, the carbon atoms were irradiated and the tube structure was reorganized, so 
that the two CNT merged together at the overlapping point. High-energy particle 
can also join CNT together without destroying its structure at the junction; instead 
solder will be used. As shown in Fig. 15, amorphous carbon was used to solder two 





700 


X. Qi et al. 



Solder 



NW 1 X) NW 2^ 


Solder 


NW 


Solder 



Solder 


NW 2 j 


NW Assembly 

-4 

Solder reflow 







Fig. 13 (a) Schematics of the possible formation of assembled and interconnected nanowires 
using multisegmented nanowires, (b) Surface tension-driven self-assembled 2D network of 
multisegmented metal nanowires (Au/Ni/Au) 


CNTs in the SEM (Banhart 2001). CNT strongly attracts hydrocarbon molecules 
from air, and those insulated hydrocarbons can be easily converted to conductive 
amorphous carbon under the beam irradiation in SEM chamber. This makes the 
joining of CNT much simpler and operable. 


Bulk Joining Assisted with Nanomaterials 

Nanotechnology has brought great impact to society in different aspects. As 
mentioned in earlier sections, nanomaterials possess several interesting mechanical, 
chemical, optical, and electrical properties that cannot be observed in their bulk 
counterparts. In the aspect of joining, addition of nanomaterials mainly contributes 
to the joining temperature reduction and properties enhancement. The depression of 
melting point is probably the most striking result from size reduction. It has been 
shown that isolated nanoparticles or nanoscale materials with relatively free surface 
usually exhibit a significant decrease in melting temperature with thermal behavior 
deviates from the corresponding conventional bulk. The underlying physic for this 
phenomenon is that the value of surface energy and vapor pressure of nanoparticles 
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Fig. 14 TEM images of CNT junctions with different shapes - (a) X, (b) Y, and (c) T -created on 
two overlapping CNTs by electron beam irradiation. (a'-b') Schematic illustration of the 
corresponded CNT junctions 

increase proportionately with decrease of particle size (Giilseren et al. 1995). As a 
consequence, nanoparticles experience melting prior to the melting point of bulk 
materials, enabling joining at the temperature that is few hundred degrees lower 
than the melting point. 

Besides the advantage in melting point reduction, various applications also 
demonstrated material properties enhancement of composite structures with the 
aid of nanomaterials. Because of the extremely high surface-to- volume ratio, the 
surface and interfacial properties of nanoparticles/nanoflakes/nanotubes become 
dominant when they are well dispersed in matrices. Consequently, incorporation of 
these finely dispersed nanophase structures in matrices can significantly alter the 
physical and mechanical properties of the hosting materials, for example, higher 
mechanical reinforcement in polymer composites, improved hardness and strength 
of metals and alloys, better ductility, hardness and formability of ceramics, lower 
sintering, and superplastic forming temperature of ceramics, to name a few. 

In this section, how nanomaterials are being added to enhance the joining process 
or joint property is reviewed. Several examples of applications will be elucidated in 
details to highlight the function of nanomaterials in respective applications. 


Addition of Metallic Nanoparticles for Low-Temperature 
Interconnects 

One advantage of adding nano structured materials is to achieve low temperature 
sintering, which is especially applicable for printed electronics manufacturing 
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Fig. 15 SEM images of CNT junctions before (a, c) and after (b, d) soldering by deposition of 
amorphous carbon, (b) large junction due to high contamination and (d) small junction due to low 
contamination 


industry. The current process of making conducting traces on printed circuit boards 
(PCBs) or other flexible substrates is through lamination or electroplating tech- 
niques, followed by patterning using photolithography and etching processes to 
define the trace pattern from blanket conducting film. This is a subtractive process 
which involves the use of harmful chemicals during fabrication and adds to the 
manufacturing cost of the products made. 

Driven by low-cost, low processing temperature, and flexible plastic electronics 
fabrication trend, additive manufacturing processes that enable direct printing of 
various functional films and conducting traces have emerged as the attractive 
techniques, such as screen printing (Gamier et al. 1994; Bao et al. 1997), inkjet 
printing (Ridley et al. 1999; Sirringhaus et al. 2000; Wang et al. 2004), micro- 
contact printing (Zschieschang et al. 2003), nanoimprinting (Ko et al. 2007a), and 
solid state embossing (Stutzmann et al. 2003), to name a few. For many of these 
processes, the printing materials need to be in viscous fluid state, and metal with 
high melting temperature is difficult to be applied. Nevertheless, the evolution of 
nanomaterials enables the printing possibilities by lowering down the melting 
temperature and viscosity. For example, the melting temperature of bulk gold is 
1,063 °C; however, a 2-3 nm-sized Au nanoparticles start to melt at around 
130-140 °C due to the thermodynamic size effect (Buffat and Borel 1976). Making 
use of this melting temperature depression, Ko et al. prepared the metal 
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Fig. 16 Selective laser sintering process of inkjet-printed nanoparticle solution on a polymer 
substrate by scanning with a focused continuous laser. The circle represents Au nanoparticles with 
a self-assembled monolayer (SAM) and the square block indicates a conductor pattern of sintered 
Au nanoparticles. Unsintered nanoparticles are simply washed away by an organic solvent 
(Ko et al. 2007b) (Reprinted with permission from IOP Science) 


nanoparticles suspension in organic solvent and inkjet printed on a polymer sub- 
strate to form nanoparticle micro patterns. Upon the drying of the nanoparticle 
suspension, the printed area is then locally sintered by scanning with a focused Ar 
ion laser which provides energy to heat and results in the transform of the insulating 
metal nanoparticle to conducting trace (Ko et al. 2007b; Fig. 16). 

The same research group also demonstrated the conducting trace formation 
using nanoimprinting. In this case, Au nanoparticles encapsulated by SAM in 
a-terpineol solvent were dispensed on top of the substrate, followed by PDMS 
mold imprinting at 80 °C for 10 min to allow the nanoparticle solution to fill the 
mold and evaporate the organic solvent. After removing the PDMS mold, the 
nanoimprinted sample was heated on hot plate at 140 °C to induce nanoparticle 
melting and transform of nanoparticles into continuous conducting film (Fig. 17; 
Ko et al. 2007a). The introduction of nanomaterials greatly reduces the melting 
temperature of metals for conductive traces formation, making the process com- 
patible to plastic materials and contributing to large extent the recent progression in 
this industry. 


Nanoparticle-Assisted Bonding Process 

Besides blending into the matrix, nanomaterials are also being applied at the 
bonding interface to facilitate fusion bonding at low temperature or pressure. Ide 
and co-workers (2005) compared Cu-to-Cu joining with the aid of 11 nm Ag 
nanoparticles and 100 nm Ag fine particles to demonstrate the “size matters” effect 
on interfacial bonding. Shear strength improvement up to 25-40 MPa was obtained 
under the same bonding condition (573 K, 1 or 85 MPa) as compared to 0.5-10 MPa 
for 100 nm Ag fine particles. Reveal of fracture interfaces showed that even though 
Ag nanoparticles paste includes inhibitory agents of organic elements, the joint had 
a denser sintered layer than the joint using Ag fine particles. The denser interfacial 
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Fig. 1 7 Nanoimprinting process of nanoparticle, (a, b) Dispensing nanoparticle solution on Si 
wafer, (c, d) Pressing PDMS nanoimprinting mold on nanoparticle solution under 5 psi pressure at 
80 °C. (e, f) Removal of mold and induced nanoparticle melting on hot plate at 140 °C 
(Ko et al. 2007a) (Reprinted with permission from the American Chemical Society) 


micro structure and superior adhesion suggested that Ag nanoparticles significantly 
improve the bondability to Cu (Fig. 18) through the formation of void-free metal- 
lurgical bond, as compared to the mere physical contact between the Cu disc and Ag 
fine particles. 

Besides Ag (Ide et al. 2005; Wakuda et al. 2010), Cu (Hiroshi et al. 201 1) and Au 
(Mehrotra et al. 2008) were also being explored and evaluated for nanostmctured 
bonding layers between Cu surfaces. For the case of Cu nanoparticles, joints showed 
high shear strength (~40 MPa) at thermocompression bonding condition of 673 K for 
300 s and 15 MPa, under nitrogen atmosphere (Hiroshi et al. 201 1). Joining conditions 
such as preheating temperature and heating temperature were observed to be cmcial in 
achieving optimized j oint condition. For the case of Au nanoparticles, since they are 
less reactive than Cu nanoparticles, the bonding is relatively easy even in ambient 
atmosphere. By adopting catalytic deposition to obtain non-agglomerated fine and 
uniform Au nanoparticles on Cu surface, one can optimize the bonding strength to as 
high as 100-150 MPa at bonding condition of 643 K for 360 s and 500 MPa, higher 
than that achieved with Cu nanoparticle interface. The good bondability is believed to 
be due to the low reactivity of Au which is less prone to oxidation and grain coarsening 
at elevated temperature, hence resulting in denser and pore-free joint at the bonding 
interface as compared to Cu and Ag nanoparticles. 
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Fig. 18 Fracture surface of joint made using 100 nm Ag fine particles and 1 1 nm Ag nanoparticles 
(Ide et al. 2005) (Reprinted with permission Elsevier) 


In conclusion, the addition of ultrathin nanoparticles to the bonding interface has 
been proven effective in enabling room- to low-temperature bonding by reducing 
the thermal budget due to the high surface energy of the nanoparticles and enhanc- 
ing bonding at low temperature through accelerated diffusion kinetics. 


Addition of Nanoparticles for Property Enhancement in Composites 

Incorporation of finely dispersed nanophase structures in matrices for properties 
enhancement has been applied for decades. It exists in every material system from 
polymers to ceramics and metals, with heaped choices of nanomaterials to be added. 
Owing to the huge variety of nanomaterial-enhanced composite systems, one 
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nanomaterial type, i.e., carbon nanotube (CNT), has been selected as the example to 
illustrate the few key issues in a composite system and the resolutions to it. 

Because of its excellent mechanical strength, superior thermal and electrical 
conductivity, outstanding toughness, elasticity, durability, etc., CNTs have been 
the top choice of nanoscale fibers for performance enhancement in composite 
systems due to the fact that it has better property than the base matrix. However, 
since they are essentially existent at nanoscale level, one has to practice certain 
means or ways to realize their fascinating properties at macroscopic level for real- 
world applications (Bal and Samal 2007). First work on CNT-polymer composite 
was reported by Ajayan et al. which showed that multi walled CNTs (MWNTs) 
have excellent mechanical properties and anisotropy can be induced by a flow in 
such a material. Subsequent reports with other polymer matrices demonstrated the 
improvement in properties, i.e., compressive strength, tensile modulus, conduc- 
tivity, hardness, etc., to different extents with the addition of CNTs (Ajayan 
et al. 1994; Lourie and Wagner 1998; Siochi et al. 2004; Cai et al. 2004; Seo and 
Park 2004). 

Besides polymer composites, CNTs are also being added into metal. The driving 
force for metal matrix composites comes from the need to lightweight, high- 
strength materials, which is especially needed for automotive and aerospace indus- 
tries to reduce the mass of material required for certain load-bearing application. 
Compared to polymer composites where mixing and dispersing of CNTs into 
polymer resins is a common technique, preparation of metal matrix composites is 
much more challenging due to the incompatible nature of CNT with metal. Powder 
metallurgy route is the most popular technique which involves the mixing of CNTs 
with metal powder by grinding or mechanical alloying, followed by consolidation 
and sintering, such as cold isostatic pressing, hot isostatic pressing, and sparking 
plasma sintering (He et al. 2007; Chen et al. 2003; Hulbert et al. 2009; Zhong 
et al. 2003; Xu et al. 1999). The second most popular route is by electrochemical 
deposition technique, which is used primarily for the formation of thin film coatings 
(Arai et al. 2008; Guo et al. 2007), as well as to produce one-dimensional compos- 
ites (Kang et al. 2007). Other methods include conventional melting and solidifi- 
cation methods (Sinian et al. 2005; Goh et al. 2008; Zhou et al. 2007), which are 
more suitable for low-melting-point matrix, and thermal spraying to spray molten 
or semi-molten particles onto a substrate to form coating/deposit by way of impact 
and solidification (Laha et al. 2004; Bakshi et al. 2008). 

The challenges for CNT-metal/polymer composites surfaced after the extensive 
research in these areas. Irrespective of the process steps, it was found that the main 
aspect in obtaining good reinforcement is by achieving homogeneous dispersion of 
CNT in the metal matrix and good bonding at the metal/CNT interface (Bakshi 
et al. 2010). CNT clustering has been the main issue in CNT-reinforced composites 
such as polymer, ceramic, or metal matrix. Because of the high surface area up to 

o 7 

200 nT g , the CNTs experience strong van der Waals force which pulls them 
together to form clusters, which result in the concentration of reinforcement at 
certain points and worsening of overall mechanical properties (Feng et al. 2005; Nai 
et al. 2006), as shown in Fig. 19. Since the elastic modulus, strength and thermal 
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Fig. 19 FESEM images of CNTs/epoxy nanocomposite prepared (a) without using the solvent 
and (b) using the solvent. Severe agglomerations could be observed with improper dispensing and 
mixing method (Song and Youn 2005) (Reprinted with permission from Elsevier) 





Fig. 20 Effect of MWNT content on the (a) impact strength, (b) bending strength, and (c) 
bending modulus of MWNT/epoxy composites (Yang et al. 2009) (Reprinted with permission 
from Elsevier) 
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properties of a composite are related to the volume fraction of the reinforcement, a 
homogeneous distribution of CNT is essential as it translates into the homogeneous 
properties of the composite. 

Interface between the CNTs and the matrix plays an important role in strength- 
ening. A weak bonding interface will result in inefficient utilization of CNT 
properties, facilitating pullout phenomena at low loads due interface failure, lead- 
ing to deteriorated performance. Conversely, a strong bonding interface will facil- 
itate the stress transfer to the high-strength fiber through the interfacial layer 
(Dilandro et al. 1988) and strengthen the composite, although at the expense of 
ductility of the composite. Wettability of the CNTs to the matrices is the key 
governing the interfacial bondability. Many CNT functionalization methods have 
been attempted to modify the surface and attach suitable functional groups so as to 
improve surface wetting and interfacial reactions, maximizing the effectiveness of 
CNT reinforcement as shown in Fig. 20 (Kuzmany et al. 2004; Balasubramanian 
and Burghard 2005; Yang et al. 2009). 

The issues encountered in CNT composite systems are also applicable to other 
nanomaterial-reinforced systems. Depending on the choice of the reinforcement 
and base system, different approaches are needed to guarantee uniform distribution 
of the nanomaterials in the matrices, with proper surface treatment to be carried out 
to facilitate interfacial reactions for strong mechanical and chemical bonding at the 
interface. 


Summary 

In summary, the fundamentals of nanomaterials and its joining -related issues - 
from nanojoining by producing permanent unions or connection between the zero- 
to two-dimensional nano-sized building blocks to the bulk joining assisted with 
nanomaterials for more desirable bonding condition and improved bonding prop- 
erties - were reviewed. Nanojoining of nanomaterials in different components are 
introduced based on dimensional change, i.e., single-component (e.g., Ag, Pt) and 
multicomponent (e.g., CuO) nanoparticles, as well as one-dimensional nanowire 
and carbon nanotubes. Although nanojoining technology is very attractive, there are 
still limitations and a lot of works be done to develop the technology for real 
applications in industrial manufacturing process. Problems to be addressed include 
the limitation of machine operation and critical experimental conditions, low 
productivity, stability and reliability of samples, and so on. Some of them can be 
easily solved in the micro or even bigger range but will be tough in the nanorange. 
Compared to direct nanojoining of materials, nanomaterial-assisted joining appears 
to be more widely accepted and easily blended into existing markets and 
applications. 

While it is exciting to notice the rapid growth in this area, one should also not 
neglect the evolving challenges in health and safety. Moving on, the involvement of 
all stakeholders (governments; international, regional, and national organizations; 
industry groups; public interest associations; labor organizations; scientific 
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associations; and civil society) will be required to identify the impact of 
nanomaterials to health and safety in positive or negative ways and address policy 
issues accordingly. This will help in expediting more adoption of nanomaterials and 
nanotechnologies into our daily life. 
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Abstract 

This chapter introduces the key functions of solder joints and the various 
soldering processes. The evolution from leaded to lead-free solder materials is 
reported. Furthermore, the recent developments of high-temperature solders and 
composite solders due to the ever demanding functional and service require- 
ments are discussed. The properties of different solder materials and their solder 
joints are also presented. Finally, reliability studies of the solder materials and 
their solder joints are discussed in terms of mechanical behavior, temperature 
cycling, and drop impact. 


Introduction 

Solder material serves as an electrical and mechanical interconnect in electronic 
packaging. Interconnects in electronic packaging are generally categorized into 
three levels. The zeroth level is an interconnect on the silicon itself which is made 
during wafer processing. This level of interconnect will not be discussed in this 
chapter as it does not involve the solder material. Level one interconnects are made 
between the die and the substrate and generally form the package. Some examples 
of these interconnects include C4 (controlled collapse chip connection) solder joints 
and wirebonding. Level two interconnects are joints between the substrate and the 
PCB (printed circuit board). These hold the package to the PCB. Examples of these 
include BGA (ball grid array) solder joints and leaded package soldering. 

In the following sections, the key solder joint technologies (like surface mount 
technology, pin-through-hole technology, and flip chip technology) are briefly 
described. This chapter also presents the various soldering processes and the 
evolution of solder materials. Emphasis is also placed on how material character- 
istics of solder impact the solder joint performance. Reliability studies in terms of 
mechanical behavior, temperature cycling, and drop impact are also discussed. 


Surface Mount Technology (SMT) 

In most portable electronics such as mobile phones, SMT packages are very 
common as the manufacturing process for this is very mature, thus making them 
relatively cheap to manufacture. In SMT packages, the silicon (Si) die is 
wirebonded to the leadframe. However, there are limitations on the length of 
wirebonds as longer wirebonds are more susceptible to wire sweep during the 
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Fig. 1 A leadframe package mounted on board using SMT. Schematic diagram showing the 
cross-sectional view of a lead soldered to the copper pad of a PCB 


Fig. 2 A schematic diagram 
showing a leadframe package 
mounted on board using pin- 
through-hole technology 


Cu plating on 
walls of through 
hole 


Encapsulation Si die 
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Wirebond (Level 1) 
Leadframe 
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encapsulation process. Wire sweep is said to have occurred when adjacent 
wirebonds move and come into contact, resulting in short circuit. To prevent this, 
wirebond pads are generally located at the periphery of the die. The legs of the 
leadframe can be bent into gull wings or J- wings and are placed onto the bond pads 
of the PCB which has solder paste printed on it. This assembly is subjected to the 
reflow process, resulting in the formation of solder joints. A schematic diagram of 
the leadframe package mounted on PCB using the SMT is shown in Fig. 1. 


Pin-Through-Hole Technology 

Pin-through-hole technology is another kind of second-level interconnection. In 
general, it has better mechanical reliability than SMT, however it is more expen- 
sive. It is used on leadframe packages with the legs bent straight as shown in Fig. 2. 
Holes are drilled on the PCBs and are plated with Cu and immersion Sn. Following 
this, the leadframe package is placed into the holes and it undergoes a wave 
soldering process such that the solder rises through the holes, connecting the 
leadframe legs to the PCB. 


Flip Chip Technology 

Flip chip technology is used in higher-end devices such as CPUs and server 
products. This type of interconnection is largely preferred due to the large number 
of I/Os present in these products and short interconnect length, which results in 
superior electrical performance. This technology involves the active Si surface 
being flipped over such that it faces the surface of the substrate. An area array of 
under bump metallization (UBM) is fabricated during wafer processing over the 
active circuitry of the silicon. Solder can be either electroplated or printed on the 
UBM. Other than a narrow keep-out zone at the edges of the die, these solder bumps 
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Fig. 3 Schematic diagram of 
a BGA package 


Si die 


Substrate 
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cover the entire surface of the chip. Joints formed using this technology are referred 
to as C4 (controlled collapse chip connection) joints. This is because wetting can 
occur only on the UBM and not on the surrounding Si0 2 passivation, thus effec- 
tively controlling the height of joint. A schematic diagram of this package assembly 
is shown in Fig. 3. 

Besides the C4 joints, ball grid array (BGA) solder joints also undergo flipping 
during its manufacturing process. Unlike C4 joints, BGA joints are second-level 
joints. First, the solder balls are attached to the package. To do this, the package is 
flipped over and solder balls are placed on the pads of the package and undergoes a 
reflow step. Then the package is flipped onto the PCB, and the whole assembly is 
subjected to reflow again. 


Metallurgical Reaction During Soldering 

During soldering, the solder material is joined together without the need to melt the 
base metal (substrate). However, in order to achieve a reliable solder joint, a proper 
metallurgical bond between the two metal surfaces must be formed and wetting 
must take place. In general, wetting is said to happen if the wetting or contact angle 
lies between 0° and 90°. Conversely, if the contact angle lies between 90° and 180°, 
the system is considered to be non- wetting. According to Young’s equation 
(Jacobson and Humpston 2004), the contact angle ( 6 C ) is determined from the 
balance of interfacial tensions at the junction, as shown in Fig. 4: 

Ysf ~ Ysl = Ylf cos @c (1) 

where Ysf is the interfacial tension between the flux and solid base metal, Ysl is the 
interfacial tension between the molten solder and the solid base metal, Ylf is the 
interfacial tension between the molten solder and the flux, and 6 C is the contact 
angle. Figure 4 shows the spreading of molten solder on the horizontal and vertical 
surfaces. Superior wetting is associated with a small contact angle. In order to 
reduce the contact angle, it is useful to reduce Ylf through the application of active 
flux. 

The formation of a solder joint is made possible by (i) introducing 
fluxing agents to remove the oxide layer present on metal surfaces and (ii) using 
an inert environment during the soldering process, to reduce the oxide formation. 
Figure 5 shows the phenomena taking place during the soldering process. Upon 
the activation of the flux, the presence of oxide on the solder and base metal 
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Fig. 4 Schematic diagrams showing the spreading of solder on (a) horizontal and (b) vertical 
surfaces. The contact angle (0 C ) is an indicator of wettability 



Fig. 5 Schematic diagram showing the various phenomena taking place during the soldering 
process 


Fig. 6 Schematic diagram 
showing the cross section of a 
typical solder joint 
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surfaces is removed. This allows wetting to take place as the solder melts. A 
metallurgical reaction of the molten solder and base metal occurs, and there is 
dissolution of the base metal into the molten solder. This reaction is known as 
liquid solder reaction, and it leads to the formation of an interfacial intermetallic 
layer. After the solidification of the solder joint, metallurgical reaction continues 
to take place while the joint is in service, and the reaction is termed as solid-state 
reaction. 

As shown in Figs. 6 and 7, intermetallic compounds (IMCs) are formed at (i) the 
interface between solder and base metal of the substrate and (ii) inside the bulk 
solder material. The formation of such intermetallic compounds can alter the 
micro structure of the solder joints and hence influence their long-term reliability. 
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Fig. 7 Representative FESEM (field emission scanning electron microscopy) image showing the 
presence of interfacial IMC layer and intermetallics inside the solder matrix 


Fig. 8 Classification of 
soldering processes 
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Although the presence of interfacial IMCs is desirable for the formation of a good 
solder joint, studies have shown that excessive intermetallics growth degrades 
interfacial integrity (Ahat et al. 2001; Nai et al. 2009). 


Soldering Processes 

The soldering processes can be broadly classified into two groups, namely, (i) total 
heating method and (ii) partial heating method. Figure 8 shows the various solder- 
ing processes under each group. 

For total heating modes, the entire package and/or printed wiring board is 
subjected to heat (refer to Fig. 9). It is widely used in the industry for large- 
volume production. It is efficient in terms of productivity and is economical. 
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Fig. 9 Schematic diagrams showing the various total heating methods: (a) infrared mode, 
(b) vapor phase heating mode, (c) convection mode, and (d) wave soldering 
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Fig. 10 Schematic diagrams showing the various partial heating methods: (a) soldering iron, 
(b) laser soldering, (c) localized hot air soldering, and (d) pulse heating 


However, heat stresses are induced on the whole device and printed wiring board. 
On the other hand, for partial heating modes, heat is applied to the package leads 
and/or printed wiring boards in a localized manner (refer to Fig. 10). Though less 
heat stress is induced on the device and printed wiring boards, it is not ideal for 
large-volume production. 
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Solder Materials 

Drive Towards Lead-Free Solders 

The implementation of legislative and regulatory actions to restrict the use of lead 
(Pb) and Pb compounds has moved the electronics industry to do away with Pb in 
solder materials. In the electronics industry, the concerns over the use of Pb are 
from the occupational exposure, disposal of electronic assemblies which contain Pb 
and Pb waste derived from the manufacturing processes. There have been many 
studies on lead-free alternative solders in the past decade (Bieler and Lee 2010; 
Puttlitz and Stalter 2004; Chidambaram et al. 2011). These solder alloys can be 
broadly classified into (i) binary, (ii) ternary, and (iii) quaternary alloys. Majority of 
these solder alloys are based on tin (Sn) being the primary or major constituent. In 
the following section “Characteristics of Bulk Solder and Solder Joints,” the 
characteristics of these lead-free solder alloys will be presented in greater detail. 


High-Temperature Lead-Free Solders 

In the past decade, there has been momentous research effort in the development of 
lead-free solders. However, a limited portion of these is related to lead- free high- 
temperature solder alloys. In the field of high- temperature applications (e.g., deep 
well oil and gas logging tools, automotive under the hood electronics and aerospace 
electronics), there is an urgent need for such high-temperature lead-free solder alloys. 
In order to ensure efficient process control of the manufacturing and assembly of the 
soldered components, the melting range of the high-temperature solder alloys has 
been defined by the industry as 270-350 °C (Chidambaram et al. 2010). 

The conventional high-temperature solders used include (i) Pb-Sn alloys which 
contain high amount of Pb (Nousiainen et al. 2006; Kim et al. 2003), (ii) Pb-Ag, 
(iii) Sn-Sb, (iv) Au-Sn, and (v) Au-Si alloys (Suganuma et al. 2009; Zeng 
et al. 2012). Table 1 lists some of the typical high-temperature solders, namely, 
high Pb solders, Au-based solders, Sn-Sb solders, Zn-based solders, and Bi-Ag 
solders. 


Composite Solders 

Stricter functional and service requirements of electronic devices, coupled with 
concurrent advances in the electronics industry towards miniaturization of devices, 
have necessitated for solder joints having enhanced mechanical, thermal, and 
electrical properties. The search beyond the conventional solders has led to the 
development of a new generation of interconnection material. A potentially viable 
way to effectively increase the service temperature capabilities and thermal stabil- 
ity of the base solder materials is the development of composite solders. Composite 
solders are solder alloys with intentionally added reinforcements. 
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Table 1 Lists of typical 
high-temperature solder 
alloys (Nousiainen 
et al. 2006; Kim et al. 2003; 
Suganuma et al. 2009; 
Zeng et al. 2012) 


Alloys 

Composition (wt.%) 

Pb-Sn 

Sn-65Pb 

Sn-70Pb 

Sn-80Pb 

Sn-90Pb 

Sn-95Pb 

Sn-98Pb 

Pb-Ag 

Pb-2.5Ag 

Pb-1.5Ag-lSn 

Sn-Sb 

Sn-5Sb 

Sn-3Sb 

Au-Sn 

Au-20 Sn 

Au-Si 

Au-3. 15Si 

Au-Ge 

Au-12Ge 

Bi-Ag 

Bi-2.5Ag 

Bi-lOAg 

Bi-llAg 

Zn-Al 

Zn-6A1 

Zn-4Al-3 Mg-3.2Ga 

Zn-Sn 

Zn-20Sn 

Zn-30Sn 

Zn-40Sn 


To date, researchers have worked with a variety of reinforcement filler materials 
to synthesize composite solders (Nai et al. 2006, 2008a, b, 2009; Guo et al. 2001; 
Guo 2007; Liu et al. 2008; Babaghorbani et al. 2009; Shen and Chan 2009; 
Geranmayeh et al. 2011; Niranjani et al. 2011; Han et al. 2011, 2012; Tsao 
et al. 2012; Han et al. in press). The filler materials used can be classified into 
(i) elemental metallic particles, (ii) intermetallic particles or intermetallics formed 
from elemental particles through a reaction with Sn during manufacturing or by 
subsequent aging and reflow processes, and (iii) phases that have low solubility in 
Sn or are nonreactive with Sn. The filler materials used also varied in (i) size (e.g., 
micron, submicron, and nanoscale) and (ii) shape (e.g., particles, wires, and 
nanotubes). 

With the advancement of nanotechnology in recent years, various nano-size 
reinforcements are chosen to synthesize the nano-composite solders. These nano- 
composite solders are developed primarily to enhance the service performance of 
solder joints, in particular their creep and thermomechanical fatigue resistances. 
Studies have shown that the nano- size reinforcements are preferred over their 
micron-size counterparts as they could more effectively distribute along the 
Sn-Sn grain boundaries and thus act as obstacles to restrict dislocation motion 
and limit grain boundary sliding (Shen and Chan 2009). In the case of nano- 
composite solders, other reported enhancements include (Shen and Chan 2009) 
(i) reduction in solder’s density, (ii) reduction in the degree of undercooling, 
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(iii) improvement of electrical conductivity, (iv) improvement of wettability of 
solders on substrates, (v) suppression of the growth of intermetallics (in terms of 
interfacial intermetallic compound layer growth and growth of intermetallics inside 
the solder matrix), (vi) refinement of micro structure (in terms of finer grains), and 
(vii) improvement of mechanical properties (in terms of microhardness, 0.2 % yield 
strength, ultimate tensile strength, shear strength, and creep resistance). 

In general, composite solders with improved performance have been broadly 
demonstrated, although the technology is still embryonic and there are technical 
issues to be resolved before adoption by the industry. Nonetheless, there is keen 
industry interest, particularly how production of such materials might be cost- 
effectively adapted to real-life industry processes. 


Characteristics of Bulk Solder and Solder Joints 

In general, solder materials serving as interconnections need to provide the neces- 
sary mechanical, thermal, and electrical integrity in electronic assemblies. As there 
is a wide range of service requirements for solder materials, it is essential to 
understand the performance characteristics of the solder materials. These perfor- 
mance characteristics can be broadly grouped into (i) properties pertinent to 
reliability and (ii) properties pertinent to manufacturing. Figures 11 and 12 show 
the key properties of bulk solder material and their solder joints which are critical to 
reliability and manufacturing respectively. 


Melting Temperature/Range 

The melting temperature of solder materials is a critical property which is closely 
linked to the manufacturing process and the ability of the solder interconnect to 



Fig. 1 1 Key characteristics important to reliability of bulk solder materials and their joints 
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Fig. 12 Key characteristics 
important to manufacturing of 
bulk solder materials and their 
joints 


Reworkability 


Melting 

temperature 



Cost of solder material and 
cost of processing 



Solderability 

/wettability 



Environmental 

friendly 



Ability to make into 
powder/balls and paste 



Viscosity 


Density 


withstand the service temperature. The melting temperature of Sn-Pb solder is 
183 °C. Most of the assembly equipment previously used for Sn-Pb solder tech- 
nology is designed to operate using 183 °C as a reference. Table 2 lists the melting 
points of some solder materials. Although some variations in the baseline temper- 
ature can be accommodated by the equipment currently in use, if the melting point 
of the alternative solder is significantly higher, the existing processing equipment 
will have to be replaced. This will result in expensive investment of new equipment 
which will then translate to an increase in overall product cost. In addition, in an 
electronic assembly, a variety of materials is used. Particularly, there are many 
polymeric materials found in a typical electronic package, and these materials 
might not be able to tolerate the higher manufacturing process temperature. 


Electrical Properties 

Solder serves as an electrical interconnect in devices whereby electrical currents 
flowing into and out of the device need to pass through the solder joints. In order to 
ensure proper functioning of the interconnect, the electrical resistivity of the solder 
materials has to be low so as to allow efficient current flow. For a given alloy 
material, the resistivity value varies with temperature, composition, and micro- 
structure. Table 3 shows the electrical resistivities of some solder alloys and 
microelectronic packaging materials. 


Thermal Conductivity 

When electronic devices are in service, heat is generated by the die. For proper and 
long-term reliable functioning of the device, it is vital to ensure efficient heat 
dissipation from the die to the surroundings through its packaging. One of the 
heat dissipation pathways is through the solder interconnections. In the case of high 
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Table 2 List of lead-free 
solder selection and their 
melting point/range 
(Abtew and Selvaduray 
2000; http://www.kester. 
com/SideMenu/ 
LeadFreeSolutions/tabid/ 

1 45/Default, aspx. 
Accessed 28 Sept 2012; 
Ganesan and Pecht 2006; 
Mahidhara 2000) 


Solder alloy composition 

Melting point/range °C 

Au20Sn 

280 

Sn-5Sb 

236-243 

Sn-3Sb 

232-238 

Sn2.0Cu0.8Sb0.2Ag 

219-235 

Sn25AglOSb 

233 

Sn 

232 

Sn0.7Cu 

227 

Sn2.5AgO.8CuO.5Sb 

217-225 

Sn4.0Ag0.5Cu 

217-224 

Sn3.9AgO.6Cu 

217-223 

Sn-3.5Ag 

221 

Sn2.5Agl.0Bi0.5Cu 

214-221 

Sn-3.5Ag3Bi 

220 

Sn-3.0Ag-0.5Cu 

217-220 

Sn-2.0Ag-0.75Cu 

217-219 

Sn3.8Ag0.7Cu 

217-219 

Sn-3.5Ag0.7Cu 

217-218 

Sn2.5AgO.8CuO.5Sb 

213-218 

Sn2.0Ag3.0Bi0.75Cu 

207-2 1 8 

Sn-3.5Ag0.9Cu 

217 

Sn2Ag4Bi0.5Cu0.1Ge 

216 

Sn-3.5Ag5BiO.7Cu 

210-215 

Sn2Ag3Bi0.75Cu 

210-215 

Sn4.0In3.5Ag0.5Bi 

210-215 

Sn8.0In3.5Ag0.5Bi 

197-208 

Sn9Zn 

199 

Sn8Zn3Bi 

191-198 

Sn20In2.8Ag 

175-187 

Sn57Bi0.1Ag 

138-140 

Sn58Bi 

138 

Sn52In 

118 


interconnect ball grid array (BGA) devices, thermal solder balls which have no 
electrical connections are introduced to function as a heat dissipation medium. 
Table 4 lists the thermal conductivity values of some solder alloys and microelec- 
tronic packaging materials. 


Coefficient of Thermal Expansion (CTE) 

An electronics assembly comprises of many different materials which ranges from 
metals, ceramics, polymers, to composites. The whole assembly will experience 
thermal cycling when the device is in operation. These materials with different 
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Table 3 Electrical 
resistivities of some solder 
alloys and microelectronic 
packaging materials 
(Abtew and Selvaduray 
2000; Hwang 1996; Gray 
1957; Lee et al. 1995; Kang 
et al. 1999) 


Type 

Materials 

Resistivity (pO cm) 

Solder alloys 

Sn-37Pb 

14.5 

Sn-2.8Ag-20In 

17 

Sn-3.5Ag 

10 

Sn-3.5Ag-0.7Cu 

13 

Sn-3.8Ag-0.7Cu 

13 

Sn-3Ag-5Bi 

11.6 

Sn-3Ag-10Bi 

8.8 

Sn-3Ag-3In 

7.7 

Sn-3Ag-5Sb 

10.5 

Sn-3Ag-lZn 

10.4 

Sn-3Ag-3Zn 

4.8 

Sn-5Bi 

11.0 

Sn-lOBi 

17.7 

Sn-5In 

8.3 

Sn-52In 

14.7 

Sn-5Sb 

17.1 

Au-20Sn 

16 

Leadframes 

52Ni-48Fe 

43.2 

42Ni-58Fe 

57.0 

Metallic elements 

Ag 

1.47 

Al 

2.83 

Au 

2.44 

Bi 

119 

Cu 

1.73 

In 

8.37 

Mo 

5.14 

Ni 

7.24 

Pd 

10.21 

Pt 

7.83 

Sb 

37.1 

Sn 

11.5 

Zn 

5.75 


coefficients of thermal expansion (CTE) will expand and contract at different rates, 
resulting in thermally induced stresses during the thermal cycling of the device. 
Table 5 summarizes the CTE values of some solder alloys and microelectronic 
packaging materials. 


Mechanical Properties 

Solder interconnects not only conduct current, they also provide mechanical sup- 
port for electronic devices. Thus, the mechanical properties of the solder materials 
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Table 4 Thermal 
conductivity values of 
solder alloys and 
microelectronic packaging 
materials (Suganuma 
et al. 2009; Abtew and 
Selvaduray 2000; Hwang 
1996; Lee et al. 1995; Bilek 
et al. 2004; Hwang 1994) 


Materials 

Thermal conductivity (W/m*K) at 300 K 

Ag 

429 

Au 

297 

Bi 

8 

Cu 

401 

Ni 

91 

In 

82 

Si 

82 

Sn 

73 

Sb 

24 

AI2O3 

21 

SiC 

210 

AIN 

250 

Sn-37Pb 

50.9 

Sn-3.5Ag 

54.3 

Sn-2.8Ag-20In 

53.5 

Sn-3.5Ag-0.9Cu 

~60 

Sn-3.8Ag-0.7Cu 

Au-20Sn 

57 

Au-12Ge 

44 

Au-3Si 

27 

Sn-5Sb 

48 

Pb-5Sn 

23 

Sn-0.7Cu 

65 


used to form the solder joints are vital to the mechanical reliability of the devices. 
The three key mechanical properties of solders are, namely, 

(i) Strength and stiffness 

(ii) Creep resistance 

(iii) Fatigue resistance 

Under various service conditions such as temperature cycling, vibration during 
transportation, and drop impact conditions, the solder joint experiences tension, 
compression, and shear stresses. 

Shear Mode 

While an electronic device is in operation, the solder joints are subjected to 
mechanical stresses and strains due to the mismatch of coefficients of thermal 
expansion (CTE) between the different materials. Components in devices are 
made up of a variety of materials which have different coefficients of thermal 
expansion. When the device is switched on and off, it experiences thermal cycling 
and this leads to shear stresses as schematically illustrated in Fig. 13. 
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Table 5 CTE values of 
solder alloys and 
microelectronic packaging 
materials (Suganuma 
et al. 2009; Abtew and 
Selvaduray 2000; Hwang 
1996; Hwang 1994; 
Handbook Committee 
1989; Korhonen 
et al. 1993; Spearing 
et al. 1993) 


Materials 

CTE (x 10 S /K) 

Si 

3.0 

SiC 

3. 7-3 . 8 

AI 2 O 3 

7.0 

AIN 

4.03 

FR-4; T < T g 

x, y: 15.8 

FR-4; T < T g 

z: 80-90 

FR-4; T > T g 

x, y: 20 

FR-4; T > T g 

z: 400 

Epoxy resins 

~50 

Bi 

13.4 

Cu 

16-18 

In 

32.1 

Sb 

11.0 

Sn 

22.0 

Sn-37Pb 

21 

Sn-3.5Ag 

22 

Au-20Sn 

16 

Au-12Ge 

13 

Au-3Si 

12 

Sn-5Sb 

23 

Pb-5Sn 

30 

Sn-4.8Bi-3.4Ag 

23 


Silicon Die (ocj) 


Solder (a 2 ) 


Substrate (a 3 ) 



Fig. 13 Solder interconnects 
subjected to shear stresses 
during thermal cycling, due to 
CTE mismatch between the 
die (al), solder (a 2 ), and 
substrate (a3). a denotes the 
CTE of the material 
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Tension Mode 

It is also of interest to determine the extent of tensile deformation the solder joints 
can endure without failure as they can be subjected to tensile loading during the 
handling of electronic devices. The tensile properties of a solder material can be 
derived from a stress-strain curve which consists of an elastic region and a plastic 
region. The elastic modulus ( E , Young’s modulus) can be determined from the 
slope of the elastic region of the curve. Within this elastic region, the sample will 
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return to its initial dimensions when the applied load is removed. When the applied 
load increases and exceeds that of the elastic limit, the solder material will undergo 
plastic deformation whereby the deformation in this region is permanent. The 0.2 % 
yield stress (0.2 % YS) of the solder material is described as the level of stress 
resulting in the onset of plastic deformation. It is determined by drawing a parallel 
line to the slope of the elastic region at 0.2 % of the strain. Under tensile loading, the 
maximum level of stress which the solder material can endure before failure is 
defined by the ultimate tensile stress (UTS). 

Creep 

Creep deformation is a key failure mode of solders due to the high homologous 
temperature of solder materials. The homologous temperature (T h ) is defined as the 
ratio of the temperature of the solder material and its absolute melting temperature, 
T m (in degrees Kelvin) (Callister 2007). When T h is greater than 0.5 T m , the creep 
deformation will be the dominating mode of deformation in metallic materials. For 
most solder alloys, even at room temperature, the homologous temperature is 
greater than 0.6 and this leads to the solder joints being loaded under conditions 
whereby they are prone to creep. 

Creep is a time-dependent plastic deformation under a constant stress in either 
tension or compression. A typical creep curve is made up of three stages as shown 
in Fig. 14. The first stage is the primary creep where the strain rate decreases rapidly 
over time due to work hardening which restricts the deformation. The second stage 
is the secondary creep, also known as the steady-state creep region. This stage is the 
most vital as deformation of the solder materials dominates here. Most of the 
published literature is on steady-state creep. The final stage is the tertiary creep 
where the creep strain rate increases rapidly over time and will result in creep 
rupture of the testing sample. 

Fatigue 

Due to mechanical or thermal cyclic loading, solder joints are also susceptible to 
fatigue failure. This type of solder joint failure can lead to (i) crack nucleation and 


Fig. 14 Graphical 
representation of a typical 
creep curve showing the three 
key stages 



Time, t 
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Fig. 15 Typical dimensions 
of a bulk solder sample used 
for mechanical testing (Pang 
2012 ) 



(ii) microcrack propagation and coalescence (Fine 2004). The fatigue life of a 
solder joint is determined by the number of stress cycles it can withstand before 
electrical failure occurs. 

Samples Used for Mechanical Testing 

For mechanical design, modeling, life prediction, reliability assessment, and pro- 
cess optimization, it is critical that the properties of the solder material must be 
reliable and accurate. However, there are discrepancies in the mechanical property 
database of solder materials due to the absence of uniformity in testing standards. 
Furthermore, there is a lack in correlation between the bulk solder properties and 
that of soldered joints. 

Solder used in electronic industry exists in various forms: (i) solders for oper- 
ations such as wave soldering, (ii) solder paste for operations such as surface mount 
reflow, and (iii) solder balls or solder columns for applications such as ball grid 
array, column grid array, and flip chip packages. The design of test specimen thus 
plays an important role in the representation and validity of the testing result. The 
specimens currently used to determine the mechanical properties of solder inter- 
connect can be grouped into (i) bulk solder, (ii) simplified shear sample, and (iii) 
surface mount technology (SMT) solder joints. 

In general, bulk solder specimens, as shown in Fig. 15, can be used to determine 
the mechanical properties of solder alloys (Pang 2012). However, such testing 
specimens could not accurately model the service life of an actual electronic assem- 
bly due to the size effect. Hence, Darveaux and Banerji (1992) used solder assemblies 
for mechanical testing. The size of the test specimens was very close to the actual 
soldered assemblies. Figure 16 shows some mechanical test specimens which are 
better representatives of the actual solder interconnects (Plumbridge 2004). 


Reliability Case Studies 
Mechanical Study of Bulk Solder 

With the rapid progress in technology, our everyday life becomes more dependent 
on electronics. Hence, the research in the dynamic response of solder interconnects 
to make these electronic devices more robust becomes crucial. Accurate material 
models and test results allow the effective comparison and selection of suitable 
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Fig. 16 Schematic diagrams 
of different test specimens 
used for mechanical testings, 
(a) Single lap shear, (b) 
double lap shear, (c) solder 
ball tension, and (d) ring pin 
shear 




solder materials for various application needs and in extracting mechanical param- 
eters required for finite element modeling. The study by Ong (2005) to obtain the 
properties of Sn-37Pb solder and a lead-free solder material, Sn-3.5Ag from 
compression of the specimens using the Shimadzu AG-25 TB Testing Machine, 
and the Split-Hopkinson Pressure Bar (SHPB) is presented in this section. 


Microstructure of Solder Specimens 

• Microstructure of Sn-Pb Solder at Different Cooling Rates 

Distinctly dissimilar micro structures are obtained under different cooling rates. 
Being poly crystalline structured, the cast solder will develop larger grains at a 
slower cooling rate (0.1 °Cs _1 ) as grains have more time to nucleate. From 
Fig. 17, Sn-37Pb solder samples cast from slow cooling (SC) result in larger 
grains as compared to those formed via moderate cooling (MC) (2-3 °Cs _1 ). 
Lamellar layers with “lighter-toned” lead and “darker-toned” tin phases are 
observed for slow-cooled samples. With moderate cooling, the material has less 
time to resolidify into lamellar layers. As a result, shorter but thicker patches of 
“light” lead phases suspended in “dark” tin are formed. This is due to the 
instabilities of advancing liquid-solid interface resulting in island-shaped phases 
(approximately 5 pm in length) that lack long-range perfection of the lamellar 
structure formed by slow cooling. 

• Microstructure of Sn-3.5Ag Solder at Different Cooling Rates 

Sn-Ag solder, besides having a much higher eutectic temperature of 221 °C, is 
also very different from Sn-Pb in terms of phase fractions and solubility 
behavior of the two phases. Lead (Pb) comprises more than 30 % volume 
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Fig. 1 7 Representative SEM micrographs of Sn-37Pb formed by (a) slow and (b) moderate cooling 

a b 



Fig. 18 Representative SEM micrographs of Sn-3.5Ag formed by (a) slow (SC) and (b) 
moderate cooling (MC) 


fraction in Sn-Pb solder, whereas silver (Ag) comprises less than 4 % of the total 
volume of Sn-3.5Ag solder and forms mainly the Ag 3 Sn intermetallics (Choi 
et al. 2000). Also, Pb-rich phases in Sn-Pb solder are ductile as compared to 
Ag 3 Sn intermetallics which are stronger but more brittle (Kim et al. 2002; 
Maveety et al. 2004). Figure 18 shows slow-cooled (SC) samples produce 
long, well-aligned Ag 3 Sn intermetallic plates/needles. Large Ag 3 Sn precipitates 
are found in the form of platelets/needles, whereas fine Ag 3 Sn precipitates are 
fibrous (Kim et al. 2002). In moderately cooled (MC) samples, Ag 3 Sn platelets 
or needles can be seen to be shorter and less well aligned than in earlier samples 
formed by slow cooling. 

Quasi-Static Compression of Bulk Solders 

For quasi-static compression, the bulk solder specimens are prepared by casting and 
machining methods to produce cylindrical samples approximately 21 mm in length 
and 7 mm in diameter. The Shimadzu AG-25TB Testing Machine is used to 





732 


S.M.L. Nai et al. 


Fig. 19 Young’s modulus 
and yield stresses (0.2 % 
strain offset) of bulk solders 
under quasi- static 
compression 



Sn-37Pb 



SC MC 

Sn-3 . 5 Ag 


a Young’s modulus/GPa □ Yield stress / MPa 


perform the compression tests where specimens were compressed at a strain rate of 
8.3 x 10 -4 s -1 and loaded to 3 % strain (Ong 2005). 

Figure 19 shows that Sn-37Pb bulk solder exhibits a slight increase in Young’s 
modulus when cast at higher cooling rates, while this is the reverse for bulk 
Sn-3.5Ag solder. The Young’s modulus for Sn-3.5Ag is about 60 % higher than 
Sn-37Pb. For the extraction of yield stress, the 0.2 % strain offset method is used to 
standardize yield stress identification. 

Dynamic Compression of Bulk Solders 

The compressive Split-Hopkinson Pressure Bar (SHPB) experimental setup (Hop- 
kinson 1914) is used to determine the dynamic response of solder specimens under 
high strain rate compression. The idea of using two Hopkinson bars to measure 
dynamic properties of materials in compression was developed by Taylor (1946), 
Volterra (1948), and Kolsky (1949). 

The specimens prepared for the Hopkinson bar tests have an aspect ratio of one, 
and specimen lengths range from 2 to 9 mm. Striker bar velocities ranging from 5 to 
15 ms -1 were used with the different specimen lengths to attain strain rates ranging 
from 10 2 to 10 4 s -1 . 

Table 6 shows the mechanical performance obtained from the dynamic com- 
pression of Sn-37Pb and Sn-3.5Ag solders. There is almost no difference in the 
yield stress of dynamically compressed bulk Sn-37Pb solder for the two rates of 
cooling. Dynamic results show distinctly higher stresses than quasi-static ones. 

Sn-3.5Ag solder, on the other hand, shows an increase in average yield stress 
and maximum flow stress with faster cooling rate. MC specimens have a maximum 
flow stress of 180 MPa, while SC specimens only reached a maximum of 150 MPa. 
Examination of the fractured surfaces showed that there is a transition from ductile 
to brittle fracture as strain rate is increased. 

Tension of Bulk Solders 

The transition to lead-free solders is led by the widely adopted Sn-Ag-Cu (SAC) 
eutectic (Henshall et al. 2008). However, some studies have shown that standard 
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Table 6 Results from high strain rate testing of Sn-37Pb and Sn-3.5Ag solders under dynamic 
compression 


Solder 

Cooling 

Average yield stress (MPa) 

Max flow stress (MPa) 

Sn-37Pb 

SC (slow cooled) 

52 

130 at 30 % strain 

MC (moderately cooled) 

53 

140 at 25 % strain 

Sn-3.5Ag 

SC (slow cooled) 

42 

150 at 35 % strain 

MC (moderately cooled) 

47 

180 at 50 % strain 




T = 0 ms 



T - 20 



[=10 ms 

(specimen under tension) 


T = 21 ms (Failure) 


I 


14 ins {Bulk yielding) 


T = 22 ms 



ac 



Fig. 20 High-speed camera photography showing dynamic failure of a bulk solder sample 



SAC alloys such as SAC 405 (Sn-4 %wt Ag-0.5 %wt Cu) have poorer mechanical 
performance than eutectic Sn-Pb under high strain rate conditions (Suh et al. 2007). 

In this section, bulk solder tensile tests data is presented. The tests are performed 
by Su et al. (2010) to characterize the mechanical properties of SAC 105 (Sn-1 %wt 
Ag-0.5 %wt Cu) and SAC 405 (Sn-4 %wt Ag-0.5 %wt Cu) at strain rates ranging 
from 0.0088 to 57.0 s -1 . The bulk solder specimens had a gauge length of 19 mm 
and diameter of 3 mm as shown earlier in Fig. 15. Results show that Sn-Ag-Cu 
alloys exhibit higher mechanical strength with increasing strain rate. Furthermore, 
SAC 105 exhibits about 30 % greater tensile ductility than SAC 405 and Sn-Pb 
solders. Figure 20 shows a series of high-speed camera pictures depicting the 
sequence of the test until specimen failure. 
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Table 7 Material parameters of rate-dependent yield stress and UTS of different solder alloys 


Solder alloys 

bi 

b 2 

Cl 

c 2 

SAC 105 

40.378 

0.0775 

53.093 

0.0810 

SAC 405 

48.741 

0.0596 

67.633 

0.0779 


(a) Effect of Strain Rate on Bulk Solder Material Properties 

It is observed that as the rate of loading increases, the strength of the bulk solder 
material also increases. The trend was similarly observed by both Pang and 
Xiong (2005) and Che et al. (2008). 

For ease of comparison with existing literature, the rate dependence in yield 
stress and UTS is expressed in the following power relationships: 


fyi^Sn-Ag-O.SCu 


Me)* 2 

(2) 

c i( e ) ' 

(3) 


°UTS\Z)sn-Ag-0.5Cu 

The coefficients bj , b 2 , cj , and c 2 are listed in Table 7. 

The coefficients b and c directly relate to the strain rate sensitivity of yield 
stress and UTS respectively. Strain rate sensitivity can be useful in the study of 
the ductile to brittle transition strain rate (DTBTSR) in solder joints, which is 
affected by the sensitivity of bulk solder strength to strain rate. 

(b) Effect of Ag Content on Mechanical Propertiesof Bulk Solder 

The yield stress and UTS of SAC 405 are consistently higher than SAC 
105, while elongation decreases with higher Ag content for lead-free alloys. 
The trend is similarly reported by Che et al. (2008). Sn-Ag-Cu alloys are 
strengthened by the internal stress accumulated due to the difference in elastic 
modulus and volume fraction between Ag 3 Sn and Sn matrix (Reid et al. 2008). 
Higher Ag content will increase the amount of Ag 3 Sn phase (Darveaux and 
Reichman 2006), contributing to the higher strength of SAC 405 (Suh 
et al. 2007). 

(c) Failure Analysis of Fractured Dog Bone-Shaped Bulk Solder Test Specimens 

Figure 21 shows the fractured solder specimens after test. 45° shearing is the 
primary mode of failure at lower strain rates, while necking is mostly observed 
at higher strain rates. 

The study conducted by Su et al. (2010), concluded that solder alloy mate- 
rials such as SAC 105 and SAC 405 exhibited higher mechanical strength 
(in terms of yield stress and UTS) with increasing strain rate as compared to 
that of traditional Sn-Pb solders. Furthermore, rate -dependent properties such 
as mechanical strength exhibited linear relationship with respect to strain rate 
expressed in power scale. For the Sn-Ag-Cu alloys, the bulk solder material 
properties such as strength and ductility are also observed to vary with different 
Ag content. 
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Fig. 21 Photographs of fractured dog bone-shaped bulk solder tensile test specimens, (a) Test 
rate of 10 mm/min, showing 45° shear failure mode, (b) test rate of 22,500 mm/min, showing 
necking failure mode (Su et al. 2010) 


Mechanical Study of Solder Joints 

Mechanical reliability has become an issue (Frear et al. 1999) as electronic com- 
ponents and solder joint dimensions shrink. Solder joints are expected to undergo 
high strain rate conditions especially for products developed for use in shock 
loading environments in the automotive, aerospace (Darveaux and Reichman 
2006), marine and offshore, and portable consumer sectors. The industry has also 
made a transition towards the adoption of lead-free solder alloys, commonly based 
around the Sn-Ag-Cu formulations (Henshall et al. 2008). 

In this section, the results of package pull and shear tests conducted by Su 
et al. (2010) are presented to illustrate the effects of strain rate, silver content in 
Sn-Ag-Cu solder joints, and loading orientation on the reliability of the micro- 
electronic packages. 

Wafer-level chip scale packages (WLCSP) with SAC 105 or SAC 405 alloy 
solder joints are test under room conditions, at 0.5 mm/s (2.27 s -1 ) and 5 mm/s 
(22.73 s' 1 ). Each WLCSP specimen has an array of 12 by 12 solder joints 
sandwiched between a die substrate and a printed circuit board. These WLCSP 
have solder bumps that are 300 pm in diameter and double redistribution layers with 
a Ti/Cu/Ni/Au under bump metallization (UBM) as their silicon-based interface 
structure. 

Results of Solder Joint Array Shear and Tensile Tests 

Solder joints that were fractured during tests were examined to determine their 
modes of failure. Both Darveaux et al. (2006) and Zhao et al. (2007) have identified 
three major modes of solder joint failures for ball grid array packages (BGAs), 
namely, (i) ductile bulk solder failure, (ii) brittle interface fracture at the interme- 
tallic compound (IMC) layer, and (iii) pad matrix failure. Figure 22 shows the 
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Fig. 22 (continued) 
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Fig. 22 Schematic diagrams and fractographs showing (a) bulk solder failure, (b) UBM-IMC 
failure, and (c) pad matrix failure on solder joints (Extracted from (Su et al. 2010)) 


typical failure modes accompanied by cross-sectional schematics to show the 
location of each failure. 

(a) Bulk solder failure. Bulk solder failure occurs by fracturing through the solder 
sphere and is usually detected near the die side interface and can be easily 
identified by the silvery solder residue on the entire pad of the die substrate. The 
failure surface is usually undulating and less smooth than those seen for the 
UBM-IMC failures. 

(b) UBM-IMC failure. The failure crack usually occurs in the intermetallic 
compounds (IMC) which are usually more brittle than the bulk solder (Frear 
et al. 1999) or at the interfaces with the substrate. It can be identified by a 
smooth surface on the die substrate or a characteristic ring step and smooth 
surface at the top of the solder bump. 

(c) Pad matrix failure. Pad matrix failure occurs across the matrix layer of the 
fiber-epoxy polymer composite that makes up the printed substrate or board. It 
can be identified by “cratering” (Henshall et al. 2008) in the board side. 

Bulk solder failure is commonly regarded as a desirable mode of failure to avoid 
catastrophic failures (Darveaux et al. 2006). Wong et al. noted that bulk solder 
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failures correspond to high drop test life of ball grid array packages (BGAs), which 
is more desirable than IMC failures that correspond to low drop test life (Wong 
et al. 2008a). 

Derivation of Average Solder Joint Array Strength 

From the load-displacement raw data, the average stress in the solder joint array 
can be calculated by dividing load over the sum of joint pad area (Darveaux and 
Reichman 2006). The total failure cross-sectional area (A r ) can be obtained as 
follows: 



where i = 1 corresponds to failure mode 1 (e.g., bulk solder failure), i = 2 corre- 
sponds to failure mode 2 (e.g., UBM-IMC failure), etc. a, is the cross-sectional area 
of the failure region in a single solder bump. The number of solder joints that failed 
under each mode {n t ) is determined from the optical images of the fractured solder 
joint array sample after testing. 

The average solder joint array strength (o ave ) is calculated by dividing peak load 
(F max) over the total failure cross-sectional area, A T : 



ave 



F 


max 


A 


T 



Solder joint strength refers to the strength of the entire solder joint that consists 
of a multitude of different materials making up the joint, and thus, the failure of the 
solder joint may not lie within a particular material, but at the interfaces 
(UBM/RDL or solder/IMC). Table 8 lists the solder joint array test results of 
SAC 105 and SAC 405 alloys. 


Table 8 Tensile and shear test results of SAC 105 and SAC 405 solder joint arrays (Data 
extracted from (Su et al. 2010)) 


Solder 

material 

Loading 

orientation 

Test rate 
(mm/s) 

Failure mode count, h . 

Bulk 

solder 

UMB 

IMC 

Cu 

RDL 

Pad 

matrix 

SAC 105 

Tension 

5 

0 

5 

1 

139 

0.5 

0 

33 

1 

111 

Shear 

5 

0 

64 

4 

77 

0.5 

75 

0 

0 

70 

SAC 405 

Tension 

5 

0 

0 

0 

144 

0.5 

0 

6 

0 

139 

Shear 

5 

0 

108 

3 

33 

0.5 

35 

1 

0 

107 
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Effect of Strain Rate 

(a) Ductile to Brittle Transition 

Sn-Ag-Cu solder alloys generally attain higher strength at higher strain rates 
(Pang and Xiong 2005; Che et al. 2008). As the test rate increases, the solder in 
the joints become stronger and more resistant to mechanical loading and the 
applied loading stress will increasingly accumulate at the solder joint interfaces 
(Darveaux et al. 2006) due to their proximity to geometric discontinuities 
leading to brittle interface failures. Darveaux had investigated the ductile to 
brittle transition strain rate (DTBTSR) by performing solder joint array tensile 
tests (Darveaux and Reichman 2006). The ductile to brittle transition strain rate 
(DTBTSR) is defined as the strain rate at which 50 % of the joints fail at the pad 
interface (Darveaux and Reichman 2006). In this study, “ductile to brittle 
transition strain rate” will simply be the strain rate where 50 % of the joints 
fail at the bulk solder. “Interfacial failure” will collectively refer to all the other 
non-bulk solder modes of failure. 

(b) Effect of Strain Rate on Solder Joint Array Strength 

Figure 23 shows the average solder joint array strength of the tested specimens 
(SAC 105 and SAC 405 alloys) at two different test rates (0.5 and 5.0 mm/s) 
under shear and tension. It is observed that the average solder joint array 
strength decreases at higher strain rates, regardless of solder alloy material 
and test orientation. Newman (2005) and Darveaux et al. (2006) had previously 
observed higher strength at higher shear rates in solder ball impact shear tests. 
However, Darveaux et al. also noted that solder joint strength declined at even 
higher strain rates when “interface failures start to occur” (Darveaux 
et al. 2006). 

(c) Effect of Strain Rate on Failure Mode (Under Shear) 

Figure 24 shows the percentage distribution of failure modes observed under 
shear tests. It is observed that there is a decrease in bulk solder failures to 
eventually no such failure at higher strain rates (Darveaux and Reichman 2006). 
UBM-IMC failure is prominent at higher shear application rates due to the 


Fig. 23 Solder joint array 
strength of SAC 105 and SAC 
405 solder alloys at different 
test rates under shear and 
tension (Data from 
(Su et al. 2010)) 
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□ Bulk solder failures ■ UBM-IMC failures □ Pad matrix failures 




Test rate (mm/s) 


Fig. 24 Graphical illustration showing the different failure mode % against test rate under shear 
(Data from (Su et al. 2010)) 


transition from ductile (cohesive) to brittle (interface) failures. Pad matrix 
failure is observed to be increasingly dominant for SAC 105 as the strain 
rates increase for shear. In contrast, SAC 405 has a reduction of pad matrix 
failures but an increase of UBM-IMC failures at the higher strain rate. The 
observed failure modes transition may be a result of the UBM-IMC layer being 
weaker than pad matrix layer at the higher test rate. Figure 25 shows the 
experimental results of SAC 105 superimposed onto the ductile to brittle 
transition strain rate (DTBTSR) model, under shear. 

(d) Effect of Strain Rate on Failure Mode (Under Tension) 

Figure 26 shows the experimental results of SAC 105 superimposed onto the 
DTBTSR model under tension. No bulk solder failure is observed for the two 
test rates which are in contrast to the results obtained from shear. As no bulk 
solder failure is observed under tension, the competing failure is between the 
various interfacial regions such as the UBM-IMC and pad matrix. It was 
observed that there is generally more pad matrix failures at the higher test 
rate of 5 mm/s, which results in correspondingly less UBM-IMC failures. The 
percentage of pad matrix failures increased to almost 100 % at 5 mm/s. 

(e) Effect of Ag Content on Failure Mode 

SAC 105 experiences more bulk solder failures than SAC 405 under 0.5 mm/s 
shear speed. It is expected that higher Ag content will result in higher bulk alloy 
strength under the same test speed; therefore, SAC 105 (1 % Ag) joints are seen 
to obtain lower strength than SAC 405 (4 % Ag) at 0.5 mm/s. For 5 mm/s, the 
trend is contrary as the higher rate has already caused the increase in interface 
failures for SAC 405 that resulted in a decline in overall joint strength. SAC 
405 bulk solder is more load resistant, thereby transferring more loading stress 
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Fig. 25 Experimental result 
of SAC 105 superimposed 
onto the ductile to brittle 
transition strain rate model 
under shear (Adapted from 

(Su et al. 2010)) 


At 0.5 mm/s, 



Fig. 26 Experimental result 
of SAC 105 superimposed 
onto the ductile to brittle 
transition strain rate model 
under tension (Adapted from 

(Su et al. 2010)) 



mm/s mm/s 


Test Rate 


to the interfaces, resulting in more interface failures than in SAC 105 solder 
joints. Kim et al. conducted drop tests and found that IMC cracking at the 
package side was predominant in SAC 405, while SAC 105 had more bulk 
solder failures (Kim et al. 2007). 

(f) Effect of Ag Content on Solder Joint Array Strength 

At higher test rates, solder joint arrays with lower Ag content are better able to 
withstand higher loads before failure occurs. Therefore, solder joints with 
Sn-Ag-Cu alloys of low Ag content are seen to improve the mechanical 
reliability of microelectronic packages under high strain rate conditions by 
reducing stress localizations to material interfaces than would a higher rate- 
sensitive solder like SAC 405. 

(g) Effect of Loading Orientation 

Specimens under tension generally have lower ductile to brittle transition strain 
rate (DTBTSR) than specimens under shear (Zhao et al. 2000). From the results 
of the solder joint array tests, Su et al. (2010) concluded that the average solder 
joint array strength decreased at higher strain rates. It was observed that for 
similar alloys, more interface failures were observed at higher strain rate. High 
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A g content Sn-Ag-Cu joints generally yielded more interface failures and 
lower peak loads than those with lower Ag content. Furthermore, solder joints 
under shear loading were found to produce more bulk solder failures and higher 
average solder joint strength, peak load, and ductility than those under tensile 
loading. 


Mechanical Study of Single Solder Joints 

Solder joint strength testing such as ball shearing (Wang et al. 2003), package peel 
test (Wang et al. 2003; Bragg et al. 2003), and board bending (Lau 1996) while 
useful has certain inherent limitations. Ball shear tests, for example, may show 
wrong failure initiation modes. Similarly, drop impact samples may show solder 
bulk failure (Wang et al. 2003) instead of either pad peel offs or intermetallic failure 
at the solder/pad interfaces which are the usually observed failure modes. In this 
section, the work of Tan et al. (2009) on the testing and failure analyses of single 
solder joints under combined normal and shear loads is presented. 

Micromachining is performed on “as-reflowed” printed circuit board (PCB) 
assemblies to produce the single-joint samples. Figure 27a illustrates the sche- 
matics and dimensions of the fixtures used, and Fig. 27b shows an example of the 
end product after the single-joint specimen is bonded to the fixtures. All joints are 
tested at the rate of 100 pm/min. Loctite™ 4105 kit instant adhesive (cyanoacry- 
late) is used to bond the samples to the fixtures. The samples are then tested on a 
micro-Instron tester. 

(a) Tension/Shear Test Results 

As the loading changes from pure tension to mixed tension/shear and then to 
pure shear for the different batches of test, the joint failure forces decreases. For 
most tests concerning 0° and 30° loading directions, rapid degradation of force 
correlated to pad peel off can be observed, while 60° and 90° tests displayed 
gradual force reduction often revealing solder yield failure in posttest analysis. 



Fig. 27 (a) Schematic diagrams of 4 types of fixture (0°, 30°, 60°, 90°) bonded to single-joint 
specimens, (b) Photo showing a sample bonded on a 60° fixture (Adapted from (Tan et al. 2009)) 
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Fig. 28 Failure mapping and criterion formulation (Extracted from (Tan et al. 2009)) 


(b) Compression/Shear Test Results 

The average results for the various sets of single solder joint tests showed that 
the joint failure forces are the highest for pure tension and pure compression 
under the test rate of 100 pm/min. 

(c) Failure Mapping and Criterion Formulation 

Using the results acquired, a mapping of the joint failure forces such as Fig. 28, 
in terms of tension, shear, and compression components, can be plotted. 

The empirical expression of the failure criterion is as follows: 



where a, b, and c are the constants defining the vertical shift of the plot, the 
averaged value for pure tension/compression failure (performed at 0° and 
180°), and the value for pure shear (performed at 90°). This failure map allows 
component and system manufacturers to design packages and to determine their 
layout locations against overload type of loading environment. 

(d) Failure Analysis of Tested Joints 

0° Pure Tension: Specimens under tension show either gross bulk yielding 
which gradually leads to joint failure or pad peel off that is catastrophic and 
sudden (Fig. 29). Energy dispersive spectroscopy (EDX) did not reveal any 
presence of intermetallics on the failure surfaces of the specimens subjected to 
tensile loading. 

30° Tension/Shear: SEM inspection of the failed specimens (Fig. 30) for 
this loading condition showed that almost all joints fail by bulk yielding near 
the substrate interface, whereby the hillock is characteristically at the leading/ 
trailing edge of the pad in the direction of shear. EDX performed on the failure 
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12 VO X200 tee^ 20 60 SCI 


Fig. 29 (a) Photo showing PCB pad peel off and (b) fractograph showing hillock, after subjected 
to 0° pure tension (Extracted from (Tan et al. 2009)) 





Fig. 30 Fractographs showing (a) an incline view and (b) a close-up view of the fractured 
surface, after subjected to 30° tension/shear (Extracted from ref. (Tan et al. 2009)) 

surfaces revealed fracture occurring at the bulk solder, and no failure at the 
intermetallic is found. 

60° Tension/Shear: The nonuniformity of stress distribution via the appli- 
cation of 60° tension/shear load causes much localized stresses at the pad edges, 
which then causes the joint to fail at substrate interfaces rather than the solder 
bulk. Some failed surfaces (Fig. 31) show failure across the solder/metallic pad 
layer, implying that the fracture surface no longer lies purely at the bulk solder 
region. 

90° Shear: The dominant failure mode for this loading condition is bulk 
solder failure which occurs at very close to the pad interfaces, and no pad peel 
off is observed. 

60° and 30° Compression/Shear and 0° Compression: Optical inspection 
of the failed specimens shows that almost all failures due to compression/shear 
occur at the substrate/solder interface. For 30° and 0° compression loading, the 
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a b 



Fig. 31 (a) Fractograph showing a “smooth” failure surface and (b) EDX elemental analysis of 
the fractured surface, after 60° tension/shear (Extracted from ref. (Tan et al. 2009)) 


specimens do not readily produce a failure surface since the stress states are 
predominantly compressive. General observations show global gross shearing 
and compression of the solder joint. Since for compression tests, only the yield 
strength of the joint is needed to generate the failure map, tests were stopped 
when yield is obvious. 

From the study conducted by Tan et al. (2009), the researchers have dem- 
onstrated that the failure mapping and modes obtained from single-joint tests 
are able to predict board-level joint failure well. This illustrates the independent 
nature of the failure mapping methodology in effectively predicting board 
failures due to combined loading on various board geometries with different 
boundary conditions. 


Temperature Cycling Reliability 

When an electronic package is in service, the solder joints are subjected to a wide 
range of temperatures as a result of the device being powered on and off. As a result, 
these solder joints often fail under cyclic thermal loading as they accommodate the 
difference in thermal expansion (CTE) of electronic materials within the assembly. 
Therefore, in order to predict the reliability of solder joints under service condition, 
it is of paramount importance to investigate the reliability issues of solder joints 
under temperature cycling condition. 

Temperature Cycling Test Conditions 

During the temperature cycling test, the test samples are placed into test chambers 
and are subjected to alternating temperature extremes. It is carried out with the aim 
to test the mechanical reliability of the joints. This is done by designing a test 
vehicle containing daisy chains to form a circuit of typical electrical resistance 
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Table 9 Various 
temperature cycling test 
conditions and cycling 
temperatures (JEDEC Solid 
State Technology 
Association 2005) 



Cycling temperatures 

Test condition 

T mi „ (°C) 

Tmax (°C) 

A 

-55 

85 

B 

-55 

125 

C 

-65 

150 

G 

-40 

125 

H 

-55 

150 

I 

-40 

115 

J 

0 

100 

K 

0 

125 

L 

-55 

110 

M 

-40 

150 

N 

-40 

85 


Test conditions shown in bold are more commonly used in for 
consumer microelectronics products 


0.5- 1.5 £2. The repeated cycling at high and low temperatures subjects the joints to 
fatigue, resulting in crack initiation and propagation in the joints. This causes an 
increase in electrical resistance of the daisy chains in the sample. Both C4 and BGA 
solder joints can be tested in this test. C4 joints are tested in a component level test, 
where the components are not mounted onto PCBs. Large CTE mismatch between 
the die (2.6 x 10 _6 /K) and substrate (20 x 10 _6 /K to 25 x 10 _6 /K) results in large 
shear stresses in C4 joints. This can lead to fatigue failure. Underfill is often used to 
improve the reliability of these joints. To test solder joints, components are 
reflowed onto a PCB and the level two daisy chains are monitored instead of 
level one joints. Several test temperature conditions have been recommended 
(JEDEC Solid State Technology Association 2005), and these are presented in 
Table 9. In electronic packaging of consumer products, test conditions G and J 
are more commonly used. 

Evaluation Study on Five Pad Finishings Using Temperature 
Cycling Test 

Temperature cycling test was carried out on packages with five different pad 
finishings to evaluate their temperature cycling reliability. The substrate pad fin- 
ishes evaluated were (i) electrolytic nickel-gold (NiAu), (ii) organic solderability 
preservative (OSP), (iii) stencil printed solder-on-pad (SoP), (iv) immersion tin 
(ImSn), and (v) electroless nickel-electroless palladium-immersion gold 
(NiPdAu). The loading profile used ranged from —40 °C to 125 °C with ramp 
and dwell times of 15 min each. PCB thickness used was 1.1 mm. 

The number of cycles to failure of each package in each leg was fitted to a 
two-parameter Weibull distribution. The Weibull distribution is commonly used to 
determine mean life of parts for cases where failure occurs as a result of wear-out 
such as solder joints. The two-parameter Weibull probability distribution function 
(PDF) is 
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Fig. 32 Representative Weibull plot showing temperature cycling data from NiAu finishing 
sample (Extracted from (Wei et al. 2008)) 


Table 10 Weibull parameters for the various legs of temperature cycling test (Extracted from 
(Wei et al. 2008)) 


Leg 

Pad finishing 

2 

Package size/no. of balls (mm / 60) 

Weibull parameters 


p 

1 

NiAu 

11 x 10.5 

2912.8 

6.070 

2 

OSP 

11 x 10.5 

1125.0 

6.188 

3 

SoP 

11 x 10.5 

2283.2 

10.488 

4 

ImSn 

11 x 10.5 

2287.2 

10.081 

5 

NiPdAu 

11 x 10.5 

2266.6 

6.734 


M^e- G) 

n W 

where f(t) is the PDF, /? the shape parameter, 77 the characteristic life, and t the time. 

A representative Weibull plot is shown in Fig. 32 for NiAu. The Weibull 
parameters (/? and 77 ) were then extracted and are shown in Table 10. From 77 in 
Table 10, it is clear that NiAu performs the best under TC and OSP the worst. 
Cross-sectional analysis is also performed on failed parts to determine the location 
of failure. A representative cross section of solder joint is shown in Fig. 33 where 
there is a through crack at the top of the joint (Wei et al. 2008). 

Simulation 

Finite element analysis (FEA) of temperature cycling is carried out for the purpose 
of life prediction for BGA solder joint. Typically, commercial FEA software such 
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Fig. 33 Through crack in 
bulk solder at the top of the 
solder joint (Extracted from 
(Wei et al. 2008)) 




Fig. 34 Quarter FEA model of electronic package 


TAT TAA 

as ANSYS or Abaqus is used. The general methodology used in carrying out 
such an analysis is outlined here. 

• Step 1: Creating the FEA model 

A CAD model of the package to be analyzed is usually created and meshed. As a 
rectangular electronic package has two planes of symmetry, a quarter model of 
the package is typically used. Depending on the computational power available, 
a one-eighth model or diagonal slice model may be used for life prediction. 
Typical quarter finite element model used in electronic packaging is shown in 
Fig. 34. 

The details of the solder joints are modeled accurately - especially the 
dimensions of the solder resist opening (SRO) and whether the pad is solder 
mask defined (SMD) or non-solder mask defined (NSMD). Figure 35 shows the 
details of a pad structure of a BGA joint. Failure in temperature cycling occurs 
near the pad (as shown in Fig. 33), and an accurate representation of this region 
will produce better simulation data. 
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Fig. 35 Details of a pad structure of a BGA joint 


• Step 2: Defining Material Properties 

Rate-dependent viscoplastic properties must be defined for the solder alloy in the 
model. The Anand model with nine material properties is commonly adopted. In 
this model, creep is described by the following flow equation (Chang et al. 2006; 
Darveaux et al. 1995): 






where k p is the inelastic strain rate, A is an empirical constant, Q is the activation 
energy, R is the gas constant, f is the stress multiplier, o is the equivalent stress, 
s is the internal state variable, and m is the strain rate sensitivity parameter. 

The evolution equation is given by 



where 





and 






where h 0 is the hardening/softening constant, a is the hardening/softening strain 
rate sensitivity coefficient, is the saturation value of s for a given set of 
temperature and strain rate data, £ is the saturation coefficient, and n is the 
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Table 11 Anand constants for Sn-Pb and SAC 305 solder alloys (Extracted from (Schubert 
et al. 2003; Darveaux 2000)) 


Solder alloy 

^ (MPa) 

Q/R (K) 

A(s-‘) 

£ 

m 

h Q (MPa) 

s (MPa) 

n 

a 

Sn-Pb 

12.4 

9,400 

4e6 

1.5 

0.303 

1,379 

13.8 

0.07 

1.3 

SAC 305 

45.9 

7,460 

5.87e6 

2 

0.0942 

9,350 

58.3 

0.015 

1.5 


s 0 is the initial value of s, used to compute B in Eq. (10) 


Fig. 36 Loading condition 
of three temperature cycles 
which is applied on the model 



TCJ 




Time (min) 


strain rate sensitivity coefficient for the saturation value of deformation resis- 
tance. The Anand constants for Sn-Pb and SAC 305 are given in Table 11. 

In addition to the properties of the solder joint, the orthotropic properties of 
the board and substrate, as well as the CTE values, need to the measured and 
input into the model. 

• Step 3: Defining Loading Conditions 

The model is then subjected to three temperature cycles as shown in Fig. 36. 
Symmetry boundary conditions are specified along the cut boundaries of the 
symmetric models. 

• Step 4: Analyzing the Results 

Once the computer simulation has completed running, the results can be ana- 
lyzed. Typically the Mises stress contour and the creep strain energy density 
contours (shown in Fig. 37) are used to determine the location of the joint with 
the highest damage. 

For the purpose of life prediction, the creep strain energy density (A W) in the 
third cycle is used as the reference stress. A W is given by 



where Aw, is the change in creep strain energy density for element i in the region 
of interest, V t the volume of element /, and n the number of elements in the 
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Fig. 37 (a) Mises stress 
contour and (b) creep strain 
energy density on solder 
joints 


a 


1.02085 

67.9691 



•• 5 : 





1.02085 0iirnrr 15.8983 30.7756 45.653 r „ 60.5304 „ nrnA 

8.45955 23.3369 38.2143 53.0917 67.9691 


b 

.024237 

.243E-11 

1.55969 




.243E-11 


.173299 


.346598 


.519897 


.693196 


.866495 


1.03979 


1.21309 


1.38639 


1.55969 


region of interest. Aw, and V ) are extracted from the software for all elements in 
the region of interest. The A W can then be computed using Eq. 12. 

Volumetric average of creep strain energy is used to reduce the effect of 
singularities at joint corners. The region of interest was defined as the neck of the 
solder joint as shown in Fig. 38. Typically it may consists of one or two layers of 
elements at the top of each joint. 

• Step 5: Correlation with Experimental Results 

The energy density-based life prediction model based on the power law is 
typically used and is shown below (Anand 1985): 

N f = (CAW)~ n (13) 

where Nf is the number of cycles to failure, C is a constant, n is an exponent that 
is close to 1 (Syed et al. 2008, 2007), and AIT is the creep strain energy density 
accumulated per cycle. 
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Fig. 38 Creep strain energy density contour on a single solder joint and region of interest used to 
determine volume-averaged AW 


Correlation with experiments is carried out by plotting number of cycles to 
failure against AW. This plot is then fitted to Eq. 13 so that the constants C and 
n may be determined. Determining the constants based on a larger data set will 
give the model greater confidence. Note that C and n will vary with solder alloy, 
pad finishing, and package type. 

• Step 6: Life Prediction 

Once the constants C and n in Eq. 13 are determined, life prediction can be 
carried out. FEA models need to be created for the packages in question and the 
AW extracted from the models. Using Eq. 13 and the constants determined in 
Step 5, BGA joint life may be calculated. 


Drop Impact Reliability 

With the advancement of technology in consumer products and the increase in 
popularity of portable electronic items like laptops and mobile phones, the failure of 
solder joints through accidental drop impact has increasingly become a concern. In 
an effort to minimize this failure, product-level drop impact testing is actively being 
carried out by the manufacturers of such products. In addition to product-level drop 
testing which simulates field conditions better as a result of the orientation of 
impact, board-level drop test is also carried out. Product-level tests are more 
difficult to analyze as during some drops, the product might bounce and this 
would result in multiple impacts on the product. Furthermore, conducting such 
testing only at the end of the entire assembly chain will be a costly affair if problems 
are encountered and redesign is necessary. Hence, board-level drop tests are first 
carried out on the PCB with the component attached onto it. The PCB is mounted 
onto a drop table using screws, and there is no change in its orientation during the 
test. The impact force causes differential flexing between the board and the chip, 
thus resulting in failure of the joint (Wong et al. 2002, 2008b). 
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Solder Materials 

Lead-free solder alloys (SAC 305 and Sn-3.5Ag) are identified as lead-free replace- 
ments due to their superior creep fatigue resistance. In order to determine the drop 
impact conditions suitable for these solder alloys, these materials need to be 
characterized at higher strain rates (Siviour et al. 2005; Wong et al. 2008c). Drop 
impact strain rates on a joint have been found to range from less than 1-200 s -1 
(Wong et al. 2008c). Various solder alloys were characterized at strain rates ranging 
from 0.005 to 300 s -1 by Wong et al. (2008c). An Instron micro-force tester was 
used at strain rates of 0.005 s -1 and 12 s _1 , while a drop weight tester (involving a 
drop weight that free-falls onto the test specimen) was developed for strain rates 
between 50 and 300 s -1 (Wong et al. 2008c). 

The test results revealed that Sn-3.5Ag and the SAC 305 solders exhibit higher 
flow stress as compared to that of the Sn-37Pb. Sn-3.5Ag and the SAC 305 solder 
are also more strain rate sensitive. When solder is reflowed onto a copper pad, it 
forms a metallurgical joint comprising of the interfacial intermetallic compound 
(IMC), Cu 6 Sn 5 . This is a brittle intermetallic compound which fails suddenly when 
overloaded. The higher flow stress at high strain rates induces stress in the Cu 6 Sn 5 
IMC as the bulk solder does not yield. For the cases of Sn-37Pb and SAC 
101 solders, they exhibit comparable flow stresses which are considerably lower 
than that of Sn-3.5Ag and the SAC 305 solders. As a result, both Sn-37Pb and SAC 
101 solders yield when stressed, resulting in bulk solder failure. The interfacial 
IMC is not subjected to high stress which causes it to fracture. In the case where 
drop impact reliability is required, SAC 101 solder is a good replacement to 
Sn-37Pb solder due to its lower flow stress. Figure 39 shows the stress-strain 
curves for SAC 101, SAC 305, and eutectic tin-lead solder. 

Board-Level Drop Test (BLDT) 

The Joint Electronic Device Engineering Council (JEDEC) standard JESD22-B 111 
(JEDEC Solid State Technology Association 2003) outlines a board-level drop test 
that is commonly carried out. It specifies the test board thickness and dimensions, 
layout, and location of components. It recommends using a drop tower with a 
special base plate for mounting the board with standoffs and four screws - one at 
each corner. When the base plate is released from a certain predetermined height, it 
falls freely onto a surface inducing a repeatable impact force on the base plate. For 
portable mobile products such as mobile phones, the recommended shock pulse 
on the component is a peak acceleration of 1500G and pulse width of 0.5 ms. 
A schematic drawing of the setup is shown in Fig. 40. 

In the study conducted by Wong et al. (2009), board-level drop tests were carried 
out on 12 solder material systems consisting of five solder alloys and two pad 
finishes as shown in Table 12. The cumulative distribution of drops (shocks) to 
failure for each material system is graphically represented in Fig. 41. Three main 
conclusions are drawn from this work. Firstly, longer life corresponds to ductile 
failure, while short life corresponds to intermetallic compound failure. Secondly, as 
flow stress of the solder increases, the drop test life decreases. Thirdly, doping SAC 
101 with minute amounts of Ni is found to enhance the drop test life. 
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Fig. 39 True stress-strain curves of (a) SAC 305, (b) SAC 101, (c) Sn-37Pb 
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Fig. 40 Schematic drawing of a drop test setup 


High-Speed Cyclic Bend Test (HSCBT) 

A tester which applies a sinusoidal cyclic bending directly onto the PCB has been 
developed by Seah et al. (2006). This tester can apply various bending cycles, each 
at different amplitudes - just like the load spectrum experienced by the PCB during 
a drop impact. A schematic drawing of the tester is shown in Fig. 42. 

The 12 solder systems in Table 12 were subjected to high-speed cyclic bend test 
(Wong et al. 2009). The resultant cumulative distribution of cycles to failure for the 
material systems tested is presented in Fig. 43. Results show that this test 
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Table 12 Twelve material systems consisting of six solder alloys and two pad finishes (Wong 
et al. 2009) 


Solder alloy 

Pad finish 

1. Sn-37Pb (Sn-Pb) 

1. Electroless nickel immersion gold 
(ENIG) 

2. Sn-3.5Ag (Sn-Ag) 

2. Organic surface preservative over 
copper (OSP) 

3. Sn-1.0Ag-0.1Cu (SAC 101) 


4. Sn-3.0Ag-0.5Cu (SAC 305) 


5. Sn-1.0Ag-0.1Cu with 0.02 % Ni and 0.05 % In 
(SAC 101(d)) 


6. Sn-3.0Ag-0.5Cu with 0.05 % Ni (SAC 305(d)) 
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Fig. 41 Cumulative distribution of drop test life for the material systems tested (Extracted from 
(Wong et al. 2009)) 
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Fig. 43 Cumulative distribution of cycles to failure for the material systems tested (Extracted 
from (Wong et al. 2009)) 


reproduces the failure modes observed in board-level drop test. There is also a good 
correlation with the drop test for the 12 material systems studied, in terms of the 
replication of failure mode and the performance of material systems. The high- 
speed cyclic bend test is a faster test than the conventional board-level drop test 
taking into consideration the mode of operation of these two testers. 

Ball Impact Shear Test 

Conventional solder joint shear tests performed at low speeds induce failure in bulk 
solder. Ball impact shear test (refer to Fig. 44) uses a high-speed shear tester with 
solder ball shear speeds of up to 1 ms -1 . This high shear speed test induces failure 
in the brittle interfacial IMC layer of the joint, making this test more suitable than 
the conventional shear test, where drop impact reliability is a concern. 

In this study by Wong et al. (2008a), the 12 material system as listed in Table 12 
were subjected to ball impact shear test. The cumulative distributions of the peak 
load and the total fracture energy is shown in Fig. 45. 

Results show that there is no universal correlation between the ball impact shear 
test and the board-level drop test. This is evidently shown in the material perfor- 
mance results presented in Figs. 41 and 45. This is because the failure mode in drop 
test is not consistently reproduced in the shear test. The high-speed shear test is a 
useful tool for testing quality of solder joints and pad finish in a manufacturing 
environment. 

Mechanical Simulation 

Drop impact simulation is also carried out for the purpose of drop life prediction of 
BGA solder joints (Syed et al. 2007). The steps taken for drop impact simulation are 
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Fig. 44 Schematic drawing 
of a shear test setup 
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Fig. 45 Cumulative distributions of peak load and total fracture energy for each material system 
tested (Extracted from (Wong et al. 2008a)) 


similar to that in temperature cycling. Some aspects of the model such as loading 
conditions and material properties need to be varied to capture the behaviors of the 
board and solder joint adequately. 
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Step 1: Creating the FEA Model 

Similar to the temperature cycling simulation, a quarter model of the package is 
created and meshed. 

Step 2: Defining Material Properties 

Strain rate-dependent material properties (as shown in Fig. 39) are defined for 
the solder alloys. 

Step 3: Defining Loading Conditions 

There have been three methods proposed to define the loading conditions in drop 
impact, namely, (i) the free-fall of a drop table, (ii) the input-G method, and (iii) 
the excitation of sample supports. The most tedious is the free-fall of the drop 
table as it requires the entire test setup to be simulated. In this simulation, the 
drop table with the test board mounted on it using the standoffs is modeled and 
subjected to a gravity loading. This method is used when the input acceleration 
pulse is not known. 

Tee et al. (Luan and Tee 2004; Tee et al. 2004, 2005) proposed the input-G 
method. With this method, none of the other supporting structures such as the 
standoffs or the drop table need to be modeled, thus reducing the computational 
time greatly. A measured acceleration pulse, or G-level, is applied to the 
mounting holes of the board. 

The other simulation method, developed by Yeh et al. (Yeh and Lai 2006; Yeh 
et al. 2006), is called the support excitation scheme. Only the test board needs to 
be modeled. However, instead of applying an acceleration pulse as in the input- 
G method, body forces are applied to the test board. The body force is the mass 
matrix of the board multiplied by the acceleration pulse. 

Step 4: Analyzing the Results 

A suitable parameter such as volume-averaged plastic strain or plastic work at 
the neck of the solder joint is then correlated to the number of drops to failure. 
Step 5: Correlation with Experimental Results 

The plastic strain-based life prediction model based on the power law is typically 
used for cases where failure occurs through bulk solder. This is shown in Eq. 14 
(Syed et al. 2007): 


D f =A(s p ) h (14) 

where Df is the number of drops to failure, A is a constant, b is an exponent, and & p 
is the plastic strain. In cases where failure occurs through the intermetallic 
compound (IMC), the plastic strain in Eq. 14 is replaced by peeling stress (Luan 
and Tee 2004). 

Step 6: Life Prediction 

Similar to temperature cycling reliability simulation study, once the constants 
A and b in Eq. 14 are determined, life prediction can be carried out. FEA models 
need to be created for the packages in question, and the plastic strain (e^) is 
extracted from the models. Using Eq. 14 and the constants determined in 
Step 5, the number of drops to failure may be calculated. 
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Summary 

This chapter provides an overview of the role of soldering in electronic packaging 
and the key soldering processes. Driven by environmental concerns of the use of 
lead and legislative implementation to ban lead, solder materials evolve from lead- 
containing to lead-free materials. Furthermore, high-temperature solders and com- 
posite solders are developed in recent years to cater to the harsher service and 
functional requirements of interconnects. The key characteristics which are critical 
to the reliability and manufacturability of solder materials are also discussed. 
Lastly, in order to provide a better understanding of the reliability of bulk solder 
materials and solder joints, reliability studies in terms of mechanical behavior, 
temperature cycling, and drop impact are discussed in this chapter. 
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Abstract 

The chapter gives a brief introduction of adhesive bonding technology. Empha- 
sis is placed on understanding of the fundamental mechanisms of bonding. 
Various types of adhesives are compared, and the techniques to modify polymer 
surface for improving the adhesion strength are described in detail. Examples are 
given to show important applications of adhesive bonding technology in auto- 
motive and aerospace industries. 


Introduction 

Joining of materials and components can be achieved by welding, brazing, and 
adhesive bonding. Compared with welding and brazing, adhesive bonding has the 
advantage of relatively low bonding temperature. It has been increasingly used in 
industry and become as important as the other joining techniques in recent years. 
The chapter aims to give a brief introduction of adhesive bonding technology. 


Fundamentals of Adhesive Bonding 

Adhesion is defined as the interfacial force of attraction between two materials. 
These physical attractions are mainly due to van der Waals forces and electrostatic 
forces. 


Mechanism of Liquid-solid Adhesion 

In liquid penetrant testing, there are usually three surface and interfaces involved, 
the solid-gas interface, the liquid-gas interface, and the solid-liquid interface. 
When a liquid spreads over a solid surface, two conditions must be met. Firstly, 
the surface energy of the solid-gas interface must be greater than the combined 
surface energies of the liquid-gas and the solid-liquid interfaces. Secondly, the 
surface energy of the solid-gas interface must exceed the surface energy of the 
solid-liquid interface. The adhesion mechanism between liquid and solid substrate 
has been explained on the basis of mechanical interlocking, adsorption, diffusion, 
and electrostatic theory. These theories can be used, solely or in combination with 
each other, to describe almost any kind of adhesion phenomenon (Nihlstrand 1996). 

The mechanical interlocking theory of adhesion states that good adhesion occurs 
only when a liquid penetrates into the pores, holes and crevices, and other irregu- 
larities of the adhered surface of a substrate and locks mechanically with the texture 
of the substrate (Allen 2005). Pretreatment methods applied on surfaces enhance 
adhesion (Clearfield et al. 1991) as they result in micro-roughness on the adherend 
surface, which can improve adhesion strength by providing mechanical 
interlocking. Beyond mechanical interlocking, the enhancement of adhesion 
strength is due to the roughing of the adherend substrate. Further, surface roughness 
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increases surface area, improves kinetics of wetting, and increases plastic defor- 
mation for better adhesion (Evans et al. 1979; Jennings et al. 1972). The size and 
shape of the special features created on the surface have an influence on the 
adhesion, providing a tortuous path which prevents the separation of the adhesive 
from the adherend (Fisher 2005). However, this theory is not able to explain the 
good adhesion strength attained in some cases between smooth surfaces. 

The diffusion theory attributes the adhesion of polymeric materials to the 
interpenetration of chains at the interface. The major driving force for polymer 
autohesion and heterohesion is due to mutual diffusion of polymer molecules across 
the interface (Voyutski 1963). To describe self-diffusion phenomenon of polymers, 
several theories have been proposed: entanglement coupling (Klein 1979), 
cooperativity (Edwards and Grant 1973), and reptation (Brochard and De Gennes 
1980). The reptation model (Wu et al. 1986) has been applied to study tack, green 
strength, healing, and welding of polymers. Some evidence may have demonstrated 
that the interdiffusion phenomenon exists in mobile and compatible polymers and 
may promote the intrinsic adhesion. The diffusion theory, however, has found 
limited application where the polymer and adherend are not soluble (Skeist 1990) 
or the chain movement of the polymer is constrained by its highly cross-linked, 
crystalline structure or when it is below its glass transition temperature. 

According to the electrostatic theory of adhesion, the force of attraction between 
the liquid and substrate is attributed to the transfer of electrons across the interface 
creating positive and negative charges that attract one another. The result is the 
creation of an electrical double layer (EDL) at the interface (Allen 2003). These 
electrostatic forces at the interface account for resistance to separation of the liquid 
and the substrate. But, this theory also could not be accepted due to the fact that the 
EDL could not be identified without separating the adhesive bond. Also, as argued 
by many researchers (Roberts 1977; Possart 1988), the effect of the electrical 
double layer on the adhesive bond strength was exaggerated. 

The adsorption theory states that adhesion results from intimate intermolecular 
contact between two materials and involves surface forces that develop between the 
atoms in the two surfaces (Ramarathnam et al. 1992). These forces may be due to 
physical adsorption which is mainly van der Waals forces or secondary forces. 
These adhesion forces may also be due to acid-base interaction or hydrogen bond 
(Allara et al. 1986; Fowkes 1987). 

To obtain good adsorption, intimate contact must be reached such that van der 
Waals interaction or the acid-base interaction or both take place; hence, good 
wetting is essential. 


Wetting 

Wetting ability of a liquid is a function of the surface energies of the solid-gas 
interface, the liquid-gas interface, and the solid-liquid interface. The intermolecular 
bonds or cohesive forces between the molecules of a liquid cause surface tension. 
The adhesion forces between the liquid and the second substance will compete 
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Fig. 1 Contact angle on a 
solid surface (Shikhmurzaev 

1993) 



Table 1 Relationship of contact angle with wettability 


Contact angle Degree of wetting 

Strength of: 

Sol./liq. interactions 

Liq./liq. interactions 

0-0 

Perfect wetting 

Strong 

Weak 

0 < 0 < 90° 

High wettability 

Strong 

Strong 

Weak 

Weak 

90° < 0 < 180 

Low wettability 

Weak 

Strong 

0- 180° 

Perfectly non-wetting 

Weak 

Strong 


against the cohesive forces of the liquid. Liquids with weak cohesive bonds and 
a strong attraction to another material (or the desire to create adhesive bonds) will 
tend to spread over the material. Liquids with strong cohesive bonds and weaker 
adhesion forces will tend to bead up or form a droplet when in contact with another 
material (Park and Seo 2011). One way to quantify a liquid’s surface wetting 
characteristics is to measure the contact angle of a drop of liquid placed on the 
surface of an object. 

According to Young’s equation, the surface tensions (liquid/vapor: y LV , solid/ 
liquid: y S L, and solid/vapor: y sv ) at the three phase contacts are related to the 
equilibrium contact angle 0 through 

Ysv = Ysl + Ylv- C0S ^c (1) 

The contact angle (9 C ), as seen in the Fig. 1, is the angle at which the 
liquid-vapor interface meets the solid-liquid interface. The contact angle is deter- 
mined by the resultant between adhesive and cohesive forces. 

The one important factor that influences adhesion strength is the ability of the 
liquid to spread uniformly on the substrate (Al-Zahrani et al. 2009). 

For spontaneous wetting to occur: 

Ysv > = Ysl + Ylv (2) 

However, this is an ideal case, and it is not possible with any polymer. 

The tendency of a drop to spread over a flat solid surface increases as the contact 
angle decreases. Thus, the contact angle provides an inverse measure of wettability 
(Sharfrin et al. 1960) as shown in Table 1. 


18 Adhesive Bonding Technology 


769 


Physically Hardening Adhesives, Chemically Curing Adhesives, 
Structural Adhesives, and Functional adhesives 

An adhesive or glue is a mixture in a liquid or semiliquid state that adheres or bonds 
items together. Adhesives cure (harden) by either evaporating a solvent or by 
chemical reactions that occur between two or more constituents (Todd 
et al. 1994; Lau et al. 2002). The adhesives are classified into different types. 
A detailed description of the different types of adhesives has been given in Table 2. 

Choice of correct adhesive is critical in order to produce strong and durable 
bonds (Vodicka 1996). Unlike thermoplastics, thermosetting adhesives do not melt 
or flow on heating but become rubbery and lose its strength. The molecular chains 
present in thermosetting adhesives undergo irreversible cross-linking during curing 
(Mays et al. 1992). The most familiar thermosetting adhesives are the family of 
epoxies. Limitations of the above adhesives for joining of different materials have 
been solved by epoxy adhesives, and presently these adhesives are widely used. 

It is evident from literature that thermoplastic resins can withstand temperatures 
comparable to many aerospace graphite/epoxy prepregs (177 °C/350 °F). Adhesives 
that have been used successfully in aerospace are typically epoxy-based adhesives 
with the brands such as FM300 for both PPS and PEEK, FM 37, and FM 87. 

Kinloch AJ (1986) and Kinloch described the use of a wide range of epoxy- and 
acrylic-based adhesives for bonding applications: Hysol 9309.3; FM73M; 
Permabond F241, F245, F246, and V501; and Bostik M890 and M896. An exten- 
sive review of adhesives for PEEK/graphite composite bonding is also given in, and 
it was found that FM300 recorded a close second highest lap shear strength value of 
23 MPa to AF- 163-2 K adhesive (24.5 MPa) when tested at 298 K. 

It is to be noted that aviation and space industries require special types of 
adhesives as one of the main problems with the application of adhesive for 
spacecraft and supersonic and hypersonic aircraft is high temperatures encountered 
by them in space. This is caused by the aerodynamic friction heating of the structure 
as it moves through the air. The temperatures vary from approximately 50 °C at 
Mach 1, 200 °C at Mach 2, 450 °C at Mach 3, and to above 800 °C at Mach 3.5 
(Shaw and Tod 1994). Therefore, the use of basic organic resins as adhesives 
become very difficult above Mach numbers of 2 (Shaw and Tod 1994), and 
consequently, high temperature-resistant adhesives are obviously required. 

An epoxy adhesive, Duralco 4703 is one such option which can withstand harsh 
conditions. The adhesives mixed with hardener are employed on the substrate 
surface in liquid form (Kinloch 1986). Their low viscosity, good wettability, 
penetrating ability, and the presence of highly polar groups in the molecule help 
the liquid to spread over the substrate leading to good adhesive joint especially with 
metals, ceramics, glass, etc. 

Epoxy resin is the most commonly used polymer matrix for advanced composite 
materials. Many attempts have been made to modify the properties of epoxy by the 
addition of either rubber particles so that the matrix-dominated composite proper- 
ties are improved. The addition of rubber particles improves the fracture toughness 
of epoxy, but decreases its modulus and strength. The addition of fillers, on the 
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Table 2 Classification of adhesives 


Types 

Description 

Examples 

General classification 

Thermosetting 

Are cross-linked by strong covalent 
intermolecular bonds, forming one giant molecule 

Epoxies, polyesters, 
polyimides, phenolics 

Thermoplastic 

Molecules of most of thermoplastics combine long 
polymer chains alternating with monomer units 

Polyamides, 
cyanoacrylates, 
polyacrylates, 
polyvinyl acetate 
(PVA) 

Elastomeric 

Consists of long lightly cross-linked molecules 

Natural rubbers, 
silicones, acrylonitrile 
butadiene (nitrile), 
neoprene, butyl, 
polyurethane, styrene- 
butadiene 

On the basis of function 

Structural 

adhesives 

Used to bond materials which must withstand 
loads or stresses acting to compromise structural 
integrity 

Polyurethane, 
polysulfides, polyester, 
acrylic special 
formulations of 
cyanoacrylates, 
anaerobics, and epoxy 
adhesives 

Nonstructural 

adhesives 

Are not capable of supporting appreciable loads 
and are generally required to locate parts in an 
assembly 



On the basis of curing method 
One-part adhesives 


Heat-activated 

curing 

adhesives 

Adhesives of this type consist of a ready mixture 
of two components and harden from melt for 
thermoplastics 

Epoxies, urethanes, 
polyimides 

Light/UV- 

activated 

curing 

adhesives 

Light-activated adhesives are cured under a 
visible or UV light of appropriate wave length. 
Adhesives of this type usually contain photo 
initiators enhancing curing reaction 

Acrylics, 

cyanoacrylates, 

urethanes 

Moisture- 

activated 

curing 

adhesives 

These adhesives are cured when reacting with 
a moisture present on the substrate surface or 
in the air 

RTV silicones, 
cyanoacrylates, 
urethanes 

Pres sure - 
sensitive 
adhesives 
(PSA) 

Adhesives of this type do not cure. Adhesive 
bonding forms as a result of a pressure applied to 
the substrates. The adhesion strength is 
determined by the pressure applied to the 
substrates 

Self-stick tape 

Two-part adhesives 


A two-part adhesive is cured when its two 
(or more) components are mixed. The components 
react chemically forming cross-links of the 
polymer molecules 

Epoxies, urethane, 
acrylics, silicones 


(i continued ) 
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Table 2 (continued) 

Types Description 

Description of common adhesives 


Name 

Description 

Advantages 

Polyurethane 

Formed by cross- 
linking highly reactive 
isocyanates with 
various polyols and is 
frequently used for 
joining 

Shows low viscosity, 
good wettability, and 
penetrating ability 
which help the 
adhesive to spread over 
the substrate surface; 
good cohesive 
strength; high impact 
resistance; and good 
flexibility at low 
temperatures 

Polysulfide 

Obtained by the 
reaction of sodium 
polysulfide with 
organic dichlorides 
such as dichloro 
diethyl formal and 
ethylene dichloride 

Adhere well to almost 
every adherend 

Polyesters 

Reaction products of 
dibasic acids with 
polyfunctional 
hydroxyl materials 

Often used as hot melt 
adhesives 

Acrylic 

Synthesized from a 
wide selection of 
acrylic and 
methacrylic ester 
monomers 

Capability to adhere to 
most metals and 
polymers. This is due 
to their low viscosity, 
good wettability, and 
penetrating ability 
which help the liquid to 
spread over the 
substrate surface 

Cyanoacrylates 

Known as instant 
adhesive or super glue, 
a cyanoacrylate is also 
from the acrylic family 
tree, but having a 
completely different 
cure system 

A very rapid curing 
adhesive, they begin to 
form polymer chains 
immediately on 
contact with the water 
vapor on the surface of 
the part, versatile 


Examples 

Disadvantages 

Processing problems 
which arise due to their 
reaction with water and 
their gaseous nature, 
requiring special 
attention to diffuse the 
catalysts in to the resin. 
Limited temperature 
resistance. Sensitive to 
moisture in cured and 
uncured states 

Poor cohesive and 
shear strength 


Becomes highly rigid 
and shows poor 
adhesion with metals, 
high shrinkage rate in 
comparison to other 
adhesives 

Shows poor cohesive 
and shear strength and 
consequently restricts 
their application for 
joining of materials 


Bonds to skin instantly, 
posing safety risks. 
Unpleasant fumes, 
poor peel strength, 
poor resistance to 
impact loads, have 
limited gap cure, poor 
durability on glass, low 
temperature resistance 
(loses strength at about 
185 °F), and poor 
solvent resistance 


(i continued ) 
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Table 2 (continued) 


Types 

Description 

Examples 

Anaerobic 

Derived from 
methacrylates, 
(Plexiglas), the basic 
ingredient in anaerobic 
adhesives is a special 
liquid of small 
molecules which can 
combine chemically to 
form a polymer or 
group of molecules 

Very versatile and are 
used in a wide variety 
of applications 

Is brittle, limited gap 
filling, cure rate is 
surface dependent, can 
stress crack some 
thermoplastics, and are 
very expensive 


other hand, improves the modulus and strength of epoxy, but decreases its fracture 
toughness. The heat-deflection temperature of epoxy is also improved by the 
addition of fillers. In recent years, micro- and nano-scaled particles have been 
considered as filler material for epoxy to produce high-performance nano compos- 
ites with enhanced properties. 


Surface Modification of Polymer 

Polymers very often do not possess the desired surface properties in terms of strong 
adhesive bonding. They are hydrophobic in nature and in generally exhibits insuf- 
ficient adhesive bond strength due to relatively low surface energy (Shenton and 
Stevens 2001). It is established that for successful application of polymeric com- 
posite materials to form structural parts using adhesive bonding, they need to have 
special surface properties like hydrophilicity (Bhowmik et al. 2006). 

Several conventional methods like chemical treatment, thermal treatment, 
mechanical treatment, which are used for modifying the polymeric surfaces, suffer 
from problems of uniformity, reproducibility, and cost-effectiveness. The need for 
adequate adhesion by activating the polymeric surface without affecting the bulk 
properties of the polymer has resulted in the development of the plasma surface 
modification industry which offer uniform, reproducible, economic, and environ- 
mentally friendly alternative (Goldman et al. 1985). The electrical treatment under 
atmospheric pressure plasma and low pressure plasma (glow discharge) are popular 
techniques which have been employed to modify the surface of polymers. The 
different treatment methods such as (i) chemical treatment, (ii) thermal treatment, 
(iii) mechanical treatment, and (iv) electrical treatment along with their merits and 
demerits are discussed in the following subsections. 


Chemical Treatment 

The influence of chemical environment on polymers includes the surface alteration as 
well as other deep modifications in surface layer. The wetting characteristics of a 
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polymeric surface is improved by introduction of polar groups on the surface when it is 
treated with a heavily oxidative liquid chemical such as chromic anhydride with 
tetrachloroethane, chromic acid with acetic acid, chlorate sulfuric acid, potassium 
dichromate with sulfuric acid, and subsequently exposed under suitable conditions. 
Oxidation may also be carried out with nitrogen oxide, cycloalkyl chromate, potas- 
sium permanganate, and sodium hypochlorite. Chlorosulfonation and flame treatment 
in the presence of halogen could also be used. Bajpai et al. (2002) have reported the 
surface modification of PMMA: PVDF blend system under chemical environment of 
some organic liquids, viz., benzene, toluene, xylene, and acetone. Griffiths (1999) in 
his report has shown that there are a number of chemical reactions which change the 
surface chemistry of PEEK thereby enhancing adhesive bonding strength. 

Carbon fiber filled PEEK was chemically etched by using a composition of 7 g 
K 2 Cr 2 0 7 + 12 g H 2 0 + 150 g H 2 S0 4 . This resulted in an increase of adhesive 
strength. Chromic acid etching increases the bondability of a plastic by introducing 
reactive sites, such as hydroxyl, carbonyl, carboxylic acid, and -S0 3 H groups, to 
the plastic surface and forming rootlike cavities which provide sites for mechanical 
interlocking. The effects of this treatment vary from substrate to substrate. For 
example, increasing the etch time and temperature increases only the etch depth 
when etching polypropylene. On the other hand, both the degree of oxidation and 
etch depth increase with time for polyethylene. All these reactions take place in 
vessels under very strict time-temperature-pressure conditions which are difficult to 
handle and not feasible to be used on an industrial scale. Moreover they require 
expensive waste disposal and may also pollute the environment. 


Thermal Treatment 

Thermal treatment plays an important role in modifying the nature and concentra- 
tion of surface functional groups. Thermal treatments have been used to produce 
activated carbons with basic character, and such carbons were effective in the 
treatment of some organic hydrocarbons. Heat treatment of carbon in an inert 
atmosphere or under inert atmosphere (hydrogen, nitrogen, or argon) flow could 
increase carbon hydrophobicity by removing hydrophilic surface functionalities, 
particularly various acidic groups. 

Thermal treatment also oxidizes the surface of polymers by the introduction of 
polar groups like carbonyl, carboxyl, or amide when the plastics are exposed to a 
blast of hot air. It also utilizes a free radical mechanism accompanied by chain 
scission and some cross-linking which helps in improving the wettability of the 
surface . 


Mechanical Treatment 

Mechanical abrasion process has been used for surface pretreatment of polymers 
because it provides an effective and reliable way of ensuring improvement in 
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adhesion of polymer surface. It serves to increase the surface area of the material by 
“roughening the exposed areas; prior to adhesive bonding.” Surface roughening 
increases bondability by dramatically increasing the number of mechanical 
interlocking sites. Mechanical abrasion is performed by dry blasting, wet blasting, 
or hand/machine sanding. However, for low energy plastics, opposite effect can 
occur. 

Bhowmik et al. (2009) investigated the effect of mechanical polishing prior to 
surface modification of substrates of HDPE and PP sheets by exposure to DC glow 
discharge, on the surface energy, and their adhesive joint strength to steel. 
Mechanical polishing of the HDPE and PP sheets by abrading with 800-grade 
emery paper prior to glow discharge treatment increased the adhesive joint 
strengths over those observed in case of unpolished polymers exposed to glow 
discharge. However, the use of prior mechanical polishing increases the joint 
strength only by a little more than 10 % compared to a five to seven times increase 
in strength observed as a consequence of exposure to glow discharge of untreated 
samples. These processes are operator sensitive, labor intensive, produces dust, 
and difficult to perform on the high-production volumes. The abrasive materials 
which are used to remove particulates or residues are hazardous and require 
disposal. 


Electrical Treatment 

Electrical treatment by generation of ionized plasma is a suitable technique which is 
used to modify polymer surface without affecting the bulk properties. The term 
“plasma” was coined by Irving Langmuir. A distinct fourth state of matter, it is 
broadly defined as partially or wholly ionized gas with approximately equal number 
of positively and negatively charged particles. Ionized means at least one electron is 
not bound to an atom or molecule, converting the atoms or molecules into posi- 
tively charged ions. Plasma contains active species, such as electrons, ions, radicals, 
photons, etc., which initiate chemical and physical modification on the polymer 
surface by making the surface electrically conductive, internally interactive, and 
strongly responsive to electromagnetic fields. Ions and electrons present in plasma 
break polymer chains due to their high kinetic energy. At the same time, free 
radicals of plasma modify the chemical properties of polymers by introducing 
functional groups on to polymer surface. There are mainly two types of plasmas: 
(i) thermal plasma and (ii) cold plasma. 

Thermal plasma is used to destroy solid, liquid, and gaseous toxic halogenated 
and hazardous substances or to generate anticorrosion, thermal barriers, antiwear 
coatings, etc. Cold plasmas are used for surface modifications of materials, ranging 
from the simple topographical changes to the creation of surface chemistries and 
coatings that are radically different with respect to the bulk material. Cold plasmas 
are generated by glow discharges at reduced pressures of 0.01-10 torr, and pressure 
of around 1 torr is sufficient for surface modification of polymers. Low pressure 
plasma using glow discharge and atmospheric pressure plasma using corona 
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discharge are more convenient methods for surface modification of polymers in 
order to enhance their adhesion characteristics, and views of various researchers 
have been outlined in the following sections. 

Low Pressure Plasma (Glow Discharge) 

A glow discharge is an ionized gas consisting of equal concentrations of positive 
and negative charges and a large number of neutral species. It is generated by 
applying a potential difference (of a few 100 V to a few kV) between two electrodes 
that are inserted in a cell (or that form the walls of the cell). The cell is filled with a 
gas (an inert gas or a reactive gas) at a pressure ranging from a few mTorr to 
atmospheric pressure. Due to the potential difference, electrons that are emitted 
from the cathode by the cosmic radiation are accelerated away from the cathode and 
give rise to collisions with the gas atoms or molecules (excitation, ionization, 
dissociation). The excitation collisions give rise to excited species, which can 
decay to lower levels by the emission of light. The ionization collisions create 
ion-electron pairs. The ions are accelerated toward the cathode, where they release 
secondary electrons. These electrons are accelerated away from the cathode and can 
give rise to more ionization collisions. In its simplest way, the combination of 
secondary electron emission at the cathode and ionization in the gas gives rise to 
self-sustained plasma as shown in Fig. 2. 

Due to the various collision processes in the plasma, a large number of different 
plasma species are generated: electrons, atoms, molecules, several kinds of 
radicals, several kinds of (positive and negative) ions, excited species, etc. 
These active species activate the surface of the polymer and cause the surface 
layer to be oxidized or cross-linked enabling them to become significantly active 
for participation in adhesion. There are different variants to glow discharge 
plasma. 

• Direct current (dc) 

• Radio frequency (rf) 

• Microwave (mw) 

In direct current (dc) glow discharge, a continuous potential difference is applied 
between cathode and anode, giving rise to a constant current. However, this setup 
gives problems when one of the electrodes is nonconducting; as due to the constant 
current, the electrodes get charged up which may lead to burnout of the glow 
discharge. This problem is overcome by applying an alternating voltage between 
the two electrodes, as in the capacitively coupled radio-frequency (cc rf) glow 
discharge. The charge accumulated during one half of the cycle is neutralized by the 
opposite charge accumulated during the next half cycle in the rf glow discharge. RF 
plasmas can be sustained at lower gas pressure and are characterized by higher 
ionization efficiency as compared to DC plasmas. 

The majority of plasma processing has been carried out at low pressure in 
a vacuum chamber and viewed as a necessary processing requirement. In principle 
and practice, however, atmospheric pressure plasmas provide a critical advantage 
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Fig. 2 Generation of plasma (Bogaerts et al. 2002) 


over widely used low pressure plasmas as they do not require expensive 
and complicated vacuum systems. Without a vacuum system, the cost of material 
processing is reduced substantially, and material issues related to vacuum com- 
patibility are not of concern. Therefore, the use of atmospheric pressure plasmas 
will greatly expand the current scope of materials processing. 

Atmospheric Pressure Plasma 

Atmospheric pressure plasma is gaining larger acceptance than other plasma 
techniques because they are easy to integrate into existing production lines and 
they can treat specific parts of a substrate selectively. Also, in contrast to most 
corona treatments (and dielectric barrier discharges), atmospheric pressure plasma 
is not limited to flat and thin substrates but can also be used for large three- 
dimensional structures. Atmospheric-pressure plasma (or AP plasma or normal 
pressure plasma) is the name given to the special case of plasma in which the 
pressure approximately matches that of the surrounding atmosphere - the so-called 
normal pressure. 

These are generated by AC excitation (corona discharge and plasma jets). By 
means of high-voltage discharge (5-15 kV, 10-100 kHz) in the plasma jet, a pulsed 
electrical arc is generated. A process gas, usually oil-free compressed air flowing 
past this discharge section, is excited and converted to the plasma state. This plasma 
then passes through a jet head to arrive on the surface of the material to be treated. 
The jet head is at earth potential and in this way largely holds back potential- 
carrying parts of the plasma stream. In addition, it determines the geometry of the 
emergent beam. It is observed that during atmospheric plasma discharge, ion 
bombardment physically and chemically removes oxides, and reducible compounds 
from surfaces and many other contaminations get vaporized. In addition, gas 
molecules are accelerated to an excited state, releasing active chemical-free 
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radicals and UV energy. Free radicals activate chemical reactions on surfaces, 
inducing intermolecular cross-linking. When compared to corona discharges, atmo- 
spheric plasmas produce significantly more homogeneous and uniform surface 
activation across material surfaces, increase the micro-roughness of surfaces, with 
introduction of active species. 


Adhesive Bonding Processes and Test Methods 

Adhesive bonding is prepared essentially with lap shear and tensile configuration. 
In general, materials of high rigidity, peel configuration is practically not possible. 
The detail of lap shear and tensile configurations is described in sections “Lap Shear 
Joint Preparation” and “Butt Joint Preparation,” respectively. 


Lap Shear Joint Preparation 

Rectangular configurations, having dimensions length x width x thickness: 150 x 

Q 

25 x 3 mm , are preferred for tensile lap shear testing. Prior to the preparation of an 
adhesive bonded joint, degassing of adhesive is an essential factor and generally 
carried out under a pressure of 1 Pa for 10 min. However, pressure and time of 
degassing varies subject to viscosity of adhesive. If there is any, excess adhesive 
present at the interface is expelled by mechanical pressing of the joint, which 
resulted in a joint having an adhesive of uniform thickness. Pressure is applied to 
the lap joint during the curing cycle by two binder clips. 


Butt Joint Preparation 

Rectangular specimens, having dimensions length x width x thickness: 150 x 25 x 

o 

6 mm , are preferred for tensile testing with butt configuration. 


Tensile Lap Shear Testing and Tensile Testing 

Lap shear determines the shear strength of adhesives for bonding materials when 
tested on a single-lap-joint specimen. The single lap shear configuration is shown in 
Fig. 3a and is a common joint design for lap shear tensile test, and tensile 
configuration is shown in Fig. 3b. 

Both the lap shear testing and tensile testing of butt joints can be carried out 
using computer-controlled testing machine, ZWICK 2010 or INSTRON as shown 
in Fig. 4, under a load cell of 50 kN. The specimens are loaded in tension at a test 
speed of 5 mm/min. Seven specimens of each joint should be tested and the mean 
value with standard deviation should be reported in the results. All tests should be 
performed at room temperature of 25 °C and at 50 % humidity. 
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Fig. 3 Schematic illustration 
of (a) single lap joint before 
and after application of 
tensile stress and (b) butt joint 
configuration 



Unstressed 


Stressed 





Fig. 4 Tensile testing 
machine 



Adhesive Bonding and Their Applications in Automotive 
and Aerospace Industries 

Adhesively bonded polymeric composite materials are frequently replacing metal- 
lic materials in many applications due to their excellent mechanical, thermal, 
cryogenic, and chemical properties and are being extensively used in automotive 
and aerospace industries not just for their ease in manufacturing and low unit cost 
but also for their potentially excellent tribological performance in engineered 
forms. 

The automotive industry faces increasing pressure to maximize performance 
while minimizing weight and cost. Additional considerations include noise abate- 
ment, new safety standards, and protecting the environment from pollution. The 
recent rise in oil prices has made these technological advances important for 
competitiveness in the global automobile market (Toyota 2005; Fuyuno 2005). 
One way of improving vehicle efficiency is the reduction of vehicle weight as 
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Fig. 5 Comparison of SEA of different materials (Sheshadri 2006) 
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light-weighting of vehicles not only enhances fuel efficiency but also lowers 
vehicle emissions thereby improving driving performance. Safety of passengers 
has always been a major concern for automobile engineers. Materials which have 
high absorption of energy offer much resistance during crash/accidents. This is 
measured in terms of specific energy absorption (S.E.A) (Sheshadri 2006). 

It is given by 


SEA = W/Vp (3) 

where W = total energy absorption, V = volume of crushed material, p = density of 
the material. 

The SEA of different materials is compared in Fig. 5. From the above figure it is 
clear that a structure built with a polymer composite fabricated by adhesive bonding 
is six to eight times safer than a structure built with metals. In event of a crash, the 
energy is absorbed by the structure made of polymer composite through plastic 
buckling. As a result, the impact of the crash is reduced at the other end of the 
structure. 

Many studies have been carried out to explore the possibilities of using adhesive 
bonding in the exterior body panels, frameworks/chassis, bumpers, drive shafts, 
suspension systems, wheels, steering wheel columns, and instrument panels of 
automotive vehicles. Both thermoplastic and thermosetting polymers are being 
used as composites fabricated by adhesive bonding, and they find wide scale 
applications in automotive sector as given in Table 3. 
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Table 3 Polymeric composites fabricated by adhesive bonding for applications in auto parts 


S. no 

Company 

Polymeric composite used 

Application/remarks 

1 . 

Convair 
Division of 
General 
Dynamics 

(1979) 

Graphite and Kevlar fibers (2:1 
by parts) and epoxy resin (32 % 
by wt) 

Reduced the weight of a 
composite GM heavy truck 
frame by 62 % in comparison to 
steel for the same strength and 
stiffness 

2. 

Ford Motor 
Co. (1981) 

Combination of vinyl ester- 
based SMC and XMC and a 
glass fiber-reinforced 
polypropylene 

Material for the development of 
a composite rear floor pan for an 
Escort model 

3. 

Mazdas (1982) 

Hybrid glass/carbon fiber 
composite 

Provided more weight savings, 
lower maintenance cost, reduced 
level of noise and vibration, and 
higher efficiency of the drive 
shafts compared to their metal 
counterparts 

4. 

Ford Motor 
Co. (1985) 
Econoline van 

Graphite composite (20 % 
carbon fiber and 40 % E-glass 
fiber in a vinyl ester resin 
system) 

Drive shaft, corrosion resistant, 
and weighed 61 % less than the 
steel shaft it replaced 

5. 

Pontiac’s Fiero 
(1986) 

Polyurethane (PU), XBR alloy, 
alloyed with an acrylate 

Rear quarter panel 

6. 

Chevrolet 
Corvette Ford 
and GM (1987) 

Fiber glass-reinforced 
polypropylene 

Bumper beams of passenger cars 

7. 

Ford Taurus 
and Mercury 

Stable (1994) 

Glass fiber-reinforced 
copolymer polyacetal 

Front suspension stabilizer. 
Provided a 42 % weight 
reduction and 33 % cost 
reduction over replaced steel 
part 

8. 

Mercedes 

CL500 (2000) 

Polyphenylene oxide/ 
polyamide filled with carbon 
fibers 

Provided charge dissipation 
during the electrostatic painting 
process of mirror housings 

9. 

Porsche’s, 

Carrera 

4 model (2005) 

Du Pont’s Bexoly V 
thermoplastic polyester 
elastomer 

Injection molded front and rear 
fascias 

10. 

Toyota (2007) 

TPO (thermoplastic olefin) 
consisting of polypropylene 
islands in rubber matrix 

Outdoor applications such as 
roofing 

11. 

Skoda Fabia 
(2010) 

Daplen polypropylene 
(PP) grades 

For dashboard, door panels, 
trunk cladding. Provided scratch 
resistance, a balance of stiffness 
and impact strength, thermal 
expansion 


The most common thermosetting polymers that are being used are epoxy, cross- 
linked polyurethanes, unsaturated polyester, and phenol formaldehyde (Al-Zahrani 
et al. 2009). Among thermoplastics, polyethylene (PE), polypropylene (PP), and 
polyvinylchloride (PVC) are commonly used in both plastic and composite auto parts. 
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Thermoplastic resins offer a number of advantages over conventional thermo- 
setting resins like lower cycle time, high service temperature, excellent chemical 
and impact resistance, low coefficient of thermal expansion, excellent fire, smoke 
and toxicity performance, good fatigue performance, low wastage, and recyclability 
(Patel 2009). Their low level of moisture uptake results in less degradation of 
mechanical properties under hot and wet conditions (Al-Zahrani et al. 2009). 
They can be remelted and remolded, therefore, can be recycled which is most 
unlikely for thermosets. They have the ability to create more complex shapes both 
online and in subsequent forming operations. Most thermoset resins are relatively 
brittle, and most thermoplastics are extremely ductile. This ductility gives the final 
composite greater impact resistance and damage tolerance. 

The structural polymer composites made from epoxy and polyester thermoset- 
ting although have many beneficial properties such as low density, good mechanical 
properties combined with good insulation, and environmental resistance; they 
suffered from chemical instability, as the impregnated intermediate or prepreg 
has a limited shelf life (Vodicka 1996). Due to these properties thermoplastic 
polymers are preferred over thermosets for high-performance applications 
(Vodicka 1996). 

Thermoplastics are further classified in to three categories: commodity thermo- 
plastics, engineering thermoplastics (ETP), and high-performance thermoplastics 
(HPTP). A distinguishing feature of HPTP is that their heat-deflection temperatures 
are above 200 °C, which is 50-100 % higher than standard engineering thermo- 
plastics. High-performance thermoplastics (HPTPs) are used in specialized appli- 
cations that require a combination of extraordinary properties. HPTPs show 
superior short- and long-term thermal stability (higher melting point, glass transi- 
tion temperature, heat-deflection temperature, and continuous use temperature), 
chemical and radiation resistance, resistance to burning, and improved mechanical 
properties (stiffness, strength, toughness, creep, wear, and fatigue). 

Most high-performance thermoplastics are semicrystalline in nature as their 
levels of crystallinity never exceed above 90 %. Crystallinity in high-performance 
polymers is an important factor and shows strong influence on chemical and 
mechanical properties. In broad terms, crystallinity tends to increase the stiffness 
and tensile strength, while amorphous areas are more effective in absorbing impact 
energy (Vodicka 1996). 

High-performance engineering thermoplastics that are commonly used in vari- 
ous applications are polyphenylene sulfide (PPS), polyether ether ketone (PEEK), 
polyetherimide (PEI), and several formulations of polyamide, commonly called 
nylon (Amin and Amin 2011). 

Polyether ether ketone (PEEK) (service temperature — 250 °C to +300 °C, tensile 
strength: 120 MPa) is a high-performance thermoplastic polymer which is gaining 
significant interest in aerospace and automotive industries. PEEK is a lightweight 
polymer and offers excellent mechanical performance at high temperatures. It can 
successfully replace metals and other polymers due to its unique combination of 
outstanding wear performance, processing flexibility, and excellent chemical resis- 
tance (Thomas and Visakh 2011). It is evident from Fig. 5 that the carbon/PEEK 
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composite demonstrates the highest specific energy absorption (SEA). Thus, com- 
posites made from PEEK are a better option for applications in automotive industry. 

Aerospace engineers typically specify those materials that can withstand harsh 
environments and reduce manufacturing costs and offer processing flexibility. The 
changeover from the use of aluminum materials to polymers in the aerospace 
industry came from the availability of lightweight polymer composites based on 
reinforcements. 

The use of lightweight composite parts fabricated by adhesive bonding have 
numerous application in aviation such as doors, rudders, elevators, ailerons, 
spoilers, flaps, fairings, etc., in passenger aircrafts such as in Boeing 747 and 
767 (Mazumdar 2002). 

Commercial aircrafts such as Boeing 757 and 767 contain less than 5 % 
advanced composites by weight, while military aircraft like GA-18A contains 
1-20 %. New aircraft designs such as the Boeing Dreamliner 787 and Airbus 
A350 use more composite materials and place a major premium on weight reduc- 
tion consequently significant application of adhesive bonding as such. 

Epoxy adhesives are widely used in structural aerospace applications due to their 
ability to withstand temperature as high as ~250 °C. Epoxies are superior to 
polyester and vinyl ester resins in their resistance to moisture and other environ- 
mental influences (Lubin and Peters 1998). 

Bismaleimide adhesives, like epoxies, are easy to handle and process but are 
more resistant to fluctuations in hot/wet conditions as compared to epoxies. 
Bismaleimides do not generate volatiles during cure, thus resulting in less porosity 
(da Silva and Adams 2005). These materials have high temperature resistance and 
withstand temperatures of 300 °C, but they are very brittle and are prone to micro- 
cracking (Zhuang 1998). 

Polyimide adhesive exhibits better performance than bismaleimide in response 
to hot/wet temperature extremes. They are widely used for high temperature 
applications as some of their grades can sustain temperature over 300 °C. The 
cured materials typically exhibit high glass transition temperatures, good chemical 
resistance, and high modulus and creep resistance (Zhuang 1998). The limitation of 
polyimide is that it is very brittle and release volatiles during curing, which pro- 
duces voids in the resulting composite. 

Adhesively bonded polymers have successfully replaced various metallic com- 
ponents in automotive and aviation industries due to their light weight. However, 
the use of adhesives and polymeric materials in primary structures of an automotive 
and aerospace is still limited due to high temperatures encountered by combustion 
gases in automotive and aerodynamic heating especially in the case of aerospace. 
This limitation can be overcome by the application of high temperature resistant 
adhesives, i.e., dispersing thermally stable nanofillers in to an adhesive. 

The inclusion of nanofillers could increase the decomposition temperature of 
adhesives indicating that the thermal decomposition of the matrix is retarded by the 
presence of the nanoreinforcement. Therefore, dispersion of appropriate nanofillers 
would be of great interest for automotive and aerospace industries as per as 
application of adhesive bonding is concerned. 
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Summary 

Fundamental mechanisms of adhesive bonding are explained in terms of 
liquid-solid adhesion and surface wetting. Various types of adhesives and polymer 
surface modification techniques are described. Proper choice of adhesive and 
surface modification helps to increase the adhesion strength. Adhesively bonded 
polymers have successfully replaced various metallic components in automotive 
and aviation industries due to their lightweight and excellent properties. For high 
temperature applications, the adhesively bonded joints can be improved by the 
application of high temperature resistant adhesives and inclusion of nanofillers. 
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Abstract 

The process and science of machining is introduced here. Machining is a 
versatile process widely used in the manufacturing industry to process raw 
materials of various types to impart shape and finish to products. While typically 
used as a secondary shaping process (primary shaping being done using casting, 
forming, etc.), it is also often used as an all-in-one primary process for 
prototyping. By its very nature machining process removes unwanted material 
to effect the shaping and surface generation. The nature of this material separa- 
tion and its consequences are discussed in this chapter. Energy is needed for such 
material, and some fundamental aspects of how to provide the needed energy for 
causing the material removal are also discussed. The reader will obtain a basic 
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intuitive idea of why the machining processes remove material in a typical 
fashion generating so-called chips, and the factors that affect the quality of the 
shape and surface generated will be highlighted. Some basic nomenclature of the 
process and cutting tools are used to quantify the deformation involved in 
machining and show the process mechanics. Experimental techniques used to 
study this process and some research trends in machining are also shown towards 
the end of this chapter. 


Introduction to This Volume 

A study of the machining process or removing material can be holistically under- 
taken by considering the various process components that contribute to it. These are 
highlighted in the rest of this paragraph. Various raw engineering materials have 
been processed using the machining process. In order to process this set of varied 
materials, a variety of cutting tools (including use of fixed and loose abrasives) have 
been designed and fabricated. The machining process is often facilitated by the use 
of coolant! lubrican t. This has evoked recently the issue of environmen tally friendly 
machining. The entire process of machining needs a solid structure called the 
machine tool to hold and move the cutting tool/work material. Such machine 
tools’ and process dynamics have to be understood and monitored and controlled 
in a high-volume production environment and also their flexibility studied in a 
low-volume high-mix environment. The very act of holding these two forms an 
entire subject of study called fixturing. Programming and use of the machine tools 
have triggered various virtual machining software tools. Besides such tools, the 
process mechanics can be understood using simulation and modeling tools that have 
also been developed and utilized. Machining has also been undertaken at varied 
length scales including at the micro- and nanoscales. It is hoped that the reader by 
undertaking a study of these various components will have a good feel for this 
important aspect of manufacturing. 

Certain aspects of the machining process are not covered in this volume due to 
time and resource constraints. These include the topics of cutting tool materials, 
tool and workpiece holding (fixturing), the field of ultraprecision machining, and 
the recently discussed aspects of microfactory concept. The reader will be referred 
to the literature as and when necessary for these topics. 

Introduction to Machining 

The history of machining goes back to the middle of the eighteenth century when 
there was a need for making metallic parts of the steam engine; it is noted that prior 
to this there were some machining operations used to craft wood. Since then, the 
field of machining has come a long way indeed. The basic need for machining has 
remained the same: machining is a manufacturing process that helps provide shape, 
dimensions, and in some cases properties to the raw material to produce an intended 
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Raw material Functional shape 


Fig. 1 An illustration of shape creation by removing material 


component. The machining process is a versatile method that accomplishes this by 
removing material from a bulk raw material. This is illustrated in Fig. 1. To create 
the functional shape shown on the right, a block of material needs to be removed 
from the raw material on the left. 

Material can be removed using various forms of energy: mechanical, light, 
electrical, chemical, etc., and combinations thereof. Here the focus is on using the 
kinetic energy/dynamic interaction of a moving mass (cutting tool and or work 
material) focused using a pointed cutting edge. Combinations of forms of energy 
with at least one of them in the mechanical form are also considered in this volume 
(called as hybrid processes). Widely used from wood/stone carving involving hand 
motions to ultrahigh precision smooth computer-programmed motion machining of 
precision optics, the machining process essentially involves using a cutting tool that 
is moved using specific motion paths to remove material in areas deemed unwanted 
so that the material left behind is in the shape desired. Machining can effect such 
material removal at different scales - from very large volumes of removed material 
such as in large meter-long aerospace structures and prototype making to very 
miniscule amounts in micro- and nanometer scales. The material removal process 
essentially produces new surfaces (three new ones in the example of Fig. 1); these 
new surfaces when connected in the right geometry and orientation produce the 
desired overall shape. Not only is the shape important in the geometrical sense but 
also the shape is needed along with desired surface condition usually in the form of 
surface roughness and subsurface properties. The machining process is often used to 
impart very fine surface roughness. Various engineering and nonengineering mate- 
rials of varied material properties have been shaped using this process. Engineering 
materials include steels, aluminum alloys, other nonferrous alloys, titanium alloys, 
nickel alloys, polymers, ceramics, etc. Nonengineering materials include food 
materials such as cheese, meat, biological tissues, etc. The very process of removing 
the undesired material may involve a lot of effort especially if tough engineering 
materials are involved; this in turn entails the challenge of using even tougher cutting 
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Fig. 2 Broadly considering, there are various by-products of machining 


tool materials. On the other hand, if soft nonengineering materials are involved, the 
challenge is a different one of finely focusing the mechanical energy by the use of 
extremely sharp cutting tools to obtain the desired material separation effects. Apart 
from the challenge of choosing and designing suitable cutting tool materials, 
substantial other challenges include separating the material without damaging the 
leftover material significantly. In most applications of machining, the removed 
material is often discarded and the leftover shape is what is often desired. In a 
very few cases, the removed material (called chip) can be utilized. Broadly speaking, 
there are many outputs or by-products of machining (Fig. 2). The shaped void space 
that is generated after removing the material is most popular output utilized. Besides 
this, the next popular closely output is the nascent surface generated. Burrs are often 
undesirable outputs that occur due to the nature of deformation. The chip with its 
characteristic microstructure is often discarded but can be utilized in some cases. 
The subsurface deformation often seen as detrimental can be used, under controlled 
conditions, to impart stresses or to harden the surface (e.g., grinding hardening). 

Let us now consider some insights into the machining process and how material 
removal happens. 


Cutting Tool 

The basic shape of the cutting tool is a wedge. The kitchen knife and office scissors 
are examples of common cutting tools that have a wedge shape (Fig. 3) with two 
faces meeting at a sharp edge called the cutting edge. The edge and its sharpness 
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Fig. 3 Basic shape of cutting tool is a wedge. A soft material like cheese can be cut by a thin sharp 
knife, while engineering materials need thicker and sometimes blunt cutting tools 


focus the mechanical energy into the material facilitating removal. While the 
kitchen knife is thin (small wedge angle between the two tool faces) and is 
sufficiently strong enough to cut soft materials, engineering materials are much 
stronger, and hence, there is a need to make the tools stronger both geometrically 
and using materials that are inherently stronger. Geometrical strength can be 
imparted to the cutting tool many ways: 

• By increasing the wedge angle and most practical cutting tools are of this shape 

(Fig. 3). 

• By changing the geometrical shape of the face of the tool; the cutting edge does 
not have to be straight, it can be curved; round-shaped tools typically have high 
strengths against impact (Fig. 4). 

• By altering cutting edge sharpness. A sharp edge is susceptible to breakage, 
while a relatively blunt edge can take impact forces well. 


Besides the few tool geometrical parameters introduced, there are many others 
to be considered in manufacturing the tools, in chip evacuation. Detailed geomet- 
rical treatment of cutting tools can be seen in the literature (Bhattacharya 1984). 
Besides the well-defined geometrically shaped cutting tools, irregular-shaped 
abrasive particles ranging in size from a few millimeters to a few micrometers or 
less can also be used to effect material removal (Fig. 5). When using such tiny 
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Fig. 4 The shape and sharpness of cutting tool are important factors to consider for the strength of 
the cutting tool 


Both regular finite 




Abrasive grain 


Irregularly shaped hard abrasive 


shaped cutting tools 
can irregu lar 
abrasive grains can 
be used in 
machining 


Fig. 5 Irregular-shaped tiny abrasive particles can also be used for removing material in machin- 
ing besides well-defined geometrically shaped cutting tools 
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abrasive grains, the question comes as to how to hold these particles and provide 
them necessary kinetic/impact energy to cause material removal. There are differ- 
ent ways to accomplish this. Processes that use such abrasive grains fall under two 
categories: grinding and polishing. In grinding processes, the thousands of abra- 
sive particles are bound together using binders to form rough geometrical shapes 
such as cylinders. The surface of these geometries will have sharp points of 
abrasives sticking out, and when (typically) rotated at high speeds, the tangential 
speeds provide the needed kinetic/impact energies. In polishing processes, a single 
coating (on paper or cloth) of abrasives or loose abrasives or in the form of slurry is 
used to cause material removal. The needed energy is provided by rotation of a 
backing pad or the abrasives are simply projected onto the work surface using high 
pressure air or gas medium. Various processes that use such abrasives include 
grinding, disk polishing, belt polishing, vibratory media finishing, abrasive jet 
machining, lapping, and ultrasonic machining. A separate chapter in this volume 
considers the use of such processes. 

A vast amount of cutting tool materials have been developed from high- 
speed steels to more popularly used carbides and now ceramics (e.g., aluminum 
oxide) and even cubic boron nitride and diamond-based cutting tools are now 
available. More details on cutting tool materials are available in books and 
handbooks (Oberg et al. 2008; Trent 1991) and from cutting tool manufacturers 
(Kennametal 2013). 


Mechanical Energy 

Just having a cutting tool is not sufficient to cause material removal to occur. 
Energy must be provided; the cutting tool merely focuses the energy via its sharp 
cutting edge onto the work material. The mechanical energy needed is provided in 
the form of motion or kinetic and impact energy, more specifically relative motion 
between the cutting tool and the work material that is being processed. The question 
of how much energy is really needed to cause material removal to occur has been 
extensively investigated although its relation to the cutting speed to be imparted to 
the cutting tool has not been linked very well. In the early days of machining, linear 
motion was the most prevalent form. The cutting tool is moved linearly to cause the 
cutting action to occur (Fig. 6). This is one of the simplest form of cutting actions. 
Depending on how the cutting edge is oriented, the cutting action is called orthog- 
onal or oblique (Fig. 7). 

Orthogonal is the simplest, often used in research to understand process mechan- 
ics, but least practical, while oblique is complex but the most commonly used form 
of cutting in practice. There is however a major limitation to linear motion (Fig. 8). 
Typically the tool has to return back at some point of time to complete cutting in 
one area of cutting and to continue cutting in other areas of the work material. 
This means that the tool has to accelerate, reach a steady cutting speed and ramp 
back down, stop, and turn back. High cutting speeds are not possible with this 
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Fig. 6 Linear cutting motion to remove material 
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Fig. 7 Orthogonal versus oblique cutting 

arrangement since it requires high acceleration and deceleration rates to have a 
sizeable distance of constant cutting speed and length. However, higher cutting 
speeds enable material to be removed at a much faster rate, a much desirable factor 
in production. 
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Fig. 9 Use of rotary cutting 
motion, the most often used 
method of imparting energy 
to the cutting tool and work 
material 


Rotate Tool 


V 
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The speed limitation is overcome by the use of rotary cutting motion (Fig. 9). 
The cutting speed is then simply the tangential speed of the tool at the location of 
the cutting edge relative to the center of rotation. Such rotary cutting motion is the 
most often used method of imparting the relative motion to the cutting tool and 
work material; the rotary action combined with the oblique form of machining is the 
most practiced one. Note that the work material can also be rotated instead of the 
cutting tool. It will be later discussed that linear motions are still needed in 
machining processes to cause the work material to be “fed” into the tool (called 
feed speeds or feed velocity); this motion is not however what causes the actual 
material removal action to occur. 


Material Removal in Smaller Chunks 

Now the question arises on how to perform the material removal using the cutting tool. 
Considering the same example as in Fig. 1, let us brainstorm and think of several ways 
to remove the unwanted material; three methods are illustrated in Fig. 10. The first 
method (Fig. 10a) involves removing the rectangular-shaped block of material by 
removing four thin rectangular areas on the four sides, and this releases the material to 
be removed. The second method (Fig. 10b) involves using a sharp tool to scoop the 
entire material out in one shot. In the third method (Fig. 10c), material is removed in 
the form of a series of small discrete chunks. The first method is adopted when the 
amount of material to be removed is very large and if accessibility allows such thin 
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Fig. 10 Three general methods of material removal to create the desired shape shown in Fig. 1 


slabs of material to be removed all around (e.g., trepanning allows a large through-hole 
to be made using this method). Such thin slabs are in turn usually removed in the form 
of small discrete chunks similar to the third method. 

The second method is adopted when the amount of material to be removed is 
small (micromachining) and also in laboratory conditions for a fundamental under- 
standing of the machining process. In most practical machining operations, the third 
method is most common to keep the material-removal-effort levels manageable, 
and the removed small chunk of material is called a chip. Such chips are seldom in 
the form of well-defined geometries as illustrated in the figure; the chip geometry in 
reality is a function of the complex cutting tool motion path, work geometry, and 
also the deformation behavior of the material. The method of partitioning the 
material into small removal chunks can be a complex geometrical problem to be 
solved to plan the machining process; this is often undertaken by computer pro- 
grams that aid in the planning of the tool motion paths. 

To further illustrate the idea of removing material in smaller chunks, let us 
consider a few examples. A cylindrical piece of raw material needs to be changed to 
a stepped cylinder (Fig. 11). This requires that a cylindrical tube of material needs 
to be removed. This can be removed in many ways. It is very rare to remove this 
cylindrical tube of material in tact as one full piece. In method A, the cylindrical 
tube is removed as series of cylindrical tubes with each tube peeled off in a helical 
fashion just like peeling the skin of an orange. In method B, the cylindrical tube is 
removed in the form of a series of thin tubular disks, each disk in turn being 
removed in the form of a spiral, just like peeling an adhesive tape from its roll. 
Method A is in practice called the turning operation, while method B is a grooving 
operation. It also becomes obvious here that the rotary cutting motion is easily 
provided by rotating the work material rather than the cutting tool. 
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Fig. 11 Illustrating different ways to remove a cylindrical tube of material off a cylinder. This 
process is essentially the turning process 



Fig. 1 2 Illustrating different ways of removing a rectangular slab of material off the surface of a 
block, (a) refers to the planing process, while (b) refers to the milling process 


Let us consider another example of options available in removing material. 
Here, a planar thickness of material needs to be removed from the top of a 
cuboidal- shaped work material (Fig. 12). There are two (or more) ways to do 
this. Figure 12a shows the use of linear cutting motion paths to remove a long 
rectangular thin layer at a time. There are many such layers to be removed. 
Figure 12b shows the use of rotary cutting motion. In this option, the material is 
partitioned into rectangular slabs of material, but each slab is in turn removed in 
smaller chunks by the rotary action of the cutting tool in the form of arc-shaped 
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Fig. 13 The hole drilling process uses opposing wedges that remove the material gradually in a 
helical fashion 

smaller chunks of material. It is clear that the arc shape is due to the rotary cutting 
action and results in smaller so-called chips compared to the linear cutting motion 
which will generate larger chips. 

Even the commonly used drilling process gradually peels away material a little 
at a time in the form of a helix (Fig. 13). Thus, an important tenet of the machining 
process is seen here: material is removed in partitioned smaller chunks and not in 
one clean cut of the entire material to be removed. The main reason for this 
partitioning is the limitations in cutting tool strengths and energy needed for 
material removal to be effected. Why is there such a large need for energy to 
remove material? This is because of the way material is removed in machining as 
will be explained in the following section. 


The Removed Material: Chip 

Consider the examples of material removal shown in Fig. 10. The method shown in 
Fig. 10a, while being least practical, involves the least amount of energy needed to 
remove the entire unwanted material, whereas the two other two methods, while 
being more practical, involve significantly higher amounts of energy. Why is this 
so? In the former, only a small amount of material (representing the thin slabs) is 
actually deformed, damaged, and removed by the cutting process. In the latter two 
cases, the entire volume of material to be removed is not simply removed but 
removed along with being subjected to deformation and damage. This effort also 
results in damage to the surface from which the material is removed. To illustrate 
this point, consider two scenarios shown in Fig. 14. Figure 14a shows material in 
the form of a tape being removed from a surface. In this process, the peeled 
tape separates clearly at the interface, and the tape itself is not damaged during 


799 


19 Science of Machining 



Fig. 14 Two scenarios of material removal: (a) peeling a tape off a surface and (b) removing 
material off a surface using a cutting tool 

removal - its material structure remains more or less the same after removal. 
Figure 14b shows material being removed by machining using a cutting tool 
moved in linear motion. An examination of the removed material shows that the 
entire removed material has a different material structure compared to its condition 
before removal. 

Not only has the micro structure changed, the chip is substantially thicker than 
the thickness of the original material that was intended to be removed. This is quite 
remarkable and is a consequence of the tool geometry and nature of material 
removal. Continuing the tape analogy, consider now that the glue used in the tape 
is now substantially stronger. The effort involved in peeling the tape would not 
distort the tape significantly; something similar seems to happen in machining also. 
In this analogy, the tape and the base material are two different materials. Imagine 
now that the strength of the glue now approaches the inherent strength of the single 
base material so that both the tape and base material are one and the same. One can 
then get a feel for the amount of effort needed to peel off this layer from the base 
material. Such an effort will distort and damage the removed material and even 
perhaps the surface of the base material; both these effects are seen in machining of 
strong engineering materials (Fig. 15). 

Hence, in machining, the material is not just simply peeled off but undergoes 
complex deformation in its entire volume; this causes the energy needed to be 
substantially higher than that needed to simply peel it off. The real nature of the 
deformation involved is better illustrated in Fig. 16. The figure shows a snapshot in 
the middle of a cutting tool machining a steel material. The surface of the steel 
material has been polished and etched to reveal its micro structure. The grains and 
phases are visible as gray and white shades of color. In this figure, as the work 
material travels from the left towards the tool on the right (relatively speaking), the 
micro structure of the material is seen to change substantially as it crosses 
the magenta line sketched in. In an almost invisibly thin zone (represented by the 
magenta line), the material seems to have undergone an extreme amount of 
deformation before becoming the removed chip. 
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Fig. 15 Tape analogy to show how material removal involves distortion and damage 


Fig. 16 Showing nature of 
deformation in the removed 
material in machining 
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Fig. 17 Machining nomenclature 

What is the amount of deformation and what rate at which does this occur? 
To get a feel for this, let us first define some basic nomenclature to use. This is 
shown in Fig. 17. The rake angle, a, and clearance angle, 0, define the tool and its 
angular position. The thickness of material to be removed is t 09 also called as uncut 
chip thickness (thickness before it got cut), while t c is the chip thickness. The ratio 
r of the former to the latter is always less than one since the chip ends up thicker. 
The speed with which the tool is moved relative to the work material is V, and the 
width of the material being cut is w. Chip deformation can be determined by using 
a simple thin shear plane model; the magenta line in Fig. 16 is called the shear 
plane, and the angle it makes with the horizontal is the shear angle <fi. (For more 
complex chip formation models, the reader is referred to the literature (Shaw 

2005).) 

Consider now an element in the work material that approaches the shear plane 
(see Fig. 18). As it crosses the shear plane, the presence of the cutting tool is felt, 
and a shear force F s acts on this element to shear it and pass it over the rake face of 
the cutting tool. The amount of shear strain can be quantified as shown using 
triangle ABC. Values of shear strain range from 2 to 4, representing an enormous 
amount of deformation (compared to common tensile tests of engineering mate- 
rials). By the earlier argument of a very thin shear plane (say, few 10s of micro- 
meters) and cutting velocities in the range of 60-150 m/min, the strain is seen to 
happen at a rate of about 10 4 or higher - again very rapid strain rates (compared to 
tensile tests or even ballistic impacts). The behavior of the material under such 
extreme conditions is often unknown and approximated using behavior at lesser 
known strain rates. 

There are three important consequences of this heavy deformation: temperatures 
can rise significantly in the vicinity of the shear plane, micro structural changes 
occur in the chip, and micro structural changes can also occur in the newly formed 
machined surface. It is expected that around 90 % of the mechanical deformation 
work that occurs here gets converted into heat. Chips can get visibly red hot, and 
temperatures can rise above 700 °C. There are predominantly two sources of heat: Q 1? 
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Fig. 18 Shear strain undergone by the chip as it crosses the shear plane 


Fig. 19 Flow of heat in the 
machining process 



heat that is generated during the shearing process at the shear plane, and Q 2 , 
heat that is generated by frictional rubbing of the chip on the rake face. Part of 
the heat Q1 goes into the work material and part of it into the chip. Similarly, part of 
the heat Q 2 goes into the tool and part of it into the chip. Ideally, to protect both the 
tool and work material, it is preferable to have no heat going into them, but that is 
practically not achievable. Usually about 15 % of heat escapes into the work and 
tool. Highest temperatures in the chip are normally seen on the rake face of the tool 
at the chip-tool interface. 

An issue of significant controversy in machining is the question of whether 
material fractures near the tool cutting edge during the chip formation or not, 
especially in engineering materials such as metals that exhibit ductile behavior. 
That the material has to separate at the line AC (Fig. 18) before it can be sheared 
while seemingly obvious is seriously debated in the literature (Shaw 2005; Atkins 
2009; Madhavan et al. 2000; Fig. 19). 
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Fig. 20 The role of fracture and material separation leading to chip formation. Conditions to 
suppress or induce fracture to better control the machining process 


Material separation by fracture is readily visible while machining brittle mate- 
rials such as silicon, while it is not so easily visible while machining ductile 
materials such as steels. Successful chip formation while machining of brittle 
materials requires the use of fracture suppression techniques (Fig. 20). Using a 
blunt tool and use of hydrostatic pressurized environments are such methods 
proposed. Fracture enhancement techniques to enhance machining of ductile mate- 
rials, while newly proposed in the literature, have been applied very little due to 
practical difficulties. 

Another set of topics of contention regarding the chip formation is that of force 
equilibrium and velocities of flow during machining (Fig. 21). Let us first consider 
the problem of force equilibrium. Based on separating the chip as a separate rigid 
body, three sets of forces can be considered as acting on the chip or on the tool. 
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Fig. 21 Left : force equilibrium diagram. Right : velocity diagram to meet principles of kinematics 


One set represents the frictional rubbing of the chip on the rake face of the tool 
(Ff, Nf ), the second set being the forces acting on the shear plane (F s , N s ), and the 
third set being parallel and perpendicular the direction of cutting velocity (F c , F t ). 
The latter set of forces is commonly measured using force sensors in machining 
experiments. These three sets of forces are in equilibrium, and knowing some of 
these (experimentally or otherwise), one can determine the others. The equilibrium 
of forces shown in the circle form is famously called Merchant’s force circle 
diagram. 

Consider next the problem of velocity equilibrium. There are three velocities 
that are postulated to meet the principle of kinematics and hence must form a closed 
velocity triangle as shown (Fig. 21). The first is the cutting speed, the second being 
the velocity with which the chip moves up the rake face and the third being the 
so-called shear velocity which is the velocity of the chip relative to the shear plane. 
Again, with some known quantities, the unknowns can be determined. While some 
of these concepts have been actively challenged (Astakhov 1999), they are contin- 
ued to be taught in basic manufacturing textbooks (Kalpakjian and Schmid 2008; 
Groover 2013). 


Energy and Power Needed in Machining 

Energy is consumed in machining in four main regions: 

• Shearing the chip 

• Frictional rubbing 

• New surface formation 

• Chip momentum change 

The shear energy, used in deforming the chip intensely, by far is the most 
dominating term, followed by frictional rubbing energy dissipated in rubbing of 
the chip on the tool and the tool on the work surface. Traditional models of 
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Fig. 22 Energy, E, consumed in machining is predominantly in two areas: shearing and friction. 
The energy is also commonly stated in specific cutting energy, u, energy (or rate) per unit volume 
of material removed (or rate - MRR: material removal rate) 


machining have typically neglected the remaining two consumers of energy. The 
machining process by its very nature produces new nascent surfaces, and energy is 
needed to create new surfaces on any material. The simple act of bending the 
material and deflecting it up as the chip on to the rake face also requires energy. The 
argument here being that the energy consumed in shearing and friction completely 
dwarfs the other two energies, and hence, the latter two can be safely ignored. As 
mentioned earlier about the argument of chip formation where there is debate on 
how exactly the material gets separated as the chip, similarly this partitioning of 
energy (neglecting the surface energy of formation) is a topic of some fundamental 
debate (Atkins 2009; Groover 2013; Fig. 22). 


Newly Formed Surface 

As pointed out in Fig. 2, the newly formed nascent surface is an important 
by-product of machining and is critical to the functionality of many manufactured 
components (Griffiths 2001) and hence is worth a closer study. Machining produces 
surfaces of various shapes. Two key motions of the cutting tool vis-a-vis the work 
surface results in this surface generation: cutting speed and feed speed. The cutting 
speed motion is the key motion that provides the needed mechanical energy to 
cause chip formation material separation. As explained in the beginning of this 
chapter, energy and tool strength considerations require us to remove material in 
smaller chunks, and hence, the shape of the cutting tool is such that a given cutting 
motion can remove only a small amount of material in the workpiece. This requires 
the cutting tool to be moved, in addition to this cutting motion, to various regions of 
the work surface - this is called the feed motion; the cutting tool is “hungry” and 
“feeds” on the work material. The two motion paths thus needed are referred to us 
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Fig. 23 Surfaces in machining are generated by a combination of two motions 


as generatrix and directrix (Ghosh and Mallik 1985). These two motions can be 
either provided to the tool or work or relative between both. When these two 
motions are in a straight line path, a planar surface is generated. When the 
generatrix is a circular motion path and the directrix is a straight line in a perpen- 
dicular plane, a cylindrical surface is generated (Fig. 23). If the directrix is a straight 
line but in the same plane, then a set of circles is created which will form a planar 
surface. The reader is left with the following puzzle: if the generatrix and directrix 
are both circular but in perpendicular planes, what shape of surface does the 
machining process generate? 

Besides the shape of the surface generated, the quality of the surface is of a key 
concern. The quality of the surface is determined by both external on-the-surface 
features and internal subsurface (below the surface and inside the material) features. 
External on-the-surface aspects include surface finish, presence of cracks, etc., 
while internal subsurface aspects include stresses, micro structural changes (e.g., 
white layer), etc. A close-up view of a typical newly formed machined surface is 
shown in Fig. 24. Typical scratch marks that are characteristic of the cutting edge 
roughness can be seen replicated on the machined surface. Also seen in this 
particular case are many pits where material seems to have been tom out. This is 
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Fig. 24 Newly machined surface observed in a scanning electron microscope 



V — 60 m/min V — 80 m/m in V = 1 00 m/min 

Fig. 25 Example of microstructural change in the subsurface in hardened steel machined with a 
carbide cutting tool 


a consequence of the chip formation process as the material is heavily deformed and 
fractured ahead of the cutting edge. 

As explained earlier, the chip formation process involves heavy deformation in 
the form of large strains at high strain rates. The effect of these deformations is felt 
in the form of temperature rise and microstructural changes on the work surface that 
is thermomechanical in nature. Microstructural changes can include recrystalliza- 
tion, grain distortion, and phase changes that can occur. Of course the nature of such 
deformation depends on the work material properties (Trent 1991). An example of a 
microstructural change is shown in Fig. 25. When hardened steel (more than 
45 HRc) is machined with a carbide-based cutting tool, intense heat is generated, 
and a combination of this heat with mechanical effects results in the formation of a 
nanocrystalline martensite layer. This layer is brittle and is prone to fatigue effects 
especially if the surface is part of a bearing, for example. This micro structure 
change is seen to happen with higher intensity when cutting speeds are high or 
when the cutting tool wears out even a little. Hence, economic considerations also 
require efforts to reduce the formation of such subsurface changes. 
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Table 1 Experimental techniques used in machining 


Parameter observed 

Typical sensors used 

Forces 

Multicomponent force dynamometers 

T emperature 

Thermocouples, infrared cameras 

Stress waves 

Acoustic emission sensors 

Deformation (in situ) 

High-speed cameras 

Deformation 

(off-line) 

Microscopes (metallurgical analysis), micro-/nanoindentation analysis 

Chip-tool contact 

Optical observations through the tool 

Chip formation nature 

In situ machining inside a scanning electron microscope 

Chip root 

Quick- stop devices 

Cutting tool wear 

Microscope 

Surface generated 

Prohlometers, microscopy 

Process monitoring 

Signal processing techniques associated with sensor signals 


Experimental Investigations of Machining 

Many experimental techniques are used to study the process fundamentals and 
mechanics of the machining process. A few are described here. The reader would 
benefit from exploring the literature in this area (Shaw 2005; Childs et al. 2000). 
Experimental techniques for both direct in situ and off-line observations and 
indirect observations are used to study the process. Typical process signatures 
measured include forces, temperature, deformation in the material, and acoustic 
emissions. Table 1 provides the reader some idea of the various experimental 
techniques used to study the machining process. 

Recent Developments in Machining 

Machining, while seen largely as a traditional manufacturing process, has contin- 
ued to be investigated and explored for research and development in many ways. 
One area of research is in reducing the length scales of material removal. Machin- 
ing at the micro- and nanoscales has recently received widespread attention 
(Online document 2014). Interesting scaling effects are observed in machining 
when the scales drop down to a micrometer or less (Melkote and Subbiah 2013). 
Various aspects of micromachining are also discussed in a separate chapter 
elsewhere in this volume. The idea of using microfactories with tabletop-sized 
machines is also being actively explored (Honegger et al. 2006) with several 
companies developing compact micromachines (Online resource 2014; Online 
resource 2014). New engineering materials (Subbiah and Melkote 2013) being 
developed are constantly being investigated for their ability to be processed by 
machining. This ability, often called machinability, is investigated from many 
fronts: surface generation quality, reduced cutting tool wear, etc. A separate 


19 Science of Machining 


809 


chapter in this volume also discusses various engineering materials that are being 
processed by machining. Novel applications of machining are increasingly being 
considered to generate nanostructures, to embed information on the surface, and to 
generate interesting nano structured materials using the chips formed. Process 
monitoring techniques are increasingly being sought after by the industry in 
association with process automation especially in the field of abrasive grinding 
and polishing. Polishing tools that were hitherto used without any control (some- 
times even used manually) are being investigated for more material removal 
control in automated environments using robots. 


Summary 

This chapter introduced the reader to the basic idea of the machining process as a 
way to shape raw material by removing unwanted material. The reader is initially 
introduced broadly to the various by-products of machining with the void shape 
generated and surface formed being the most sought-after by-products. The funda- 
mentals of the process mechanics are introduced; more detailed process mechanics 
descriptions can be found in the literature (e.g., (Shaw 2005)). One of the important 
overlooked aspects of machining is that material is removed in small chunks; the 
tool typically nibbles away the material gradually. This is a direct consequence of 
the constraints in energy and cutting tool strengths, especially when dealing with 
new high-strength engineering materials. The other important aspect to remember 
is that substantial energy is spent in deforming the chip unnecessarily; this is a 
direct consequence of the tool geometry (again dictated by tool strength require- 
ments) that can be used. The large deformations induced in the chip also affect the 
surface being generated and the life of the tool itself. It is much desired but 
impossible to machine engineering materials with a thin knife-type cutting tool; it 
will save energy and reduce effort enormously. It is also highlighted here that both 
cutting tools of finite well-defined geometry and those in the form of irregular 
abrasive particles are commonly used to cause material removal in various engi- 
neering materials. 
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Abstract 

Machining operations are among the most versatile and accurate manufacturing 
processes in terms of its capability to produce diverse and complex geometric 
features. This chapter takes its focus on the machining processes utilizing sharp 
cutting tools to remove materials from the workpiece by shear deformation. 
The main principle of these processes is providing the relative motions between 
the cutting tool and the workpiece, which is accomplished through machine 
tools. The machine tools are discussed and categorized based on the employed 
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cutting tools: single-point cutting tools, multipoint cutting tools, or grinding 
wheels. Machining centers, which have flexibilities to perform various machin- 
ing operations with different cutting tools on more than one workpiece, are also 
discussed. Drives and controls are responsible to respectively provide and 
regulate the motions of the machine tool components. Thus, industrial designs 
and technologies of spindle drives and feed drives, as well as numerical controls, 
are described in this chapter. Finally, different mechanical designs, in terms of 
the kinematic mechanism to provide the relative motions between the tool and 
the workpiece, have been introduced to the manufacturing floors. Each of these 
designs has penetrated today’s manufacturing floor bringing certain superiority 
as compared to the conventional machine tools. This chapter discusses important 
characteristics of a few of these emerging machine tool designs, namely, indus- 
trial robots, parallel and hybrid kinematic machine tools, and reconfigurable 
machine tools. 


Introduction 

Development of machine tools for manufacturing was closely related to the Indus- 
trial Revolution, which is marked by the successful development of steam engine 
by James Watt in 1776. When he started designing the steam engine in 1763, his 
primary challenge was to make the bore of the cylinder sufficiently accurate to 
prevent the steam from escaping around the piston. The initial cylinder was 
manufactured from sheet metal, which certainly did not work properly. The prob- 
lem was eventually solved by John Wilkinson when he invented a waterwheel- 
powered horizontal -axis boring machine around 1775. This machine is often 
recognized as the first machine tool. 

Since then, machining process and machine tool have always been essential in 
product design and manufacturing. Some even argue that machining is the most 
versatile and accurate of all manufacturing processes in terms of its capability to 
produce diverse geometric features. Further, parts manufactured by casting, 
forming, or other shaping processes often require machining operations before 
being assembled for the final product. This final machining operation is required 
because the parts may have external and internal profiles, as well as sharp corners 
and flatness that cannot be accurately achieved by forming and shaping processes. 
Also, because many engineering components or parts need to be interchangeable, it 
requires control of dimensional accuracy, which can be achieved through machin- 
ing process. Therefore, critical choices have to be made during product design and 
manufacturing planning about the extent of casting, shaping, or forming against the 
extent of machining to be done on a workpiece. 

Machining is the general term used to describe the material removal process 
from a workpiece, which is a wide topic. In the interest of a focused discussion, the 
scope of this chapter is limited to the processes using sharp cutting tools to form a 
chip from the workpiece by shear deformation. This chapter also describes the 
machine tools to perform these machining operations. Besides the operations 
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Fig. 1 Coordinate system for motions of (a) tool and (b) workpiece 


described in this chapter, other mechanisms can be applied to remove material other 
than using sharp cutting tool. These alternative mechanisms may involve the use of 
mechanical (e.g., ultrasonic or water jet machining), thermal (e.g., electrical dis- 
charge, electron beam, laser beam, arc, or oxyfuel machining), electrical, or chem- 
ical energy (e.g., etching) or even a combination of these energies (e.g., 
electrochemical machining). 

The main principle of all machining processes is providing suitable relative 
motions between the cutting tool and the workpiece to produce the characteristic 
geometry. Generally, two kinds of relative motion must be provided either by a 
machine tool or manually; they are called the primary motion and feed motion. The 
primary motion is the main motion that causes relative motion between the cutting 
edge and workpiece. The feed motion is one that when added to the primary motion 
leads to a repeated or continuous chip removal to create a machined surface with the 
desired geometric characteristics. The resultant of these two tool motions is called 
the resultant cutting motion. For example, in cylindrical turning on lathe, the 
primary motion is the rotation of the workpiece on its axis, and the feed motion is 
the motion of the cutting tool along the axis of the workpiece. The secondary 
principle is imparting the shape of the cutting tool to the workpiece in order to 
create part geometry. The two principles are sometimes combined in one operation. 
For example, in slot milling, the primary motion is the rotation of the mill on its 
axis, and the feed motion creates the slot. The width and shape of this slot are 
dependent on the geometry of the cutting tool. 

The International Organization of Standards (ISO 841 2001) has recommended a 
system of machine tool axes to describe the motions of both the tool and workpiece 
shown in Fig. 1. The system was established for the purposes of programming 
numerically controlled machine tools. However, it can also be adopted to describe 
general machine (Boothroyd 1975). 

The three coordinate axes X , 7, and Z refer to possible linear motions of the tool, 
and motions A, B, and C refer to possible rotary motions of the tool about those 
axes, respectively. The Z axis is arranged parallel to the axis of the spindle of the 
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machine, which normally provides the primary motion. If the machine has no 
spindle, the Z axis is arranged perpendicular to the worktable. Motion in the 
positive Z axis increases the distance between the workpiece and tool holder. The 
X axis is parallel to the worktable and, wherever possible, should be horizontal. For 
machine tools with rotating workpieces, the X motion is radial, where positive 
X axis is defined as the tool motion when it moves further off the axis of rotation of 
the workpiece. On machine tools with rotating tools, if the Z axis is horizontal, the 
positive X axis is to the right when viewed from the main spindle toward the 
workpiece; if the Z axis is vertical, the positive X axis is to the right when viewed 
from the main spindle toward the machine column. The positive Y motion is 
selected to complete the right-hand coordinate system. 

For the motion of the workpiece, a similar set of axes are employed. However, 
the corresponding letter is primed for differentiation, and the sign is reversed so that 
both sets of axes use the same sign convention to represent the same relative 
motion. For example, a tool approaching a workpiece can be represented by the 
motion of either the tool in the positive X direction or the workpiece in positive X’ 
direction. It is important to note that the X'TT coordinate system does not follow 
the right-hand rule. 

Section “Machine Tool Operations” describes some common machine tools 
along with the operations, which are grouped in three based on the employed cutting 
tools. These categories are based on the use of single-point cutting tools, multipoint 
cutting tools, and abrasive wheels. The machining processes based on the relative 
motions of the machine tools are correspondingly illustrated. Besides dedicated 
machine tools, this section also describes machining centers, which have flexibilities 
to perform various machining operations with different cutting tools on more than 
one workpiece. Key components of these machine tools based on their functions are 
grouped into the supporting structures, drivers, and controls. Drivers and controls 
will be elaborated in section “Machine Tool Structures.” Section “Nontraditional 
Machine Tools” highlights several emerging machine tools used in industries. 
Finally, concluding remarks will be presented. 


Machine Tool Operations 

Machines with Single-Point Cutting Tools 

Single-point cutting tools are those having one sharp cutting edge attached to the 
shank. The cutting edge is intended to perform cutting, produce chip, and create 
machined surface on the workpiece. The geometry of a typical right-hand single- 
point tool is shown in Fig. 2. 

The tool angles have important functions in cutting operations. The rake angle 
controls the chip flow direction and strength of the tool tip. Positive rake angles 
improve the cutting operation by reducing forces and temperatures, but reduce the 
strength of the tool, as the small angle may cause the cutting edge to chip away. In 
the general case of oblique cutting, there are two rake angles, namely, side and back 
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edge angle angle 

Fig. 2 Geometry of single-point cutting tool 


rake angles. As cutting takes place on the side of the tool, the side rake angle is of 
primary importance. The back rake angle also affects the ability of the tool to shear 
the work material and form the chip. The relief angle controls the tool interfacing 
with the workpiece. Too large relief angle may cause the tool to chip off, while too 
small angle may result in high frictional forces due to rubbing between the flank 
surface and workpiece causing excessive flank wear. The cutting edge angle affects 
the chip formation, tool strength, and cutting forces. The nose radius affects the 
surface finish and tool tip strength. Smaller radius creates rougher surface finish on 
the workpiece and weaker tool tip. However, large radius can lead to excessive 
force and tool vibrations. 

Center Lathe 

Figure 3 shows a center lathe with its principal components. This versatile machine 
tool, which is often called as the engine lathe, is mainly used for low to medium 
production. The term “engine” dates from the time when these machines were 
powered with overhead pulleys and belts, driven by steam engines. Nowadays, 
various types of automation have been added to the lathes to improve efficiency and 
accuracy for repetitive operations. 

A center lathe consists of a horizontal bed or base supporting all other major 
components. The headstock, which is fixed to the bed, is equipped with motors, 
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Fig. 3 Centre lathe (Courtesy: Knuth Machine Tools USA, Inc.) 


pulleys, and V-belts that rotate the spindle, which rotates the workpiece at various 
speeds. Levers on the front of the headstock are to select the rotational speeds. The 
headstock has a hollow spindle to which the workholding devices, such as chucks 
and collets, are attached. Opposite the headstock is the tailstock, which can slide 
along the ways and be clamped at any position. A center is mounted in the tailstock 
to support the other end of a long workpiece. A short workpiece is normally 
supported only by the chuck. 

The single-point cutting tool is held in a tool post fastened to the cross slide. 
The assembly of the tool post and cross slide is referred as the carriage. The 
carriage is designed to slide along the guides in order to feed the tool parallel to 
the axis of rotation. The guides are tracks along which the carriage rides. They are 
made with great precision to achieve a high degree of parallelism relative to the 
spindle axis. 

The feed rod, which is powered by a set of gears from the headstock, rotates 
during the operation of the lathe and provides movement to the carriage and the 
cross slide. The leadscrew is especially useful to accurately machine threads on a 
workpiece. It is engaged with the carriage by a closing split nut. 

Primary motion in a lathe is the rotation of the workpiece ( C' motion), which is 
provided by the spindle. The feed motion can be provided by driving the carriage 
along the bed (Z motion). Alternatively, the carriage can remain stationary; the tool 
post is driven across the carriage ( X motion) using a leadscrew in the cross slide. As 
an example, Fig. 4 shows a cylindrical surface being generated on a workpiece by 
the rotation of the workpiece (— C’ motion) and the movement of the carriage along 
the lathe bed (— Z motion). This operation is known as cylindrical turning. 

The feed motion setting on the lathe is the distance moved by the tool during 
each revolution of the workpiece. Once chosen, the feed setting remains constant 
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Fig. 4 Cylindrical turning on 
centre lathe 



D 


m 


regardless of the spindle speed. The feed/ is defined in mm/rev as the displacement 
of the tool relative to the workpiece. To machine a cylindrical workpiece from one 
end to another, the machining time t m is given by 

_ L 

tm ~fN 

where t m is the machining time in minute (min), L is the length of the machined 
surface in mm, and N is the rotational speed of the workpiece in revolutions/minute 
(rev/min). 

It should be noted that t m is the time for one pass of the tool along the workpiece. 
This single pass does not necessarily mean that the whole machining operation is 
completed. Further, this does not include the time required for tool approach and 
retraction. Machine tools are now designed and built to minimize the total operation 
time by minimizing the number of tool passes as well as the distance and time for 
tool approaching and retraction. As shown in Fig. 4, for each workpiece revolution, a 
ring-shaped layered of material is removed. The cross-sectional area of this layer is 
the product of the feed/ and depth of cut d in mm, which is given by d = ( D 0 — D m )/ 2. 
D 0 is the initial diameter of the workpiece in mm, while D m is the diameter of the 
machined surface in mm. The product of this area with the average circumference of 
the ring, which is nD ay , where D av = ( D 0 + D m )l 2, gives the amount of material 
removed in one revolution. Thus, the material removal rate (MRR) is given by 

MRR = nD ay dfN 

The maximum surface cutting speed is defined as V max = nDJSl, while the 
average one is F av = nD av N. 

Besides cylindrical turning, a variety of other machining operations can be 
performed on a center lathe using single-point tool. These processes, which produce 
parts with radial symmetry but vary in terms of feed motion, are illustrated in Fig. 5: 
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Fig. 5 Other machining operations on centre lathe 


(a) Facing. By moving the cross slide, the cutting tool is fed perpendicular to the 
axis of rotation into the rotating workpiece to produce a flat surface at the end of 
the workpiece or to produce grooves for O-ring seats. 

(b) Taper turning. By moving both the carriage and cross slide, the cutting tool is at 
an angle, thus creating a tapered cylindrical or conical shape. 

(c) Contour turning or profiling. Instead of feeding the cutting tool along a straight 
path, it follows a contour that is other than straight. 

(d) Chamfering. The cutting tool is used to cut an angle on the corner of the 
workpiece, creating a chamfer. 
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Fig. 6 Horizontal boring machine (Courtesy: Bravo Industries) 

(e) Parting or cutting off. Similar to facing, the cutting tool is fed radially into the 
rotating workpiece. This time, the tool is fed somewhere along the length of the 
workpiece, cutting a piece from the end of the workpiece. 

(f) Threading. The cutting tool is fed parallel to the axis of rotation at a large 
effective feed rate to create threads at the outside surface of the rotating 
workpiece. For this operation, the single-point tool has a geometry that shapes 
the thread. 

(g) Boring. The cutting tool is fed parallel to the axis of rotation on the inside 
diameter of an existing hole in the workpiece. 

There are other machining operations that can be performed on a lathe with tools 
other than the single-point one. An example is form turning, which utilizes a 
specially designed multipoint tool called a form tool. Drilling can also be performed 
on the lathe using a drill bit, which is also a multipoint tool. 

Boring Machine 

Similar to turning, boring uses a single-point tool to remove material from a 
rotating workpiece. However, boring is an internal turning operation, which is 
performed on the inside diameter of an existing hole. Consequently, boring can 
be performed on lathes as discussed in the previous subsection in Fig. 5 g. None- 
theless, lathes have limitations in handling heavy and/or non-cylindrical workpiece. 

Boring machine can be horizontal and vertical, which refers to the orientation of 
the axis of rotation of the machine spindle that provides the primary motion. A 
horizontal boring machine is shown in Fig. 6. There are two possible setups in this 
machine. The first setup is depicted in Fig. 7a. Similar to lathe, this setup mounts the 
workpiece on the rotating spindle. The cutting tool is attached to a cantilevered 
boring bar, which is mounted on the worktable so that it can be fed into the 
workpiece. This setup is not preferred because the boring bar must be very stiff 
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Fig. 7 Horizontal boring 
operations with (a) rotating 
workpiece and (b) rotating 
boring bar 
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to avoid deflection and vibration during cutting. Also, this setup still has limitation 
in handling large and heavy diameter workpiece. The second setup is depicted in 
Fig. 7b. Similar to the first setup, the tool is mounted to a boring bar. However, this 
long bar can be supported on its two ends, increasing the tool stiffness. The 
workpiece is mounted on the worktable, which can be moved in the feed and/or 
cross-feed directions. This setup is more versatile as it can handle heavier work- 
pieces. A facing operation can be carried out on a horizontal boring machine, using 
either of the two setups. However, it requires a special tool holder, instead of a 
boring bar. 

A vertical boring machine is shown in Fig. 8, which is used for large and heavy 
workpiece, generally larger and heavier than that of the horizontal boring machine. 
The setup for this machine is shown in Fig. 9. The workpiece is mounted on the 
worktable, which is rotated by the spindle providing the primary motion 
(C’ motion). The cutting tool is mounted on the toolhead, which can be fed either 
perpendicular (Z motion) or parallel ( X motion) to the axis of the workpiece 
rotation. Facing the top surface of the workpiece can also be done using this 
machine. 

Shaping Machine 

Shaping machine is used to yield flat machined surfaces. A horizontal shaping 
machine is shown in Fig. 10. The single-point tool is attached on the toolhead, 
which is mounted on the end of the ram. The ram linkage mechanism provides the 
primary linear motion ( X motion) of the toolhead as shown in Fig. 11. The forward 
stroke of the mechanism performs the cutting, while the return stroke lifts the tool 
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Fig. 8 Vertical boring 
machine (Courtesy: Weaver 
Mabbs Engineering, Ltd.) 
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slightly to provide clearance and then resets the next pass. The crank of the 
mechanism can be either mechanically or hydraulically driven. To minimize 
the overall machining time, the mechanism is designed to yield higher speed on 
the return stroke than on the forward stroke. The length of the stroke can be adjusted 
to achieve the desired machining geometry. On completion of each return stroke, 
the worktable, on which the workpiece is mounted, is fed laterally (Y motion) to 
facilitate the next cutting cycle. The height of the worktable can be adjusted to 
modify the depth of cut. The cutting mechanism of horizontal shaping is shown in 
Fig. 12. Feed on shaping machines is defined in mm/stroke. 

A vertical shaping machine, which is also called a slotting machine, is shown in 
Fig. 13. The machine configuration and cutting mechanism, shown in Fig. 14, are 
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similar to that of the horizontal machine, except for the orientation of the primary 
motion. The ram mechanism of this machine gives vertical forward and return 
strokes to the cutting tool (Z motion). Thus, this machine is used to yield flat vertical 
surfaces. 
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Fig. 13 Vertical shaping machine (Courtesy: Ajax Machine Tools International, Ltd.) 


Fig. 14 Vertical shaping 
mechanism. 


Primary 



Planing Machine 

The shaping machine is unsuitable for generating flat surfaces on very large 
workpiece because of the limitations on the stroke and overhang of the ram 
mechanism. This problem is solved in the planing machine by applying the linear 
primary motion to the workpiece (X’ motion), instead of the tool. A two-column 
planing machine is shown in Fig. 15. The toolhead is mounted on the crossrail, 
which facilitates appropriate intermittent feed motion ( Y motion). The column 
supports this crossrail. Each column has a feed rail, which facilitates vertical 
motion (Z motion) that controls the depth of cut. A side toolhead, which moves 
along the vertical column, can be installed if machining the side of the workpiece is 
required. Multiple (more than two) toolheads can be mounted either on the crossrail 
or column to allow multiple cuts on each pass. The cutting mechanism of planing is 
shown in Fig. 16. 
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Fig. 15 Planing machine (Courtesy: XingTai XinRong Machinery Co., Ltd.) 


Fig. 16 Planing mechanism 
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Machines with Multipoint Tools 

A multipoint cutting tool has a series of two or more cutting edges. These tools have 
a variety of designs dependent on the machine tools and machining process, most of 
which will be covered on this section. Nonetheless, the geometric terms, such as the 
rake and flank angles and surfaces, defined earlier for the single-point tool are still 
applicable to each cutting edge of a multipoint tool. The cutting action at a selected 
point on one cutting edge engaging the workpiece at a given instant is also the same. 
Majority of multipoint tools are to be rotated. Thus, they have either a taper 
(conical) or parallel (cylindrical) shank for holding purposes or a bore through 
which a spindle or arbor can be inserted. 

Drilling Machine 

The most common form of drilling machine is the bench drill shown in Fig. 17. As 
its name implies, this machine is normally bolted down to a bench. Workholding is 
normally accomplished by clamping the workpiece in a vise, fixture, or jig onto the 
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Fig. 17 Bench drill (Courtesy: Ozito Industries) 


worktable or onto the base. The worktable can be moved up and down the vertical 
column and then clamped at the preferred height. 

The cutting tool is normally mounted on a three jaw chuck, which is rotated by 
the spindle as the primary motion (C motion). The chuck can be moved up and 
down manually by a feed handle via a rack and pinion mechanism, which provides 
the continuous feed motion (Z motion). The workpiece is always kept stationary 
during each machining cycle. On a bench drill of Fig. 17, the workpiece must be 
manually shifted to drill other locations. 

Drilling is commonly performed using a twist drill to generate an internal 
cylindrical surface. An example of cutting process using twist drill is shown in 
Fig. 18. This cutting tool normally has two cutting edges, each of which ideally 
removes an equal amount of workpiece material. Drilling using the tools presented 
herein can also be performed on a lathe. The difference is that in lathe, the primary 
rotating motion is provided to the workpiece, while the tool is fed into the 
workpiece. 

For convenience, the cutting speed V in drilling is generally defined in mm/min 
as the speed at the outside diameter of the twist drill: 

V = nDN 

where D is the diameter of the twist drill in mm and N is the spindle rotational speed 
in rev/min. It is defined so for convenience, although much cutting process also 
occurs at lower speed close to the axis of tool rotation. The cutting speed is zero at 
the tip of the twist drill, which is in the form of a short chisel edge. In drilling a new 
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Fig. 18 Drilling mechanism 



Primary motion ( C) 



hole, this chisel edge must force its way through the workpiece, resulting in a poor 
cutting condition. However, because this contact area is going to be removed, it has 
little effect on the quality of the hole produced by the cutting edges. The chips 
removed by the cutting edges take a helical form and travel up the helical flutes. 
Feed /in drilling is specified in mm/rev. The machining time is given by 

_ L 

tm ~w 

where L is the distance traveled by the cutting tool in mm. The material removal 
rate (MRR) can be obtained by multiplying the cross-sectional area of the produced 
hole with the feed rate: 


MRR = "-DifN 

where D m is the diameter of the machined surface in mm. If an existing hole of 
diameter D 0 is to be enlarged: 

MRR = \ K - D 2 o )fN 

Several other machining operations can be performed on a drilling machine. The 
most common ones are illustrated in Fig. 19: 

(a) Reaming. The cutting tool, the reamer, is similar to a drill but has several 
cutting edges with straight or helical flutes. It is intended to remove a small 
amount of workpiece material, but it improves the accuracy and surface finish 
of a previously drilled hole. 
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Fig. 19 Machining operations related to drilling 


(b) Counterboring. Counterboring provides a stepped hole, in which a larger 
diameter hole concentrically follows a smaller diameter one. A counterbored 
hole is used to seat bolt heads into a hole so that the heads do not protrude above 
the surface. 

(c) Countersinking. This process is similar to counterboring, but a countersunk 
hole is cone shaped to seat flat head screws and bolts. 

(d) Center drilling. This process is also called centering, which is to produce a 
shallow, conical hole with clearance at the bottom. This center hole can guide 
the subsequent drilling to prevent the drill tip point from wandering as the hole 
is started. 

(e) Spot facing. It is intended to provide a flat surface around the end of a hole, 
which is perpendicular to its axis. This flat surface is used to seat a washer 
or nut. 

To drill many holes on heavy workpieces using larger drills, a radial-arm drilling 
machine, shown in Fig. 20, is normally used. The workpiece is located on the base 
of the machine. The drilling head and motor can be positioned along an arm. 
Further, this arm can be positioned along the vertical column and swung in a 
horizontal plane about the column, allowing the tool to reach significant distance. 

Taps and Dies 

Internal screw threads on an existing hole are produced by tapping. A tap is a 
threading tool with multiple cutting teeth, as shown in Fig. 21. Each cutting edge 
removes a small layer of material to form the thread. The fully shaped thread on the 
tap serves to clear away the chips. Similar to drilling process, tapping operation is 
performed by providing rotating primary and linear feed motions either on the 
cutting tool or workpiece. Thus, this operation can be done on a drilling machine or 
a lathe, respectively. However, since most tapping operations require low rotational 
speed, they are often performed manually, in which the tap is attached to a T-handle 
tap wrench, either via a chuck or a gripper, as shown in Fig. 22. 
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Fig. 20 Radial drill press (Courtesy: Knuth Machine Tools USA, Inc.) 


Fig. 21 Tap (Courtesy: 
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A die is shown in Fig. 23. It has the same cutting action as a tap but is designed to 
produce external threads on a cylindrical surface. Most of these operations are also 
performed manually. The die wrench is shown in Fig. 24. Nonetheless, external 
threading using a die can be carried out on a lathe, in which the tailstock is replaced 
with a turret that holds the cutting tool (i.e., the die). 

Milling Machine 

Milling is considered as the most versatile machining process capable of producing 
various three-dimensional shapes. In milling, the rotation of the multipoint cutting 
tool, which is the primary motion, has an axis that is perpendicular to the direction 
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of the feed motion. This is a main distinction of this process from drilling. 
The cutting tool in milling is called a mill. The cutting edges are called teeth, 
which can be either a solid extension of the mill or indexable inserts attached to the 
cutter body. Milling is an interrupted cutting process as revealed by the generated 
short chips. On the contrary, the previously described machining processes are 
fairly continuous ones that produce long chips. Thus, the material and geometric 
properties of mills are to be designed to withstand repetitive cycles of impact forces 
and thermal shocks. 

Based on the orientation of the mill with respect to the machined surface, 
milling operations can be categorized as peripheral and face millings, which are 
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Two basic milling operations: (a) peripheral milling and (b) face milling 


shown in Fig. 25. In peripheral milling, the axis of the tool is parallel to the 
machined surface. In face milling, the axis of the tool is perpendicular to the 
machined surface. 

Based on the direction of the feed motion with respect to the rotation of the 
cutting tool, milling operations can be categorized as up and down millings, which 
are shown in Fig. 26. In up milling, also known as conventional milling, when 
cutting the workpiece, the teeth motion is in the opposite direction to the feed 
motion. In down milling, also known as climb milling, the teeth motion is in the 
same direction as the feed motion during cutting. Consequently, the chip formed in 
up milling increases in thickness, while the one in down milling reduces throughout 
the cut. Despite exaggeration in Fig. 26, the chip of up milling is longer than that of 
down milling (Groover 2007). 
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Fig. 26 Two basic milling 
operations: (a) up milling and 
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Fig. 27 Horizontal milling machine (Courtesy: CNC Establishment) 

Column-and-knee-type machines are the most commonly used ones for general 
purpose milling. Herein “column” and “knee” refer to the two main components of 
these machines. They can be classified as either horizontal or vertical milling 
machine based on the orientation of the main spindle. The aforementioned catego- 
rizations of milling operations in Figs. 25 and 26 remain valid for these two 
machines. A horizontal milling machine is shown in Fig. 27. The key components 
of this machine, which serve the same functions as those of the vertical milling 
machines are: 

• Worktable, on which the workpiece is mounted. The worktable moves longitu- 
dinally with respect to the saddle. 

• Saddle, which supports the table. The saddle moves transversely. 

• Knee, which supports the saddle. The knee moves vertically to adjust the depth 
of cut. 
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Fig. 28 Peripheral milling operations 


• Column, which support the spindle. 

• Head, which contains the transmission system (e.g., belts and pulleys) from the 
spindle motor to the spindle nose. 

The arbor is a shaft that supports the mill and driven by the spindle. If necessary, 
some spacers can be mounted on the arbor to separate two or more mills. The 
overarm is to support the arbor. For a horizontal milling machine, the table provides 
the X ’ motion, the saddle provides the Z’ motion, and the knee provides the T 
motion. 

The horizontal milling machine is well suited for peripheral milling. Several 
processes related to peripheral milling can be performed on this machine based on 
the size and position of the mill with respect to the workpiece. The operations, in 
which the feed motion is either into or out of the paper, are shown in Fig. 28: 

(a) Slab milling. This is the simplest form of peripheral milling, on which the width 
of the tool extends beyond both sides of the workpiece. 

(b) Side milling. The mill, which can either be wider or narrower than the work- 
piece, is used to machine the side of the workpiece. 

(c) Slot milling. The mill, which is narrower than the workpiece, is used to create a 
slot in the workpiece. If a thin mill is used to cut the workpiece in two, the 
process is called saw milling. 

(d) Straddle milling. This process is the same as side milling, but two mills are used 
at the same time to cut both sides of the workpiece. 


The shape of the mill determines the shape of the machine surface, which is 
referred as form milling. Various shapes of mills that can be mounted on the arbor 
of a horizontal milling machine are shown in Fig. 29. Cylindrical mill in Fig. 29a is 
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Fig. 29 Mills for peripheral milling (Courtesy: Tool Masters) 


suitable to create flat surfaces, angle mill in Fig. 29b is suitable to create angular 
surfaces (e.g., chamfers and serrations), convex mill in Fig. 29c is suitable to create 
concave surface, and concave mill in Fig. 29d is suitable to create convex surface. 

The cutting speed V for milling in mm/min is the peripheral speed of the mill, 
which is defined as 


V = nDN 

where D is the diameter of the mill in mm and N is the spindle rotational speed in 
rev/min. The feed /in milling is usually defined in feed/tooth, which represents the 
size of chip removed by each tooth measured parallel to the feed direction (i.e., the 
chip load). The relationship between feed/tooth and feed rate/-, which is the linear 
speed of the workpiece in mm/min, is defined as 

fr = Nn f 


where n is the number of teeth. 
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Fig. 30 Relative motion between a slab mill and the workpiece 


The machining time of a workpiece of length L in mm must consider the distance 
traveled by the mill before fully engaging the workpiece. For a slab milling with 
depth d in mm, as shown in Fig. 30, the distance for the center of the mill to reach 
the closest edge of the workpiece is given by 

1= y/d(b - d) 


Thus, the machining time is 



/ + L 



The material removal rate in milling is determined by the product of the 
machined cross-sectional area and the feed rate. For a slab milling in Fig. 30 cutting 
a workpiece of width w in mm, the material removal rate (MRR) is defined as 


MRR = wdf r 

It is important that the MRR formulation neglects the initial entry of the mill 
before the full engagement with the workpiece because it is based on the assump- 
tion that / is much smaller than L, which is not generally true. This equation can be 
applied to the other milling operations by making the proper adjustment in the 
computation of the machined cross-sectional area. 

A column- and-knee-type vertical milling machine is shown in Fig. 31. 
The table provides the A’ motion, the saddle provides the T motion, and the knee 
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Fig. 31 Vertical milling machine (Courtesy: Harbor Freight Tools) 


provides the Z’ motion. Similar to its horizontal counterpart, these degrees of 
freedom allow machining of horizontal, vertical, and inclined surfaces. 

The vertical milling machine is well suited for face milling. Several processes 
related to face milling, which can be performed on this machine, are shown in 
Fig. 32: 


(a) Conventional face milling. In this process, the diameter of the mill is longer 
than the width of the workpiece. 

(b) Shoulder milling. The mill overhangs at the side of the workpiece on which the 
machining takes place. Hence, this process creates a shoulder feature. 

(c) End milling. The diameter of the mill is less than the width of the workpiece, 
creating a slot in the workpiece. 

(d) Profile milling. The periphery of the mill is used to machine the outside 
periphery of the workpiece. In contrast to the previous three milling process, 
the end of the mill does not touch the workpiece. 

(e) Pocket milling. This is a derivative of end milling, in which the end mill is used 
to mill shallow pockets. 

(f) Surface contouring. This process uses a ball-end or ball-nose mill to 
machine three-dimensional surfaces (Dhupia and Girsang 2012). To do so, 
this end mill is fed with respect to the workpiece across a curvilinear path at 
close intervals. 


The cutting tools for the vertical milling machine come in various shapes and 
configurations. Each tool is specially designed to optimize a specific milling 
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Fig. 32 Facial milling operations (d), (e), and (f) (Courtesy: Sandvik Coromant) 


Fig. 33 Face mill (Courtesy: 
Kennametal, Inc.) 
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operation on certain materials. A face mill in Fig. 33 consists of a cutter body and 
replaceable inserts. The inserts act as the cutting edges to be mounted on the cutter 
body. For conventional face milling, the machining mostly occurs at the end of the 
mill. To minimize the overall machining time, some cutter bodies are of large 
diameter and can accommodate up to 10 or 12 inserts. Long edge mill in Fig. 34 is 
optimized to perform shoulder and profile milling operations. The design of this 
mill is in contrast with that of the face mill because most machining of this process 
occurs at the cutting edges on the periphery of the mill. Solid end mills, such as 



20 Machine Tools for Machining 


837 


Fig. 34 Long edge mill 
(Courtesy: Sandvik 
Coromant) 
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Fig. 35 End mills (Courtesy: 
Kennametal, Inc.) 



shown in Fig. 35a, are versatile because end mills can be used to perform almost all 
the previously mentioned face milling operations. However, due to design and cost 
considerations, such end mills do not generally come in large sizes. Square-end 
and ball-end mills are the two most commonly used end mills. Indexable end mills, 
shown in Fig. 35b, are cheaper alternatives for the solid end mills, which are 
specialized for end milling operations. 

Possible relative motions between the mill and workpiece in face milling oper- 
ation are shown in Fig. 36. It is customary to allow for some distance / in mm for the 
mill to approach the workpiece and some distance k in mm for the mill to leave the 
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Fig. 36 Approach and overtravel distances for two cases of face milling operations 


workpiece. In both possible motions, k = /. The first motion is when the mill is 
centered over the rectangular machined surface. Thus, as shown in Fig. 36a 

k = l = - 

2 

The second one is when the mill overhangs at one side of the workpiece. As shown 
in Fig. 36b 

k — l — \/w{D — w) 

For either motion, the machining time is given by 



2 Z + L 

fr 


Besides column-and-knee-type milling machines, there are bed-type and 
planer-type milling machines suitable for larger workpieces and higher production 
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rates. In bed-type machines, the worktable is directly mounted on the base or bed. 
Thus, there is no knee and the workpiece can only be moved in longitudinal 
direction. However, these machines have high stiffness, allowing higher feed 
rates and depths of cut; thus, higher material removal rates. Single-spindle 
bed-type machines are called the simplex mills, and the spindle can be either 
horizontal or vertical. Two-spindle bed-type machines are called the duplex mills, 
and the three- spindle ones are understandably called triplex mills. Multiple 
spindles allow simultaneous machining of two or three workpiece surfaces. 
Planer-type mills are similar to the bed-type ones, but for even larger workpiece 
and higher production rate. Their general appearance resembles that of the 
planing machine shown in Fig. 15, but for milling instead of planing. Accordingly, 
one or more spindles for mills are mounted instead of toolheads for single-point 
tools. 


Broaching Machine 

Broaching machine uses a broach, which is a multipoint cutting tool that is moved 
linearly with respect to the workpiece in the direction of the axis of the broach. The 
workpiece is held stationary throughout the machining process. Most broaching 
machines can be classified as either horizontal or vertical machine based on the tool 
axis, along which the direction of the primary motion of the tool is. A manually 
powered vertical broaching machine, or a broaching press, is shown in Fig. 37. For 
higher production rates and harder materials, hydraulically powered broaching 
machines are normally used. The mechanism of a broaching process is shown 
in Fig. 38. 

Broaching is most often utilized to machine keyways. A variety of features can 
be machined through either internal or external broaching. External broaching is 
performed on the outside surface of the workpiece. Internal broaching is to machine 
internal surface of a hole. For internal broaching, a starting hole must be present so 
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Fig. 39 Shapes machined from internal broaching 


that the broach can be inserted. Some possible features from internal broaching are 
shown in Fig. 39. Similar features can also be applied on the external surface. 

In broaching, the feed /is provided by staggering of the teeth on the broach, in 
which each tooth removes a thin layer of material or cut per tooth. As the machined 
surface is usually produced during one pass the tool, the machining time£ m in min is 
given by 



L 

V 


where L is the length of the broach in mm and V is the speed of the broach in 
mm/min. 
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Machine with Grinding Wheels 

Grinding is a material removal process that uses a wheel containing abrasive 
particles. Every grinding machine has a spindle which is rotated at high speed 
and on which the grinding wheel is mounted. The spindle is supported by bearings 
and mounted in a housing. This assembly is called the wheel head. The grinding 
wheel is usually disk shaped, although it also comes in a cup shape. Besides 
abrasive particles, the wheel also consists of bonding material that holds the 
particles in place and establishes the shape and structure of the wheel. An interest- 
ing property of the grinding wheel is that it is self- sharpening. As the wheel wears, 
the abrasive particles become dull. Then, these dull particles either fracture creating 
fresh cutting edges or are pulled out of the wheel surface exposing new grains. 

Grinding wheels are rarely used in rough cutting where material removal rate is 
the important factor. They are more commonly used to finish parts, which have 
already been machined by other operations. In this stage, the resulting surface finish 
is the criterion. Grinding wheels are also used in tool rooms to form the geometries 
on cutting tools. Four major grinding machines are discussed in this section, based 
on the orientation of the workpiece and the grinding wheel. 

Surface Grinding Machine 

Surface grinding is normally used to grind flat surfaces, using either the periphery 
or flat face of the grinding wheel. The workpiece is normally held horizontal, and 
the relative motion of the workpiece is achieved either by reciprocating the work- 
piece or by rotating it. Four possible combinations of wheel orientations and 
workpiece motions are shown in Fig. 40. 

A horizontal spindle grinding machine with reciprocating table, shown in 
Fig. 41, is the most common grinding machine. The horizontal spindle provides 
the primary motion (C motion). The reciprocation of the worktable, on which the 
workpiece is mounted (A’ motion), provides the feed motion. This motion is called 
the traverse motion. Further, feed motions can be applied either by moving the 
wheel vertically with respect to the column (7 motion), which is known as infeed 
motion, or by moving the worktable parallel to the machine spindle (Z’ motion), 
which is known as cross-feed motion. The cross-feed motion of this machine is 
intermittent, applied after each pass of the feed motion of the table. The definition 
of these motions for other surface grinding operations is shown in Fig. 40. 

There are two grinding operations based on the relative motion of the wheel with 
respect to the machined surface, as shown in Fig. 42. Traverse grinding operation is 
when the wheel is fed parallel to the surface being machined. Plunge grinding 
operation is when the wheel is fed perpendicular to the surface being machined. 
Plunge grinding is normally applied using a formed grinding wheel to impart the 
shape of the grind wheel on the workpiece. 

Rotary table obviates the need of traverse motion. The rotary table is more 
commonly coupled with vertical spindle because this configuration can achieve 
higher material removal rates owing to the relatively larger surface area between 
the wheel and the workpiece. An example of this machine is shown in Fig. 43. 
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Fig. 40 Different types of surface grinding operations 
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Fig. 41 Horizontal spindle surface grinding machine with reciprocating table (Courtesy: 
Prayosha Enterprise) 


Cylindrical Grinding Machine 

Cylindrical grinding is normally used to grind cylindrical surfaces. The surface can 
be either an external or an internal surface, as shown in Fig. 44. An external surface 
cylindrical grinding machine is shown in Fig. 45. The cylindrical workpiece is 
supported and rotated between the headstock and tailstock. This assembly is 
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Fig. 42 Horizontal spindle 
grinding operations: 

(a) traverse grinding and 

(b) plunge grinding 


a 




wheel 


Traverse motion 


b 


Gri nding 
vvh ee I 


► 


I n feed 
motion 


▼ 


Machined 
s li r face 





Rotational spindle 




si] r fa cc 



miece 


Fig. 43 Vertical spindle 
grinding machine with rotary 
table (Courtesy: DCM Tech, 
Inc.) 
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mounted on a worktable that is reciprocated horizontally (Z’ motion). This table 
provides the traverse motion. The headstock provides low rotational speed 
(i C motion) to the workpiece. The axis of the wheel head is horizontal, parallel to 
the axis of the workpiece. The wheel head provides high rotational speed to the 
grinding wheel (C motion). Motion can be applied to the wheel head in the direction 
normal to the axis of the workpiece ( X motion), which is the infeed motion. 
For this machine, traverse grinding operation moves the wheel head perpendicular 
to the axis of the workpiece, resulting in workpiece with smaller radius or thickness 
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Cylindrical grinding operations: (a) external surface and (b) internal surface 


as shown in Fig. 44a. In contrast, plunge grinding operation forms a groove on 
the external surface of the workpiece as the wheel is moved only in the infeed 
direction without any traverse motion. 

An internal surface cylindrical grinding machine is shown in Fig. 46. It is designed 
to produce an internal cylindrical surface. The wheel head supports a horizontal 
spindle, on which a relatively small cylindrical grinding wheel is mounted. To 
accommodate the small wheel diameter, the spindle must rotate at very high speed 
(C motion). Reciprocated motion can be provided to the wheel head parallel to the 
spindle axis (Z motion), which acts as the traverse motion. Horizontal motion can also 
be applied to the wheel head normal to the spindle axis ( X motion), which is the infeed 
motion. The workpiece is mounted in a chuck and rotated by the head stock at low 
speed (C' motion). For this machine, traverse grinding operation moves the wheel 
head perpendicular to the axis of the workpiece, resulting in workpiece with smaller 
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Fig. 45 External cylindrical grinding machine (Courtesy: Knuth Machine Tools USA, Inc.) 
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Fig. 46 Internal cylindrical grinding machine (Courtesy: Paragon Machinery Co., Ltd.) 

thickness (i.e., longer internal radius). Plunge grinding operation, which has no 
traverse motion, forms an internal groove in the workpiece. 

Centerless Grinding Machine 

This machine is an alternative for grinding external and internal cylindrical sur- 
faces. The main difference is that the workpiece is not held between centers (i.e., 
this machine has no headstoek nor tailstock). This yields in reduced work handling 
time, which promotes this machine for high production rates. An external centerless 
grinding machine is shown in Fig. 47. Its setup, from end and side views, is depicted 
in Fig. 48. Besides a grinding wheel, it has a regulating roll, which rotates at much 
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Fig. 47 External centerless grinding machine (Courtesy: Shenzhen Henfux Technology, Ltd.) 
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lower speed. This roll can be slightly inclined to certain angle 6 to control the 
through feed of the workpiece, which rests on a rest blade. The through feed /in 
mm/min can be estimated as (Kalpakjian and Schmid 2010) 

/ = nDyNy sin 6 

where D r and N r are respectively the diameter in mm and rotational speed in 
rev/min of the regulating roll. 

The setup of a typical internal centerless grinding machine is shown in Fig. 49. 
Instead of a rest blade, a holding roll and a support roll are used to maintain the 
position of the workpiece. The position of the holding roll can be adjusted to 
accommodate the size of the workpiece. The regulating roll is tilted at a small 
inclination angle to control the feed of the workpiece. For this machine, it is not 
possible to feed the workpiece through because one end is occupied by the wheel 
head. 

Creep Feed Grinding Machine 

Creep feed grinding machine has the same exact configuration as the ones in 
Figs. 41 and 45. Thus, it can be applied in both surface and external cylindrical 
grindings. Creep feed grinding allows machining at very high depths of cut, from 
1,000 to 10,000 times greater than in conventional surface grinding. However, to 
accommodate such depths, the grinding is performed at very low traverse speeds, 
reduced by about the same proportion as the depths of cut; hence, it is referred as 
“creep feed” grinding. 


Machining Centers 

The machine tools in the preceding discussions are designed to perform limited 
(often only one) types of operation, either manually or with certain level of 
automation. However, manufacturing products or parts usually require different 
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Fig. 50 Tool changer (Courtesy: Quick Jet Machine Co., Ltd.) 


consecutive machining operations, such as milling, drilling, or threading, on vari- 
ous surfaces. None of the machine tools described so far is capable to individually 
produce such parts. In a highly automated manufacturing floor, those parts are 
achieved by transferring them through a transfer line, in which a number of machine 
tools are arranged in sequence. The parts are transferred from one machine tool to 
another until all required operations are completed. This methodology is commonly 
used in high volume productions. 

For some types of product and in certain situations, such as for the type of 
products that change rapidly, the prescribed transfer lines with dedicated machine 
tools are not feasible or economical. To tackle this problem, machining centers 
were introduced in the late 1950s. A machining center is a highly automated 
machine tool capable of performing multiple machining operations on different 
surfaces and along almost all directions on a workpiece. The high level automation 
is generally achieved by computer numerical control (CNC). 

Several key features of the machining centers are automatic tool and pallet 
changing and positioning. Each machining center is equipped with a tool storage, 
which generally can take more than 15 cutting tools. Each tool is mounted on its 
own tool holder and equipped with certain identification (e.g., bar code, tag code, 
and memory chip). The cutting tool must be changed so that within this center either 
a tool with different dimension can be used or one machining operation can be 
performed after another. Tool exchange and attachment to the spindle are done 
automatically by the tool exchange arm, as shown in Fig. 50. Further, some 
machining centers are equipped with more than one pallet, on which the workpiece 
is placed in a machining center. Multiple pallets with automatic changing allow the 
machinist to unload the machined part and load the next part while the current part 
is being machined. In both tool and pallet changing, the cutting tool is automatically 
positioned with respect to the workpiece. Machining centers are generally equipped 
with touch probes, which can be loaded into the spindle, to determine the reference 
surfaces of the workpiece or to inspect the machined surfaces online. Some 
machining centers are also equipped with sensors for monitoring purposes, such 
as to indicate if the tools are worn or to monitor force and vibration experienced by 
the spindle. 
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Fig. 51 Turning center 
(Courtesy: DMG Mori Seiki 
Co., Ltd.) 



Machining centers can be classified as vertical, horizontal, or universal machin- 
ing center, based on the spindle orientation. Vertical and horizontal machining 
centers typically utilize rotating multipoint cutting tools, such as drilling and 
milling. Vertical machining centers are suitable to machine flat top surfaces and 
deep cavities on a workpiece. Horizontal machining centers are suitable to machine 
a number of surfaces on the side of the workpiece. Another type of horizontal 
machining center is turning center, which is a numerically controlled lathe with 
multiple turrets (i.e., tool storage) and multiple horizontal spindle heads as shown in 
Fig. 51. This machining center is specialized in performing various turning -related 
operations, as illustrated in Fig. 5. Universal machining centers are equipped with 
both vertical and horizontal spindles, which are capable of machining all surfaces of 
the workpiece as the spindle can be swiveled to any angles between horizontal and 
vertical, as shown in Fig. 52. 


Machine Tool Structures 

Machine tools consist of stationary and moving bodies, as well as the drives. The 
stationary bodies include the machine base or bed, columns, and housings. These 
bodies are to support, cover, and protect the moving bodies, such as the worktables, 
spindles, gears, and bearings. In general, the dimensions of machine tool structures are 
overestimated to provide high rigidity, thermal stability, and damping. These proper- 
ties are necessary to minimize static and dynamic deformations during machining. 
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Fig. 52 Universal 
Machining Center (Courtesy: 
Gebr' uder Heller 
Maschinenfabrik, GmbH) 



Machine tool drives provide motion to the moving bodies. The drives can be 
categorized into two main groups, which are the spindle and feed drives. This 
section describes the building components of each drive and the numerical control 
system that regulates the motions of each drive. 


Spindle Drives 

As discussed in the previous section, the spindle drives are to rotate either the cutting 
tools, as in the case of drilling, milling, and grinding, or the workpiece, as in the case of 
turning. The machine tool spindle drive provides the relative primary motion between 
the cutting tool and the workpiece, which is necessary to perform the material removal 
operation. The spindle drive must provide sufficient angular speed, torque, and power 
to the spindle shaft, which is held by roller or magnetic bearings in the spindle housing. 
Therefore, the characteristics of the spindle have significant impacts on the machine 
tool performance and quality of the machined parts. 

Industrial spindle drives commonly come in three designs, as shown in Fig. 53. 
Active cooling is often required, generally implemented through air or water 
cooling. Conventional spindles are popular in machine tools due to their modular 
integration to machine tools. It consists of a motor externally driving the spindle 
shaft through either a V-belt or a coupling. There may be a single-step gear reducer 
and a clutch between the electrical motor and spindle shaft. This design is normally 
applied for low to medium spindle speed up to 15,000 rev/min. 

The developments in rolling element materials, bearing design, lubrications, elec- 
trical drive, and power electronics have allowed the construction of compact direct 
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Fig. 53 Spindle drive designs (Courtesy: Siemens) 
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Fig. 54 Internal components of motor spindle (Courtesy: GMN Paul M "uller Industrie GmbH & 
Co. KG) 


drive spindles, which do not require mechanical transmission elements. These spin- 
dles are also called as motor spindles, which consists of elements shown in Fig. 54. 
The reduced inertia and friction, as well as rigidity and lower vibration of this compact 
design, encourage its implementations for high-speed machining. Recent trends in 
design and development of spindle drives are reviewed in (Abele et al. 2010). 

Industrial spindles are equipped with sensors for condition monitoring. The key 
measurements include temperature of the motor stator, temperature of the bearings 
temperatures, rotational speed, mechanical torque, motor currents, and electrical power. 


Feed Drives 

The feed drives move the machine structures, on which the workpiece or cutting 
tool is mounted to a desired location. The structures are driven along a rail or a 
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Fig. 55 Feed drive designs: 
(a) linear motor and rotary 
motor with (b) ball screw and 
(c) rack pinion 
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guide. Performance of the drives and guides in terms of speed and positioning 
accuracy plays significant role in material removal rate and the product quality 
obtained from the machine tool. 

Feed drives are powered by either linear motors or rotary motors with ball screw 
or rack pinion, as shown in Fig. 55. The ball screw mechanism can be connected to 
the rotary motor directly or through gear reduction, which is necessary to amplify 
the torque for large heavy-duty machine tools. Conventional machine tools have 
multiple gear reduction steps to obtain the desired feed rate. In term of control, the 
reference feed rate specified in the NC program is combined with the acceleration 
and jerk limits of the feed drives. A real-time trajectory generation algorithm then 
sends discrete position commands to the feed drive. The corresponding digital 
velocity commands are then converted into electrical signals, which are fed to the 
amplifier and motor of the drive. 

The guides or rails are to direct the motion of the moving parts in machine tools. 
The guides must be of a high degree of accuracy as it directly affects the machining 
quality (Dhupia et al. 2007, 2008a). Further, they should have good toughness, 
damping, load capacity, and stiffness to withstand the loads, impacts, and vibrations 
during the machining process. Finally, as moving components, they need to possess 
wear resistance and low friction characteristics to avoid gripping, stick-slip 
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Fig. 56 Schematic of components of (a) open-loop and (b) closed-loop numerical controls 

phenomena, and surface deterioration. There are various types of guides, some of 
which are friction guides (or slideways), rolling guides, hydrostatic guides, and 
magnetic levitation guides. Different technologies of guides and feed drive assem- 
bly designs are reviewed in detail in (Altintas et al. 2011). 


Numerical Controls 

Control in machine tool is achieved through numerical control, which utilizes 
coded instructions in the form of alphanumeric (numbers and letter) commands to 
regulate the motion of machine components. The control system interprets these 
commands and converts them to the appropriate control signals. There have been 
two types of numerical controls, namely, direct numerical control (DNC) and 
computer numerical control (CNC). In DNC, a central computer is used to directly 
control all machine tools in a manufacturing floor in a step-by-step fashion. The 
main drawback is that all machines may stop if the main computer suffers from 
faults. The development of CNC stems back from the availability of low-cost 
programmable microcontrollers. In CNC, a microprocessor or microcomputer is 
an integral part of the control. This allows each machine tool to have an onboard 
computer, through which the operator can independently prepare, store, and exe- 
cute programs for the machine. CNC is the most widely used NC today because of 
higher flexibility of the programmable controller and greater accuracy resulting 
from a higher sampling rate and computational speed of dedicated controller. 
Distributed numerical control is now used in complex modern manufacturing 
system, in which a central computer is used as to control and coordinate a number 
of individual CNC machine tools. 

Motion control in NC machine can be achieved through either open-loop or 
closed-loop system, as shown in Fig. 56. In open-loop system, the controller 
delivers the signals to the drives, but the motion and final position of the machine 
component are not checked for accuracy. In closed-loop system, the machine is 
equipped with variety of sensors, transducers, and counters to give feedback 
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Fig. 57 Movement of 
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regarding the status of the machine component to the controller. Digital-to-analog 
converter (DAC) is essential to interface the controller with the drives. 

There are two basic types of NC system, which are point-to-point and continuous 
path, as shown in Fig. 57. Recent trends in design and development of NC for 
machine tools are reviewed in (Neugebauer et al. 2007; Dhupia and Ulsoy 2011). 

In point-to-point, or positioning, the component (spindle head or worktable) is 
moved to a programmed location without considering the path from one location to 
another. Thus, the machine accelerates initially to maximum velocity to minimize 
the machining time and then decelerates to reach the specified position. This system 
is appropriate in operations in which positioning and machining take place sequen- 
tially, such as drilling. 

In continuous path, or contouring, the moving component is controlled to follow 
a prescribed path. Thus, this control requires accurate position control and synchro- 
nization of velocities. This system is widely used in milling and turning operations. 
In building the path, the controller can compensate for the cutting tool geometry 
(i.e., the tool shape, size, and wear). Also, the cutting tool location within the path, 
or the interpolation, occurs incrementally. This interpolation can be linear, higher- 
order polynomial, or circular. Linear interpolation moves the machine component 
along a straight line. Theoretically, linear interpolation can produce any path profile 
by reducing the increment between adjacent points. However, this interpolation 
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may require a large amount of data point, which is computationally expensive. 
Higher-order polynomial interpolation estimates the path using curves of higher 
polynomial equations (i.e., parabolic, cubic, or higher polynomials). Finally, cir- 
cular interpolation moves the component along a circular arc of different radius. 


Nontraditional Machine Tools 


Some alternative machine tool designs have emerged in recent decades, each of 
which comes up with certain strength as compared to the dedicated machine tools or 
CNC machining centers described earlier. Research and development on these 
machine tools have been active to promote industrial acceptance by tackling the 
limitations, improving the reliability, and reducing the costs. This section aims to 
highlight a few important machine tool technologies, which have established their 
presence and have been envisioned to have potentially high penetration into the 
manufacturing floor. 


Industrial Robots 

An industrial robot is an automatically controlled, reprogrammable, 
multifunctional, dextrous machine that is often designed following certain human- 
like features. The most apparent feature is the robotic arm, which is called the 
manipulator. Combined with the use of intelligent control, the manipulator is able 
to perform various tasks in manufacturing floor, such as picking up, moving, and 
placing down materials or tools, spray-painting, spot welding, and certainly 
machining, which is the interest of this chapter. The basic components of an 
industrial robot are manipulators, drives, end effectors, and control system. 

The manipulator is the mechanical unit that provides the trajectory motion so 
that the end of the manipulator can reach a point in space with certain orientation. It 
consists of joints and linkages. A joint allows relative motion between the input and 
output links. The joints can be classified as linear or rotating. The coordinated 
motion of these joints allows an industrial robot to move, position, and orient an 
object. For a general industrial robot, an independent actuator provides the motion 
of each degree of freedom (DOF) of each joint. The drives can be electric, 
pneumatic, or hydraulic. 

There are five basic manipulator configurations, as shown in Fig. 58. They are 
(a) Cartesian/rectilinear/gantry robot, (b) cylindrical robot, (c) polar/spherical 
robot, (d) articulated/revolute/jointed robot, and (e) SCARA (Selective Compliant 
Assembly Robot Arm). 

The end of the manipulator is attached with an end effector to perform the 
required task on an object. If a robot manipulator is designed as having a function of 
the human arm, the end effector is designed having one of the hand. The location of 
the end effector for each manipulator configuration in Fig. 58 is marked by the 
black surface. 
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For machining, the manipulator is to provide the relative motion between the 
cutting tool and workpiece. The end effector can be equipped with either a single- 
point or multipoint cutting tool. If required, a spindle can be attached on the end 
effector, as shown in Fig. 59a. Another option is to mount the workpiece on the end 
effector, while the cutting tool (on a spindle) is mounted on a worktable, as shown 

in Fig. 59b. 

The control system receives input, either from sensors or programmed codes, 
and delivers commands to the actuators to regulate the movement of the robot. 
Industrial robots can be classified based on the commanded input. 

(a) Sequence robot is the traditional pick-and-place robot, which can be 
programmed for a specific sequence of operations. The input only contains a 
cycle of motion from one location to another that must be repeated. 

(b) Playback robot repeats the motion performed by the user. The user can teach the 
motion to the robot by either directly handling its end effector of the robot or 
using other inputs (e.g., joystick). The control system records the path and 
corresponded motion of each joint and then repeats them continuously. 

(c) Numerically controlled robot receives user-defined inputs and operates much 
like a numerically controlled machine tool. 

(d) Intelligent, or sensory, robot utilizes various sensors, such as visual and tactile 
sensors, to observe and evaluate its environment based on perception and 
pattern recognition. The robot makes decision on the next move in real time 
based on these inputs. 

An important consideration of an industrial robot is the size of its workspace. 
Workspace is defined as the space or volume within which a manipulator can 
position its end effector. In general, the workspace is important to determine 
whether a robot is suitable for certain application. The workspace of different 
manipulators is shown in Fig. 58. 

The major drawback of conventions machine tools is their limited workspace. 
For example, a typical three-axis milling machine resembles the Cartesian robot 
in Fig. 58a, which has a workspace in the shape of a rectangular prism. This 
imposes limits on the shape and dimension of the machined workpiece. Other 
DOFs must be added to machine tool to enlarge its workspace. On the other hand, 
industrial robots for machining generally take a form of articulated robot of 
Fig. 58d, which also has three DOFs. Not only does this robot have larger 
workspace, but also, due to its extensive flexible kinematics (i.e., higher accessi- 
bility), it is often capable of machining parts with intricate details and complex 
shapes, which would require special fixtures and techniques to produce using 
conventional machine tools. 

Nonetheless, the biggest shortcoming of industrial robots to carry out machining 
operations is their structural flexibility. The links and joints of an industrial robot 
are much less rigid than those of the conventional machine tools. Therefore, the 
accuracy and repeatability of the movements of a robot are inferior to its machine 
tool counterpart. This flexibility also results in a low structural natural frequency of 
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Fig. 59 End effector for robotic machining: (a) spindle and tool holding (Courtesy: Staubli Inc.) 
and (b) workpiece holding (Courtesy: ABB Inc.) 


the industrial robot, which is easily excited during typical machining operations and 
creates large amplitude vibration due to resonance. This resonant vibration is 
unwanted because it produces poor surface quality, breaks the cutting tool, and 
damages the robot. 
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Parallel and Hybrid Kinematic Machine Tools 

As discussed previously, industrial robots utilize serial kinematic architecture, 
which is a series of links or body connected by joints from the base to the end 
effector. The main advantages of serial mechanism are its large workspace and 
simple kinematics. CNC machine tools are also built based on the same architec- 
tures, where every additional DOF is mounted on the previous one. The main 
difference is that CNC machine tools are built using massive structures and wide 
bases to ensure rigidity and thus, accuracy at the cutting tool tip. However, the 
limitations of this mechanism can no longer be concealed as the trend moves toward 
high-speed machining. It is because the heavy structures of the serial mechanism 
are still susceptible to bending loads and vibrations when they move at high speed. 

In a parallel kinematic architecture, all links (i.e., the struts) are connected to the 
fixed base and to the same moving platform, which acts as the end effector. The 
architecture provides higher stiffness and lower moving masses, which is an 
attractive design for demanding tasks such as high-speed machining. Most existing 
parallel kinematic machine tools (PKMTs) can be classified into two main catego- 
ries based on the joint between the links and ground (or base), as shown in Fig. 60. 
The one in Fig. 60a has multi-DOF joints and variable-length struts while the one in 
Fig. 60b has fixed length struts with gliding joints. 

PKMTs have been promoted for several advantages. They have greater load 
carrying capacities as the total load can be shared by a number of parallel links. 
Further, the links can be lightweight and very stiff to minimize inertia, increase 
structural rigidity and payload-to-weight ratio, and improve dynamic perfor- 
mances as higher bandwidth can be achieved. PKMTs are intrinsically more 
accurate because the errors reflected at the end effector are the average error of 
the joints instead of the cumulative error as in the case of their serial counterparts. 
The abundant use of multi-DOF (i.e., spherical and universal) joints in PKMTs 
ensures that the links experience only compressive or tensile loads, but no shear 
forces and no bending nor torsional moments. This reduces the deformation of the 
platform, even under high loads. Fully/purely PKMTs have as many degrees of 
freedom as the number of struts. An example of which is shown in Fig. 61. The 
spindle is mounted on the platform, which are moved by three parallel struts. This 
configuration is widely used in high-speed drilling or tapping. 

Parallel kinematics is also referred as the closed-loop kinematics, while its 
serial counterpart is called the open-loop kinematics. Consequently, the kinematic 
relationships of PKMTs are considerably more complex. The presence of passive 
(i.e., no actuator) and multi-DOF (or universal) joints further complicates the 
kinematic analysis. Thus, the control and calibration of PKMTs suffer from a 
heavy computational burden. PKMTs are also prone to either serial or parallel 
singularities. Singularities are scenarios in which achieving a platform trajectory 
requires extreme values of either joint velocities or joint forces. Serial singularities 
are machine configurations where a finite platform velocity requires infinite 
joint velocities. In these scenarios, the velocity amplification is null while the 
force amplification is infinite. Parallel singularities are configurations where 
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the platform becomes uncontrollable. In these scenarios, the velocity amplification 
is very large, while the force amplification is close to zero. On top of these 
drawbacks, the PKMTs have irregular workspace and limited range of motion, 
particularly the rotational one. This limits the part size and shape that can be 
machined. 

To solve the issues pertaining to the fully PKMTs, hybrid kinematic or serial- 
parallel kinematic machine tools are introduced. They combine the fully parallel 
mechanisms with additional serial mechanism. Figure 62 shows a widely used 
hybrid configuration for 5-axis milling tools. In this configuration, a two-DOF 
serial mechanism is mounted on a parallel mechanism resembling the one shown 
in Fig. 61. 


Reconfigurable Machine Tools 

Today’s competitive markets are marked by a trend toward short product life cycle 
and rapid change in the customer’s demand for product type and quantity. This 
requires a rapid response from the manufacturing system. The earliest 
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Fig. 62 Hybrid kinematic 
machine tool (Courtesy: PKM 
Tricept S. L) 
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manufacturing systems, which are known as the Dedicated Manufacturing Systems 
(DMS), are no longer suitable to meet this trend. This is because such systems are 
built to optimize the manufacturing process and tools around a specific part or 
product. Often, such systems produce a single part by operation of several machine 
tools at the same time. Consequently, they are economical for high volume and high 
production rates, but cannot be converted to handle new products. On the other 
hand, the flexible manufacturing systems (FMS) use multipurpose CNC machine 
tools. These machine tools are able to accommodate a large variety of parts, which 
may have not been specified at the design stage. Thus, the systems may produce a 
wide range of different products at changeable volume. However, FMS are expen- 
sive and handle relatively smaller capacity. Industrial experiences (Mehrabi 
et al. 2002) have also shown that the flexibility of the machines is often 
underutilized over their life cycle. FMS generally lack the efficiency and robustness 
compared to DMS and wasted resources make FMS uneconomical for many 
production situations. In summary, DMS have high productivity but low flexibility, 
while FMS have the opposite characteristics, which are low productivity and high 
flexibility. 

The reconfigurable manufacturing system (RMS) (Koren et al. 1999) is a new 
concept that bridges the gap between the DMS and FMS. It is defined as a system 
“designed at the outset for rapid change in structure, as well as in hardware and 
software components, in order to quickly adjust production capacity and function- 
ality within a part family in response to sudden changes in market or regulatory 
requirements.” In other words, this system is designed to provide just enough 
flexibility (i.e., functionality) to produce an entire part family and to grow and 
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Fig. 63 Cylinder head family: (a) V6 and (b) V8 


change within the scope of its lifetime to respond to market changes relatively 
quickly. The aim of RMS is to achieve “exactly the capacity and functionality 
needed, exactly when needed.” 

While a 3-axis CNC machine can be the basic module for reconfiguration for 
many manufacturing systems, the Reconfigurable Machine Tool (RMT) can bring 
the principle of reconfiguration to the component level for a manufacturing system 
(Landers et al. 2001). Consider an example of choosing a machine tool for the 
finishing milling process of automotive cylinder head with inclined surfaces. 
Various parameters in this process, including tooling tolerances and process param- 
eters, were the same for all inclined surfaces. However, the surfaces to be milled are 
at different angles with respect to the horizontal: 30° (for V6 cylinder head) and 45° 
(for V8 cylinder head), as shown in Fig. 63. A dedicated machine tool solution 
would need a customized station for each angle. Thus, considering the product 
varieties and changing product demands, one will most likely lean toward the CNC 
machines. However, a commercial CNC machine tool solution would require a 
5-axis CNC machine tool with orthogonal kinematics. As an alternative, one now 
can consider a reconfigurable solution where the machine tool is capable of 3 -axis 
kinematics with one additional passive DOF available for reconfiguration 
depending on the surface inclination. 

Figure 64 shows several possible RMT configurations based on a three-DOF 
kinematics. The first row shows the pivot-type RMTs in which the axis can be 
rotated along a certain axis. The second row shows the arch-type ones in which the 
spindle can be oriented along an arch. These various machine concepts were 
evaluated based on lowest natural frequency and workspace. The pivot-type 
RMTs had the advantage of having lower moving mass and, therefore, were stiffer. 
The lowest natural frequencies obtained for pivot-type RMTs were higher than 
those of the different arch-type RMTs. However, the workspace of pivot- type 
RMTs was restricted at large reconfiguration angles. 

The first full-scale prototype of side-mounted arch-type RMT (Dhupia 
et al. 2006, 2008b), based on Fig. 64f, is shown in Fig. 65. It was built in 1999 in 
the Engineering Research Center for Reconfigurable Manufacturing Systems 
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Fig. 64 Various reconfigurable machine tool configurations 


Fig. 65 An arch-type 
reconfigurable machine tool 



(ERC/RMS) at the University of Michigan to machine a part family of products 
with inclined surfaces, which exist in some automotive engine blocks or cylinder 
heads. The machine tool has three controlled DOFs along its column, along the 
spindle axis and along the table axis. One manually reconfigurable passive motion 
is the angular reconfiguration motion of the spindle, which allows reconfiguring the 
spindle’s angular position into five predesigned locations, at —15°, 0°, 15°, 30°, and 
45°. Thus, this machine can be used on different inclined surfaces. 
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The arch-type RMT is a non-orthogonal machine that may have different 
characteristics at each configuration of the machine. The machine is designed to 
drill and mill on an inclined surface in such a way that the tool is perpendicular to 
the surface. 


Summary 

There are two main principle of all machining process. The first principle is to 
provide suitable relative motions between the cutting tool and the workpiece to 
produce the characteristic geometry. The second one is to impart the shape of the 
cutting tool to the workpiece in order to create part geometry. This chapter has 
discussed the commercially available machine tools and machining processes based 
on these two principles. The majority of machine tools can be differentiated based 
on the employed cutting tools. In specific, this chapter has highlighted the machine 
drives and control, which are responsible to provide and regulate the necessary 
motions of the machine tool components. The development of machine tools has 
been moving toward increasing productivity and versatility. This chapter has also 
discussed a few promising nontraditional machine tools, namely, industrial robots, 
parallel and hybrid kinematic machine tools, and reconfigurable machine tools. 
Each of these designs has entered today’s manufacturing floor bringing certain 
superiority as compared to the conventional machine tools. 

Machining and machine tool will always be key components of a manufacturing 
system. Thus, research and development in this area will remain relevant and 
impactful to improve the efficiency, improve the reliability, and reduce the cost 
of machine tools and machining processes. 
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Abstract 

Machining dynamics has been a critical academic and industrial discipline to 
determine a chatter-free cutting condition as well as to provide an insight into the 
vibration-resistant design of a machining system. This article presents the 
introduction, the modeling of machining dynamics in frequency and time 
domain, the simulation examples, and the application of machining dynamics 
into industries. The advantage of the frequency -domain solution is to rapidly 
generate stability lobes over the wide range of spindle speed and cutting depths 
and avoid computationally costly numerical solutions at the expense of ignoring 
the nonlinearities in comparison to the time-domain solutions. The time-domain 
model allows prediction of cutting forces, torque, and vibrations during machin- 
ing, which is essential in planning the operations without overloading the tool, 
machine, and workpiece for a given set of cutting conditions. The time-domain 
simulation estimates the physics of the processes and allows the analysis of time- 
varying parameters, by incorporating the nonlinearities caused by material 
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behavior, and tool geometry variations along the cutting edge. Both models can 
provide a map of chatter-free cutting conditions and a fundamental guide for 
process planners. Engineers can use the simulations to perform the analysis of 
various tool geometry, cutter-part engagement, and cutting conditions and avoid 
chatter vibrations. Finally, this article shows the actual application of the models 
into machining industries. 


Introduction 

For the past few decades, CAM software packages have been exclusively used to 
generate tool path. However, they do not have any further function to estimate the 
cutting performances such as cutting force, machined surface, and cutting stability. 
Traditionally, the process planners frequently have to resort to empirical methods or 
trial-and-error methods to find acceptable cutting conditions in a cutting process. 
Empirical methods are typically developed based on experimental data. Since an 
empirical model is only valid in the range over which data have been collected, it is 
difficult to extrapolate the results from one set of machining conditions to a new set 
of conditions. The cutting conditions, which are determined by empirical or trial- 
and-error methods, may often result in under-/overcuts or cutter breakage due to 
excessive cutting force. Thus, most operators naturally become conservative in the 
selection of process parameters to assure part quality and to avoid such undesirable 
results. However, this comes at the cost of lowering the productivity. To overcome 
this problem, one of the major current research trends in this area is the develop- 
ment of a system that is capable of simulating the actual machining process, rather 
than simple geometric verification of tool paths. 

Machining chatter often becomes a big hindrance to high productivity and machin- 
ing quality. So engineers cannot figure out the way to avoid the severe vibration caused 
by the combination of cutting conditions and machines tools. It is a well-known 
problem in CNC machining companies, and engineers still depend on trial-and-error 
approach to figure out optimal cutting conditions and proper tooling combinations. 

Machining is a variety of material removal processes in which a cutting tool 
removes unwanted material from a workpiece to produce the desired shape, but the 
surface quality is often deteriorated by the dynamic response of a machining system. 
For example, Fig. 1 shows the comparison in machined surface quality generated by 
end milling operation. Figure lb illustrates the fine surface quality and roughness as 
compared to the rough surface quality affected by a tool vibration in Fig. lc. 

This article aims to introduce the methodology for machining chatter prediction 
prior to real machining so that engineers can avoid the vibration and find optimal 
cutting condition which results in higher surface quality and productivity. There has 
been research works done on the mechanics and dynamics of milling processes. 
Tobias and Tlusty proposed first chatter stability laws in frequency domain (Tobias 
and Fiswick 1958; Tlusty and Polacek 1957). Sridhar devised the time-domain 
solution with two coupled, delayed differential equations with time-varying coeffi- 
cients (Sridhar et al. 1968). Minis and Yanushevsky developed the first analytical 
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Fig. 1 Machined surface quality generated by end milling operation from Mitsubishi Materials 
(http://www.mitsubishicarbide.com/mmc/en/product/catalog/catalog.html). (a) End milling oper- 
ation (b) Fine surface quality (c) Rough surface quality due to a tool vibration 


solution of milling stability using Floquet’s theory by advancing from Sridhar’s 
formulation (1993). Altintas and Budak developed a general- and closed- form solu- 
tion of the milling stability in frequency domain (Budak and Altintas 1998a, b). 
Recent articles mention that added stability lobes are needed to accurately simulate 
low radial immersion cutting during high-speed machining (Insberger and Stepan 

2000; Davies et al. 2000; Bayly et al. 2003; Merdol and Altintas 2004). Olgac 
et al. considered both single and multiple time delays and validated their stability 
law on uniform and variable pitch cutter results (Olgac and Sipahi 2005). Ko and 
Altintas proposed mechanistic and dynamic models in time and frequency domain for 
plunge milling processes (2007a, b; Altintas and Ko 2006). The developed stability 
model can predict a torsional-axial vibration as well as lateral vibrations, while the 
previous plane milling model basically focuses only on lateral vibrations. 
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Section on “Modeling of Machining Dynamics” explains the critical procedures 
for frequency- and time-domain modeling of machining dynamics by illustrating 
the examples of milling and plunge milling models. Section on “Application of 
Machining Dynamics” shows the actual application of the modeling to manufactur- 
ing processes in industries. Section on “Summary” finally summarizes this article 
which is useful to understand the modeling and the application of machining 
dynamics for actual application to industries. 


Modeling of Machining Dynamics 

As shown in Fig. 2, while a cutter rotates and generates a chip, the dynamic 
response of a cutter leaves the vibration marks on the machined surface. While a 
chip is moved away, the phase difference between previous and present vibration 
waves may cause the oscillation of dynamic chip thickness. Modeling of machining 
dynamics should incorporate the formulation of the vibration phase difference 
which is reflected on dynamic chip thickness. 

The modeling of machining dynamics is illustrated in Fig. 3. Firstly, cutting 
force is formulated considering cutting force coefficients (/:), dynamic uncut chip 
thickness (/z), and cutting depth (d). Cutting force coefficients ( k ) can be accurately 



Milling operation and cutting mechanics 



Influence of the dynamic response of a cutter on the machined surface 


Fig. 2 Illustration of machining dynamics, (a) Milling operation and cutting mechanics, 
(b) Influence of the dynamic response of a cutter on the machined surface 
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Fig. 3 Modeling of machining dynamics ( k cutting force coefficients, h dynamic uncut chip 
thickness, d cutting depth) 


calculated by considering three components: the workpiece material, uncut chip 
thickness, and cutter geometry. The relationship between the uncut chip thickness 
and cutting force coefficients can be derived using the cutting force model, the 
measured cutting forces for a given workpiece material, and cutter geometry. The 
resultant cutting force in metal cutting is distributed over the area where the cutter 
contacts the chip and workpiece. Since no cutting tool is perfectly sharp, the cutting 
edge can be represented by a cylindrical surface between the tool flank and the rake 
face. Neither the force acting on the tool edge nor the force that may act on the tool 
flank contributes to removal of the chip, and these forces will be referred to collec- 
tively as the plowing force. The existence of the plowing force results in certain 
important effects and can explain the so-called size effect. This term refers to the 
increase in specific cutting energy at low values of undeformed chip thickness. Thus, 
to accurately predict the cutting forces, the cutting force coefficients with size effect 
should be considered and appropriately modeled. Since the methodology for identi- 
fying the cutting force coefficients has been introduced by numerous articles (Ko and 
Cho 2005; Ko et al. 2003), this article omits the explanation of how to determine 
cutting force coefficients. Section on “Cutting Force Formulation” summarizes the 
formulation of the cutting force model with the identified cutting force coefficients. 

Dynamic uncut chip thickness Qi) can be simulated by tracking the cutting edge 
positions at the present and previous vibrations, which reflects the vibration phase 
difference. The modeling of dynamic uncut chip thickness is a key to the success of a 
chatter modeling in machining processes. The dynamic uncut chip thickness models 
for time- and frequency-domain solutions are explained in section on “Dynamic Chip 
Thickness Model.” Finally, the machining stability and vibrations can be estimated 
using the identified transfer function and the predicted cutting forces. Section on 
“Stability Lobe Generation” illustrates the stability lobe generations, and section on 
“Examples of Simulation” shows the examples of dynamic simulations. 
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Cutting Force Formulation 

The geometry and parameters of a milling cutter are shown in Fig. 4. D 1? D 2 are the 
shank and effective cutting diameters of the cutter, respectively, ay, y / r , C/y, <fi p are 
the axial rake, cutting edge, axial relief angles of the tool, and cutter pitch angle, 
respectively. 6> r , A 6 r are the absolute and incremental rigid body rotation angle of 
the cutter, respectively, a is the radial depth of cut. A a is the length of edge element 
along the radial direction. 

The cutter cuts the metal at the cutting edges of the flutes as it proceeds into the 
metal in feed direction. The tangential cutting force (F t ) is in the direction of cutting 
speed and distributed along the cutting edge. The radial force (F r ) is in the radial 
direction of a cutter, and the axial cutting force (F a ) acts along the cutter axial 
direction (Z axis). In order to consider a general case, the cutting edge is divided 
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Fig. 4 The geometry and parameters of a milling cutter, (a) Flat end-milling variables, 
(b) Geometry and coordinates of a sample plunge mill 
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into finite number of small differential elements in the cutting edge curve. The chip 
load and corresponding differential loads for each edge element are evaluated and 
digitally integrated to predict the total forces in three directions. 

The angular position of the points along the cutting edge of the cutter is 
evaluated from Fig. 4. The angular position (</> 7 ) of the tooth j is described by 

<t>j = <Po + (j~ i )<t> P , 4> P = (i) 

where </> 0 is the rotation angle of the reference tooth measured from y axis and N is 
the number of teeth on the cutter. The area of chip cut by a differential element is 
h(<fij)ka where h(c/)j) is the instantaneous chip thickness removed by edge segment 
with a differential length of A a. 

The tangential (F t J) , radial (F a j), feed forces (FfJ), and torque (T c j) acting on the 
differential cutting edge element k of tooth j are given by 


dF tj = K tc h{4>j)Aa + K te ka, 
dFfj = Kf c h Aa + Kf e Aa, 




where the torque arm is r k = (k — 1/2) • A a + l for plunge milling and r k = cutting 
radius for plane milling (K tc , K ac , Kf c ) and (K te , K ae , Kf e ) are the material and tool 
geometry-dependent cutting and edge force coefficients, respectively. 

The tangential, radial, and feed forces acting on the cutting edge can be 
transformed into three orthogonal force components in Cartesian coordinates of 
the cutter axes as follows: 



The total instantaneous cutting forces and torque are evaluated by digitally 
integrating the contributions of all differential edge segments of the inserts which 
are in cut when the reference tooth is at angular position 


r 0 ) 

J FyM o) 
I f-M 

{ TcM 0 ) 


N-l M - 1 




j = 0 k = 0 



where N, M are the number of teeth on the cutter and the number of elements on 
each cutting edge, respectively. The reference immersion angle is incremented as 
the spindle rotates </> 0 = Fit where Q is the angular speed of the spindle. 
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Dynamic Chip Thickness Model 
Formulation for Time-Domain Solution 

The numerical solutions are based on the first principles of metal cutting laws, and 
they are used to simulate the correctness of the mathematical models describing the 
physics of the system. The rigid body motion and vibrations are modeled for the 
prediction of dynamic chip thickness variation along the cutting edge. The cutting 
forces acting on the tool and workpiece are evaluated from the chip thickness 
distribution. The size effect of cutting force coefficients, varying cutting edge 
geometry, and loss of contact between the tool and workpiece can be considered 
as nonlinearities and deviations from the process parameters. The degree of 
nonlinearity and the influence of geometric variations on the parameters of the 
delayed differential equations, which govern the stability of the process, are also 
measured through numerical simulations. The limits of process linearization for 
linear frequency or time-based chatter stability solutions are assessed through 
numerical modeling of the complete cutting process. Furthermore, the prediction 
of cutting forces acting on the bearings, spindle shaft, tool, and holder is important 
to prevent overloading of the machine elements. Hence, both process planner and 
spindle designer require the time-domain, numerical simulation of the process for 
improved design and productive machining. 

In time-domain solution, the cutter center position can be tracked using rigid 
body motion and vibrations as defined in Eq. (5): 


r x c {t) ) 

J Y c (t) I 

\m\ 

CUTTER CENTER 


< 


' R r • sin cf) 

R r • cos (j) 
N • c • Qf 

2 n 


> + 


£lt 


RIGID BODY 



VIBRATIONS 



where R r represents the radial eccentricity of the cutter center. The vibrations are 
estimated from the frequency response function (FRF) ([O]) of the structure 




It must be noted that the terms inside the FRF matrix represent the flexibilities in 
each direction, and some of them, especially the cross terms in translational 
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directions, may be zero depending on the cutter geometry and machine configura- 
tion. However, a general formulation is considered here since the proposed model is 
applicable to plunge milling and indexed boring cutters which may have various 
flexibility configurations depending on each application and cutter type. Each FRF 
term is represented in Faplace domain as 




E 

h = 1 



S 2 + 2 £h°>nhS + m nh 2 



where co nh , k h , and £ h are the natural frequency, modal stiffness, and damping ratio 
of mode number h, respectively. The vibrations (Eq. 6) are estimated by applying 
the predicted cutting force at each discrete time interval on the FRF (Eq. 7) by using 
the 4th-order Runge-Kutta integration. 

Depending on the cutter geometry, cutting insert type and distribution, and 
cutting conditions, the tool may experience lateral (x,y), axial (z), and torsional 
(0) vibrations. In plane milling process, lateral vibrations are dominant and axial and 
torsional vibration may be neglected. However, in other hole-making processes such 
as plunge milling, drilling, and multi-insert boring, the axial and torsional vibrations 
should be considered due to high torque generations. The general formulation of 
dynamic chip thickness, which contains both rigid body motion and structural 
vibrations, is formulated based on the kinematics of milling processes. Montgomery 
and Altintas (1991) developed exact kinematic model of dynamic milling by digitiz- 
ing the surface finish at discrete time intervals. Although their model accurately 
predicts the exact chip thickness with regenerative vibration effects in milling 
process, it is computationally costly and not robust when torsional vibrations are 
included. An alternative but still an accurate kinematic model of dynamic chip 
thickness can be formulated here by considering the stmctural vibrations in the lateral 
(x,y), axial (z), and torsional (0) directions at the cutter center as Ko and Altintas have 
lately proposed (2007b). The translational vibrations at the cutting edge locations are 
simply predicted by using the tool geometry. The torsional vibrations (6) are defined 
at the cutting edge but relative to the spindle axis passing through the cutter center. 

The coordinates of the points along the cutting edge are evaluated by using the 
cutter geometry information given in Fig. 5. As the cutter rotates and vibrates, the 
cut surface is digitized at discrete time intervals by tracking the positions of discrete 
cutting edge elements. The coordinates (X e , Y e , Z e ) of a point along the cutting edge 
of tooth j are expressed as 


X eJ (t) = X c (t) + r k ■ sin (fy), Y eJ = Y c (t ) + r k ■ cos (cpj), 

z e,j(t) =Z C + R aj + (r k - /) • tan y/ r 



where the angular immersion angle <pj = £lt + (j — \)<p p is updated with time t, and 
R aj represents axial runout of the cutting edge. 
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Fig. 5 Dynamic chip thickness ( h ) calculation due to previously and presently generated vibration 
marks (X c , Y c ), present cutter center position; (X cm , Y cm ) , previous cutter center position 
(a) Dynamic chip thickness (h) in plane milling, (b) Dynamic chip thickness ( h ) in plunge milling 
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The Z coordinate of the cutting edge of the previous tooth (j — 1) is given as follows: 

Ze,j— 1 ^cj— 1 H - Id ' tan Iff r R a j— ^ l d 

= \f (V,; — Xe,j-l ) 2 + (Y e j — Y e j-i ) 2 (9) 

For plunge milling processes, when the present (j) and previous (j — 1) cutting 
edge elements are at the same radial location (X e j-\ = X e j, Y e j = Y e j-i), dynamic 
chip thickness is evaluated by simply subtracting the z coordinates of the present 
and previous teeth, see Fig. 3. 


hki&ji 0 ) — Z e ,j - 1 Z e ,j ( 10 ) 

For plane milling processes, dynamic chip thickness is evaluated by subtracting 
the radial distance of the present and previous teeth from the present cutter center as 
follows. 


hk{4>j(t)) = y/(x e - X c ) 2 + (Y e - Y c f - yj ( X em - X c f + ( Y em - Y c f (11) 

If the resulting chip thickness is negative due to tool jumping out of the cut, it is 
forced to become zero to consider saturation nonlinearity in the process. Since the 
cutter center history is updated and saved in an array as a function of the cutter 
rotation angle (fy) or time (0, the dynamic chip loads, which include both rigid 
body motion and regenerative vibrations, are easily evaluated with improved 
computational efficiency and numerical robustness. The process simulation allows 
complex tool geometry, runout, time-varying directional factors, chip thickness, 
and various cutting force coefficients, which are important in analyzing the process 
physics and accurate prediction of cutting loads for a given application. By input- 
ting the dynamic chip thickness Eqs. (10) or (11) into dynamic forces Eq. (4), the 
dynamic forces and torques can be predicted. 

Formulation for Frequency-Domain Solution 

The advantage of the frequency-domain solution is to avoid computationally costly 
numerical solutions at the expense of ignoring the nonlinearities in comparison to 
numerical solutions. Since the frequency-domain stability analysis requires a 
closed-form relationship between the milling process and vibrations, some linear- 
ization of the process is inevitable while not sacrificing the prediction accuracy and 
fundamentals of governing physics beyond acceptable limits. 

The dynamic chip thickness removed by tooth ( j) can be expressed as hj(t) = c + 
h d (t), where c is the feed per tooth in z direction and represents the rigid body 
motion. Since the rigid body motion affects only static deflections and forced 
vibrations but not the stability, the feed rate can be dropped from the stability 
analysis. The dynamic, regenerative chip thickness h d {t) is influenced by the present 
and past vibrations and must be expressed as an analytical function. 
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For plunge milling processes, since the chip thickness is measured from the axial 
direction of the cutter, the dynamic chip load is expressed as 



where gj is a unit step function that determines whether the tooth is in or out of cut, i.e., 
gj = 1 when the insert is in cut and zero otherwise depending on the immersion 
conditions of the cutter. A r = r(t) — r(t — T), r — » x, y, z, 0 terms are the regenerative 
vibrations in four directions; T is the tooth period. The cutting coefficients are normal- 
ized with respect to the most dominant gain, the tangential cutting force coefficient^, 
i.e., K a — K a JK tc , Kf=KfJK tc . The edge forces are not dependent on the regenerative 
chip thickness; hence, they can be dropped from the stability expressions. 

The dynamic chip thickness (Eq. 12) is substituted into the cutting force expres- 
sions (Eq. 4), where the torque arm is assumed to be measured from the midpoint of 
the cutting edge of the insert ((D 2 + 2/)/4) since the actual depth of cut is an 
unknown variable in the stability. 

For plane milling processes, the total dynamic chip load can be evaluated by 
considering the influence of vibrations in x, y and z directions by neglecting 
torsional vibration: 


hd h x T~ hy T~ h z 

The dynamic part of the total dynamic chip load can be evaluated as 



A h x + A hy 4 - A h z 


= Ax sin cj) + Ay cos (ft + Az. 

tan y/ r 



where Ax = x(i) — x(t — T), Ay = y(t ) — y(t — T ), and Az = z(t ) — z(t — T ) are the 
regenerative vibrations and T is the tooth period. 


Stability Lobe Generation 


For plunge milling operation, the dynamic chip thickness (Eq. 12) is substituted into 
the cutting force expressions (Eq. 4), where the torque arm is assumed to be 
measured from the midpoint of the cutting edge of the insert (( D 2 + 2/)/4) since 
the actual depth of cut is an unknown variable in the stability: 



where R avg = (D 2 + 2/)/4 for plunge milling and R avg = cutter radius for milling 
process. 
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For plane milling operation, by substituting the dynamic part of the chip 
thickness and constant cutting force coefficients, the dynamic forces in Eq. 4 can 
be expressed as 



where a is the axial depth of cut and the instantaneous immersion of a tooth j is 
(fij (p + j (f) p and K r K rc /Kf C , K a K ac /Kf C . 

Equations 15 and 16 can be expressed in matrix form {E(7) } = aK tc [A(t )] { A(t)}, 
where [A(t)] is a periodic, time-varying directional factor matrix due to cutter 
rotation (4)j(t) = £lt + ( j — Terms in [A(t)] are dependent on the tool 

geometry and engagement of the mill with the part. The time variation of the 
directional factors requires either multifrequency solution as proposed by Budak 
and Altintas (1998a, b) or time-domain solution of delayed differential equations as 
presented in (Insberger and Stepan 2000; Davies et al. 2000; Bayly et al. 2003). 
However, Merdol and Altintas (2004) showed that when the periodic directional 
factors do not contain short-impulse-type waveforms and the spindle speed is not 
high enough to lead to flip bifurcation, the mean values of the directional factors 
lead to as accurate solution as the case when the time-varying factors are fully 
considered. Since [A(t)] is periodic at tooth-passing interval ( T ) or pitch-angle 
interval (fi p , the mean values can be evaluated as 


' T l Cfi ex 

[A(t)\dt = — 
o Qp J 4> st 

Even at high speeds and short engagements where the directional factors have 
high harmonics, the flip bifurcation creates small added stability pocket near the 
first lobe which corresponds to the highest spindle speed region. Such a situation is 
not applicable to roughing or semifinishing where the directional factors do not lead 
to flip bifurcation of the modes. Nevertheless, if any unforeseen application 
requires the inclusion of higher-order Fourier terms, the multifrequency model of 
the proposed stability law can be applied as shown in (Merdol and Altintas 2004). 

The milling system oscillates at the chatter frequency (< oo c ) when it is critically 
stable. The harmonic response of the structure at time (t) and previous tooth period 
(t — T) can be expressed as 



mm 


N 
2 n 


a 


(17) 



[®(iffl)]{F(f)},{r(f- T)} = [® {ico)\e 


-icoT 


HOI 



By substituting the regenerative vibrations (Eq. 18) into dynamic milling 
(Eqs. 15 and 16) with average directional factors (Eq. 17), the following stability 
formulation is established: 
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By defining the oriented frequency response function matrix (O 0 ) and the 
eigenvalue (A) of the characteristic equation as 



the stability of the system is evaluated from the eigenvalues (A) of the characteristic 
equation 


det[[/] + A[OoO 


C 




Since the oriented transfer function matrix (O 0 ) is fourth order, the eigenvalues, 
which lead to critically stable cutting conditions, are found from the roots of the 
following polynomial: 


( 2 qA Cl\ A “l - <22 A T (23 A 1 
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( 22 ) 


Each eigenvalue has a real and imaginary part: 
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The critical depth of cut (< 2 i im ) and spindle speed (ft) are found from Eq. 16: 
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The eigenvalues (A) are solved by scanning possible chatter frequencies from 
the FRFs to generate pairs of critical depth of cut and corresponding speeds for each 
lobe k = 1,2,3,. . ., k. 


Examples of Simulation 
Plane Milling Chatter Simulation 

The proposed chatter stability law for milling process is experimentally verified by 
conducting milling tests for the single tool path. The experimentally identified 
modal parameters of the transfer function matrix are also given in Table 1, where 
the end mill cutter with two teeth (diameter =10 mm, rake angle =10 deg., helix 
angle = 30 deg.) is used in machining the aluminum alloy A16061-T6. And the 
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Table 1 Modal parameters of the end mill attached to Roeders 760 



Mode no. 

Natural frequency 

Damping ratio (f) 

Modal stiffness (k) 


1 

4,210 

0.016 

30.100 N/pm 

yy 

1 

4,220 

0.015 

23.300 N/pm 

Zz 

1 

826 

0.022 

185.00 N/pm 


cutting force coefficients are K tc = 1,157 (N/mm ), K ac = 1,100 (N/mnr), and 
K fc = 400 (N/mm 2 ). 

The analytically predicted chatter stability lobe is presented in Fig. 6. And the 
machined surface at A, B, C, and D conditions are listed, and cutting conditions A 
and C are identified as chatter phenomenon with clear chatter marks as predicted in 
the frequency-domain stability law. 

In the chatter prediction algorithm, the chatter frequency may vary due to the 
contact stiffness. Since the exact shifted frequency is not predictable, it may be 
useful to use a chatter frequency from FFT of the measured forces/vibrations. The 
presence of chatter is identified using chatter marks, and FFT of the measured 
vibration signal. 

This proposed model can be applied to pocket milling process, which has the 
tool path as shown in Fig. 7. NC code is generated using CAM software for 
machining along the given tool path with axial depth of cut 2 mm. And the entry 
and exit angles (EEA) of cutters can be calculated based on boundary information 
of cutter and workpiece. Figure 8 illustrates EEA calculated from tool path infor- 
mation. It is shown that the entry angle gets smaller as cutter approaches pocket 
corner. 

According to the decrease of the entry angle, the critical axial depth of cut from 
stability law is reduced as illustrated in Fig. 9. For the given set of the end mill and 
workpiece, the frequency-domain solution can generate this geometry-dependent 
stability array CADOC [RPM, EEA] before NC simulation. When the developed 
algorithm detects the change of EEA in geometric simulation, it can compare 
CADOC [RPM, EEA] against ADOC [RPM, EEA]. 

For example, when spindle speed for pocket milling is 17,700 rpm, CADOC 
varies according to entry angles as shown in Fig. 9. ADOC (2 mm) of the given NC 
code becomes higher than CADOC as the cutter moves into the comer of pocket 
geometry at 17,700 rpm. In detail, when entry and exit angles are 1.2 rad and 3.14 
rad, respectively, CADOC [17,700 rpm, EEA] = 1.8 mm which is smaller than the 
axial depth of cut 2 mm of the given NC code. So the algorithm predicts that the 
machining becomes unstable at 17,700 rpm and EEA (1.2, 3.14). 

However, when spindle speed for pocket milling is 22,000 rpm, CADOC does 
not become less than ADOC (2 mm) even though the cutter moves into comer. In 
detail, when entry and exit angles are 1.2 rad and 3.14 rad, respectively, CADOC 
[22,000 rpm, EEA] = 4.25 mm which is larger than the axial depth of cut 2 mm of 
the given NC code. So the algorithm forecasts that the machining stays stable even 
though the cutter encounters the corner. 
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Fig. 6 Stability lobe and 
measured machined surfaces 
(Ko and Shaw 2009). (a) 
Condition A: 17,700 rpm, 
ADOC = 3 mm (b) Condition 
B: 18,800 rpm, ADOC = 

3 mm (c) Condition C: 20,300 
rpm, ADOC = 3 mm (d) 
Condition D: 22,000 rpm, 
ADOC = 3 mm 
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Condition A: 17700 rpm, ADOC = 3 mm 



Condition B: 18800 rpm, ADOC = 3mm 



Condition C: 20300 rpm, ADOC = 3 mm 






Condition D: 22000 rpm, ADOC = 3 mm 


In order to verify the predictions from the proposed algorithm, the experimental 
pocket machining with different spindle speeds were performed. The machined sur- 
faces and FFT of the cutting force component, F x measured at the comer area are shown 
in Figs. 10a- 1 and a-2, respectively. Figure 10a illustrates that the tool starts chattering 
with around 4,360 Hz as the end mill gets into comer with 17,700 rpm even though the 






21 Machining Dynamics in Manufacturing 


883 


Fig. 7 The tool path of 
pocket milling process: radius 
= 5 mm, axial depth of cut 
(ADOC) = 2 mm 
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Fig. 8 Entry angle variation 
according to cutter movement 
into corner 




Fig. 9 The change of CADOC according to spindle speed and EEA (Entry and Exit Angles) 
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Cutting condition; 17700 rpm, Feed per tooth 0.05 mm, DOC - 2mm 




(b-l) Machined surfaces (no chatter at all) 
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Fig. 10 Experimental results under different conditions for pocket machining (Ko and Shaw 
2009). (a) Cutting condition: 17,700 rpm, feed per tooth 0.05 mm, DOC = 2 mm. (a-1) Machined 
surfaces (chatter at cornering section, no chatter at straight tool path section), (a-2) FFT of cutting 
force component Fx measured at the corner, (b) Cutting condition: 22,000 rpm, feed per tooth 0.05 
mm, DOC = 2 mm. (b-l) Machined surfaces (no chatter at all) (b-2). FFT of cutting force 
component Fx measured at the corner 


straight tool path section does not generate chatter. However, Fig. 10b illustrates that 
the machining stays stable even though the tool encounter the comer with 22,000 rpm. 
Figures 10b- 1 and b-2 show stable machined surfaces with no chatter marks and FFT of 
F x only with tooth-passing frequency, respectively. These experiments results match 
with the predictions using the proposed method. 
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Plunge Milling Chatter Simulation 

Plunge milling tests at n = 1 ,000 rev/min spindle speed but at various immersion 
and feed rates were performed in Fig. 11. The predicted and measured cutting 
forces are in good agreement, which indicates the correctness of rigid body kine- 
matics of plunge milling, chip thickness evaluation from digitized surfaces cut by 
successive teeth, the process mechanics, and linear cutting coefficient model. One 
insert had 20 pm axial and 1 .5 pm radial runouts, which are measured relative to the 
reference insert and included in the time-domain simulation model. 

The cutting force coefficients for the aluminum material A17050-T7451 were 
identified as follows: K tc = 956 N/mm 2 , K a = 396 /K tc , and Kf— 31\/K tc . The modal 
parameters identified for the given tooling are listed in Table 2. The stability lobes 
predicted by the proposed theory are given in Fig. 12. The presence of chatter is 
identified when the resulting cut surface is rough, and the process is dominated by a 
high-pitch sound which does not correspond to the harmonics of tooth-passing 
frequency, but close to one of the natural frequencies. More than 100 plunge 
milling tests were performed to validate the theory, and overwhelming number of 
predictions were in good agreement with the experiments. The lobes were domi- 
nated by the torsional-axial coupled mode (z e ) around 12,000 Hz. If this torsional- 
axial coupled mode (12 KHz) is disabled in the simulation, the lobes, which 
limit the axial depth of cut most, disappear and are replaced by less dominant 
lateral modes. 

Sample cutting force and vibration measurements at unstable (Point A) and 
stable (Point B) conditions are shown as an example. The unstable cutting condition 
at Point A corresponds to spindle speed of 16,000 rev/min and 5 mm radial depth 
of cut. The chatter occurs due to the coupling of torsional-axial vibrations at 
12,360 Hz. Although the lateral cutting forces F x and F y are significantly cancelled 
due to the symmetry of the geometry, the torque generated by each tooth is summed 
as the total torque. The dynamic torque leads to torsional vibrations, which in turn 
cause axial vibrations due to coupling effect. The spindle speed and radial depth of 
cut at stable cutting condition (Point B) were 17,142 rev/min and 5.75 mm, 
respectively. The measured cutting forces and sound spectrum are dominated by 
the spindle speed (285.7 Hz) due to runout. 

It is important to predict the amplitude and frequency of cutting forces and 
elastic displacements in machining under stable, chatter and forced vibration 
conditions. The dynamic loads are transmitted to the spindle bearings and tool 
holders, and the spindle designer must forecast their magnitude by considering 
chatter conditions. Similarly, excessive forced vibrations may violate the dimen- 
sional tolerances of the part, although the process may be stable. Furthermore, the 
accuracy of linear frequency-domain solutions is best checked against the time- 
domain simulations where all the process nonlinearities can be considered. 

The stability of the same plunge milling operation is tested by conducting time- 
domain simulations at 100 rev/min increments at the spindle speed range of 
13,500-16,500 rev/min. The depths of cuts are 2.5, 4.0, and 5.5 mm, and the 

stability is assessed by checking the amplitude history of chip regeneration, cutting 
forces, and vibrations, as well as their spectrum. The time-domain prediction results 
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Table 2 Modal parameters of the plunge mill attached to Mori Seiki SH403 with HSK 
63 interface 



Mode# 

Natural frequency (Hz) 

Damping ratio (f) 

Dynamic stiffness (2 kQ 


1 

508 

0.024 

5.68 N/pm 


2 

3,040 

0.024 

1 .99 N/pm 


3 

4,035 

0.018 

2.42 N/pm 

yy 

1 

515 

0.051 

4.52 N/pm 


2 

3,100 

0.030 

4.90 N/pm 


3 

3,961 

0.018 

1.87 N/pm 

Zz 

1 

321 

0.048 

23.30 N/pm 


2 

405 

0.038 

59.80 N/pm 

zo 

1 

12,090 

0.0015 

0.0649 Nm/pm 

9e 

1 

12,000 

0.0024 

53.80 Nm/rad 


correspond well with the lobes predicted by the analytical solution and experiments 
shown in Fig. 12. The cutting conditions at unstable Point A and stable Point B were 
simulated and given in Figs. 13 and 14, respectively. Similar to the force measure- 
ments collected during the experiments (Fig. 12), the simulated forces under the 
same cutting condition of Point A show chatter at 12,137 Hz torsional-axial 
coupling mode frequency (see Fig. 13). The small discrepancy between the simu- 
lated and measured chatter frequency is mainly due to resolution of FFT and 
numerical integration of the delayed differential equations in evaluating the chip 
thickness. The simulated forces under the same cutting condition of Point B are 
stable with constant axial force as shown in Fig. 14. 


Application of Machining Dynamics 

As shown in the Fig. 15, the cutting can become stable or unstable (chatter) 
according to cutting condition for the given set of cutting tool and workpiece. In 
case of unstable cutting, the additional time and resources of removing chatter may 
be wasted in trial-and-error approach. Machine tools’ dynamic parameters, work- 
piece materials, and cutter’s dynamic performance should be considered for 
improving machining productivity and machining quality. Also, the cutting steps 
such as rough, semifinish, and finish cutting should be reflected on cutting strategy. 
So, extensive machining expertise including machine tool dynamics, cutting 
mechanics/dynamics, cutting tool/workpiece material information, computer- 
aided manufacturing, and experimental approach is required for machining optimi- 
zation. The modeling considers these requirements into the stability lobe genera- 
tions by focusing on machining vibration and cutter deflection problem which 
machining industries often encounter. 

The vibration parameter of the tooling or workpiece structure can be identified 
through impact hammer and accelerometer. The analysis of transfer function 
between impact hammer force signal and acceleration signal is transferred into 
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Fig. 14 Predicted cutting 
forces for cutting condition B 
(cutting conditions: full- 
immersion plunge milling 
mode, spindle speed = 16,000 
rpm, feed per tooth = 0.075 
mm/tooth, radial depth of cut 
= 5 mm) (Ko and Altitntas 
2007a, b) 



stability lobe generation software module. The stability lobe considers machine 
tool, tooling information (cutter geometry: rake, helix, relief angles), workpiece 
material and so on as illustrated in Fig. 16. This stability lobe is generated based on 
physical cutting simulation including dynamic cutting force model and machining 
system’s transfer function and so on. 

In Singapore, SIMTech, Ko has been working for local precision engineering 
companies to customize a technology to integrate the vibration mode of each 
machine tool into process simulation and optimization for high productivity. 
Through modeling and simulation, DYNACUT, innovative software as shown in 
Fig. 17, is developed that incorporates the vibration mode information to identify 
optimal machining parameters that avoid machining chatter without sacrificing 
machining productivity. Elimination of chatter marks on the machined surface 
resulted in higher surface integrity without the need for tedious, often manual, 
secondary re-machining operation such as polishing. 

As shown in Fig. 18, engineers’ hammer tests on the tooling or the workpiece 
identify transfer function of the machining system, which is integrated into the 
milling dynamic model to predict cutting force, deflection/vibration, power, and 
torque considering the cutter geometry, workpiece material, and system’s fre- 
quency/dynamic stiffness. Through modeling and simulation of machining process 
based on machining mechanics and dynamics, a toolkit is developed that incorpo- 
rates the vibration mode information to identify suitable machining parameters in 
order to avoid a machining chatter without sacrificing machining productivity as 
shown in Fig. 19. 

As shown in Fig. 19a, cornering milling often encountered chatter phenomenon 
in mold base manufacturing, and chatter marks can appear at the cornering of long 
depth. So the cornering milling simulation needs to be incorporated based on the 
application of milling chatter solution which can effectively remove the chatter 
marks as shown in Fig. 19b. So the user interface should include option for 
cornering milling process by considering the cornering radius as well as the width 
of cut, spindle speed, depth of cut, and feed rate. 
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Hammering Test 
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Workpiece Information 

Name (lAforkpiece material index) 
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Spindle rpm or Cutting Speed 
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Feed rate 


Fig. 16 Input parameters/database information 
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Fig. 17 Quick milling optimization software, DYNACUT developed by Ko 


Local companies have applied the technology to actual mold machining for case 
study for comparisons of machining times and ensuring surface quality. The case 
study is able to validate the applications at machine shop floor. Semifinishing 
and finishing processes have been tested by engineers using the quick milling 
vibration solver, and existing conditions have been compared with recommended 
conditions from quick milling vibration solver. The mold material is S50C. 
Figure 20 shows the overall machining results and successful finishing quality of 
the actual product with the application of the recommended conditions from the 
developed models. 
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Improve productivity & 
surface quality 

Enhance sustainability 


Fig. 18 Overall configuration of the quick milling vibration solver 


According to the machining results, machining time has been reduced around 
30 % with cutting conditions by applying the proposed modeling while good 
surface quality and tool life were maintained. So this test validates that this 
technology can improve productivity while maintaining sustainability in terms of 
surface quality and tool life. 

Companies have benefited from the technology in: 

• Reduction of machining time 

• Improved surface quality of its products 

• Increased capability of its trained engineers through enhanced, skills, and under- 
standing of machining phenomena 

The technology is tuned into SME environment where daily productivity is 
critical. It allows users to build their own machine tool/tooling’s dynamic param- 
eters into customized database structure. Company engineers have been trained on 
the use of the solver, to adopt the solution to milling machines on the production 
floor. The operation director, who led the project in the company, mentioned in his 
appreciation letter, “With the success in implementation of the Vibration Solver, 
the technology helps us to improve the machining surface quality against machin- 
ing chattering marks and to optimize cutting condition for higher productivity in 
our precision machining works.” 
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Fig. 20 The overall 
machined geometry and 
results with cutting condition 
recommended by quick 
milling vibration solver for 
real production (Material 
S50C) [in collaboration with 
Yudowanco Pte Ltd.] (a) 
Hammering tests using quick 
milling vibration solver (b) 
Machined results 




Summary 

Even though computer numerical control (CNC) machining technology is well 
developed, machining systems still require long periods of trial and error to 
optimally produce a given new product design or component. There is still a risk 
of limited knowledge of machining results according to machining conditions as 
well as machine tools at the production stage. So, machine tool operation still relies 
on engineer’s experiences and know-how. These deficiencies cause significant 
delays in time to market, increase cost, and reduce productivity. This article 
introduces the simulation of the machining dynamics for high quality and produc- 
tivity in manufacturing process. The simulation technology can be used to over- 
come the limited operations and low productivity due to machining chatters. 

For modeling of machining dynamics, the dynamic uncut chip thickness should 
be modeled in order to reflect the regenerative vibration due to the inner and outer 
modulations. With introduction of the dynamic uncut chip thickness determined 
by the present and previous positions of cutting edges, the regenerative effect can 
be considered into the machining stability models. This article explains the 
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formulations of the frequency- and time-domain models. The frequency-domain 
solution adopts the linearization of process variables, which results in rapid gener- 
ation of chatter- free cutting conditions. The solution has been widely implemented 
in machining industries as a quick guide for process planners. The time-domain 
model allows prediction of cutting forces, torque, and vibrations, which is essential 
in planning the operations without overloading the tool, machine, and workpiece 
for a given set of cutting conditions. The time-domain simulation verifies process 
physics and allows sensitivity analysis of time-varying parameters, nonlinearities 
caused by material behavior, and tool geometry variations along the cutting edge. 
So the model allows the analysis of various tool geometry, cutter-part engagement, 
and cutting conditions and predicts process performance measures such as forces 
and vibrations. 

The application sequence is to first use the frequency-domain model to see the 
chatter stability lobes over wide ranges of spindle speed and cutting depth and to 
finally apply time-domain model in order to confirm the condition or the selection 
of tooling with prediction of cutting forces, torque/power, and surface errors. The 
presented modeling can fulfill the following objectives for manufacturing 
industries: 


Improve machining surface quality against machining chattering marks 

- Increase productivity by eliminating the polishing processing to get rid of 
chatter marks on workpiece 

- Enhance machining accuracy such as surface roughness, form error, and 
dimensional error 

Enhance competency over productivity & machining know-how 

- Immediate identification of optimal cutting condition for higher productivity 

- Save material cost and machining time by selecting a right set of spindle 
speed and cutting depth 

- Transfer the know-how to solve machining vibration problem through train- 
ings and toolkit 

Reduce workpiece damage and tooling/spindle wear 

- Preserve workpiece cost against vibration 

- Prolong tooling/machine tool life and reduce machine downtime due to the 
wear and tear of spindle unit as a result of machining vibration 

- Improve the machining performance by avoiding vibration with stable 
condition 

Equip high value toolkit to support engineers 

- Stay competitive in terms of manufacturing costs against low-labor-cost 
countries 


- Upgrade engineers’ performance with the application of the portable and 
quick machining vibration solver 

- Overcome the shortage of the experienced engineers against machining 
vibration issues 
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Abstract 

There are thousands of materials available for engineering applications. 
Machinability is an indicator of one engineering material how easy or diffi- 
cult to be machined using a cutting tool to achieve an acceptable surface 
finish, which could be considered as a material property. Engineers are often 
challenged to find ways to improve machinability without harming material 
performance, which are much focused on the machining efficiency and 
productivity. However, unlike most material properties, machinability cannot 
be simplified into a unique work material property, but rather considering as a 
resultant property of the machining system which is mainly affected by work 
materiaEs physical properties, heat treatment processes, and work-hardening 
behavior, as well as cutting tool materials, tool geometry, machining opera- 
tion type, cutting conditions, and cutting fluids. When assessing a material 
machinability, all other aspects of the machining system must be considered 
concurrently. An understanding of the interactions between tool and work 
materials at the tool-work interface would benefit to machining behavior and 
machinability. Tool material and cutting speed perhaps are the two most 
important parameters for engineering material machinability assessments. 
Materials with good machinability require little power to cut, can be cut 
quickly, easily obtain a good surface finish, and do not wear the cutting 
tool fast. Engineering materials could be developed with improved machin- 
ability or more uniform machinability through microstructure modification 
and chemical components adjustment. Advance developed tool materials with 
high thermal hardness and wear resistance would improve the material 
machinability. 



22 Machinability of Engineering Materials 


901 


Introduction 

Engineering Materials 

There are thousands of materials available for engineering applications. Based on 
the atomic bonding forces of a particular material, all engineering materials can be 
grouped into four classes: metallic, ceramic, polymeric, and composite which is 
created by a combination of different materials. Within each of four classes, 
materials can be further organized into groups based on their chemical composition 
or certain physical or mechanical properties. Below is a list of four general groups 
of materials and their distinctive properties. 

(A) Metals and alloys - inorganic materials compounded one or more metallic 
elements, including ferrous metals and alloys (iron, carbon steel, stainless 
steel, tool steel, and steel alloy) and nonferrous metals and alloys (aluminum, 
copper, brass, magnesium, nickel, titanium, and superalloys): 

• Usually having crystalline structure 

• High thermal and electrical conductivity 

• High strength and elastic modulus 

• Good ductility and toughness 

• Strengthened by alloying and heat treatment 

• Corrosion resistant 

(B) Ceramics and glasses - inorganic materials consisted of both metallic and nonme- 
tallic elements bonded together, including glass, carbide, glass ceramic, and graphite: 

• High melting point 

• High chemical stability 

• High hardness and thermal strength 

• Brittleness 

• Low electrical conductivity 

(C) Polymers - organic materials consisted of long molecular chains or networks 
containing carbon, including thermoplastic, thermosetting plastic, and elastomer: 

• Low density and low rigidity 

• Low electrical conductivity 

• Good strength- to -weight ratio 

• Corrosion resistance, but does not apply to high temperature 

• Good compatibility with human tissue 

(D) Composites - consisted of two or more above distinct materials together, 
including metal-matrix composite, ceramic matrix composite, and fiber rein- 
forcement composite: 

• Consisting a matrix and a reinforcement 

• Combining the best properties of each component material 
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Definition of Machinability 

The term “machinability” is a property of a material which governs the ease or 
difficulty with which the material can be machined, achieving an acceptable surface 
finish using a cutting tool. Materials with good machinability require little power to 
cut, can be cut quickly, easily obtain a good surface finish, and do not wear the 
cutting tool fast. Such materials are said to be free machining. Thus, all engineering 
materials machined may be classified as (Shaw 1983): 

1 . Easy-machining materials (aluminum and copper alloys) 

2. Ordinary wrought steels and cast irons 

3. Difficult-to-machine materials 

Manufacturing and production engineers are often challenged to find ways to 
improve machinability without harming material performance, which are much 
focused on the machining efficiency and productivity. However, unlike most 
material properties, machinability cannot be simplified into a unique work mate- 
rial property which is not a commonly accepted parameter used for its measure- 
ment of machining performance, but rather considering as a resultant property of 
the machining system which is mainly affected by work material’s physical 
properties such as micro structure, grain size, heat treatment, chemical composi- 
tion, fabrication, hardness, yield strength, tensile strength, Young’s modulus, 
thermal conductivity, thermal expansion, and work hardening. Other factors also 
affecting material machinability include cutting tool materials, tool geometry, 
machining operation type, cutting conditions, and cutting fluids, as well as 
machine tool, fixture, and jig. 


Machinability Criteria 

There are many factors affecting material machinability, but no widely accepted 
way to quantify it. The basic machinability of a material is a function of its (Shaw 

1983): 

• Chemistry 

• Structure 

• Compatibility with tool materials 

The basic machining characteristics of all engineering materials are quite dif- 
ferent due to the differences in their chemical and physical properties. Such a wide 
variety of engineering materials will also have different machinability depending 
upon machining conditions and tool materials used. As the machinability is more 
like a resultant property of the machining system, to assess one material’s machin- 
ing performance, i.e., machinability, may utilize one or more of the following 
criteria (Trent and Wright 2000): 
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1. Tool life - is the measurement of how long a tool lasts when cutting a material. 
The Taylor equation for tool life expectancy provides a good approximation: 



where V c is cutting speed, 

T is tool life, and 

n and C are constants found by experimentation or published data; they are 
properties of tool material, workpiece, and feed rate. 

2. Martial removal rate (MMR) - is the volume of material removed by the cutting 
tool per unit time. For milling process, the material removal rate of the process 
can be obtained by 

Rmr = vfd (2) 

where R mr is material removal rate, 
v is cutting speed, 

/ is feed rate, and 
d is depth of cut. 

3. Machined surface finish - also known as surface texture, is a characteristic of a 
surface including, surface roughness, and waviness. One of the most effective 
surface roughness measurement, commonly adopted in general engineering 
practice, is known as arithmetic mean value R a , an arithmetic average of the 
absolute values as following: 





where R a is arithmetic mean surface roughness. 

It gives a good general description of the height variations in the machined 
surfaces. The surface profile contains n equally spaced points along the trace, and 
is the vertical distance from the mean line to the point. Height is assumed to be 
positive above the mean line, and negative below the mean line. 

4. Cutting forces - required for a tool to cut through a material directly related to 
the power consumed. Therefore, tool forces are often given in units of specific 
cutting energy, P S9 i.e., energy required to remove unit volume of material: 


P s = F/fd 



where F is cutting force. 

5 . Cutting power - is the power consumed by the machine tool during machining 
operations to remove the unwanted material from the workpiece material in the 
form of chips, which can be expressed as 



where P m is cutting power. 
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6. Chip formation - chips formed and flowed along rake face around the cutting 
tool and chip shapes influenced by work materials, machining operation type, 
cutting tool geometry, and cutting conditions. Normally, there are three basic 
types of chip formed during machining processes: continuous, discontinuous, 
and continuous with built-up edge (BUE). For some extreme cases, types of 
chips formed may change from continuous to continuous with BUE and further 
to discontinuous even cutting of the same work material with the same cutting 
tool just by changing its cutting parameters. 

A material may have good machinability by one criterion, but poor by another. 
Also, relative machinability may change when a different type of machining 
operation is being carried out, for example, turning versus milling, or the tool 
material used being changed. Due to many factors involved, it is best to state the 
comparison when two materials are compared to their machinability. 


Material Property Effects on Machinability 

The best machinability can be achieved in a material if there is: 

• Not too high ductility 

• No high work (strain) hardening 

• Not too much abrasiveness 

It is not easy to define the material machinability without the help of the original 
material’s properties. But certain trends can be found for certain materials. 


Hardness 

Hardness is closely related to strength. It is the ability of a material to resist a 
localized deformation from indentation, scratching, penetration bending, abra- 
sion, and wear. Alloying techniques and heat treatment would help to achieve 
high resistance to the localized deformation, i.e., high hardness achieved after 
alloying or heat treatment. It is measured on special hardness testers by 
measuring the resistance of the material against penetration of an indent or of 
special shape under a given load. Hardness measurements are widely used for 
material quality control because they are fast and considered to be nondestruc- 
tive tests. The different scales of hardness are Brinell hardness, Rockwell 
hardness, Vickers hardness, etc. Hardness can be an indicator of material 
machinability, but hardness alone is not consistent as a measure of material 
machinability. 

The material’s machinability varies with its micro structure and hardness (Finn 
1989). A work material having a low hardness and low work-hardening rate pro- 
duces small cutting forces and low contact pressure at the tool-chip interface, 
consequently causing a low temperature rising at the cutting region, hence, shows 
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a good machinability. Low hardness values make a material easier to machine. 
However, low hardness is a disadvantage if it is associated with high ductility. 

Apparently, a material is chosen for a particular application primarily due to its 
properties and functions, and generally, its machining performance is becoming a 
secondary factor in this selection. Hardness is changed in accordance with its 
micro structure. Therefore, control of work material micro structure is particularly 
important as it is one property which can be influenced by the material manufac- 
turer, to some extent. Hard particles like carbides and nitrides are undesirable for 
materials as they are the cause to a faster abrasive tool wear. If a slight degradation 
of material properties would not much affecting on its application, choosing an 
easy-to-machine material will benefit to the reducing of machining cost. 


Yield Strength 


Stress is a resistance against the internal force acting within a deformable object. 
Quantitatively, it is a measure of the average force per unit area of a plane within the 
object on which internal forces act. These internal forces arise as a reaction to external 
loads applied to the object. Normal stress o is on a plane perpendicular to the 
longitudinal axis of the object, which can be calculated using the following equation: 



where P is external applied load and 
A is original cross-sectional area. 

A strain is a normalized measurement of the deformation representing the 
displacement between particles in the object relative to a reference length. Normal 
strain e in the longitudinal direction is the total deformation divided by the original 
length of the object, which can be calculated using the following equation: 



where S is change in the object’s length and 
L is original object length. 

As shown in Fig. 1, the stress a is shown as a function of strain s and the initial 
portion of the stress-strain curve for the most materials used in engineering structures 
is a straight line. That is, for stress-strain curve straight line portion, the stress o is 
directly proportional to the strain s, of which this relationship is known as Hooke’s law 
in terms of continuum mechanics and was first recorded by Robert Hooke, an English 
mathematician, in 1678. Therefore, for an object subjected to a uniaxial load, its strain 
s is linearly proportional to its tensile stress o by a constant factor E\ hence, 

a = Es (8) 


where E is elastic modulus. 
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Fig. 1 Stress-strain curve showing a typical yield behavior for engineering materials, (a) Ferrous 
alloys, (b) Nonferrous alloys 


Hooke’s law only holds for some materials under certain loading conditions, which 
is only valid for it throughout its elastic range, i.e., for stresses below yield strength. 

Yield stress is an indicator of the stress level where the material begins to deform 
plastically. This stress level point is called as yield point where some fraction of the 
deformation will be permanent and nonreversible, and the material no longer 
returns to its original shape and size after the applied stress release. However, not 
all material has a well-defined yield point. In the absence of a distinct yield point, a 
0.2 % offset is used to obtain an approximate yield point. Figure 1 is a stress-strain 
curve showing typical yield behavior for nonferrous alloys: (a) ferrous alloys and 
(b) nonferrous alloys. 

In general, material hardness and yield strength are related. A material with high 
yield strength requires a high level of force to initiate chip formation in a machining 
process. Materials with relatively high strengths will be more difficult to machine 
and will reduce tool life. The same as hardness, a work material having low yield 
strength produces small cutting forces and low contact pressure at the tool-chip 
interface, consequently causing a low temperature rising at the cutting region, 
hence, shows a good machinability too. 


Fracture Toughness 

Fracture mechanics is one field of mechanics concerned with the study of crack 
propagation in materials, which was developed during World War I by an English 
aeronautical engineer, A. A. Griffith, to explain the failure of brittle materials. 
Fracture is a separation of an object or a material into two, or more, pieces under 
the action of stress. Rather than focusing on the crack-tip stresses directly, Griffith 
employed an energy-balance approach that has become one of the most famous 
developments in material science, which can be used to calculate the driving force 
on a crack and characterize the material’s resistance to fracture. 
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Sliding Mode 


Fig. 2 Three possible modes of crack extension in materials 



Fracture toughness is one of the most important material properties for all 
actually designed applications, which is an indicator of the stress amount required 
to propagate preexisting defects as there is no perfect material in the nature. All 
materials more or less contain some flaws which may have appeared as cracks, 
voids, metallurgical impurity, twins, segregations, dislocations, discontinuities, or 
some combination thereof. 

It has been shown that the two most important defects affecting the material 
failure are cracks and dislocations (Ewalds and Wanhill 1986). Usually, there are 
three possible modes of crack propagation generally identified by the subscripts I, 
II, and III as shown in Fig. 2: mode I also called the opening mode, mode II usually 
called the sliding mode, and mode III also referred to as the tearing mode. In 
practice, among these three modes, the most applicable is the opening mode (mode 
I). Crack propagation in materials has been the subject of many researches in the 
field of linear elastic fracture mechanics and fatigue being well documented. 

Irwin found that if the size of the plastic zone around a crack is small compared 
to the size of the crack, the energy required to grow the crack will not be critically 
dependent on the state of stress at the crack tip. He showed that for an opening mode 
crack (mode I), the strain energy release rate G for crack growth and the stress 
intensity factor Kj are relevant, which can be expressed using the following stress 
field equations (Irwin 1957): 


for plane stress 

i/\K 2 

— for plane strain 

where G is strain energy release rate or energy dissipation, 

Gj is strain energy release rate for a mode I crack (opening mode), 

v is Poisson’s ratio, and 

Kj is stress intensity factor in mode I. 
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Fig. 3 A plate of finite width 
subjected to uniform stress 





An internal through crack of length 2a is situated in a plate of finite width W and 
is subjected to a stress o at the boundary as shown in Fig. 3. The stress intensity 
factor Ki is expressed as 


Kf = Yo\fna 



where Y is a geometric factor and 
a is half crack length. 

Irwin also showed that the strain energy release rate of a planar crack in a linear 
elastic object can be expressed in terms of the mode I, mode II (sliding mode), and 
mode III (tearing mode) stress intensity factors for the most general loading 
conditions. 

As a property parameter of work materials, fracture toughness K c expresses the 
ability of a material to resist the growth of a preexisting crack or flaw. The stresses 
at the tip of a crack are much larger than that in the uncracked materials. It is widely 
accepted that the stress intensity factor K I and fracture toughness K c largely affect 
crack propagations for materials (Kendall 1976). The fracture toughness K c is 
expressed as 




for plane stress 
for plane strain 



where G c is critical energy release rate. 

If G, > Gc, this is the criterion for which the crack will begin to propagate. Irwin 
modified Griffith’s solids theory by using stress intensity factor K/ replacing strain 
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energy release rate Gj and fracture toughness K c replacing surface weakness energy 
G c . Thus, when the stress intensity factor K I reaches a critical level of fracture 
toughness K c , i.e., Kj > K c , the crack propagates and then fracture occurs. Low 
fracture toughness contributes to easy crack propagation which results in reduction 
in the difficulty of producing broken chips. Hence, a low fracture toughness value 
makes a material easier to machine, showing a good machinability. 


Temperature Effects on Machinability 

The power consumed in metal cutting is largely converted into heat around the 
cutting region. High temperatures are generated in the region of the cutting edge, 
which have a controlling influence on the rate of tool wear and on the friction 
between the chip and tool (Boothroyd and Knight 1989). Temperature rise at the 
tool-work interface, varied with the composition of the work material, is directly 
affecting the cutting tool performance and significantly influences in limiting the 
rates of metal removal when cutting iron, steel, and other metals and alloys of high 
melting point. Temperature distribution at the tool-work interface was determined 
by changes in microhardness or micro structure in the heat-affected regions of work 
material. 

During the orthogonal cutting of mild steel, the mean share zone temperature 6 S 
increases slightly with increasing cutting speed and then tends to become constant. 
Since the temperature rise ( 6 m ) at the tool rake owning to friction heat in the 
secondary deformation zone increases rapidly with the increasing cutting speed, 
as a result, the maximum tool-work interface temperature ( 6 m + 6 S ) increases 
rapidly with the increasing cutting speed too (Boothroyd and Knight 1989). For 
example, the maximum tool-work interface temperature can reach as high as 
1,000 °C when rough cutting of steel. 

The high cutting temperature has a controlling influence on the work material 
mechanical properties as shown in Figs. 4 and 5. Figure 4 shows the effect of 
temperature on material’s strength and ductility, and Fig. 5 shows the effect of 
temperature on the material’s hardness of different engineering materials. Nor- 
mally, most material tensile strength and yield strength will decrease with the 
increasing temperature. For example, yield stress (YS) and ultimate tensile strength 
(UTS) of steel with 75 % ferrite, 15 % bainite, and 10 % retained austenite in 
volume decrease with an increasing temperature. There exists a temperature region 
(120^400 °C) where a reduction of strength with increasing temperature is retarded 
or even slightly increased (Akbarpour 2007). 

Ductility or elongation of most engineering materials will increase with the 
increasing temperature. But increasing temperature will cause material hardness 
decreasing. Experimental results of tensile stress-strain curves obtained at temper- 
atures of 260 °C and 480 °C for aluminum alloy 7000 T4 applying a constant strain 

O 1 

rate of 10 s showed that the lower temperature specimen underwent a moder- 
ately ductile failure with regions of both strain hardening and necking before the 
final sudden fracture, of which elastic modulus is 63 GPa, 0.2 % yield strength is 
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Fig. 4 Effect of temperature 
on material’s strength and 
ductility 
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Fig. 5 Effect of temperature 
on material’s hardness 



180 MPa, and ultimate strength is 233 MPa. The higher temperature specimen 
experienced a highly ductile failure with no sudden fracture, of which elastic 
modulus is 37 GPa, 0.2 % yield strength is 68 MPa, and ultimate strength is 
73 MPa (Codrington et al. 2009). 

Figure 6 shows the typical effects of temperature on stress and strain for 
aluminum alloy 7075-T6: (a) strain obtained under three constant stresses heated 
rapidly at widely different temperature rates and (b) stress-strain curve at three 
different temperatures with a constant strain rate. The slant portions of the curves 
give mainly the effects of thermal expansion. The sharp downward turn at the right 
of each curve gives the effects of viscosity. 

Overall, when machining of whatever materials like metals, polymers, compos- 
ites, or ceramics, there always is accompanying with a temperature rising in the 
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Fig. 6 Effects of temperature on material stress and strain 
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Fig. 7 Schematic diagram of suitable cutting speed for different materials 


cutting region. For example, when rough cutting mild steel, the temperature rising 
in the cutting region can reach as high as more than 1,000 °C. Definitely, such high 
temperature rising will change the work material mechanical properties such as 
microhardness, tensile strength, yield strength, ultimate tensile strength, ductility, 
and strain-stress status and frequently, material micro structure changing also 
happens. As a result, work material machinability will be changed accordingly, 
which would be a compound result of all kinds of effects including work material 
mechanical and thermal properties, microstructure, as well as cutting tool thermal 
behavior. 

Figure 7 shows a schematic diagram of suitable cutting speed including normal 
machining and high-speed machining for different work materials, which indicates 
the differences of machinability for different work materials from the cutting speed 
aspect. For example, cutting with a speed of 200 m/min is considered as a normal 
machining for fiber-reinforced plastics, aluminum, bronze, and brass, but it is 
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considered as a high-speed machining for titanium alloys and nickel alloys. That is, 
tool wear is very fast when cutting superalloys like titanium and nickel alloys even 
though the cutting speed is considered a relatively low cutting speed for aluminum 
and copper. Therefore, to the two extreme cases, superalloys including titanium 
alloys and nickel alloys are classified as difficult-to-machine materials, and mate- 
rials like aluminum and copper alloys are classified as easy-machining materials. 

There are great achievements on development of cutting tools in the past few 
decades, including coated carbides, ceramics, cermets, cubic boron nitride (CBN), 
and polycrystalline diamond (PCD), which have been used to improve the machin- 
ing performance of cast irons, steels, and nickel alloys. The trends to develop novel 
cutting tools should consider the following technical aspects, especially for machin- 
ing of titanium alloys which currently is not much successful (Ezugwu and Wang 

1997): 

1 . High thermal hardness against high stresses involved 

2. Good thermal conductivity to minimize cutting temperature gradients and ther- 
mal shock 

3. Good chemical inertness to depress the tendency to react with titanium 

4. Toughness and fatigue resistance to withstand the chip segmentation process 

5. High compressive, tensile, and shear strength 


Aluminum and Aluminum Alloys 

Aluminum is a silver white metal element with chemical symbol A1 and atomic 

o 

number 13. Aluminum is remarkable for the metal’s low density of 2.70 g/ cm and 
for its ability to resist corrosion due to the phenomenon of passivation. Aluminum is 
the third most abundant element and the most abundant metal in the Earth’s crust. 
Majority applications of aluminum and its alloys are made of structural compo- 
nents, which are vital to the aerospace industry and are important in other areas of 
transportation due to their high strength-to-weight ratio, where lightweight or 
corrosion resistance is required. Due to low density, excellent recycling potential, 
and continuously improvement of mechanical properties by metallurgists, the 
demand of aluminum and its alloys in all the industries especially in transport 
industry is increasing. However, pure aluminum is too soft for such applications, 
and it does not have the high tensile strength that is needed for aerospace and 
transportation industries. 

Generally, the microconstituents present in aluminum alloys significantly influ- 
ence the machining characteristics. Nonabrasive constituents have a beneficial 
effect while insoluble abrasive constituents demonstrate a detrimental effect on 
tool life and machined surface quality. In fact, aluminum and aluminum alloys are 
considered as the family of materials offering the highest levels of machinability, as 
compared to other families of lightweight metals such as titanium and magnesium 
alloys. In general, they belong to the most machinable materials in small-to-medium 
volume production and mass production, which can be machined relatively easily. 
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As the melting temperature of aluminum (around 659 °C) and aluminum 
alloys is relatively low, consequently, the machining temperatures are generally 
low and never being high enough to damage the cutting tools’ micro structure 
such as high-speed steel tools and carbide tools. High cutting speed can be used 
up to 600 m/min when carbide tools are being used and 300 m/min for high-speed 
steel tools. Cutting speed can be achieved as high as 5,000 m/min when using 
poly crystalline diamond (PCD) tools. Tool wear occurs generally in the form of 
flank wear when machining of aluminum and aluminum alloys. Good tool life can 
be obtained as long as 20 km when PCD or single crystalline diamond tools 
being used. 


Pure Aluminum 

For commercial pure aluminum like many other pure metals, in spite of the lower 
share strength, the cutting forces are generated higher than those for aluminum 
alloys particularly at low cutting speeds due to a very strong adhesion between the 
cutting tool and work material. The contact length on the rake face tool-chip 
interface is very long, and no built-up edge (BUE) is present when machining of 
commercial pure aluminum. Long and ductile chips are formed regardless of the 
cutting tool geometry used. High tendency of aluminum to adhere to cutting tools 
produces a high risk leading to tool breakage during machining of commercial pure 
aluminum. 

As the chips formed during machining of commercial pure aluminum are 
continuous, strong, and not readily broken, it always clogs the flutes and spaces 
between teeth or accumulates and coils at the tool tip for single-point cutting. 
Therefore, the cutting tool needs to be sharp and have large positive rake angle 
with a chip breaker, and sometimes, specially designed cutting tools are required for 
cutting of aluminum. 


Aluminum Alloys 


Aluminum alloys are the alloys in which aluminum (Al) is the predominant 
elements. Typically, the technically alloying elements used for aluminum alloys 
are silicon, magnesium, manganese, zinc, and copper, targeting to improve fluidity, 
strength, ductility, castability, work hardening, etc. Aluminum alloys can be 
broadly categorized as the cast alloys and wrought alloys. Wrought aluminum 
alloys are generally divided into the heat-treatable and non-heat-treatable groups. 
Wrought alloys account for about 85 % of aluminum use and are mainly made for 
the forming products, for example, rolled plates, foils, and extrusions. The most 
important cast alloys are Al-Si alloys, where the high levels of silicon (4.0-13 %) 
contribute to give good casting characteristics. 

It is well known that chemical composition, structural defects, and alloying 
elements have significant effects on the machinability of aluminum alloys. High 
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levels of magnesium (Mg) increase the cutting forces while maintaining the same 
level of hardness, and a low percentage of copper (Cu) in aluminum alloys 
decreases the cutting force (Tash et al. 2006). Alloys containing more than 10 % 
Si are the most difficult to machine because hard particles of free silicon cause rapid 
tool wear. The presence of hard silicon particles in the alloys decreases machin- 
ability due to accelerated abrasive wear of carbide tools and chipping of PCD tools, 
which typically are limited to low cutting speeds both for carbide and PCD tools. 
When dry cutting aluminum alloys, the major problems encountered are the BUE 
generated at low cutting speeds and sticking at high cutting speeds, hence, the need 
for special tool geometries. Different heat treatment methods can be used to 
improve the machinability of aluminum alloys, which increase hardness and reduce 
the built-up edge (BUE) tendency during machining processes. Heat treatment of 
Al 6061, especially aging, influences the cutting forces only at low cutting speeds, 
but the influence is negligible at high cutting speeds because of the low temperature 
rising found in the cutting region. 


Copper, Brass, and Copper Alloys 

Copper is a reddish orange color metal element with chemical symbol Cu and 
atomic number 29, which is a comparatively heavy metal with a density of 8.98 

o 

g/ cm . It is soft and malleable and a highly ductile metal with a face-centered cubic 
structure, but it has a high melting point of 1,083 °C. Copper is one of the earliest 
metals known by human and have been used for thousands of years. The attractive 
properties which made copper very wide applications are good corrosion resistance, 
attractive color, excellent workability, and the best electrical and thermal conduc- 
tivity of any commercially available metals. Today, more than 50 % of the copper 
produced is used in electrical and electronic applications. 

Copper alloys are metal alloys that have copper as their principal component. 
Typically, the main alloying elements used for copper alloys are zinc, tin, 
aluminum, silicon, and nickel. There are as many as 400 different copper and 
copper alloys loosely grouped into the categories: copper, brasses, bronzes, nickel 
copper, copper-nickel-zinc (nickel silver), leaded copper, and special alloys. 
Table 1 lists the principal alloying element for most common types used in modern 
industry. 


Copper and Copper Alloys 

Pure copper is ductile and weak but can be strengthened by alloying, mechanical 
working, and in a small number of cases, precipitation hardening. The same 
behavior as pure aluminum, pure copper is difficult to machine because of the 
high friction forces between the chips and cutting tool. In general, copper alloys 
have a good machinability because the temperature rising generated by shear in the 
cutting zone is not high enough to make a very serious effect on the tool life and 
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Table 1 Classification of copper and its alloys 


Alloy group 

Main alloying element 

Copper alloys 

No deliberate alloying additions 

Brasses 

Zinc (Zn) 

Bronzes 

Tin (Sn) 

Silicon bronzes 

Silicon (Si), Zinc (Zn) 

Aluminum coppers 

Aluminum(Al), Iron (Fe) 

Beryllium coppers 

Beryllium (Be), Cobalt (Co), Nickel (Ni) 

Copper-nickels 

Nickel (Ni) 


cutting performance. The copper and its alloys can be classified into three groups 
based on their machining characteristics (Mills and Redford 1983): 

• Free-machining alloys 

• Difficult-to-machine alloys 

• Intermediate-machining alloys 

Both carbide and high-speed steel tools could be used for machining of copper 
and its alloys. No built-up edge (BUE) occurs when cutting high-conductivity 
copper. Tool wear occurs generally in the form of flank wear or crater wear or 
both when machining of copper and its alloys. Good tool life can be obtained as 
long as 10 km when PCD or single crystalline diamond tools are being used. Cutting 
forces are very high particularly at low cutting speed. The machinability of copper 
and its alloys are often based on the type of chips formed rather than tool life 
because they have only moderate shear strength. The machining behavior of copper 
is somehow improved by cold working and alloying. 


Brass 


Brass is an alloy of copper and zinc, and zinc content can be up to 43 %, which has 
higher malleability than bronze or zinc. The relatively low melting point of brass 
(900-940 °C, depending on composition) and its flow characteristics make it 
relatively easy to cast. Today almost 90 % of all brasses are recycled. By varying 
the proportions of zinc and copper, its properties can be changed. If zinc content is 
less than 35 %, the brass will be solidified in a single-phase micro structure, a-brass. 
A higher zinc concentration produces a two-phase micro structure, i.e., a-P brass. 

When cutting a-brass, the cutting forces are lower. There is a great reduction of 
cutting forces when cutting a-P brass. Greater p phase in brass, lower cutting force. 
For brasses containing between 62 % and 65 % copper, lead additions up to 3 % 
give a continuous improvement in machining characteristics. Due to the lower 
cutting forces and machine power consumption in cutting of both a-brass and a-P 
brass, together with slow tool wear rate, brasses are classified as easy-machining 
materials, i.e., high machinability. 
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Oxygen-Free Copper 


High purity copper is produced by electrolytic refining. Oxygen-free copper (OFC) 
generally refers to a group of wrought high-conductivity copper alloys that have 
been electrolytically refined to reduce the level of oxygen to 0.001 % or below. For 
industrial applications, oxygen-free copper is valued more for its chemical purity 
than its electrical conductivity. Oxygen-free copper is often used as the work 
material for optical components due to its material properties and favorable dia- 
mond machinability because optical grade surfaces can be achieved with single- 
point diamond cutting. When cutting of oxygen-free copper, the cutting force 
varieties are very sensitive to the grain orientation and the transition of the cutting 
tool across grain boundaries of the alloys. 


Copper-Nickels 

Copper-nickel or cupronickel is an alloy of copper containing nickel and other 
strength elements, which is highly ductile and single a phase micro structure alloy 
as copper and nickel are mutually soluble in the solid state over all ranges of 
compositions. The two most popular copper-rich alloys contain 10 % and 30 % 
nickel, which have excellent resistance to seawater corrosion. The corrosion resis- 
tance of copper-nickel alloys is due to a protective film formed when in contacted 
with seawater. As a result, copper-nickels are used for many marine hardware and 
seawater system. 

The machining properties of copper-nickel alloys are similar to many other high- 
strength copper-based alloys such as aluminum bronze and phosphor bronzes. 
When machining copper-nickel alloys, continuous chips are formed with no seg- 
mentation over the all cutting speed range. The workpiece materials are strongly 
bonded to the cutting tools over the whole contact areas for both high-speed steel 
and carbide tools. The cutting forces are very high at the low cutting speed. When 
increasing the cutting speed, cutting forces dropped rapidly. No built-up edge 
(BUE) is formed at low cutting speed. Flank wear and crater wear happen when 
cutting with high-speed steel tools. 


Beryllium Copper 

Beryllium copper, also called as copper beryllium, is a copper alloy with beryllium 
content around 0.5-3 % and other alloying elements, which can be age-hardenable 
and heat-treated to improve its strength. The nonmagnetic and non-sparking beryl- 
lium copper alloys with high strength and good electrical and thermal conductivity 
are used to make tools for the safe utilization in environments where there are 
explosive vapors and gases, such as on oil rigs. Meanwhile, they can be used to 
make springs, electronic connectors, bearings, molds, and corrosion-resistant 
hardware. 
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All beryllium copper alloys are readily machinable using conventional machin- 
ing processes with either high-speed steel or carbide tools. Because of the machin- 
ing characteristics of beryllium copper alloys, cutting tool life is excellent. In 
general, tool wear is proportional to speed, feed rate, and depth of cut. Inhalation of 
airborne beryllium may cause a serious lung disorder in susceptible individuals, as 
beryllium compounds are toxic and cumulative poisons. That is why there is a 
great safety concern on handling and machining beryllium copper alloys. In fact, as 
finished parts, beryllium copper presents no known health hazard. 


Iron, Steel, and Steel Alloys 


Iron is a lustrous silvery gray color metal element with chemical symbol Fe 

Q 

and atomic number 26. It has a density of 7.874 g/cm and a high melting point 
of 1,358 °C. Its micro structure is body-centered cubic (a iron) at room temperature 
and transforming to face-centered cubic (y iron) just above 900 °C. Iron has 
relatively low shear strength but high ductility with poor machinability. Iron alloys 
with less carbon content are known as steel. Steel basically is an alloy of iron with 
up to about 1.5 % carbon. 


Cast Irons 

Cast iron is a ferrous alloy containing 2. 0-4. 5 % carbon (C). Silicon (Si) is another 
main alloying element for cast iron, with the amount ranging from 1.0 % to 3.0 %. 
The high carbon content of cast irons leads to a reduction of melting temperature 
compared with steels. With relatively low melting point, good fluidity, castability, 
excellent machinability, resistance to deformation, and wear resistance, cast irons 
have become a popular engineering material widely used in different industries for 
making pipes, cylinders, machine tool beds, and automotive parts. 

Historically, the first classification of cast iron was based on its fracture: white 
iron and gray iron. With the advent of metallography, cast irons can be classified 
most exactly based on micro structural features. Here are the most common cited 
classifications used today: 

• White iron - a micro structure containing cementite phase, Fe 3 C 

• Gray iron - flake graphite in a matrix of ferrite, pearlite, or austenite 

• Nodular iron (also called ductile cast iron) - spheroidal graphite in a matrix of 
pearlite, ferrite, bainite, or martensite 

• Malleable iron - a white iron being heat-treated at about 900 °C to improve 
ductility 

Cast iron’s properties are changed by adding various alloying elements. The 
majority of engineering cast irons are of the ferritic or pearlitic types. 
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Cast irons show a wide range of machining behavior which depends upon 
composition and micro structure. Overall, cast iron has a good machinability : high 
material removal rate, low tool wear rate, relatively small cutting forces, small 
fragment chips formed, and low power consumption. Because the chips formed are 
not continuous, the length of contact on the rake face is very short. When cutting 
cast irons with carbide or high-speed steel tools, a built-up edge is formed and a 
very good tool life can be achieved even at high cutting speeds. Ceramic tools have 
been developed and utilized for machining cast irons mainly in mass production. 
Somewhat, machining of cast irons is a dirty and dusty operation as it throws a fine 
graphite spray into the air. 


Carbon Steels 

Steels vary greatly in chemical content and micro structure. The main interstitial 
alloying constituent for carbon steel is carbon. Generally, steels are classified 
according to their carbon content: 

• Low carbon steel, with less than 0.3 % carbon 

• Medium carbon steel, carbon content between 0.3 % and 0.8 % 

• High carbon steel, carbon content above 0.8 % and below 1.5 % 

Alloying elements in steel such as carbon, manganese, and chromium increase 
its strength. As the carbon content increases, steel can be harder and stronger 
through heat treatment, but being less ductile. This influences the cutting stresses 
and cutting temperature generated. Cutting force is reduced by the addition of 
alloying elements. Low carbon steels have low hardness and high ductility with 
the tendency forming built-up edge (BUE) adhered to the cutting tool strongly, 
which leads to reducing tool life and poor surface finish. Its machining behavior can 
be improved by increasing strength and reducing ductility through cold-drawing. 
Higher carbon content improves the machinability in such a way that hardness is 
increased moderately and ductility decreased. 

High-speed steel and carbide tools have been used to cut off a very 
high percentage of carbon steels. Using high-speed steel tools to machine steels 
with hardness higher than 300 HV is very difficult. Carbide tools can be used 
to cut steels with higher hardness. But when the hardness exceeds 500 HV, 
carbide tool life becomes very short and permissible cutting speeds are very 
low. To cut hardened steels, the cutting tools must retain their hardness and 
yield strength at the elevated cutting temperatures. Ceramic tools can be used to 
cut steels with hardness of 600-650 HV. Using cubic boron nitride (CBN) tools 
can machine fully hardened steels with large material removal rate and long tool 
life. But CBN tool costs also very high. Therefore, it is not surprising that the 
engineering industries always request for continuous improvement of steel’s 
machinability while maintaining the mechanical properties to ensure its 
performance. 
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Stainless Steels 

Stainless steels, also called corrosion-resistant steels, are iron-based alloys 
containing 10.50 % or more chromium (Cr). These steels achieve their “stainless” 
characteristics as a result of the invisible and adherent, chromium-rich oxide film 
that forms the material’s surface. Stainless steels can be broadly divided into five 
groups by the dominant room temperature microstructure: austenitic, ferritic, mar- 
tensitic, precipitation hardening, and duplex types. Over the past few decades, 
stainless steels have been widely used in a variety of applications such as chemical, 
petroleum, nuclear power, marine, transportation, hospitals, dairies, food industry, 
etc. Austenitic alloys are the most widely used one among all stainless steels in both 
cryogenic environment and elevated temperatures due to easy fabrication and 
corrosion resistance. 

Stainless steels are considered to be more difficult to machine than carbon steels 
due to high tensile strength, high ductility, high work-hardening rate, low thermal 
conductivity, and abrasive character, which lead to higher tool wear rate, difficul- 
ties with chip breakability, and poor surface finish. Austenitic stainless steels are 
strongly work hardening. A built-up edge is formed during cutting of austenitic 
stainless steels even in a relatively lower cutting speed than cutting of carbon steels. 
They bond very strongly to the cutting steel, and chips often remain stuck to the 
tool, causing the tool fragment once the chip broke away. Typically, austenitic 
stainless steels are more difficult to machine than ferritic and martensitic stainless 
steels. Duplex stainless steel is similar to austenitic stainless steel but harder to 
machine due to its high annealed strength. Generally, TiN-TiCN-coated carbides 
are recommended for machining of stainless steels using lower cutting speeds and 
material removal rates than carbon steels, larger depth of cut than work-hardening 
layer thickness, and flush coolant. 


Titanium and Titanium Alloys 

Titanium is a silver color metal element with chemical symbol Ti and 

o 

atomic number 22. It has a low density of 4.506 g/cm and a high melting point 
of 1,668 °C. The two most useful properties of titanium are corrosion resistance and 
the highest strength-to-density ratio against any metals. Titanium is 30 % stronger 
than steel but nearly 43 % lighter and 60 % heavier than aluminum but twice as 
strong. Titanium has a relatively low thermal expansion coefficient and fairly 
hardness although not as hard as some heat-treated steel, is nonmagnetic, does 
not exhibit a ductile-brittle transition, and has a good biocompatibility and a poor 
conductor of heat and electricity. However, oxygen and nitrogen are absorbed by 
titanium rapidly at the temperatures above 500 °C, which leads to potential embrit- 
tlement problems. 

Titanium can exist in a close-packed hexagonal structure, i.e., a-phase, at room 
temperature, and change to body-centered cubic structure, i.e., p-phase, at 883 °C. 
Alloying elements in titanium alloying tend to stabilize either a-phase below the 
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transformation temperature of 883 °C or allotrope P-phase above the transformation 
temperature of 883 °C. Generally, according to their crystal structures at room 
temperature, titanium alloys are classified into four groups: a-alloy, near a-alloy, 
P-alloy, and a + p alloy. The principal alloying element in titanium a-alloys is 
aluminum, and titanium a-P alloys are normally containing more than two elements 
including aluminum, copper, molybdenum, silicon, vanadium, and zirconium. The 
most widely used titanium a + p alloy is Ti-6Al-4 V, which contains 10-50 % 
P-phase in volume. 

Titanium alloys have excellent mechanical and exploitation properties such as 
high strength-to-density ratio, high corrosion resistance, high fatigue and cracking 
resistance, and ability to withstand moderately high temperatures without creeping, 
which have been widely used in aerospace industries as structural materials for 
supersonic aircraft and spacecraft and non-aerospace sections such as military, 
automotive, and sporting goods. Due to being biocompatible, nontoxic, and not 
rejected by the human body, titanium alloys are also very popular in medical 
applications including surgical implements and implants like joint replacement 
which can last up to 20 years. 

Titanium alloys are classified as difficult-to-machine materials due to their poor 
machining behavior, which are attributed to their unique physical, chemical, and 
mechanical properties: 

• Very low thermal conductivity (one-sixth of steel), causing the most heat 
generated concentrated on the cutting edge and tool rake face, which results in 
high temperature gradients at the tool-chip interface. 

• Relatively low modulus of elasticity, tending to move away from the cutting 
point, which can lead to distortion and chattering during machining. 

• Tendency of a strong alloying or reacting chemically with oxygen, nitrogen, and 
materials in the cutting tools at elevated cutting temperatures, causing galling, 
welding, smearing and abrasive wear along with rapid destruction of the cutting tool. 

• Relatively small contact area on the cutting tool rake face, resulting in very high 
contact pressure compared with the relatively high strength, raising the cutting 
temperature and increasing temperature gradient at a much localized portion of 
the cutting tool. The combination of high contact pressure and maximum cutting 
temperature rising very close to the cutting edge results in rapid tool breakdown. 

• Retaining its strength and hardness at elevated temperatures. Very high mechan- 
ical stresses occur in the immediate vicinity of cutting edge when machining 
titanium, contributing to cutting tool wear. 

Obviously, titanium and its alloys should be machined at lower cutting speed 
than those steels having similar hardness. The cutting forces are only slightly higher 
than those needed to machine steels, but their metallurgical characteristics make 
somewhat more difficult to be machined than steels with equivalent hardness. The 
combination of titanium’s poor thermal conductivity, low elastic modulus, small 
cutting contact area, strong alloying tendency, and chemical reactivity with cutting 
tools is a detriment to tool life and metal removal rate. 
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Table 2 Ti-6A1-4V alloy 
chemical compositions 


Chemical elements (%) 

Content 

Aluminum (Al) 

5.5-6.75 % 

Vanadium(V) 

3. 5-4. 5 % 

Iron (Fe) 

<0.25 % 

Hydrogen (H) 

<0.015 % 

Titanium (Ti) 

Balance 


Pure Titanium 

Commercial pure titanium is available in the market in a range of grades depending 
on oxygen, nitrogen, and carbon. Its hardness and strength increases and ductility 
decreases when the content of the abovementioned elements is increased. Histor- 
ically, titanium has been perceived as a difficult-to-machine material. Along with 
wide applications in many industries, experience and knowledge accumulated, 
titanium can now be machined no more difficult than stainless steel 316. High- 
speed steels are widely used for machining of titanium because of their flexibility 
and lower cost than cemented carbides. Titanium chips are continuous but typically 
segmented. The main problem of machining titanium is short tool life. As cutting 
speed is increased, cutting tool life is dramatically decreased. 

Slender components tend to deflect under the cutting contact pressure causing 
chatter, tool rubbing, and hence tolerance problems. Rigidity of the entire machin- 
ing system is consequently very important. Generally, titanium machining behavior 
can be improved through decreasing temperatures generated at the tool face and 
cutting edge by considering the following aspects: 

• Use sharp tools and positive cutting edges. However, complete tool failure 
occurs rather quickly after a small initial amount of wear takes place. 

• Use low cutting speeds. Tool tip temperatures are affected more by cutting speed 
than by other cutting variables. 

• Apply high feed rates. Cutting temperature is not much affected by feed rate 
compared to cutting speed. 

• Use flush coolant. Cutting fluids used require special consideration due to having 
chlorine ions may cause stress-corrosion cracking. 


Ti-6AI-4V 

Ti-6A1-4V, the most common titanium a + p alloy, is predominately used in the 
aerospace industries as airframes and engine components, accounting for over 45 % 
of the total titanium production. Its chemical compositions are listed in the follow- 
ing Table 2 and the mechanical properties of Ti-6A1-4V are listed in Table 3. 

When machining of titanium alloys, the heat-affected zone is much smaller and 
much closer to the cutting edge because of the thinner chips produced due to a short 
contact length. A large proportion about 80 % of heat generated during the 
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Table 3 Ti-6Al-4 V alloy 
mechanical properties 


Mechanical properties 

Values 

Density 

4.42 g/cm 3 

Melting point 

1,650 °C 

Hardness 

340-360 HV 

Ultimate tensile strength 

895-1,035 MPa 

0.2 % Yield strength 

825-965 MPa 

Elongation 

8-10 % 

Thermal conductivity 

7. 3-7. 5 W/m • K 

Elasticity modulus 

110 GPa 


machining of Ti-6A1-4V is conducted into the cutting tool because it cannot be 
transferred into fast-flowing chips due to a low thermal conductivity, which causes 
high tool-tip temperatures up to about 1,100 °C (Ezugwu and Wang 1997). Tita- 
nium is an extremely reactive metal and has a strong chemical reactivity with 
almost all chemical elements within the cutting tools. Fine particles of titanium 
can be ignited and burnt. The chips have a tendency to pressure weld to cutting 
tools, causing severe dissolution-diffusion wear at elevated cutting temperature. 
Notching, flank wear, crater wear, chipping, and catastrophic failure are the prom- 
inent failure modes for machining of Ti-6A1-4V. However, the use of flood coolant 
in most machining operations can eliminate this danger to a large extent. Using 
through spindle coolant directed to the cutting edges would improve cutting tool 
life and could even utilize a high cutting speed. 


Nickel and Nickel Alloys 

Nickel is a silvery white lustrous metal element with chemical symbol Ni and atomic 
number 28. Its melting point is 1,454 °C, lower than iron of 1,535 °C. It is hard and 
ductile with a face-centered cubic structure and does not undertake any transformation 
in its basic crystal structure up to its melting point. The metallurgy of nickel alloys is 
complex. Pure nickel can be alloyed with copper, chromium, iron, titanium, molyb- 
denum, aluminum, niobium, and tungsten. Solid solution-strengthened alloys includ- 
ing those containing copper and chromium demonstrated excellent corrosion 
resistance to seawater, hydrofluoric and sulfuric acids, and alkalis. Alloying elements 
are added to import high temperature strength through three basic mechanisms: 

• Solid solution strengthening (Cr, W) 

• Precipitation strengthening through formation of stable intermetallic compounds 

(Al, Ti, Nb) 

• Precipitation strengthening and high temperature oxidation resistance (Ta) 

Nickel-based alloys, such as commercially available grades Inconel, Incoloy, 
Nimonic, Rene, and K-Monel, are generally known to be difficult-to-machine 
materials because of their high hardness, high strength at high temperature, strong 
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affinity to react with tool materials, and low thermal conductivity. When machining 
nickel and nickel alloys, the maximum temperature rising occurs at the cutting tool 
tip, rather than on the rake face and a distance away from the cutting edge for steel 
cutting. No relatively cool region at the cutting tool edge is presented when cutting 
nickel and nickel alloys. High temperature generated tends to deform the main 
cutting edge and may result in rapid collapse of the cutting tool. That is another 
reason why nickel and nickel alloys are classified as difficult-to-machine materials. 


Pure Nickel 

Compared with nickel alloys, commercial pure nickel has high electrical and 
thermal conductivity and also has a poor machinability in terms of the almost all 
machinability criteria. Cutting forces are higher compared to cutting pure iron and 
copper. Contact length on the tool rake face is very large. No built-up edge is 
formed during cutting of pure nickel. High temperature rising at the chip formation 
zone results in a high tool wear rate. Tool failure is caused by rapid flank wear and 
cutting edge deformation at relatively low cutting speed. Tool life is relatively short 
and the maximum material removal rate is low. 

Compared with high-speed steel tools, carbide tools have a high compressive 
strength at high temperature, which can be used for cutting nickel and its alloys at a 
much high speed. Its tool wear is mainly on flank face by diffusion wear and defor- 
mation wear, rather than crater wear. However, carbide tools are not recommended for 
cutting of pure nickel generally. Because there is very strong bonding between nickel 
chips and tool faces, causing tearing and damages once the chips are removed. 


Inconel 718 

Inconel refers to a family of austenitic nickel-chromium-based superalloys. They 
are corrosion- and oxidation-resistant materials well suited for service in extreme 
environments subjected to pressure and heat. Inconel 718 is a precipitation- 
hardenable nickel-chromium alloy and an extremely difficult-to-machine material 
- largely due to its higher yield strength compared and age-hardened. Its chemical 
compositions are listed in the following Table 4. 

The mechanical properties of Inconel 718 are listed in Table 5. The key features 
of Inconel 718 include: 

• High resistant to chloride and sulfide stress corrosion cracking 

• High strength in the aged condition 

• Good corrosion resistance 

Inconel is a difficult-to-machine material due to the high work-hardening rate and 
the presence of hard abrasive particles such as titanium carbide, niobium carbide, 
and Ni 3 AlTi phase. After the first machining pass, work hardening tends to 
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Table 4 Inconel 

718 chemical compositions 


Chemical elements (%) 

Minimum 

Maximum 

Nickel (Ni) 

50.0 

55.0 

Chromium (Cr) 

17.0 

21.0 

Iron (Fe) 

— 

Balance 

Niobium + Tantalum (Nb + Ta) 

[4.75 

5.5 

Molybdenum (Mo) 

2.8 

3.3 

Aluminum (Al) 

0.65 

1.15 

Cobalt (Co) 

— 

1.0 

Copper (Cu) 

0.2 

0.8 

Manganese (Mn) 

— 

0.35 

Titanium (Ti) 

— 

0.3 

Silicon (Si) 

— 

0.3 


Table 5 Inconel 

718 mechanical properties 


Mechanical properties 

Values 

Density 

8.19 g/cm 3 

Melting point 

1, 260-1, 336~°C 

Hardness 

30-40 HRC 

Ultimate tensile strength 

150 ksi 

0.2 % Yield strength 

120-140 ksi 

Elongation 

20 % 

Thermal conductivity 

11.4 W/m • K 

Elasticity modulus 

29 x 10 6 psi 


plastically deform either the workpiece or the tool on subsequent passes. For this 
reason, age-hardened Inconel 7 1 8 are machined using an aggressive but slow cutting 
with a hard tool, minimizing the number of cutting passes required. It is usual to use 
low cutting speed to reduce tool wear rate. If the feed rate is too slow, the tool is 
continuously cutting through the work-hardened layer generated by the previous 
cutting passes. If the feeding is too fast, the cutting forces acting on the cutting tool 
could be too large causing catastrophic failure. Inconel 718 also retains its strength at 
elevated temperatures which results in high cutting forces even at a high cutting 
speed due to high temperatures generated. The usual turning speeds for Inconel 
718 is 20-35 m/min, and the feed rate is chosen in the range of 0. 18-0.25 mm using 
uncoated carbides K10/K20. Alternatives to carbides for turning are non-oxide 
ceramics and poly crystalline cubic boron nitride (PCBN). When using PCBN 
tools, the speed used for cutting of Inconel 718 can be higher, up to 300-600 m/min. 


Polymers 

A polymer is a chemical compound or compound mixture consisting of large 
macromolecules composed of many repeating structural units created through a 
process of polymerization. These molecules are covalently bonded chains of atoms. 
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Table 6 Mechanical 
properties of PMMA 


Mechanical properties 

Values 

Density 

1.18 g/cm 3 

Melting point 

160 °C 

Glass transition temperature ( T g ) 

105 °C 

Ultimate tensile strength 

30 MPa 

Young’s modulus 

2.2 GPa 

Elongation 

1.4 % 

Thermal conductivity 

0.18 W/m-K 


Unless they are cross-linked, the macromolecules interact with another by weak 
secondary bonds and entanglement. The mechanical properties and thermal behav- 
ior of polymers are influenced by several factors, including the composition and 
structure of the macromolecular chains and their molecular weight. Compared with 
metals, polymers have much lower strengths and modules, low density, poor 
thermal and electric conductivities, good corrosion resistance, and mould ability, 
but they can be deformed to a great extent before failure happened. 

Another common name of many synthetic polymers is plastic, suitable for 
shaping and molding. Industrial polymers are broadly grouped into three catego- 
ries: thermoplastics, thermosets, and elastomers. Thermoplastics are the polymers 
that do not undergo chemical change in their composition when heated, where the 
cycle of heating to soft and cooling to firm can be repeated. Thermosets can be 
softened and formed to the shape once when heated, but solidified permanently 
when cooled down to room temperature. That is, the thermosetting process is 
irreversible. Elastomers are polymers with viscoelasticity, generally having low 
Young’s modulus and high yield strength compared with others. Polymers have 
been widely used in almost everywhere, such as different industries, sports, med- 
icine, consumer service, etc. 


PMMA 

Polymethylmethacrylate (PMMA) is a strong transparent thermoplastic, often used 
as a lightweight or shatter-resistant alternative to glass and sometimes called as 

Q 

acrylic glass. Its density is around 1.18 g/ cm , which is less than half of glass. It also 
has good impact strength, higher than glass. PMMA of 3 mm thickness can transmit 
up to 92 % of visible light and give about 4 % reflection from each of its surfaces on 
the refractive index of 1.4914 at 587.6 nm. Its molecular formula is (C 5 0 2 H 8 )„. The 
mechanical properties of PMMA are listed in Table 6. PMMA is a versatile material 
and has been used in a wide range of fields and applications. PMMA is an economic 
alternative to polycarbonate (PC) when extreme strength is not necessary. 

PMMA has the highest surface hardness among all common thermoplastics. The 
scratch resistance can compare to the metal aluminum and brass. Overall, PMMA 
has a fairly good machinability, which can be machined using standard cutting 
tools. Both cutting speed and feed rate have to be moderately high. At moderate 
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Table 7 Mechanical 
properties of PC 


Mechanical properties 

Values 

Density 

1.20-1.22 g/cm 3 

Melting point 

155 °C 

Glass transition temperature (T g ) 

147 °C 

Ultimate tensile strength 

55-75 MPa 

Young’s modulus 

2. 0-2.4 GPa 

Elongation 

80-150 % 

Thermal conductivity 

0.19-0.22 W/m • K 


high speed, heating is minimal and so there is no softening of the workpiece. Too 
high feed rate causes work material chipping and cracking. Too high cutting speed 
makes work material soften and poor surface finish generated. Uncoated carbide 
and high-speed steel tools with a good cooling are recommended for cutting of 
PMMA as less heat generated using the sharp edges, i.e., small cutting edge radius. 
Large burrs are formed when increasing depth of cut. PMMA is less ductile, and 
hence the burr generation is less formed as compared with PC. Up-cut mode 
generated less burr formation for most cutting conditions as compared to down- 
cut mode. Surface finish could be improved when small depth of cut and low feed 
rate was implemented. As there is a material spring back after cutting load released, 
geometrical control may be a potential issue for very precise machined PMMA 
parts (Ampara et al. 2012). 


PC 

Polycarbonates (PC), known by the trademarked names Lexan, Makrolon, 
Makroclear, and others, are a particular group of thermoplastics. They are easily 
worked, molded, and thermoformed. PC is a transparent material with a high impact 
resistance, high stiffness, good dimension stability, and good electrical properties. 
Because of these properties, polycarbonates find many applications. The mechan- 
ical properties of PC are listed in Table 7. Its repeating chemical structure unit is 
showed as below: 



Like PMMA, PC has a fairly good machinability, which can be machined by 
standard cutting tools. Both cutting speed and feed rate cannot be too high. Because 
workpiece chipping and cracking occur when cutting with very high feed rate; and 
work material is softened to produce poor surface finish when cutting with very 
high speed. To minimize the heat generation during the cutting, it is recommended 
to use a good cooling and sharp cutter like high-speed steel and uncoated carbide 
tools. PC is more ductile as compared to PMMA, and hence more burrs were 
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Fig. 8 Schematic diagram of ductile to brittle transition for cutting of brittle materials 


formed along the edge of machined feature of PC compared to PMMA particularly 
at large depth of cut. Relative large burrs are formed when increasing depth of cut 
even at the same feed rate. While at low feed rate and large depth of cut, rubbing 
and compression of material take place instead of cutting, and this generates more 
burr. Additionally, cutting modes play a role on the amount of burr formation. 
Up-cut mode generated less burr formation for most cutting conditions as compared 
to down-cut mode (Ampara et al. 2012). 


Brittle Materials 

Brittle material is a group of materials of which there is little or no plastic 
deformation when subjected to stress before failure. Therefore, a non-ductile 
material is considered to be a brittle material. Usually the tensile strength of 
brittle materials is only a fraction of their compressive strength. Brittle materials 
include most ceramics and glasses and even some polymers. Brittle materials 
have been widely used in the different areas of the industries due to their unique 
physical, chemical, optical, and mechanical properties. Unfortunately, brittle 
materials are very difficult to be machined and often fractured during machining 
process. 

All brittle materials will undergo a transition from brittle machining regime to 
ductile machining regime if the machining in feed rate is made small enough. 
Figure 8 is a schematic diagram of ductile to brittle transition model for cutting of 
brittle materials, which shows a projection of the cutting tool, perpendicular to the 
cutting direction where / is the feed rate, y c is the average surface damage depth, 
and Z e is the distance from the tool center to the fracture-pit transition on the uncut 
shoulder. When the depth of cut used for cutting of brittle materials is less than the 
critical chip thickness or critical undeformed chip thickness, ductile regime 
machining behavior will be achieved. Otherwise, brittle regime machining will be 
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obtained when the depth of cut is larger than the critical chip thickness or critical 
undeformed chip thickness (Liu et al. 2007a). 

According to the Griffith fracture propagation criterion, a critical depth-of- 
indentation of brittle materials can be predicted using the following expression 
(Ewalds and Wanhill 1986): 



where d c is critical depth for indentation or critical undeformed chip thickness for 
ductile regime machining, E is elastic modulus, H is material hardness, K c is 
fracture toughness, and u is brittle-to-ductile transition factor of the individual 
material, depending upon the material properties. Equation 13 indicates that in 
cutting of brittle materials, there is a critical value of undeformed chip thickness or 
depth of cut, below which the chip formation will be in ductile mode. Otherwise, 
the chip formation is in brittle mode. The critical value can be predicted as it is a 
function of work material hardness, elastic modulus and fracture toughness, which 
would vary with the elevated temperature in the cutting region. 


Glasses 

Glass is an inorganic material supercooled from the molten state to a rigid condition 
without crystallizing (or amorphous). It is typically very brittle and very weak in 
tension but quite strong in compression, having excellent properties such as opti- 
cally transparent, good corrosion resistance, temperature stability, nonporous, var- 
ious reflective indices homogeneity, high hardness, and durability. It has been 
widely used in optomechatronic systems, laboratory application, wafer, semicon- 
ductor components, optical lenses, windows of buildings, artwork, etc. (Liu 
et al. 2005). The most familiar type of glass is soda lime glass composed of about 
75 % silica (Si0 2 ), sodium oxide (Na 2 0), and lime calcium oxide (CaO). 

As glass is hard and brittle, it is very difficult to be cut and often be fractured 
during machining. Commonly used machining methods for glass are abrasive 
processes with either fixed abrasive like grinding and loose abrasive like abrasive 
jet machining and ultrasonic machining, where super hard particles like diamond, 
CBN, alumina and silicon carbide are used as the abrasive grit. In nature, they are 
brittle machining mode to remove the material by fracturing and crack propagation, 
such that the machined surface is not as good as for industrial application. Subse- 
quently, polishing method is introduced to remove those damaged layer containing 
fine cracks to improve the workpiece final surface finish. Again, the abrasive type 
used for polishing can be diamond and CBN grit. However, the material removal 
rate for those machining methods is very small and particularly polishing is 
extremely time-consuming. 

Ductile mode machining of brittle materials has attracted a lot of interests from 
both engineers and academics, because a crack-free surface can be obtained and 
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subsequent polishing is no longer necessary. As early in 1975, improvement in 
precision diamond grinding mechanism allowed the first reproducible evidence of 
grinding ductility in brittle glass workpiece (Huerta and Malkin 1976). 
Ultraprecision cutting has carried out to investigate the cutting performance of 
ZKN7 glass and soda lime glass. Ductile mode cutting of ZKN7 glass has been 
achieved with single-crystal diamond tool when depth of cut is less than 600 nm 
(Fang and Chen 2000). Ductile mode cutting of sod lime glass has been achieved 
with polycrystalline diamond tool when depth of cut is less than 1 pm (Liu 
et al. 2005). However, this small critical undeformed chip thickness is a bottleneck 
for the industrial application of ductile regime machining. Ultrasonic vibration- 
assisted ultraprecision machining method was invented in 1992 to improve glass 
material ductile cutting performance by applying an ultrasonic vibration on a 
single-crystal diamond tool along the cutting velocity direction (Moriwaki 
et al. 1992). In both the conventional cutting and ultrasonic vibration-assisted 
cutting, the machined grooves formed from ductile mode to brittle mode along 
with the depth of cut increasing and exceeding a critical value. Ultrasonic vibration 
could improve the ductile cutting performance of glass up to 1 .4 pm critical depth 
of cut, compared with the critical depth of cut 200 nm obtained in the conventional 
cutting. Studies on ductile mode cutting explore innovated ways continuously to cut 
glasses using a convention CNC machine tool with TiAlN-coated carbide end mill 
(Arehn et al. 2007). 


Silicon 


Silicon is a tetravalent metalloid element with chemical symbol Si and atomic 

o 

number 14, having a low density of 2.329 g/cnr and a high melting point of 
1,414 °C (see Table 8). Single-crystal silicon or monocry stalline silicon is 
the base and most important technological material of the electronic industry. 
Silicon reigns supreme as a semiconductor material mainly because of its ease of 
manufacturing. Single-crystal silicon is not only a dominant substrate material for 
the fabrication of micro electro and micro mechanical components but also an 
important infrared optical material. 

Silicon wafers are fabricated from the single-crystal silicon ingot through slic- 
ing, edge grinding, finishing, lapping, and polishing. After integrity circuit 
(IC) fabrication, back thinning and dicing of wafers into devices are the subsequent 
fabrication processes. Actually, ah those wafer fabrication processes are basically 
grinding or abrasive processes. The main problems of these grinding processes are 
the subsurface damage and cracks. The average thickness of the wafer subsurface 
damage layer is more than 6 pm even after fine grinding (Liu et al. 2009). It is 
obviously too costly and time-consuming to remove the subsurface damaged layer 
by the subsequent heave chemical-mechanical polishing (CMP) process. 

Tremendous affords have been made on ductile regime grinding and ductile 
mode cutting of single-crystal silicon wafers, targeting to achieve less subsurface 
damage so as to speed up the whole silicon wafer fabrication processes. Nanometric 


930 


H. Zheng and K. Liu 


Table 8 Main mechanical 
properties of single-crystal 
silicon 


Mechanical properties 

Values 

Density 

2.329 g/cm 3 

Melting point 

1,414 °C 

Ultimate tensile strength 

350 MPa 

Young’s modulus 

150 GPa 

Thermal conductivity 

148 W/m • K 


cutting has been carried out to investigate the ductile cutting performance of 
single-crystal silicon wafers using an ultraprecision lathe with single-crystal 
diamond tools of different cutting edge radii. There is an upper bound of the 
tool cutting edge radius for ductile mode chip formation in cutting of silicon. 
Cutting with tool edge radius larger than the upper bound would not be in a ductile 
mode (Are fin et al. 2007). There is a critical undeformed chip thickness for a 
given tool with the tool cutting edge smaller than the upper bound, beyond 
which the chip formation changes from ductile mode to brittle mode. The 
ductile-brittle transition of chip formation varies with the tool cutting edge radius. 
The larger the cutting edge radius, the larger the critical undeformed chip thick- 
ness for the ductile-brittle transition in the chip formation (Liu et al. 2007b). 
Ductile cutting of silicon can be achieved when the cutting edge radius does not 
exceed the upper bound value and the undeformed chip thickness is less than the 
critical value. Cutting mode is mainly determined by tool edge radius and 
undeformed chip thickness used, which has a significant effect on the R a values 
of surface roughness for the machined workpiece. The machined silicon 
wafer surfaces achieved by ductile cutting were free of micro cracks and fracture 

(Liu et al. 2009). 


Ceramics 

Ceramics represent a broad range of inorganic and nonmetallic materials, typically 
oxides, borides, carbides, and nitrides; have ionic and covalent bond character and 
extremely high melting point, >2,000 °C, that are commonly electrical and thermal 
insulators; and are composed of more than one element. Ceramics are generally 
brittle, porous forms and weak in tension, but in particular, they can be very hard 
and wear resistance once fully dense forms made, being excellent cutting and 
grinding materials for metals. Fully dense ceramics can be very efficient thermal 
conductors, but porous ceramics can be very efficient heat insulators. Thus, 
ceramics are a class of important materials useful in both traditional and advanced 
applications. Commercial and engineering ceramics include alumina (A1 2 0 3 ), sil- 
ica, silicates, silicon carbide (SiC), tungsten carbide (WC), silicon nitride (Si 3 N 4 ), 
sialons, zirconia (Zr0 2 ), etc. Many ceramics are chemically very inert, strong, and 
light, making very suitable for biomedical implants. Advanced ceramics are also 
used in the medicine, electrical, and electronics industries. They are generally very 
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Table 9 Main mechanical 
properties of tungsten 
carbide 


Mechanical properties 

Values 

Density 

15.63 g/cm 3 

Melting point 

2,870 °C 

Vickers hardness 

13.7-14.7 GPa 

Transverse rupture strength 

2,150 MPa 

Young’s modulus 

530 GPa 

Specific heat capacity 

39.8 J/mol • k 

Poisson’s ratio 

0.24 

Thermal conductivity 

110 W/m • K 


strong and brittle, such that they are very difficult to be cut or machined. Grinding, 
polishing, ultrasonic machining, laser machining, and water jet machining are the 
commonly used machining methods for ceramics applications. 

Tungsten carbide (WC) is an inorganic chemical compound, often simply called 
carbide and also called cemented carbide, hard metal, or tungsten-carbide cobalt, 
containing equal parts of tungsten and carbon atoms. It is a metal matrix composite 
where tungsten carbide particles are the aggregate and metallic cobalt serves as the 
matrix. Table 9 lists its main mechanical properties. Tungsten carbide has a 
prosperous future for the sake of its excellent properties such as high hardness, 
high wear resistance, and good toughness. It has been widely used in the industry 
for cutting tools, moulds and dies, and other devices. Manufacturers use tungsten 
carbide as the main material in some high-speed drill bits, as it can resist high 
temperatures and is extremely hard. 

When grooving of tungsten carbide using a CNC lathe with CBN tools, there is a 
transition from ductile mode cutting to brittle mode cutting as the depth of cut 
continuously increased. Here, the critical depth of cut for grooving of tungsten 
carbide is 4.8 pm. When the depth of cut was smaller than this critical value, the 
chip formation was in ductile mode. Otherwise, when the depth of cut increased in 
exceeding the critical value, the mode of chip formation changed from ductile to 
brittle (Liu and Li 2001). The tool wear occurs mainly at the tool flank in cutting of 
tungsten carbide, which is dominated by diffusion wear, adhesion wear, and 
abrasion wear. The higher the cutting speed, the larger the tool wear VB 


max 


and 


the shorter the tool life. The cutting forces increase gradually with the increase of 
the cutting distance and the tool flank wear. However, the machined workpiece 
surface roughness shows no significant change with the progress of the tool flank 
wear (Liu et al. 2003). But the material removal rate in ductile mode cutting of 
tungsten carbide using conventional machining methods remains very small and not 
efficient. Ultrasonic vibration-assisted cutting has been used to improve the ductile 
cutting performance of tungsten carbide. In comparison with conventional grooving 
of tungsten carbide, the critical depth of cut for the transition from ductile to brittle 
in ultrasonic vibration assisted grooving is around 16 pm, several times larger than 
that in conventional grooving (Liu et al. 2004). 
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Composites 

Composite materials, often shortened to composites, are engineered or naturally 
occurring materials composed of two or more distinct phases: matrix phase and 
dispersed phase, with significantly different physical or chemical properties from 
those of any constituent materials. The primary phase is called matrix with a 
continuous character, which is usually more ductile and less hard phase. It holds 
the dispersed phase and shares a load with it. The second phase is called dispersed 
phase embedded in the matrix phase in a discontinuous form. Dispersed phase is 
usually stronger than the matrix phase and thus sometimes called it reinforcing 
phase. 

In general, according to dispersion phase, composites can be segmented into 
three types: fiber reinforced, particle reinforced, and dispersion strengthened. 
According to the matrix phase, composites can be classified into three groups: 
metal matrix composite (MMC), ceramic matrix composite (CMC), and polymer 
matrix composite (PMC). Commonly, composites have superior high strength-to- 
weight ratio and resistance to corrosion and fatigue. The machinability of compos- 
ites is very much diverse based upon the matrix materials and reinforcement 
materials. 


Metal Matrix Composites 


Metal matrix composites (MMCs) are composed of a metallic matrix and a dis- 
persed ceramic or metallic phase, which may be classified by the morphology of 
matrix reinforcement as dispersion strengthened, particle reinforced or fibre 
reinforced. Typically, the metal matrix materials of MMCs are aluminum, magne- 
sium, iron, cobalt, copper, and titanium, while the reinforcement materials are 
oxides, carbides, and graphites. The most widely used MMCs are based on alumi- 
num as a matrix, reinforced with particulate silicon carbide (SiC), which is rela- 
tively cheap and has a low density. The stiffness of aluminum may thus be 
increased by more than 50 % in particulate composites. Their high strength and 
stiffness make them being particular attractive materials for aerospace structures. 

MMCs can be machined using conventional techniques such as turning, drilling, 
milling, and grinding, and nonconventional machining methods including electro- 
discharge machining (EDM), electro-chemical machining (ECM), laser machining, 
and abrasive jet machining. But most commonly used tools are diamond-based 
cutting tools such as poly crystalline diamond tooling (PCD), and CVD diamond- 
coated tools due to ceramic particles caused abrasion wear and adhesion wear, 
although cubic boron nitride (CBN), alumina, silicon carbide, and tungsten carbide 
tools are also used. Built-up edge occurs when cutting of MMCs especially at low 
cutting speed. The tool life decreases while the surface finish slightly improved with 
an increase in cutting speed, since the cutting temperature increases with the cutting 
speed. The excessive tool wear and damage associated with machining of long-fiber- 
reinforced MMCs result in the process being uneconomical (Davim 2012). 
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In general, aluminum MMCs reinforced with alumina are more difficult to machine 
than SiC -reinforced aluminum MMCs. 


Ceramic Matrix Composites 

Ceramic matrix composites (CMCs) are composed of a ceramic matrix and embed- 
ded fibers of other ceramic material (dispersed phase) to overcome the major 
disadvantages of conventional technical ceramics, namely, brittle failure and low 
fracture toughness, and limited thermal shock resistance. CMCs have attractive 
high temperature properties, but many are brittle. Therefore, their applications are 
mainly in fields requiring reliability at high temperatures and resistance to corrosion 
and wear, such as cutting tools, wear-resistant parts, aerospace engine parts, and 
military parts, as well as energy -related components. They are generally very 
difficult to be cut or machined due to high hardness at elevated temperatures. 
Grinding, polishing, electro-discharge machining (EDM), ultrasonic machining, 
laser machining, and water jet machining are the commonly used machining 
methods for CMCs applications. 

The inherent challenge in machining of CMCs is the excessive tool wear and 
subsequent subsurface damage. Increasing the particle volume fraction results in 
fast tool wear and subsequently affected the surface finish of the machined work- 
piece. A higher tool wear is a result of the increased hard ceramic particle volume 
fraction in CMCs. PCD tools are the most preferred, while carbide tools are 
preferred over ceramic tools. Cutting speed, feed rate and depth of cut have a 
similar effect on tool life and surface finish in machining of CMCs compared to 
those in machining of metals although some differences are noticeable due to the 
ceramic particles. In case of using carbide tools, a low cutting speed and a high feed 
rate are utilized to maximize the tool life (Dandekar and Shin 2012). 


Polymer Matrix Composite 

Polymer matrix composites (PMCs) are composed of a matrix from thermoset or 
thermoplastic and embedded glass, carbon, steel, or Kevlar fibers (dispersed phase), 
characterized by the morphology of the reinforcement: filler strengthened or fiber 
reinforced. They comprise a thermoset matrix, which is reinforced with fibers or 
whiskers of a material of very high tensile strength. PMCs reinforced by long fibers 
are an important class of materials in advanced structural applications due to their 
lightweight, high modulus and high specific strength. The use of PMCs is very 
diverse and far reaching almost all industry sectors with an application ranging 
from commercial aircraft to sporting goods. 

However, because of their anisotropic and heterogeneous nature, PMCs are 
difficult to machine. They contain strong and very abrasive fibers held together 
by a relatively weak and brittle matrix, being more prone to damage during 
machining than conventional metals (Campbell 2006). Carbides, coated carbides, 
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PCD, and diamond abrasive tools are commonly used for machining of PMCs. The 
actual depth of cut are very different from nominal depth of cut in cutting of PMCs, 
because a part of PMCs is pushed away from the cutting tool during machining and 
sprang back partially after the cutting load released. Machined composite surfaces 
often contain mechanical damages such as delamination, fiber pullout, fiber 
fuzzing, matrix de-cohesion, matrix chipping, cracking, and fiber dislodgements, 
as well as thermal damage and chemical damage. Debonding and fiber breakage can 
also take place easily. Due to the much lower thermal conductivity of the fibers 
(especially glass and aramid), heat buildup can occur rapidly and degrade the 
matrix, resulting in matrix cracking and even delamination. That is why it is 
important to minimize forces and heat generation during machining of PMCs. 
Generally high cutting speeds, low feed rates, and small depths of cuts are used 
to minimize damage (Davim 2009). Other machining methods such as laser 
machining and abrasive jet machining are also commonly used for PMC 
applications. 


Machining Economics 

Production costs and production rates create a crucial interest to the all manufactur- 
ing engineers. Metal cutting economics is to a great extent about making the best 
use of production resources (Boothroyd and Knight 1989). The variables affecting 
the economics of a machining operation are numerous which include the tool 
material, raw material, over head investment, machine tool capacity, operating 
skill, and the cutting conditions. 


Basic Terms 

In the following discussion, two terms related to machining economics are defined 
as: 

1. Production cost: total average cost to perform the machining operation on a 
component using one machine 

2. Production time: average time taken to produce one component 

Manufacturing is a complex system and usually no single machining operation 
can complete a component fabrication. A high efficiency or production rate would 
probably mean low production cost. But these two facts must be taken into account 
separately, because the machining conditions providing maximum production rate 
may be the conditions giving minimum production cost. In practice, the optimum 
machining conditions can rarely be achieved due to some constraints: 

• Tool and work material variability 

• Range of speeds, power, feed, and stability of machines 
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• Inadequate machine tools 

• Finish, surface integrity, and dimensional accuracy required 

• Insufficient cost data 

The problem is to adjust the machine consistent with constraints so that produc- 
tion can be carried out as close to optimum machining conditions as possible. And 
also optimum conditions for one machining operation may not be the optimum 
conditions for a complete manufacturing system. 


Machining Performance Criteria 

In general, a component manufacturing will involve several machining operations 
using a variety of machine tools, and most machine tools are used for more than one 
type of component. Thus, the actual economic optimum for any machining opera- 
tion must consider other processes which will be performed on that component. The 
manufacturing company is more interested in the overall profit rather than the 
individual machining operations and single component in a given period. 

The economic criteria normally considered for practical machining operations to 
choose cutting speeds, feeds, tool geometry, tool and work materials, cutting fluids, 
and machine tools are: 


• Minimum cost: increasing the batch size always reduces the cost of production 
and the economic batch size for the appreciate machine is depended on both the 
cutting conditions and work-tool machinability. 

• Maximum production: increasing the batch size increases the maximum 
production rate. 

• Maximum profit rate (maximum efficiency): the optimum conditions for maxi- 
mum profits lies close to the minimum-cost conditions. 

An estimation of cost factors is important as it would affect the optimum 
machining conditions selected. The calculating methods for those costs could 
vary from different manufacturing systems to different components. Basic elements 
of cost estimation applicable to most cases can be expressed as 



C m + C t + C s + C 


n 



where C pr is the average cost per piece, C m is the machining cost, C t is tooling cost, 
C s is tool-changing cost, and C n is nonproductive or idling cost. 

Figure 9 shows the qualitative plots on the effect of cutting speeds on 
manufacturing costs. Details of the manufacturing costs are: 


1 . Machining cost is direct labor and machine cost with overhead for machining of 
one component, including operator wage rate and benefits, depreciation rate of 
machine tool, machine overhead like machine servicing and maintenance cost, 
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Machining cost 





Fig. 9 Effect of cutting speed on manufacturing costs 

power consumption, etc. The machining time is determined by the types of 
machining operations and cutting conditions used. As shown in Fig. 9a, increas- 
ing cutting speed will decrease the machining cost as the machining time is less. 

2. Tooling cost is the cost of cutting tools and regrinding of regrindable tools, etc., 
for machining of one component. As shown in Fig. 9b, increasing cutting speed 
will also increase the tooling cost as the tool wear is fast and more cutting tools 
are needed to machine the component. 

3. Tool changing cost is the cost incurred for removing the worn tool from the 
machine tool, placing a new tool and setting the new tool in the proper position 
for machining of one component. As shown in Fig. 9c, increasing cutting speed 
will also increase the tool changing cost as more cutting tools used need more 
tool changing times. 

4. Nonproductive or idling cost is the cost not directly associated with manufactur- 
ing operations, including the cost of loading and unloading component, the 
idling time cost, and other non-cutting time costs not included in the total cost 
per component. As shown in Fig. 9d, increasing cutting speed will not change the 
nonproductive or idling cost as this cost incurred for running the company is not 
associated with the manufacturing operations, regardless of job type, batch, and 
volume. Figure 10 shows the qualitative plots on the effect of cutting speed on 
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Fig. 10 Effect of cutting 
speed on total 
manufacturing cost 



total manufacturing cost. Note that there is an optimum machining speed for 
manufacturing cost. The cutting speed close to the optimum condition is known 
as the high-efficiency machining range. 


Other factors may limit the selection of optimum machining conditions includ- 
ing maximum machining power and maximum machining force as 


Cutting_Power = cvf a a 'P 


( 15 ) 



F = kf a a p (1 6) 

where c, a, and /? are constants, /is feed rate, v is cutting speed, and a is depth of 
cut. All those cutting speed, feed rate, and depth of cut selection must not exceed 
the available cutting power and machining force. 


Summary 

In this chapter, machinability has been introduced to indicate one engineering 
material how easy or difficult it is by machining to achieve an acceptable surface 
finish. Machinability could be considered as a material property, but not a unique 
property. When assessing a material machinability, all other aspects of the machin- 
ing system must be considered concurrently. An understanding of the interactions 
between tool and work materials at the tool-work interface would benefit to 
machining behavior and machinability. Tool material and cutting speed perhaps 
are the two most important parameters for engineering material machinability 
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assessments. Engineering materials could be developed with improved machinabil- 
ity or more uniform machinability through micro structure modification and chem- 
ical components adjustment, making it easy to machine and achieving low tool 
wear rate, improved chip formation, and reduced cutting forces. Advance devel- 
oped tool materials with high thermal hardness and wear resistance would improve 
the material machinability. 
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Abstract 

In the modem manufacturing industry, monitoring the machining process and 
tool condition is becoming increasingly important in order to achieve better 
product quality, higher productivity, higher process automation, and lower 
human labor costs. This chapter introduces the fundamental technologies and 
state-of-the-art development for machining process monitoring. After a brief 
introduction of the background, in section “Measurands and Sensors,” the 
commonly used measurands for machining process monitoring are presented, 
including motor power and current, force, torque, acoustic emission, vibration, 
image, temperature, displacement, strain, etc. The corresponding sensors for 
these measurands, and the requirement for signal conditioning, are also 
discussed. Signal conditioning includes amplification, filtering, converting, 
range matching, isolation, and other processes to make sensor output suitable 
for data acquisition and signal processing. Knowledge of data acquisition, which 
is the process of sampling sensor signals and converts the resulting samples into 
digital numeric values that can be manipulated by a computer, is provided in 
section “Data Acquisition.” Some key concepts such as analog-to-digital con- 
version, quantization, sampling rate, Nyquist sampling theorem, and aliasing are 
explained. Section “Signal Processing” introduces the essential signal process 
techniques, including the time domain analysis, frequency domain analysis, 
time-frequency domain analysis, and artificial intelligence approaches such as 
artificial neural networks, fuzzy logic, etc. Detailed machining process monitor- 
ing strategies and approaches, together with some examples and case studies, are 
provided in section “Monitoring Strategies and Approaches,” which covers the 
topics of tool wear estimation, tool breakage detection, chatter detection, surface 
integrity, and chip monitoring. 


Introduction 

The manufacturing industries are facing high competition nowadays. Manufactur- 
ing enterprises have to cope with the challenging demands of increased product 
quality, greater product variability, shorter product life cycles, reduced cost, and 
higher process automation. More sophisticated manufacturing systems together 
with higher level of quality requirements of manufactured components should be 
achieved. 

Machining is in fact the most important of the manufacturing processes in terms 
of annual dollars spent. The term of machining describes a variety of material 
removal processes, such as drilling, turning, milling, etc., in which a cutting tool 
removes unwanted material from a workpiece in the form of chips to produce the 
desired shape. Machining problems, such as cutter breakage, excessive wear, and 
chatter, impede production consistency and quality. It is important to understand 
the modes of possible failure so that their early symptoms may be detected and 
analyzed, warnings sounded, and possible corrections be made as soon as possible. 
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Monitoring the machining process can therefore serve to detect these problems for 
cutting and machine tool, check and safeguard machining process stability, avoid 
machine tool damage, and maintain machining tolerance on the workpiece to 
acceptable limits by providing a compensatory mechanism for tool wear offsets. 
In a condition monitoring system, sensors are used to measure the process phe- 
nomena such as force, vibration, temperature, vision, etc., and signal processing 
techniques are used to determine the state of the processes, so as to enable process 
optimization and control. 

In machining operations, static or dynamic errors may exist. Most of the static 
errors, such as machine errors, tooling inaccuracies, and part tolerances, can be 
overcome by datum detection and machine calibration. However, dynamic errors 
that affect the product quality severely cannot be rectified through calibration or 
computer numerical control (CNC) technology. For example, tool- workpiece vibra- 
tion, especially chattering, is a common detrimental phenomenon and a major 
obstruction in achieving higher productivity and better accuracy. Tool wear is 
also a dynamic factor that affects the final finish. It requires online compensation 
for dynamic errors to achieve desired results, such as mechatronic solutions that are 
typified by synergistic integration of customized mechanics, sensors, modeling and 
control, and product and process information. Then machining process monitoring 
system with sensors and relevant signal process strategies will be a key component 
of the solution. 

There are many challenges in building an effective tool condition monitoring 
system. The complex interactions between machines, tools, workpieces, fluids, 
measurement systems, materials, humans, and the environment in cutting opera- 
tions require that sensors be employed to insure efficient production and protect 
workers and the environment. This problem is even worse in new and enhanced 
production processes and tooling. In addition, the function of the system may differ, 
depending on whether the monitoring system is built into machine tool as a feature 
or requested by users as the addition of a system. These sensing systems are often 
built for protection of the machine tool or spindle from unexpected events or failure 
(Byme et al. 1995; Liang et al. 2004). 

In the machining process, there are several process variables, such as cutting 
forces, vibrations, acoustic emission, noise, temperature, surface finish, etc., that 
are influenced by the cutting tool states and the material removal process condi- 
tions. The variables that are prospectively effective for machining process moni- 
toring can be measured by the application of appropriate sensors, such as 
dynamometer, accelerometer, current sensor, etc. Signals measured by sensors are 
always analog, which should be conditioned and converted to digital signal for 
future processing. Functional signal features correlated with tool state and/or 
process conditions are then generated using a number of methods. These features, 
sometimes coupled with experimental results, are finally fed to and evaluated by 
cognitive decision-making support systems for the final diagnosis. This can be 
communicated to the human operator or fed to the machine tool numerical controller in 
order to suggest or execute appropriate adaptive/corrective actions (Teti et al. 2010). 
The framework of a machining process monitoring system is shown in Fig. 1 . 
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Fig. 1 Intelligent monitoring system for machining processes 

This chapter will introduce the fundamentals of machining process monitoring. 
It will cover different sensors used in the monitoring of machining processes, data 
acquisition and signal preprocessing methods, signal processing techniques used for 
the sensory feature extractions using the relevant sensory information, and com- 
monly used monitoring strategies. 


Measurands and Sensors 

The progress of a pending failure in machining process can be monitored by direct 
and indirect means. Direct monitoring methods are based on dimensional measure- 
ment of the machined component or the progress of the tool wear. For example, it 
may consist of laser, optical, and ultrasonic sensors to provide a direct measurement 
of tool wear development. These methods are still very expensive and difficult to 
apply in the industrial machining process environment. Sometimes the failure mode 
cannot be tracked or is too expensive to monitor effectively by direct means. Then 
indirect monitoring methods will be used. Relatively, indirect measuring systems 
are more economical, which measure suitable auxiliary quantities, such as force, 
vibration, and current, and derive the actual quantity from empirically determined 
correlations. The material removal process in machining with a cutting tool gener- 
ates significant cutting forces between the cutting tool and the workpiece. To 
maintain the machining operation, torques have to be created on the spindle and 
the drive motors. Excessive cutting forces and torques can cause undesired tool, 
workpiece, and machine structural deflections, tool-workpiece vibration, tool fail- 
ure, spindle stall, etc., that affect the machining process and the machined product 
quality. Therefore, the cutting forces, torques, and power are often monitored as 
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Fig. 2 Measurands commonly used for machining process monitoring 


indirect measurements of other process phenomena (Teti et al. 2010; Jemielniak 
1999; Abellan-Nebot and Romero Subiron 2010). 

A sensor is actually a converter that converts energy from one form to another. It 
measures a physical quantity in such a way by converting it into a signal which can 
be read by an observer or by an instrument. The energy forms relevant to machining 
processes include mechanical energy, electrical energy, thermal energy, and radiant 
energy. The input quantities or properties that are to be measured by sensors are 
called measurands. The common measurands for machining process monitoring 
include force, torque, power, vibration, noise, acoustic emission, temperature, 
displacement, image, etc. 

Figure 2 shows the commonly used measurands in machining monitoring sys- 
tems. Basically, four types of sensors are most widely used to monitor machining 
process: force/torque sensors, accelerometers, AE sensors, and motor power and 
current sensors. This section introduces the fundamentals and applications of some 
commonly used measurands and sensors. 


Motor Power and Current 

The measurement of the spindle drive power can be used to detect tool wear, collision, 
or tool breakage, which is regarded as technically simple. From the measurements of 
current, line voltage, and phase shift, motor power can easily be calculated. It is even 
possible to gain information about the actual power demand of the drives from the 
machine tool control without additional sensors. A power sensor EPT 20 from 
Prometec Company is shown in Fig. 3. It uses the three voltages/currents from the 
three power lines to calculate the effective power of the drive. EPT 20 measures the 
three voltages on the phases and utilizes two hall sensors to measure the current in two 
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Fig. 3 A power sensor EPT 
20 (Courtesy of Prometec) 



Fig. 4 A power sensor 
PS100DGM (Courtesy of 
Montronix) 



phases and calculates the current of the third phase from the two measured currents. 
The measurement apparatuses of motor-related parameters to detect malfunctions in 
the cutting process do not disturb the process itself and are easy to retrofit. 

Since the power required for cutting is only a portion of the total power 
consumption, the sensitivity of power/current monitoring is thus limited. Further- 
more, the slow response speed is a severe limitation in many machining applica- 
tions. In small diameter machining, the amount of spindle power consumed for 
material removal may be very small. This can cause great errors in force or torque 
measurement, although additional signal amplifier has been used recently to solve 
the problem. As shown in Fig. 4, the PS100DGM from Montronix is a power sensor 
with onboard microprocessor controlled gain, offset, and nonlinear filtering. These 
features allow the sensor to work well in applications using both large and small 
diameter tools. 


Force 

Cutting force is considered to be the variable that best describes the cutting process, 
which is due to the high sensitivity and rapid response of force signals to changes in 
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Fig. 5 A piezoelectric 
dynamometer (Courtesy of 
Kistler) 



cutting states. The information derived from the force signal can be used to evaluate 
tool condition (tool wear, tool breakage/chipping), cutting condition (chatter vibra- 
tion), and the quality and geometric profile of the cutting surface and to validate 
analytical process models and predict/detect tool failure. So monitoring of cutting 
forces is highly desirable. 

Strain gauges, piezoelectric quartz force transducers, and dynamometers are 
available for force measurement. The principle of force sensors is generally that 
sensing elements convert the applied force into deformation of an elastic element. 
The two main sensor types are piezoelectric-based and strain-based sensors. Pie- 
zoelectric sensors use the piezoelectric effect, in which the material generates a 
corresponding electric charge proportional to the force applied on it. There are two 
types of piezoelectric materials that are commonly used: natural quartz and 
man-made polycrystalline ceramics. Each material offers certain benefits, and 
material choice depends on the particular performance features desired of the 
dynamometer. A widely used piezoelectric dynamometer is shown in Fig. 5. 

Force transducers based on strain gauges have a so-called spring element or 
loaded member, which can convert the forces to be measured into strain as 
reproducibly and linearly as possible. Since piezoelectric sensors are based on the 
piezoelectric effect, it is difficult to measure static forces over a long period without 
drift. Strain gauge force transducers can offer reasonably high-frequency response 
and long-term stability; however, the natural frequency is relatively low, and the 
displacement can be considerable. A new development is a combined tool dyna- 
mometer, using the best features of strain gauge and piezoelectric sensor types: 
strain sensing for static forces and a piezoelectric thin film accelerometer for 
dynamic forces (Kim and Kim 1997). 

In addition, there are other force sensor types recently developed, such as 
microforce sensors using silicon microfabrication technology, sensors using 
surface acoustic wave (SAW), or ferromagnetic materials to sense the force 
(Teti et al. 2010). Although initial results show less accuracy in comparison to 
piezoelectric-based force measurement systems, the techniques offer advantages as 
no direct contact is required to the structure surface, making it ideal for force 
measurements during rotating machining. 
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Fig. 6 Different torque sensors: (a) piezoelectric torque sensor Type 9275 (Courtesy of Kistler); 
(b) strain gauge sensor Type 4504B (Courtesy of Kistler); (c) AccuTorque torque sensor (Courtesy 
of Montronix); and (d) SAW sensor (Courtesy of Schott) 


Torque 

A torque sensor is a device for measuring and recording the torque on a rotating 
system. The working principles of torque sensors are similar to force sensors, where 
sensing elements convert the applied force into deformation of an elastic or 
magnetic element. Torque sensors can achieve much higher accuracy in tool and 
process monitoring compared with measuring the power consumed by the spindle 
motor. It is especially useful for tapping and multi- spindle applications where 
power monitoring is often not sufficient, as only a small percentage of the available 
motor load is used in each cut. The sensor can monitor tool wear, tool breakage, did- 
not-cut condition, thread depth, oversized or undersized predrilled holes, and 
damaged or missing threads on taps. 

Figure 6 shows some torque sensors from different manufacturers. In Fig. 6a is 
the most commonly used piezoelectric torque sensor Type 9275 from Kistler, while 
in Fig. 6b is a strain gauge sensor Type 4504B also from Kistler. Figure 6c shows a 
recently developed torque sensor using magnetic properties, the AccuTorque from 
Montronix. It converts mechanical shaft torque into a linearly proportional 
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Fig. 7 Piezoelectric 
accelerometers (Courtesy of 
Briiel & Kjaer) 




magnetic field and transforms this field into a linearly proportional electrical signal, 
thus providing a noncontact means of gathering shaft torque information. Figure 6d 
shows a surface acoustic wave (SAW) torque sensor provided by Schott. As no 
batteries or wires are required, this sensor can be used in applications that tradi- 
tional sensors cannot be applied, such as on rotating shafts or in environments 
where access to the sensors is difficult or potentially hazardous. 


Vibration 

During the machining process, the interaction between the cutting tool and work- 
piece produces vibrations. Spindle rotation and cutting force oscillations also 
produce vibrations. As cutting tool wears or chatter occurs, the vibration charac- 
teristics, such as amplitudes and frequencies, will change relatively. Many failure 
modes in machining process can be reflected in the vibration signals. In addition, 
vibration can be easily measured. It has been widely used in machining process 
monitoring. 

Vibration can be measured using acceleration, velocity, or displacement 
sensors. To monitor the machining process, the accelerometer is regarded as one 
of the most convenient sensors for vibration measurement. An accelerometer is a 
device to measure acceleration. Among a large variety of sensing principles for 
vibration detecting, piezoelectric transduction is the most common type. A piezo- 
electric accelerometer utilizes the piezoelectric effect to measure dynamic changes 
in mechanical vibration. It can measure the mechanical vibration of the machine 
structure resulting from the cutting process, typically up to 10 kHz. Examples of 
piezoelectric accelerometers are shown in Fig. 7. 

A piezoelectric accelerometer is small in size and easy to be installed on new or 
existing machine tools. It can meet very well the requirements due to the tough 
environmental conditions in the machine tool in terms of splash protection, 
moisture-proofing, resistance to aggressive media, and resistance to flying chips. 
Because of these advantages and its comparatively low price, this type of sensor is 
often used for tool condition monitoring to detect missing tools, broken tools, out- 
of-tolerance parts, machine collision, and severe process faults. It is also possible to 
monitor excessive vibration on bearings or spindles (Jemielniak 1999). 
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Fig. 8 An AE sensor and a measurement setup 


Acoustic Emission 

Acoustic emission (AE) is commonly defined as a transient elastic wave generated 
by the rapid release of energy from localized sources in a material or on its surface. 
The major sources of AE in machining are friction on the rake face and the flank, 
plastic deformation in the shear zone, crack formation and propagation, impact of 
the chip at the workpiece, and chip breakage. Within the crack formation, a high- 
amplitude signal occurs due to tool breakage, which makes the AE sensor a very 
useful tool for breakage detection. 

Piezoelectric AE sensor is one of the particularly suitable sensors to measure AE 
signal in machining process monitoring. It can be easily installed on both new and 
existing machines. With very wide sensor bandwidth, up to 1 MHz, AE can detect 
most of the phenomena in machining. Nevertheless, appropriate approaches for 
data acquisition and signal processing need to be carefully selected. Band-pass 
filters are used to choose appropriate frequency ranges; thus, there exists great 
flexibility for AE detection. Typically, AE sensor consists of the sensor housing, the 
piezoelectric sensing element, and the built-in impedance converter. A root mean 
square (RMS) converter, gain selection unit, and filters are also always contained 
within the housing. Various signal processing and analysis methods are available to 
interpret sensor signals and extract feature information (Chen et al. 2008). 

Most AE sensors are to be attached to the machine tool surface. Figure 8 shows a 
Kistler AE sensor and an example of a measurement setup when it was mounted to a 
fixture to monitor machining process. There are also alternative methods of AE 
wave transmitting. The high-frequency and low-amplitude nature of AE means that 
signal transmission via a coupling fluid is possible. Figure 9a presents a coolant AE 
sensor developed by Nordmann with a frequency range from 113 kHz to 1 MHz. 
This signal transmission method has a distinct advantage for rotating tools such as 
in milling and drilling and measures signal very close to the tool or the workpiece. 
A rotating, wireless AE sensor RSA, comprising rotating acoustic emission sensor 
and upright measured value receiver, serves for the sound pickup of grinding 
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Fig. 9 (a) A coolant AE sensor and (b) wireless AE sensor (Courtesy of Nordmann) 


wheels and diamond rollers (Fig. 9b). It is suitable for applications where signals 
from rotating shafts or translatory components, such as slides, pallets, spindles, etc., 
have to be passed to a fixed receiver for analysis and monitoring. The smaller the 
distance between the receiver and transmitter, the stronger the measured values 
transmitted and the weaker the effects of possible malfunctions in the case of 
neighboring electromagnetic interference fields. 


Visual Image 

Due to the development of the image processing technology, various image sensors 
can be used to obtain information about the cutting tool condition as well as the 
machined parts. An image sensor converts an optical image into an electronic 
signal. The earlier studies employed analog sensors such as video camera tubes, 
which suffer from geometric distortions and image drift and are obsolete already. 
Nowadays, there are two types of image sensors that dominate imaging devices: 
CCD (charge-coupled device) and CMOS (complementary metal-oxide-semicon- 
ductor). A CCD image sensor is an analog device. When light strikes the chip, it is 
held as a small electrical charge in each photo sensor. The charges are converted to 
voltage one pixel at a time as they are read from the chip. A CCD captures light 
much like a piece of film, all at the same time. On the other end, a CMOS sensor 
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uses a “rolling shutter.” It captures light through capturing each pixel one by one. 
Each row of pixels (from top to bottom) is captured in sequential order, creating a 
rolling effect. Traditionally, CMOS sensors use less power than CCDs. In general, 
CCD sensors deliver higher image quality while CMOS sensors are more efficient 
and consumer friendly. 

The image sensors can be used in tool condition monitoring regarding tool wear. 
Preprocessing is applied to reduce noise and keep the detailed information, before 
the tool morphologies are recognized and measured by certain algorithms. In 
addition, they have been proposed as an alternative to the stylus profilometer 
among the noncontact techniques in terms of accuracy, speed, and flexibility 
(Kurada and Bradley 1997; Lanzetta 2001). 


Temperature 

In terms of the principles of temperature measurement, there are mainly resistance 
thermometry devices, thermocouples, and radiation pyrometers. Resistance ther- 
mometry devices depend on variations in electrical resistivity with temperature 
change. This type of sensor is used extensively in calibration laboratories because 
they are inherently simple, low cost, and highly accurate. The thermocouple (TC) is 
the most widely used temperature sensor, which consists of two dissimilar metals 
that produce a voltage in the vicinity of the point where the two metals are in 
contact. The voltage produced is dependent on the difference of temperature of the 
junction to other parts of those conductors. Thus, this property is used to measure 
the temperature. Finally, a radiation thermometer/pyrometer uses an optical system 
to collect the energy emitted by the target, followed by a detector which converts 
this energy to an electrical signal. An emissivity adjustment is used to match the 
thermometer calibration to the specific emitting characteristics of the target and an 
ambient temperature compensation circuit, to ensure that temperature variations 
inside the thermometer due to ambient conditions do not affect accuracy (Davies 
et al. 2007). 

The significance of assessing temperatures during machining operations and 
analyzing their effects on both the workpiece and the cutting edge of the tool has 
long been recognized. High temperatures at the tool-workpiece interface accelerate 
the flank wear mechanisms and promote plastic deformation on the machined 
surface. That leads to a significant thermal load of the subsurface which may induce 
phase transformation, generate surface alterations, and produce high tensile resid- 
ual stress values which have a negative effect on fatigue life of the machined parts 
(Le Coz et al. 2012). However, measuring temperature during machining is very 
challenging. It is not easy to choose a proper temperature sensor. According to 
(Ueda et al. 2001), the important factors, temperature range, sensor robustness, 
temperature field disturbance by the sensor, sensitivity and signal noise ratio, and 
response time, should be reminded, together with the following criteria: easiness of 
calibration, availability, cost, and size. 
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Fig. 10 A quartz strain 
transducer (Courtesy of PCB) 



Strain 

Strain sensors are relatively easy to retrofit and often work for indirect force 
measurement. As such, these devices can provide valuable insight into the behavior 
of mechanical systems or processes that generate an associated machinery reaction. 
Strain gauge is a sensing device formed by a thin strip made from a metallic high- 
resistivity material that rests on a pliable support. It changes resistance at their 
output terminals when stretched or compressed. When the conductive track is 
stretched, it becomes thinner and longer and the ohmic resistance increases. Strain 
gauges are typically bonded to the surface of a solid material to measure its minute 
dimensional changes when put into compression or tension. They are predomi- 
nantly used in full bridge circuits to permit the compensation of interferences, such 
as temperature compensation. Strain gauges and their underlying principles are 
often used in devices for measuring acceleration, pressure, tension, and force. There 
is also a quartz strain transducer incorporating piezoelectric quartz sensing crystals 
that respond to a longitudinal change in distance. This type of quartz strain sensors, 
as shown in Fig. 10, is used to monitor the dynamic response of crimping, stamping, 
punching, forming, and any other applications where it is crucial to maintain 
process control. 


Displacement 

Displacement sensors measure the distance an object moves, and they can 
also be used to measure dimensions such as object height and width. There are 
two types of displacement sensors: contact type that utilizes a dial gauge, 
differential transformer, etc. and noncontact type that utilizes a magnetic field, 
laser beam, etc. Inductive sensors using eddy current technology can be used to 
measure distance, displacement, or position of electrically conductive targets 
without contact. These objects may have ferromagnetic or non-ferromagnetic 
properties. A high-frequency magnetic field is generated by passing a 
high-frequency current through the sensor head coil, as shown in Fig. 11 
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Fig. 11 Principle of 
inductive (eddy current) 
position sensor (http://www. 
chenyang-ism.com/ 
EddyCurrentDistance.htm) 



(http://www.chenyang-ism.com/EddyCurrentDistance.htm. Accessed Dec 2012). 
When a metal target is present in the magnetic field, electromagnetic induction 
causes an eddy current perpendicular to the magnetic flux passage to flow on the 
surface of the target. This changes the impedance of the sensor head coil. Inductive 
displacement sensors measure the distance between the sensor head and target, 
based on this change in oscillation status. There are also many other sensor types. 
Capacitance sensors are sometimes used to detect the spindle shaft displacement 
due to cutting load. Ultrasonic sensors, similar to radar or sonar, are employed to 
generate high-frequency sound waves and calculate the time interval between 
sending the signal and receiving the echo to determine the distance to an object 
(Teti et al. 2010). 

The laser displacement sensor is also widely used in both industry and labora- 
tories. The CCD laser displacement sensor shown in Fig. 12 uses a triangulation 
measurement system. This laser sensor uses a CCD as the light-receiving element, 
detects the peak value of the light quantity distribution of the beam spot, and 
identifies this as the target position. Therefore, the CCD enables stable highly 
accurate displacement measurement, regardless of the light quantity distribution 
of the beam spot. 


Signal Conditioning 

Most sensors usually cannot be connected directly to a data acquisition device. 
Rather, they need preprocessing by a conditioner specific to the sensor. The outputs 
of sensing elements can be either voltage, current, charge, or frequency based. 
Signal conditioning circuitry is required to map the sensing element’s output signal 
into a range that the data acquisition circuitry can process. As a majority of sensors 
generate analog output signals, this circuitry is essentially analog rather than digital. 
Depending on the type of sensors employed, specific signal conditioning circuits 
are needed, such as charge amplifier, coupler, etc. Signal conditioning can include 
amplification, filtering, converting, range matching, isolation, and any other pro- 
cesses required to make sensor output suitable for processing after conditioning. 
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Fig. 12 CCD laser 
displacement sensor 
(Courtesy of KEYENCE) 



The key purpose is to provide distinct enhancements to both the performance and 
accuracy of data acquisition systems. 

The signals generated by sensors, whether charges, voltages, currents, or elec- 
trical properties, usually have weak amplitudes. In order to process the 
signal accurately, better match the analog-to -digital converter (ADC) range, thus 
increasing the measurement resolution and sensitivity, the signal needs to be 
amplified. 

Filters are used to reject unwanted noise within a certain frequency range. The 
signals generated by the sensor must be clean and band limited, i.e., the signals must 
be as free of noise as possible, and the frequency content of the signal must be 
limited to a certain range. Moreover, low-pass filtering is used to prevent aliasing 
from high-frequency signals. This can be done by using an antialiasing filter to 
attenuate signals above the Nyquist frequency. 

Other conditioning functions, such as level shifting or isolation, may be needed so 
that the sensor output can be converted to a form that can match the requirement of 
the ADC. Examples of commercial sensor conditioners are shown in Fig. 13, a 
charge amplifier for a piezoelectric dynamometer and a Piezotron coupler for an 
acoustic emission (AE) sensor. Due to high impedance of the piezoelectric sensor, 
the output signal of the sensor is connected directly to a buffer amplifier, which 
converts the charge signal from the sensor into a proportional voltage signal. 
High-pass and low-pass filters are sometimes combined to eliminate high-frequency 
noise components caused by electric sparks and useless low-frequency noise com- 
ponents, respectively. Figure 14 shows the function block of the AE-Piezotron 
coupler with built-in RMS converter and a limit switch for the processing of 
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Fig. 13 Examples of signal conditioners, (a) A charge amplifier for piezoelectric dynamometer 
and (b) a Piezotron coupler for acoustic emission sensor 


high-frequency sound emission signals, so the AE signals can be recorded or 
processed with the conventional, less expensive signal processing equipment. 


Data Acquisition 

Data acquisition (DAQ) is the process of sampling signals and converting the 
resulting samples into digital numeric values that can be manipulated by a com- 
puter. Figure 15 is a typical measurement chain used for sensing the real-world 
measurand, converting analog and digital information about real-world physical 
conditions into the language of computers, until the decision-making stage. 

The analog-to-digital conversion (ADC) process converts the input continuous 
physical quantity from a sensor to a digital number that represents the quantity’s 
amplitude, which can then be processed in a digital signal processor or microcom- 
puter. The conversion involves two main challenges: quantization and sample rate. 
Quantization is the procedure of constraining something from a continuous set of 
values to a discrete set. Measurement transducers or transmitters typically provide 
continuously varying signals between 0-10 V DC, ±5 V DC, 0-100 mV DC, and 
4-20 mA DC. Thermocouples and resistance temperature devices (RTDs) are other 
common low- voltage inputs. Quantization introduces a small amount of error, 
which is the difference between the original signal and the digitized signal. Quan- 
tization error is there because of the finite resolution of the digital representation of 
the signal. It is an unavoidable imperfection in all types of ADCs. The higher the 
ADC resolution is, the lower the quantization error is. 

The resolution of an ADC is represented in terms of bits, and the more the bits, 
the higher the resolution. The number of bits determines the number of divisions 
into which a full-scale input range can be divided to approximate an analog input 
voltage. For example, 8 bit resolution of a 0-10 V input signal means that the range 
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Fig. 14 Function blocks of the Piezotron coupler (Courtesy of Kistler) 


958 


H. Li and Y. Chen 


Fig. 15 Measurement chain 


Real world 



Decision making 


is divided into 2 8 = 256 steps. This means a size of 10 V/256 = 0.039 V for each 
step. In addition, a 10 V input is equal to the digital number 255, and a 0 V input 
corresponds to 0. Each 0.039 V increment or decrement in the input equals adding 
or subtracting 1 from the previous number. 

Apart from digitalizing the data, DAQ also quantize time which is parceled into 
discrete intervals (Fig. 16). The analog signal is continuous in time. An ADC often 
performs the conversions periodically; as a result, the analog signal is converted 
into a flow of digital values, i.e., a continuous-time and continuous-amplitude 
analog signal is converted to a discrete-time and discrete-amplitude digital signal. 
It is therefore required to define the rate at which new digital values are sampled 
from the analog signal. The rate of new values is called the sampling rate or 
sampling frequency of the converter. It defines the number of samples per unit of 
time (usually seconds) taken from a continuous signal. The unit for sampling rate is 
hertz, which indicates samples per second. The time interval between two succes- 
sive samples is the sampling period or sampling interval, which is in fact an inverse 
of the sampling frequency. 

To enable a perfect signal reconstruction after sampling, the sampling frequency 
should be high enough. The Nyquist sampling theorem is a fundamental theory that 
defines the necessary relationship between the highest frequency contained in a 
signal and the minimum required sampling rate. It stated that the sampling fre- 
quency should be at least twice the highest frequency contained in the signal being 
sampled. If a function x(t ) contains no frequencies higher than f c Hz and f s is the 
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Fig. 16 Signal sampling into discrete time intervals 


sampling frequency, then it is required that f s > 2f c . This theorem also implies a 
formula that when the band limit B is too high or there is no band limit, the Nyquist 
criterion is not satisfied, and then the reconstruction exhibits imperfections known 
as aliasing. If the sampling rate is not fast enough, the presence of totally nonex- 
istent frequencies may be indicated, as in Fig. 17. Low-pass or antialiasing filters 
can be used to limit the measured waveform’s frequency spectrum so that no 
detectable component equals or exceeds half of the sampling rate. 

There are a series of trade-offs to consider when choosing devices for ADC and 
data acquisition. Higher resolution or more bits means more accurate conversion 
but more expensive hardware. Similarly, slower sample rates mean cheaper A/D 
conversion, while still the Nyquist criterion must be satisfied. 


Signal Processing 

Signal processing deals with the analysis of signals so that the information 
contained in signals can be extracted, displayed, analyzed, interpreted, or converted 
to another type of signal that may be of use. The purpose of signal processing is for 
feature extractions which greatly reduce the dimension of the raw signal but at the 
same time maintain the relevant information of process condition in the extracted 
features. Various feature measurements are exploited to assess their capability in 
collecting meaningful information about process condition from sensory signals. 
Specifically, they are classified as the time domain, the frequency domain, and 
time-frequency domain analysis. In this section, only computer-based approaches 
for digital signal process are introduced. 
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Fig. 17 Aliasing caused by small sample rate 


Time Domain Analysis 

Analyzing the measured signals in the time domain provides one of the simpler 
detection and diagnosis approaches. Generally, when a signal is measured with a 
sensor and acquired by a computer, it is viewed in the time domain, in a format with 
the vertical axis as amplitude or voltage and the horizontal axis as time. For many 
signals, this is the most logical and intuitive way to view them. It can show how 
a signal changes over time through as simple as a visual checking or more 
sophisticated approaches like trending time domain statistical parameters. Figure 18 
shows the time domain waveforms of the measured cutting force signals in the X, 
Y, and Z directions for a milling operation. Enormous amount of information can 
be obtained through checking the time waveforms of the signal, such as the 
transients, amplitude modulation, tooth-passing frequency components, repetitive 
impacts, etc. 

However, the time domain signal is not so informative as it is very time 
consuming to look at the raw signal in a graphical format. Thus, signal features 
are necessary to be derived that can describe the signal adequately and maintain the 
relevant information about the process or tool conditions. Usually a number of 
statistical parameters are calculated for comparison and diagnosis. The most com- 
mon parameters include the peak, arithmetic mean, root mean square (RMS), 
variance (or standard deviation), skewness, kurtosis, signal power, peak-to-peak 
value, crest factor, etc., as listed in Table 1. 

Time series analysis is also used for signal processing. This method analyzes a 
sequence of data points, measured typically at successive time instants with uni- 
form time intervals, in order to extract meaningful statistics and other characteris- 
tics of the data. Three main time series analysis techniques are frequently used 
in machining monitoring: autoregressive (AR), moving average (MA), and 
autoregressive -moving average (ARMA). These methods can be employed to 
predict signal value at the next time instant, which can be further used for failure 
prediction. 
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Fig. 18 Cutting force signals for a milling operation 


Table 1 Common statistical parameters for time domain signal analysis 


Feature names 

Definitions 

Equations 

Arithmetic mean 
(M) 

Mean of amplitude values of raw data signal 

M-iy> 

i=i 

Root mean square 
(RMS) 

Root mean square of amplitude values 

RMS - J'n X > 2 

Variance (V) 

Variance of amplitude values 

y E," 

n — 1 

Skewness (Sk) 

Third central moment and a measure of the 
asymmetry of the raw data 

a - i 

Kurtosis (Ku) 

Fourth central moment and a measure of the 
peakedness of the raw data 

k - - j ” 

Power (P) 

Measured area under the rectified signal envelope 

' = i X> 2 

i=i 

Peak-to-peak 
amplitude (pp) 

Highest peak value minus the lowest peak value 
attained by a signal 

— 

Crest factor (CF) 

Peak amplitude of a waveform divided by the 
RMS value 

CF — peak 

^ RMS 
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Frequency Domain Analysis 

A frequency domain analysis can provide information with regard to signal com- 
ponents in certain frequency range and is regarded as the most useful method for 
signal processing. In many machining operations with a rotating spindle (with 
either a rotating workpiece or rotating tool), the frequency spectrum of measured 
signals, such as vibration or cutting forces, carries a great deal of information that 
can be used to monitor and diagnose the process and tool condition. As a sampled 
signal is always discrete, a discrete Fourier transform (DFT) is often used for 
signals composed of data sampled at given spaced intervals. A continuous signal 
may be reconstructed without information loss if the sampling frequency is greater 
than twice the highest frequency component in the signal (Nyquist frequency). 
A low-pass filter is usually applied to avoid aliasing. Direct application of the DFT 
is typically modified by windowing to force the signal to appear periodic and thus 
reduce leakage from frequency components. For example, a Hanning window can 
be applied to the raw data to prevent leakage. Spectra are averaged in order to 
reduce the variance of spectral estimates. 

For an N-point measured signal series x(n ) in the time domain, with a sampling 
period of T, the discrete Fourier transform is given as 

N 

x(k) = k = 1 , 2 , . . . ,n. (i) 

n= 1 

An inverse DFT can transform the resultant frequency spectrum back into the 
time domain, which is given by 

x(n) = n=l,2,...,N. (2) 

^ k= 1 

In practice, the DFT is calculated by using fast Fourier transform (FFT), which is 
an efficient algorithm that provides a fast and convenient means to compute DFT of 
the time domain signal with exactly the same result. FFT is available in many 
software packages such as MATLAB and NI measurement studio. In addition, the 
power spectral density (PSD) can be estimated using the mean square of the 
spectrum amplitude after FFT or through the transform of the autocorrelation 
theorem. It should be noted that peaks in the frequency domain are better to be 
detected using the PSD, which gives the energy at each frequency. To plot the 
frequency spectrum, the scale of the frequency axis should be determined based on 
the sampling rate f s = 1 IT and the total number of sampled data points N. 
The frequency resolution after FFT is / = 1 /(NT) = fJN. It is important to note 
that only N/2 independent spectrum values exist after Fourier transform. The 
second half of the frequency spectrum, from fJ2 to f s , actually represents the 
negative frequency components from —fJ2 to 0; thus, there is no meaning to 
visualize or analyze it. An example of the frequency spectrum amplitude for the 
cutting force in an end milling operation is shown in Fig. 19. 
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Fig. 19 Frequency spectrum amplitude for the cutting force in a milling 


Time-Frequency Domain Analysis 

Although FFT-based methods are widely used as a powerful tool for condition 
monitoring and fault diagnosis, one main limitation for the methods is that they are 
not suitable for nonstationary signals. A nonstationary signal means that its fre- 
quency changes over time. The sensor signals detected during machining are some- 
times nonstationary. FFT is unable to detect the frequency changes as it averages the 
frequency composition over the duration of the signal with fixed resolution of the 
entire frequency spectmm. To address this issue, a number of time -frequency domain 
analysis techniques have been developed, including the short-time Fourier transform 
(STFT), the Wigner-Ville distribution, and the wavelet transform. 

The short-time Fourier transform (STFT), also known as waterfall plot or 
spectrogram, takes a linear approach for a time-frequency representation. It uses a 
windowing function w[n] sliding along the time axis to characterize the change of 
frequency components at different time intervals, i.e., a series of the conventional 
Fourier transforms of finite length time signal are evaluated by applying a 
windowing function to the original time signal. Given a data set x[n ], the discrete 

STFT at time n and frequency co = ^ k is represented as 
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where w[n] is the analysis window, which is assumed to be nonzero only in a certain 
interval, and N is the frequency sampling factor, which determines the number of x 
[ft]. The size of w[n] greatly affects the time-frequency resolution of STFT. If the 
window size of w[n] in the time domain is long and correspondingly the bandwidth 
of W[co\ is narrow, such a window is suitable for sinusoidal or harmonic signals. 
Otherwise, it is good for fast time- varying signals. One major difficulty with STFT 
is that it cannot provide good time and frequency resolution simultaneously because 
it employs constant resolution at all frequencies, a finer frequency is achieved at the 
expense of time resolution, and vice versa. 

The Wigner-Ville distribution (WVD) is a time -frequency energy distribution, 
which is particularly useful for diagnosing short-time nonstationary events such as 
impacts and resonance excitation. The approach is based on the use of the autocor- 
relation function for calculating the power spectrum. The signal is compared to 
itself for all possible relative shifts, or lags. The WVD is always real valued. It 
preserves time and frequency shifts and satisfies the marginal properties. Due to the 
quadratic nature of the WVD, its sampling has to be done with care. Usually the 
time signal should be known for all time before the WVD can be computed. 

The wavelet transform (WT) has been developed to overcome the problem of 
STFT in which the window size is fixed when it slides along the time. The WT is 
similar to the STFT and the WVD in that it provides a time -frequency map of the 
signal being analyzed. The wavelet transform (WT) uses windows of different sizes 
for different frequencies: high frequencies are analyzed with narrower windows for 
better time resolution, while at low frequencies wider windows are used for better 
frequency resolution. Thus, the WT can extract more information in the time 
domain at different frequency bands, especially when a narrow window size is 
used to capture rapidly changing details. As a result, the WT studies high-frequency 
components with sharper time resolution than the low-frequency components. 

The wavelet transform uses wavelets of finite energy to analyze signals. Wave- 
lets are in fact localized waves. They have their energy concentrated in time or 
space and are suited to analysis of transient signals. Similar to the STFT in which 
the time signal to be analyzed is multiplied with a window function, in the wavelet 
transform, the signal is multiplied with a wavelet function, and then, the transform 
is computed for each segment generated. However, the width of the wavelet 
function changes with each spectral component. The continuous wavelet transform 
of a real signal x(t) is defined with respect to an analyzing wavelet which is 
complex in general, as 
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where the translation parameter x relates to the location of the wavelet function as it 
is shifted through the signal. It corresponds to the time information in the wavelet 
transform, s represents the wavelet scale, and y/* denotes the complex conjugate of 
y/. There is a large number of wavelet families that exist. 

The discrete wavelet transform (DWT) is based on sub-band coding which 
provides a fast computation of wavelet transform. It is easy to implement and 
reduces the computation time and resources required significantly. It is presented as 



where yr is a wavelet function. When the scaling parameter j is reduced, the wavelet 
is compressed, making it suitable to approximate rapidly changing details or higher 
frequency and vice versa. On the other hand, the parameter kl J indicates the 
location along the time axis. 

Multi-resolution analysis (MRA) is a mathematical construction that enables 
real-time computation of the DWT, which is shown below as 




OO 


J— 1 OO 




where <pj j is the scaling function, y/jj is the wavelet function, (o 0? /) are the scaling 

function coefficients, and (djj) are the corresponding wavelet function coefficients. 
They can be defined as 



It is noted here that the DWT makes expanding a function into corresponding 
scaling and wavelet coefficients very efficient. It decomposes the signal into 
approximations (the scaling coefficients) and details (the wavelet coefficients). 
The approximations carry low-frequency components of the signal. The details, 
on the other hand, retain high-frequency information of the signal. The wavelet 
transform is actually a subset of a far more versatile transform, the wavelet packet 
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transform which is also used for signal processing. This method decomposes both 
approximations and details and generates more frequency bands where more 
specific information can be derived. 


Artificial Intelligence Approaches 

An effective tool condition monitoring system not only collects signals but also 
makes decision and provides responses to some phenomena identified as bad 
cutting conditions. Recently, many artificial intelligent algorithms have been 
employed for decision making in machining process monitoring and tool fault 
detection, such as artificial neural networks (ANNs), fuzzy logic systems, and the 
hybridization of neuro-fuzzy systems. 

An artificial neural network (ANN) is a mathematical model inspired by 
biological neural networks made up of an interconnected group of artificial neurons. 
It processes input information using a connectionist approach to determine the 
activation of the neuron and output the results. In general, an ANN is a computa- 
tional model that consists of an interconnected group of artificial neurons. A neuron 
possesses information by its dynamic state response to external inputs using a 
connectionist approach to computation via learning algorithm and recall algorithm. 
The neurons are arranged in a series of layers. Multilayer feed-forward networks are 
the most common architecture. By a learning process, a neural network can always 
change its structure to better respond to the environment. There are several learning 
algorithms for training neural networks. Backpropagation has proven to be suc- 
cessful in many industrial applications and it is easily implemented. Neural net- 
works are always exploited to model complex relationships between inputs and 
outputs or to find patterns in data. 

Fuzzy logic deals with reasoning that is approximate rather than fixed and exact. 
It was first introduced as an effective method to describe uncertainties in terms of 
imprecision, vagueness, and imperfect knowledge. A fuzzy logic allows decision to 
be made with estimated values under incomplete or uncertain information. It is thus 
suitable for uncertain or approximate reasoning. A typical fuzzy control system 
consists of four conceptual components: knowledge base, fuzzification interface, 
inference engine, and defuzzification interface, as shown in Fig. 20. The knowledge 
base has two bases, a fuzzy control rule base and a data base. The data base is the 
declarative part of the knowledge base which defines objects of the sets and 
membership functions. Unlike the Boolean logic, objects of the sets are represented 
with their membership degrees to that set. The fuzzy control rule base is the 
procedural part of the knowledge base which contains information on how these 
objects can be used to infer new control actions. The inference engine employs 
linguistic terms resulting from the fuzzification interface and the if-then statement 
used to decide the output of the system. It is the central part of a fuzzy control 
system. The fuzzification interface is to translate each crisp value obtained from the 
input into a linguistic term and degree of membership function, and the 
defuzzification interface is to convert the fuzzy value into a crisp value. 
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Fig. 20 Block diagram of a fuzzy control system 


Neuro-fuzzy is actually combinations of artificial neural networks and fuzzy 
logic to synthesize their advantages. Neural networks have the possibility of dealing 
with large amounts of sensor data simultaneously using simple processing ele- 
ments, and fuzzy logic builds a structural framework that utilizes the low-level 
results. There are different methods for the combination. An approach to merge 
these two technologies by fuzzificating conventional neural network architectures 
with interconnection substituted with fuzzy relationship or fuzzified processing 
elements is given in (Li and Elbestawi 1996). The interconnection is represented 
by fuzzy membership grades. The hidden layer consists of the neurons that use 
fuzzy classification. 


Monitoring Strategies and Approaches 
Tool Wear Estimation 

Considerable research efforts have been concentrated on the field of tool condition 
monitoring to detect tool failures, such as tool wear and tool breakage. It has been 
reported that about 20 % of machine tool downtime was caused by tool failure, 
which results in problems in the workpiece dimensions and surface integrity (Liang 
et al. 2004). Direct monitoring systems for tool wear are not easily implemented 
due to the lack of ingenious measuring methods. Thus, indirect measurements are 
required for the estimation of cutting tool wear. A number of types of signals can be 
used for tool condition monitoring, including AE, vibration, cutting forces, and 
spindle power/current. 

Axinte and Gindy (2003) investigated the correlation between the condition of 
broaching tools and multisensor outputs from AE, vibration, cutting forces, and 
hydraulic pressure using time and frequency domain analysis. Kim et al. (2002) 
proposed an online detection of drill wear based on spindle motor power consump- 
tion during a drill process. In (Ibrahim Nur et al. 1995), wavelet transforms are 
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applied to the raw force signal to eliminate unnecessary details for tool 
failure detection in end milling. Li et al. (2006) carried out an experimental study 
of the tool wear propagation and cutting force variations in the end milling of 
Inconel 718 with coated carbide inserts, where the tool wear propagation was 
believed to be responsible for the gradual increase of the mean peak force in 
successive cutting passes. The development of a tool wear observer model for 
flank wear monitoring in the milling of Nickel-based alloys was introduced in 
(Li et al. 2009) to estimate the flank wear growth from the measured cutting force 
signals. 

Recently, some advanced techniques have been employed for tool wear detec- 
tion, including the use of infrared microscope (Arrazola et al. 2008), vision sensor 
(Zhang et al. 2012), and online laser measuring system (Gandarias et al. 2006). 
Wang et al. (2008) proposed a CBN tool wear estimator based on a generalized 
fully forward connected neural network, which is trained by the Kalman filter 
algorithm. Ozel and Karpat (2005) used a three-layer neural network (MLP) with 
a different training method, the Levenberg-Marquardt training method together with 
Bayesian regularization, to predict the surface roughness and tool wear in hard 
turning. 

In (Susanto and Chen 2003), fuzzy logic-based in-process tool wear monitor 
system in a face milling operation was investigated. The triangular fuzzy set was 
selected as the fuzzification basis with two independent fuzzy sets of feedrate and 
depth of cut. These membership functions were defined based on experimental 
observations and experimental runs. Such a system has gained a good accuracy in 
predicting tool wear. 

Chungchoo and Saini (2002) employed another online fuzzy neural network 
with cutting forces and RMS of AE signal as input for estimation of tool wear. It 
was divided into four parts: the first part was to classify tool wear by using fuzzy 
logic; in the next part, the output of the previous part was normalized and limited 
between 0 and 1; the third part estimated flank and crater wear using a modified 
least-square backpropagation neural network (MLSB); the final part adjusted the 
results of the third part by another fuzzy logic rules. 

There are still other approaches in tool condition monitoring. The least squares 
version of support vector machines (LS-SVM) was used for online tool wear moni- 
toring in turning (Salgado and Alonso 2007), while Wang et al. (Wang et al. 2002) 
proposed hidden Markov models (HMMs) for tool wear monitoring. Achiche 
et al. (2002) used a hybrid technique based on the fuzzy logic and genetic algorithm 
(GA). GA helped to automatically constmct the fuzzy knowledge base from the same 
set of experimental data without requiring any human expert and gave a great 
advantage in the short learning time. The method, called support vector regression 
(SVR), was also adopted to detect tool breakage in a milling process (Cho et al. 2005). 
S VR is a statistical learning theory which enables learning machines to generalize well 
to unseen data. This research showed that SVR performed well with a tight threshold 
value for tool breakage determination. However, it had to be trained through a 
relatively complicated tuning process of design parameters. 
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Tool Breakage Detection 

Tool breakage is one of the most critical fault conditions in a machining process. 
Detection of tool breakage is regarded as an important role in machining process 
monitoring to achieve intelligent machining. Since tool breakage is usually accom- 
panied with an abrupt change of the sensing signals, this feature can be employed 
for tool breakage detection. However, in some machining operations such as 
milling or drilling which is characterized as an interrupted cutting process, how 
to distinguish the tool breakage with the normal cut in and cut out phase with a 
robust real-time algorithm remains a challenging issue. 

Li et al. (2007) presented a recursive crest factor monitoring algorithm 
(RCFMA) for the tool breakage detection. Crest factor, also known as peak-to- 
average ratio, is the ratio of the peak value to RMS value of a waveform. It can be 
used for estimating the amount of impact. A high crest factor corresponds to a 
signal having a high peak voltage compared to the average signal level. In general, 
the greater the crest factor, the greater the energy contained in higher frequency 
harmonics. RCFMA is implemented in the time domain and suitable for both the 
point-by-point processing and block processing. The RMS value used is not from 
the current signal but from the preceding data block. In a block processing situation, 
the calculation equation for the so-called recursive crest factor Cf is 

Cf = ^~ ( 11 ) 

X RMS 

where x' RMS is the RMS value from the preceding data block. To account for the 
cutting-in phase where x' RMS may take the value of zero, a reference RMS value, 
RMSf -, is used for comparison with every new x RM s • Once the new x' RM s is less than 
RMSf -, it will take the value of RMSf in the calculation of Cf. In a point-by -point 
processing situation, the Cf is calculated once a new signal x t arrives and x RMS is the 
RMS value of the previous cutter revolution from the historic data block. 

Figure 21 shows a set of recorded signals in the time domain for a slot 
milling process in which the 4-flute end milling cutter was broken at about the 
1-1.2 s. The sensing signals include the cutting force components at the x, y, 
and z directions, the noise signal, and acceleration signals picked up on the 
quill and on the work fixture. It is clear from Fig. 4 that there were rapid and 
sharp signal changes for all the six monitoring channels. The signals from 
different sensors were at a different order of magnitude. The detection results 
of RCFMA under point-by-point processing mode and block processing mode 
are shown in Figs. 22 and 23, respectively. It can be seen that RCFMA outputs 
normalized monitoring results despite the diversity of the order of magnitude of 
the original signal. Under normal cutting condition, the recursive crest factor 
(RCF) takes a value of less than two for all the six channels. When tool 
breaking occurs, there is a sudden increasing of the RCF to a value larger 
than three. This characteristic of RCF enables a common tool breakage detec- 
tion criterion to be chosen. 
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Fig. 21 Time domain signals for a slot milling process, with the milling cutter broken at about the 
1-1.2 s 


Altintas (2000) introduced a tool breakage detection algorithm using the 
average resultant cutting forces per tooth period, Fa(m). When the tooth breakage 
and runout are not present and the cutter is not in a transient geometric zone, 
theoretically equal average cutting forces will be there for each tooth on a milling 
cutter. Thus, the first differences of the average cutting forces will be zero. 
Otherwise, the average cutting forces will reflect the changes in the chip load 
and be nonzero. A first-order adaptive time series filter can remove the slow 
varying dc trend caused by the changes in the workpiece geometry, and the 
runout of each tooth can be removed by comparing the tooth’s performance by 
itself one revolution before. The two filters are run recursively in parallel at every 
tooth period. When the cutting forces increase from a level of air machining at 
the beginning of cut, the maximum residuals of both filters are measured during 
the first five revolutions of spindle, in which it is assumed that the cutter is not 
broken. The breakage thresholds are selected by scaling the maximum residuals 
by user-defined factors, and they are used during the rest of machining with the 
cutter. A tool breakage event is assumed whenever both residuals exceed their 
thresholds, followed by a further transient assurance check in the following 
resolution. 
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Fig. 22 Tool breakage detection using RCFMA under point-by-point processing mode 

Chatter Detection 

Tool- workpiece vibration is a common detrimental phenomenon and a major 
obstruction in achieving higher productivity and better accuracy. Machine tool 
chatter is a self-excited vibration between the cutting tool and the workpiece at 
large metal removal rates, which affects the finish of the finished product, accuracy 
of the machining, and tool life. The cutting forces between the cutting tool and 
workpiece generate significant dynamic deflections of the tool structure and the part 
structure. When this interaction becomes unstable, chatter occurs. Regenerative 
chatter results in excessive machining forces and tool wear, tool failure, and scrap 
parts due to unacceptable surface finish. 

A common approach for chatter detection is to investigate the spectral density 
of a process signal and develop a threshold value that indicates chatter. A method 
for in-process monitoring of chatter in CNC turning was based on the analysis of 
the PSD of cutting forces measured (Liang et al. 2004). The ratio of the cumula- 
tive PSD for a certain frequency range corresponding to the states of cutting to 
that of the whole frequency range of each force component, coupled with properly 
selected threshold values, was calculated as the indicator to classify the cutting 
states. 
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Fig. 23 Tool breakage detection using RCFMA under block processing mode 


Li et al. (2007) proposed an approach for the detection of chatter occurrence in 
the frequency domain. Under normal cutting condition, the dominant frequency 
components are around the tooth-passing frequency, the spindle rotating frequency, 
and their harmonics. When chatter occurs, the dominant frequency usually shifts to 
near a dominant structural frequency. This phenomenon was used to detect chatter 
occurrence by analyzing the distribution of the power spectral density (PSD) of the 
acquired sensing signals. 

Figure 24 shows an example of the recorded sensing signals in the time 
domain for the whole cutting period of a slot milling process in which chatter 
occurred at about the 2nd second. A 4-flute end milling cutter was used and the 
cutter rotating speed was 1,600 rpm. The four signal channels include the cutting 
force components at the x, y, and z directions and acceleration signal picked up 
on the quill. Figure 25 shows the power spectral density of the signals taken 
before the onset of chatter. It is clear that the highest peaks for all the signals in 
the frequency domain are around 106.7 and 26.7 Hz, which are the tooth-passing 
frequency and the spindle frequency, respectively. The power spectral density of 
the signals taken after chatter occurred is shown in Fig. 26. Now, for all the four 
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Fig. 24 Measured signals in the time domain for a slot milling process with chatter 


signal channels, the dominant frequency component is shifted to a lower 
frequency at about 95.4 Hz. The frequency component at the tooth-passing 
frequency is still visible in Fig. 26, but obviously weak compared with the 
dominant component. 

Altintas (2000) presented a chatter detection approach by continuously mon- 
itoring the amplitude of the sound spectrum measured by a microphone. The 
maximum amplitude of the power spectrum that occurred at the chatter fre- 
quency was searched every 250 ms. A chatter threshold was selected which was 
well above the maximum amplitude observed in stable, chatter-free machining 
tests. The CNC system assumed that chatter was present whenever the measured 
sound spectrum amplitude exceeded the threshold during milling on the partic- 
ular machine tool setup. Yao et al. (2010) proposed a method for online chatter 
detection in boring. A two-dimensional feature vector, made up with standard 
deviation and the energy ratio of wavelet transform in the chatter-emerging 
frequency band, was used for pattern classification using a support vector 
machine (SVM). Griffin and Chen (2009) used genetic programming 
(GP) classification technique to investigate two grinding anomalies, grinding 
burn and grinding chatter. The demarcation between each anomaly was 
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Power spectral density of the signals in Fig. 24 before the onset of chatter 


identified from acoustic emission signals converted to the frequency-time 
domains using short-time Fourier transforms (STFT). An approach to detect 
chatter onset using multisensors was introduces in (Kuljanic et al. 2008). The 
signal characteristics from rotating dynamometer, accelerometers, acoustic 
emission, and electrical power sensors were condensed in both the time domain 
and the frequency domain by means of statistical methods. The results showed 
that multisensor systems composed of three or four sensors were the most 
promising solution for reliable and robust chatter identification, although the 
cost can be relatively high. 

Figure 27 shows a chatter detection system for milling operation based on 
artificial intelligent approaches (Kuljanic et al. 2009). The signals from two accel- 
erometers and an axial force sensor were first processed by wavelet decomposition 
to extract statistical parameter features. These features were then used to detect 
chatter by neural networks. The outputs of the neural networks for individual sensor 
signal were further combined in order to obtain a multisensor chatter indicator using 
a linear combination of sensor chatter indicators, neural network, fuzzy logic, and 
statistical inference. Evaluation of the performances of these strategies showed that 
it is possible to obtain efficient chatter detection in terms of both accuracy and 


23 Machining Process Monitoring 


975 


x 

LL 
■ ■ 

O 

CO 

CL 




N 200 

^ 150 

m 100 

50 


x 10“ 3 




Fig. 26 


Power spectral density of the signals in Fig. 24 after the onset of chatter 
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Fig. 27 Outline of a multisensor chatter detection system (Kuljanic et al. 2009) 


robustness against malfunctions, and the system is compatible with modem 
machine tool operation and automation. 


Surface Integrity and Chip 


Surface integrity is the surface condition of a workpiece after being machined. 
In terms of machining process monitoring, it mainly indicates the topography 
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characteristics rather than surface layer characteristics, which includes surface 
roughness, waviness, errors of form, and flaws. To monitor the surface integrity, 
both direct and indirect measurements are employed. An in-process direct mea- 
surement of surface roughness using fringe field capacitive (FFC) method during 
grinding was given in (Nowicki and Jarkiewicz 1998). This method enables a 
control of surface roughness (. Ra >0.1 pm), together with advantages not sensitive 
to materials, working liquids, and other disturbances. Nevertheless, direct surface 
integrity detections are less extensively used as it needs special apparatuses or the 
specially designed sensors which is always difficult to install or use. On the other 
hand, for indirect surface integrity measurements, vibrations, forces, and AE 
signals are the most widely used signal sources. 

In order to develop appropriate techniques for qualitative/quantitative evalua- 
tion of the machined surface integrity, time and frequency domain analyses of 
sensory signals in different machining operations are carried out to extract 
features. For example, features derived from AE signals using time-frequency 
domain analysis were correlated with the occurrence of workpiece surface anom- 
alies (e.g., laps, material drag) in multi-teeth milling applications by Marinescu 
and Axinte (2009). The surface quality, in terms of geometric accuracy, burr 
formation, chatter marks, and surface anomalies, has been detected in broaching, 
based on time and frequency domain analysis of AE, vibration, and cutting force 
signals (Axinte et al. 2004). Artificial intelligence approaches have been used for 
the surface integrity monitoring, such as using the artificial neural networks 
(ANNs) (Tsai et al. 1999), fuzzy logic systems (Latha and Senthilkumar 2010), 
and the neuro-fuzzy systems (Abburi and Dixit 2006). Huang and Chen (2003) 
developed an in-process neural network-based surface roughness prediction sys- 
tem. Other approaches such as genetic expression programming (GEP) algorithm 
(£olak et al. 2007) and support vector machines (Salgado et al. 2009) have also 
been used for surface integrity monitoring. 

Andreasen and Chiffre (1998) used threshold values of peak and RMS of cutting 
forces to automatically detect chip breaking in a CNC lathe. Chip breaking was 
detected if a peak or RMS had an amplitude exceeding the corresponding threshold. 
In (Kim and Ahn 2002), a monitoring method to detect the state of chip disposal in 
drilling based on neural network and spindle motor power sensing was proposed. 
Variance/mean, mean absolute deviation, gradient, and event count of spindle 
power signal were calculated as feature vectors and then presented to the neural 
network with an error-backpropagation algorithm to make a decision on the state of 
chip disposal. Experimental results showed that the proposed monitoring system 
could successfully recognize the state of chip disposal over a wide range of drilling 
conditions. Jemielniak et al. (2006) reported two methods respectively for chip 
form detection in turning in different labs. The first method used was the wavelet 
packet transform analysis. Several features of each packet were calculated, standard 
deviation, variance, moment of 3rd degree, moment of 4th degree, and energy, and 
sorted according to the observed chip forms to identify the features that presented 
separate value ranges for different chip forms. The other employed the neural 
network for pattern recognition. Cutting force signal specimens were first processed 
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to achieve their spectral estimation through a parametric method. Then these 
extracted features were presented to three-layer feed-forward backpropagation 
ANNs. 


Summary 

Machining processes have to deal with the deformation of the cutting tool, the 
machine tool, and the workpiece and the tool anomalies such as tool breakage and 
tool wear, which is caused by cutting force, thermal effect, and chatter vibration. 
In-process sensing and monitoring improve machining quality and efficiency in the 
drive towards automated and intelligent machining. Many machining process 
monitoring systems have been developed in the past two decades. The future 
enhancement of machining processes requires the monitoring systems to be robust, 
reconfigurable, reliable, intelligent, and inexpensive. This chapter has presented the 
fundamental techniques and recent development of machining process monitoring 
systems. It covers a wide variety of measurands and sensors such as motor current/ 
power sensor, force/torque sensor, acoustic emission sensor, vibration/acceleration 
sensor, temperature sensor, and optical/vision sensor that have been utilized to 
monitor machining process. Data acquisition techniques and important concepts 
like ADC, quantization, sampling rate, Nyquist sampling theorem, and aliasing are 
explained. Various signal processing techniques including time domain analysis, 
frequency domain analysis, time-frequency domain analysis, and artificial intelli- 
gence are introduced, together with detailed machining process monitoring strate- 
gies, approaches, and examples. An effective monitoring scheme relies on accurate 
interpretation of the sensor outputs and mapping the extracted signal features with 
the corresponding process characteristics. Due to the complex of the machining 
process, further development of robust signal processing algorithms and sophisti- 
cated feature extraction algorithms to recognize process conditions is needed that 
can be applied to manufacturing industries with acceptable cost. 
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Abstract 

This chapter is focused on coolant and lubrication in machining. The cutting 
fluids are known to aid the machining process by providing cooling action and 
lubrication. The functions, properties, classification, and guidelines for selection 
and application of the cutting fluids have been discussed in detail in this chapter. 
One of the key aspects of machining is the tribology of the tool-chip and tool- 
workpiece interfaces where the contact conditions are severe and the tempera- 
tures are very high. The contact stresses and the temperature estimates at these 
interfaces have been presented which could help in selecting an effective 
cooling/lubrication strategy during metal cutting. The recent advances in field 
of cutting fluid application in machining, such as dry machining, minimum 
quantity lubrication (MQL), and cryogenic machining, have also been covered. 


Introduction 

It is known that the machining operation induces very high plastic deformation in the 
shear zone of the metal being machined. The shear zone is termed as the primary 
deformation zone (PDZ). In addition, there is very high friction at the tool-chip 
interface which is also termed as the secondary deformation zone (SDZ). The cutting 
speeds are typically high during the metal cutting resulting in severe plastic defor- 
mation in the PDZ and high friction in the SDZ. The plastic work and the frictional 
losses get converted into heat which results very high temperatures at the tool-chip 
interface and in the shear zone. Figure 1 shows the areas where heat is generated 
which can be conducted to workpiece, chip, and tool or lost to the environment. 

The temperature may damage the tool and the machined surface of the work- 
piece if it crosses a certain limit. It results in increased tool wear, lower part 
accuracy due to thermal expansion of part, and subsurface damage in the work- 
piece. Consequently, it is desirable that the heat is not conducted to the workpiece 
or the tool. 

To prevent damage from high temperature, cutting fluids are used in various 
machining processes. A cutting fluid can reduce temperature by removing the heat 
from the heat generation zones. Additionally, it helps in reducing the friction at the 
tool-chip/tool- workpiece interface. Hence, majority of metal cutting operations 
performed use cutting fluid. It also helps in chip removal in high-speed machining. 
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Fig. 1 Regions of heat 
generation in metal cutting 


Tool-chip interface 



Use of cutting fluid may help in increasing the material removal rate (MRR) by 
increasing depth of cut and/or cutting speed and improve tool life, surface finish, 
and dimensional accuracy. A wide range of cutting fluids is available for machining 
based on the application. The cooling capacity of the cutting fluid depends upon the 
properties of the base fluid and the volume, whereas, the chip removal capacity is 
based on the coolant application method and the operation geometry. The lubricat- 
ing property of the cutting fluid is governed by chemical composition of the base 
fluid. 


Functions of Cutting Fluids 

The cutting fluid is the integral part of metal cutting processes; most of the 
machining operations are performed with cutting fluids. Fluid consumption, main- 
tenance, and disposal costs have significant contribution in the final cost of the 
manufactured product. Coolants and lubricants (known as cutting fluids) are used in 
metal cutting to reduce the friction between contacting surfaces of tool and work- 
piece which helps in escaping the excess heat. Some cutting operations do not need 
cutting fluid, especially, if the machining speed is low and hence the heat generation 
is negligible. However, most of the cutting operations end up producing significant 
heat and excessive rubbing which results in low tool life and poor surface finish. 
Cutting fluids have multiple functions. Some of key functions of the cutting fluids 
are as follows (ASME 1952): 

• To reduce the temperature of the cutting tool so that the mechanical strength and 
the chemical integrity is retained 

• To keep the temperature of the workpiece low to prevent thermal expansion/ 
damage and corrosion 

• To reduce energy consumption and tool wear by reducing friction at the tool- 
chip and tool-workpiece interfaces 

• Reduction of built-up edge in the tool and enhancement of tool life 

• To improve the finish and surface/subsurface integrity of the machined surface 

• To provide assistance in chip formation and flow 
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• Help in chip removal, especially, from narrow or deep machining, e.g., deep 
drilling, milling, etc. 

• To provide a corrosion-resistant layer on the freshly cut metal surface to prevent 
oxidation 

• To provide lubrication to the machine parts near the machining area 


Properties of the Cutting Fluid 

There are certain properties that the cutting fluids must possess for proper func- 
tionality. These are listed below (ASME 1952): 

• High heat absorption capacity to remove the heat from the tool and the work- 
piece at a satisfactory rate. 

• It should have good lubrication properties. The fluid film should be able to 
sustain high pressures present at the tool-chip and tool- workpiece interfaces. 

• The coolant should have a high flash point, i.e., it should be nonflammable. 

• The cutting fluid should not have solid precipitates at normal working 
temperatures. 

• High-temperature stability is required to prevent the oxidation of the coolant and 
sticky deposits on the machine of workpiece parts. 

• The components of the cutting fluid should not easily become rancid and no 
unpleasant odor should emanate when heated or used. 

• It should not be either toxic or prone to contamination and have a long useful 
life. No harm should be caused to the machinist due to the sustained exposure of 
cutting fluid. 

• It should be environment friendly to dispose and should be capable of being 
recycled. 

• A cutting fluid should be inert so that it does not react with the tool, workpiece, 
or machine parts. 

Recently the adverse health effects of the cutting fluids have been reported. The 
material safety data sheet (MSDS) lists all the fluids and its effect on the health. The 
National Institute for Occupational Safety and Health (NIOSH) develops documents 
on the prevalence of hazards, health risks, and adequate control methods (American 
National Standards Institute, American National Standard Technical Report 1997). 


Major Categories of Cutting Fluids 

The cutting fluids can be divided into three major categories (Cook 1966): 

• Water-based cutting fluids 

• Mineral-oil based 

• Synthetic 
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Most of the commercial cutting fluids are water based containing salts or soluble 
oils or mineral-oil based. Usually, the mineral-oil-based coolants contain sulfur, 
chlorine, or organic compound. The mineral-oil-based coolants are known to be 
effective in reducing the coefficient of friction during low-speed machining where 
the tool is repeatedly exposed to fluid, e.g., milling, grinding, etc. However, their 
use is limited by smoke and fire hazard to operations where the temperature levels 
are relatively low. 

The water-based fluids usually contain the same ingredients as the oil-based ones 
but in the form of an emulsion. Water-based fluids are used for high-speed (high- 
temperature) conditions where high flow is required. The cost of water-based 
cutting fluids is lower than that of oil based. Note that the specific heat of water 
is about twice that of any other cutting fluid; hence, the water-based fluids excel for 
cooling purposes. A detailed description of various types of cutting fluids used in 
different machining operations is presented in section “Classification of Cutting 
Fluids.” 

There is evidence that gases can have a substantial effect on the cutting process. 
Cooling the tool body by refrigerant is also effective but expensive and potentially 
hazardous for the environment. Other methods that are being explored are using 
aerosols for minimum quantity lubrication (MQL) and cryogenic coolants which 
may be helpful for certain machining/grinding operations. 

A process planar can select suitable cutting fluid depending on the application 
and the cutting fluid handling system present in the machine. The selection of the 
cutting fluids is mostly based on the experience as very little theoretical work exists. 
This chapter presents a discussion of tribology of cutting followed by properties and 
selection methodology of cutting fluids for various machining application. 
Advanced topics, such as MQL and cryogenic and green machining, have also 
been discussed. 

Tribology of the Tool-Chip and Tool-Workpiece Interface 

This section is focused on the tribology of the tool-chip and tool-workpiece 
interfaces, such as the nature and profile of contact stresses and temperature profile 
as a function of cutting parameters. The tool-chip and tool- workpiece interface are 
those contact areas that directly participate in the cutting process. The tool-chip and 
tool-workpiece interfaces shown in Fig. 2 are subjected to dissimilar contact 
conditions, and the contact geometry/pressures in both the cases are quite different. 
Despite the differences, there are certain similarities in the tribology of these 
interfaces, such as high contact pressures and temperatures and high sliding veloc- 
ities. However, the tool-chip interface plays a leading role that it is the location with 
high temperature and the site for the crater wear. Consequently, considerable 
portion of this section is dedicated to the tribological conditions at the tool-chip 
interface. 

Only 30-50 % of the energy required by the cutting system is spent for the useful 
work, i.e., for the chip formation. The rest is spent at the tool-chip and 
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Fig. 2 The tool-chip and 
tool- workpiece interfaces 
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tool-workpiece interfaces. It may be noted that most of the energy consumed in the 
cutting process is converted to heat. Interestingly, the temperature in the primary 
deformation zone (PDZ), where the major work of plastic deformation and chip 
formation occurs, is relatively low (normally 80-250 °C). In contrast, the maximum 
temperatures at the tool-chip interface can reach well above 1,000 °C. The main 
objective of this section is to understand the tribological conditions at the tool-chip 
and tool- workpiece interface. This knowledge can be used in selection of the 
process parameters, tool coatings, and appropriate coolants and their delivery 
mechanisms. 


Tool-Chip Interface 

During metal cutting, the cutting force acts mainly on the rake face, which is in 
contact with the chip, and, hence, the interface region is known as the tool-chip 
interface. It is of interest to determine the cutting force and understand the mechan- 
ics of chip formation to establish the tribological characteristics of the tool-chip 
interface . 

Friction Coefficient 

In Merchant’s analysis, the contact between the tool and the chip is assumed to be 
sliding contact, wherein the coefficient of friction does not vary (Merchant 1945). 
A lot of physical contact conditions are described by a constant coefficient of 
Coulomb friction // f , 



where N is the normal force acting at the considered interface and F is the frictional 
force at this interface. 

It is well established in the machining literature that contact between the two 
bodies is limited to only asperities (summits on the contacting surfaces). In reality, 
the total friction originates due to plastic deformation of interlocking asperities, 
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adhesion, and the shear stress in the fluid film. However, the stresses can be 
estimated by assuming that the forces are distributed over the total (apparent) 
contact area. Such an approximation is valid for machining where the actual and 
apparent contact areas are close due to high contact pressure (Zorev 1966). The 
coefficient of friction at the tool-chip interface, /q, can be estimated by 


flf 


T f 


a 


( 2 ) 


n 


where the mean normal stress at the interface is a 


n 


N/A c , the mean shear 


(frictional) stress at the interface is ry = F/A c , and A c is the area of tool-chip 
interface . 

The above discussion was for sliding friction at the interface. Under the extreme 
condition, the maximum friction that is achievable is the shear flow stress of the 
material. This is known as sticking friction, where there is no relative motion 
between the chip and the tool at the interface. Both these zones are shown in 
Fig. 3. For sticking friction, this shear strength should be equal to the flow stress 
in shear, k f , and the normal stress to the yield stress of the work material a y . Using 
the von Mises yield criterion, the coefficient of friction under sticking conditions is 



Therefore, the value of the friction coefficient defined by Eq. 2 should be 
considered as the limiting value so that if // f > 0.577, no relative motion can occur 
at the interface. 
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General Remarks on Tool-Chip Interface 

The results of the theoretical and experimental studies on the tribological condition 
at the tool-chip interface can be summarized as follows (Astakhov 2006): 

• Use of fixed coefficient of friction is not sufficient to characterize the sliding 
between the chip and tool and thus should not be used in metal cutting studies. 
Most of the commercial software packages use sliding friction which cannot be 
considered as adequate to model any real cutting process. In reality, the contact 
stresses at the tool-chip interface and their dependence on the parameters of the 
cutting process should be considered. 

• The chip formation process is of cyclic nature, and the normal and shear stresses 
as well as the shape of their distributions vary within each cycle of the chip 
formation. 

• The length of the tool-chip interface depends both on the uncut chip thickness 
and rake angle. The dependence of the contact length on the cutting speed is 
similar to the dependence of chip compression ratio (chip thickness/undeformed 
chip thickness) on the cutting speed. 

• The contact length is directly proportional to the uncut chip thickness for 
different work materials. Once on the most important tribological characteristics 
at the tool-chip interface is the ratio of the contact length to the uncut chip 
thickness referred as the P 0 -criterion. This criterion is not dependent on the 
changes in the properties of the work material. 

• The mean shear stress at the tool-chip interface is a function of the work material 
properties. It may be noted that the mean shear stress determines the temperature 
at the tool-chip contact, and it can be stated that this temperature is primarily a 
function of the cutting speed and the work material. 

• The mean normal stress at the tool-chip interface is relatively more sensitive 
than the shear stress to the cutting process parameters. This stress increases with 
an increase in the cutting speed and decreases with the rake angle. It depends on 
the P 0 -criterian in the same way as this criterion affects the state of stress in the 
deformation zone. 

• Among the tool material properties, the greatest influence on the contact condi- 
tion at the tool-chip interface is its thermal conductivity and adhesion properties 
as opposed to the elastic properties. Higher conductivity increases the mean 
contact temperature at the tool-chip interface. It also affects the temperature 
distribution over this interface. 

• The adhesion condition at the tool-chip interface affects the cutting process. The 
solubility of the tool in the workpiece material should be carefully considered in 
the selection of the tool material for a given application. At elevated tempera- 
tures, it could lead to accelerated tool wear. 

From the discussions above, it can be inferred that the tool-chip interface 
cooling is extremely important for the cutting process. The lubrication film from 
the cutting fluid reduces the coefficient of friction, thereby, reducing the cutting 
forces. In addition, it affects the crater wear by reducing the sticking zone which 
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facilitates the diffusion based crater wear. The rate of chemical diffusion from the 
tool to chip is lower at lower temperatures. 


Tool-Workpiece Interface 

The tool-workpiece interface is not as critical as the tool-chip interface; hence, it 
has been studied as comprehensively as the tool-chip interface in the literature. The 
key findings can be summarized as follows (Astakhov 2006): 

• The normal and shear stresses are highest near the tool-tip of the cutting edge as 
shown in Fig. 2. The magnitude of the stresses decreases over the contact length 
finally becoming zero at the end of the contact. 

• Adhesion takes place in the region adjacent to the cutting edge because the 
sticking friction is observed close to the tool-tip and the shear stresses are equal 
to the shear flow stress in that region. 

• Although the level of stress, temperature, and strength of adhesion bonds are 
much lower at the tool-workpiece interface compared to those at the tool-chip 
interface, the sliding velocity at the tool-workpiece interface is much higher. The 
increased sliding speed can result in high flank wear for ductile and difficult-to- 
machine materials. The cooling and lubrication of the flank (area of contact 
between tool and workpiece due to wear) is essential to keep the flank wear in 
control. 


Temperature at the Interface 

The heat produced due to plastic deformation and friction in machining results in 
very high temperatures (~ 1,000 °C) in the tool-chip interface. The high tempera- 
tures observed in metal cutting affect the flow stresses of the work material and, 
therefore, the chip formation. The temperature at the interface determines the mode 
of the tool wear and the type and delivery mechanism of the coolant. 

To address these issues, a lot of work has been reported in the literature. Trent 
and Wright reported (Trent and Wright 2000) that the consideration of heat 
generated in metal cutting is to explain the role of temperature in the rate of 
metal removal. Their study showed that there is no direct correlation between 
cutting force or power consumptions and the temperature near the cutting edge. 
Zorev (Zorev 1966) found out that temperature was not an important factor. Based 
on the energy balance in metal cutting, he reported that the maximum temperature 
at the end of the chip formation zone did not exceed 270 °C for plain and alloyed 
steels, while temperature-dependent changes in the mechanical properties do not 
kick in before 300 °C. Gorczyca (1987) showed the practical application of 
temperature in metal cutting. He operated at a temperature called the burnout 
temperature which is defined as the temperature at which the tool cannot efficiently 
perform the cutting process. Unfortunately, this property is not available in the 
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specifications for the tool materials. Milton Shaw (1984) presented a detailed 
description of various methods of temperature assessment in machining and 
emphasized the importance of the cutting temperature because of following 
reasons: 


• The shear plane temperature has an influence on the flow shear stress of the work 
material and also affects the temperature of the tool face and the flank. 

• The temperature on the tool face and flank is very important to crater wear and 
flank wear rates, respectively. 

• The temperature on the tool face also influences the size and stability of the built- 
up edge. 

• The higher workpiece temperature can affect the surface integrity. 


The average temperature at the tool-chip interface depends on the thermal 
conductivities of the tool and the workpiece along with the heat capacity of the 
workpiece. It can be approximated by the following formula (Astakhov 2006): 





0.225 7 , 0.15 
w K t 




where C e is a constant, k t is thermal conductivity of tool, k w is thermal conductivity 
of workpiece, C p is the specific heat, and p is the density of the work material. 

Because the temperature generated in machining is one of the major factors that 
determine tool life and that limit the cutting rate, its proper assessment, correlation 
with the characteristics, and parameters of the cutting system and control are of 
enormous significance in metal cutting theory and practice as well as in tool and 
coolant selection. 


Classification of Cutting Fluids 

The cutting fluid primary role is cooling, so a variety of fluids may be used for the 
purpose. However, broadly the cutting fluids can be divided into the following 
classes (Astakhov 2006; ASME 1952): 

• Air jet or induced draft 

• Aqueous solutions: plain water or water containing a small percentage of alkali 

• Emulsions of soluble oils 

• Straight cutting oils 

• Synthetic 

• Semisynthetic 

• Cryogenic fluids 

Figure 4 shows the share of different cutting fluids in the US market. The soluble 
oils are the most widely used followed by straight oils, semisynthetic, and synthetic. 
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Semi-synthetic 
19 % 


synthetic 

12 % 


Straight oils 
23 % 



Soluble oils 


46 % 


Fig. 4 Share of different kind of cutting fluids in the US market 

Air 

Air is not a cutting fluid in conventional sense but is often applied as suction or 
high-velocity jet via a nozzle with cutting operations such as turning or boring cast 
iron, surface grinding, and polishing. Its primary objective is to remove small chips 
and dust, but it does provide some cooling in the cutting zone. 


Aqueous Solutions 

Plain water or water containing traces of alkali acts as a suitable cutting fluid where 
cooling and chip removal are required. This aqueous solution can be used in 
grinding, drilling, sawing, light milling, turning, etc. The fluid is economical but 
can corrode the machine and work if not handled carefully. It may thin the lubricant 
necessary of bearings and other machine-tool wearing parts in the vicinity of the 
machining zone. Such aqueous solutions may contain up to 1.5 % by weight of an 
alkali, such as borax, sodium carbonate, or trisodium phosphate. 


Emulsions of Soluble Oils and Pastes 

The soluble oils typically comprise of mineral, vegetable, and rarely animal oils 
along with an emulsifier such as soap. The emulsifiers include the following: acid 
sludge; turkey red oil; diethylene glycol; saponified phenol; saponified naphthenic 
acids, used up to 15 %; or alcohol. Cutting pastes have been used in the past in 
which very high soap content in the saponified mineral oil gives it a grease-like 
consistency. These are mixed with water to form emulsions. The oil should be 
poured into the water. Emulsions of soluble oils or paste comprise of 1 part oil in 
5-100 parts of water, depending upon the machining operation. They are low-cost 
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cutting fluids and are used for practically all the cutting and grinding operations on 
all types of metals, except magnesium (ASME 1952). Water emulsifiable oils 
possess good cooling capacity, low viscosity, adequate wettability, nonflammable, 
nontoxic, easy to separate from small chips and wear particles using standard filters. 
The main disadvantages are insufficient lubrication, rancidity, misting, and low 
stability (components have different degradation levels) (Shaw 1984). The soluble- 
oil emulsions are used where the cooling requirements supersede the lubrication 
requirements. Emulsions are capable of forming stronger films and assuring greater 
protection against corrosion than water or alkali solutions. In addition, the soluble- 
oil emulsions produce lower fumes and provide better cooling capacity than the 
straight oils (discussed in the next section). Soluble oils are used extensively in 
grinding, turning, milling, drilling, sawing, and operations where cooling is the 
primary requirement. 

Although the most important functions of a cutting fluid is to cool the tool and 
workpiece, soluble oils cannot always outperform the straight cutting oils. In many 
operations the use of the most effective lubricant possible is of utmost consider- 
ation, i.e., when it is necessary not only to reduce the temperature and the thermal 
distortion but also to facilitate the chip formation. High-speed machining opera- 
tions require rapid heat dissipation which can be delivered by the soluble oils. 
However, the key factors limiting the use of emulsions might be their tendency to 
corrode the machine parts. 

The soluble oils should have following desirable qualities: 

• Capability of forming stable emulsions at room temperature with water. 

• It should have long shelf life and not deteriorate in storage. 

• It should produce sticky residue and clog the moving parts of the machine. 

• It should not foam readily. 

• It should have good wettability for the metal being machined. 

• It must not be toxic for the machinist. 

The main causes for problems encountered with emulsions are water chemistry 
and inversion of emulsion. The water chemistry governs the pH value and water 
hardness. The inversion of the emulsion is caused by a gradual loss of water through 
evaporation from the emulsion, thereby increasing the concentration of oil; as a 
result the emulsion changes from oil-in- water phase to the undesirable water-in-oil 
phase. The inverted emulsion is slimy and may possess bad odor. However, 
inversion seldom occurs when the emulsion contains ten parts or more of water 
for 1 part of oil. Hence, the percentage of water in soluble-oil emulsions should be 
checked periodically. Stable emulsions are governed by their alkalinity. If there is a 
discernible increase in the acidity, the oil separates from the water. If the separation 
of the oil from the emulsion is not clearly visible, it can be checked for acidity with 
blue litmus paper. The cause of acidity could be attributed to prior processing of the 
metal, such as pickling for cleaning the metal. 

Most soluble oils emulsify fairly easily with hard water. However, water soft- 
ening can be done by adding trisodium phosphate (<1.5 % by weight). Other 


24 Coolant and Lubrication in Machining 


993 


alkalies, such as soda ash and borax, can also be used in similar quantities. The 
water should be softened prior to adding the oil (ASME 1952). 


Straight and Compound Cutting Oils 

Straight cutting oils include mineral oils of required viscosity and straight fatty oils. 
Straight mineral oils are typically suited for applications where good lubrication is, 
but cooling is not the key requirement. Some grades of straight cutting oils have 
good penetrating and adhesive properties, thereby promoting their value as cutting 
fluids. The penetrating quality is important because it enables the oil to enter in tiny 
fissures in the outer surface of the chip, thus aiding its formation. 

Straight cutting oils consist of base oil to which one or more mineral oils are added 
to obtain desired viscosity. The two main categories of mineral oils are naphthenic 
mineral oils and paraffinic mineral oils. Straight cutting oils contain up to 80 % 
mineral oils. The other components of straight cutting oils are chlorinated paraffin, 
active sulfur carriers, inactive sulfur carriers, friction modifiers, tackiness modifiers, 
viscosity index modifiers, anti-weld additives, odorants, and polar additives. Straight 
cutting oils generally contain EP (extreme pressure) or anti-weld additives such as 
chlorine and sulfur. It is known that EP additives react with the metal surface under 
high contact pressure/temperature and improve the lubrication properties under 
extreme conditions (Shaw 1984). Colloidal graphite, a chemically neutral carbon, 
can be used as an additive to oils and emulsions. It has been found to be very effective 
for increasing the tool life, reducing the cutting forces, and improving the surface 
finish in continuous machining operations on steel (ASME 1952). 

Straight cutting oils are most often used undiluted; however, under certain condi- 
tions, they are diluted with mineral oils and kerosene, etc. to obtain the right viscosity 
for application as coolant. The advantages of straight cutting oils are good lubrication 
and antiseizure properties, corrosion protection, and stability. On the other hand, poor 
cooling, mist formation and emission of fumes at high cutting speeds, and high initial 
and disposal costs are the major impediments. Straight cutting oils with additives find 
application in heavy-duty machining and grinding operations where lubrication is of 
utmost importance. These are generally slow speed operations where the cut is 
extremely heavy. Straight cutting oils are used in slow machining operations, such 
as broaching, threading, gear hobbing, gear cutting, tapping, deep-hole drilling, and 
gear grinding. As mentioned previously, straight cutting oils are not effective in high- 
speed cutting operation due to limited heat dissipation capability. However, under 
certain conditions, the cooling action of the cutting fluid could be dependent on 
volume and temperature rather than relative cooling ability. 


Synthetic Fluids 

Synthetic cutting fluids are water-based primarily inorganic concentrate which 
form translucent solution in water. These fluids comprise synthetic water-soluble 
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materials which do not contain any mineral oil. These synthetic fluids often contain 
corrosion inhibitors, biocides, surfactants, and antifoaming agents. 

Advantages of synthetic cutting fluids are the following: cost-effectiveness, 
nontoxic, rapid heat dissipation, completely nonflammable and non-fuming, easy 
mixing, high rancidity resistance, good filtration with standard filters, and biode- 
gradable. In addition, the detergent properties help clean the machine. However, the 
major disadvantage is insufficient lubrication in severe operations which can lead to 
wear and/or sticking. It can also react with nonmetallic parts and leave dry powdery 
residue. Safe disposal of coolants is becoming an ever-increasing problem with the 
advent of the Resource Conservation and Recovery Act in America. Synthetic 
fluids pose fewer disposal problems than the emulsifiable oils. Consequently, they 
have become more popular because synthetics are easier to treat than emulsifiable 
oils before they can be disposed. As a result, synthetic fluids are products to watch 
for in the future. A very large fraction of the development work on cutting fluids is 
focused on improving the synthetic fluid technology. 


Semisynthetic Fluids 

These are essentially a combination of synthetic fluids and emulsifiable oil in water, 
which may contain up to 25 % of oil. When diluted with water, they form stable 
emulsion for very small droplet size. They contain the good properties of both oil 
emulsions and the synthetic fluid. The mineral oil improves the lubrication which is 
a problem with synthetic fluids by applying protective and lubricating film (Mariani 
1990). Like all cutting fluids, semisynthetics are required to lubricate, cool, and 
protect metal parts and machinery. A variety of generic chemistries are used to 
accomplish these objectives (Mariani 1990). 

To provide lubrication, the semisynthetic cutting fluid first and foremost 
employs mineral oil which is present in all semisynthetics. Not only does mineral 
oil help in the formation of emulsion for the fluid chemicals, it also provides 
lubrication, making it suitable for heavy-duty machining and grinding. 

In order to emulsify the lubricants into water, emulsifying chemicals are 
required. Typical emulsifiers used are nonionic surfactants such as polyethers, 
amine soaps, petroleum sulfonates, and amine condensates. 


Cryogenic Fluids (Liquid Nitrogen) 


Liquid nitrogen (having temperature — 196 °C) is used as a cutting fluid for cutting 
difficult-to-machine materials including high titanium and nickel-based alloys, 
where chip formation and chip breaking are known to cause significant problems 
(Wang and Rajurkar 2000; Zurechi et al. 1999; Hong et al. 1999; De Chiffre 1980). 
Liquid nitrogen is used to cool workpiece and the tool via various configurations 
which could be either channelized or flooded. A detailed description of cryogenic 
machining is given in section “Cryogenic Machining.” 
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Key Design Considerations During Cutting Fluid Applications 

There are certain design considerations that need to be taken into account for 
efficient cutting fluid application. The key factors to be considered are as follows: 
cutting fluid pressure, temperature, flow rate, wettability, nozzle design and loca- 
tion of the machining zone, and filtration. 


Cutting Fluid Pressure 

The effectiveness of cutting fluids depends primarily upon the method of their 
delivery into the cutting zone. These cutting fluids can be applied in various ways, 
namely, low-pressure flooding, high-pressure flooding, through-tool delivery and 
mist cooling. 

In low-pressure flood cooling systems, the cutting fluid is supplied through a 
nozzle over the work zone at the normal water supply pressure. The coolant may be 
delivered through either fixed or flexible piping, depending upon the machine. The 
coolant nozzles should be directed ahead of the cut and ensure that sufficient 
coolant is being supplied to cover the workpiece and the trailing edge of the cutter, 
especially in milling operations. Inadequate or disrupted coolant application in 
milling operations can lead to fatigue of the cutter due to thermal cycling 
(Stephenson and Agapiou 2010). If the coolant volume is sufficient, low-pressure 
flood application is effective in clearing chips and cooling the part to maintain 
dimensional tolerance; however, its effectiveness in lubrication is limited because 
of lack of penetration of the cooling fluid into the tool-chip interface. 

In high-pressure flood cooling systems, the coolant is directed through the 
nozzles to impinge ahead of the cutting tool at high pressure. The inlet nozzle 
pressure lies typically between 5 and 50 bar (Stephenson and Agapiou 2010). Even 
higher pressures are used in some grinding operations and in impingement chip 
breaking jet systems. The coolant is typically applied through a rigid piping, with 
nozzles mounted on a ring around the spindle nose in boring, milling, and drilling. It 
is mounted on the toolholder behind the insert in turning and on a rigid pipe in front 
of the wheel in grinding (Malkin 1989). High-pressure application provides 
improved lubrication due to increased penetration of cutting fluid at the tool-chip 
interface. In addition, the chip removal is better, but it generates mist which may 
present a health hazard to the operator if proper care is not taken. The coolant tends 
to become aerated and starts foaming. 

In through-tool coolant systems, coolant is supplied through the spindle to 
coolant passages in the tool under high pressure. A discharge pressure of 13 bar 
is typical with most machine tools; however, the discharge pressures vary between 
35 and 100 bar in drilling applications (Stephenson and Agapiou 2010). Through- 
tool coolant helps in chip disposal in deep-hole drilling and grinding operations. 
Specially designed pumps and leak-proof seals to prevent damage to the spindle 
bearings are required for through-tool cooling system. These are integrated with the 
machine itself as opposed to a separate module. However, the coolant must be 
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filtered properly to prevent the debris from congesting the coolant pipes in the 
recirculating system. Disadvantages of a high-pressure coolant system are 
increased cost of equipment and maintenance along with tendency to generate 
mists and foam. 

Small jet equipment is used to disperse soluble oil or synthetic oil-water 
emulsions as very fine droplets in pressurized air as the carrier medium. The air 
pressure is typically between 69 and 552 KPa. Water-miscible fluids are preferred 
due to health advantages and lower clogging tendency. Mist is better suited for high 
speed and low area of cut, such as end milling. The mist cooling mechanism is 
explained in detail in section “Cooling Action of Air-Cutting Fluid Mixture.” 


Flow Rates of the Coolant 

Regardless of the method used to apply coolant, sufficient volume must 
be supplied to provide adequate cooling and chip removal. Coolant volume is 
measured by the flow rate in liters per minute. As a first cut approximation, the 
coolant volume should be 5-10 1/min for each KW for general machining (Hoff 
2000) and 10-20 1/min/kW for grinding (Drozda and Wick 1983). Consequently, 
an adequate sump capacity is required to ensure proper filtering and minimize 
foaming due to aeration. As a general rule of thumb, the sump capacity should 
be 3-10 times the flow rate per minute (Hoff 2000). In order to ensure 
uninterrupted machining, it is generally recommended that the sump capacity are 
5 times the flow rate for steel machining, 7 times the flow rate for cast iron and 
aluminum machining, 10 times the flow rate for grinding, and 10-20 times the flow 
rate for high stock removal machining and grinding (Drozda and Wick 1983). 
A rough guideline from machining handbook [for flow requirements] is given in 
Table 1 . 


Thermal Properties 

Schallbroch et al. (1938) developed an empirical formula relating tool life and 
temperature of cutting tool, which is given by 

T0 n = K (5) 

where T is tool life in minutes; 6 is temperature at tool-chip interface, °C; n is an 
exponent whose value depends mainly on the tool form and material; and K is a 
constant. 

From the above relationship, it can be seen that a small reduction in tool 
temperature can increase the tool life, and the cutting fluid can play an important 
role by effectively removing heat from the tool-chip interface. Consequently, the 
desired thermal properties of cutting fluids are high specific high, high thermal 
conductivity and a high film coefficient. The higher values of these properties 
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Table 1 Typical coolant flow requirements (Machinability Data Center 1980) 


Operation 

Coolant flow rate 

Turning 

19 1/min/tool 

Screw machining (for 1-3 in. dia) 

132-227 1/min 

Milling (small to large cutters) 

19-227 1/min 

Drilling/reaming small to large drilling 

8-1 1 1/min (0.3-0.43 1/min x dia in mm) 

Gun drilling (external chip removal, 5-40 mm dia) 

8-189 1/min 

Gun drilling (internal chip removal, 8-60 mm dia) 

20-500 1/min 

Trepanning (external chip removal heads, 
50-200 mm) 

30-400 1/min 

Trepanning (internal chip removal heads, 
60-600 mm) 

400-2,200 1/min 

Honing (small to large) 

10-20 1/min 

Broaching (small to large) 

38 1/stroke (small) 

0.45 1/stroke x length of cut in mm 
(large) 

Centerless grinding (small to large) 

75-150 1/min 

Other grinding 

0.75 1/min/mm of wheel width 



Fig. 5 Surface tension and the contact angle for a drop on the surface 


would facilitate enhanced heat transfer which is necessary for a good coolant. For 
cooling ability, water has all the desirable properties but can be corrosive and does 
not possess good lubricating properties. 


Wettability 

In order to form a continuous film all over the contact area and avoid agglomeration 
at certain locations, the fluid should possess good wettability with the contacting 
surfaces. This means that the coolant should be able to wet the surface thoroughly. 
The contact angle 6 measures the tendency of the fluid to wet the surface. A contact 
angle of 180° means no wetting (hydrophobic condition), while a contact 
angle of 0° infers complete wetting (hydrophilic condition). This is schematically 
shown in Fig. 5. 
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The contact angle, 0, depends on the interfacial surface tension between solid 
and vapor, solid and liquid, and liquid and vapor denoted by y S v , Ysl, and y LV . The 
force balance along x-axis yields the following relationship: 

Ysv = Ysl + Ylv cos 0 ( 6 ) 

For complete wetting, 6 = 0; hence, relationship between the interfacial surface 
tensions is given by 

Ysv ~ Ysl > Ylv ( 7 ) 

The above analysis is true for a smooth surface. However, if the surface is rough 
instead of being smooth, the interfacial tension y SL and y S v moves through a greater 
distance by a factor K during the wetting. Note that K is a function of the surface 
roughness. The contact angle, 0 , for a rough surface is given by 

co & e = K{7sy ~ YsL) 

Ysv 

From the above analysis it is evident that the wetting tendency will increase with 
an increase in the surface roughness. To improve the spreading and wetting of the 
cutting fluid over a surface, the contact angle must be decreased for which wetting 
agents could potentially be employed. Most wetting agents function by decreasing 
the surface tension of the liquid (y LV ) and/or the interfacial tension (y SL ) which 
reduces the contact angle. However, this could adversely affect the other functions 
of the cutting fluid (Bhattacharya 2000). 



Nozzle Design 

The coolant system can be supplied internally to spindle nose in numerous config- 
urations shown in Fig. 6. 

The coolant either can be through the tool and the spindle shaft or can go through 
the tool via the periphery of the shaft. Note that the coolant supplied through the 
tool via the shaft should be filtered down to 5-10 jum size or it can result in 
clogging. Other methods are using swiveling jets placed either inside or outside 
the spindle housing. 


Filtration 

In high-throughput machines, the coolant requirement is 10 1/min, and in grinding it 
can be twice as much. Consequently, it is imperative that the coolant be 
recirculated. However, the cutting fluid in recirculating systems entraps and trans- 
ports a number of particulates, such as chips, airborne contaminants, hydraulic oil, 
machine guide way lubricant, residues left on the part from previous operation, etc. 
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CooJan! through the 'oof and canter of spindle shaft Coolant through the tool with holes 

on the perimeter of the shaft 



Coolant through the spindle housing Coolant through swiveling jets on the machine 

unit located around the spindle housing 


Fig. 6 


Nozzle designs used in the delivery of cutting fluids (Stephenson and Agapiou 2010) 


(Brandth 1994). These contaminants must be filtered before reuse for optimum 
coolant performance and avoid damage to the machined surface. A number of 
techniques are used to separate the contaminants from cutting fluids. 

Common separation methods comprise of settling tanks, centrifuges, cyclones, 
and magnetic separators (Drozda and Wick 1983; Brandth 1994). A settling tank is 
a large tank with two or more baffles as shown in Fig. 7. It may be noted that as the 
fluid initially moves under and then over, the successive baffles which forces the 
tramp oils and lighter impurities to rise to the surface and chips and other heavy 
particulate matter to settle in the bottom. This way the lighter portion can be 
skimmed from the top, and the heavier portion can be removed from the bottom. 
The effectiveness of the tank is a function of the settling time which is the volume 
of the tank divided by the inlet flow rate. If the settling time is too short, then there 
will not be enough time for all the impurities to settle out; this will be accompanied 
by an increased tendency to foam since coolant bubbles will not have enough time 
to burst and coolant temperature will be more difficult to control (Aronson 2001; 
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Fig. 7 Settling tank for 
filtration of cutting fluid 
(Stephenson and Agapiou 
2010 ) 



Malkin 1989; Urandoff and McKinley 2003). The settling times are typically 5 min 
for small systems (Drozda and Wick 1983), 7-10 min for general-purpose systems 
(Foltz 2003), and 10-15 min for large systems (Drozda and Wick 1983; Brandth 
1994). 

Centrifuges and cyclones separate debris from the coolant under the action of 
centrifugal forces (Drozda and Wick 1983; Brandth 1994). In a centrifuge, the 
contaminated coolant is supplied between a set of rotating bowls or cone-shaped 
disks. Chips and other heavy debris gravitate toward the center of the bowls or 
disks, and the oils with low specific gravity are pushed radially outward. The bowls 
or disks get filled with debris and would require cleaning at regular intervals. In a 
cyclone separator shown in Fig. 8, used coolant is supplied into a cone-shaped 
vessel. The rotary motion experienced by the fluid pushes the chips and debris 
outward, and, hence, the clean fluid emerges from the center of the device (Drozda 
and Wick 1983; Brandth 1994; Urandoff and McKinley 2003). Magnetic separator 
systems are very effective for ferrous chips but can also remove abrasive grains 
adhering to ferrous fine in grinding systems. 

Since the separator systems lack the capability to remove contaminants having 
specific gravities close to that of the coolant, filtering systems are required in 
conjunction with the separator systems. These filtration systems use either dispos- 
able or permanent porous media, and the extent of filtering achieved is governed by 
the size of the pore in the medium. 

The filtering medium in these systems may be made of paper, cotton, wool, 
synthetic fibers, or felted materials. These media are often pre-coated with fine 
particles such as cellulose fibers or diatomaceous earth (Stephenson and Agapiou 
2010). In a bag-based filtering system, the coolant is sequentially passed through 
two or more fabric bags with decreasing mesh sizes. The bags trap the chips which 
lie between these mesh sizes and, therefore, need to be changed periodically once 
they are full. Another mechanism is cartridge system which is similar to an 
automotive oil filter (Likens and Venner 2000). Flat-bed roll filters are commonly 
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Fig. 8 Cyclone separator for 
cutting fluid (Stephenson and 
Agapiou 2010) 



used in large recirculating systems. In this approach, the coolant is passed through a 
sheet of filter media. As filtering progresses and the medium becomes clogged, the 
fluid level in the tank rises triggering a float which rotates the roll to expose fresh 
media. Disposable media filters are typically driven by gravity, vacuum, or pres- 
sure. Pressurized systems operate at high pressures and are capable of producing the 
largest flow rates. The gravity and pressurized systems are used with flat-bed roll 
filters, whereas the pressurized systems use cartridge filters. The filtering systems 
can be either permanent with a wire mesh or permanent fabric as the filtering 
medium or disposable. Chips and other debris need to be periodically removed by 
which may be controlled by a timer or a float trigger. 

Cost is indeed an important factor for selecting the filtration system; however, 
the other important factors to be considered in choosing a filtering strategy are the 
following: level of filtration necessary, required system capacity, and the effect of 
the filter medium on the coolant (Drozda and Wick 1983). The level of filtration is 
denoted by the largest dimension of contaminant present in the coolant post 
filtration. Standard chip removal techniques and fluid filtration systems remove 
debris over 50 fim in size without any difficulty. Additional filtering stages are 
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required to remove finer debris (Richter 2003). A rule of thumb for general-purpose 
machining is that coolant should be filtered an order of magnitude lower than the 
tolerance band (Malkin 1989). This can lead to very stringent requirements for 
precision operations. 

The flow rates can be computed from the sump volume. The large systems 
typically pump at least three times the sump capacity in 24 h (Urandoff and 
McKinley 2003). Bag filters have flow rates between 1,000 and 2,000 l/min/nr of 
filter (Brandth 1994). Cartridge filters typically have a flow rate of 17 1/min/m of 
cartridge length (Likens and Venner 2000). Flat-bed systems deliver much higher 
capacities and therefore find application in large systems. 


Effect of Cutting Fluid on Machining Response 

The cutting fluid has significant role and has multiple effects, namely, cooling, 
lubrication, improvement of surface finish/integrity, and dimensional accuracy. All 
these effects will be discussed in detail in the following sections. 


Cooling Action 

The cooling action from the cutting fluid has the following effects (Astakhov 2006): 

• Reduces cutting temperature 

The cutting temperature is reduced by the coolants for cutting speeds up to 
150 m/min. If the cutting speed is increased further, it stabilizes the temperature 
as opposed to reducing it. It is known that the cutting temperature affects the tool 
wear as given in Eq. 5. Therefore, the cooling action of a cutting fluid extends the 
tool life if it brings the cutting temperature closer to the optimal cutting temper- 
ature. Experiments have shown that if the cutting temperature exceeds the 
optimal temperature at the tool-chip interface and diffusion wear takes place 
there, a 10-15 % decrease in the contact temperature leads to a two- to threefold 
reduction in the coefficient of diffusion between the tool and the work materials. 

• Improves the accuracy of machining 

By reducing the temperature of the tool and workpiece, better accuracy of 
machining can be achieved due to lower temperature. Low temperatures result 
in lower thermal expansion of the workpiece and tool. 

• Reduces the tool-chip contact length 

This is a negative aspect of cooling because the cutting forces are not normally 
reduced, and, therefore, higher contact stresses act at the tool-chip interface that 
may result in reduced tool life. 

• Increases thermal shocks in interrupted cuts 

Cooling during interrupted cutting such as milling can introduce thermal crack- 
ing on the cutting edge which could either grow causing a catastrophic failure of 
the cutting insert or result in poor quality of cut. 
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Mechanisms of Cooling During Application of Cutting Fluid 
Direct Cooling Action 

The main mechanism of cooling is due to forced convection. The rate of heat 
transfer is given by 

Q = A cjhfc ( 6 s f - 0 c f ) (9) 


where A cl is the heat transfer area of surface, h fc is the convection heat transfer 
coefficient of the process, and 6> sf and 0 cf are the temperatures of the surface being 
cooled and the cutting fluid, respectively. 

An expression for convection coefficient can be derived based on Nusselt 
number analysis (Astakhov 2006): 







where y cf is the specific weight ( p c f.g ) of the cutting fluid, g is the acceleration due 
to gravity, b is equivalent length, v c f is the kinematic viscosity, v c /is the velocity of 
the cutting fluid, C p _ c f is the specific heat, and k c f is the thermal conductivity. 

Although Eq. 10 can be used to compute the heat transfer coefficient, a more 
general term could be introduce called the cooling intensity, K h , given by 
(Astakhov 2006) 



06 5£ 067 £°. 33 
cf K cf ^ p-cf! cf 
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As seen, the velocity of the cutting fluid v c f affects its cooling ability almost as 
much as its thermal conductivity k c f and much more than its specific heat C p _ c /. 
Cooling intensity of water at 0.2 m/s and 20 °C, soluble oil of 10 %, and compressed 
air at 500 m/s at 20 °C have cooling intensities of 2,715, 1,646, and 1,360, 
respectively. Note that water has excellent cooling intensity and cooling intensity 
of every other fluid could be expressed as a percentage of cooling intensity of water. 

Substantial improvement of the cooling action of the cutting fluid is achieved by 
increasing its velocity, which has a dual effect. On one hand, it increases the 
convection heat transfer coefficient, and on the other hand, high-velocity jets of 
the cutting fluid blow a boundary layer formed on high-temperature surfaces. This 
explains the efficiency of the high-pressure cutting fluid supply which increases the 
velocity of the cutting fluid. Although it has been hypothesized that the high- 
pressure supply of the cutting fluid increases its penetration ability into the tool- 
chip and tool- workpiece interfaces (Mazurkiewicz et al. 1989; Entelis and 
Berlinder 1986), in reality it may not be true because the contact stresses are a 
much higher than the maximum pressure of the cutting fluid. It has been reported in 
the literature that neither the tool life increases appreciably nor the cutting forces 
reduce significantly in cutting with high-pressure coolants (Crafoord et al. 1999; 
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Fig. 9 Convection coefficient (h bv ) as a function of difference between surface and the surface 
temperature and saturation temperature (A 6 vp ) (Astakhov 2006) 

Reznikov and Reznikov 1990). It may be that the high pressure of the cutting fluid 
increases its flow velocity, which, in turn, significantly improves the cooling action 
of this fluid as explained before. 

Cooling Action Due to Evaporation 

To understand the cooling action of the cutting fluid when it evaporates due to the 
contact with hot surface in the machining zone, one should consider the process of 
heat exchange between a solid and a boiling liquid (Chiou et al. 2003). The boiling 
process comprises of formation, growth, and separation of bubbles. 

When the surface temperature is slightly higher than the saturation temperature 
of the fluid, the excess vapor pressure is unlikely to produce bubbles. The locally 
warmed liquid expands and convection current transports it to the liquid-vapor 
interface. The evaporation takes place there at small temperature differences with 
no bubble formation. As the surface temperature increases, nucleate boiling occurs. 
Hence, the excess of vapor pressure over local liquid pressure increases and 
eventually bubbles are formed. These bubbles initiate at nucleating points on the 
hot surface where miniature gas pockets, due to surface defects, form the nucleus 
for the formation of a bubble. Once the bubble is formed, it expands rapidly and the 
temperature of the vapor in the bubble is same as the saturation temperature. The 
buoyancy detaches the bubble from the surface and is replaced by the next bubble. 
Nucleate boiling is accompanied by rapid bubble formation and very high turbu- 
lence which yield very high heat transfer rates. However, the convection coefficient 
is a function of the temperature difference between the hot surface and the satura- 
tion temperature as shown in Fig. 9 on a logarithmic scale. The highest convection 
rates are obtained when the temperature difference is 10-20 °C. 
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Cooling Action of Air-Cutting Fluid Mixture (Mist) 

The cost of cutting fluid is approximately 16-17 % of the operational cost of 
machining. The total cost comprises of the cost associated with purchase, filtration, 
separation, and finally disposal. Due to potential health hazard, disposal costs of 
cutting fluids are higher than the initial cost of these fluids. The laws for occupa- 
tional hazard and hazardous waste disposal, which include cutting fluid usage, 
disposal, and worker protection, are getting stricter day by day. Consequently, the 
use of flood cooling in machining operations is becoming a costly affair, and 
minimum quantity lubrication (MQL) is gradually being considered as an econom- 
ical and environment-friendly option in place of machining operations with flood 
cooling. A detailed description of the MQL systems is given later in section 
“Advanced Topics.” However, the discussion on the cooling mechanism for mist 
is presented here in this section. 

The mist is a two-phase cooling medium having spherical droplets having 
diameter d dr that moves with velocity v air toward a hot surface (Chiou 
et al. 2003) as shown in Fig. 10. It is assumed that the concentration of droplets 
in the unit volume of the mixture is uniform. The temperature of the hot surface is 
0 hs and that of the mixture is 0 mx . The droplet deforms as soon as it meets the 
surface and its contact diameter increases. The diameter of contact is D ct = m ct d dr 
where m ct 1 . 

The thermal energy gained by the droplet during removal of heat from the 
surface initially heats it to the saturation temperature, 0 st , and then causes its 
evaporation. The heating to the saturation temperature can be represented by 
(Astakhov 2006) 


Fig. 10 Interaction of fluid 
droplets with the hot surface 
(Astakhov 2006) 
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where T ]h is the time for heating the bubble, p c f is the density of cutting fluid, and h cf 
is the convection coefficient during heating till saturation. 

The evaporation and boiling can be represented by 



where r 2h is the time required for vaporization of the bubble, h vb is the convection 
coefficient during vaporization, and L v is the latent heat of vaporization. 

Note that Eqs. 12 and 13 only capture the heat transfer because of fluid drops, the 
remaining area, where the drops do not fall, is cooled by air convection. The 

average heat transfer coefficient, h\- 2 includes heating till saturation and boiling 
and takes place over the area A hs _ dr of the total surface area A hs . On the rest of the 
surface (over the area A hs — A hs _ dr ), the heat transfer takes place with the air 
contained in the two-phase mixture. The heat transfer coefficient in this process is 
designated as h air . The cumulative heat transfer coefficient for the two-phase 
mixture is expressed as (Astakhov 2006) 



c—mix 


h\—2^hs—dr 






Cooling Action Due to Heat Pipe Embedded in the Tool 

Heat pipes are sometimes used to cool the cutting tools. Even though the theory of 
heat pipes has been reported in the literature (Faghri 1995; Rebinder 1979), there is 
no simple technique to calculate the cooling action of heat pipes so that they can be 
implemented in practice. The heat pipe shown in Fig. 1 1 comprises of a thick- walled 
hermetically sealed container (3) made up of a high thermal conductivity material. 
This container has specified quantity of volatile working fluid. One end (termed as the 
evaporator) of the container is embedded in the unit to be cooled (1), while the other 
end (often called as the condenser) contains a heat exchanger (radiator) (2) to release 
the thermal energy to the surrounding. Heat applied by a heat source at the evaporator 
region vaporizes the working fluid in that region. The vaporization creates a pressure 
difference that transports the vapor from the evaporator to the condenser where it 
condenses releasing the latent heat of vaporization via the heat exchanger. The 
condensed working liquid goes back to the evaporator. This is a good mechanism 
of transferring heat from a hot region to a cold region. The heat pipe transports large 
amount of heat with a small temperature difference. This technique can be applied 
effectively for cooling of cutting tools. 

The energy balance equation for internal surface of the container is given by 
Eq. 15 which assumes that the container is insulated: 
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Fig. 1 1 Schematic of 
vertical heat pipe (Astakhov 
2006) 



hbl^bl(@hs\ @st) hedged st @hs2 ) (15) 

where h b/ and h cd are the heat transfer coefficients in the boiling and condensation 
regions, respectively; 6 hsX and 0 hs2 are the average temperatures in the boiling and 
condensation zones, respectively; and A bi and A cd are the areas of boiling and 
condensation zones, respectively. 

The left-hand side of this equation is the amount of thermal energy that enters 
from the hot zone through the walls of the container via the evaporation of liquid in 
the “hot” zone, while the right side is that exchanged through the walls in the “cold” 
zone. The contact areas where the A h/ and A cd can be approximated by 
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Equation 18 can be used to determine the heat transfer coefficient in the 
vaporization (boiling) zone: 



if™ p 

where 0i = c bvTr is a factor that depends on the properties of 

b eq-M ywfPwfVwf) 

working liquid at the saturation temperature and the Prandtl number. 
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Penetration of the Cutting Fluid into the Contact Interface 
for Lubrication 

The mechanism of coolant penetration into the contact zone is not very well 
understood, and there is no conclusive evidence to prove that the cutting fluid 
penetrates the contact interfaces (tool-chip and tool- workpiece). Despite the lack of 
physical evidence, various hypotheses exist for the mechanism of penetration of the 
fluid, such as access through capillary network between chip and tool, access 
through the voids connected with built-up edge, and access to gap created by tool 
vibration. The contact pressures between tool and chip are very high, sometimes 
two orders of magnitude higher than the coolant supply pressure; consequently, the 
coolant is unlikely to penetrate the contact interface, especially, under sticking 
friction. Note that the gasket pressures of 17 MPa can ensure no leakage due to 
capillary; in comparison, the contact pressure in the tool-chip interface is much 
higher, so fluid transfer via capillary mechanism to the interface is highly unlikely. 
Some of the experiments carried out to test the penetration of cutting fluid used 
radioisotope T-3 mixed cutting fluid that was used in cutting AISI 1045 steel. The 
analysis after cutting showed little-to-no traces of radio isotopes in the interface. A 
small trace of the radioisotope was observed in the sliding zone, but there was no 
evidence of cutting fluid penetrating in the sticking zone (Astakhov 2006). 

As mentioned previously, the tool-chip interface is characterized by two distinct 
sticking and sliding zones between the tool and the workpiece at high normal 
pressure and temperature. The sticking contact is extreme case of friction. The 
maximum friction that can be encountered in a system is the shear flow stresses of 
the softer material which is the case in the sticking zone. In order to counter this 
extreme frictional resistance, cutting fluids need to be used as lubricants in certain 
machining operations. 

Shaw (1984) also experimentally observed that the cutting fluid does not lubri- 
cate at high speeds. They attributed this behavior to chips carrying the cutting fluid 
away too quickly for it to reach the cutting zone and form a fluid film at the 
interface. In addition, the time is too short for the fluid to chemically react with 
metal surfaces to form a solid-film lubricant or the extreme pressure (EP) additives 
present in the cutting fluid to kick in. Similar observation was reported by Cassin 
and Boothroyd (1965) at high cutting speeds. They suggested that lubrication 
occurs at low speeds by diffusion through the workpiece or that the extreme 
pressure additives within the fluid were able to react to form a boundary layer of 
solid-film lubricant at lower speeds. The primary functions of a cutting fluid are 
considered to be lubricating and cooling; lubrication is the dominant function for 
only low-speed machining operations, e.g., drilling and tapping. It is to be expected 
that a cutting fluid in these operations would reduce the friction at the tool-chip 
interface. However, in drilling and tapping, a significant amount of friction between 
chip and tool occurs in locations other than the rake and flank faces. Frictional 
resistance occurs when the chips attempt to evacuate through the flutes. The chips 
rub against the tool and the wall of the hole, clogging the flutes under certain 
conditions; increasing torque, thrust force, and tool temperature; and occasionally 
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deteriorating the quality of the drilled hole. In these cases, the presence of a cutting 
fluid can reduce the friction between the chips and tool flutes, enabling the smooth 
flow of chips from the hole and preventing chip clogging. Of course, the lubricating 
function of the fluid is a function of penetration of the fluid in the machining zone at 
the bottom of the hole. Furthermore, the geometry and mechanical properties of 
chips produced in drilling and tapping play an important role in the chip clogging 
phenomenon (Haan et al. 1997; Cao and Sutherland 2002). 

To summarize, it can be inferred that the lubrication phenomenon in cutting 
fluids occurs only for low cutting speeds such as found in machining operations like 
drilling and tapping. At higher speeds, the cutting fluid does not provide lubrication 
as it is unable to penetrate the high contact stresses at the tool-chip interface and the 
high speed carries the fluid away from the cutting zone. 


Selection of a Cutting Fluid 

The selection of a cutting fluid depends on many complex factors. The main factors 
of concern are machinability of the material, compatibility (metallurgical, chemi- 
cal, and human), and acceptability (fluid properties, reliability, and stability) 
(Machinability Data Center 1980). 


Machinability 

The selection of the type of cutting fluid for use should be based on the following 
considerations: 

• Type of machining operation 

• W orkpiece material 

• Tool material 

• Process parameters - cutting speed, feed, and depth of cut 

One of the most important factors in selecting a cutting fluid is the severity of the 
cutting operation itself. The various machining processes have different material 
removal mechanisms. The more challenging and severe the operations are, the 
greater the requirements from a cutting fluid are. Hence, the selection is closely tied 
to assessing the severity of the machining operation and then assigning the appro- 
priate cutting fluid. The ascending order of severity in different machining pro- 
cesses is as follows (Machinability Data Center 1980): 

• Grinding 

• Sawing 

• Turning, single-point tools 

• Planning and shaping 

• Milling 
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• Drilling 

• Reaming 

• High-speed, light-feed screw machining 

• Screw machining with form tools 

• B oring 

• Deep-hole drilling 

• Gear cutting 

• Threading 

• T apping 

• External broaching 

• Internal broaching 

These ratings are guidelines, since the variations in the tool geometry, workpiece 
material, and processing conditions will alter the severity of the machining 
operation. 

It may be noted that for heavy-duty machining operations, such as tapping or 
broaching, medium or heavy-duty cutting oils are recommended. The horizontal 
broaching of steel usually requires heavier-bodied with active agents than does 
vertical surface broaching under similar conditions. The heavier oil adheres to the 
horizontal broach better than a water-miscible fluid and active agent aids in efficient 
cutting. For vertical surface broaching of mild steels, emulsions or solutions may be 
used; however, cutting oil is typically used in these applications as well. 

A list of cutting oils in the order of increasing load-bearing capacity is as follows 
(Machinability Data Center 1980): 

• Straight mineral oil 

• Mineral oil with fatty additives 

• Mineral oil with chlorinated additives 

• Mineral oil with sulfurized fatty additives 

• Mineral oil with free sulfur and sulfurized or chlorinated compounds 

Slow speed operations require the lubricating properties of cutting oils. As a rule 
of thumb, cutting oils should be used at cutting speeds below 30 m/min. Cutting oils 
with EP lubricants are effective in a wide variety of machining operations on many 
materials up to speed of 60 m/min. If the speed increases, the penetration of cutting 
fluid in the tool-chip interface becomes difficult. Beyond a speed of 60 m/min, 
chemical action diminishes quickly and becomes virtually nonexistent if the cutting 
speed is increased to 120 m/min. 

Both tapping and threading involve small multiple cutting edges in continuous 
contact with the work throughout the cut. The geometry of the tools and the inherent 
nature of these operations restrict the flow and, therefore, the cooling effect of the 
cutting fluid, to the cutting edges, especially in tapping. 

Drilling can be a challenging operation when done on difficult-to-machine 
materials. Drilling speeds are generally slower than those used for other operations 
because the cutting edge is in continuous contact with the metal when cutting and, 
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therefore, fluid does not reach the cutting edges and they are devoid of beneficial 
cooling action of the cutting fluid. The preferred fluids for conventional drilling 
operations are emulsifiable oils and sulfurized or chlorinated mineral oils which 
could potentially undergo chemical activity to counter the high pressures. These 
fluids provide lubrication to prevent chatter and friction-generated heat while 
transferring the heat generated by chip formation. Oil-hole drills should be used 
as far as possible. 

There are some operations for which oils are specially formulated, e.g., honing 
requires the use of thinner paraffinic base oil. For many high-speed operations, such 
as grinding, turning, and milling with carbide tools, the coolant cannot penetrate the 
tool-chip interface, and consequently, the extreme pressure and anti-weld additives 
are rendered ineffective. Under these conditions, it is best to use water-miscible 
fluids for their excellent cooling capabilities. The gross cooling helps to prevent 
catastrophic tool failure from crater wear and reduce distortion in the workpiece. 

Certain emulsifiable oils and synthetic fluids are formulated especially for 
grinding operations and are used in concentrations of 1 part concentrate in 25-60 
parts water. An increase in the oil in the emulsion from 2.5 % to 10 % can improve 
the grinding ratio and surface finish and results in power saving. It may be noted 
that the grinding ratio is a measure of the volume of material removed per unit 
volume of wheel wear (Machinability Data Center 1980). 

Severe grinding operations, such as form, thread, and plunge grinding, where 
wheel form is a critical factor require the use of cutting oils or emulsifiable oils with 
EP additives. It should be noted that for the low stress grinding technique, the use of 
an oil-based fluid results in less surface distortion. Grinding oils provide better 
lubrication and reduce friction, thereby reducing the heat generated. As a result, 
heavier cuts can be taken and smoother surface finish is obtained. However, the oils 
can compromise the cooling efficiency, resulting in heat buildup in the workpiece 
and smoking/burning. Note that the oil vapors are also problematic. 

To summarize, the cutting and grinding fluids should be selected to provide the 
following: 

Greater lubrication 

• At relatively low speeds 

• On the more difficult-to-machine materials 

• For severe machining operations 

• For better surface finish 
Greater cooling 

• At relatively high speeds 

• On easy-to-machine materials 

• For less-demanding operations 

• Where heat accumulation occurs, specifically, in low thermal conductivity 
materials 

Types of cutting fluids recommended for various work materials and machining 
operations can be referred in ASME handbook. 
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Compatibility Considerations 

One big issue is compatibility of the cutting fluid with the workpiece and the 
machine tool. In addition, the water-miscible coolants should be compatible with 
the water as well. 

Compatibility with Workpiece Material 

The cutting fluid selected for use must be chemically compatible with the material 
being machined. The fluid should not react causing either work surface staining or 
corrosion. On the contrary, it should prevent corrosion of the workpiece during 
machining and handling post machining. Some of the main concerns are listed 
below: 

• Some copper alloys cannot be machined with highly sulfurized cutting oils since 
the active sulfur will stain the machined surface. For this reason, cutting fluids 
containing free sulfur should never be used in the machining of copper and its 
alloys due to their high reactivity. 

• Metals such as aluminum, magnesium, and zinc (also galvanized steel) are prone 
to attack by acids or alkalies, thus causing corrosion. Therefore, selection of a 
cutting fluid for aluminum is determined by the chemical composition of 
the alloy. Active sulfur or chlorine additives may cause staining of some 
aluminum alloys. There are some fluids, however, with proprietary additives 
that improve surface finish and help prevent staining in machining aluminum 
alloys. The use of highly alkaline grinding fluids on aluminum could potentially 
corroded parts. 

• Magnesium and its alloys, for example, should never be machined or ground 
with water-miscible fluids because it is flammable under effect in water. 

• Zinc and galvanized steel should not be machined with nitrite -containing fluids. 

• Oil-rich emulsions with a pH of approximately 8 have been found effective for 
machining these nonferrous metals. Special fluids available for machining cop- 
per, aluminum, and magnesium can be found in the Machining Handbook 
(Machinability Data Center 1980). 

• Ferrous metals corrode when attacked by acids and rust will form as a result. 
Alkalies form a protective layer on most ferrous metals to prevent corrosion. 
Consequently, slightly alkaline cutting fluids for ferrous are preferred to inhibit 
rusting of the machined components and the machine elements as well. 

• Fluid concentration should be monitored and controlled; otherwise, the rust- 
inhibiting properties could be compromised. Some rust inhibitors react with 
oxygen and acids which may be present in the air and water, respectively, and 
lose its efficacy. 

• Additional care should be exercised while selecting cutting fluids used for 
machining components which are designed for high contact pressure or high 
stresses in service or which are exposed to unusual/harsh environments. For such 
parts, cutting fluids should be used which will not make the part susceptible to 
stress corrosion attack in the event that there is a residual surface film of the 
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cutting fluid on the workpiece. At present, the only way of selecting such cutting 
fluids is to choose them on the basis of extensive laboratory tests and/or service 
experience. An alternative is to ensure thorough cleaning to eliminate any trace 
of the cutting fluid from the machined surface. 

• Cutting fluid selection can have a dramatic impact on the part quality and the 
surface integrity during grinding. Grinding is carried out with water-miscible 
fluids, straight oils, and dry. The use of cutting fluid governs the cooling action, 
quality of lubrication, and possibly some chemical action. Water-miscible fluids 
have greater cooling capability but provide lower lubrication than the straight 
oils, but they induce higher thermal shock on the workpiece. Dry grinding 
conditions are not usually compatible with low stress grinding or high surface 
integrity. It may be noted that lubrication is the most important function of the 
grinding fluid for surface integrity, specifically, for controlling the residual 
stresses. At times, the difference between the performance of emulsions and 
straight oils is not easily discernible at higher infeed rates or higher wheel 
speeds. The effect of infeed is more pronounced and tends to mask the effects 
of the fluids. As noted previously, grinding oils become increasingly effective at 
lower wheel speeds (below 20 m/s). Typically, the grinding oils are superior for 
high integrity grinding as they provide good lubrication. 

Compatibility with Water 

Water-miscible fluids must be compatible with the water in which they are mixed. 
Water quality is perhaps the most important single factor affecting fluid life. The 
water chemistry and its microbial load determine the cooling effectiveness and the 
life of the coolant. The best cooling action can be obtained from very fine emulsions 
are formed due to suspension of oil in water, but such emulsions are not possible if 
the water has high hardness or dissolved/suspended solids. Many things are affected 
by the hardness level or pH of the water, such as rate of biological growth, rust 
protection, evaporative effect, emulsion stability, and foaming. A high chloride or 
sulfate content can hamper the rust protection capability of the coolant formulation. 
Hard water (over 200 ppm) can break some emulsions, resulting in scum formation 
which leads to subpar performance. Soft water (under 100 ppm) often produces 
excessive foam, especially under conditions of high agitation (Machinability Data 
Center 1980). 

Compatibility with the Machine Tool 

The cutting fluid must also be compatible with the machine tool being used for 
machining. The use of water-miscible fluids with most machines can pose prob- 
lems. Usually, the drive mechanisms and the feed mechanisms are placed away 
from the cutting zone, with their own lubrications system and sealed appropriately 
against the seepage of water- containing cutting fluids into the lubrication system. 
With some of the more complex machine tools, however, like automatic and 
semiautomatic lathes and gear cutting and broaching machines, emulsions should 
be avoided because of the moving parts near the cutting zone. Under such condi- 
tions, the use of straight oil is desirable. 
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Compatibility with Operator Health 

Another important consideration in selecting cutting fluids is acceptability to the 
operator. Fluids designed with the operator in mind will be clean to use, pleasant 
and safe to breathe, nontoxic, and free from ingredients harsh to human skin. 
Bacterial growth in the coolant should be controllable before and during use should 
not result in dermatitis. Fluid clarity makes it easier for the operator to see the work 
while it is being machined. A pleasant-smelling, generally clean, “nonsticky” fluid 
helps to make an operators job more pleasant and productive. The main concerns 
are toxicity, dermatitis, respiratory disorder, microbial infections, and cancer. 

It is known that short- or long-term exposure to cutting fluids which contain 
biocides and fungicides can be harmful to the operator health. Toxic reactions may 
occur through accidental splashes, inhalation of mists, and transmission through 
skin cuts. Coolants suppliers must provide material safety data sheet (MSDS). 
These documentations provide information about the permissible levels of exposure 
and associated safety measures. Incidence of dermatitis has been observed between 
0.3 % and 1 % of the machinists in the USA (Stephenson and Agapiou 2010). 
Dermatitis can be prevented by using right concentration of biocide, using gloves, 
using low pH soap, and using hand creams. Growth of microbes in the coolant 
system can result in microbial infections; therefore, sumps should be monitored for 
microbial growth and not be allowed to stagnate for long periods of time. 

Respiratory disorders can stem from prolonged exposure to cutting fluid mists. 
These respiratory disorders include cough, asthma, bronchitis, and breathing prob- 
lems. These conditions persist and worsen with time, and then it is termed as 
occupational asthma. Another problem which is encountered is hypersensitive 
pneumonitis. This is seen in a small percentage of operators who become allergic 
to some component of the mist, and it gets aggravated with further exposure. 
Prolonged exposure to the allergic components can result in pulmonary fibrosis or 
build up in the lungs of the operator. Long-term exposure to cutting fluids, espe- 
cially, with straight mineral oils in grinding where the mist generation is more is 
also injurious to the health of the operator. Some of the older oils and carbon 
tetrachloride used in the past has now been classified as a human carcinogen by 
EPA and NIOSH. 

Economic Considerations 

It may be noted that the cost associated with coolants and disposal can be 16 % of 
the total machining cost. Consequently economic considerations are a must while 
selecting the cutting fluid. However, while calculating the cost, the cost/piece 
should be considered as opposed to the initial price of the cutting fluid. In the 
overall economic analysis, both the costs associated with the cutting fluid and 
costs affected by the cutting fluid, such as effect on tool, wheels, and downtime, 
should be accounted for. The costs associated with the cutting fluids are as follows: 
inventory, fluid changing cost, price differentials, maintenance, and disposal cost. 
The recycling of cutting fluids can decrease the disposal costs of the cutting fluids. 
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Advanced Topics 

The advanced topics in cooling and lubrication deal with dry machining, MQL, and 
cryogenic machining. 


Dry Machining 

Dry machining can provide cost advantage and machine-tool flexibility if big 
sumps are not required. Some processes can be easily carried out even without a 
cutting fluid, e.g., cast iron can be readily machined without a coolant if the tool is 
oriented properly for easy chip evacuation. The aluminum parts can be turned and 
milled with poly crystalline diamond (PCD) tools without any problems. Special 
low-friction tool coatings and machine architecture can help the dry machining 
further. 


Minimum Quantity Lubrication 

MQL machining is environmentally friendly and can give good results under 
normal machining conditions. However, these can be sometimes less effective 
when higher material removal rate and better surface quality/integrity finish are 
required. In these circumstances, near-dry machining operations utilizing very 
small amounts of cutting lubricants are very effective and are increasingly being 
used in a number of practical applications. An MQL process is performed with a 
transport medium, such as air, and a pump supplies the tool with the fluid in the 
form of droplets. Sometimes this concept of minimum quantity lubrication is also 
termed as near-dry lubrication or micro-lubrication. Minimum quantity lubrication 
(MQL) refers to the use of only a minute amount of cutting fluids typically between 
10 and 50 ml of cutting fluid per machine but it can be as high as 
500 ml/h. Generally, straight oils are used for MQL, but emulsions can be used if 
better cooling is required (Adler et al.; Khan and Mithu 2009). 

The concept of MQL has been suggested for over a decade as a means of 
addressing the issues of environmental concerns and occupational hazards associ- 
ated with the airborne cutting fluid particles on factory shop floors. The exposure to 
cutting fluids can result skin and respiratory ailments. The large quantities of 
cutting fluids used in flood cooling system have very high cost of disposal due to 
potential environmental hazard. As mentioned previously, the total cost of coolants 
in flood cooling system can be up to 17 % of the total machining costs. The cost of 
disposal/recycling can exceed the initial cost of the coolant under certain 
conditions. 

The effect of type of cutting fluid used in MQL machining has not been explored 
fully. Stabler et al. (2003) and Krahenbuhl (2005) suggested that certain types of 
fluids are not preferred for minimum quantity lubrication, such as water-mixed 
cooling lubricants and lubricants with EP additives. Some recent studies (Khan and 
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Mithu 2009; Stabler et al. 2003) have suggested that vegetable oil is a viable 
alternative from performance, cost, health, safety, and environment perspectives. 


Cryogenic Machining 

High-temperature alloys having high melting temperatures and good thermal creep 
properties are increasingly gaining importance as they find application in hot zone 
components of gas turbines. These superalloys can be divided into four major catego- 
ries: nickel-based alloys, cobalt-based alloys, iron-based alloys (e.g., high chromium 
stainless steel), titanium-based alloys and tungsten alloys. The ability to retain mechan- 
ical strength and chemical/metallurgical integrity at elevated temperatures renders 
these superalloys as ideal candidates for use in both rotating and stationary components 
in the hot end of jet engines. Conventional flood cooling methods are not very effective 
in terms of decreasing the cutting temperature and improving environmental sustain- 
ability. The conventional approach is gradually being replaced by dry and near-dry, and 
for these high-temperature alloys, cryogenic machining is a viable option. 

Cryogenic machining presents a method of cooling the cutting tool and/or part 
during the machining process. It involves delivering of cryogenic coolant, typically, 
liquid nitrogen (instead of an oil-based cutting fluid) to the local cutting region of 
the cutting tool, which is exposed to the highest temperature during the machining 
process, or to the part in order to change the material characteristics and improve 
machining performance (Pusavec et al. 2009). Liquid nitrogen cools the cutting 
zone and immediately evaporates and returns back to the atmosphere, leaving no 
residue to contaminate the part, chips, machine tool, or operator. Hence, it elimi- 
nates disposal costs related to the conventional straight oils or emulsions. The 
potential benefits of cryogenic machining are (Pusavec et al. 2009): 

• Sustainable machining methods (cleaner, safer, environment friendly, and 
nontoxic) 

• Increase of material removal rate without associated increase of the tool wear 

• Increase in tool life due to lower abrasion and chemical wear 

• Bulk machining of hard parts and difficult-to-machine alloys, which were 
previously possible only via expensive grinding operations 

• Improvement in surface quality/integrity 

It is expected that due to abovementioned advantages, the cryogenic machining 
will gain more uses in machining of high-temperature alloys. 


Summary 

One of the most important considerations during machining is use of the right 
cutting fluid based on the machining operation. The application technique and the 
flow requirement is again a function of the operation. The coolant flow rates and 
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delivery pressures can vary from 20 to 2,000 1/min and 15 to 100 bar. These 
variations necessitate special designs for nozzles and filters. The other important 
aspect is the selection of the cutting fluid which is based on following key 
considerations: machinability, compatibility, and cost-effectiveness. The machin- 
ability defines the functional requirement from the cutting fluid: either greater 
cooling or greater lubrication. The severity of the machining operation can be 
countered by chlorine-/sulfur-based additives in the mineral oil which enhances 
its load-bearing capacity. The compatibility conditions are crucial; it should be 
nonreactive and noncorrosive to the workpiece and the machine tool. The compat- 
ibility with respect to the operator health is of paramount importance as some of the 
cutting fluids can pose respiratory and skin health hazards which have to be avoided 
at any cost. Due to the stringent environmental laws, the cost of safe disposal of the 
cutting fluid has to be accounted for in the total machining cost. Due to these issues, 
active research is being carried out in the area of dry machining and minimum 
quantity lubrication with environmentally friendly vegetable oils as opposed to 
mineral or synthetic oils. Cryogenic machining with liquid nitrogen has also been 
found effective in machining aerospace alloys. 
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Abstract 

Fixed abrasive machining, which was considered a finishing operation involving 
low rates of removal, has evolved as a major competitor to cutting. As the 
primary fixed abrasive machining means, the borders between grinding and 
other operations such as superfinishing, lapping, polishing, and flat honing are 
no longer distinct. Machining with grinding wheels extends from high-removal- 
rate processes to the domains of ultrahigh accuracy and superfinishing. This 
chapter presents the fixed abrasive machining technology in fundamental and 
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application terms. The topics cover a range of fixed abrasive machining process 
with grinding wheels, grinding parameters, and application technology in grind- 
ing ductile and brittle materials. The aim is to present a unified approach to 
machining with grinding wheels that will be useful in solving new grinding 
problems of the future. It should be of value to engineers and technicians 
involved in solving problems in industry and to those doing research on machin- 
ing with fixed abrasive machining in universities and research organizations. 


Introduction of Fixed Abrasive Machining Process 

Fixed abrasive machining is the common collective name for machining processes 
which utilize the fixed hard abrasive particles as the cutting medium. In particular, 
grinding with fixed abrasive tools is the one of the most important means of fixed 
abrasive machining. The grinding process of shaping materials is probably the 
oldest in existence, dating from the time prehistoric man found that he could 
sharpen his tools by rubbing them against gritty rocks. Nowadays, grinding is a 
major manufacturing process which accounts for about 20-25 % of the total 
expenditures on machining operations in industrialized countries. 

In general, within the spectrum of machining process, the uniqueness of grinding 
is found in its cutting tool. Grinding wheels and tools are generally composed of two 
materials - tiny abrasive particles called grains or grits, which do the cutting, and a 
softer bonding agent to hold the countless abrasive grains together in a solid mass. 
Each abrasive grain is a potential microscopic cutting tool. The grinding process 
uses thousands of abrasive cutting points simultaneously and millions continually. 

Grinding is traditionally regarded as final machining process in the production of 
components requiring smooth surfaces and fine tolerances. There is no process 
which can compete with grinding for most precision machining operation, but the 
process is far from being confined to this type of work. More abrasive is actually 
consumed by heavy-duty grinding operations, where the objective is to remove 
materials as quickly and efficiently as possible with little concern for surface 
quality. Grinding is as essential for delicate precision slicing of silicon wafers for 
microelectronic circuits using paper-thin abrasive disks or saws only 20 pm thick, 
as it is for the heavy-duty conditioning and cleaning of billets and blooms in 

Q 

foundries and steel mills at removal rates of around 1,600 cm per min with 
220 kW machines (Malkin and Guo 2008). 

There are numerous types of grinding operations which vary according to the 
shape of the wheel and the kinematic motions of the workpiece and wheelhead. 
Some of the more common ones for machining flat and cylindrical surfaces are 
displayed in Fig. 1. More complex machines are used to generate other shapes. Any 
of these processes may be applied to fine finishing, to large-scale stock removal, or 
to a host of tasks between these extremes. 

Another area where grinding is virtually unchallenged is for machining of 
materials which, because of their extreme hardness or brittleness, cannot be effi- 
ciently shaped by other methods. In the production of hardened steel components, 
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Fig. 1 Illustration of some 
common grinding operations 
for machining flat and 
cylindrical surface. 

(a) Traverse surface grinding. 

(b) Plunge surface grinding. 

(c) Traverse cylindrical 
grinding, (d) Traverse 
internal grinding 



Traverse surface grinding 



Plunge surface grinding 



Traverse cylindrical grinding 



Traverse internal grinding 
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such as cutting tools and rolling bearing rings, grinding can be performed on either 
the annealed or the hardened steel, often with comparable ease, whereas other 
machining methods are usually restricted to the annealed material. The machining 
of nonmetallic brittle materials, including ceramic, cemented carbides, and glasses, 
is almost exclusively dependent on fixed abrasive processes. 

Basic Parameters and Tools of Fixed Abrasive Machining 

Grinding, as the typical machining process of the fixed abrasive machining, is often 
considered somewhat of a “black art” where wheel life and cycle times cannot be 
determined from standard tables and charts. Certainly, precision grinding, being a 
finishing process with chip formation at submicron dimensions occurring by extru- 
sion created at cutting edges with extreme negative rake angles, is prone to process 
variability such as chatter, system instability, coolant inconsistency, etc. Neverthe- 
less, with grinding equipment in a competent state of repair, performance can be 
controlled and predicted within an acceptable range. Importantly, rules and guide- 
lines are readily available to the end user to modify a process to allow for system 
changes. It is also essential to ensure surface quality of the parts produced. These 
objectives are balanced through an analysis of costs as described in subsequent 
chapters on economics and on centerless grinding. The importance of the grinding 
parameters presented below is to provide an understanding of how process adjust- 
ments change wheel performance, cycle time, and part quality. 


Process Parameters of Fixed Abrasive Process 


Uncut Chip Thickness or Grain Penetration Depth 

The starting point for any discussion on grinding parameters is “uncut chip thick- 
ness,” h cu , as this provides the basis for predictions of roughness, power, and wear 
(Shaw 1996). Uncut chip calculations are typically based on representations of the 
material removed in the grind process as a long, slender, triangular shape with a 
mean thickness, h cu . However, a more practical way of looking at this parameter is 
to think of h cu as representing the depth of abrasive grain penetration into the work 
material. In fact, this parameter is often termed the grain penetration depth. The 
magnitude of h cu may be calculated from the various standard parameters for 
grinding and the surface morphology of the wheel: 






where is wheel speed, v w is work speed, a e is depth of cut, d e is equivalent wheel 
diameter, C is active grain density, and r is grain cutting point shape factor. 
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Other useful measures of grain penetration include equivalent chip thickness 
h eq = a e 'vjv s . However, equivalent chip thickness takes no account of the spacing 
of the grains in the wheel surface. 

Wheel Speed 

Wheel speed, v 5 , is given in either meters/second (m/s) or surface feet per minute 
(sfpm). To convert the former to the latter, use a rule of thumb multiplication factor 
of approximately 200 (or 196.85 to be precise). 

Work Speed 

Work speed, v w , is a term most typically applied to cylindrical grinding; equivalent 
terms for surface grinding are either traverse speed or table speed. 

Depth of Cut 

Depth of cut, a p , is the depth of work material removed per revolution or table pass. 

Equivalent Wheel Diameter 

Equivalent wheel diameter, d e , is a parameter that takes into account the conformity 
of the wheel and the workpiece in cylindrical grinding and gives the equivalent 
wheel diameter for the same contact length in a surface grinding application (i.e., 
d e — » d s as d w — » oo). The plus sign is for external cylindrical grinding, while the 
negative sign is for internal cylindrical grinding: 

j ds x d w 

e ~d s ±d w [ j 

d w is workpiece part diameter and d s is wheel diameter. 

Active Grain Density 

Active grain density, C, is the number of active cutting points per unit area on the 
wheel surface. 

Grain Shape Factor 

Grain shape factor, r, is the ratio of chip width to chip thickness. In most discussions 
of precision grinding, the product Cr is considered as a single factor that can be 
somewhat affected by dress conditions but, under stable grinding conditions, that is, 
with a fixed or limited range of dress conditions, can be considered as a constant for 
a given wheel specification. 

There are several key parameters that research has shown to be directly depen- 
dent on h cu : 



v 


W 


1 


a 


v c Cr d 


0.85 


( 3 ) 
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Force per Grain 

Grain retention is directly related to the forces experienced by the grain, and these 
forces increase with uncut chip thickness. It can be seen that for a constant stock 
removal rate ( a e • v w ), forces are lower at large depth of cut and low table speed. 
Hence, a softer grade might be used for creep feed rather than for reciprocated 
surface grinding. A softer grade has a better self-sharpening action and reduces 
grinding forces. 

Wheel wear can accelerate as a wheel diameter gets smaller and force/grain 
increases. 

Specific Grinding Energy 

Specific grinding energy, e c (or u in older publications), is the energy that must be 
expended to remove a unit volume of workpiece material. The units are usually 
J/mm or in.lb/in. ; conversion from metric to English requires a multiplication 
factor of 1.45 x 10 5 . Analysis of the energy to create chips leads to the following 
relationship between e c and h cu \ 



where n = 1 for precision grinding. 

The relationship is logical insofar as it takes more energy to make smaller chips 
but is valid only so long as chip formation is the dominant source. Hahn and Malkin 
show that in many cases, especially in fine-grinding or low metal removal rates, 
significant energy is consumed by rubbing and ploughing. 

Specific Removal Rate 

Specific removal rate, Q' or Q' w , is defined as the metal removal rate of the 
workpiece per unit width of wheel contact, Q' = a p • v w . The units are either 

o o 

mm /mm/s or in. /in./min. To convert from the former to the latter requires a rule 
of thumb multiplication factor of approximately 0.1 (or 0.1075 to be precise). 

For very low values of Q’ , rubbing and ploughing dominate, but as Q’ increases 
so does the proportion of energy consumed in chip formation. More to the point, the 
energy consumed by rubbing and ploughing remains constant, thereby becoming a 
smaller proportion of the total energy consumed as stock removal rates increase 
(Heinzel and Bleil 2007), as illustrated in Fig. 2. 

Chip formation dominates in high-removal-rate precision applications such as 
camlobe grinding or peel grinding with vitrified CBN or rough grinding with plated 
CBN. Under these circumstances is a good predictor of performance: 



( 5 ) 
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Fig. 2 Specific grinding 
energy depending on the 
specific material removal rate 
(Heinzel and Bleil 2007) 
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Grinding Power 

Grinding power, P, can be estimated from the specific grinding energy, e c , using the 
equation 





where b w is the width of grind. 

Tangential Grinding Force 

Tangential grinding force, F t , may then be calculated from 

P e c 'Q' -b w 

Ft = — = 

V s V s 



Normal Grinding Force 

Normal grinding force, F n , is related to the tangential grinding force by the 
coefficient of grinding, a parameter defined in a similar way to friction coefficient. 
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Coefficient of Grinding 

Coefficient of grinding is where 


( 8 ) 

* n 

The value for // can vary from as little as 0.2 for low stock removal applications 
for grinding hard steels and ceramics to as high as 0.8 in very high stock removal 
applications such as peel grinding, or grinding soft steels or gray cast iron. Coolant 
can also have a major impact on the value as a result of the hydrodynamic pressure 
created by high wheel speeds. The effect is particularly noticeable with high- 
viscosity straight oils. Typical precision-grinding applications on steels have values 
of // in the range of 0.25-0.5. 

Since tangential force can be readily calculated from power but not from normal 
force, knowledge of // is particularly useful to calculate required system stiffness, 
work holding requirements, chuck stiffness, etc. For most precision production grind- 
ing processes with hardened steel or cast iron, it can be seen that // tends to a value of 
about 0.3. Note, however, that these numbers are for flat profile wheels in a straight 
plunge mode. If a profile is added to a wheel or the angle of approach is changed from 
90 () , then allowance must be made for increased normal forces and for side forces. 


Surface Roughness 

Surface roughness, not surprisingly, is closely related to uncut chip thickness: 







R t Roughness 

R t roughness is the SI parameter for maximum surface roughness, the maximum 
difference between peak height and valley depth within the sampling length. 
As a first approximation, R t is independent of depth of cut but is dependent on 

C • r, and d e . The relationship between surface roughness and specific 
grinding energy can also be readily obtained by direct substitution. 

R t is but one of several measures of surface roughness. Two other common 
roughness standards are R a roughness and R z roughness. 


v w , v 5 , 


R a Roughness 

R a roughness is the arithmetic average of all profile ordinates from a mean line 
within a sampling length after filtering out form deviations. 


R z Roughness 

R z roughness is the arithmetic average of maximum peak-to-valley readings over 
five adjacent individual samplings lengths. R t and R z values are much larger than R a 
roughness values for measurements from the same surface. 

Two other parameters related to surfaces, especially those used for rubbing 
contact, are defined as follows: 
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Short filter cutoff, for removing noise 
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Comparison rule, “max” for 100%, “16%” for 116% 
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Fig. 3 How to specify a surface finish on an engineering drawing 


Material or Bearing Ratio 

Material or bearing ratio, t p , is the proportion of bearing surface at a depth p below 
the highest peak. 

Peak Count 

Peak count, P c , is the number of local peaks that project through a given band height. 
t p is less for grinding than for other operations such as honing, although it can be 
improved to some extent by a two-stage rough-and-finish grind with wheels of very 
different grain size. P c can be controlled somewhat by adjusting dress parameters. 

Factors That Affect Roughness Measurements 

Relative values between different roughness systems will vary by up to 20 % 
depending on the metal-cutting process by which they were generated. Even 
when considering just grinding, the abrasive type can alter the ratio of R z to R a , 
CBN often giving a higher ratio to alumina. This difference also shows up 
cosmetically when looking visually at surfaces ground with alumina or CBN. 
When changing from lapping to fine grinding, the change in appearance of the 
finish can be dramatic, changing from a matt-pitted surface to a shiny but scratched 
surface, both of which have comparable surface roughness values. The type 
of grinding process will also affect the appearance in terms of the grind line 
pattern. 

Roughness Specifications on Drawings 

In the United States, surface finish is usually specified using the ASME Y14.36M 
standard (Fig. 3). The other common standard is International Organization for 
Standardization (ISO) 1302. 

Stock Removal Parameter 

A is defined as the ratio between stock removal rate and normal force: 





A is an indicator of the sharpness of the wheel but is limited by the fact that it must be 
defined for each wheel speed and removal rate. The second factor is decay constant t. 
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Decay Constant r 

When the infeed reaches its final feed point, the grinding force F will change with 
time t as the system relaxes according to the equation 


F = F 0 e~ ,/r 


( 11 ) 


F t and power are directly related; therefore, r can be determined from a log plot 
of the decay inpower during spark-out. After 3 t, virtually all grinding has ceased, 
preventing any improvement inpart tolerance, while roughness, as shown above, 
will not improve further. Consequently, spark-out times in internal grinding should 
be limited to no more than 3 t. 

G-Ratio 

G-ratio is used as the primary measure of wheel wear. This is defined as 

G-ratio = Volume of material ground per unit wheel width/Volume of wheel 
worn per unit wheel width 

G-ratio is dimensionless with values that can vary from < 1 for some soft alox 
creep feed vitrified wheels to as high as 100,000 for vitrified CBN wheels. G-ratio 
will fall linearly with increases in Q' accelerating to an exponential drop as the 
maximum metal removal rate for the wheel structure is reached. 

P-Ratio 

P-ratio is a closely related index that has started to be used as an alternative to 
G-ratio for plated superabrasive wheels. 

P-ratio = Volume of metal ground per unit area of wheel surface 
This allows for the fact that it is hard to define a wear depth on a plated wheel. 
P-ratio usually has the dimensions of (mm /mm ). For high-speed high stock 
removal applications in oil-cooled grinding crankshafts, for example, P-ratio values 
have reached 25,000 mm /mnr. Since the usable layer depth on a plated wheel is 
only at most about 0. 1 mm, a P value of 25,000 mm corresponds to a G-ratio greater 
than or equal to 250,000. 

Contact Length 

l c is the length of the grinding contact zone and is the length over which the heat 
input to the workpiece is spread. The contact length is approximately equal to the 
geometric contact length for rigid metal bond wheels. 

Geometric Contact Length 




( 12 ) 


Real Contact Length 

The real contact length is typically twice this value for more elastic vitrified wheels. 
Marinescu et al. (2004) show that 
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t = \AM <13) 

where 

,2 8 -Rj-F' n -de 

f % -E* 

gives the contribution to the contact length due to elastic deflection between the 
abrasive and the workpiece due to the normal grinding force. This deflection is 
increased for rough surfaces such as an abrasive wheel. A typical value for the 
roughness factor is R r ~ 5. The combined elastic modulus for the workpiece and 
abrasive materials is given by 




Grinding Temperatures 
Surface Temperature T 

Prediction of grinding temperatures and the avoidance of bum are critical to 
grinding quality. Numerous calculations modeling the partition of heat between 
the elements in the grind zone have been developed over the last 50 years. For 
example, the maximum temperature of the workpiece is usually based on an 
original paper by Jaeger (1942) and on the principles of moving heat sources 
described by Carslaw and Jaeger (1959). Heat partitioning is described in depth 
by Marinescu et al. (2004). The following simple version suffices to illustrate the 
key factors governing maximum surface temperature. 

Maximum Workpiece Surface Temperature 

5 

The maximum surface temperature depends on the grinding power (/vv s ), the 
grinding speeds, and material parameters: 



where the thermal parameters that affect grinding temperature are the C max factor; 
the transient thermal property, /? w ; and the workpiece partition ratio, R w . 


The C max Factor 

This is a constant that gives the maximum temperature. The value is approximately 
equal to 1 for conventional grinding. The value is reduced for deep grinding. Rowe 
and Jin (2001) give charts of C values for maximum temperature and for finish 
surface temperature. 
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The Transient Thermal Property ft w 

The transient thermal property of /i w of the workpiece material is given by 

P w = \Jk-p-c (17) 

where k = thermal conductivity, p = density, and c = heat capacity. 

Workpiece Partition Ratio R w 

Workpiece partition ratio R w is the proportion of the grinding energy that is 
conducted into the workpiece. The work partition ratio is a complex function of 
the wheel grain conductivity and sharpness and of the workpiece thermal property. 
Ignoring, for the present, coolant convection and convection by the grinding chips, 
R w approximates to R ws . Hahn (1962) modeled heat transfer between a sliding grain 
and a workpiece. It can be shown that 



k g is the thermal conductivity of the abrasive grain and is the contact radius of 
the grain. R ws is relatively insensitive to variations of r 0 . Typically, R ws for 
conventional grinding varies between 0.7 and 0.9 for vitrified wheels and between 
0.4 and 0.6 for CBN wheels. 


Effect of Grinding Variables on Temperature 

The temperature equation for conventional grinding can, therefore, be very approx- 
imately reduced for a given wheel/work/machine configuration to 



from which it follows that increasing the wheel speed, increasing the depth of cut, 
or increasing the number of active cutting edges (e.g., by dull dressing) will 
increase the surface temperatures. Further discussion of temperatures generated 
when grinding at very high wheel speeds is made in a later chapter. 


Heat Convection by Coolant and Chip 

A note of caution should be sounded for deep grinding where the long contact 
length allows substantial convective cooling from the grinding coolant. Also in 
high-rate grinding with low specific energy, the heat taken away by the grinding 
chips reduces maximum temperature very substantially. 

Allowance can be made for convective cooling by subtracting the heat taken 
away by the coolant and chips. Allowance for convective cooling is essential for 
creep grinding as shown by Andrew et al. (1985). It has also been found important 
for other high-efficiency deep-grinding processes as employed for drill flute grind- 
ing, crankshaft grinding, and cutoff grinding. If allowance is not made for convec- 
tive cooling, the temperatures are very greatly overestimated. 
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The maximum temperature equation modified to allow for convective cooling 
has the form 



max 


F,v s 


P'C'Tnm ' U e 'Vw 




c 




ws ^ max 


mp 

+F* 


f’h 



where T mp is a temperature approaching the melting point of the workpiece mate- 
rial. For steels, the material is very soft at 1,400 °C, and this temperature gives a 
reasonable estimate for the chip convection term. 

hf is the coolant convection coefficient that applies as long as the maximum 
temperature does not cause the fluid to burn out in the grinding zone. If burnout 
occurs, the convection coefficient is assumed to be zero. Burnout is a common 
condition in grinding but should be avoided in creep grinding and for low-stress 
grinding. Values estimated for convection coefficient when grinding with efficient 
fluid delivery are 290,000 W/m 2 .K for emulsions and 23,000 W/m 2 .K for oil. 


Control of Thermal Damage 

An increasingly popular approach to control thermal damage has been developed 
with literature examples of its application in industry by General Motors on cast 
iron, hardened steel, and Inconel to impose a limit on grinding temperatures. 
Malkin and Guo (2008) provides the maximum allowable specific grinding energy 
for a given maximum temperature rise as 


e c = A + C-T mm -(d l J 4 -a; 3 ' 4 (21) 

A and C are constants based on the thermal conductivity and diffusivity proper- 
ties of the workpiece and wheel. A series of tests are made for different values of a e , 
v w , and d e and the workpieces analyzed for bum. Plotting these on a graph of e c 
against d e x,4 a~ 3/4 v w ~ 1/2 establishes the slope CT max and intercept A. 

The method is illustrated schematically in Fig. 4. In an industrial situation, a power 
meter is used to monitor specific energy values. If the specific energy values exceed 
the threshold level for bum, it is necessary to take corrective action to the process. This 
can mean redressing the wheel or making some other process change such as reducing 
the depth of cut, increasing the work speed, or using a different grinding wheel. 


Tools for Fixed Abrasive Machining 
Grain Material 

During grinding, material removal is caused by the interaction of the grain material and 
the target material. The following demands are placed on the abrasive grain material: 

• A high degree of hardness and toughness in order to facilitate chip formation and 
to maintain cutting edge sharpness over a longer period 
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Fig. 4 Specific grinding 
energy at threshold of burn 
(Malkin and Guo 2007) 



• High thermal (change) resistance, so that the grain can withstand both the high 
machining temperatures and rapid temperature changes 

• Chemical resistance in order to avoid grain- weakening chemical reactions even 
at higher pressures and temperatures when interacting with the air, cooling 
lubricant, or workpiece material 

Furthermore, varying requirements are made of the grain material with respect 
to fracture behavior, depending on the grinding process at hand. Since no 
grain material can fulfill all these demands to the same extent, numerous 
natural and synthetic grain materials are utilized for diverse machining tasks, 
i.e., corundum (A1 2 0 3 ), silicon carbide (SiC), cubic boron nitride (CBN), and 
diamond. 

Bonds Material 

According to DIN 69 111, for all fixed abrasive tools, individual grains are 
bonded with each other and with the backing material. In practice, all components 
of the grinding wheel, with the exception of the abrasive grains, are comprised 
under the designation of “bond.” Bonds have the task of adhering to the grains until 
they are dulled by the grinding process. Then the bonds should release the grains so 
that succeeding, sharp grains can be applied. The holding function is only fulfilled 
if the bond material is strong enough and the bond web exhibits cross-sectional 
areas of sufficient size. In addition, the bond must have pores so that there 
is enough space for the chipped material and cooling lubricant. The essential 
properties of a grinding wheel can be adjusted by the composition of the 
bond, the volumetric amounts of the bond components, and the manufacturing 
process. 

Primarily, the differentiation is between synthetic resin and vitrified and metal 
bonds. These must be customized for the respective application and have numerous 
variations. 
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1. Resin Bonds 

The resin bond consists of a synthetic resin or synthetic resin combination with 
or without fillers. For particular grinding wheel types, applications, or different 
fabrication methods, there are many resins available. In the case of the dry mix 
method, phenol resol is used for the grain wetting as well as a phenol resin 
powder based on novolak hexamethylenetetramine. Despite the variety of syn- 
thetic resins available today, phenol resins or phenol plasters are still the most 
common bond components for grinding wheels bonded with synthetic resin. 
Grinding wheels manufactured with synthetic resin bonds are insensitive to 
impact or shock as well as lateral pressure. In addition, they are allowed for 
high rotational speeds and chipping volumes for abrasive cutters and rough 
grinders and permitted high surface qualities by means of a high elasticity in 
the case of polishing and smoothing disks. 

Consequently, the primary areas of use for grinding wheels bonded with 
synthetic resin are abrasive cutting and rough grinding. 

2. Vitrified Bonds 

Vitrified bonds are formed by mixtures from the natural silicates red and white 
clay, kaolin and feldspar, as well as quartz and, as an additive, frits. 

Frits are glassy, previously melted, and pulverized organic and inorganic 
mixes which serve as fluxing agents and give the vitrified bond certain proper- 
ties. Among other things, they help lower the firing temperature of the bond 
mixture. For grinding wheels, these mixtures are composed mostly of boron 
silicates or glass containing magnesium. Especially with frits containing boron, 
the propensity of borax, as in the hard soldering of metals, to form molten 
masses with metal oxides at low temperatures (these masses then congealing 
into glassy state after cooling) is evident. 

The usual components and their volumetric portions determine the strength of 
the bond bridges. All components are prepared, after checking the raw material, 
by crushing, milling, sifting, and air sifting. Mixture proportions are varied in 
accordance with the desired target properties. The bonds must be attuned to the 
respective machining conditions and the abrasive grain material in use. In view 
of this, one should distinguish according to the proportion of the glass phase in 
the bond mixture: 


• Fused bonds with a high amount of glass phase 

• Bonds with a medium amount of glass phase 

• Sintered bonds with a small amount of glass phase 

In fused bonds, grain bonding occurs by means of a glass phase with low 
viscosity. In sintered bonds, adhesive force is a result of rearrangement or solid 
reactions. The sintering and melting points of the bond mixture are essential, 
characteristic values for every bond. They determine the firing temperature and 
the process behavior of the abrasives. 

Moreover, the bond should be made to conform to the abrasive’s 


properties. Certain sintered corundum materials, for example, are attacked by 
alkaline silicates. Besides a diffusion of sodium and silicon in the interior of the 
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grain, crystal growth transpires when exposed to higher temperatures. This 
interferes with the self- sharpening properties of the abrasive grains. Thus, in 
the case of sintered corundum, low-alkaline fused bonds with the lowest possible 
melting point are used. 

Should high requirements be placed on the grinding component’s profile 
accuracy and dressing capability of the wheel, the hardness and strength of the 
bond can be enhanced by increasing the amount of glass phase. Thereby, next to 
an improvement in the bonding of the individual grit, a facilitation of the 
dressing process is achieved. 

In essence, the properties of ceramic bonds can be summarized as follows: 

• Brittle and thus comparatively sensitive to impact 

• High elasticity modulus 

• Temperature-resistant, yet sensitive to temperature change 

• Chemically resistant to oil and water 
3. Metallic Bonds 

Metallic bonds exist in multiple varieties. All metallic bonds exhibit, in contrast 
to grinding wheels bonded with resins or vitrified mixtures, an increased level of 
heat conductivity. The properties of metallic bonds can be summarized along 
these lines: 

• High resistance to wear 

• Difficult to dress, or, in the case of single-layered grinding wheels, not 
dressable 

• High heat conductivity 

• Increase in frictional heat 


Tool Structure and Designation 

Grinding components with bonded abrasives are comprised of straight, conical, and 
reduced grinding wheels, abrasive cutting disks, grinding wheels affixed to support 
disks, cup grinding wheels, bent grinding wheels, grinding segments, mounted 
points, honing stones, and oil stones (Fig. 5). Ignoring form and dimensions, all 
of these types exhibit the same basic structure made of abrasive material, bond, and 
pore space. 


Composition of Grinding Wheels 

The volumetric portion of grain V G of bond V B and of pores V P of the total volume 
of the grinding component V is described by the equation 


V = V G + V B + V P 



If the individual columns are referred to the total volume V of the grinding 
component, we obtain the so-called structural formula. According to this 
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Fig. 5 Abrasive tools for machining with geometrically undefined cutting edges 

F G + V B + V P = 100% (23) 

Furthermore, from the grain mass m G , the bond mass m B , and the density of the 
abrasive grit p G and the bond p B , we obtain the relation 

m = m G + m B = p G V G + p B V B (24) 

In relation to the total mass of the grinding component, we obtain 

m' G + m' B = 100% (25) 

Superabrasive Grinding Wheels 

The superabrasive grains cBN and diamond are a great deal more expensive than the 
conventional abrasives already mentioned. Thus, the constmction of superabrasive 
grinding wheels is also different from that of conventional ones. A further reason for 
the altered constmction has to do with the fact that such grinding wheels exhibit a 
higher wear resistance. Furthermore, in order to reach higher material removal rates, 
which is what makes such grinding wheels economical, much higher burst resistance 
values and grinding wheel speeds are necessary than in the use of conventional 
grinding wheels. Therefore, grinding wheels with diamond or cBN have bodies 
applied only with a thin abrasive coating. Basic requirements of the body material are: 

• High heat conductivity 

• High mechanical strength 

• Good vibration dampening 
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These property requirements are partially in contrast to each other and thus neces- 
sitate compromises. Not all combinations between the body and the grinding wheel 
layer are possible and technically sensible. Some frequently used body materials are: 

• Aluminum, steel, bronze 

• Synthetic resin with metallic or nonmetallic fillers 

• Fiber-reinforced synthetic resin 

• Ceramics 

1. Aluminum and Steel Bodies 

Metallic bodies are distinguished by their high strength. Theoretically, for a 
body design without a central hole and with an approximately constant tension 
curve across the cross section, rotational speeds of 500 m/s can be reached. The 
highest practically feasible rotational speed is 280 m/s. Since the ratio of elastic 
modulus to density for steel and aluminum as a function of their compositions 
differs only minimally, strain levels are nearly identical at any given rotational 
speed. 

2. Bronze Bodies 

These bodies are manufactured in both cast and sintered form. Of the two, 
sintered bronze reaches the highest burst resistance value. 

3. Synthetic Resin Bodies with Metallic or Nonmetallic Fillers 

Bodies made of synthetic resin do not reach the strengths of those of steel or 
aluminum. However, their improved damping properties are advantageous. 
Synthetic resin bodies filled with metal have better heat conductivity than 
those with nonmetal fillers. Copper and aluminum are examples of metallic 
fillers used in this context. 

4. Synthetic Resin Bodies with Reinforcing Fibers 

This type of body makes use of the strength of nonmetallic fibers (glass fibers, 
carbon fibers) in a plastic matrix. The small density of fibers in connection with 
their high strength makes it possible to use them at maximum rotational speeds. 

5. Ceramic Bodies 

These bodies correspond in their construction to conventional vitrified grinding 
wheels. The prerequisite for their use is a coordination of the bonds of the body 
and the grinding wheel layer with respect to their strength and strain. 

Wheel Topography 

The efficiency of the grinding operation is highly dependent on the grinding wheel 
surface topography (Oliveira et al. 1999). Some main parameters are grain distri- 
bution, wear flat areas, grain protrusion, and grit geometry. These properties can 
highly change the result of a grinding process. A classical challenge is how to 
control these parameters in an industrial environment. There is still no system 
available for industry able to control the topographic properties of a grinding 
wheel while grinding. 

Some researchers (Brinksmeier and Werner 1992; Nakayama 1973; Verkerk 
1977) analyzed the relation between the topographic characteristics of the wheel 
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Fig. 6 Methodology for 
mapping a grain wear flat area 

(Oliveira et al. 1999, 2009) 
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surface and its behavior on the grinding operation. In this case study, Oliveira 
et al. (1999) established a monitoring system for the wheel sharpness measurement 
based on the characteristics of a light beam reflected from the grinding wheel 
surface on the abrasive grains wear flat areas over the whole wheel peripheral 
surface (Fig. 6). In the proposed system, the light reflected from the top surfaces of 
the abrasive grains is converted to an electrical signal by a charge coupled device 
(CCD) sensor and processed by a microcomputer. 

The proposed mapping system for grinding wheels (MSGW) was able to acquire 
data with the wheel running at the cutting speed (30 m/s) and the measurement 
carried out on the grinding machine without stopping. The system was applied on a 
surface grinding operation where an A1 2 0 3 grinding wheel was used. The results 
showed that the system could be used to map the wear flats at the grinding 
wheel surface and also to analyze the wear phenomena efficiently. A good 
correlation between the average reflected light level and the normal force was 
also found. 


Wheel Performance 

Probably the best way for an end user to ensure a reliable and predictable process is 
to develop it with the machine tool builder, wheel maker, and other tooling 
suppliers at the time of the machine purchase using actual production parts. This 
then combines the best of the benefits from controlled laboratory testing with real 
components without production pressures, resulting in a baseline against which all 
future development work or process deterioration can be monitored. 

The number of grinding parameters that an end user needs to understand is 
actually quite limited. The key factors are generally associated with either wheel 
life, cycle time, or part quality. The purpose of this discussion is to define various 
parameters that relate to wheel life, cycle time, and part quality and to demonstrate 
how these parameters may be used to understand and improve the grinding process. 
In most cases, the author has avoided the derivations of the formulae, providing 
instead the final equation. 
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Wheel Life 

The statement that a process can be controlled “within an acceptable range” 
requires some definition. A recent study (Hitchiner and McSpadden 2005) investi- 
gated the process variability of various vitrified cubic boron nitride (CBN) pro- 
cesses as part of a larger program to develop improved wheel technology. They 
showed that under “ideal” conditions, repeatability of wheel life within ±15 % or 
better could be achieved. However, variability associated with just wheel grade 
from one wheel to another (±1 % porosity), all within the standard limits of a 
commercial specification, made the process less repeatable and increased the 
variability to ±25 %. In the field, for example, in a high-production internal- 
grinding operation with 20 machines, the average monthly wheel life was tightly 
maintained within ±5 %. However, these average values obscured an actual 
individual wheel life variability of ±100 %. Of these, wheels with very low or 
zero life were associated with setup problems while the large variability at the high 
end of wheel life was associated with machine-to-machine variables such as coolant 
pressure, spindle condition, or gauging errors. 

Wheel makers and machine tool builders are usually in the best position to make 
predictions as to wheel performance. Predictions are based on either laboratory tests 
or past experience on comparable applications. Laboratory tests tend to reproduce 
ideal conditions but can make little allowance for a deficiency in fixturing, coolant, 
etc. In fact, the author witnessed a situation where the laboratory results and the 
actual field wheel life differed by a factor of 40. The loss of wheel life in the field 
was caused by vibration from poor part clamping and wheel bond erosion from 
excessively high coolant pressure. Laboratory data were able to inform the end user 
that there was a major problem and provide evidence to search for the solution. 

In particular, the main shortcoming of grinding with plated CBN wheels is the 
transient behavior of its grinding performance. Unlike resin-bonded or vitrified 
CBN wheels where the grinding performance is restored by periodic dressing, 
single-layer plated CBN wheels cannot be dressed. Therefore, the grinding perfor- 
mance of plated CBN wheels varies significantly as the wheel wears down which 
imposes great challenges in cost-effectively utilizing the wheel and process control. 
The workpiece material is found to be one of the dominant factors affecting the 
wheel wear. For example, it was found that the grinding performance of plated 
CBN wheels deteriorates much faster when grinding a 100Cr6 steel than a GG40 
cast iron and nickel alloy B1900 than an AISI 52100 bearing steel. 


Redress Life 

In practice, the end user seeks to reduce cycle time for part production as a route to 
reducing costs and increasing production throughput. The number of parts produced 
per dress is critical for economic production. For parts produced in large batches, 
redress life can be given as the number of parts per dress n d . If redress has to take 
place for every part produced, the cost of grinding is greatly increased. Long redress 
life depends on having the correct grinding wheel for the grinding conditions and 
also on the dressing process. Dressing parameters are discussed further in the 
chapter on dressing. 
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Cycle Time 

Cycle time is usually defined as the average total time to grind a part. For a batch of 
n d parts produced in a total time t b , the cycle time is 


n b 

The cycle time, therefore, depends on the dressing time, as well as the grinding 
time, and the loading and unloading time. 



Fundamental Removal Mechanisms of Fixed Abrasive Machining 

In the case of grinding, the investigation of removal mechanisms is complicated due 
to many different factors. The first problem is posed by the specification of the tool. 
The abrasive grains are three-dimensional and statistically distributed in the volume 
of the grinding wheel. The geometry of the single cutting edges is complex. 
Moreover, there is a partially simultaneous engagement of the cutting edges 
involved in the process. The surface formation is the sum of these interdependent 
cutting edge engagements, which are distributed stochastically. Furthermore, the 
chip formation during grinding takes place within a range of a few microns. The 
small chip sizes make the observation even more difficult (Marinescu et al. 2007). 

In spite of the complexity, some statements can be made on the removal 
mechanisms, surface formation, and the wear behavior during grinding. Analogy 
tests and theoretical considerations on the basis of the results of physical and 
chemical investigations are used for this purpose. In the past few years, chip and 
surface formation have been modeled with the help of high-performance computers 
and enhanced simulation processes. 


Material Removal in Grinding of Ductile Material 

The grinding process can be distinguished into three phases, including mbbing, 
ploughing, and cutting (Chen and Brian Rowe 1996). When the grain engages with 
the workpiece in up-cut grinding, the grain slides without cutting on the workpiece 
surface due to the elastic deformation of the system. This is the rubbing phase. As the 
stress between the grain and workpiece is increased beyond the elastic limit, plastic 
deformation occurs. This is the ploughing phase. The workpiece material piles up to the 
front and to the sides of the grain to form a groove. A chip is formed when the workpiece 
material can no longer withstand the tearing stress. The chip formation stage is the 
cutting phase. From the point of view of the energy required to remove material, cutting 
is the most efficient phase. Rubbing and ploughing are inefficient, since the energy is 
wasted in deformation and friction with negligible contribution to material removal. 
Furthermore, a high temperature may result, producing an excessive rate of wheel wear, 
and the workpiece surface may suffer metallurgical damage (Fig. 7). 
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Fig. 7 Removal process 
during the machining of 
ductile materials (Chen and 
Brian Rowe 1996) 




Surface Formation During Grinding Brittle-Hard Materials 

The stressing of a ceramic surface with a cutting edge causes hydrostatic compres- 
sion stress around a core area in the subsurface of the workpiece (Shaw 1995). This 
leads to a plastic deformation of the material (Fig. 8a). If a certain boundary stress is 
exceeded, a radial crack develops below the plastically deformed zone (Fig. 8b), 
which expands with increasing stress (Fig. 8c). After discharge, the radial crack 
closes in the initial phase (Fig. 8d). During a further reduction of the stress state, 
axial stresses occur around the plastically deformed zone leading to lateral cracks 
below the surface (Fig. 8e). The lateral cracks grow with decreasing stress. This 
growth can continue up to the surface of the material after complete discharge 
(Fig. 8f), leading to the break-off of material particles, which form a slab around the 
indentation zone. 

Despite the low ductility, elastoplastic deformations occur as removal mecha- 
nisms alongside the brittle fracture during the grinding of brittle-hard materials. 
Thereby, material removal through brittle fracture is based on the induction of 
microcracks. Ductile behavior of brittle-hard materials during grinding can be 
derived from the presumption that, below a threshold of boundary chip thickness 
defined by a critical stress, the converted energy is insufficient for crack formation 
and the material is plastically deformed. It is supposed, however, that cracks that do 
not reach the surface are also formed under these conditions. 
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Fig. 8 Mechanisms of crack spread in case of punctual stress [(a), (b), (c)] and during unloading 
[(d), (e), (f)] (Shaw 1995) 


Thus, not only the amount of stress, defined by the uncut chip thickness, is 
responsible for the occurrence of plastic deformations but also the abovementioned 
hydrostatic compression stress below the cutting edge (Komanduri and 
Ramamohan 1994). The transition from mainly ductile material removal to brittle 
friction is decisively determined by uncut chip thickness at the single grain by grain 
shape and by material properties. Large cutting edge radii promote plastic material 
behavior and shift the boundary of the commencing brittle friction to larger 
engagement depths. As a consequence, Hertzian contact stresses occur below the 
cutting edge, which cause hydrostatic stress countering the formation of cracks. 
Roth (1995) investigated removal mechanisms during the grinding of aluminum 
oxide ceramics. He observed the influence of different grain sizes of the material as 
well as different diamond geometries on the material removal processes. 


Fixed Abrasive Machining of Various Materials 
Machinability in the Fixed Abrasive Machining Process 

Machinability is understood as a property of a material that allows for chip removal 
under given conditions. It thus describes the behavior of a material during chip 
forming. The machinability of a material must always be considered in conjunction 
with the machining method, the tool, and the machining parameters. 
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In comparison to machining with geometrically defined cutting edges, grinding 
manifests clear procedural differences that affect the machinability of the materials. 
During grinding, machining is achieved by means of a number of individual grain 
engagements. Together with the strongly negative tool orthogonal rake angle of the 
grain, there is, in contrast with geometrically defined chip removal, an increased 
amount of friction and deformation work, from which results a higher conversion of 
energy in the process. This in turn can lead to heavier thermal stress on the surface 
layer. Small depths of cut result from the geometrical process characteristics of 
grinding and the high cutting speeds. Thus, the grain size of the workpiece material 
as well as the size of inclusions (e.g., carbide) plays a role with respect to 
machinability. This problem should certainly be considered in process construction 
as a scaling effect. 

A multitude of individual grain contacts are made in the grinding process. Thus, 
besides the particular abrasive grains, the space between the grain and therefore the 
entire abrasive coating bond must be taken into account. The machinability of a 
material is determined by all the elements of the grinding system. All components, 
i.e., the grinding wheel (specification and preparation), process parameters, and 
cooling lubricants, have to be adjusted for the respective material and the machin- 
ing goals (component requirements, productivity, and quality). 

The workpiece surface layer created (thermal and mechanical influences) as well 
as chip formation can be called upon as criteria for the evaluation of machinability. 
Grinding forces, workpiece roughness, possible clogging of the grinding wheel, and 
macroscopic and microscopic grinding wheel wear can all occur during chip 
formation. Chip formation and surface layer properties thus limit potential material 
removal rates and workpiece roughness. 


Grinding Nickel-Based Superalloy 
Introduction of Nickel-Based Superalloy 

The specific properties of these materials, adapted to the respective field of appli- 
cation, are essentially contingent on chemical composition, possible cold forming, 
and the heat treatment method. Corresponding to their most important alloying 
elements, nickel-based alloys can be subdivided into the following main groups 
(Everhart 1971): 


Nickel-copper alloys 

Nickel-molybdenum alloys and nickel-chrome-molybdenum alloys 
Nickel-iron-chrome alloys 
Nickel-chrome-iron alloys 
Nickel-chrome-cobalt alloys 


Nickel-based alloys are highly temperature-resistant, resistant to corrosion, and 
also very tough. Thus, they are presently the more often exploited material for 
components that are exposed to nonuniform mechanical strains at working 
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temperatures up to 1,100 °C. Because of their high heat resistance, thermal fatigue 
resistance, and oxidation resistance in the high-temperature range of aviation tur- 
bines, their preferred areas of application are in gas turbine construction as well as in 
the construction of chemical instruments (Konig and Erinski 1987). In turbine 
construction, compressors and turbines of high power densities expose their com- 
ponents, especially turbine blades, to highly complex stress profiles. They are 
thermally strained with very high gas inflow temperatures and experience additional 
mechanical stress from centrifugal forces caused by rotation and have to withstand 
stress variations due to varying operational conditions. Depending on the request 
profile, nickel-based alloys of varying micro structures are used in this context. 

In general, nickel-based alloys are numbered among materials that are difficult 
to machine because of their mechanical, thermal, and chemical properties. Due to 
varying chemical composition and micro structure however, nickel-based alloys 
show fluctuations in machinability. On the whole, the high heat resistance and 
low heat conductivity of nickel-based alloys as well as the abrasive effect of 
carbides and intermetallic phases in abrasive machining lead to high thermal and 
mechanical stress on the tools (Li 1997). Due to their high ductility, nickel-based 
alloys can be assigned to the long-chipping material group. 

Grinding Behavior of Nickel-Based Superalloy 

1. Grinding Nickel-Based Materials with Conventional Abrasive Tools 

When machining nickel-based materials, mainly conventional grinding tools 
made of corundum are utilized. The low heat conductivity of the materials 
demands a grinding wheel specification suited to the machining case at hand, 
an appropriate conditioning of the grinding wheel as well as an optimal removal 
of heat by the cooling lubricant in order to determine thermal damage of the 
workpiece’s external layer. 

Thus, open-pored grinding tools with a granulation of about F60 are used. 
Preparation for use usually takes place in the CD (continuous dressing) method, 
in which grinding wheels maintain a state of optimal conditioning by means of 
continuous in-process dressing. At the same time, it should be taken into 
consideration that grinding wheel wear is specified by the dressing feed f Rd 
and the workpiece speed v w and thus by the grinding time. A higher dressing 
feed f rd leads to a high grinding wheel effective peak-to-valley height. In this 
way, an effective removal of heat from the machining zone is possible. Disad- 
vantageous in this however are workpiece roughness, which rises with the 
effective peak-to-valley height, and increasing tool wear. Sol-gel corundums 
are of no advantage in this context. An increase in the cutting speed generally 
leads to improved surface quality and smaller cutting forces. Since the dressing 
feed is moved according to grinding wheel rotation, an increased cutting speed 
leads to more tool wear. Moreover, thermal workpiece stress rises with higher 
cutting speeds. The customary range is v c = 20-35 m/s. 

Conventional grinding wheels are indeed relatively cheap in comparison with 
superabrasives; however, not only tool costs are to be considered but also setup 
costs necessitated by grinding wheel changing. 
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In the case of surface grinding with conventional tools, as a rule the creep 
(feed) grinding process is used, in which the stock allowance is generally 
machined in one stroke. The advantages of this methodological variant are less 
surface roughness and less tool wear in comparison with pendulum grinding. 
When grinding engine blade roots, depths of cut can reach 10 mm or more. 
Problems develop because chip volumes increase along with the depth of cut and 
because cooling lubricant addition becomes more difficult, which can lead to 
thermal stresses, especially in the case of complex geometries. Increasing the 
specific material removal rate Q' w generally results in workpiece roughness 
deterioration, increased thermal stress, and more tool wear. In CD grinding, 
wear can be compensated with an increased dressing feed. A common range for 
the specific material removal rate for conventional CD grinding processes is 

r\ 

about Q' w = 20 mm /mm s (Klocke 2009). In HSCD (high speed continuous 
dressing) grinding processes, specific material removal rates of Q' w =100 mm / 
mm-s can be realized. 

2. Grinding Nickel-Based Materials with Superabrasive Tools 

Besides conventional grinding tools, cBN grinding wheels with vitrified and 
galvanic bonds have also been tried and tested for machining nickel-based alloys. 
Depending on the bond type, usually water-mixed cooling lubricants are used for 
vitrified bonds and grinding oils for galvanic bonds. Since nickel-based alloys are 
considered long-chipping materials, for cBN grinding wheels, in order to remove 
the chips and add the cooling lubricant, high grinding wheel effective peak-to- 
valley heights must be adjusted by means of the dressing process. In the case of 
dressing with a forming roller, dressing speed quotients of q d = 0.5-0. 8 for low 
depths of dressing cut of a ed = 2-4 pm have proven favorable. For crude or simple 
operations, varying degrees of dressing penetration are set. Similar dressing 
speed quotients have been proven for profile rollers as well. In this case, radial 
dressing feed speeds were in the range of f rd = 0.5-0. 7 pm. 

The grinding depth of cut has a decisive influence on the chip removal 
process when machining nickel-based materials, as the contact length goes up 
with increasing depth of cut. In this way, supplying the contact zone with cooling 
lubricant and with this the removal of heat is made more difficult. With cBN 
grinding wheels, pendulum grinding operations with grinding depths of cut of a e 
< 250 pm tend to be more practical than deep-grinding processes. The critical 
material removal rate usually runs up from only Q' w = 5-10 mm /mm-s. By 
means of speed stroke grinding technology, with table feed rate v w of 200 m/min 
at small depths of cuts, specific material removal rates could be increased up to 
Q' w = 100 mm /mm-s. Cutting speeds in cBN grinding processes are as a rule 
located at v c > 100. 

Although these grinding conditions have been tried and tested for the machin- 
ing of many different nickel-based alloys, machinability still depends on the 
structural state and alloy composition. In comparison to the forging alloy 
INCONEL718, in the case of polycrystalline cast alloys like MAR-M247, 
generally higher grinding forces arise, resulting in increased grinding 
wheel wear. 
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The larger amounts of fortifying intermetallic y carbides can be made respon- 
sible for this. The highest tool service lives are generally found in monocrystal- 
line nickel-based alloys lacking grain boundaries. For cast alloys in a directional 
manner, chip removal behavior is contingent on the direction of solidification. 
Grinding transversely to the direction of solidification can lead to up to 5 times 
higher tool wear than grinding lengthwise in the direction of solidification. 


Grinding Titanium Alloy 
Introduction of Titanium Alloy 

Q 

Titanium and titanium alloys have a low density (p = 4.5 g/ cm ) and high tensile 
strength (R m = 900-1,400 N/mnr). They exhibit good heat resistance up to 
temperatures of 500 °C. In addition, they are resistant to many corrosive media. 
From these properties are derived the main applications of titanium material, these 
being in air and space travel and the chemical industry. More universal use is 
prevented by the price, which is several times higher than steels and aluminum 
alloy. Titanium materials are subdivided into four groups: 

• Pure titanium 

• a-Alloys 

• (a + P)-Alloys 

• P-Alloys 

Machining titanium materials is generally regarded as difficult. Its machinability 
is essentially determined by the material type (metallic or intermetallic titanium), as 
well as by the respective alloy composition and thermomechanical pretreatment. 

Grinding Behavior of Titanium Alloy 

In order to machine titanium materials economically, the physical properties of this 
material group must be carefully considered. Its strength is high, while its facture 
elongation is, with A5 = 5-15 %, low (for alloys). The elasticity modulus is about 
50 % lower than that of steel, while its heat conductivity is about 80 % lower. In 
addition, high temperatures arise in the contact zone. The heat generated can only 
be removed to a small degree with the chips. In titanium machining, a much larger 
amount of heat in comparison with steel machining drains off through the tool and 
the coolant. 

Grinding can lead to the formation of cracks, which influence the ultimate 
component in its function and service life (Aust and Niemann 1999). Moreover, 
the mechanical processing of TiAl can lead to hardening zones in areas near the 
surface, which are characterized by a microhardness of up to 800 HV 0.025 and a 
maximum thickness of 180 pm. These have a negative effect on the tool compo- 
nent’s service life. 

Titanium’s reactivity with oxygen, nitrogen, hydrogen, and carbon, together 
with the high temperature of the contact zone, increases grinding wheel wear. 
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In this case, two different types of wear can be distinguished. The first is grain 
wear, which is also called fatigue-adhesive wear. If the thermal and mechanical 
resistance of the bond is insufficient at the high temperatures of the contact zone, a 
second wear-type comes into play, bond wear. This is characterized by abrasive 
grain breakaway under excessive strain of the bond or thermal or chemical wear of 
the binder. In an experiment involving grinding the titanium alloy TiA16V4, bond 
wear was responsible for up to 80 % of the total grinding wheel wear. In this case, 
most of the wear was caused by grain breakaway and bond fracture. These effects, 
however, were decisive in the failure of the bond, since the grains were dulled by 
use and caused higher cutting forces. Silicon carbide and diamond grinding wheels 
have proven advantageous in the grinding of titanium alloys. In grinding experi- 
ments with alloy TiA16V4, two to three times higher grinding forces were mea- 
sured using corundum grinding wheels as opposed to silicon carbide at a constant 
specific material removal rate of Q' w = 3 mm /mm-s. This leads to more than 
doubled wear in the case of the corundum grinding wheel. The use of corundum 
grinding wheels is not to be recommended due to their low heat conductivity 
(Kumar 1990). 

In comparison to CD grinding methods, better surface qualities were realized 
with metallically or vitrified bonded diamond grinding wheels without continuous 
dressing (Aust and Niemann 1999). Pendulum grinding operations with small 
depths of cut and high table velocities tend to be better suited to grinding 
Ti-6A1-4V than deep-grinding processes due to improved thermal marginal 
conditions. The reaction of titanium chips with atmospheric oxygen and atomized 
grinding oil can lead to deflagration or to inflammation of oil spills in the machine. 

Compared with grinding the titanium alloy TiA16V4, machining titanium 
aluminides tended to have lower machining forces, lower tool wear, less grinding 
wheel clogging, and improved surface quality. Machining with formal and dimen- 
sional accuracy with conventional grinding wheel specifications is more realizable 
with titanium aluminides than with metallic titanium alloys. In this case, up to ten 
times higher grinding ratios can be achieved. 

In the grinding of titanium aluminides, grinding wheels with high effective peak- 
to-valley heights, obtainable by high positive dressing feed rate conditions and high 
depths of dressing cut, has proved effective. Machining titanium materials created 
chips with highly reactive surfaces, which lodge themselves in the chip spaces of 
the grinding wheel and cause clogging in the grinding wheel. Therefore, a large 
chip space is required. Due to this high tendency to clog the grinding wheel, 
cleaning nozzles should be used, which rinse the chips and impurities from the 
pore space of the grinding wheel during the process. 

When using vitrified bonded diamond grinding wheel to machine DP-titanium 
aluminides, grinding ratios of over 500 can be realized at a specific material 
removal rate of Q' w = 5 mm /mm-s and a grinding depth of cut of a e = 25 pm. In 
the case of pendulum grinding operations, higher specific material removal rates 
lead to uneconomically high wear and low tool service lives. 

By using speed stroke grinding technology, with which table speed 
of v w = 200 m/min can be realized, specific material removal rates of up to 
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Q' w = 70 mm /mm-s at grinding ratios of 200 could be reached in machining 
titanium aluminum with vitrified bonded diamond grinding wheels. Due to the 
high table feed speeds of this technology, low performance intensity leads to 
minimal thermal influence on the surface layer. Crack formation in surface layer 
of the workpiece can thus be prevented. Since high table velocity and high 
material removal rates create chips that are very thick, high cutting speeds 
should be selected. But because the cooling lubricant should be added at the 
rotational speed of the grinding wheel, a flexible choice of cutting speed is only 
possible within limits. Values of v c = 125-140 m/s have proven effective at a 
feed velocity of v kss « 130 m/s. A further increase in the grinding wheel’s 
circumferential speed leads to problems in cooling lubricant supply and an 
uneconomical increase in grinding wheel wear and crack formation in the 
material. 


Grinding Brittle Material 
Introduction of Brittle Material 

The designation “brittle,” often used in manufacturing, characterizes a certain 
material group according to their mechanical properties. High brittleness, i.e., low 
fracture resistance, and hardness represent a combination of material properties 
that, on the one hand, influences the range of uses of these materials but also 
determines their machinability and workability properties. 

Several factors influence the mechanical behavior of a material. Firstly, there is 
the atom arrangement of the solid body. This can have an amorphous or a crystal- 
line structure. In the case of amorphous structures, the atoms are arranged ran- 
domly. Glasses and many plastics and rubbers are amorphously structured. One 
speaks of a crystalline structure if the atoms form a regular three-dimensional 
lattice. Ceramics can exhibit both structures. The dominant atomic bond type is 
decisive for the inclination toward ductile or brittle material behavior. Covalent 
bonds lead to limited electron movement potential. For plastic forming processes, 
position changes are, however, extremely necessary. For this reason, large amounts 
of covalent bonds facilitate brittleness and hardness, while metallic bonds (ionic 
bonds) cause ductile material behavior. 

Grinding Behavior of Brittle Material 

The machinability of brittle materials significantly variables in contrast to metallic 
materials due to the characteristics described above. When machining brittle 
materials, as opposed to machining ductile materials, the assumption is that, with 
increasing penetration depths, material separation becomes dominated by the 
characteristic behavior of brittle materials, i.e., microcrack formation and resultant 
fragment breakaway. Fundamentally, it can be stated that, in case of a local loading 
of brittle materials (observing the microscopically small chip formation zone), the 
same action mechanisms are always prevalent - crack initiation and propagation 
and plastic material deformation. 
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1 . Grinding of Ceramic Material 

With respect to their material characterization, ceramics are subdivided into 
oxide, non-oxide, and silicate ceramics. In the case of oxide ceramics, aluminum 
oxide and zirconium oxide or zirconia (Zr0 2 ) represent the industrially most 
important materials. Oxide ceramics have mostly ionic bonds (>60 %), exhibit 
favorable sintering properties, and are disadvantageous compared to other 
ceramics with respect to their heat resistance. 

As opposed to metal machining, when processing ceramics, the process 
forces are higher as a rule, especially in the normal direction. It is imperative 
that these forces be absorbed by correspondingly rigid machines and spindle 
systems. Otherwise, excessively soft, resilient systems lead to decreased dimen- 
sional and formal accuracy in the functional surface. Furthermore, ceramic 
machining is more demanding on machine protection. Grinding sludges have a 
highly abrasive effect due to the hardness of the removed particles. There is a lot 
of research available pertaining to the use of diamond grinding wheels for 
grinding ceramics. Since diamond reacts sensitively to strong thermal loads, 
increased demands are placed on cooling lubricant supply. 

Concerning the type of bond, tools bonded with synthetic resin and metal are 
the most often used in ceramic grinding. Grinding wheels bonded with synthetic 
resin exhibit more wear during the process, but lower process forces, and 
therefore result as a rule in better surface quality and formal accuracy. The 
size of the diamond grains used varies between D7 and D252, whereby grain 
sizes between D91 and D181 are selected for most machining operations. Grain 
concentrations in ceramic machining are usually between C75 and Cl 00 
(Verlemann 1994). 

With respect to the maximum obtainable specific material removal rates, 

o 

values up to 50 mm /mm-s are cited for machining high-performance ceramics 
with external grinding. Higher specific material removal rates are realizable with 
other grinding kinematics, such as surface grinding, but these cannot be consid- 
ered representative methods. In ceramic chip removal, increasing machining 
performance is accompanied by concurrently increasing process forces. 

2. Grinding of Silicon 

Silicon is primarily used as a wafer material. The mechanical properties of 
silicon are informed by the anisotropic bonding forces prevalent in the mono- 
crystalline corpus. The wafers are separated from a silicon monocrystal (ingot) 
and undergo chip removal on planar surfaces. Established praxis when grinding 
the isolated wafers dictates a two-step process comprising a preprocessing and a 
post-processing stage. First, there is a rough grinding process, which is required 
to remove the wafer surface, which is quite faulty after separation, as well as to 
smooth out the grooves. For this process, a relatively coarse grain is often 
selected (D46 in a synthetic resin or ceramic bond) in order to realize a high 

Q 

material removal rate (Q w = 100-200 mm /s). In this case, brittle machining 
mechanisms are acceptable if the damage depth of the external zone is less than 
the depths of cut of the subsequent fine-grinding process. The state of the art for 
fine grinding is D6 grits bonded in synthetic resin. Synthetic resin bonds are 
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preferred to vitrified bonds. In fine grinding, low material removal rates of Q w = 

Q 

5-15 mm /s tend to be chosen. Using a ductile machining mechanism in this 
way, a surface quality of R a < 10 nm and external zone damages smaller than 
3 pm (Klocke et al. 2000) can be realized, lessening post-processing costs. 


Summary 

Fixed abrasive machining, which is carried out by means of more or less irregularly 
formed grains composed of hard substances brought into contact with the material, 
represents a key manufacturing engineering technology with high-efficiency secu- 
rity and machining quality. This chapter has provided some basic information on 
the fixed abrasive machining technology in fundamental terms, i.e., tools, machin- 
ing parameters, and application technology in machining ductile and brittle mate- 
rials. Limited to the length of the book, the industrial application information of 
fixed abrasive machining could not be introduced here. The readers could perhaps 
find the additional knowledge in the related literatures. 
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Abstract 

Loose abrasive machining is one of the processes, which contributes to improv- 
ing precision, such as surface roughness and form accuracy of manufactured 
components. To date, numerous process principles have been developed to 
materialize the loose abrasive machining process in different ways. These 
developments have been made in response to the changing needs of the com- 
mercial market and to the improvement of the quality of products. This chapter 
provides introductions of processes for ultrasmooth surface, complex geometry, 
and mass finishing as well as the latest discussions and findings about ultrasonic 
machining. 


Introduction 

Many types of machining process have been developed and utilized by researchers 
and engineers with objectives to improve devices’ performance, function, eco- 
nomic, compact, light-weight, and design. 

Loose abrasive machining is one of the processes, which contributes to improv- 
ing precision, such as surface roughness and form accuracy of manufactured 
components. 

Loose abrasive process is categorized as a constant pressure process. Since 
abrasive is not fixed, material removal depends on the amount of pressure that is 
set. On the other hand, fixed abrasive process such as grinding is categorized as a 
constant depth-of-cut process. The amount of material removal will therefore 
depend on how much the depth of cut is set. 

Preston’s equation is a well-known formula in explaining the relationship 
between material removal rate ( MRR ) during process relative to main parameters 
(Preston 1927). This is useful in the understanding of material removal mechanism 
under constant pressure process. According to Preston’s equation, the MRR is 
proportional to the product of the polishing pressure, p , and relative velocity 
between workpiece and polishing tool, v: 



k • p • v 


where h is the amount of material removal; t is the processing time; and k is the 
Preston coefficient. Since this equation is adaptable to most of the loose abrasive 
processes, it is useful to aid in the understanding and estimation of the processed 
results. 

In this chapter, processes for ultrasmooth surface, p complex geometry, and 
mass finishing are introduced respectively. Followed by a machining process, using 
ultrasonic is also introduced with detail information. 
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Process for an Ultrasmooth Surface 
Lapping and Polishing 

Lapping and polishing are loose abrasive finishing processes, which are among the 
commonly used methods for fabrication of extremely smooth surfaces, e.g., a 
critical requirement of optical lenses, bearings, dies and molds, silicon wafers, 
and other precision components (Nakazawa 2004; Venkatesh and Izman 2008). 
Figure 1 shows the schematic diagram of lapping and polishing. 

Lapping is applied to produce a smooth surface. The lapping uses a tool, called a 
lap, which has a reverse shape of the desired form. The lap rubs the loose abrasive 
which is generally suspended in fluid, against the workpiece surface, and replicates 
the tool form to the workpiece while smoothening the surface. In general, surface 
with R a = 0.04 to 0.01 pm are obtained as a result of lapping. 

Laps are generally made of cast iron, soft steel, brass, hardened steels, as well as 
glass. Copper and steel laps enable the lapping process to be accelerated, but cast 
iron laps retain their shape better and produce a smoother surface at the same time. 
Glass laps are capable to remove high levels of metal when used with fine-grain 
abrasives and produce an even better surface than cast iron laps. 

Lapping media that are commonly used are aluminum oxide, silicon carbide, 
emery, boron carbide, and pastes or compounds mixed with oil or bonding vehicle 
(gasoline, kerosene, vegetable oil, etc.) 

Polishing compared to lapping is applied to produce extremely smooth surface. 
The polishing process uses a tool, relatively soft tools, such as pads made of soft 
cloth or resin which is used to rub loose abrasive slurry against the workpiece 
surface thus improve the smoothness of workpiece. 


Chemical Mechanical Polishing (CMP) 

Chemical mechanical polishing (CMP) was introduced by IBM for the production 
of the 64 Mbit DRAM chip (Zantyea et al. 2004). This process is frequently used for 
the planarization of semiconductor wafers; CMP takes advantage of the synergetic 
effect of both mechanical and chemical forces in the polishing of wafers (Patrick 
et al. 1991; Zantyea et al. 2004; Matijevic and Babu 2008; Venkatesh et al. 1995). 
This is performed by applying a load force to the back of a wafer while it rests on a 
pad. Both the pad and wafer are then counter rotated while introducing slurry 
containing both abrasives and reactive chemicals underneath. 

A schematic diagram of CMP is shown in Fig. 2. A wafer containing the films, 
which is a workpiece, is mounted in a carrier and pressed facedown at a known 
pressure and rotated against a porous polyurethane pad mounted on a rotating table. 
Both the carrier and the platen are normally rotated in the same direction. When the 
rotational speed of the carrier and the platen are the same, the relative velocity of 
each point on the wafer with respect to the pad is the same, facilitating a uniform 
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Fig. 2 Schematic diagram of CMP 


material removal from across the entire wafer surface. After determining that the 
proper amount of material has been removed by using one of a variety of end point 
determination techniques, the wafer is removed from the carrier and washed. 

To obtain the better quality surface, this set of CMP process and cleaning 
processes are repeated for each level of metallization. It is useful to remember 
that in a typical CMP process, the thickness of the material removed is limited to 
about a micrometer, with a removal uniformity of perhaps a few tenths of a 
nanometer across a heterogeneous surface on a 300 mm diameter silicon wafer - 
a daunting and inherently complex task. 

A large number of parameters influence the outcome of the CMP process. These 
can be divided into two groups. The first group, which depends on the polishing tool 
configuration, includes the design of the wafer carrier and retainer ring, applied 
pressure and its distribution across the wafer, rotational speeds of the carrier and the 
platen, etc. The second group consists of those dictated by the consumables used 
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Fig. 3 Schematic diagram of EEM 

during the process, which include the characteristics of the pad like its hardness, 
modulus, porosity, surface roughness, grove design, conditioning, etc., and the 
large number of properties associated with the slurry. The latter contains abrasives 
(e.g., silica, alumina, ceria, zirconia, etc.) as well as several different chemical 
additives. The components of the slurry, i.e., the abrasive size, shape, method of 
preparation and concentration, as well as the chemical additives, and more impor- 
tantly the pH strongly influence the polishing process in terms of both removal rates 
and defects caused. 


Elastic Emission Machining (EEM) 

EEM is one of the atomic size machining methods (Mori et al. 1987; Mori 
et al. 1988). When two solid phase materials composed of different chemical 
elements come into contact with each other, different kinds of interactions are 
being generated at the interface. Therefore, if these solids are separated mechani- 
cally, chances that the atoms from one surface moving onto the other surface might 
occur. This kind of phenomenon is applied in machining and is known as EEM. 

Ultrafme powder particles consisting of diameters that are much smaller than 1 .0 
pm are homogeneously mixed with water. Making use of the flow of this mixture, 
powder particles are accelerated and transported onto the work surface with min- 
imal load (Fig. 3). When in contact with the work surface, surface atoms will be 
removed through the process mentioned above. Limitation of working area is 
within the contacting area which needs to be smaller than 10 nnT, and removal is 
possible only where mutual surface atoms are ideally binding. 

Furthermore, to remove the target atom, the interface has to possess the charac- 
teristics to decrease the binding energy between the atoms in the surface and second 
layers. As a result, to obtain a geometrically perfect surface, both machines area and 
the depth need to be of approximate atomic order. The interface characteristics have 
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shown that the removal of atoms from the work surface is semi- spontaneous, thus 
finished surfaces from the point of view of physical properties can be perfect. 


Process for Complex Geometries 

Limitation of the flexibility and motion of the tool configuration make lapping and 
polishing not practical for the treatment of complex surfaces, such as sharp comers, 
deep recesses, sharp projections, free-form surfaces, and interiors of complicated 
components. 

Components used in critical applications in the aerospace, biomedical, and 
semiconductor industries require highly finished surfaces to achieve their desired 
surface functions, and many of these are currently accomplished manually despite 
increasing production costs. In fact, some of these are virtually unreachable by 
conventional techniques, and the lack of finishing technology acts as an obstacle to 
the technology innovation. 


Abrasive Flow Machining (AFM) 

AFM Process 

Abrasive flow machining (AFM) was developed by Extrude Hone Corporation, 
USA, in 1960. Three types of AFM machines that have been reported in the literature 
are one-way AFM, two-way AFM, and orbital AFM. The most common used AFM 
is two-way AFM in which two vertically opposed cylinders extrude medium back 
and forth through passages formed by the workpiece and tooling (Fig. 4). 

AFM is used to deburr, radius, and polish difficult to reach surfaces by the 
extrusion of an abrasive-laden polymer medium with very special rheological 
properties through the workpiece repeatedly. It is widely used in finishing processes 
to finish intricate shapes and profiles (Loveless et al. 1994; Raju et al. 2005). The 
polymer abrasive medium used in this process possesses easy flowability, good self- 
deformability, and fine abrading capability. Layer thickness of the material 
removed is of the order of about 1-10 pm. The best surface finish that has been 
achieved is 50 nm and tolerances are =b 0.5 pm. In this process, tooling plays a 
tremendous role in the finishing of material; however, hardly any literature is 
available on this kind of process. In AFM, deburring, radiusing, and polishing are 
performed simultaneously in a single operation on various areas including conven- 
tionally inaccessible areas, and it can produce true round radii even on complex 
edges. 

AFM reduces surface roughness by 75-90 % on cast and machined surfaces. It 
can process dozens of holes or multiple passage parts simultaneously and achieve 
uniform results. For example, air cooling holes on a turbine disk and hundreds of 
holes in a combustion liner can be deburred and radiused in a single operation. 

AFM maintains flexibility while capable to perform jobs which require hours of 
highly skilled hand polishing in a few minutes; AFM produces uniform, repeatable, 
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and predictable results on an impressive range of finishing operations. An important 
feature which differentiates AFM from other finishing processes is that it is possible 
to control and define the intensity and location of abrasion through fixture design, 
medium selection, and process parameters. It has wide applications in many areas 
such as aerospace, dies and moulds, and automotive industries. 

Magneto-rheological Abrasive Flow Finishing (MRAFF) 

Magneto -rheological abrasive flow finishing (MRAFF) was developed by Jha 
et al. in 2004 as a new precision finishing process using MR fluid for complicated 
geometries (Jha et al. 2007). This process employs determinism and in-process 
controllability of the rheological behavior of an abrasive-laden medium to finish 
intricate shapes. The desired properties of the base MR fluid used in MRAFF are as 
follows: 

1 . The fluid should be thermally stable and should have a high boiling point 

2. It should be noncorrosive and nonreactive with the employed magnetic and 
abrasive particles 

The representative MR fluid slurry consists of carbonyl iron particles (6 pm in 
mean diameter: 20 vol.%), silicon carbide abrasive (mesh size #800, #1,000, 
#1,200, or #1,500: 20 vol.%), and organic medium (60 vol.%). 

In the MRAFF process, magnetically stiffened slug of MR polishing fluid is 
being extruded back and forth with a piston through or across the passage formed by 
the workpiece and fixture. Selective abrasion occurs only where the magnetic field 
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Fig. 5 Schematic diagram of the MRAFF 

is applied across the workpiece surface while keeping the other areas unaffected. 
The schematic diagram of the process is shown in Fig. 5. Looking into the 
rheological behavior of the polishing fluid, it changes from nearly Newtonian to 
Bingham plastic and back when entering, traversing, and exiting the finishing zone, 
respectively. The abrasive cutting edges, which are held by carbonyl iron chains, 
rub the workpiece and shear the peaks away from its surface. The bonding strength 
of the field-induced structure of the MR polishing fluid and the extrusion pressure 
applied through the piston determine the amount of material sheared from the 
workpiece surface peaks by the abrasive grains. 


Electromagnetic Field-Assisted Machining 
Magnetic Field-Assisted Finishing (MFAF) 

The combination of a magnetic field with the mechanical action of a magnetic tool 
against a workpiece gives rise to the magnetic field-assisted finishing (MAF) 
process (Ko et al. 2003; Yin and Shinmura 2004; Yamaguchi et al. 2007). The 
magnetic tools can be introduced into areas that are hard to reach by conventional 
technologies and by means of magnetic manipulation; they exhibit relative motion 
against the workpiece surface needed for finishing. This shows potential for over- 
coming problems associated with more conventional finishing processes. 

Figure 6 shows the schematic diagram of MAF. The magnetic abrasive (mag- 
netic tools) consists of iron particles and A1 2 0 3 abrasive grains. The composite 
ingot is produced from the thermite process using aluminum powder and iron oxide 
powder. Subsequently, the ingot is then mechanically crushed and sieved to form 
the finished magnetic abrasive. The A1 2 0 3 grains are contained both inside and 
outside of the resulting magnetic abrasive. 

In a magnetic field, ferrous particles (including magnetic abrasive) suspended by 
magnetic force are linked together along the lines of magnetic flux. When the 
magnetic flux flows unimpeded through the nonferrous workpiece material, it might 
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Fig. 6 Schematic diagram of MAF (a) Utility of magnetic abrasive blush, (b) Utility of magnetic 
transmission phenomenon 


influence the motion of a ferrous particle - even if the particle is not in direct 
contact with a magnetic pole - and the magnetic field can be controlled from 
outside. The ferrous particle chains connected by magnetic force allows a flexible 
configuration, and given this unique behavior of the ferrous particles, it enables the 
application of the finishing operation to easily accessible surfaces and also to areas 
that are hard to reach by means of conventional mechanical techniques. 

Float Polishing Using Magnetic Fluid 

Magnetic fluids (MFs) were developed by Papell in 1965 to magnetically control 
fuel flow for the Apollo project in the zero gravity conditions of space (Papell 
1965). They are found in applications such as seal components and are routinely 
used in voice coils, dampers, and rotary brakes. 

MFs are stable colloidal suspensions of permanently magnetized particles, such 
as magnetite. This stability comes from the Brownian motion which keeps the 
particles, which are about 10 nm in diameter, from settling under gravity, and at the 
same time, a surfactant covers each particle to create short-range steric repulsion 
between particles, which prevents particle agglomeration in the presence of 
nonuniform magnetic fields. 

Application of a MF to polishing and finishing processes was undertaken by 
Kurobe and Inamaka in 1983 (Kurobe et al. 1983). The resultant process, called 
magnetic field-assisted fine finishing, was developed as a new lapping technique for 
the controllable finishing of materials, such as semiconductors and ceramics in the 
electronics and precision machinery fields. Figure 7 shows the schematic diagram 
of magnetic float polishing. The MF engulfs the groove cut in the brass disk, and the 
polisher (a 1 mm thick rubber sheet) covers the magnetic fluid-filled groove. After 
which, the water-based polishing compound was supplied over the polisher. When 
DC voltage is applied, electromagnets placed above and below the disk create a 
magnetic field. As shown in Fig. 7, the polishing pressure is generated by the 
ferrous particles, which are attracted by the magnetic field, and pushes the polishing 
compound (via the polisher) against the work surface. The polishing action results 
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Fig. 7 Schematic diagram of 
magnetic float polishing 
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when the upper pole, which is connected to the workpiece, and the disk rotate in 
opposite directions. This method has been experimentally proven that flexible 
surface finishing is possible, and the surface roughness and stock removal rate 
can be controlled by changing the current to the electromagnet. 

Magneto-rheological Finishing (MRF) 

The initial discovery and development of magneto-rheological fluids (MR fluids) 
and devices is credited to Jacob Rabinow, who studied them at the US National 
Bureau of Standards (now named the National Institute of Standards and Technol- 
ogy (NIST)) in the 1940s (Rabinow 1948). MR fluids typically consist of micron- 
sized, magnetically polarizable particles dispersed in a carrier medium; some 
common media are mineral oil or silicone oil. Particle chains form when a magnetic 
field is applied. As a result, the viscosity of the MR fluids apparently increases 
(Kurobe et al. 1983). 

The MRF process was firstly introduced commercially in 1998, and the manu- 
facture of precision optics has since changed dramatically (Tricard et al. 2003). The 
developed process makes use of polishing slurry based on MR fluid, which can be 
mixed, pumped, and conditioned in their liquid state; however, in the presence of an 
applied magnetic field, it causes a viscosity change to a semisolid state creating a 
stable and conformable polishing tool. A typical composition of an MR fluid is 
36 % carbonyl iron, 6 % abrasive (cerium oxide), 3 % stabilizer, and 55 % water. 

Figure 8 shows the schematic diagram of the MRF processing principle. A 
workpiece is fixed at some distance from a moving surface, in order to ensure 
that the workpiece surface and the moving surface form a converging gap. In the 
area around the gap, a nonuniform magnetic field is generated when an electro- 
magnet is placed below the moving surface. The MR fluid is delivered to the 
moving surface just above the electromagnetic poles and then pressed against the 
surface by the magnetic field gradient making the fluid a Bingham plastic before it 
enters the gap. Thereafter, the shear flow of plastic MR fluid flows through the gap, 
resulting in the development of high stresses in the interface zone and thus, as a 
result, material removal occurs over a portion of the workpiece surface. 
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Fig. 8 Schematic diagram of the MRF processing principle 


The process can make significant improvements to the surface roughness, and 
flatness has since been adopted by major manufacturers of precision optics. In the 
aspect of both form accuracy and micro-roughness, the MRF process has demon- 
strated the ability to produce optical surfaces to tight tolerances. The surfaces, 
including aspheres, can be made with materials ranging from glass/glass ceramics 
(including fused silica, ULE, and Zerodur) to single-crystalline materials (including 
silicone and calcium fluoride) or poly crystalline materials (including SiC). 

Internal Surface Finishing Using Magneto-rheological Fluid-Based 
Slurry 

A new type of slurry - referred to as MRF-based slurry - was developed in 2006 by 
Yamaguchi and Sato (Sato et al. 2007) and is especially appropriate for the internal 
finishing process of piping systems in micro- and nanotechnological industrial 
devices. The major feature of the MRF-based slurry is that the abrasives are smaller 
than the iron particles so that the iron particles can trap as many abrasive particles as 
possible. 

The processing principle is illustrated in Fig. 9. The MRF-based slurry is 
introduced inside the work and is attracted to the finishing area by the field 
generated by magnetic poles (e.g., electromagnetic coils or permanent magnets). 
When the work is rotated and oscillated along its axis, there is relative motion 
between the abrasives and the work surface, and the entire inner workpiece surface 
can be finished. Under typical finishing conditions, the MRF-based slurry experi- 
ences a magnetic force, a centrifugal force, a friction force against the work surface, 
and the force of gravity; these combine and result in the dynamic behavior shown in 
Fig. 9b. This behavior continuously displaces the cutting edges of the abrasives in 
the MRF-based slurry, which improves the finishing efficiency. 
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Schematic diagram of processing principle of internal finishing using MRF-based slurry 


Magnetic Compound Fluid Finishing (MCF) 

Magnetic compound fluid (MCF), which basically consists of MF and MR fluid, 
was developed by Shimada et al. in 2001 as an intelligent fluid (Shimada 
et al. 2008). In the same conditions, the apparent viscosity of MF in a magnetic 
field is lower than that of MR fluid. However, the stability of the particle distribu- 
tion in MF is better than in MR fluid. Therefore, by changing the mixing ratio of MF 
and MR fluid, the apparent viscosity and the particle distribution stability in MCF 
can be altered. This characteristic must be the greatest advantage of MCF. The 
structure of MCF is made up of chain- shaped magnetic clusters consisting of 
magnetic particles of different sizes. Magnetic particles of MF (a few nm in 
diameter) surround the magnetic particles of MR fluid (a few pm in diameter), 
and long clusters of magnetic particles are formed in the process. 

Research into the application of MCF to float polishing has been conducted from 
2002 (Shimada et al. 2008; Sato et al. 2010). Figure 10 illustrates schematic diagram 
of MCF float polishing and chain-shaped clusters consisting of MCF mixed with 
abrasive. As the MCF is subjected to a magnetic field, chain- shaped magnetic 
clusters composed of nanometer-sized magnetic particles and micrometer- sized 
magnetic particles are thus created along the lines of magnetic flux. The dimensions 
of the clusters depend on the composition of the MCF, the strength of the magnetic 
field, and the method by which the field is applied. The nonmagnetic abrasive 
particles are trapped by the clusters or distributed between clusters, and to increase 
the viscosity, alpha cellulose fiber has been used and interspersed with the clusters. 
When the magnet is moved, the clusters and abrasive exhibit relative motion against 
the work surface, and material removal by the micro -cutting action of the abrasive 
particles can be observed. This process can be diversely applied to free-form metal 
surfaces, ceramics, glasses, resin surfaces, inner surfaces of tubes, etc. 
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Fig. 10 Schematic diagram of MCF float polishing 


Process for Mass Finishing 

Deburring and surface finishing are necessary processes for almost all metal and 
many plastic parts. Mass finishing is the term most commonly associated with 
abrasive tumbling operation. Typically, large number of parts are placed in the 
deburring machine, and they are deburred together at the same time. Equipment 
operation can be performed as a batch process or a continuous flow process with the 
use of several different styles of machines. 


Barrel Finishing 

Conventional rotary barrel tumbling is the original mass finishing technique. 
Ancient Chinese and Egyptians used tumbling barrels with natural stones as 
media to achieve smooth finishes on weapons and jewelry (Gillespie 2006). 
Figure 1 1 shows the schematic diagram of barrel finishing. 

Barrel finishing is now a vastly improved process compared to the ancient 
tumbling operations, but this process is slow in general. Given that barrel finishing 
is a versatile means of edge and surface conditioning, equipment costs are cheaper 
and operation is simple; therefore, there are still applications as this is the most 
economical process. 

Barrel finishing is a low-pressure abrading process generally performed by the 
controlled sliding and rolling action of workpieces, media, and compounds. In a 
rotary or tumbling barrel, the upper layer of the workload has a sliding movement. 
While the barrel rotates, the load moves upward in the barrel to a turnover point. 
The force of gravity overcomes the tendency of the mass to stick together, and then 
the upper layer slides toward the bottom of the barrel. The barrel is normally loaded 
to approximately 60 % of capacity with a mixture of workpieces, media, and 
compounds. 

Higher load levels will be better for some workpiece such as large or heavy 
components. If the load level is increased above 60 %, this decreases the length of 
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Fig. 11 Schematic diagram 
of barrel finishing 
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slide, thus the probability of the workpieces contacting each other will be reduced. 
However, while the force of workpiece contact is being reduced, cycles will have to 
be increased. 

Increase in barrel rotation increases the steepness of angle of slide and thus 
accelerates the action. However, faster barrel rotation increases the tumbling action, 
which in turn increases the likelihood of damaging the workpieces and reducing the 
quality of edge and surface conditions. 


Vibratory Finishing 

Vibratory finishing is now the most favored type of mass finishing, right next to 
hand deburring, the most common surface conditioning method used by industries 
(Gillespie 2006). This versatile process is commonly used for cleaning, deburring, 
deflashing, descaling, edge and corner radiusing, surface finishing, and stress 
relieving. 

Figure 12 shows the schematic diagram of vibratory finishing. The first tub-type 
vibratory finishing machine was introduced commercially in 1957, and the bowl- 
type about 5 years later. Workpieces are loaded into the open top of a container 
holding the media, compound, and water. 

Vibratory finishing has been widely employed for final surface finishing of 
products, given credit to the capability to finish with consistency and with consid- 
erably lower manufacturing cost. 

Workpieces of wide variety of sizes and shapes are processed and are applicable 
to all metals and many nonmetallic materials. The advantage of conducting the 
process in large quantities in batch or continuous process setups without handling or 
fixturing helps to minimize costs. 
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Fig. 12 Schematic diagram 
of vibratory finishing 
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Drag Finishing 

Drag finishing is the process of dragging parts through a bed of media (Gillespie 
2006). The parts can be handled in single or fixture in groups. The motion of the 
parts through the media can be in a straight line, an oval-shaped racetrack system, 
or some planetary motion. Figure 13 shows a schematic diagram of drag finishing. 

The processes have two variations: first, the parts are fixed on nonrotating fixture 
and dragged through the media, and second, the parts are placed on spindles that 
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rotate while being dragged through the media. The first approach is used in Japan, 
while the second approach is a common design in the USA. 

The advantage of the process is the non-impingement of the parts on each other 
and large parts can be processed. In drag finishing, parts are inserted into a tank of 
moving abrasive media and compound and moved in the opposite direction of the 
media. Similarly as other loose abrasive processes, the abrasive media and com- 
pound rub against the parts and its edges. 

Heavy burrs can be removed, however, only with noticeable stock loss and long 
cycle time. This process does not work well on edges with deep internal features and 
impregnates minute particles of abrasive into the work surface which can result in poor 
brazed, soldered, or welded joints and increase the probability of plating failures. 


Ultrasonic Machining 

Overview of the Ultrasonic Machining 

The potential of the sound waves with high frequency (about 70 kHz) for machining 
was first observed by R. W. Wood and A. L. Loomis in 1927 (McGeough 1988). 
Subsequently, the first patent on using ultrasound for machining was granted to 
L. Balamuth in 1945 (Balamuth 1945). In the investigation on the ultrasonic 
grinding of abrasive powders, the surface of a container filled with abrasive slurry 
was found to disintegrate in the areas close to the vibrating tip of an ultrasonic 
transducer. In addition, the shape of the generated cavity was exactly the same as 
that of the transducer tip (McGeough 1988). Consequently, the capabilities of this 
technique for machining applications were quickly recognized by industrial users. 
The production of ultrasonic machine tools then started in the early 1950s by 
mounting the first USM tools on the bodies of drilling and milling machines. By 
1960, various USM tools were commercialized and utilized by manufacturers in 
regular production. Various terms have been coined for the USM such as ultrasonic 
drilling, slurry drilling, and ultrasonic abrasive machining. However, it is more 
commonly referred as ultrasonic impact grinding (USIG), ultrasonic-assisted lap- 
ping (USAL), and ultrasonic machining (USM). 

Basic Elements and Working Principles of USM 

Figure 14 shows the basic elements of an ultrasonic machining system. High- 
frequency electrical signal produced by an ultrasonic power generator is converted 
into mechanical vibration via a transducer-booster combination. The booster may 
increase, decrease, or retain the vibration amplitude received from the transducer. 
The mechanical vibration is then transmitted to a horn-tool assembly (known as 
sonotrode) which acts as an energy-focusing device. This results in vibration of the 
tool in the axial direction, typically at a frequency and amplitude ranging from 20 to 
40 kHz and 5 to 50 pm, respectively. The power employed in the USM process 
usually ranges from 50 to 3,000 W, and it may reach up to 4 kW in some USM 
systems (Thoe et al. 1998). 
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Fig. 14 Basic elements of 
the USM system: 

1 transducer, 2 booster, 

3 horn, 4 tool, 5 workpiece, 

6 fixture, 7 slurry pump, 

8 slurry tank, 9 slurry nozzle 
(Weller 1984) 



Figure 15 illustrates the working principle of the USM process. As illustrated, 
abrasive slurry which consists of a powder dispersed in a liquid medium (water or 
oil) is fed into the machining zone between the tool and workpiece. Abrasive 
materials such as boron carbide, boron nitride, aluminum oxide, silicon carbide, 
and poly crystalline diamond with particle size of 50-300 pm are commonly used. A 
downward controlled force (also known as static load or machining force) is applied 
on the vibrating tool to maintain a certain gap distance between the tool and 
workpiece (known as machining gap). The vibrating tool causes the abrasive 
particles within the machining zone to impact the workpiece surface causing 
indentation, microcracks, and eventually material removal. This process continues 
leading to the formation of a cavity whose geometry is similar to that of the tool tip. 

Commonly used materials for booster and hom are Monel, titanium alloys (e.g., 
TiAlV64), stainless steel, and aluminum alloys (e.g., AlCuMg2) (McGeough 
1988). Titanium and aluminum alloys have high fatigue strengths, and hence they 
can be used for high vibration amplitudes of up to 40 pm at 20 kHz vibration 
frequency. The tools used in USM process are usually made of materials with high 
wear resistance and fatigue strength properties (Thoe et al. 1998). Tool materials 
such as tungsten carbide, silver steel, pure tungsten, and copper are commonly used 
for machining of the materials with low fracture toughness such as glass, while 
chromium-nickel steel is recommended for machining of the materials with higher 
toughness such as sintered carbides (McGeough 1988). 
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Fig. 15 Working principle 
ofUSM (Zhang et al. 2005; 
Rajurkar et al. 2006) 



Process Parameters and Performance Measures 

Ultrasonic machining is a free abrasive machining process where abrasive parti- 
cles are in interaction with both the tool and workpiece, thereby causing some 
levels of intricacy in the process, which is further influenced by USM process 
parameters. These parameters are generally controllable input factors affecting 
the machining conditions. The machining conditions in turn determine the out- 
come of the machining process assessed by performance measures. The process 
parameters are presented in Fig. 16 using a cause-and-effect diagram. 

Performance measures in USM process include surface integrity, material 
removal rate (MRR), and tool wear rate (TWR). Surface integrity, as a crucial 
performance measure in components made of hard and brittle materials, comprises 
surface roughness, subsurface damage, and heat-affected zone (HAZ). The material 
removal in USM process involves crack initiation and propagation followed by chip 
breakage resulting in a rather coarse surface with shallow pits and voids and with 
subsurface microcracks which are generally not acceptable for delicate structural 
components. In contrast, HAZ is of lesser concern in USM since the process is 
considered as a nonthermal process which generates negligible amount of the heat. 
Thus, the machined surface is generally free from thermally damaged zone or 
residual stress (Thoe et al. 1998). 

MRR is considered as a primary process measure which reflects the efficiency of 
the machining operation. Besides the volumetric machining rate, drilling speed 
which is the penetration speed of the tool into the workpiece may also be used to 
measure the MRR in USM process. MRR might not be constant throughout the 
machining process due to the variation in the machining zone conditions such as 
distribution and speed of the free moving particles, indentation force introduced to 
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Fig. 16 Cause-and-effect diagram for USM 


the abrasive particles, intensity of the cavitation bubble collapse, interparticle 
collisions, and flushing of the accumulated debris. 

Tool wear in the USM process consists of longitudinal wear (W L ), side wear 
(W D ), and cavitation wear (Adi than 1974). The length and weight of the tool 
decrease as a result of the tool wear, thereby causing the tool to vibrate out of 
the resonant frequency which results in a mismatch with the output frequency of 
the generator. This leads to reducing amplitude and subsequently a decreased 
MRR. Furthermore, tool profile may change as a result of tool wear, thereby 
affecting the dimensional accuracy of the machined features. TWR in USM is 
influenced by parameters such as particles’ type and size, workpiece and tool 
material, tool size, and rate and method of slurry delivery to the machining gap 
(Adithan 1974). 

Capabilities and Applications of USM 

USM processes are capable of machining a wide range of workpiece materials 
regardless of their electrical or chemical characteristics. Materials can be categorized 
into three groups in view of their machinability in USM process. Type I materials such 
as glasses are very brittle and hence machined easily by the USM process. The 
material is removed as a result of the propagation and integration of the minute cracks 
originated from the interactions of abrasive particles with surface material. Type II 
materials, like hardened steels, exhibit some plastic deformation before fracture, and 
they can be machined by USM process but with a lower efficiency and MRR as 
compared to Type I materials. Type III materials such as copper and soft steel are 
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ductile with a high fracture toughness, and hence they are unsuitable for machining by 

USM technique. The applications of the USM process include: 

• Various machining operations such as drilling, die sinking, and contour machin- 
ing of features with circular and noncircular geometries as well as machining of 
the complex shapes and 3-D contours 

• Machining of ceramics, glasses, silicons, germaniums, quartzes, sapphires, fer- 
rites, optical fibers, and sintered carbides (Type I) as well as hardened stainless 
steels, hard carbon alloys, and nickel-titanium alloys (Type II) 

• Application of USM in micromachining (i.e., termed micro-USM) for microhole 
drilling, slot machining, and -D microcavities milling 

The advantages of the USM process are in the following: 

• USM is an efficient and cost-effective technique for precision machining of hard, 
brittle, and fragile materials especially in medium or small quantities. 

• USM is an environment-friendly process, suitable for machining both conduc- 
tive and nonconductive materials. 

• Since actual machining is carried out by abrasive particles, tool materials used 
can be softer than that of the workpiece material. 

• Unlike other thermal-induced processes, USM does not create heat-affected 
zones, which may induce residual stresses in the machined surface. 


Micro-Ultrasonic Machining (Micro-USM) 

Micro-USMtechnique was introduced by Masusawa’s group in the mid-1990s (Sun 
et al. 1996a). Since then, a number of research works have been carried out to 
enhance this process in both technological and fundamental aspects. The capabil- 
ities of micro-USM technique have been demonstrated for various micromachining 
operations such as drilling of microholes, machining of straight and spiral grooves, 
machining of complex- shaped (3-D) micro structures, and generating of 
microfeatures in MEMS components (Medis and Henderson 2005). These machin- 
ing operations have been carried out by implementing two modes of micro-USM, 
namely, die-sinking mode and contouring (scanning) mode. 

In die-sinking mode, micro-tool is fabricated with a face containing the patterns 
similar to features required in workpiece. During the machining, micro-tool is 
driven toward the abrasive slurry and workpiece, whereby the patterns can be 
transferred from the tool onto the workpiece. This mode of micro-USM is capable 
of generating either a single complex micro structure or a pattern of simple 
microfeatures in one pass of the tool penetration toward the workpiece. In 
contouring mode, typically a micro-tool with simple shape such as a cylindrical 
rod is used. The desired microfeature can be machined by guiding the micro-tool 
along a predetermined tool path. Figure 17 depicts the microfeatures machined on 
different materials using die-sinking and contouring mode of micro-USM. 
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Fig. 1 7 Machining of microfeatures using die-sinking and contouring mode micro-USM (Medis 
and Henderson 2005; Boy et al. 2010) (a) An array of pillars in PZT with diameter of 280 pm and 
depth of 6,000 pm. (b) Micro channels with serpentine pattern on silicon 


Since the inception of micro-USM in the mid 1996s, there have been some 
developments in the machine system design and configuration including tool 
preparation and tooling system, tool-workpiece interface monitoring, force mea- 
surement and monitoring, workpiece holding and hom design. These advancements 
have resulted in improvements to process efficiency and machined surface quality 
in micro-USM. 

Comparison of USM and Micro-USM 

The process conditions under which material removal takes place in USM and 
micro-USM are distinctively different due to the diverged process parameter 
magnitudes employed for micromachining. In order to satisfy the micromachining 
requirements, the mechanical stress causing crack initiation, propagation, and 
eventually fracture of the workpiece material is applied to a very small area or 
volume of the workpiece to reduce the unit removal and to attain low surface 
roughness (Masuzawa 2000). This is typically achieved by downscaling the particle 
size with magnitudes of a few hundreds of micrometers used in USM to micrometer 
or even submicrometer range suitable for micromachining (Egashira and Masuzawa 
1999). Another factor that contributes to reduction of the unit removal for micro- 
USM applications is introducing smaller energies into the slurry medium and 
consequently to each single particle in the machining zone as compared to that of 
USM. This leads to a reduced indentation depth and eventually a smaller removal 
volume per particle impingement in the process. This requirement is achieved by 
lowering the vibration amplitude to the range of a few micrometers (Zhang 
et al. 2005). Machining conditions and magnitudes of the process parameters in 
USM and micro-USM are presented in Table 1. 

Further to above conditions desired to satisfy micromachining requirements, and 
from the standpoint of the feature size magnitudes in micromachining, a tool 
with dimensions in the micrometer range is required in micro-USM process. 
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Table 1 Process parameters and machining conditions in USM and micro-USM (Thoe 
et al. 1998; Zhang et al. 2005; Hu 2007) 


Process parameters and 
conditions 

USM 

Micro-USM 

Vibration frequency 

Usually > 20 kHz 

Usually > 20 kHz 

Vibration amplitude 

Tens of micrometers 
(5-50 pm) 

Within micrometers (0.2-5 pm) 

Abrasive particle Size 

Tens of micrometers 
(50-300 pm) 

Within micrometers (0.2-5 pm) 

Machining load 

0.1-30 N 

Within 100 mN 

Tool size 

Usually > 1 mm 

Within 500 pm 

Vibrated component 

Tool 

Tool or workpiece 

Slurry delivery and flushing 
method 

Jet flow, suction, or 
combined method 

Slurry bath and tool rotation 


This requirement confines using methods such as jet flow and/or suction flushing 
which are normally employed in USM, and instead it favours the use of tool rotation 
to facilitate the slurry circulation and debris removal in the machining zone 
(Egashira and Masuzawa 1999). The flushing conditions in USM are affected by 
flow rate and flow pressure while the flushing conditions in micro-USM are 
controlled by the centrifugal forces resulted from the tool rotational speed. There- 
fore, these factors give rise to variations in conditions of the tool-workpiece 
interface between USM and micro-USM. 

Nevertheless, since micro-USM is an adoption of the conventional USM, both 
processes still share the same principle for machining. Similar to USM, the 
material removal in micro-USM is performed by mechanical action of the abrasive 
particles driven directly or indirectly by either vibrated micro-tool or workpiece as 
well as by cavitation in the slurry fluid causing the particle indentation, microcrack 
initiation, propagation, and fracture breakage in the impacted material (Egashira 
and Masuzawa 1999). This implies that certain process performance results are 
common for the two processes. Material removal is implemented by multiple- 
particle indentations and impacts, which cause the machined surface to be covered 
by numerous minute craters with randomly distributed positions. This effectively 
limits the minimum surface roughness achievable in the process. Also, the 
machined workpiece surface may contain a layer with many microcracks which 
is not acceptable for the majority of microstructures and microcomponents 
(Brinksmeier et al. 1998). Although these surface quality issues are encountered 
in both USM and micro-USM, their influence is more significant in micro-USM 
due to relatively smaller size difference between surface defects and machined 
microfeatures. Moreover, the surface roughness should be reduced in proportion to 
the feature size as an important factor for micromachining. The ability to control 
and predict the micro-USM process performance is also more crucial due to 
miniature tools, small particles, and features size involved in the process. 
In addition, there is a need to improve the accuracy and reliability of the 
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micro-USM system, which is considered as an important factor to perform 
micromachining. Thus, improvements to the micro-USM technique require 
advancements that provide better understanding of material removal mechanisms 
and conditions, better control and prediction of process performance, as well as 
better precision and reliability of the machining system. 

Configuration of the Machine System 

Two major configurations have been proposed for micro-USM system, namely, 
tool vibration and workpiece vibration methods. In the former method, the tool is 
attached to ultrasonic horn and it is oscillated in a vertical direction, while 
workpiece is fixed onto the XY stage (Sun et al. 1996b). Also, a rotational 
movement is applied to the horn and attached tool through a spindle system. 
While workpiece clamping is simple in this type of micro-USM system, 
implementing the vibration unit into the tooling system to some extent limits the 
rotational accuracy of the spindle system which is undesirable from the standpoint 
of the requirements for a high-precision micromachining process (Masuzawa 
2000 ). 

In the workpiece vibration method, the oscillation of the tool is removed; thus 
tooling system is only responsible for guiding and rotating the micro-tool. Applying 
the vibration to workpiece rather than tool is an attractive method due to its 
simplicity in the design and operation of the spindle system, precision of the tooling 
system, effective agitation of the slurry, and efficient delivery of the particles into 
the machining zone (Egashira and Masuzawa 1999). Furthermore, unlike the micro- 
USM with tool vibration method, tool wear has no influence on the vibration 
amplitude of tool tip in workpiece vibration method. Despite these advantages, 
issues such as reliable workpiece clamping and consistent transmission of the 
vibrations from horn to the workpiece need to be addressed to improve the process 
performance. 

Tool Preparation and Tooling System 

Miniaturization of the tool imposes some constraints on implementation of the 
micro-USM process, particularly in the fabrication of tiny tools and configuration 
of tooling system (Zhang et al. 2005). Mounting and aligning of the micro-tools 
are key issues, which have a direct impact on process productivity and part 
accuracy. Various tooling strategies and tool preparation techniques such as 
on-machine tool preparation using wire electrical discharge grinding (WEDG) 
and ultrasonic-assisted grinding, utilizing the sintered diamond tools, and fabri- 
cation of the multi-tools have been proposed to enhance the capability of the 
micro-USM process. Also, rotation of the micro-tool has been reported to reduce 
the form error and to improve the machining rate (Egashira and Masuzawa 1999). 
However, tool rotation has the adverse effect of enlargement of the microhole 
diameter mainly due to the spindle runout. While improvements are made to 
rotational and linear movements of the micro-tool, the flexibility of the tooling 
system also should be considered to accommodate the tool preparation 
techniques. 
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Monitoring and Control of the Machining Force 

During material removal in micro-USM process, the micro-tool progresses further 
toward the workpiece through abrasive slurry to maintain a certain machining gap 
and thus a stable machining condition. While a machining gap larger than an 
optimum level would cause the particles to lose their effective kinetic energy before 
striking the workpiece surface, using a very small machining gap might result in 
suppressing the abrasive particles by the micro-tool and hence leaving them with 
insufficient energy for impacting the workpiece surface. In both conditions, the 
material removal rate would be very low or even negligible. Furthermore, after the 
onset of the machining process, the interface boundary between tool and workpiece 
as well as the conditions of machining zone vary continuously, and hence it is 
difficult to maintain the machining quality (Zhang et al. 2005). The control of the 
machining gap in micro-USM is made possible through precision measurement and 
control of the static load (machining force) as the feedback in the process. 

While in USM, static load and tool size are in the range of a few kilogram-force 
and larger than 1 mm, respectively; in micro-USM process the static load is in order 
of a few gram-force and tool size ranges within 500 pm (Zhang et al. 2005). 
Besides, mismatched tool- workpiece interactions not only adversely affect the 
stability of the micro-USM process, but they also lead to the breakage of micro- 
tools. About 60 % of the micro-tool breakages have been reported to occur due to 
mishandling of the initial engagement of tool-workpiece interaction (Zhang 
et al. 2006). Therefore, downscaling of the process parameters from USM to 
micro-USM necessitates the development of a reliable force sensing and monitor- 
ing system with high precision and rapid response in order to enhance the stability 
of the micromachining process and to avoid tool breakages. 

Different strategies have been proposed to control the machining gap and contact 
force between micro-tool and abrasive slurry in micro-USM process. Acoustic 
emission (AE) technique was proposed and implemented to monitor the tool- 
workpiece interaction in order to reduce the probability of the tool breakage in 
the process (Moronuki and Brinksmeier 2002). However, the AE output signal has 
been reported to be influenced by the workpiece position on the machine table as 
well as the intrinsic characteristics of the ultrasonic generator used in the process, 
thereby causing an incorrect assessment on actual state of the tool-workpiece 
interaction and hence incomplete machining (Zhang et al. 2006). 

Other methods such as using electronic balance and dynamometer also have been 
proposed for measuring the machining force in micro-USM. A digital balance with 
resolution of 10 mg and response time of 10 ms was proposed in (Sun et al. 1996a). 
Also, an electronic balance with a minimum index of 1 mg was installed in a micro- 
USM experimental setup acting as the sensor for feedback control (Egashira and 
Masuzawa 1999). However, low-frequency response and small sampling rate were 
identified as major limitations of this method. In another study, a precision force 
actuation and measurement system with the capability of sensing and control of 
forces in the low mN range was developed (Hoover and Kremer 2007). Neverthe- 
less, the force measurement in this system only provides the desired accuracy when 
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employed for measurements in the horizontal direction rather than vertical one. 
Thus, it may not be suitable for ultrasonic machining process. 

Workpiece Holding Method 

The machining accuracy is directly affected by the workpiece holdingmechanism in 
micro-USM with workpiece vibration method. In such a setup, the workpiece must 
be attached firmly to the ultrasonic horn, which otherwise mechanical vibrations 
would not be transmitted to the workpiece consistently, causing unstable machining 
conditions and excessive heat generation in the horn-workpiece interface. 

The use of double- sided adhesives to attach the workpiece to the transducer or 
horn has been attempted by researchers (Hu 2007). However, this method has two 
main drawbacks. The removal of workpiece after machining is difficult, especially 
in the case of thin and fragile workpiece materials. Also, the adhesive tape itself 
may absorb the ultrasonic power and alter the acoustic characteristics of the original 
mechanical vibration supplied into the ultrasonic horn. Therefore, a reliable 
clamping system is deemed necessary to attain a uniform transmission of the 
mechanical vibrations from ultrasonic hom to the workpiece. 


Development of a Micro-USM System with New Techniques 
for Workpiece Clamping and Static Force Control 

A new micro-USM system was designed and fabricated with the purpose of 
improving the functionality and flexibility of the machine system (Zarepour 
et al. 2011). Effective and well-proved techniques such as applying the vibration 
to workpiece are incorporated in developed system. Figure 18 illustrates the 
schematic diagram of the developed micro-USM system which consists of five 
main subsystems, namely, generation and transmission of ultrasonic vibrations, 
tooling system, workpiece holding, measurement and control of static load, and 
slurry delivery system. 

The tool rotation has been adopted to reduce form errors and to provide flexi- 
bility in the tooling system. The whole tooling system is mounted on a three-axis 
stage. The micro-tool motion is controlled with a minimum incremental motion of 
0.03 pm in z-axis and 10 pm in both x-axis and y-axis. An integrated piezo-motor 
controller/driver with computer interface is used in z-axis, enabling a highly 
reliable tool positioning with 30 nm sensitivity. This configuration provides an 
accurate control over the infeed of the tool in the direction of the workpiece 
oscillation in order to maintain a preset machining gap. 


Workpiece Clamping System 

The task of holding the workpiece against the ultrasonic hom hampers the efficient 
use of the micro-USM with workpiece vibration method. The vibration frequency 
of the horn is 50 kHz in the developed system. As such, attachment of the 
workpiece to the hom is a critical stage in the process, which may affect the 
stability and accuracy of the machining process. The micro-tool used has a diameter 
up to 300 pm, which is susceptible to breakage in the event of tool collision. 
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Fig. 18 Schematic of the developed micro-USM system 


An aluminum booster together with a full-wave titanium horn, which has a 
recess to accommodate the workpiece, is used to transmit the vibration from 
ultrasonic transducer to the workpiece. Nominal vibration amplitude of the system 
ranges between 0.8 and 5.5 pm using a combination of the reverse booster and horn 
with the same gain ratio of 0.5:1 and adjustment of the output power of the 
ultrasonic generator. As shown Fig. 19, a vacuum chuck is incorporated to the 
ultrasonic horn for workpiece clamping. It is important to introduce the vacuum 
tube to the ultrasonic hom at the position of the nodal point where the amplitude of 
vibration is close to zero. The vacuum tube is connected to the vacuum pump 
through a liquid separator with filter. The proposed method enables a rapid 
clamping and unclamping of the workpiece without introducing crack to thin and 
fragile workpieces. 

Measurement and Control of the Static Load 

In the developed system, the static load between micro-tool and vibrated workpiece 
is measured and controlled using a precision force sensor integrated with the tooling 
system as illustrated in Fig. 20. The proposed design has a merit in that the sensor 
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Fig. 20 Configuration of the force measurement system 


can be mounted on tooling system without hampering the rotation of the micro-tool. 
Furthermore, utilizing the sensor in the tool side eliminates the measurement errors 
arising from dead weight of the ultrasonic stack and fixture as well as the noise and 
vibration from ultrasonic hom. The sampling frequency of the sensor is up to 1 kHz 
which can be utilized in both tension and compression; hence, it is capable of high- 
speed measurement of the variation in the machining force. The static load can be 
constantly monitored by the force sensor in order to maintain the preset machining 
gap through controlling the infeed tool motion. The system also has a built-in 
overload protection feature. 
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Table 2 Mechanisms of the material removal in ultrasonic machining 


Mechanism of 

material 

removal 

Description 

References 

Direct 

hammering 

Material removal occurs through direct localized 
hammering of abrasive particles by tool against the 
workpiece surface. Only the larger particles take 
part in the mechanism either because they rest on 
the workpiece, thus receiving a blow from the tool 
face, or because they descend in contact with the 
tool face to hammer the work. This mechanism 
may result in crushing of the abrasive particles 

Thoe et al. (1998) 

Free impact 

Material removal is performed by the impact of 
free moving abrasive particles on the workpiece. 
This mechanism generally takes place when the 
particles with a size smaller than the gap distance 
fly across the machining zone and strike the 
workpiece surface at random locations 

Soundararajan and 
Radhakrishnan 

(1986) 

Cavitation 

erosion 

Erosion of the surface occurs due to cavitation 
effects within the abrasive slurry. This 
phenomenon is considered as a form of wear 
whereby the damage is produced by bubbles 
collapsing near the surfaces of both workpiece and 
tool 

Soundararajan and 
Radhakrishnan 

(1986) 

Chemical action 

This mechanism is associated with the fluid 
employed for the slurry and involves some 
chemical reaction taking place within slurry 
mainly due to micro-explosion resulted from 
cavitation in the slurry fluid 

Thoe et al. (1998) 


Material Removal Mechanisms and Modes in USM and Micro-USM 

The mechanisms of material removal in USM process can be categorized into four 
types, namely, direct hammering, free impact, cavitation erosion, and chemical 
action. Detailed descriptions of these mechanisms are presented in Table 2. One 
material removal mechanism or a combination of mechanisms will dominate the 
material removal process depending on the process parameters (Thoe et al. 1998). 
While direct hammering and free impact are primarily responsible for material 
removal in USM, cavitation erosion and chemical action are of secondary signif- 
icance with the majority of the workpiece materials such as glass, ceramics, and 
hardened steel (Thoe et al. 1998). Cavitation erosion has been reported to play an 
important part in machining of the porous materials (Weller 1984). 

The effect of these mechanisms results in material removal from the workpiece 
either by crack formation and fracture in brittle mode or by cutting and shearing in 
ductile mode. In addition, during the USM process, there might be a situation where 
the material at transient surface of the workpiece is not completely removed but 
only displaced upon impingement of the abrasive particles. 
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When material is impacted by a hard angular particle at high speeds or under large 
contact forces, plastic deformation occurs in the contact zone due to high compres- 
sive and shear stresses, and a radial crack is formed. After impact, the plastic 
deformation results in large tensile stresses leading to initiation of lateral cracks 
and eventually material removal (Wensink and Elwenspoek 2002). When the speed 
of particles or the amount of force acting on them during the process is below the 
threshold values required for crack initiation in the brittle material, particles may 
cause only plastic deformation to the surface, which brings about material removal 
by ploughing and cutting in a ductile mode instead of a brittle one. This change in 
erosion mode from brittle fracture-dominated behavior to plastically dominated 
behavior is called brittle-ductile transition (Wensink and Elwenspoek 2002). The 
material removal mode, whether it occurs in brittle or ductile mode, and transition 
between these modes have a direct effect on both surface integrity and material 
removal rate associated with process quality and productivity, respectively. 

Although the underlying machining principle of micro-USM is similar to that of 
conventional USM (Zhang et al. 2005), downscaling of the process parameters such 
as tool size, vibration amplitude, machining load, and particles size so as to 
minimize the contact zone of abrasive particles with tool and workpiece to the 
microscale range may affect the machining condition. Also, the nature of material 
removal in micro-USM, whether the material is removed through ductile deforma- 
tion or crack generation followed by brittle fracture, may differ from that in USM 
due to the influence of miniaturized tool-abrasive-workpiece interaction intensity 
(Zhang et al. 2006). For instance, the accumulation of workpiece debris and crushed 
abrasives in a small machining gap and difficulty in removing them from the 
machining zone due to susceptibility of the micro-tool to deformation and breakage 
will affect the manner by which material is removed from the workpiece surface 
(Yu et al. 2006). This also could lead to further complexity in tool-abrasive- 
workpiece interactions and consequently more intricate material removal mecha- 
nisms in micro-USM. Research works concerning the material removal modes in 
micro-USM are very limited in the literature, while no studies have been reported 
regarding the investigation on dominant material removal mechanisms that con- 
tribute to the machining process. 

Intricacies Involved in Study of Material Removal Mechanisms in USM 
and Micro-USM 

Characteristics of the machined surface are dependent on the manner by which the 
material is removed from the workpiece. The predominant mechanism of material 
removal has a direct influence on material removal rate and machined surface 
quality. The study of different conditions in tool-abrasive- workpiece interaction 
within the slurry fluid is complex due to the following issues: 

• Ultrasonic machining is a free abrasive machining process. That is, abrasive 
particles move randomly and freely within the machining zone during the 
material removal process. Thus, the location of each particle is changing by 
instant displacement and rotation within the slurry. 
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• Generally, the shape of the particles is irregular and with a distributed size over 
the machining zone. That is, the real particles engaged in machining operation 
do not have exactly the same size, but they follow a nominal average size. 
Besides, abrasive particles are not ideally rigid and incompressible; thus, they 
break into smaller pieces due to collision with the workpiece and tool or between 
themselves resulting in variation of the abrasive size and distribution across the 
machining gap. 

• The distribution of the abrasive particles in the affinity of the machined surface 
and tool may become uneven as a result of slurry flow and agitation due to 
micro-explosion of the bubbles inside the slurry fluid. Also, there is a likelihood 
that the conditions of the machining gap become abnormal due to embedding of 
the particles into the workpiece or tool surface (Yu et al. 2006). These factors 
may lead to a situation whereby the mechanical vibration would not be trans- 
mitted to the slurry fluid and abrasive particles effectively. 

An eclectic of these factors introduces micro-USM as a stochastic process. 
Further, as discussed earlier in this chapter, the effective material removal mech- 
anisms may be different in the case of micro-USM as compared to that of conven- 
tional USM due to more complexity of the tool-abrasive -workpiece interaction in 
the machining zone. 

Investigation on Material Removal Mechanisms in Micro-USM 

Experimental observations by authors (the results to be published in a separate 
paper) demonstrated that the material removal in micro-USM is primarily caused 
by the erosive wear associated with ultrasonic vibration rather than by abrasive wear 
resulting from micro-tool rotation during the process. The erosive wear in micro- 
USM mainly takes place via three material removal mechanisms similar to those of 
conventional USM as described earlier in this chapter. These mechanisms include 
pure cavitation due to bubble implosion inside the slurry liquid, direct hammering of 
abrasive particles on the workpiece surface by micro-tool, and impact of the free 
moving abrasive particles. While the role of pure cavitation has been reported to be 
insignificant in USM, direct hammering and free particle impact are regarded as 
main removal mechanisms. Also, free particle impact has been confirmed to account 
for only about 22 % of total material removal in USM (Khairy 1990). Thus, direct 
hammering has been identified as the primary cause of the material removal in 
conventional USM. The relative contribution of the removal mechanisms might be 
different in micro-USM as compared to that of conventional USM due to the diverse 
removal conditions inside the machining gap for two processes. As such, a basic 
study is conducted to identify the contribution of the different mechanisms to 
material removal in micro-USM. These mechanisms are depicted in Fig. 21. 

Material Removal by Pure Cavitation 

To identify the role of pure cavitation in material removal, a set of machining 
experiments were conducted by using deionized water and without any abrasive 
particles. Tungsten rods with a diameter of 300 pm and monocrystalline silicon 
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Fig. 21 Material removal mechanisms involved in micro-USM process: (a) pure cavitation; 
(b) direct hammering; (c) free particle impact 


with thickness of 525 pm were used as micro-tool and workpiece, respectively. The 
vibration frequency of 50 kHz and vibration amplitude of 2 pm were used for all 
experiments. A machining duration of 5 min was maintained for all experimental 
runs, and each run was repeated three times. The force introduced by ultrasonic 
vibration was recorded using the precision force sensor with interface software. 
This force value corresponds to the adjusted gap distance, and it is used as the 
feedback to maintain the gap distance during the process. Figure 22 shows the 
profile of a typical microhole with maximum depth of 6.8 pm obtained at gap 
distance of 2 pm. 

Material Removal by Particle Direct Hammering 

Another set of experiments was planned and conducted to investigate the role of 
direct hammering mechanism in material removal in the micro-USM process. PCD 
particles with nominal size of 3 pm mixed with DI water were utilized as abrasive 
slurry. Vibration amplitude was set at 2 pm, and small machining gaps in the range 
of 0.2-2 pm were applied. This is to ensure that abrasive particles can be pushed 
and impacted directly by end face of the micro-tool onto the workpiece surface, that 
is, the condition required for direct hammering mechanism to take place in micro- 
USM process. Other machining conditions were similar to that described for the 
experiments in the previous experiment. 

The depth of microholes was measured, and maximum machined depth of 
28.4 pm was obtained when machining gap was adjusted at 0.2 pm. The profile of 
a typical microhole machined under these process conditions is depicted in Fig. 23. 

Material Removal by Free Particle Impact 

Machining experiments were conducted to investigate the effect of free 
particle impact in the material removal process. PCD particles with nominal size 
of 3 pm mixed with DI water were utilized as abrasive slurry. Vibration amplitude 
was set at 2 pm, and machining gap was adjusted in the range of 5-200 pm. 
Since the machining gap was set to be larger than the particles size, direct 
hammering of the particles by tool face was unlikely to occur in the process. 
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Fig. 22 Material removal resulted from cavitation only (without applying abrasive particles); 
vibration frequency = 50 kHz, vibration amplitude = 2 pm, gap distance = 2 pm, machining 
time = 5 min 

Therefore, material removal was brought about primarily by the impact of the free 
moving particles striking the workpiece surface. 

The depth of microholes machined at various gap distances was measured and 
analyzed. As shown in Fig. 24, a microhole with a depth of approximately 218.2 pm 
was obtained, which corresponds to the microhole machined at 30 pm gap distance. 

Contribution of Different Material Removal Mechanisms in Micro-USM 

A comparison of the three removal mechanisms in micro-USM with respect to their 
contributions to material removal is presented in Fig. 25. Average machining depth 
was calculated for three machining runs at which the maximum depth was achieved 
under different mechanisms in the process. While pure cavitation produced a 
shallow microhole with an average depth of only 6.8 pm during 5 min of the 
machining process, direct hammering and free particle impact produced microholes 
with an average depth of 28.4 pm and 218.2 pm, respectively, over the same 
machining time. Accordingly, the depth of the microholes produced under the 
free particle impact mechanism is approximately 32 times and 8 times higher 
than that of pure cavitation and direct hammering, respectively. This could be 
attributed to high velocity of the free moving particles inside the machining gap 
which are subjected to ultrasound in micro-USM. Results of this study evidence that 
unlike conventional USM, free particle impact is the main contributor to material 
removal in micro-USM. 
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Fig. 23 Material removal resulted from direct hammering of the particles by micro-tool; vibration 
frequency = 50 kHz, vibration amplitude = 2 pm, gap distance = 0.2 pm, abrasive particles = 
PCD, particles size = 3 pm, machining time = 5 min 


Predictive Modeling of Ductile and Brittle Removal Modes in Micro- 
USM 


The results of the investigation on abrasive particle erosion of some brittle 
materials have demonstrated that the ductile-brittle transition may occur as the 
size of particles is reduced (Hutchings 1992). Since the size of abrasive particles 
used in micro-USM is smaller than that of conventional USM by a factor of 10 or 
more, the likelihood of such a ductile-brittle transition is higher in micro-USM as 
compared to that of conventional USM. Being a loose abrasive process, the 
material removal mode in micro-USM can be influenced by setting the process 
parameters including particle size at desired levels in order to enhance the process 
performance. 

Investigations on material removal characteristics have been carried out based 
on the morphology of surfaces, processed by multiple-particle impingements. 
Unlike the multiple-particle impact, the method of single-particle impact has a 
potential to provide more basic and explicit information about material removal 
modes and mechanisms. Investigation on craters produced by single-particle 
impact is an attractive approach with the capability of providing practical solutions 
to overcome the ambiguity around the issue of determining the material removal 
modes in abrasive-based micromachining processes. Therefore, the study of single 
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Fig. 24 Material removal resulted from free particle impact; vibration frequency = 50 kHz, 
vibration amplitude = 2.0 pm, gap distance = 5 pm to 200 pm, abrasive particles = PCD, 
particles size = 3 pm, machining time = 5 min 
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Fig. 25 Comparison of different removal mechanisms with regard to their role in micro-USM 




26 Loose Abrasive Machining 


1085 


\ Li terial properties 


Pa il icle properties 

E . K fC , H 


■ ftp 


± 




kinetic energy for radial 
and I at era! cracks : , U j 




Estimate kinetic 
of a panicle in the 
machining zone: U p 



Criteria for brittle 
and ductile modes 
of material removal 


1 

I 

I 

I 





f 



l Itrasomc cliarac ter i sties 

/. " 


Consta ut c oeffic ien t s 



Brittle mode 
M aleri al removal 


Ductile mode 
Material removal 
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abrasive particle impingement may open new avenues toward a more fundamental 
and objective approach to investigate the brittle and ductile machining modes and 
transition between them in micro-USM process. 


Approach to Development of the Predictive Model 

Figure 26 outlines a model for prediction of material removal mode in micro-USM 
process. The model consists of three parts: estimation of the apparent threshold 
kinetic energy for radial and lateral fracture in workpiece material, estimation of the 
kinetic energy of an impinging particle inside the machining zone, and criteria for 
determining the material removal mode. The analysis to estimate the threshold 
kinetic energy is performed based on the well-established indentation fracture 
theory for microcracks initiation by hard angular particles in work materials. The 
application of this theory to predict the thresholds for ductile and brittle transitions 
in other processes such as particle erosion have been described by Hutchings 
(Hutchings 1992). 

The model estimates the kinetic energy of a single impinging abrasive particle 
driven by the ultrasound in the process. The primacy on the type of material 
removal mode lies at the foundation of comparative principle between the apparent 
threshold kinetic energy of the workpiece material and estimated kinetic energy of 
the impacting particle based on the input parameters of micro-USM process. 


Summary 

The major discussions and findings presented are summarized in the following: 

• The development of a micro-USM system has been reported, which incorporates 
the workpiece vibration method. New techniques for the force measurement and 
workpiece clamping in micro-USM were introduced. 

• A study was performed on the role of various mechanisms contributing to the 
material removal in the process. Results demonstrated that the depth of 
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microholes machined by free particle impact mechanism is larger than that 
of microholes machined under pure cavitation and direct hammering mecha- 
nisms by 32 times and 8 times, respectively. Therefore, the free particle 
impact was found to play a major role in the material removal in micro- 
USM process. 

• A new model was developed to predict material removal mode in micro-USM 
with respect to material properties of the workpiece and particles as well as 
ultrasonic characteristics of the system. In the model development, the required 
conditions for ductile and brittle removal in micro-USM were discussed. The 
kinetic energy of a single impinging abrasive particle introduced by the ultra- 
sound was estimated in the model. The type of material removal mode was 
determined based on the comparison between the apparent threshold kinetic 
energy of the workpiece material and estimated kinetic energy of the impacting 
particle. Experiments were conducted on quartz material to validate the capa- 
bility of the proposed model. Modeling results and experimental data showed 
that the proposed model is able to predict the material removal modes reasonably 
well in the micro-USM process. 

• Process models were proposed to predict the material removal rate for brittle and 
ductile removal modes in micro-USM. The models were based on the indenta- 
tion fracture theory and formation of the plastic indentation zone as well as 
lateral cracks during the interaction between sharp particles with workpiece 
surface. Machining experiments were carried out using the developed micro- 
USM system in order to validate the models. MRR values predicted by the 
proposed models showed a reasonable agreement with the experimental trends 
of the MRR for quartz material. However, it was observed that the models 
consistently overestimate the measured MRR values within the machining 
conditions carried out in the work. The deviation between theoretical and 
experimental results may be due to conditions such as overlapping of the craters, 
size and shape distribution of particles, and interparticle collision within the 
machining zone. 


Summary 

To date, numerous process principles have been developed to materialize the loose 
abrasive machining process in different ways. These developments have been made 
in response to the changing needs of the commercial market and to the improve- 
ment of the quality of products. 

Loose abrasive machining processes are also used for deburring, edge finishing, 
and surface integrities. Moreover, process modeling and simulation of the loose 
abrasive machining are being studied intently. Although this chapter cannot cover 
all research and development related to loose abrasive machining, the authors hope 
that this chapter will be of assistance in understanding loose abrasive machining 
and in supporting future technology innovation. 
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Abstract 

As a result of the current trend toward products miniaturization, there is a 
demand for development in micro -manufacturing technologies in order to have 
better quality products, and cheaper, more efficient, and effective processes. 
Miniaturized products and components in the range from a few hundred micro- 
meters to a few micrometers size are becoming common and widely used in 
daily human life. The micro-products and micro-components are used in many 
industries especially related with micro-electromechanical, aerospace, medical, 
environment, biomedical and biochemical industries, and also in the field of 
chemistry. Many manufacturing methods have been developed to produce these 
micro-sized products, namely micro electro mechanical system (MEMS) based 
processes such as dry etching, lithography, electroplating, ultraviolet - 
lithographie galvanoformung abformung ( UV-LiGA ), non-conventional based 
micro -machining such as micro- electron discharge machining (EDM), and 
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mechanical micro-machining. This chapter discusses mechanical micro-machin- 
ing especially challenges related with this process. Some main challenges are 
discussed in this chapter such as size effect, micro structures of the materials, 
surface quality, burr formation, and micro-tools performance. Some solutions 
are offered in order to solve these challenges. 


Introduction 


Various researchers have looked to alternative processes such as mechanical micro- 
machining because of its advantages. The mechanical micro -machining process can 
increase the efficiency, reduce cost, save energy, produce high aspect ratio, have 
wider range of materials, have flexibility, and simplify the production of micro- 
parts and micro-components. 

A mechanical micro-machining process such as micro-milling is more conve- 
nient than a MEMS based process to produce high aspect ratio structures with 
complex geometry (Bissacco et al. 2005). Furthermore, MEMS based processes are 
usually limited to silicon while mechanical micro-machining can be applied to a 
wider range of materials (Kussul et al. 1996). Meanwhile, lithography can be 
applied only to metals that can be easily electroplated (Hupert et al. 2007; Becker 
et al. 1986). Mechanical micro -machining is a flexible process that can also be 
applied to silicon based substrate materials. Rusnaldy and Kim (2008) have 
conducted micro-milling experiments on silicon in order to obtain ductile regime 
machining. Morgan et al. (2004) micro-machined glass and soda-lime glass mate- 
rials using polycrystalline diamond (PCD) tools made by micro-EDM. Micro- 
milling has also been used to produce 3D features on polycarbonate materials 
(Lee et al. 2006). 

Micro-milling can be used to produce micro-features because it is simpler and 
less time consuming. Hyuk-Jin and Sung-Hoon (2007) have shown, by cost esti- 
mations, that the micro-milling process produces cheaper aluminum embossing 
mold injection molding than lithography based processes. Nevertheless, besides its 
advantages micro-milling is limited in its ability to create sharp interior comers 
because of the finite radius of the milling tool (Friedrich and Kikkeri 1995). 

Recently, mechanical micro-machining especially micro-milling is applied in 
the production of embossing mold used for hot embossing process. By using the 
micro-milling process, the micro -features in the embossing mold are produced 
directly by removing the materials to form the positive features on the workpiece; 
an additional finishing process might be needed to make the features smoother. 
Guber et al. (2004) produced a brass embossing mold with channel width and depth 
of 50 pm each and sidewall roughness of about 200 nm using micro milling. Hupert 
et al. (2007) also developed embossing mold for hot embossing using micro milling 
on a 6.3 mm thick brass plate, with tool diameter as small as 50 pm (Fig. 1). The 
mold made by micro-milling requires fewer steps and the result is comparable to 
that developed with LiGA (Hupert et al. 2007). Mecomber et al. (2005) also made 
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Fig. 1 SEM of brass embossing mold produced using 200 pm diameter micro-milling tool (a, b) 
and the replicated embossed into PMMA (c, d) (Hupert et al. 2007) 


low cost micro-embossing molds from aluminum, milled with a 51 pm diameter 
high density micro-grain carbon tool using conventional CNC milling. Moreover 
Bissaco et al. (2005) applied milling tool diameter of 200 pm to produce microin- 
jection molds made from hardened steel. 

The cutting phenomenon of micro-machining is different than conventional 
macro machining, in which the conventional metal cutting principles cannot be 
directly applied. There is no general definition which exists for the micro- 
machining process. Liu et al. (2004) defined the micro -machining process occurs 
when the cutting edge radius becomes comparable to the depth of cut. According to 
Chae et al. (2006) micro-machining is the machining process to create features that 
range from tens of micrometers to a few millimeters in size. In addition, Min 
et al. (2006) defined micro-machining as machining with a tool whose dimension 
is on the order of the average grain size of the workpiece material and/or the specific 
feature being generated or machining with a tool whose dimension is small enough 
to lose isotropic homogeneity with respect to the workpiece material. While Uriarte 
et al. (2007) categorized micro-milling as the process with feed per tooth less than 
1 pm, depth of cut in the range of 2-15 pm, spindle rotation speeds higher than 
50,000 rpm, and tool diameter less than 0.3 mm. 

Moreover, the micro -machining process will be more efficient when it is 
applied in the desktop rather than computer numerical control (CNC) machine. 
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By using micro-machining desktop, the consumption of energy saved can even be 
as large as three magnitudes when compared to CNC (Liow 2009). Most of the 
energy used when producing micro-parts in conventional CNC is to move the 
tables and to rotate the spindle at the highest rpm to compensate for the small 
chip sizes. 

There are some aspects that must be considered during the micro-machining 
process such as size effect, multiphase microstructure of materials, surface quality, 
burrs, and micro-tools (Liu et al. 2004; Chae et al. 2006). These aspects are related 
to each other and simultaneously influence the process performance and quality of 
the machined workpiece. These aspects are discussed below. 

Size Effect 

Size effect in cutting process occurs when the uncut chip thickness (depth of cut) is 
on the same order as that of the cutting edge radius. This phenomenon generally 
occurs when cutting at the micro scale or cutting using a blunt tool. One of the 
significant influences of size effect is the increasing of energy dissipation. Lucca 
et al. (1991) observed that energy dissipation increases with the decreasing uncut 
chip thickness when orthogonal cutting oxygen free high conductivity (OFHC) 
copper using a single point diamond. Afterwards Lucca et al. (1993) also presented 
that the specific cutting energy increases as the depth of cut decreases for sharp tool 
(new) and blunt tool (worn). They argued that rake angle has significant effect on 
the specific cutting energy when the cutting edge radius is in the same order of 
depth of cut. In this condition, the rake angle becomes negative due to the radius 
shape at the tip of the tool. 

Consequently, as the cutting edge radius increases, the cutting force also 
increases due to the occurrence of two mechanisms in the process which are 
shearing (chip removal) and ploughing (elastic deformation) (Liu et al. 2006). 
Increasing the cutting force and the dominant effects of ploughing could generate 
the tool wear faster (Uriarte et al. 2007) due to the increase of heat from chip-tool 
rake friction and excessive tool deflection (Lucca et al. 1991; Liu et al. 2006). 
Ploughing, adhesion, and asperity deformation are believed to be the main effects of 
the variation of the coefficient of friction along the tool-chip contact interface in 
micro-cutting ( Venkatachalam and Liang 2007). 

In contrast, the increase of cutting edge radius and negative rake angle are 
favorable in the micro-cutting of brittle materials such as silicon. The negative 
rake angle produces high hydrostatic pressure that enables plastic deformation to 
occur in front of the cutting edge resulting in the ductile cutting mode (Yan 
et al. 2009). Negative rake angle and small depth of cut are the conditions needed 
to achieve ductile cutting mode where beyond this point brittle cutting conditions 
occurred resulting in more damage on the surface and sub-surface. In ductile cutting 
conditions, the amorphous phase will be produced in the subsurface (Minowa and 
Sumino 1992), and the amorphous layer thickness increases in proportion to depth 
of cut (Yan et al. 2009). 
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Fig. 2 Relation of minimum chip thickness with cutting edge radius (Chae et al. 2006) 

Ploughing occurs when the chip thickness (h) is below the minimum chip 
thickness (h m ) or when cutting using large edge radius and small uncut chip 
thickness, as shown in Fig. 2. Aside from yielding higher forces, a significant 
amount of ploughing also generates more burrs. 

The cause of ploughing and size effect in micro-machining is still not well 
understood. Waldorf et al. (1998) proposed a slip-line model by considering the 
ploughing effect to predict the forces in orthogonal cutting. Liu et al. (2006) 
developed a model for predicting the minimum chip thickness values that occur 
for ploughing, especially for 1040 steel and aluminum alloys A16082-T6. They 
found that the normalized minimum chip thickness (ratio of the minimum chip 
thickness to the cutting edge radius) was increased as the cutting velocity and tool 
edge radius increased when machining carbon steels due to the more dominant 
effect of thermal softening than the strain hardening for carbon steels. The normal- 
ized minimum chip thickness was found to almost stay constant over a range of 
cutting velocities and tool edge radii when micro-machining A16082-T6. 

Joshi and Melkote (2004) developed a strain gradient plasticity model in the 
primary deformation zone to explain the size effect. They proposed a model to 
calculate the specific energy. Lai et al. (2008) argued that material strengthening 
behavior is the main cause of the size effect in micro scale cutting. The strain 
gradient plasticity model is applied to model the material strengthening behaviors. 
Liu and Melkote (2006) also observed the size effect using strain gradient plasticity 
model and found that the material strengthens due to the strain gradient as the uncut 
chip thickness reduces to a few microns. Subbiah and Melkote (2006) observed the 
constant cutting force occurred at high rake angles (>70°). High rake angles 
minimized the energy in shear and friction due to the minimum shear in the chip. 
They found that there was a constant force component that will not change with a 
change in uncut chip thickness. 

Aramcharoen and Mativenga (2009) investigated the size effect when micro- 
milling HI 3 hardened tool steel. They found that the specific cutting force increases 
non-linearly when the ratio of depth of cut to the cutting edge radius is less than 
1 (Fig. 3). This phenomenon occurred due to the size effect where the ploughing 
and elastic deformations are dominant. The specific cutting force is calculated by 
dividing the feed force by product of chip load and axial depth of cut. The ratio of 
the depth of cut to the cutting edge radius for different materials is mainly affected 
by the microstructure, hardness, material phase, and tool geometry. In certain cases 
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Fig. 3 The specific cutting force in different ratio of uncut chip thickness (Aramcharoen and 
Mativenga 2009) 


of micro-machining, the cutting edge of the tool can also be assumed to be perfectly 
sharp when the minimum chip thickness is 40 % of the cutting edge radius for 
aluminum (Bourne et al. 2011). The use of diamond tool can minimize the size 
effect or cutting edge radius effect. However the diamond cannot be used to cut 
ferrous materials due to the excessive wear of the tool. 

Several works have also been conducted to investigate the size effect in micro- 
cutting using commercial finite element software. Weber et al. (2007) found that the 
non-linear increase of the specific cutting force with decreasing depth of cut is 
strongly influenced by the rate dependent material model when investigating the 
contribution of rounded cutting edge to the size effect in cutting and the character- 
istics of machined surface. It is observed that the increase of cutting speed is 
followed by the increase of specific cutting force and the depth of plastic zone 
increases almost linearly with cutting edge radius. 


Multiphase Microstructure of Materials 

Another important phenomenon that distinguishes micro-cutting from conventional 
macro-cutting is the assumption of a non-homogeneous workpiece as the depth of 
cut is on the same order as the size of the grains (micro structure) and the cutting 
edge radius (Chae et al. 2006). In the conventional cutting process the depth of cut 
is relatively larger than the grain size; hence the workpiece is considered homoge- 
neous and the cutting edge is sharp. In contrast, in micro-cutting the depth of cut is 
relatively comparable with the grain size and cutting edge radius; hence the 


27 Mechanical Micro-machining 


1095 



Fig. 4 Schematic view of relation between grain size and depth of cut in orthogonal cutting for 
macro (left) and micro (right) (Bissacco et al. 2005) 


workpiece materials cannot be considered as homogeneous and the cutting edge 
radius has a value. Figure 4 shows workpiece has homogeneous and isotropic 
properties in macro-cutting while in micro scale the grain size is relatively compa- 
rable with the depth of cut. 

In micro-cutting, the grain size influences the surface generation or surface 
integrity of the machined workpiece more significantly than in conventional cutting 
(Wang et al. 2008; Vogler et al. 2004; Simoneau et al. 2006a). The micro structure 
of workpiece effects the cutting forces and the chip formation more prominently if 
the materials consist of several phases with different hardness (Dornfeld 
et al. 2006). The forces fluctuate at different materials phase or state due to the 
different plastic deformation behavior. In addition, micro-cutting has a dominant 
vibration problem compared to conventional macro-cutting due to the multiphase 
micro structure (Vogler et al. 2004). Vogler et al. (2004) found that multiphase 
materials and size effect influence the mechanism of chip formation and surface 
generation especially in the bottom of the slot along the feed direction in the micro 
end-milling process. 

The cutting force in the micro-cutting process is also affected by the grain 
boundary of poly crystalline materials (Furukawa and Moronuki 1988). Furukawa 
and Moronuki (1988) observed that the mean values of cutting forces are almost the 
same but the dynamic components of the forces are different under the same cutting 
conditions for grain size of 160 and 620 pm when micro-cutting aluminum alloys. 
The dynamic variation of forces at grain boundaries of polycrystalline materials is 
affected mainly by different properties of each grain and different grain orienta- 
tions. In addition to the grain effect, the crystallographic orientations and active 
dislocation slips systems also have an effect on the surface quality and edge 
conditions especially burr formation (Min et al. 2006). Min et al. (2006) observed 
that the cutting mechanism, variation in the surface finish, and edge condition such 
as burrs are influenced by the crystallographic orientation and active dislocation 
slip systems when machining single crystal and polycrystalline oxygen free high 
conductivity (OFHC) copper using fly-cutting operation. It is argued that for 
poly crystalline materials, ductile plasticity and chip formation in machining are 
usually accomplished with the movement of dislocations along the preferred slip 
planes in the shear zone ahead of the tool. Moreover, there are certain 
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Fig. 5 SEM images of the defects in the machined surface of steel material. The dimpled surface 
in (a) shows examples of prows (P) and microvoids (V), microcrack (C) is shown in (b), (c), and 
(d) show dimples on the machined surface. Cutting direction is indicated by the large arrows 
(Simoneau et al. 2006b) 


crystallographic orientations more favorable for relatively easy dislocation glide 
and chip formation in single-crystal materials. 

In the micro-cutting of steel, Schimdt et al. (2002) suggested that the steel needs 
to be heat treated in order to have the finest and uniformly distributed carbides so 
that constant cutting conditions can be achieved in the micro-cutting process. In 
addition, the challenge in the micro-machining of steel is also due to the presence of 
pearlite and ferrite phases. These phases affect surface quality and forces. The 
micro-cutting of AISI 1045 steel that consists of pearlite and ferrite phases pro- 
duced prows, micro-voids, micro-cracks, and surface dimples on the machined 
surface (Simoneau et al. 2006b) (Fig. 5). In addition, surface dimples also occur 
at a “hard to soft” grain boundary. 


Surface Quality 


Producing good quality, appropriate surface finish, and geometry are important for 
the machined workpiece. The surface finish or surface texture based on ASME 
(1985) is defined as geometrical irregularities of solid materials surface while 
surface roughness is defined as the finer irregularities of the surface texture, usually 
resulting from the inherent action of the production process, such as feed marks 
produced during machining. The surface roughness is commonly indicated by 
parameters such as average roughness (. R a ) or root mean square roughness ( R q ) 
and calculated by Eqs. 1 and 2, respectively. 




1 

L 


r* 





Y(x)\dx 


( 1 ) 
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where : 

L is the sampling length 

Y (x) is the ordinate of the profile curve 

Surface finish and surface integrity of the machined workpiece affect wear, 
fatigue resistance, corrosion resistance, crack initiation and propagation, creep 
life, and friction during the component use. Surface roughness measurement 
can also be used to predict the criterion of tool wear in micro-milling 
(Hongtao et al. 2008). Cutting velocity and material removal volume have a 
significant effect on tool wear, which in the end will affect surface roughness. 
Geometry of tool, cutting parameters (spindle speed, feed rate, and depth of 
cut), irregularities in cutting operation (tool wear and lubricant), and material 
properties of tool and workpiece are several factors influencing the surface 
quality especially in the micro-milling process (Dimov et al. 2004). The tool 
wear especially affects the wall surface quality of the slot (Schmidt and 
Tritschler 2004). 

By applying low feed rate, better surface finish can be achieved but it gives lower 
material removal rate (Miao et al. 2007). It is observed that the surface roughness 
increased as the tool edge radius increased in micro-end milling (Vogler 
et al. 2004). In the micro-milling of copper materials, feed rate has a significant 
influence on surface roughness and burr height, spindle speed has a slight influence 
on the surface roughness and burr formation, whereas axial depth of cut has no 
significant influence on the surface roughness (Huo and Cheng 2010). 

In micro-milling a slot, the side wall where the tool edge finished the cut (down- 
milling) has better surface quality and dimensional accuracy than the other wall 
where the cutting edge enters into the cut (up-milling) (Min et al. 2008). The 
up-milling process produces uneven and rough surfaces especially in the wall due 
to the elastic deformation and recovery. Min et al. (2008) observed this phenomena 
when micro-milling annealed austenitic stainless steel 304 and aluminum 6061- 
T651 1 with the grain size varied in between 5 and 25 pm using two-fluted WC-Co- 
Carbide (8 % cobalt) micro-end-mills with 254 pm diameter. 

In order to produce low surface roughness in micro-cutting of steel using 
tungsten carbide tool, Schmidt et al. (2002) and Weule et al. (2001) recommended 
using high cutting velocities and hard, homogeneous material (Fig. 6). As can be 
seen in Fig. 6, in the soft material states (T450), surface roughness increases 
significantly at lower cutting velocities. In the case of micro-milling harder AISI 
Hll hardness 52 HRC, the up-milling has the largest roughness when the tool is 
new and turned to have better surface finish when reaching the stabilized cutting 
edge conditions. In contrast, in the micro-milling of soft AISI Hll of 42 HRC 
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Fig. 6 Influence of cutting velocity and material state on the surface roughness in the micro- 
milling of steel (Weule et al. 2001) 


hardness, the lowest surface roughness was found in the middle of the milling trace 
(Schmidt and Tritschler 2004). 

The quality of the micro-milled surface is also affected by the micro structure of 
the workpieces (Wang et al. 2008). In macro-size milling, the workpiece material is 
regarded as isotropic hence the surface roughness is affected mainly by the milling 
parameters (Wang et al. 2008) whereas in micro-milling, the grain sizes of the 
workpiece are normally between 1 and 100 pm, and the tools’ tip edge radii are less 
than 5 pm. When the tool passes through the workpiece along the length of the tool 
edge, there are only a countable number of grains being cut. 


Burr Formation 

Another aspect of surface quality that is normally discussed and seen in practice is 
burr formation. Burr is defined as the material plastic deformation produced at 
workpiece edges as a result of the machining or shearing process (Gillespie 1999). 
The existence of burrs on the workpiece may cause problems related to the 
dimensional accuracy, surface finish, ease of parts assembly, and furthermore 
increase the cost and time production for deburring (Chern 2006). Schaller 
et al. (1999) suggested that proper selection of machining parameters, quality and 
sharpness of cutting edge, cutting fluid and also material properties of the work- 
piece are several factors affecting the burr dimensions. In general, the burr dimen- 
sions especially those produced in micro-machining are difficult to quantify mainly 
due to the limitation of metrology equipment. One method that is commonly used to 
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Fig. 7 The burrs location in 
the slot-milling process 
(Gillespie 1976) 




measure burr is the stylus method. However this method is only suitable to measure 
burr heights (Aurich et al. 2009). 

According to Gillespie (1999) there are six physical processes which form burrs: 
lateral flow of material which occurs whenever a solid is compressed, bending of 
material, tearing of chip from workpiece, redeposition of material, incomplete 
cut-off, and flow of material into cracks. Hashimura et al. (1999) categorized 
burrs in face milling according to burr locations, burr shapes, and burr formation 
mechanisms. Exit burr is defined as the burr attached to the surface machined by the 
minor edge of the tool, side burr is the burr attached to the transition surface 
machined by the major edge, and a top burr is defined as a burr attached to the 
top surface of the workpiece. While Chern (Chern 2006) classified five types of 
burrs in face milling operation: knife type, wave type, curl type, edge breakout, and 
secondary burr. Whereas in the micro-milling process, Chern et al. (2007) catego- 
rized burrs as primary burr, needle-like burr, feathery burr, and minor burr. The 
primary burr, feathery burr, and needle-like burr are mainly produced on the 
up-milling side of the micro-slot wall. In general, the burrs in the micro-milling 
process are formed when the cutting edge leaves the surface being cut. 

As feature and/or part sizes become smaller, burr problems are more difficult to 
solve because while the burr size also becomes small, the burr to feature size ratio 
increases. Most prominent burrs in micro-milling occur on the top edge of the side 
walls of the slots, called top burrs (Fig. 7). In contrast to the surface quality of the 
side walls in the micro-slots, up-milling has smaller burrs than down milling. In 
the up-milling side the burr is a Poisson burr formed only by side bulging action. On 
the down milling side, the top burr is formed by the action of the chip material 
tearing away as it flows as well as side bulging deformation; hence the down milling 
burrs tend to be larger (Saptaji et al. 2012). 

The ductility or brittleness as well as the strain hardening behavior of materials 
have important contributions to the burr size (Gillespie 1979). Burr formation in 
ductile materials is related to the high elastic-plastic deformation that occurs in the 
machining, larger and more burrs are likely to be formed with increasing ductility 
(Shafer 1975). In the micro-grooving of aluminum alloys such as A15083-H116 
using a single-crystal diamond tool, top burr mainly occurs ahead of a tool and is 
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caused by expansion of material compressed after starting to flow around a tool 
rather than becoming part of a chip. While exit burr formation is formed when a thin 
membrane of material forms ahead of a tool and splits into two side segments and 
one bottom segment as the tool exits a workpiece (Bourne et al. 2011). 

The burr formation in brittle or hard materials is mainly affected by the tool wear 
(Weule et al. 2001). A broken tool has a larger negative rake angle, increases 
ploughing, and in the end increases exit burr dimensions significantly (Liang 
et al. 2009). In the micro-milling of hard materials such as steel, the down-milling 
side has more burrs (Schmidt and Tritschler 2004). Schmidt and Tritschler (2004) 
observed that the harder materials state has fewer burrs, however the harder 
material (AISI Hll, 52 HRC) can have more burrs than the softer material (AISI 
Hll, 42 HRC) especially due to stronger tool wear progress. 

In general, there are two approaches to overcome the burrs. One way is to reduce 
the burr during the machining process by changing the process parameters and 
another is to remove the burr (deburring) after the machining. Many researchers 
have discussed the burr reduction through different machining strategies. It is found 
that by decreasing the depth of cut, the burr height is decreased (Chern 2006; Lee 
and Dornfeld 2005). Weule et al. (2001) experimentally showed that high cutting 
velocities led to less burr formation in the micro-milling of steel. Coated tools can 
also reduce the burr size when micro-milling hardened steel (Aramcharoen 
et al. 2008). 

Lekkala et al. (2011) argued that in the micro-milling of aluminum A12124 and 
stainless steel SS-304 using a solid carbide end milling cutter with diameter 300 and 
400 pm, the speed has a less significant effect on the burr thickness and height 
whereas tool diameter, depth of cut, number of flutes, and the interaction between 
feed rate and number flutes have significant effect on the burr height. In addition, 
they observed that up milling produces more side exit burrs in micro-milling of 
aluminum. 

Deburring is defined as the removal of minute amounts of material from edges 
after major part features have been produced (Gillespie 1979). The deburring 
process is difficult to apply on micro-features produced by micro-milling; the 
process must be carefully conducted to avoid damage to the small features. Incor- 
rect selection of deburring techniques or parameters may also introduce dimen- 
sional errors, damage surface finish, and residual stresses. The additional deburring 
process and edge finishing in micro-scale and ultra precision machining may also be 
too expensive or impractical to implement (Min et al. 2006). 

Burrs in brass are removed by applying cyanoacrylate and subsequently cut with 
diamond milling (Fig. 8) (Schaller et al. 1999). Whereas for stainless steel, burrs 
can be removed using electrochemical polishing (Schaller et al. 1999). Other 
methods such as powder blasting using white fused alumina also removes the 
burrs when micro-milling brass (Yun et al. 2008). A similar method is also applied 
by Horsch et al. (2006) for quenched and tempered tool steel. Micro-peening and 
ultrasonic wet peening are also used for deburring and improving the surface finish 
(Horsch et al. 2006). The top burrs on the micro-slots of aluminum alloys, copper, 
and stainless steel can be reduced using additional micro-EDM (Jeong et al. 2009). 
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Fig. 8 SEM photo of micro 
features before burr removal 
( upper right ) and after burr 
removal ( lower left) by 
diamond milling in the 
micro-features filled with 
cyanoacrylate in brass 
(Schaller et al. 1999) 



b u rrs 


Burr formation can also be minimized if the material is restricted to deform in 
force direction due to workpiece geometry (Shafer 1975). Park and Dornfeld (2000) 
developed finite element models to study the exit burr formation of various work- 
piece exit edge angles of 60°, 80°, 90°, 100°, and 120° for the case of orthogonal 
cutting. It is observed that the use of larger tool rake angles (Park and Domfeld 
2000) can also reduce the burrs. The top burrs formed in micro -milling can be 
reduced by increasing the side edge angle of the micro-slots where the cutting edge 
will exit (Saptaji et al. 2012). This method can provide better resistance to the 
plastic deformation closer to the side edges. In addition, the use of a tapered tool is a 
possible technique to avoid top burrs in micro-milling where the higher taper angle 
produces smaller burrs (Saptaji et al. 2012). The top burr height can be decreased by 
making the tool edge radius smaller and the rake angle more positive (Liang 

et al. 2009). 


Micro-tools 

In the micro-cutting process especially micro-milling, the downscaling of the 
cutting mechanism from macro to micro-scale does not also automatically mean 
down scaling the tool shape without any modification (Li et al. 2011). The tool 
shape needs to be modified especially due to the different cutting parameters 
between macro and micro-scale machining. The tools used for micro-milling 
need to have high stiffness, high strength at the cutting edge corner, simple 
geometry, run out compensation, etc. (Li et al. 2011). Fang et al. (2003) argued 
that the tool shape such as the semi circle-based (D-type) shape for end-mills has 
higher rigidity than that of the two-flute (commercial type) end-mills, and produced 
better surface quality than that of the triangle-based (A-type) end-mills (Fig. 9). 

In the milling process, as the tool diameter becomes smaller, the spindle speed 
has to be increased in order to compensate the material removal rate as shown 
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Fig. 9 Various types of micro-milling tool shapes, (a) Two-flute end-mills, (b) A-type end-mills 
with a straight body, (c) D-type end-mills with a straight body, (d) A-type end-mills with a tapered 
body and (e) D-type end-mills with a tapered body (Fang et al. 2003) 


in Eq. 3. As a consequence of the small tool diameter and high spindle speed, the 
tool experiences more vibration and deflection which in the end influences the 
surface finish, generates error in the accuracy of the geometrical feature produced 
by the process, and increases tool wear (Uriarte et al. 2007). The vibration occurs 
because of the spindle run out and reduced tool rigidity as a consequence of 
reducing the tool diameter. Vibration can also occur due to the ploughing induced 
by the rapid increase of the thrust forces, due to the transition from shearing- 
dominated cutting to ploughing-dominated cutting. 


n.D.s 

1,000 



where : 

v = cutting speed (m/min) 
n — circular constant (3.14) 

D = diameter of the milling tool (mm) 
s = spindle speed (rpm) 

The vibration of the tool during the cutting process produces intermittent 
impacts on the tool, which are strong enough to cause discrete chip formation, 
tool wear, or even tool breakage (Miao et al. 2007; Malekian et al. 2009). The 
discrete chip formation generally occurs at low feed rate or low axial depth of cut in 
which the ratio of the depth of cut to the cutting edge radius is less than the 
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Fig. 10 Tool stiffness problems: (a) Tool run-out, (b) Tool setting error, (c) Tool deflection 
(Ryu et al. 2006) 
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minimum depth of cut or uncut chip thickness. Chem et al. (2007) suggested that 
the amount of tool engagement must be kept to a minimum in order to avoid the 
undesirable premature tool failure problem in micro-cutting process. 

The tool stiffness, such as tool run-out, tool setting error, tool tilting, and tool 
deflection, also affects the generation of the surface finish (Chern et al. 2007; Ryu 
et al. 2006). Tool run-out consists of radial run-out and axial run-out (Fig. 10a). 
Meanwhile, tool setting error can be defined as eccentricity and tilting between tool 
and axis and spindle axis (Fig. 10b). The tool deflection effect is commonly caused 
by high cutting forces (Fig. 10c). 

In order to perform the micro-cutting process successfully, the micro-tools need to 
be precise and strong. Focused ion beam (FIB) technique can be used to produce 
micro-tools such as tungsten carbide used for micro-milling or micro -grooving 
(Schmidt and Tritschler 2004; Adams et al. 2000). This method has been shown 
to successfully produce micro-grooving tools with tool edge radius of about 0.4 pm 
(Adams et al. 2000). In addition, wire electro -discharge grinding (WEDG) can 
be used to produce a tungsten carbide micro-milling tool with diameter of about 
100 pm (Chem et al. 2007). The accuracy of the machined workpiece produced by 
micro-milling is also influenced by the specification of the machine tool. The 
machine tool is required to have high motion accuracy, stiffness, thermal stability, 
precise spindle bearings, and high resolution of linear and rotary motions (Domfeld 
et al. 2006). 


Summary 

In this chapter, a brief explanation of the mechanical micro-machining process and 
its challenges has been presented and discussed. The demands for micro -features 
and micro-components in many industries are increasing with the challenge being 
especially to manufacture these products using an optimum process. Mechanical 
micro-machining is one of the alternative processes due to its advantages. However 
challenges related with this process such as size effect, micro structures of materials, 
surface quality, burrs, and also tool performance need to be studied and solved so 
that the micro-machining process can have better performance in the production of 
micro -features. Several solutions are offered to encounter these challenges: 

1 . The size effect can be minimized by using diamond tools; however it may not be 
applied for ferrous materials. 

2. The effect of multiphase micro structures of materials can be minimized by 
preparing finer and homogenous grains in the materials. 

3. Good surface quality can be achieved by using optimum cutting parameters and 
modifying the micro structure of the workpiece. 

4. The burr problem can be avoided by optimizing cutting parameters and using an 
additional deburring process. 

5 . The tool modifications can be applied in order to have better performance of the 
micro-tools. 
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The solutions of one issue must not cause other issues to appear since the issues 
are related to each other and simultaneously influence the process performance and 
quality of the machined workpiece. More alternative solutions need to be investi- 
gated in order to optimize the mechanical micro-machining process. 


References 

Adams DP, Vasile MJ, Krishnan ASM (2000) Microgrooving and microthreading tools for 
fabricating curvilinear features. Precis Eng 24(4):347-356 
ANSI/ASME B46.1 (1985) Surface texture (surface roughness, waviness and lay). ASME, 
New York 

Aramcharoen A, Mativenga PT (2009) Size effect and tool geometry in micromilling of tool steel. 
Precis Eng 33(4):402-407 

Aramcharoen A et al (2008) Evaluation and selection of hard coatings for micro milling of 
hardened tool steel. Int J Mach Tools Manuf 48(14): 1578-84 
Aurich JC et al (2009) Burrs-analysis, control and removal. CIRP Ann Manuf Technol 58(2):5 19-542 
Becker EW et al (1986) Fabrication of microstructures with high aspect ratios and great structural 
heights by synchrotron radiation lithography, galvanoforming, and plastic moulding (LIGA 
process). Microelectron Eng 4( 1):35— 56 

Bissacco G, Hansen HN, De Chiffre L (2005) Micromilling of hardened tool steel for mould 
making applications. J Mater Process Technol 167(2-3): 20 1-207 
Bourne KA, Kapoor SG, DeVor RE (2011) Study of the mechanics of the micro-groove cutting 
process. In: Proceedings of the ASME 2011 international manufacturing science and engineer- 
ing conference, MSEC2011 

Chae J, Park SS, Freiheit T (2006) Investigation of micro-cutting operations. Int J Mach Tool 
Manuf 46:313-332 

Chern G-L (2006) Experimental observation and analysis of burr formation mechanisms in face 
milling of aluminum alloys. Int J Mach Tools Manuf 46(12-13): 15 17-25 
Chern G-L et al (2007) Study on burr formation in micro-machining using micro-tools fabricated 
by micro-EDM. Precis Eng 3 1(2): 122-129 

Dimov S et al (2004) Micromilling strategies: optimization issues. Proc Inst Mech Eng Part B Eng 
Manuf 218(7):73 1-736 

Dornfeld D, Min S, Takeuchi Y (2006) Recent advances in mechanical micromachining. CIRP 
Ann Manuf Technol 55(2):745-768 

Fang FZ et al (2003) Tool geometry study in micromachining. J Micromech Microeng 13(5):726 
Friedrich C, Kikkeri B (1995) Rapid fabrication of molds by mechanical micromilling: process 
development. In: Microlithography and metrology in micromachining, Austin, TX. Society of 
Photo-Optical Instrumentation Engineers, Bellingham 
Furukawa Y, Moronuki N (1988) Effect of material properties on ultra precise cutting processes. 
CIRP Ann Manuf Technol 37(1): 1 13-1 16 

Gillespie LK (1976) Burr formation and properties. Deburring capabilities and limitations. Society 
of Manufacturing Engineers (SME), Dearborn 
Gillespie LK (1979) Deburring precision miniature parts. Precis Eng 1(4): 189-198 
Gillespie LK (1999) Deburring and edge finishing handbook. SME, Dearborn 
Guber AE et al (2004) Microfluidic lab-on-a-chip systems based on polymers - fabrication and 
application. Chern Eng J 101( 1— 3):447— 53 

Hashimura M, Hassamontr J, Dornfeld DA (1999) Effect of in-plane exit angle and rake angles on 
burr height and thickness in face milling operation. J Manuf Sci Eng 121(1): 13—19 
Hongtao L et al (2008) Modelling and experimental analysis of the effects of tool wear, minimum 
chip thickness and micro tool geometry on the surface roughness in micro-end-milling. 
J Micromech Microeng 18(2):025006 (12 pp) 



1106 


K. Saptaji 


Horsch C, Schulze V, Lohe D (2006) Deburring and surface conditioning of micro milled 
structures by micro peening and ultrasonic wet peening. Microsyst Technol 12(7):69 1-696 
Huo D, Cheng K (2010) Experimental investigation on micromilling of oxygen- free, high-con- 
ductivity copper using tungsten carbide, chemistry vapour deposition, and single-crystal 
diamond micro tools. Proce Inst Mech Eng Part B J Eng Manuf 224(6):995-1003 
Hupert ML et al (2007) Evaluation of micromilled metal mold masters for the replication of 
microchip electrophoresis devices. Microfluid Nanofluid 3(1): 1-11 
Hyuk-Jin K, Sung-Hoon A (2007) Fabrication and characterization of microparts by mechanical 
micromachining: precision and cost estimation. Proc Inst Mech Eng B-J Eng 221(B2):231-40 
Jeong YH et al (2009) Deburring microfeatures using micro-EDM. J Mater Process Technol 

209(14):5399-5406 

Joshi SS, Melkote SN (2004) An explanation for the size-effect in machining using strain gradient 
plasticity. J Manuf Sci Eng 126(4):679-684 

Kussul EM et al (1996) Micromechanical engineering: a basis for the low-cost manufacturing of 
mechanical microdevices using microequipment. J Micromech Microeng 6(4) :4 10-425 
Lai X et al (2008) Modelling and analysis of micro scale milling considering size effect, micro 
cutter edge radius and minimum chip thickness. Int J Mach Tool Manuf 48(1): 1-14 
Lee K, Dornfeld DA (2005) Micro-burr formation and minimization through process control. 
Precis Eng 29(2):246-252 

Lee JH, Park SR, Yang SH (2006) Machining a Micro/Meso scale structure using a minaturized 
machine tool by using a conventional cutting process. J Manuf Sci Eng 128(3):820-825 
Lekkala R et al (2011) Characterization and modeling of burr formation in micro-end milling. 
Precis Eng 35 (4): 625-637 

Li P et al (2011) Design of micro square endmills for hard milling applications. Int J Adv Manuf 
Technol 57(9-12):859-870 

Liang YC et al (2009) Modeling and experimental analysis of microburr formation considering tool 
edge radius and tool-tip breakage in microend milling. J Vac Sci Technol B 27(3): 153 1-1535 
Liow JL (2009) Mechanical micromachining: a sustainable micro-device manufacturing 
approach? J Cleaner Prod 17 (7): 662-667 

Liu K, Melkote SN (2006) Material strengthening mechanisms and their contribution to size effect 
in micro-cutting. J Manuf Sci Eng 128(3):730-738 
Liu X et al (2004) The mechanics of machining at the microscale: assessment of the current state of 
the science. J Manuf Sci Eng, Trans ASME 126(4): 666-67 8 
Liu X, DeVor RE, Kapoor SG (2006) An analytical model for the prediction of minimum chip 
thickness in micromachining. J Manuf Sci Eng 128(2):474-481 
Lucca DA, Rhorer RL, Komanduri R (1991) Energy dissipation in the ultraprecision machining of 
copper. CIRP Ann Manuf Technol 40(l):69-72 

Lucca DA, Seo YW, Komanduri R (1993) Effect of tool edge geometry on energy dissipation in 
ultraprecision machining. CIRP Ann Manuf Technol 42(l):83-86 
Malekian M, Park SS, Jun MBG (2009) Tool wear monitoring of micro-milling operations. 
J Mater Process Technol 209(10):4903-4914 

Mecomber JS, Hurd D, Limbach PA (2005) Enhanced machining of micron-scale features in 
microchip molding masters by CNC milling. Int J Mach Tool Manuf 45(12-13): 1542-1550 
Miao JC et al (2007) Review of dynamic issues in micro-end-milling. Int J Adv Manuf Technol 
3 1(9- 10): 897-904 

Min S et al (2006) Surface and edge quality variation in precision machining of single crystal and 
polycrystalline materials. Proc Inst Mech Eng Part B J Eng Manuf 220(4):479-487 
Min S et al (2008) A study on initial contact detection for precision micro-mold and surface 
generation of vertical side walls in micromachining. CIRP Ann Manuf Technol 57(1): 109-1 12 
Minowa K, Sumino K (1992) Stress-induced amorphization of a silicon crystal by mechanical 
scratching. Phys Rev Lett 69(2):320-2 

Morgan CJ, Vallance RR, Marsh ER (2004) Micro machining glass with polycrystalline diamond 
tools shaped by micro electro discharge machining. J Micromech Microeng 14(12): 1687 



27 Mechanical Micro-machining 


1107 


Park IW, Domfeld DA (2000) A study of burr formation processes using the finite element 
method: part II-The influences of exit angle, rake angle, and backup material on burr formation 
processes. J Eng Mat Technol 122(2):229-237 

Rusnaldy TK, Kim H (2008) An experimental study on microcutting of silicon using a 
micromilling machine. Int J Adv Manuf Technol 39(1):85— 9 1 
Ryu SH, Choi DK, Chu CN (2006) Roughness and texture generation on end milled surfaces. Int J 
Mach Tool Manuf 46(3-4) :404-4 12 

Saptaji K, Subbiah S, Dhupia JS (2012) Effect of side edge angle and effective rake angle on top 
burrs in micro-milling. Precis Eng 36(3):444-450 
Schaller T et al (1999) Microstructure grooves with a width of less than 50 pm cut with ground 
hard metal micro end mills. Precis Eng 23(4):229-235 
Schmidt J, Tritschler H (2004) Micro cutting of steel. Microsys Technol 10(3): 167-174 
Schmidt J et al (2002) Requirements of an industrially applicable microcutting process for steel 
micro-structures. Microsyst Technol 8(6):402-408 
Shafer F (1975) Entgraten. Krausskopf-Verlog, Mainz 

Simoneau A, Ng E, Elbestawi MA (2006a) Chip formation during microscale cutting of a medium 
carbon steel. Int J Mach Tools Manuf 46(5):467-81 
Simoneau A, Ng E, Elbestawi MA (2006b) Surface defects during microcutting. Int J Mach Tool 
Manuf 46(12-13): 1378-1387 

Subbiah S, Melkote SN (2006) The constant force component due to material separation and its 
contribution to the size effect in specific cutting energy. J Manuf Sci Eng 128(3):8 1 1—8 15 
Uriarte L et al (2007) Error budget and stiffness chain assessment in a micromilling machine 
equipped with tools less than 0.3 mm in diameter. Precis Eng 3 1(1): 1-12 
Venkatachalam S, Liang SY (2007) Effects of ploughing forces and friction coefficient in 
microscale machining. J Manuf Sci Eng Trans ASME 129(2):274-280 
Vogler MP, DeVor RE, Kapoor SG (2004) On the modeling and analysis of machining perfor- 
mance in micro-endmilling, part i: Surface generation. J Manuf Sci Eng Trans ASME 
126(4):685-694 

Waldorf DJ, DeVor RE, Kapoor SG (1998) A slip-line field for ploughing during orthogonal 
cutting. J Manuf Sci Eng 120(4): 693-699 

Wang JS et al (2008) Surface generation analysis in micro end-milling considering the influences 
of grain. Microsyst Technol 14(7):937-942 

Weber M et al (2007) Investigation of size-effects in machining with geometrically defined cutting 
edges. Mach Sci Technol 1 1(4):447-473 

Weule H, Huntrup V, Tritschle H (2001) Micro-cutting of steel to meet new requirements in 
miniaturization. CIRP Ann Manuf Technol 50(l):61-64 
Yan J et al (2009) Fundamental investigation of subsurface damage in single crystalline silicon 
caused by diamond machining. Precis Eng 33(4):378-386 
Yun D, Seo T, Park D (2008) Fabrication of biochips with micro fluidic channels by micro 
end-milling and powder blasting. Sensors 8(2): 1308-1320 



Hybrid Machining Processes 

Murali Meenakshi Sundaram 


Contents 

Introduction 1110 

Abrasive-Based Hybrid Machining Processes 1113 

Working Principle of Abrasive Electrical Discharge Machining 1113 

Effect of Hybrid Interaction on Work Surface in AEDM Process 1114 

Working Principle of Abrasive Electrochemical Machining 1114 

Effect of Hybrid Interaction on Work Surface in the AECM Process 1115 

Thermal-Based Hybrid Machining Processes 1116 

Working Principle 1116 

Effect of Hybrid Interaction on Work Surface 1116 

Electro-Based Hybrid Machining Processes 1117 

Electrochemical Discharge Machining (ECDM) 1117 

Vibration- Assisted Processes 1119 

Vibration- Assisted Electrical Discharge Machining 1119 

Working Principle 1119 

Effect of Hybrid Interaction 1120 

Vibration- Assisted Micro-ECM 1121 

Working Principle 1121 

Effect of Hybrid Interaction 1122 

Process Capabilities 1122 

Summary 1123 

References 1124 


M.M. Sundaram (ISI) 

Department of Mechanical and Materials Engineering, University of Cincinnati, Cincinnati, 
OH, USA 

e-mail: murali.sundaram@uc.edu 


© Springer-Verlag London 2015 

A.Y.C. Nee (ed.), Handbook of Manufacturing Engineering and Technology , 
DOI 10.1007/978-1-4471-4670-4 15 


1109 


1110 


M.M. Sundaram 


Abstract 

Advanced engineering materials, including inter alia fiber-reinforced compos- 
ites, super alloys, and ceramics, offer superior thermal, physical, chemical, and 
mechanical properties in the form of better strength, higher weight-to-volume 
ratio, improved corrosion, and wear resistance, to name a few. These properties 
have permitted the design of products with better properties, but they have also 
made these advanced materials difficult to machine by conventional machining 
processes thus making them unsuitable and uneconomical. Complex 3-D forms, 
tight tolerances, acute surface finishes, and stringent design constraints have 
necessitated research of new machining methods capable of processing the 
difficult-to-machine advanced materials economically and accurately. The 
basic idea behind a hybrid machining process is the synergistic combination of 
constituent machining processes in order to overcome their individual short- 
comings and achieve effective material removal. This chapter has summarized 
some important aspects of the literature on hybrid machining processes. Machin- 
ing processes in general are categorized into mechanical, thermal, chemical, and 
electrochemical processes based on their dominant material removal mecha- 
nism. Working principles and mechanisms material removal of existing hybrid 
processes and their capabilities have been discussed. It is noted that the complex 
physicochemical, electrical, thermal, and mechanical interactions associated 
with hybrid machining processes are yet to be fully understood, and there exists 
a knowledge gap due to the unresolved issues. 

Introduction 

The unprecedented challenges faced by today’s manufacturing industry are due to 
the ever-increasing demands from three distinctly different requirements, namely, 
(1) advanced material machining, (2) miniaturization of components, and (3) com- 
plex shape machining. Added to these are the routine strive for increased produc- 
tivity and improved quality. Advanced materials, owing to their exceptional 
thermal, chemical, and mechanical properties (such as improved strength, heat 
resistance, wear resistance, and corrosion resistance), offer enormous economic 
benefits to manufacturing industries through improved product performance and 
product design. However, their augmented mechanical properties render traditional 
machining processes unsuitable or uneconomical to machine these materials. For 
example, during the traditional grinding of polycrystalline diamond, the tool wear 
rate is 50-200 times higher than the work material removal rate. The cost incurred 
in the machining structural ceramics (such as silicon nitride) often exceeds 50 % of 
the total production costs in the engine industry (Kozak and Rajurkar 2000). 
Innovative manufacturing techniques or modifications of existing methods are 
needed to tackle such situations. 

Apart from the use of advanced materials, the ongoing trend of miniaturization is 
another challenge posed to manufacturing industry. Miniaturization offers several 
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advantages. Micro products and components occupy less space and consume less 
energy and material. They can be cheaper. Downsizing of the cutting tools is 
imperative to machine these components by traditional processes such as drilling 
and milling which inevitably require the use of tools harder and stronger than the 
workpiece. However, downsizing structurally weakens the tool and makes it vul- 
nerable to rapid tool wear and sudden breakages. Re-sharpening of these tiny tools 
is challenging if not impossible. It is preferable to opt for nontraditional/hybrid 
micromachining processes to machine micromechanical components with tight 
tolerances and fine surface quality. In addition to the advanced materials and 
miniaturization, stringent design requirements also cause major problems in 
manufacturing industry. More and more complex shapes (such as an aerofoil 
section of a turbine blade, complex cavities in dies and molds, noncircular, small, 
and curved holes) and low-rigidity structure are often needed. Traditional machin- 
ing is ineffective in machining these parts. Hybrid machining processes often have 
the potential to meet these challenges. 

According to the International Academy for Production Engineering (CIRP), 
hybrid machining process (HMP) is a process based on the simultaneous and 
controlled interaction of process mechanisms and/or energy sources/tools having 
a significant effect on process performance (Lauwers 2011). It implies that the 
constituent processes in a hybrid process interact in the same machining zone at the 
same time. In general, a hybrid manufacturing process quintessentially has the “1 + 
1 = 3” effect as stated in (Schuh et al. 2009) and possesses one or more of the 
following advantages: 

1. Enhancing the existing capability (e.g., higher material removal rate) 

2. Introducing new capability (e.g., EDM of insulating ceramics) 

3. Alleviating the prevailing problems (e.g., electrochemical discharge machining 

not only reduces the surface damage of EDM but also provides more accurate 

shapes than ECM) 

Typical hybrid machining processes and their constituents are shown in Table 1. 
Feasibility of processes marked “?” in the Table 1 has not yet been adequately 
explored. 

Hybrid machining processes may broadly be classified into two groups. The first 
group consists of processes wherein the constituent machining methods work 
synergistically to remove the work material. The second group involves processes 
wherein controlled application of different process mechanisms yields a combined 
effect that is traditionally achieved by separate techniques. For example, in case of 
grind hardening, the heat generated during grinding is used to achieve controlled 
hardening of the workpiece along with material removal. The first group can be 
further subdivided into “assisted processes” and “mixed processes.” Assisted pro- 
cesses involve a primary process responsible for the material removal, while the 
secondary process only assists it. Mixed processes involve several material removal 
mechanisms originating from the constituent processes, or even new mechanisms 


Table 1 Hybrid machining processes (Rajurkar et al. 2006; Komanduri et al. 1997; Mognol et al. 2006; Knights 2005; Wang et al. 2003; Radhakrishnan 
et al. 2003; Kuo et al. 2003; Sugioka et al. 2002; Hur et al. 2002; Kurita et al. 2001; Kozak and Oczos 2001; Aspinwall et al. 2001; Mediliyegedara et al. 2005; 
Sommer 2000; Kim and Choi 1995; Yeo et al. 2004; Reyntjens and Puers 2001; McGeough 2002; Dabrowski et al. 2006) 
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altogether. In this book chapter, an energy-based approach is used to classify and 
discuss various hybrid machining processes. 


Abrasive-Based Hybrid Machining Processes 

The three most commonly used abrasive-based hybrid machining processes are 
(a) abrasive electrical discharge machining (AEDM), (b) abrasive electrochemical 
machining (AECM), and (c) abrasive electrochemical discharge machining 
(AECDM). For these process, the primary tooling may be in the form of 
(i) metallic electrodes deposited with abrasive grit, (ii) metallic electrodes and 
free abrasive grit, or (iii) segmented metallic electrodes. 


Working Principle of Abrasive Electrical Discharge Machining 

Abrasive electrical discharge machining (AEDM) offers the synergistic effect of 
electro-erosion process and micro-cutting process for increased material removal. 
In this process, an abrasive tool is separated from the workpiece which is immersed 
in a dielectric medium. Application of electric voltage across the tool and work- 
piece results in a series of current discharges or sparks, which erodes the work 
material. This spark erosion process is assisted by the abrasive grit that impacts the 
workpiece causing mechanical dislodgement of material. Adjustable process 
parameters for the AEDM process include the pulse-on time, duty factor, mean 
current, voltage, and tool rotation speed. 

Another variation of AEDM process involves addition of abrasive grains in 
the dielectric medium, instead of the use of an abrasive tool. The presence of 
abrasive grains reduces the capacitance in the discharge gap by increasing the 
gap size. This results in higher dispersion of sparks, which is useful in machining 
larger workpiece areas. Material removal from the surface is more when the 
abrasive slurry concentration is high than when the concentration is low. Smaller 
grit size of abrasives mixes more uniformly in the slurry and hence a better 
surface finish is obtained (Yan et al. 2002). Electrical discharge machining 
(EDM) with point grinding has been performed on a high-speed machining 
center in the work of Aspinwall et al. (2001) for processing nickel-based 
superalloys. Widespread application of this technique requires development of 
a single chemical that can serve as cutting fluid for abrasive tool as well as 
dielectric for the EDM. Liu et al. have proposed micro-EDM combined with 
high-frequency dither grinding, which employs electrode as lapping tool to 
achieve form precision in micromachining (Liu et al. 2006). Dither grinding 
was also reported to remove discharge craters on the work surface after micro- 
EDM, leading to better surface roughness. Wang et al. have demonstrated the 
possibility of dressing diamond abrasive grinding wheel using EDM, which can 
be used for in-process refurbishing of worn tools during abrasive EDM process 
(Wang et al. 1996). 
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Fig. 1 Interactions in machining zone for the AECM process (Kozak and Oczos 2001) 

Effect of Hybrid Interaction on Work Surface in AEDM Process 

Productivity of AEDM process is mainly influenced by the hydrodynamic condi- 
tions of the dielectric flow. Studies have reported twofold productivity increase in 
machining Al-SiC composite by the AEDM process, with five times higher MRR, 
as compared to the EDM process (Kozak and Oczos 2001). 


Working Principle of Abrasive Electrochemical Machining 

Machining zone for abrasive electrochemical machining (AECM) process as illus- 
trated in Fig. 1 is composed of two regions, each with a distinct material removal 
mechanism, viz., the electrochemical zone , where anodic dissolution prevails, and a 
grinding + electrochemical zone , where abrasive cutting predominates. In order to 
exploit the synergism of abrasive cutting with electrochemical work dissolution, the 
tool feed rate cannot be decreased below a certain threshold value; otherwise the 
smallest gap between the tool and the work surface will be greater than the average 
protrusion height of abrasive grains deposited on the tool. Further, factors such as 
electric field and electrolyte flow within the gap as also hydrodynamic effects in the 
anode vicinity determine the extent of abrasive action in the AECM process. 
Abrasive grinding electrochemical machining using resin-bonded wheels has 
been reported; however, the deficiencies of the process like wheel wear still remain 
a challenge (Curtis et al. 2009). 

Alternatively, abrasives like silicon carbide are suspended freely in the electro- 
lyte in the vicinity of the workpiece. These abrasives along with a wire cathode are 
responsible for slicing silicon wafers with better production rate, less cost, and good 
surface integrity (Wang et al. 2011). Electrochemical grinding has been used to 
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Fig. 2 Effect of current 
density on micro-hardness of 
3H13 steel in the AECM 
process (Kozak 1996) 



i[A/cm 2 ] 


machine small holes with sharp edges. The process involves coating the tool 
electrode with abrasives and rotating it at high speed. Initially, material is removed 
through the action of ECM and then the holes are ground for better finish through 
contact machining (Zhu et al. 2011). 


Effect of Hybrid Interaction on Work Surface in the AECM Process 

In the machining of composite work materials by the AECM process, constituent 
phases may undergo unequal electrochemical dissolution leading to weakening of 
interphase bonding. For instance, in AECM of WC-Co sintered carbides, the cobalt 
matrix surrounding the carbide precipitates undergoes more dissolution due to 
higher electrochemical machining coefficient K v . This reduces coherence of the 
carbide and matrix phases after machining. In addition, hydrogen diffusion into the 
work material may cause changes in residual stress and increase material brittle- 
ness. The plot in Fig. 2 indicates that the micro-hardness of 3H13 steel decreases 
during dissolution at lower current densities (Kozak 1996). Reduction of micro- 
hardness of work surface, in turn, reduces tangential and normal forces exerted on 
individual abrasive grains. This leads to considerable savings in machining costs 
due to reduced wear of expensive diamond abrasive grit. 

During the AECM process, electrochemical heterogeneity of work material 
increases due to changes in chemical potential occurring due to escapes of 
dislocations. Such material heterogeneity proves advantageous when machining 
metal-matrix composites, since it reduces the stress levels that are otherwise 
produced due to abrasive action. Curtis et al. have investigated the machining of 
difficult-to-cut materials by incorporating grinding tools in the electrochemical 
machining (ECM) setup (Curtis et al. 2009). Electroplated abrasive tools with 
CBN superabrasives were reported to yield good machining performance along 
with lesser grit loss. 
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Thermal-Based Hybrid Machining Processes 

Thermal energy-based HMPs commonly use energy sources such as laser, plasma 
beam, or electron beam for material removal from such workpieces as hardened 
steel. 


Working Principle 

The thermal energy source in the form of laser, plasma beam, or electron beam is 
directed toward the machining zone ahead of the cutting tool. The concentration of 
heat causes softening of the surface layer of work material thereby resulting in a 
phase change due to melting or vaporization. Therefore, ductile deformation occurs 
rather than brittle deformation during cutting. It is known that the resistance offered 
by the workpiece to material removal by mechanical action is a function of the 
workpiece temperature below its melting point, generated in the machining zone. 
Stress- strain curves and fracture properties derived from tensile tests indicate 
decrease in strength and increase in ductility with rise in temperature. This explains 
the consequent reduction in machining forces during contact of the cutting tool. In 
general, for thermal-based hybrid machining processes, the use of thermal energy 
increases the machinability of the work material for the subsequent mechanical 
removal of material by cutting tool. 

In laser-assisted ECM, a laser beam is focused on an area exposed to the 
electrolyte jet which dissolves a specific region improving precision and surface 
roughness. This process also reports a higher material removal rate due to temper- 
ature increase in the region targeted by the laser beam (Pajak et al. 2006). The 
machining precision decreases with increasing depth but can be improved by 
increasing the speed of the electrolyte jet (Stephen and Vollertsen 2010). It has 
been reported that the local temperature rise does not cause any thermal damage and 
so the machined surface is free of stress (De Silva et al. 2011). This process has the 
capability to generate complex 3-D patterns like micro-stents (Kasashima and 
Kurita 2012). 


Effect of Hybrid Interaction on Work Surface 

While thermal-based hybrid machining processes increase work machinability, 
they also lead to undesirable phenomenon such as subsurface cracks, formation 
of striations, and heat-affected zone on the workpiece. The heat-affected zone, in 
particular, is characterized by cracks, the density of which increases with rise in 
temperature. Compared to traditional cutting process, the stress penetration depth in 
laser-assisted machining process is found to be much smaller. The surface integrity 
of the workpiece is found to improve with uniform values of stress and hardness 
over the machined surface. This is attributed to the deeper and more uniform 
plastically deformed surface layer as compared to conventional cutting. Further, 
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investigation of machined workpieces reveals the absence of smeared material 
which is indicative of favorable residual compressive stresses. Some studies have 
reported better surface finish with more heat application for hard materials. How- 
ever, the heat-affected zone also enlarges at higher temperatures. Therefore, an 
engineering compromise needs to be reached between surface quality and heat- 
affected zone for thermal-based HMPs. 


Electro-Based Hybrid Machining Processes 
Electrochemical Discharge Machining (ECDM) 

Most hybrid machining processes are restricted to a certain range of materials. Few 
among them are capable of machining ceramics like glass. Electrochemical dis- 
charge machining, shown schematically in Fig. 3, is a unique process that uses the 
effects of bubble formations caused by electrochemistry and electrical discharges 
between the tool and the electrolyte. The product obtained as a result of this 
combination is a technology capable of machining a wide variety of materials 
including brittle ceramics like glass. The ability to machine complex features 
with high accuracy is an important forte (Cao et al. 2009; Zheng et al. 2007). 

Working Principle 

The process requires a DC power supply between the electrodes. However, in order 
to obtain a better surface finish, a pulsed DC power supply is recommended (Cook 
et al. 1973). The electrodes are dipped a few millimeters inside an electrolyte and 
kept at a distance from each other. The tool electrode has a considerably smaller 
surface area than the counter electrode. The system follows a characteristic curve 


Tool-electrode (-) 


Work-piece 


Work-piece 

holder 



Voltage Supply 



Counter electrode 

(+) 


Electrolyte 



Fig. 3 ECDM schematic 
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Fig. 4 Voltage-current characteristics (Wiithrich and Fascio 2005) 


shown in Fig. 4 with the increase in voltage. As the voltage increases, hydrogen 
bubbles are formed at the smaller electrode (tool). The tool is used as the cathode to 
prevent anodic dissolution. The current increases with the increase in voltage from 
point A to B. At this stage the current reaches a limiting point and ceases to increase 
with the voltage (point B-C). The production of hydrogen bubbles intensifies. With 
further increase in voltage, at a certain point, namely, critical point, sparks occur at 
the tool electrode and the current drops. These sparks provide the required energy to 
remove material. The values of current and voltage at the critical point are critical 
voltage and critical current. As the formation of hydrogen bubbles intensifies, at the 
critical point, the bubbles coalesce in a manner so as to separate the tool from the 
electrolyte thus causing the sparks. Region C-D signifies unstable sparking with 
high variation in the current values and should be avoided to improve reproduc- 
ibility. With further increase in voltage, the current stabilizes along with the sparks 
(point D-E). 

Effect of Hybrid Interaction 

The material removal rate increases with increase in voltage due to the increase in 
thermal energy of sparks (Basak and Ghosh 1996, 1997). Chemical interaction of 
the process can be understood by investigating the effect of concentration of 
electrolyte on the material removal. The material removal rate increases to a certain 
extent with the increase in concentration (Bhattacharyya et al. 1999; Jain 
et al. 1991). As increase in temperature aids in the chemical activity, the material 
removal also increases with increase in temperature of electrolyte (Cook et al. 1973; 
Jain and Chak 2000; Jain et al. 2002). Tool wear increases with concentration of 
electrolyte and voltage (Tandon et al. 1990). 
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Fig. 5 ECDM of complex features on glass (Cao et al. 2009; Zheng et al. 2007) 


Process Capabilities 

Complex features have been fabricated using ECDM on glass. Some of these 
features are demonstrated in Fig. 5. A wall thickness of 15 pm has been achieved. 

Fabrication of high aspect ratio (11) micro-holes on glass (Fig. 6) has opened the 
door toward a plethora of new applications in the engineering and medical fields 
(Jui et al. 2013). 


Vibration-Assisted Processes 

Ultrasonic vibration is used along with various conventional and nonconventional 
micromachining processes like turning, drilling, grinding, EDM, ECM, ECDM, and 
even nanoscale machining. Making use of ultrasonic vibrations in machining has 
the following advantages (Brehl and Dow 2008): 

• Eower tool wear due to lower forces on the tool 

• Improved surface finish and form accuracy 

• Greater depth of cut for ductile mode machining of brittle materials 

• Eesser tendency to form burrs 

Vibration-Assisted Electrical Discharge Machining 
Working Principle 

Electro-based machining techniques, such as electrical discharge machining, 
employ a series of discrete electrical discharges between an electrode (cathode) 
and the workpiece (anode) immersed in a dielectric fluid. These electric sparks 
generate a plasma channel between the electrodes with temperature as high as 
10,000 °C, which causes melting of the material at the electrode surfaces. In 
micro-EDM one of the important issues is flushing. It becomes very difficult to 
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Fig. 6 Entrance and exit diameters of micro-hole on a 1.2 mm thick borosilicate glass plate 


flush out the debris from features of such small dimensions made by micro-EDM. 
In order to hybridize the electrical discharge machining (EDM) process, ultrasonic 
vibrations are applied to the cathode, which accelerates dielectric circulation and 
reduces machining times. Direct material removal is due to the cavitation phenom- 
enon where ultrasonic bubbles are formed during the low-pressure portion of the 
wave cycle (Sundaram et al. 2008). 


Effect of Hybrid Interaction 

In vibration-assisted EDM, either the tool or the workpiece can be vibrated. The 
workpiece can be vibrated either parallel to the direction of tool feed or perpendic- 
ular to the direction of tool feed. The pressure variations in the spark gap due to 
vibrating cathode surface lead to more efficient electric discharges. This results in 
greater material removal. The quality of machined surface is also improved in that 
micro -cracking and affected layer are reduced (Srivastava and Pandey 2012). 
Similarly, ultrasonic vibrations can also be applied to the workpiece (Shabgard 
et al. 2009) in the electrical discharge machining process. This was reported to 
improve the process stability by reducing the number of inactive current pulses. Use 
of ultrasonic vibrations has been found to be responsible for producing spheroidized 
debris particles in the EDM process, as against elliptic particles produced in their 
absence (Murti and Philip 1987). Thus, ultrasonically assisted EDM process is a 
potential method for producing fine metallic powders with oxide-free surfaces, 
which have varied industrial uses. Important process variables are the frequency 
of vibration and amplitude of vibration. A significant decrease in machining time as 
shown representatively in Fig. 7 is observed when ultrasonic vibration is used in 
EDM (Endo et al. 2008; Huang et al. 2003; Tong et al. 2008). A 70-80 % reduction 
in machining time has been reported (Lee 2005). 
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Fig. 7 Machining time of vibration-assisted machining (perpendicular vibration) (Endo 
et al. 2008 ) 


Vibration-Assisted Micro-ECM 

Micro-ECM in spite of having great advantages over other machining processes has 
some drawbacks. Maintaining the electrolyte flow as the micro-hole gets deeper 
becomes difficult. Therefore, the machining rate decreases as the depth of the hole 
increases. Also at low current densities, a passivation membrane is formed at the 
anode (workpiece). This cannot be removed by the ECM process easily and this 
affects machining. Ultrasonic vibration-assisted EDM improves the machining rate 
by countering the abovementioned problems. This process involves vibrating the 
tool electrode to agitate the abrasives suspended in the electrolyte for a good surface 
finish. A study of the geometry and type of the electrode which gives a well- 
polished surface is reported and the effect of ultrasonic energy is acknowledged 
(Pa 2007). This energy is also responsible for the removal of debris from the 
machining zone and creation of optimal hydrodynamic conditions affecting the 
surface layer (Skoczypiec 2011). The use of magnetic as well as ultrasonic energy 
has been reported to remove dregs out of the electrode gap. The process gives a very 
good surface finish in smaller time (Pa 2009). 


Working Principle 

The working principle is similar to that of vibration-assisted micro-EDM. 
A piezoelectric crystal is used to vibrate the work material or the tool. The ultrasonic 
pressure waves aid the machining process. Vibrations in the narrow machining gap 
with the electrolyte can be viewed as a micro-cell, and the higher the frequency of 
vibration, the more the pressure on the micro-cell (Yang et al. 2009). 
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Effect of Hybrid Interaction 

As shown in Fig. 8a, the machining time can be reduced by about 81 % using 
ultrasonic vibrations when compared to the conventional ECM process. Also higher 
precision and tighter tolerances were obtained by this process as illustrated in 
Fig. 8b (Yang et al. 2009). 


Process Capabilities 


The potential of hybrid machining processes in the manufacture of various mechanical 
components, electronic devices, and microscale parts has been evaluated in different 
studies. The table below enumerates few recent hybrid process capabilities achieved. 


HMP 

Feature 

Dimension 

Other 

Reference 

Point grinding + EDM 

3-D 

surface 

texture 

NA 

1 pm Ra 

Aspinwall 
et al. (2001) 

Ultrasonic abrasive 
micromachining with 
thermoplastic tooling 

3-D 

cavity 

NA 

0.15 pm Ra 

Curodeau 
et al. (2008) 

Nanosecond pulsed laser + 
micro-EDM 

Hole 

125 pm 


Kim 

et al. (2010) 

Nanosecond pulsed laser + 
micro-EDM 

Groove 

75 pm width x 
100 pm deep 


Kim 

et al. (2010) 

Turning + micro-EDM 

Shaft 

050 pm; AR = 100 


Lim 

et al. (2002) 

Plasma-assisted turning 

2-D 

surface 

area 


2.35 pm 

Wang 

et al. (2003) 


a 




e nl ranee (left) and <J>7yum exit (right) on 
305SS malarial: thickness 3 00 tun without vibration 



(f>76 jjm en I ranee {left} and ^73 p,m exit (right) on 
305SS material; thickness 300pm with vibration 



on precision 


Fig. 8 ECM drilling results with and without vibration using a semi-cylindrical tool (Yang et al. 2009) 
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Summary 

This chapter has summarized some important aspects of the literature on hybrid 
machining processes to understand the mechanisms of existing hybrid processes 
and their capabilities. Machining processes in general are categorized into mechan- 
ical, thermal, chemical, and electrochemical processes based on their dominant 
material removal mechanism. Accordingly machining occurs when the energy 
supplied is sufficient to cause plastic deformation, phase change, or dissolution. 
When there is scope for the dominant mechanism to get overwhelming or even 
subdued assistance from one or more of the other mechanisms, it is possible to 
develop hybrid system. This is highlighted in the following examples (Kozak and 
Rajurkar 2000): 

1. Thermal assistance: The temperature rise helps to accelerate the shear failure 
in mechanical cutting and kinetic reactions in chemical/electrochemical 
interactions. 

2. Mechanical assistance: The mechanical removal of thin layers of oxides and 
other compounds intensifies the anodic dissolution. Cavitation and agitation 
generated by ultrasonic vibrations enhance the performance of EDM, ECM, 
and LBM processes. 

3. Chemical/electrochemical assistance: Change of dislocation density and 
increase of dislocation velocity due to surface potential reduction caused by 
chemical reactions facilitate plastic deformation. Generation of gas bubbles in 
electrochemical reaction facilitates electrical discharges. Oxidation increases the 
radiation absorptivity of metals and decreases the radiation density thresholds 
for melting and vaporization during laser machining. 

Material removal rate (MRR) in a hybrid process H formed by the combination 
of two constituent processes A and B can be expressed by QH= QA+QB+QAB 
where QA is the MRR due to A, QB is MRR due to B, and QAB is their synergetic 
effect which can be determined from the experimental results. The significance of 
QAB will help to identify economically viable hybrid machining process. Theoret- 
ical models and energy equations are available in literature for most of the pro- 
cesses. Knowing the significance of QAB will be useful in the development of 
global model for the hybrid process. It is common to have two or three constituents 
as shown in Table 1. Hybrid process with as many as four constituents such as 
magnetic abrasive-assisted electrochemical machining has also been developed. 
However, the complex physicochemical, electrical, thermal, and mechanical inter- 
actions associated with hybrid machining processes are yet to be fully understood. 
The feasibility of processes marked “?” in Table 1 and the reasons for the incom- 
patibility/absence are not clear. Several issues as listed below remain unresolved. 
For example: 

1. Is there any technical or economic limit on number of constituents in hybrid 
machining process? 
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2. Does the number of constituents change the interacting mechanism/efficiency of 
individual constituents? If yes, then why? How? 

3. How to identify the compatibility (or incompatibility) of constituent processes to 
reap their synergetic effects? 

Potential for investigation lies further in mathematical modeling of different 
hybrid processes for evaluating effects of various input parameters on machining 
quality. This will further enhance industrial applications of hybrid technologies. 
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Abstract 

Machining is a controlled material removal process and finds its application in a 
variety of industrial sectors such as automobile, aerospace, and defense. Similar 
to many other manufacturing processes, machining bears significant environ- 
mental impacts in terms of energy /re source consumption, airborne emissions, 
wastewater discharge, and solid wastes along with occupational health risks. 
Most of these issues are due to the use of cutting fluids, which are traditionally 
formulated with petroleum-derived compounds with high ecotoxicity and low 
biodegradability. Exposure to these chemicals, along with growth of microor- 
ganisms and biocides used for microbial control, could lead to respiratory 
irritation, asthma, pneumonia, dermatitis, and even cancer. To address these 
concerns, extensive effort has been put forth to (1) extend the cutting fluid life 
span by removing particulates, free oils, and other contaminants via separation 
and filtration, (2) reformulate traditional petroleum-based fluids with vegetable 
oils and bio-based ingredients for lower toxicity and higher biodegradability, 
and (3) reduce or even eliminate the reliance on cutting fluids during machining 
through dry machining and minimum quantity lubrication (MQL) techniques. 
Apart from these technology developments, machining process parameters can 
be optimized for reduced environmental impacts, especially energy consumption 
and carbon footprint. Process optimization approaches require the development 
of models and equations to correlate process parameters with process inputs and 
outputs. Given the current status in the field, opportunities exist in designing new 
bio-based, microfiltration-compatible formulations using industrial by-products, 
optimizing minimum MQL system configuration, advancing cutting tool insert 
materials and lubricants for MQL, and developing high-energy efficiency 
machine tools. 


Introduction 

Since the 1972 Stockholm UN summit on the human environment, the global com- 
munity has embarked on a journey toward sustainable development objectives in order 
to solve global environmental problems such as ecological system degradation, 
environmental pollution, and resource depletion. Manufacturing activities (including 
a variety of machining operations) are critical to modern economies, while at the same 
time they are among the major sources that cause these environmental problems 
(Haapala et al. 2013). Due to an increased demand for consumer goods from a growing 
world population with an improving average of quality of life, manufacturing will 
remain a key component of modern economies. Worldwide manufacturing enterprises 
are facing mounting pressure from governments and the general public to reduce their 
environmental footprints. In the past decades, significant research and development 
efforts have been made to “green” manufacturing operations. 

Machining processes are among the most important manufacturing activities, 
widely used in the automotive, aerospace, and defense industries. Similar to 
other major manufacturing activities, machining processes carry significant 


29 Environmentally Friendly Machining 


1129 


environmental impacts and occupational health risks. New technologies and man- 
agement practices developed to address these issues are commonly referred to as 
environmentally friendly machining, green machining, environmentally conscious 
machining, and environmentally benign machining. In this chapter, the term envi- 
ronmentally friendly machining is used. The chapter is organized as follows: 
section “Environmental Impacts of Machining” discusses the environmental 
impacts of machining processes and methods commonly used to evaluate environ- 
mental performance of machining operations; sections “Minimizing Environmental 
Impacts of Cutting Fluids” and “Minimum Quantity Lubrication and Dry Machin- 
ing” present an overview of efforts on developing new processes to reduce the 
environmental impacts associated with the use of metalworking fluids; section 
“Optimizing Machining Processes for Reduced Environmental Impacts” covers 
efforts on process parameter selection and optimization to minimize energy con- 
sumption and environmental footprints; section “Future Development” comments 
on future development in the area of environmentally friendly machining; and 
section “Summary” summarizes the chapter. 


Environmental Impacts of Machining 

Figure 1 illustrates the major material and energy flows involved in a typical 
machining process. In addition to the workpiece, which is converted to an inter- 
mediate or finished part after machining, electricity and cutting tools are consumed. 
Cutting fluids may also be used in some operations to provide lubrication and 
cooling. To maintain proper operation of the machine tools, hydraulic oil and 
lubricants are needed. Since machining is a material removal process, chips and 
swarf are produced. Depending on the process, dusts, mists, and volatile organic 
compounds (VOCs) may also be generated. If cutting fluids are used, used fluids 
have to be treated and disposed. In the following sections, the environmental 
impacts due to these material and energy flows will be discussed. 


Energy and Resource Consumption 

Rising energy cost and global warming concerns have triggered growing interests 
on efficient use of energy in a variety of manufacturing sectors. A modern machin- 
ing center usually has a power rating in the range of several kilowatts to several tens 
of kilowatts. In addition to the main spindle drive, a machining center has several 
other energy-consuming components such as the feed drive, the coolant system, the 
computer/control system, and other auxiliary equipment. The power demand of a 
machining center could change significantly over time, depending on the workpiece 
being machined, operating parameters, and the status of the machine. Figure 2 
shows the power profile of a typical machining center over a 10-min time period 
during which one workpiece is loaded, machined, and unloaded. Three levels of 
power consumption are defined, i.e., basic, idle, and cutting. 
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Fig. 2 Power profile of a typical machining process 


Before a workpiece can be machined, it must be loaded and then properly 
positioned and secured to the fixture. At this time, the operator turns on some 
auxiliary equipment, such as way lube system, lighting, controller/computer, oil 
pump, fan, and servo motors. Power demand during this process is at a basic level 
and typically in the range of 0.5-2 kW depending on the size of the machine. The 
duration of the process depends on the size, weight, and complexity of the work- 
piece, as well as the type of fixture. In addition, machining centers may be left at 
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Table 1 Formulations of typical water-based cutting fluids (Zhao 2005) (compositions shown are 
for concentrates, which will be diluted to 1:10-1:20 before use) 


Cutting fluid type 

Function 

Ingredient 

% w/w 

Synthetic cutting fluid 

Carrier 

Water 

70 

Corrosion inhibitor 

Amine carboxylate 

10 

pH buffer 

Triethanolamine 

5 

Biocide/fungicide 

Pyridinethione 

2 

EP lubricant 

Phosphate ester 

4 

Boundary lubricant 

PEG ester 

5 

Boundary lubricant 

Sulfated castor oil 

6 

Semisynthetic cutting fluid 

Carrier 

Water 

55.5 

Oil 

Naphthenic oil 

15 

Surfactant 

Alkanolamine 

15 

Surfactant 

Sulfonate base 

5 

Biocide/fungicide 

Triazine/pyridinethione 

2 

Corrosion inhibitor 

Amine borate 

6 

Coupler 

Butyl carbitol 

1.5 

Soluble oil cutting fluid 

Oil 

Naphthenic hydrotreated oil 

68 

Emulsifier 

Sulfonate emulsifier oil base 

17 

EP lubricant 

Chlorinated olefin 

5 

Boundary lubricant 

Synthetic ester 

5 

Corrosion inhibitor 

Alkanolamine 

3 

Biocide/fungicide 

Triazine/pyridinethione 

2 


this state while waiting for the next batch of workpieces. After the workpiece is 
secured, the operator then turns on the main spindle along with the automatic tool 
changer and cutting fluid pump. The rapid traverse drives are also turned on in order 
to allow movement of the tool between features, to and from the workpiece. The 
power demand during this process is at an idle level and typically in the range of 
2-10 kW. Next the actual cutting of a workpiece starts, and the power demand is at 
the level of cutting. Tool change and repositioning may occur during subsequent 
cutting, during which the power level is changed back to idle. Finally after the 
desired geometry is achieved, the part is unloaded. In reality, the electricity 
consumed due to actual material removal may only represent a small portion of 
total consumption by a machining center (Gutowski et al. 2005). This opens up 
opportunities to schedule machining operations in order to reduce energy consumed 
during nonproductive phases. 

Cutting fluids are used in many machining processes to extend tool life, improve 
surface finish, and carry away chips from cutting zone. Traditionally cutting fluids 
are either straight oils or water based (i.e., water solutions or oil-in- water emul- 
sions) consisting of components such as mineral oil, surfactants, corrosion inhibi- 
tor, pH buffers, lubricant additives, chelating agents, and biocides. Table 1 shows 
typical formulations of three common types of water-based cutting fluids, i.e., 
synthetic, semisynthetic, and soluble oil (Zhao 2005). While in use, cutting fluids 
suffer from performance deterioration due to contaminants and have to be replaced 
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every 3-6 months in most cases. Manufacturers currently consume over two billion 
liters of cutting fluids each year worldwide, which represents a significant con- 
sumption of natural resource including petroleum and freshwater (Glenn and Van 
2004). For common cutting fluid systems, the daily water makeup in the range of 
2-10 % is needed to replenish loss due to evaporation and carryout (Iowa Waste 

Q 

Reduction Center 1996; Bierma and Waterstraat 2004). Overall per mm of mate- 
rial removed, up to 0.3 liter of water is consumed, depending on cutting fluid 
management practice (Environmental Technology Best Practice Programme 
1999). Besides cutting fluids, hydraulic fluids and lubricant oils are consumed by 
the machines, but the amount is much smaller. 

Another major material input to the machining process is cutting tools/inserts. 
Modern cutting tools/inserts usually have an extended tool life so the actual 
consumption per workpiece could be small. However, the manufacturing associated 
with these tools/inserts can be energy intensive and carries significant environmen- 
tal burdens (Xiong et al. 2008; Zhao et al. 2010). Common tool/insert materials 
include steel, carbides, cements, ceramics, and synthetic diamond. Materials used 
to manufacture cutting tools/inserts include tungsten, boron, tantalum, niobium, 
and titanium, while cobalt and nickel are commonly used as binders. Some of the 
metals have low natural abundance, and supply may be constrained in the near 
future. For example, it is estimated that there is less than 50 years left of tantalum 
resources (USGS 2013). But manufacturing of cutting inserts usually accounts for a 
very small percentage of the total demand for these metals. 


Wastewater and Water Pollution 

While in use, cutting fluids are circulated between storage, maintenance, and the 
machining operation, and some fluids and ingredients are lost due to evaporation, 
leaks, spills, mist, and residue on chips and workpieces. To maintain performance, 
usually fresh fluids or specific ingredients are added regularly. In addition to fluid or 
ingredient loss, contaminants will accumulate, which include oils and lubricants 
from leaking hydraulic systems of machine tools and particulates from the sur- 
rounding environment. Microorganisms including bacteria and fungi may also 
grow in cutting fluids as they provide a balanced blend of water and organic 
nutrients. Even with maintenance measures, cutting fluids only have quite a limited 
service life, and the waste cutting fluids have to be properly treated before disposal. 

The waste cutting fluids carry many substances that pose environmental risks. 
These include heavy metals from the cutting tool and the workpiece (e.g., cobalt, 
cadmium, chromium, and lead) and cutting fluid ingredients such as corrosion 
inhibitors, surfactants, biocides, chlorinated fatty acids, and chelating agents. In 
addition, oil, fat, and greases in the fluids lead to significant oxygen depletion and 
nutrient loading. The US EPA’s Metal Products and Machinery Rule requires oil 
and grease concentration in treated effluent to be below 17 mg/1, while used cutting 
fluids may easily have oil and grease levels above 2,000 mg/1. To make it even more 
complicated, waste cutting fluids may contain wastes and contaminants from other 
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processes and operations at large manufacturing facilities. These could include 
floor cleaners, phosphate wastes, stamping and drawing compounds, machine 
lubricants, and die casting lubricants, to name a few. It is critical that the treatment 
system of waste cutting fluids is robust and flexible. Usually a standard wastewater 
treatment solution will not meet the regulation requirements, and to date there is no 
perfect method of wastewater treatment for this complex and varying mixture of 
contaminants. The selection of the treatment method varies depending on the 
facility’s operation and location. In the European Union (EU), waste cutting fluids 
are classified as hazardous wastes and have to be treated accordingly (Burke and 
Gaines 2006). 


Airborne Emissions 


Airborne emissions from machining processes include mists, particulates, and 
volatile organic compounds (VOCs). When cutting fluids are used in machining, 
the fluid spray strikes the workpiece, cutting tool, or fixture surfaces and then breaks 
off as small droplets, which are carried away by air flows. In addition, some of the 
fluids will temporarily attach to the rotating surfaces and then be released as 
droplets due to centrifugal force. Droplets can also form from the vapor of cutting 
fluid ingredients as a significant amount of heat is generated at the cutting zone. 
Usually the size of droplets formed due to vapor condensation is smaller than those 
formed due to impact and centrifugation. The mist generated during machining 
usually does not have the same chemistry as the cutting fluids. It may contain some 
chemicals presented in cutting fluids along with contaminants, including heavy 
metals and microorganisms. During machining, mist concentration in the range of 

o 

1-10 mg/m is common (Jean et al. 2006). 

In addition to chemical composition, the size of the droplets also has significant 
effects on environmental and health impacts. When water-based cutting fluids are 
used, the size of most droplets generated is in the range of 0.1-30 pm (Jean 
et al. 2006). Large droplets (i.e., several micrometers in size) settle down quickly 
and are easily removed, while small droplets (submicron) can stay in the air for an 
extended time and be difficult to capture. When inhaled they can deeply penetrate 
the lungs in the alveoli, leading to allergenic and irrigative reactions, as well as 
carcinogenic effects. In addition to inhalation, cutting fluid mists can enter the 
human body through ingestion and skin absorption, which may also trigger allergic 
reactions and lead to dermatitis (Mathias 2006; John et al. 2006). 

When cutting fluids are not used during machining (i.e., dry machining), dust/ 
particulate emissions are common, and the amount of dust generated is dependent 
on the speed, feed, and depth of cut. It is generally observed that airborne emissions 
are much lower during dry machining than in the case of wet machining (Sutherland 
et al. 2007). However, particulates with diameters less than 2.5 pm have raised 
significant environmental and health concerns in recent years. 

During cutting fluids use, VOCs generated are typically minimal due to the low 
vapor pressure of oils and other organic compounds. However, some biocides, such 
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as triazines and oxazolidines, may release formaldehyde, which has been classified 
as a known/probable human carcinogen. Moreover, mists and VOCs from a facility 
using cutting fluids may contribute to photochemical smog formation, especially 
when meteorological conditions favor photochemical reactions. 


Solid Waste 

Since machining is a material removal process, solid wastes are inevitable 
by-products, usually in the form of chips and swarf. During grinding operations, 
the residue also contains particulates worn off from the grinding wheels. More 
restrictive landfilling policies and increasing commodity prices have led to signif- 
icant advancements in zero-waste machining. In many cases, the chips and swarf 
are collected and recycled for material recovery. Solid wastes from machining 
processes also include used cutting tools/inserts and grinding wheels. It is a 
common practice now that these are returned to the vendors and manufacturers 
for remanufacturing and repurposing. 


Environmental Performance Evaluation 

Environmental Management System (EMS) as specified in ISO 14001/14004 has 
been adopted by many manufacturers, especially large companies to improve their 
environmental performance. Companies with an EMS in place usually have 
environment-related data and information collected, including those related to 
machining operations. However, since a machining process could impact environ- 
ment and occupational health in many different ways (e.g., energy/resource con- 
sumption, water pollution, air emission, and health risks), a quantitative assessment 
method is needed to evaluate the overall environmental impacts and potential trade- 
offs of a specific process. 

Life cycle assessment (LCA) has emerged as the most common method for 
environmental impact evaluation of manufactured goods and manufacturing pro- 
cesses. As defined in ISO 14040, LCA addresses the environmental aspects and 
potential environmental impacts, e.g., resource use and environmental consequences 
of releases, over a product’s life cycle from raw material acquisition through produc- 
tion (cradle-to-gate), use, end-of-life recovery, and disposal (cradle -to-grave). LCA 
avoids shifting environmental consequences from one life cycle stage to another, from 
one geographic area to another, and from one environmental medium to another. For 
example, during machining in addition to electricity consumed all materials inputs 
also contribute to GHG emissions and global warming, since energy and raw materials 
are needed to produce them. Ideally, any decision to improve the environmental 
performance of a machining process should be supported by LCA. 

A formal LCA consists of four components as specified in ISO 14040 series 
standards: goal definition and scoping, inventory analysis, impact assessment, and 
interpretation. It is during inventory analysis that inputs (e.g., energy, water, and 
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materials) and outputs (e.g., air emissions, solid waste, and wastewater) are identified 
and quantified. Collection of such data is time and resource intensive, which has led to 
the development of life cycle inventory databases for common materials and pro- 
cesses. The most comprehensive inventory database available to date is Ecoinvent by 
Swiss Centre for Life Cycle Inventories, which is mainly based on EU scenarios and 
data. Datasets in these databases are treated as black boxes, with no correlation 
between inputs, outputs, and process conditions. As a result, manufacturing processes 
are usually the weakest aspect of life cycle databases and exploration of “what-if ’ 
scenarios, e.g., process changes or technology updates, remains difficult. 

Recently, an approach that aims at developing the next-generation life cycle 
inventory databases for environmental impact assessment of manufacturing pro- 
cesses has been proposed by researchers in the EU and the USA (Sutherland 
et al. 2007; Kellens et al. 2012a, b). The new databases are expected to have 
some desired characteristics, such as transparency, engineering quality, and the 
ability to reflect changes when new information is secured. With the built-in models 
and correlations, a user can generate a life cycle inventory with minimum efforts if 
basic process information is known. For example, when developing an inventory 
for a drilling operation, one can simply collect and supply data such as workpiece 
material properties, feed rate, cutting speed, drill diameter, drilling time, coolant 
properties, and setup time to the model, which will calculate all the material and 
energy inputs and outputs. Initial development has largely been focused on machin- 
ing processes, e.g., turning, milling, grinding, and drilling. 


Minimizing Environmental Impacts of Cutting Fluids 


From discussion above, it can be seen that most environmental concerns of machin- 
ing processes are due to the use of cutting fluids. As a result, most of the research 
and development effort on environmentally friendly machining have been focusing 
on eliminating or minimizing the use of conventional cutting fluids or replacing 
them with environmentally friendly alternatives. In other words, the approaches 
toward the environmentally friendly machining can be largely divided into the 
following two categories: 


• Fluid-based approach: This approach includes the techniques in which the 
environmental impacts are reduced by improving the life span of the cutting 
fluid or replacing it with a biodegradable fluid. This approach does not require 
any change in cutting tool and/or the basic machine tool. Some peripheral 
rearrangements are required for recycling and reusing the cutting fluids. This 
fluid-based approach is the focus of this section. 

• System-based approach: This approach aims to totally eliminate or minimize the 
use of the cutting fluids. The fundamental functions of the cutting fluids, i.e., 
cooling, lubricating, and flushing of chips, are achieved by changing the cutting 
tool and/or the machine tools. This will be covered in section “Minimum 
Quantity Lubrication and Dry Machining.” 
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Cutting Fluid Recycling and Reuse 

As discussed earlier, a variety of contaminants will enter cutting fluids during use. 
These contaminants will deteriorate machining performance and eventually lead to 
the disposal of cutting fluids. Removal of these contaminants brings cutting fluids 
back to as-new conditions thus extend their usable life. This will not only reduce 
environmental impacts associated with cutting fluid usage but also minimize health 
risks to workers exposed to cutting fluids. Among all the contaminants present in 
cutting fluids, current cutting fluid recycling and reuse practice mainly deals with 
particulates and free oils. This is usually done through some means of separation or 
filtration (Brandt 2006). The design and selection of the recycling and reuse system 
depend on many factors, e.g., material being machined, type of machining process, 
chip/swarf shape, amount of material removed, production rates, machine horse- 
power, fluid type, fluid application rate, and space available for installing such 
system. 

Particles in cutting fluids could affect surface finish and tool life. The acceptable 
level of particulates (concentration and shape/size) depends on machining pro- 
cesses and workpiece material. When machining metals, usually particulates with 
concentration below 10-30 ppm and mean diameter 15-30 pm are deemed accept- 
able. Large particulates and chips can be easily removed using an H-chain, chain 
and flight, push bar (harpoon), and flume system (Brandt 2006). Removing fine 
particulates is a bit more difficult and usually separation or filtration is needed. 
Setting tanks, foam separators, centrifugal separators, and magnetic separators are 
all examples of separation systems. 

When using filtration systems for fine particulate removal, one can choose either 
disposable media or permanent media. Disposable media are more common due to 
minimum maintenance, and these include bags, cartridges, rolled goods, and 
chopped paper. These media usually have pore size in the range of 30-100 pm. 
However, particulates of smaller diameter can also be removed by the system since 
as filtration process proceeds a cake layer builds up on the media. This cake layer is 
able to capture smaller particulates, but at the cost of reduced permeation flow rates. 
The media will need to be replaced when the filtration speed becomes low. When 
permanent media such as wire mesh, woven fabric belts, or wedge wire screens are 
used, a similar process is used. When the filtration rate drops to a certain level, 
indexing is performed to remove the cake from the media by applying an air or 
cutting fluid blow or scrubbing by a blade. This allows the supporting media to be 
reused for the next cycle. Generally, permanent media have large pore sizes. Before 
a cake layer is developed, the filtered fluid may have a relatively lower cleanliness. 
Therefore, in order to maintain the cleanliness of recycled fluids, it is desired that 
the cake layer be established as early and maintained for as long a period as 
possible. 

Some of the free oils entering cutting fluids could be loosely emulsified by 
surfactants that are already presented in the fluids. Different from particulates, a 
small amount of free oils can be beneficial as they improve lubricity. However, for 
optimal fluid performance and life in most cases, the level of free oil and loosely 
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emulsified oil needs to be controlled to a level of 0.5 % or less (Brandt 2006). 
Commonly found oil removal devices are skimmers (disk, rope, or belt), coalescers, 
and centrifuges. Filters and oil sorbent materials are also commonly used. The free 
oil removed is then collected for disposal or reuse. 

It should be noted that the separation and filtration systems discussed above 
cannot remove microorganisms (which have a size between 0.2 and 2 pm) from 
the fluids. Recently membrane microfiltration has been explored as a means for 
contaminant removal (Rajagopalan et al. 2004; Skerlos and Zhao 2003). The 
advantage of using microfiltration is that it can not only remove microorganisms 
but can also remove particles and free oils. Membrane microfiltration is typically 
performed with filtration tangential to the channels of bulk fluid flow, usually 
referred to as crossflow mode. This operation mode minimizes the plug of 
membrane pores, and cake buildup on the surface is controlled by the shear 
stress from bulk flow. Both help to maintain the filtration rate at high levels. The 
membranes are usually made of ceramics and polymeric materials and can be 
cleaned and reused for long periods of time. Lab-scale and pilot-scale experi- 
ments suggest that membrane filtration processes are able to restore cutting 
fluids to a “good as new” condition. It eliminates the use of troublesome bio- 
cides for biological control thus adds both environmental and occupational 
health benefits. 


Bio-Based Cutting Fluids 

Traditionally cutting fluids are formulated with oil and compounds derived mostly 
from petroleum. Many of these ingredients are associated with high human/eco- 
logical toxicity and low biodegradability, which present occupational health risks to 
workers and complicate end-of-life disposal. Improper disposal of waste cutting 
fluids could result in surface water and groundwater contamination, air pollution, 
and soil pollution. To address these issues, in the past two decades, efforts have 
been made to formulate cutting fluids with ingredients derived from biological 
resources (Shashidhara and Jayaram 2010; Lawal et al. 2012). 

Vegetable oils that have been used to formulate cutting fluids include canola oil, 
soybean oil, palm oil, sunflower oil, and jatropha oil. The principal technical 
limitation of vegetable oil lubricants, their low oxidative stability, has been 
addressed by genetic alternation, chemical modification, and use of various addi- 
tives. Since vegetable oils have different molecular structure compared with 
petroleum oil (i.e., triglycerides vs. alkanes), formulating vegetable oil-based 
cutting fluids likely requires different surfactant packages. It has been observed 
that in order to minimize surfactant consumption, a combination of two surfac- 
tants, one nonionic and one water-soluble cosurfactant (either nonionic or 
anionic), is preferred over a single surfactant. In addition, to achieve good stability 
and longevity, the nonionic surfactant should have (1) a carbon-tail length similar 
to that of the fatty acids in the oil and (2) a head group that is not excessively small 
or large (Zhao et al. 2006). The performance of bio-based fluids has been tested for 
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many machining processes, including turning, drilling, tapping, and grinding. In 
addition to vegetable oils, esters derived from them as well as animal fat and 
used cooking oils have also been tested. In addition to low toxicity and high 
biodegradability, bio-based cutting fluids have demonstrated comparable or 
even improved lubrication performance when compared with petroleum-based 
formulations in a variety of machining processes (Lawal et al. 2012; Abdalla and 
Patel 2006). 


Minimum Quantity Lubrication and Dry Machining 
Dry Machining 

The machining operation without the use of a cutting fluid is designated as “dry 
machining.” This means that the cost and environmental issues associated with 
cutting fluid procurement, management, and disposal are eliminated. Also, the 
machine tool without a subsystem for coolant results in significant reduction in 
capital investment. However, it should be noted that dry machining technology is 
far from simply performing machining without cutting fluids. All the primary 
functions of the cutting fluids in conventional machining like heat removal, chip 
removal, and lubrication need to be accomplished by alternative means in dry 
machining. 

In the absence of cutting fluids, heat is removed through alteration in the cutting 
tool design, the cutting tool material, and the machine tool. Variants in cutting tool 
designs like internal cooling by passing the vaporized liquid or cryogenically 
cooled coolant through a conduit inside the tool, undercooling by channeling the 
coolant under the insert, and thermoelectric cooling have been investigated. How- 
ever, most of the development in the dry cutting technology is achieved through 
improving the properties of the tool material followed by alteration in machine tool 
design and operation as given in the following sections. 

Cutting Tool Material 

The selection of an appropriate cutting tool material is the fundamental requirement 
for dry machining (Weinert et al. 2004). The tool material should be more heat 
resistive and less heat generative. The form stability of the cutting tool, which 
influences the surface finish of the component and tool wear, is affected by the heat 
generated during machining. The conventional cutting materials such as high-speed 
steel, cemented carbides, cobalt alloys do not support the optimum performance in 
dry cutting. Even the hardest cutting material available, i.e., polycrystalline dia- 
mond (PCD), can only support temperature up to 600 °C, after which graphitization 
occurs (Byrne et al. 2003). In place of bare tools, the coated tools are more 
appropriate for dry cutting. The coating provides beneficial effects like improved 
thermal insulation and self-lubrication. Many commercially available coating 
materials, as shown in Table 2, provide the tool the hardness and the thermal 
stability essential for dry machining. 
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Table 2 Coating materials for tools used in dry cutting 


Coating material 

Microhardness HV 

Max. service temp. (°C) 

AlCrN 

3,000-3,300 

1,100 

AlTiN 

3,000-3,800 

900-1,000 

Diamond-like carbon 

2,500 

350 

Nanocrystalline diamond 

10,000 

600 

TiAIN 

3,200-3,300 

800-1,000 

TiAIN + WC/C 

3,000 

800 

TiAlCN 

3,200 

650 

TiAIN + AlCrN 

3,300 

1,100 

TiCN 

3,000 

400-450 

TiN 

2,300, 2,400 

595-600 

TiN + TiAIN 

3,000-3,300 

800 



20 

jim 

16 

14 

12 

10 
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Cutti n g d e pt h a n : 


160 mm 
2750 m/min 

0.64 mm /tooth 
3 mm 
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TiN +■ MoS 2 
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Fig. 3 Flank wear in dry milling with different coatings; material AlSilO Mg-wa, CL cooling 
lubricant (Klocke and Eisenblatter 1997) 


The monolayer coatings are often subject to early delamination or crack. There- 
fore, for better tool life, multilayer coating is preferred, which combines beneficial 
properties of different coating materials, e.g., heat resistance of TiAIN, toughness 
of TiCN (titanium carbonitride), and good adhesion characteristics of TiN. The 
coating can also partially fulfill the lubricating function of cutting fluids. The 
specially formulated coating of a soft material (e.g., MoS 2 ) on a hard material 
(e.g., TiAIN) acts as a self-lubricating coating in dry machining. Figure 3 shows the 
flank wear in face milling with different types of coatings in dry and wet conditions. 
It can be seen that the self-lubricating coating of TiN + MoS 2 offers better wear 
resistance than all the others except dry cutting with diamond (Klocke and 
Eisenblatter 1997). The soft coating in the self-lubricating coating helps in reducing 
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the friction between the cutting tool and the workpiece, which in turn reduces 
cutting forces and heat generation, in turn leading to less tool wear. 

Along with the coating material, the substrate that is responsible for the toughness 
and strength of the tool also plays an important role in dry cutting. One of the recent 
developments indicates that the use of cutting tools made of layered composite 
ceramics with nanoscale multilayered coatings in dry cutting can offer more tool life 
than using cutting fluids (Grigoriev et al. 2012). The carbide layer in the composite 
provides the toughness and strength of tool material, and the ceramic layer provides 
resistance of a tool material to oxidation, corrosion, and high-temperature weaken- 
ing at elevated temperatures. 

Machine Tool Design and Operation 

Cutting fluids in the conventional machining operations flush the chips away from the 
cutting zone. The chip removal in dry machining is accomplished by special pro- 
visions in the design of the machine tool. For processes like milling, turning, 
broaching, sawing, etc., where the cutting edge remains visible, the chip removal is 
simpler than processes like drilling, reaming, tapping, etc. where the cutting edge is 
not visible and chips remain in contact with the workpiece for longer duration. In case 
of the former, the necessary design features include the smallest possible work area 
and shortest possible distance between chip buildup and chip disposal. The chips can 
be removed using conveyor (e.g., in case of cast iron) or suction pump (e.g., in case of 
aluminum alloys) (Grsesik 2008). Processes like drilling and reaming require special 
arrangements for chip removal as listed below (Landgraf 2001): 

1. Compressed air for blowing off the chip after unloading. 

2. Trunnion or tilt table for tilting the part form vertical (during loading) to 
horizontal (during machining); the horizontal position helps evacuate chips. 

3. Vertical inversion fixture for holding the workpiece from above to prevent the 
chips from falling onto the workholder. 

4. Liquid (water with corrosion inhibitors) flushing at the end of a cycle. 

5. Vacuum shroud surrounding the tool that suctions out chips from the 
machining area. 

The dry machining should be carried at the maximum prescribed speed to avoid 
premature tool wear-out. It has been demonstrated that dry machining of difficult- 
to-machine materials, like Inconel 718, under the optimized cutting speed leads to 
potentially acceptable surface quality without severe microstructure alteration 
(Devillez et al. 2011). The residual stresses are of the same order than those 
obtained in wet conditions. Contrary to the conventional belief that lower speeds 
help in maintaining the tool life, dry machining at high speed controls heat 
dissipation to the tool and improves tool life because the fast moving chips remove 
the heat from the cutting zone. With the development of high-speed dry machining 
centers, the materials and processes (e.g., milling of austenitic stainless steel) which 
were considered in the recent past challenging can outperform wet machining in 
both productivity and tool life. 
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Cryogenic Machining and Gas-Cooled Machining 

Cryogenic machining refers to the cooling of tool- workpiece interface using liquid 
nitrogen. The cryogenic cooling is an environmentally friendly option as the liquid 
nitrogen evaporates after cutting without leaving any detrimental emission. Chips 
generated from this technique do not have any residual oil on them; thus, they can 
be easily recycled. Moreover, nitrogen is an inert gas and does not chemically react 
with the tool or the workpiece. The cryogenic cooling has been recommended as 
one of the most favorable method for material cutting operations due to consider- 
able improvement in tool life and surface finish (Yildiz and Nalbant 2008). The 
cryogenic cooling can be applied in different ways as shown in Fig. 4. 

The tool-chip interface temperature can be considerably reduced by different 
cryogenic cooling techniques compared to the dry cutting, as seen in Fig. 5. In almost 
all cases, the temperature remains far below the level when cutting fluids are used 
(Shane et al. 2001). The low temperature in cryogenic cooling makes the workpiece 
material harder. In turn, the cutting forces with cryogenic cutting are, in general, 
higher compared to that observed with dry cutting. At higher cutting speeds, the 
cooling effect is not significant because the liquid nitrogen cannot fully penetrate the 
tool tip. This results in elevated tool-chip interface temperatures (Fig. 5) and in turn 
decreased cutting forces and friction coefficients as seen in Fig. 6. 

The impact of cryogenic cooling on the tool and workpiece is quite different from 
the other cooling methods. The conventional cutting fluids primarily flush out the 
chips after breaking, while cooling at cryogenic level promotes brittle fracture of the 
chip. The lower temperature and inert environment provide form and chemical 
stability respectively to the tool. The cooling of the workpiece and the tool can also 
be done using other mediums like air, water, steam and vapor, and refrigerated C0 2 . 

In addition to liquid nitrogen, refrigerated compressed air can also be used as a 
cooling medium in machining and grinding operations. A considerable improvement 
in the tool life with reasonable surface finish can be obtained using the refrigerated 
compressed air in machining of difficult-to-machine materials. A beneficial change in 
chip morphology, i.e., curly to straight, is possible with the use of the refrigerated 
compressed air (Su et al. 2007). The cooling temperature may have a binding effect on 
the machining performance. In case of Ti-6A1-4V, it is observed that the cutting force 
shows considerable reduction up to —15 °C, and surface roughness better than dry, 
wet, and mist cooling can be obtained (Yuan et al. 2011). In grinding operations, the 
compressed cool air does not affect the surface hardness. However, the surface and 
in-depth residual stress decrease if compared with the case of using coolant. Cold 
water jets can also be used as a cutting fluid. It has been observed that heat transfer 
efficiency and in turn the tool temperature reduces significantly (An et al. 2011). 


Minimum Quantity Lubrication 

Minimum quantity lubrication (MQL), also known as near-dry machining (NDM), 
is the technique in which a minimum amount of cutting oil (typically with a flow 
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Fig. 4 Different methods of cryogenic cooling (Yildiz and Nalbant 2008; Hong and Ding 2001; 
Hong et al. 1999) 


rate of 50-500 ml/h), mixed with gaseous substances like air or C0 2 , is injected to 
the cutting zone. Some of the researchers have also attempted water in place of air. 
The oil and water mixture provides good cooling and lubrication as well (Itoigawa 
et al. 2006). The water droplet that is covered with the oil provides the cooling 
effect. When the water droplet hits the surface of the tool or the workpiece, the oil 
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Fig. 5 Tool-chip interface 
temperature in cryogenic 
cooling and dry cutting (Hong 
and Ding 2001) 
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Fig. 6 Cutting forces and friction coefficient in cryogenic machining (material Ti-6A1-4V, 
adapted from Shane et al. (2001)) 


spreads on the surface providing the lubricating effect. To date, a considerable 
amount of research has been focused on MQL drilling, turning, milling, and 
grinding, e.g., (Astakhov 2008; Lawal et al. 2013; Sanchez et al. 2010). The 
MQL machining with gaseous medium, as described below, is more common in 
industrial practices. 

Air-Based MQL 

The cooling and lubricating capabilities of aerosol in MQL cutting are considerably 
low compared with the traditional cutting fluids. However, cutting forces actually 
decrease during MQL operations. In traditional cutting operations, the compressive 
force applied by the tool closes the micro-cracks before the actual cutting operation 
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Table 3 Characteristics and main applications of MQL lubricants (Weinert et al. 2004) 


Synthetic esters 

Fatty alcohols 

Chemically modified vegetable oils 

Long-chained alcohols made from natural raw 
materials from mineral oils 


Good biodegradability 


Low level of hazard to water 


Toxicologically harmless 


High flash and boiling point with low 
viscosity 

Low flash and boiling point, comparatively high 
viscosity 

Very good lubrication properties 

Poor lubrication properties 

Good corrosion resistance 

Better heat removal due to evaporation heat 

Inferior cooling properties 

Few residuals 

Vaporizes with residuals 

Primarily heat removal 

Primarily reduction in friction 

High surface qualities are demanded 

Adhesive workpiece material (buildup 
edge, apparent chips) 

Low-cutting speeds and high specific 
area load 

Lubrication of supporting and/or guiding 
rails 


initiates. The oil drops in the aerosol prevalent during the MQL operation occupy 
the micro-cracks in the workpiece and prevents crack healing. This helps in cutting 
the material with lesser forces. Fatty alcohols and synthetic esters (chemically 
modified vegetable oil) are most commonly used as lubricants in MQL applications 
(Itoigawa et al. 2006). The characteristic and main applications of MQL media are 
given in Table 3. 

There are four commonly used methods of injecting aerosol as shown in Fig. 7. 
The aerosol can be supplied externally or through the tool itself. The aerosol 
preparation can take place within the atomizer or the ejector nozzle. In the case 
of the atomizer, compressed air is supplied to an oil tank where the oil atomizes and 
the resulting aerosol is supplied to the cutting zone. With the ejector nozzle system, 
the air and oil are supplied through two coaxial nozzles. The mixing takes place in 
the chamber or at the tip of the nozzle. 

Carbon Dioxide-Based MQL 

When oil is used as lubricant in air-based MQL, only very limited cooling can 
be achieved. This presents a significant challenge when machining materials 
with low thermal conductivity, e.g., titanium alloy. A new type of MQL 
technology has been developed to address this issue, which is based on rapidly 
expanding sprays of supercritical carbon dioxide (Clarens et al. 2006; Supekar 
et al. 2012). Carbon dioxide-based MQL also improves lubrication. Carbon 
dioxide at a supercritical state (T c = 31.1 °C and P c = 72.8 atm) can dissolve 
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Fig. 7 Different MQL techniques: (a) external aerosol supply, (b) through tool aerosol supply 
with atomizer, and (c) through tool aerosol supply with ejector nozzle (Astakhov 2008) 


many lubricating oils. When the supercritical carbon dioxide with dissolved oil 
expands out of a nozzle, a uniform oil coating is formed on the surface of the 
workpiece and tool. Due to the high pressure, deeper penetration into the cutting 
zone is also achieved. In a carbon dioxide-based MQL system, the process starts 
by compressing carbon dioxide (usually stored in a tank) to the desired pressure, 
which could be up to 350 atm if enhanced cooling and lubrication are needed. 
The carbon dioxide travels through a high-pressure vessel containing lubricating 
oils. The vessel has an outlet which is controlled by a valve, allowing super- 
critical fluid phase carbon dioxide and oil to be delivered through a nozzle 
aimed at the cutting zone. Performance tests have been done for tapping and 
turning of titanium alloy and compacted graphite iron. The results suggest that 
carbon dioxide-based MQL can reduce energy and lubricant consumption while 
extending tool life (Supekar et al. 2012). 
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Optimizing Machining Processes for Reduced Environmental 
Impacts 

Traditionally, machining processes are usually optimized in order to minimize 
production cost. Given the increasing concern on environmental issues, especially 
energy consumption and the associated carbon footprints, it is possible to minimize 
environmental impacts of machining process by selecting machining parameters. 
For a typical machining process, the environmental impacts are due to energy and 
resource consumption, wastewater, airborne emissions, and solid wastes. In order to 
curb these impacts, models have to be developed to relate the input conditions to the 
specific environmental impact categories. The focus of this section is on minimiz- 
ing energy consumption of machining process. Here, the energy consumption is 
defined to include both direct and indirect consumption (i.e., the embedded energy 
with material consumption). For indirect energy consumption, the scope is limited 
to cutting tool inserts only as there is no model available which could reasonably 
well quantify the consumption of cutting fluids. 


Modeling of Material and Energy Consumption 
Cutting Tool Inserts 

The consumption of tool inserts can be calculated as 

trn / -I \ 

m t = — m too i (1) 

where t m is the machining time and T is the tool life, which can be estimated using 
extended Taylor’s tool life equation: 


V c T n f m d k = C (2) 

where V c is the cutting speed, /is the feed, d is the depth of cut, and n , m, k and C are 
the empirical constants determined by experiments. In case of the MQL machining, 
the effect of the lubricant mist can be incorporated in the tool life equation by 
(Marksberry and Jawahir 2008) 



where 7/ is the reference tool life, V R is the reference cutting speed, n c is the coating 
effect factor, m represents the type of machining operation with m = 1 for turning, 
and nj , n 2 , and k are the empirical constants. The n mist is defined as follows: 
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Table 4 Representative 
values of ^ mist (Marksberry 
and Jawahir 2008) 


MWF type 

Trade name 

^mist 

Straight oil 

Coolube 2210 

0.078 

Soluble, water miscible 

Cimperial 1011 

0.060 

Semisynthetic, water miscible 

TRIM SC235 

0.055 

Synthetic, water miscible 

Hocut 763 

0.065 


Nozzle position: rake face, MWF volumetric flow rate =180 ml/h, 
tool coating effect factor n c \ —0.56, k = 0.32, m = 1 for turning, 
ni = 0.23, n 2 = 0.642, depth of cut = 3.2 mm, cutting 
speed = 4 m/s, feed = 0.3 mm/rev, material removal rate: 
11,520 mm 3 /s 


n 


logV'd - log Vc 2 


mist 


log (Gf, W, V,Mzx,M zy 


lo gT 



where G Fy Wt Ny M zx, mzy is the new modified tool life value in MQL, empirically 
derived while varying lubricant type, volumetric flow rate, and nozzle(s) position. 
Representative values of n mist under the identical experimental condition for dif- 
ferent type of oils are given in Table 4: 

Process Energy 

As discussed earlier, energy consumed during machining operations can be broken 
into three parts, i.e., basic energy, idle energy, and cutting energy. The total energy 
consumption is given by the following relation: 

E — P it i T Pbtb T Pmtm (5) 

where P b , P h and P m are power level at basic, idle, and cutting state, respectively, 
and t b , t h and t m are time duration of each machine state. 

In practical conditions the cutting power depends upon the cutting tool, the 
machine, the workpiece material, and the geometry. The power required for remov- 
ing a unit volume per unit time is defined as the specific power consumption (Ps). 
The power demand during machining (P m ) and the specific power consumption are 
related as follows: 


P m = P s MRR (6) 

where MRR is the material removal rate, i.e., volume of material removed per unit 
time. The material removal rate in common cutting operations is given as follows: 


MRR 


turning 


n{D 2 -Di 2 ) ndD ar 

L f r N = ~^f r N 


4 


2 


MRR drilling 


n D d 2 


frN 



4 


( 8 ) 
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MRR slab milling = dwf r ZN (9) 

MRRfaQQ milling — dwVf (10) 

where D and Z), are the outside and inside diameter of the workpiece, respectively, 
D a is the average diameter of the workpiece, N is the spindle rpm ,/ r is the feed per 
revolution, D d is the drill diameter, and Z is the number of teeth, w is the width of 
cut, and v t is the table velocity. 

The idle time incorporates the handling time (t/ r ) and the machining time and is 
given by the following relation 


U — th tm ( 11 ) 

The handling time refers to the time spent in the tool movement without 
performing the cutting operation. During this duration, the tool movement, spindle, 
coolant pump, tool changer, etc. consume energy. 

The basic time incorporates the idle time (£,-) and the loading/unloading (f //M ) time 
and is given by the following relation: 


tb — U + ti/ u — th + t m + tj/ u 



The loading/unloading time is the time spent in preparation for cutting operation 
without any tool movement. 


Environmental Objective Functions and Constraints 

The machining process generates various environmental impacts, which can be 
minimized individually or combined. In the latter case, the optimization problem 
becomes a multiple objective one. In addition, it may be desired to consider trade- 
offs between economic performance and environmental performance. Here, two 
closely correlated environmental objectives are considered, i.e., energy footprint 
and carbon footprint. 


Energy Footprint 

A generalized relation for the energy footprint (E) can be given by the following 
relation (Rajemi et al. 2010): 



P it i T P b^b T P ndm 3T 




where y E is the embedded energy of tool insert. In reality, the energy consumed at 
idle state can be ignored since modern machining centers usually have the 
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capability for rapid transverse. If P 0 denotes the power demand due to auxiliary 
modules, t s is the machine setup time, and t c is the tool change time, Eq. 13 becomes 

E f = P 0 t s + (P 0 + P s MRR ) t m + P 0 t c (y) + y E (y) (14) 

Carbon Footprint 

The carbon footprint (C) of machining operation is estimated on the basis of carbon 
intensity : 


C = e, (Pots + (Po + P s MRR ) t m + P 0 t c Q ) + e, (y) (15) 

where s e is the carbon intensity of electricity and s t is the carbon footprint of one 
tool insert. 

Constraints 

During optimization, the process parameters are allowed to vary such that the 
optimal value of the environmental objective function can be obtained. However, 
the process parameters are subject to constraints from production economics and 
machine tool specifications. Some of the typical constraints are listed below: 

• Makespan: The makespan, i.e., the timely delivery, is an important constraint for 
optimization of the environmental objective function. A limit on the maximum 
makespan is essential to keep the process economically feasible. 

• Range of applicable process parameter: The range of the allowable process 
parameters (cutting speed, feed, and depth of cut) depends on the machine tool 
specifications and materials of tool and the workpiece. 

• Surface roughness: The surface roughness of the produced part depends upon the 
process parameters, cutting fluids, and materials of tool and the workpiece. For a given 
combination, the surface roughness poses a constraint on the process parameters. 

• Machine power: For a given material and operation specific power remains 
constant. Thus, the machine tool due to its limited power cannot remove the 
material at a rate more than a fixed value. This limitation restricts the use of the 
full range of the process parameters for a given workpiece geometry. 


Future Development 

Most of the environmental and occupational health issues associated with machin- 
ing are due to the use of cutting fluids. In the past decades, extensive efforts have 
been made to address them. This chapter provides an overview of these efforts. 
Roughly speaking, these efforts could be put into two categories, i.e., fluid-based 
approaches and system-based approaches. For fluid-based approaches, efforts have 
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been made to extend usable life of cutting fluids and design cutting fluids formu- 
lations using bio-based ingredients. For system-based approaches, efforts have been 
focused on reducing or eliminating the use of cutting fluids. However, it should be 
noted that many of these approaches will increase production cost and are not 
competitive in many applications. As a result, traditional machining technologies 
are expected to remain to be widely used in decades to come, while continuing 
efforts will be made to advance environmentally friendly technologies to make 
them more competitive. 

Almost all research on developing bio-based cutting fluids has been focusing on 
replacing petroleum oil with vegetable oil and synthetic esters, which may compete 
with other applications such as food supply and renewable fuels. However, it has 
been found that bio-based oils and esters are not absolutely necessary to achieve 
this and it is possible to use by-products in excess supply or even wastes for this 
purpose. Recently, researchers in Germany developed a new type of bio-based 
cutting fluids, which are based on water and glycerol (Winter et al. 2012). Rapid 
expansion of biodiesel production has led to an excess supply of glycerol. The use 
of glycerol as one main ingredient in cutting fluids offers a promising solution to 
this problem while at the same time eliminating the need of petroleum oil. Grinding 
inner cylindrical rings made of carbon alloy steel was conducted using cubic boron 
nitride and an aluminum oxide wheels. The tests suggested that glycerol fluids can 
deliver comparable and partially better performance with regard to the measured 
machining forces, the tool wear, and the workpiece surface roughness. In addition, 
no adverse reaction or decomposition products were found on finished workpiece, 
and no VOCs were detected when glycerol-based fluids were used. Moreover, it is 
reported that the glycerol-based cutting fluids are highly biostatic, thus eliminating 
the need of biocides for microorganism control (Winter et al. 2012). 

Using membrane microfiltration for contaminant control has been demonstrated 
at pilot scale for both synthetic and semisynthetic cutting fluids. The primary 
limitation to wider application of membrane filtration technology is its sensitivity 
to cutting fluid formulation design. Many of the current cutting fluids have low 
compatibility with microfiltration; thus, significant membrane fouling and ingredi- 
ent loss occur during filtration. Future development of bio-based formulation 
should take micro filtration compatibility into consideration by paying particular 
attention to the selection of oils and surfactants as well as their concentrations. 

Dry machining and MQL machining significantly reduce environmental foot- 
prints of machining process as the use of cutting fluids is greatly reduced or 
eliminated. The primary obstacle for widespread adoption of these technologies is 
cost and the lack of technical expertise. Most of MQL systems currently in use are 
retrofitted to existing machine tools, thus optimal machining performance is gen- 
erally not achieved. It is desirable to configure MQL systems such that lubricants 
are delivered more effectively to the primary cutting edge; thus, the amount of 
lubricants needed can be minimized while maintaining sufficient lubrication and 
cooling. Although many MQL configurations are developed, it is not clear to what 
degree these configurations are close to optimal settings. In addition, commonly 
used MQL lubricants cannot deliver high lubrication and cooling capacity at the 
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same time, which calls for the development of new lubricants. Research is also 
required in order to develop tool insert materials that can tolerate high temperature 
and/or provide self-lubricating. 

It should be noted that most machine tools manufactured have cost, productivity, 
and precision as main targets. Adopting environmentally friendly machining 
requires the development of machine tools with high-energy efficiency and low 
environmental impacts. Currently, Workgroup 12 of the ISO/TC 39 is working on 
the development of standards for environmental evaluation of machine tools. Very 
likely an energy efficiency standard will soon be available, and manufacturers 
producing high-efficiency machine tools will gain some market advantages. 

Machining processes interact with the environment through material and energy 
flows on both input and output sides. New machining technologies likely have their 
own environmental burdens, and life cycle assessment has to be carried out to 
determine their true environmental benefits (Clarens et al. 2008). Ideally, life cycle 
assessment should be integrated into process development to provide timely feed- 
backs. Therefore, a critical need exists to develop a comprehensive life cycle 
inventory database with broad coverage on machining processes. 

As world population and the average quality of life both increase, environmental 
sustainability has become a big challenge faced by our society. Manufacturing has a 
significant impact on the environment, especially in developing countries. There- 
fore, greening manufacturing activities, including machining processes, plays an 
important role in achieving important sustainable development goals. During the 
past decade, lots of research has been carried out in the field of environmentally 
friendly machining, but the industrial penetration seems to be limited for many of 
these new technologies. There are barriers that have to be overcome to implement 
the developed green technologies in machine shops. At the same time, the research 
community should continue its efforts on developing new machining technologies 
with improved economic and environmental performance. 


Summary 

Machining represents one of the most important manufacturing processes in mod- 
ern industry. However, a variety of environmental issues have been identified to be 
associated with machining processes. These issues include energy/re source con- 
sumption, airborne emissions, wastewater discharge, solid wastes, and occupational 
health risks. The ubiquitous use of cutting fluids in machining operations is the 
primary cause of most of the issues. As a result, in the past two decades research and 
development efforts on environmentally friendly machining have been largely 
focusing on reducing environmental impacts due to the use of cutting fluids. In 
this chapter, a comprehensive review was provided on these efforts. Separation and 
filtration technologies have been developed and utilized to extend the life span of 
cutting fluids by removing particulates, free oils, and other contaminants. New, 
bio-based formulations have been designed to replace petroleum-based fluids, 
which deliver lower toxicity and higher biodegradability without sacrificing 
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machining performance. The development of new tool materials and coating also 
makes it possible to reduce or even eliminate the reliance on cutting fluids during 
machining. In dry machining and minimum quantity lubrication (MQL) machining, 
effective cooling is achieved by delivering small amount of liquid nitrogen, air, 
water, vapor, or refrigerated C0 2 . In addition to these techniques, machining 
process can be optimized for reduced environmental impacts, especially energy 
consumption and carbon footprint. For many of the new techniques, limited indus- 
trial penetration has been achieved, mainly due to economic and knowledge 
barriers, an issue that has to be addressed by the research community and the 
machine tool industry collaboratively. 
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Abstract 

Simulation of machining using finite element analysis is widely practiced now, 
using special purpose as well as general purpose commercial software packages. 
While simulations cannot be guaranteed to quantitatively reproduce all the 
outputs of actual machining operations, appropriately designed simulations can 
easily predict trends correctly and lead to improved understanding of particular 
features of a process that is being studied. This has led to many valuable insights 
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about different machining processes. Various challenges remain to be overcome 
before simulation based design of machining processes can be routinely used by 
industry. 


Introduction 

Advancements in computing power have led to a golden age of simulation where 
any system, natural or human, real or virtual, is fair game. Realistic simulations of 
complex systems are typically developed with the objective of improved under- 
standing leading to improved design and operation of these systems. The realism of 
simulations of natural systems can be verified by comparing predictions against real 
behavior, both in magnitude and trends. Finite element models that simulate the 
multiple physical processes involved in machining have been used to better under- 
stand various aspects of machining, and usually model the trends very well. 
However, only in very few reported instances have finite element simulations of 
machining simultaneously matched all of the easily measurable outputs of interest, 
such as cutting and thrust forces, tool temperature, contact length, chip thickness, 
and chip curl (Usui and Shirakashi 1982). This is due mostly to the complexity of 
material and friction models, the large amount of computational time required for 
even 2D simulations, and the requirement of a large number of simulations for 
mapping the response function between inputs and outputs for this highly nonlinear 
process. Additional issues preventing perfect matching of inputs and outputs 
include (i) the lack of accurate and comprehensive experimental data for all the 
outputs of interest under some conditions, (ii) inaccuracies in some of the measure- 
ments, such as tool temperatures, due to the partial 3D character of even orthogonal 
machining experiments, (iii) paucity of flow stress data at high strain, strain rate and 
temperature, (iv) uncertainties in analytical models that may be used to guide the 
search for improved model parameters, (v) the high computational complexity of 
simulation of dynamic thermo-mechanical phenomena at multiple length and time 
scales, especially in 3D, (vi) inhomogeneity of material response at small length 
scales comparable to the grain size, (vii) uncertainties in tool wear models, etc. Still, 
given that a finite element model is appropriately constructed and analyzed for the 
purpose of understanding some fundamental aspects of the process, progress can be 
made by detailed study of the distribution of the output variables in simulation 
results, and by analyzing trends in simulation results with respect to cutting 
conditions. 

This chapter highlights how, via appropriate modeling assumptions and tech- 
niques, simulations can be used for understanding fundamental aspects of the 
machining process. For instance, simulations have been used to understand that 
creation of free surfaces while machining ductile materials does not require frac- 
ture. In contrast, simulation of the machining of fiber-reinforced polymers using 
different models for failure of the fibers, matrix, and their interface is used to 
understand the different modes of failure that occur as the fiber orientation angle 
changes. Simulations also have helped understand shear band formation in 
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machining heterogeneous materials and the clear preference for the deformation to 
occur along a mode of deformation, continuous or cyclic, that requires the lowest 
energy within the constraints imposed by equilibrium, constitutive response, and 
compatibility. Simulations can be used to compare different available models for 
the heat partition in machining and improve upon them. Simulations also reveal that 
orthogonal cutting can be considered to be a 2D machining process to study forces, 
strain fields, etc., but not for temperature fields. Due to the transition from plane 
strain to plane stress, the temperature along the side face is significantly different 
from that along the midplane, and it requires 3D simulations to try to match the 2D 
fields measured on the side faces of tools. 

Improvements in the accuracy with which the constitutive model represents the 
strain, strain rate, and temperature-dependent flow stress and in the model for 
the friction stress along the contact interface are important for understanding the 
observed deformation and other effects in machining. The interplay between the 
primary and secondary shear zones plays a key role in determining the nominal 
shear plane angle in case of continuous chip formation and the transition from 
continuous chip formation to adiabatic shear banding. Understanding the effect of 
cutting conditions as well as coatings and coolants, on friction, temperature distri- 
bution, tool wear, residual stresses, burr formation, etc., are practically important 
outstanding problems, but the need for 3D simulations to accurately capture 
important geometric effects makes it highly challenging to run enough of these 
simulations to verify the predictions and fine-tune inputs and achieve a reasonable 
level of certainty in the inferences. For this reason, the present utility of simulations 
of machining lies more in improving understanding, predicting trends, and provid- 
ing guidance for improvement than in providing exact numbers that solve indus- 
trially relevant problems. 


Orthogonal Cutting Versus 3D Cutting 

Cutting or machining is the removal of material from a softer workpiece by a hard 
cutting tool by the movement of a cutting edge through the work material. 
Depending on the material being machined and the cutting conditions, most of 
the deformation is concentrated in one, two, or three deformation zones called the 
primary, secondary, and tertiary shear zones. All real cutting processes are 3D 
deformation operations, wherein at least some portions of the material deform in all 
three dimensions, and all of the material cannot be considered to be deforming in 
plane stress or plane strain (Fig. 1). In addition, the cutting edge is typically not a 
single straight line, resulting in a complex 3D deformation pattern. Orthogonal 
cutting refers to a few types of cutting operations in which a material is removed by 
a single straight cutting edge that moves in a direction perpendicular to itself, 
creating a planar machined surface that is parallel to a free surface of the work 
piece being machined, as shown in Figure 1. Under these restrictions, most of the 
material being cut by the middle of the cutting edge (i.e., other than the regions 
close to the ends of the segment of the cutting edge that is engaged in cutting) can 
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Fig. 1 (a) Initial work, chip and tool geometry of 3D ALE FEA model showing the boundary 
conditions and (b) Steady state deformation geometry, showing the reduction of chip thickness 
near the side faces of the workpiece, where the deformation changes from plane strain to plane 
stress. £ = constraint on material £ = constraint on mesh 

be assumed to undergo 2D plane strain deformation, as seen from the variation of 
chip thickness observable in the chip outflow cross section in Fig. lb. Processes 
such as planing or shaping of a sheet, end turning of a tube, and reducing the OD of 
a disc, in which the cutting edge is wider than the thickness of the workpiece, moves 
through the material such that the direction of relative motion is perpendicular to 
the cutting edge, cuts the entire thickness of the workpiece in one pass, and cuts at a 
uniform feed (which also causes the depth of cut to be constant), can be called 
orthogonal cutting operations. All of the mechanics and physics of machining that 
need to be simulated remain the same between orthogonal and 3D machining 





30 Simulation in Machining 


1159 



Fig. 2 Description of deformation of a domain and the discretization of the domain and its 
deformation. The relationship between displacement and strain (and between their variations) is 
also shown. Vectors are indicated using boldface type when an index is omitted. Subscript indices 
indicate components, and the special subscript n indicates the node a quantity is associated with. 
Repeated indices imply summation over all coordinate directions, or over all nodes, and a comma 
in the subscript indicates differentiation with respect to the direction(s) indicated by the subse- 
quent index (indices) 

operations. 3D simulations are more complex and need at least 10 x more compu- 
tational time and resources than 2D simulations. To this day, 3D simulations are 
prohibitively expensive from the point of view of using these to develop and 
validate a simulation capability for new materials and processes. However, 
researchers in academia and in industry are beginning to use 3D simulations to 
study 3D operations like drilling, milling, tapping, hobbing, etc. (Guo and Domfeld 
2000; Rotella et al. 2013; Arrazola et al. 2013) where the 3D nature of chip 
formation and heat flow have to be modeled correctly for gaining insights required 
for understanding burr formation, improving the design of tools, etc. 


Introduction to Finite Element Analysis of Machining 

The finite element method (FEM) is a method of approximating variables that are 
complicated functions of spatial dimensions using simpler piecewise functions each 
valid within a small finite element. Figure 2 shows a domain of interest V, and some 
regions of it discretized into finite elements. A finite element is typically a convex 
polyhedron with faces, edges and vertices (called nodes) constituting its bound- 
aries. The distribution of a variable within a finite element, at a given time, is 
obtained by interpolating the nodal values at that time. 

For isoparametric elements, the original (material) locations of the n nodes X n 
prescribe the original geometry of the element X(|) = X u N n (f;) [Note that the 
equations are written using indicial notation, Belytschko et al. (2000). While 
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other indices vary over spatial dimensions, subscript index n identifies nodal 
quantities and varies over the number of nodes of an element or the whole structure, 
depending on the context, and e varies over the number of elements. Boldface 
symbols represent vectors/tensors.] as a mapping from a parent element 
(a geometrically simple instance of the type of element) via interpolation functions 
N n in the same manner as the nodal values of a variable (displacements u n in Fig. 1) 
govern its spatial variation over the element w(|, t) = u n (t)N n (%). The current spatial 
location of the material at original location Z(|) is given by x(%, t) = u(i *, t) + X(|) = 
x n (t)N n (Jz). The interpolation functions N n (t;), also called shape functions or influ- 
ence coefficients, express the relative influence of each of the n nodes of an element 
on the variables at each point within it as a function of its local coordinates £ but can 
also be expressed in terms of the material coordinates X as N n (t;(X)). Note that the 
influence coefficients at a point £ are the same for all the variables (displacements, 
velocities, accelerations, etc.) at a node. In coupled thermomechanical analysis, the 
nodal variables also include temperature. 

If external loads and/or boundary conditions are applied over some points, lines, 
areas, or volumes of the domain of interest, the discretization of the domain is 
accomplished in a manner as to simplify the description of these; for instance, if 
concentrated loads are applied at some points, nodes would be situated at those 
points. If this is precluded, for instance, by the location of these loads moving with 
respect to the material over time, the loads may be approximated or the mesh would 
be modified over time to follow the loads, and the flow of material across elements 
would be tracked. 

The interpolation functions used for approximating the spatial distribution of the 
variables within each finite element are typically polynomials. The interpolation 
functions are typically parameterized in terms of the nodal values of these variables 
as mentioned above. For a given set of nodal values (i.e., the parameters), the 
polynomial is fully defined over the boundaries as well as the interior of the element 
(i.e., all the degrees of freedom of the polynomial are constrained). From the order 
of the polynomial, the order of continuity along the boundaries can be inferred. 
Conversely, the order of the polynomial chosen for approximating the variable 
depends on the continuity (smoothness) requirements across element boundaries, 
and the number of parameters, i.e., degrees of freedom, will increase with the order 
of the polynomial - the number of nodes and/or the number of variables defined at 
the nodes will increase (slope, curvature, etc., may be added). The continuity 
requirements are enforced by recognizing that the values of the governing variables 
at a node are the same for each of the elements sharing the node, during assembly of 
the equations governing the behavior of the individual elements into a system of 
equations representing the behavior of the entire domain of interest. 


Mechanical Deformation 

The fundamental idea behind FEM is that the nodal values that characterize the 
spatial variation of unknown quantities can be easily arrived at by expressing the 
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principles that govern the behavior of the domain in a “lumped” form, i.e., as an 
equivalent integral principle, that is satisfied over each finite element. Typically, the 
principles are expressed as partial differential equations (PDEs) valid at every point 
within the domain, called the “strong” form. They can also be expressed as an 
equivalent integral form, which is called the “weak” form because satisfaction of 
the integral form for a set of test functions does not necessarily imply that the PDE 
is satisfied everywhere within the element. For purely mechanical deformation, the 
momentum equation (or the equation of motion, Eq. 1) that needs to be satisfied at 
every point within the domain is the strong form, and the principle of virtual work 
(Eq. 2) is a weak form. Belytschko et al. (2000, Sect. 2.3) show the derivation of this 
weak form from Eq. 1 by integrating the product of the equilibrium equation with 
an arbitrary variation in the displacement, the test function. Key steps include 
integration by parts of the stress gradient term to reduce the order of the derivatives 
and recognizing that the inner product of stress and variation in displacement 
gradient is the same as the inner product of the stress and variation in strain, due 
to the symmetry of the stress tensor. 


°ii,j + Pi - P*i = 0 


( 1 ) 
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Equation 1 states that the inertial force due to the rate of change of momentum 
equals the sum of the body force and force due to the stress gradient. Equation 2 
states that the sum of the change in internal energy and the change in kinetic energy 
of the structure, due to an arbitrary variation in displacement, should equal the sum 
of the external work done by distributed traction vectors ({ r } = t ? -), distributed body 
forces ({/?}= /?,), and the point load vectors at the n nodes (P m ). 

The finite difference method enforces the partial differential Eq. 1 at points on a 
uniformly spaced grid, which imposes some limitations on the geometries and 
boundary conditions. On the other hand, FEM satisfies the principle of virtual 
work on an element-by-element basis for arbitrary geometries and boundary con- 
ditions, while approximately satisfying the equation of motion. 

As shown in Fig. 2, the displacement and strain within the element can be 
expressed as functions of the nodal displacements. The strain measure shown, e^-, 
is the Lagrangian infinitesimal strain tensor, which is the symmetric part of Ujj, the 
displacement gradient with respect to the original configuration (or material refer- 
ence frame). The variation in strain can be expressed as the symmetric part of 
the gradient of the variation in displacement <5 M/. Since the shape functions are the 
source of the spatial gradient of displacement, the displacement gradient u f j is a 
simple product of the nodal displacements with the gradient of the shape functions, 

Uu = u ir p nj , where B 


dNn (£) _ dN n (0 ag* 


nj 


dX, 


dt k dX- 


For 3D problems, the six independent components of the stress are 

'T 

typically written as a vector {< 7 } = [o n , o 2 2 , o 33 , o 2 3 , <r 31 ] (in Voigt form). 
For an element with n nodes, if the nodal displacements are recast as a vector 
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{ U] = [U i n , U 2n , U 3n ] , of length 3 n, the shape function matrix [TV] for the element 
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of size 3 x In. The gradient of the shape functions \B\ 
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of size 6x3 n that when multiplied with the nodal displacements gives the strain 

T 

vector {s} = [e n , s 2 2 , £ 33 , y 12 , Y 31 ] • Note that y z y = e z/ - + e,- z - is used in the strain 

'T 

vector so that { 8s } { o } gives the increment of work done per unit volume. As noted 
above, gradients are usually calculated with respect to the local coordinates 
and transformed to the material reference frame using the inverse of the Jacobian 

Q X . 

Jjk = Integrating the terms of the virtual work equation over the volume of each 

element and summing over all elements, the equations to be satisfied for arbitrary 
variation in nodal displacements are obtained as shown (using matrix notation) in 

Eq. 3. 



Where { U) is the global displacement vector, {P} is the global load vector, and 
the superscript prefix e indicates element-specific quantities or degrees of freedom. 

The stresses are related to the strains via constitutive equations, e.g., linear 
elasticity where e {o} = [D] e {s} = [D][B] e { U}, where [D] is the elasticity matrix. 
For nonlinear constitutive models, e.g., plasticity, the stress needs to be 
tracked separately and updated to correspond to each increment in the solution 
process (see below). Since the vectors of nodal displacements, accelerations, and 
variations in displacement are constants, they can be factored out of the integrals. 
Distributed body and traction forces are appropriately apportioned to the nodes by 
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the shape functions acting as weighting functions, resulting in the corresponding 
nodal force vectors e {b) and e { t } . 



The first integral within the brackets gives rise to the element stiffness matrix 
e [K] that when multiplied by the displacement vector gives the internal force vector 
e {f} mt required to effect the deformation corresponding to the displacement e {U). 
The second integral gives the element mass matrix e [M] that when multiplied by the 
acceleration vector gives the inertial force required to cause the acceleration. The 
integrations are performed numerically, typically using Gauss quadrature that 
evaluates the integral as a weighted sum of the values at the functions at one or 
more integration points within the element. The number of integration points and 
the weights depends upon the total order of the polynomials in the integrand. 
Reduced integration refers to numerical integration using a smaller number of 
integration points than required for accurate integration of the polynomial to 
speed up the calculations as well as to mitigate overly stiff response (locking of 
fully integrated elements, Chap. 8 of Belytschko et al. (2000)). However, this 
causes the strain to be incorrectly calculated as being zero for some modes of 
deformation (zero-energy modes of deformation, also called hourglass modes 
because this is the shape of these modes for quadrilateral elements) and should be 
complemented by hourglass control techniques that assign a penalty force to 
mitigate these. If significant amount of hourglass energy is reported, it indicates 
the need to refine the mesh to correctly account for the deformation energy in 
regions of high strain gradients. 



For an element with n degrees of freedom, equation 5 gives rise to n terms of the 
traction and body force vectors, and to nxn terms of the stiffness and mass matrices 
that represent the interactions between the n degrees of freedom. Terms 
corresponding to a degree of freedom at a node will figure in the equations for 
each of the elements that the node is part of. The process of collating the element 
displacement vectors into a global displacement vector {!/} is called assembly. 
During assembly, the traction force and body force terms corresponding to a degree 
of freedom, arising from the different elements that a node is part of, are summed to 
get the global body and traction forces along that degree of freedom. Similarly, 
elements that share boundaries give rise to different element stiffness and mass 
matrices, each containing terms representing interaction between the same pairs of 
degrees of freedom, that need to be summed together to obtain the global stiffness 
and mass matrices. The end result of assembly is the following global equation. 
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{SU} T [*]{[/} + [M\{U} - {b} - {r} - {P} 





The quantity within square brackets is the residual force vector. Recognizing 
that its inner product with arbitrary variation in nodal displacements will be zero 
only if each of the terms is zero, the system equations are written as 


\K}{U} + [M\{U} - {b} - {t} - {p} = 0 

[K\{U} + [M]{U} = {f} 

Equation 7 above is a finite element semi-discretization, because though the 
spatial dependence has been discretized through the use of finite elements, the time 
dependence is still in the form of a differential equation. Note that If viscous forces 
had also been included in the equation of motion, there would be an additional term 
dependent on the nodal velocities. The initial and boundary conditions need to be 
applied, and this system of second order ordinary differential equation (ODEs) needs 
to be solved. 

For quasistatic loading, such as when analyzing the cutting of a perfectly plastic 
material for the purpose of developing slip-line field models, the stiffness force 
dominates and the system equation can be written as [K] { U } = {/} . If rigid body 
translations and rotations are not constrained, structures are capable of rigid-body 
motions requiring zero internal energy. Since these are eigenmode s with zero 
eigenvalue (finite displacement requiring zero force), and the determinant of a matrix 
is the product of its eigenvalues, the full stiffness matrix is singular or rank deficient. 
As a degree of freedom is constrained, the force terms corresponding to it can be 
moved to the external force term on the right side, eliminating the columns of the 
stiffness matrix corresponding to that degree of freedom. The row can also be 
eliminated as it is only useful to compute the reaction force once the other displace- 
ments are known. When a sufficient number of these degrees of freedom are 
constrained, the stiffness matrix becomes non-singular and the quasistatic problem 
can also be solved (the dynamic problem can always be solved). 


Explicit and Implicit Time Integration 

The two main methods of solution are (i) direct integration methods and (ii) modal 
methods. Direct integration methods use finite difference to solve Eq. 7 and track 
the nodal displacements over time. There are two classes of finite difference 
approximations to / +1 {£/} in terms of the displacements ,■{£/}, velocities, and 
accelerations at successive instants of time, g. 

Explicit methods: i+1 {U} = g(i{U}, ^{U}, . . .) (8) 

Implicit methods: i+l {U} =*( i+ i{*/}, ,{[/}, . . .) (9) 
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The main difference between implicit and explicit methods is that in explicit 
methods, the displacement , +1 \U) at the next instant of time i+l t depends only 
upon the displacement, velocity, and acceleration at the current and previous 
instants and can be simply calculated if these are known. This is achieved by 

using the system equations at time g to calculate the velocity 7 - +1 / 2 {t/} at i+ i /2 t 
and the displacement /+1 { U) at i+l t, as shown below. 



Note that the displacement i+ \ [U] is an extrapolation; if the velocities and 
accelerations continue to remain close to what they are at the current time, then 
the displacement at the next instance of time can be approximated well by enforcing 
the equation of motion at the current time. Thus, the time step size A t = O+i t — g) 
needs to be small compared to the time period of even the highest frequency 
oscillations in the system for the estimated displacement ,+i{t/} to not begin to 
diverge from the actual displacement. Specifically, At < 2 /co max , which is equiva- 
lent to At < L/c where L is the shortest side of the smallest element and c is the 
velocity of sound (dilatational wave speed) in the element. For this reason, the 
smallest element side controls the time step size, and very small or highly distorted 
elements result in large solution times. For modeling the deformation around the 
cutting edge accurately, for instance, to accurately model residual stresses, subsur- 
face delamination, etc., very small elements are required causing the time steps to 
be small and the simulation time to be large. Subcycling, whereby the nodal forces 
due to each of the elements is updated at different multiples of the critical time step 
of the smallest element is sometimes used, but is not very common. As an aside, an 
effective way to verify that a consistent set of units is being used for displacement, 

loads, stresses, etc., is to verify that the speed of dilatational waves (V c = \JEjp, 
where E is the elastic modulus and p is the density) is obtained correctly from the 
values of E and p. 

Note that when using explicit integration, even a purely quasistatic problem has to 
be solved as a dynamic problem where loads ,{/} and prescribed accelerations are 
smooth (do not have jumps) and inertial forces are negligible. If the loads are 
suddenly applied, the accelerations become too high and lead to spurious results 
(especially when mass scaling, discussed next, is used). In a case where an approx- 
imately quasistatic problem is being solved, i.e., where the inertial force is much 
smaller than the stiffness force (or the kinetic energy is much smaller than the internal 
energy), increasing the density of the material, known as mass scaling, is an effective 
strategy to decrease the speed of sound, increase the time step size, and speed up the 
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simulation. The limit to mass scaling should be chosen such that the change in kinetic 
energy from step to step (throughout a simulation) is always a negligible fraction of 
the change in internal energy to ensure that stored kinetic energy has negligible effect 
on the deformation. In most software packages, it is easier to track the total kinetic 
energy and ensure that it is a small fraction of the total internal energy. However, for 
large structures with small regions of active deformation (such as a small primary 
shear zone (PSZ) at one end of a large highly deformed chip within which the internal 
energy is high), local inertial effects may go undetected if the total energy criterion, 
rather than the incremental change criterion, is used. 

By repeatedly solving the above equation for small increments in time, the 
displacements at any instant of time can be obtained. Notice that solving the 
above equation is trivial if [M] is a diagonal matrix; otherwise, this method is 
computationally infeasible. For this reason, the mass matrix is “lumped” into a 
diagonal matrix by moving (summing) all the masses in each row into the diagonal 
term, zeroing out the other terms. In fact, mass and internal force corresponding to 
each degree of freedom can be readily calculated, obviating the need to formulate 
these matrices. Thus, very large problems can be handled easily. 

In implicit methods the displacements /+1 { U) at the next instant of time i+x t also 
depend upon the velocity j + \{U} and acceleration i+ \{U} at i+l t 9 in addition to 
the displacement, velocity, and acceleration at the current and previous 
instants. This occurs when the system equations at time i+x t are used to calculate 
displacement i+x {U}, as shown below 


/+iM 


i{U} + At 


/+1 {*/}+,•{£} 

2 


l 




Solving the above equations for /+ i {£/} and /+ i {£/} 



Using the equation of motion at i+l t, 




i {U} + At i {u}+^ 









The last term within the square brackets on the right is again the acceleration 
/+ i{U}, and the last two terms together estimate the change in velocity between 

time ^ and i+x t, due to the average acceleration j+x / 2 {uy Since the displacement at 
the next instant of time i+x t is obtained by enforcing the equation of motion at that 
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time, implicit methods are unconditionally stable. However, iteration through 
various trial values of , +1 {t/} is needed when [M], [ K ], or {/} depend on /+1 {U}, 
as occurs for structures with nonlinear response, contact, etc. In fact, for many 
constitutive models, the stress cannot be directly related to the displacement, but 
only to the current stress and strain increment. The increment of internal force is 
proportional to the incremental displacement via a tangent stiffness matrix. One 


form of the tangent stiffness matrix is calculated as 


relates the increment of stress to increment of strain as 



T 


; B} 1 [C][B]dV , where [C] 


6\ = [C][e] (Belytschko 
et al. 2000, Sect. 6.4). The process of updating the stress and tangent stiffness 
matrix typically requires iteration using predictor-corrector schemes to ensure that 
the constraints imposed on the stresses and strain increments by the constitutive 
model are satisfied. 

Plasticity, contact, temperature effects, geometry evolution, etc., make machin- 
ing a highly nonlinear problem. Still, the time step size can be 20-100 times the 
critical time step size in explicit methods, limited only by the need to track the 
evolution of [M], [K], and {/}. However, notice that matrix inversion is required for 
solving Eq. 11, because [K] has non-diagonal terms. This makes implicit methods 
sensitive to problem size, and solution time increases as the third power of the 
number of degrees of freedom. While local mesh refinement, condensation of linear 
regions of the structure into superelements, etc., are effective strategies to keep the 
problem size manageable, if the time step size also decreases with local mesh 
refinement (for instance with contact or localized deformation) the solution time 
can increase even more. Moreover, convergence difficulties in the presence of very 
high gradients in strain and highly coupled nonlinear interactions (such as in 
adiabatic shear banding) can cause the solution to fail. Some general purpose 
software packages are prone to this, and overcoming convergence difficulties is 
sometimes a frustrating experience. 


Lagrangian Simulations 

It may also be noted that if only the increment of displacement is required to 
compute the change in internal force vector, the reference configuration can either 
remain fixed (total Lagrangian formulation) or be updated periodically (updated 
Lagrangian formulation). Usually, the choice is dictated by whether the material 
constitutive response is easier to describe with respect to the original or updated 
configurations. Depending on this choice, the strain measure changes and additional 
terms will be needed in the first term of Eq. 4 to convert the increment in 
displacement gradient to an increment of strain. The stress tensor used has to be 
work conjugate to the strain measure, and the component of stress rate that is related 
to material deformation by the constitutive model has to be independent of rigid 
body rotations (Belytschko et al. 2000). 

When there are large gradients in strain (e.g., localized deformations near the 
cutting edge), a Lagrangian analysis with the same mesh leads to large distortions in 


1168 


V. Madhavan 


the elements, which causes convergence difficulties for implicit schemes and small 
time steps for explicit schemes. For this reason, the mesh will have to be periodically 
regenerated and/or smoothed for simulation of cutting of materials that are ductile. 
Complete regeneration of the mesh can be done by seeding nodes in regions of high 
gradients and using a mesh generation scheme such as Delaunay triangulation (Sekhon 
and Chenot 1993; Marusich and Ortiz 1995) or by specifying required element sizes 
along boundary and/or interior points and generating a “paver” mesh (Madhavan 
et al. 1996). The state of the material (stress, plastic strain, temperature, and other 
history variables) is mapped from the old mesh to the new mesh via an interpolation 
step, also called as rezoning (which is trivial for places where new nodes are placed at 
integration points of the old mesh). Using this periodic remeshing and rezoning 
approach, Madhavan et al. (1996) showed that machining of ductile materials can be 
simulated without invoking any failure or separation criterion that separates the chip 
from the workpiece, thereby demonstrating that the deformation around the cutting 
edge in machining is very similar to the deformation around the tip of a wedge indenter. 
When coupled with good remeshing algorithms, this is the most efficient way to 
simulate machining under conditions that lead to periodic deformation patterns due 
to shear banding or cracking and of materials with complex constitutive models. 

To avoid the complexity of developing a good remeshing algorithm, and the 
computational cost of rezoning, a simpler approach adopted often is to use a parting 
line along which the chip material is separated from the workpiece. Separation is 
initiated when a criterion based on plastic strain, damage, distance to the cutting 
edge, etc., is satisfied (Strenkowski and Carroll 1985; Shih et al. 1990; Zhang and 
Bagchi 1994; Guo and Dornfeld 2000). A parting line can be modeled using interface 
elements that tie coincident nodes together till they separate. The portion of the 
workpiece that forms the chip and the portion that forms the machined surface are 
modeled as separate regions that have two sets of coincident nodes (located in 
identical locations) along their common boundary. The interface elements initially 
tie or constrain the coincident nodes to move together. When the separation criterion 
is satisfied, the interface constraints are removed and the nodes become free to move 
along different paths. A variant of this technique models a thin row of elements 
between the two portions of the workpiece that form the machined surface and the 
chip, that get deleted one by one, when the element deletion criterion is satisfied. The 
problem with the parting line approach is that the separation criterion is arbitrary, 
which leads to uncertainty in machining forces, machined surface stresses, interface 
temperature, etc. (Huang and Black 1996; Zhang 1999). Additionally, the separation 
criterion does not solve problems caused by localized high strains in other regions, 
such as the secondary shear zone and along shear bands. 

Despite these shortcomings, the parting line approach may be the most appro- 
priate method for some situations, for instance, to study the chip formation of brittle 
materials or heterogeneous materials. Chuzoy et al. (2003) modeled the spatial 
distribution of phases within a workpiece of ductile cast iron, used different 
mechanical properties for the different phases, and simulated machining. Since 
ductile iron has limited ductility, they utilized an element deletion approach to 
model chip separation. They reported good matches between experimental and 
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simulation results for the chip morphology and cutting forces. An interesting 
observation that was reproduced by their simulation was that the cutting force for 
machining ductile iron is less than that for machining either ferrite or pearlite under 
the same conditions. Obikawa and Usui (1996) modeled the machining of Ti64 with 
an element failure criterion and interpreted their results as indicating that the 
sawtooth chip formation at conventional cutting speeds is initiated by cracking, 
while adiabatic shear banding may be the controlling mechanism at high speeds. 

Simoneau et al. (2006, 2007) carried out a comprehensive study to understand 
the interrelationship between material micro structure and homogeneity of defor- 
mation. They found that the lower strength and higher ductility of the ferrite phase 
caused the deformation to be increasingly localized within the ferrite phase as the 
depth of cut decreases, leading to the formation of “quasi-shear extrusion” chips. 
The type of analysis used by them was a Lagrangian analysis using ABAQUS/ 
Explicit, along with element deletion to accomplish separation of the chip from the 
material. Given the ability of the Lagrangian simulation to preserve sharp material 
transitions if no rezoning or advection occurs, but the need to separate the chip, this 
is probably the most efficient solution procedure that satisfies the intended purpose 
of studying chip formation. 


ALE Simulations 


In an arbitrary-Lagrangian-Eulerian (ALE) formulation, arbitrary mesh motion is 
modeled using mesh displacements cp(X , t) (with respect to the material) as 
additional nodal variables, and the continuity, momentum and energy equations 
are modified to include these variables [18,5]. The convective velocity c = v — v, is 
the velocity of the material with respect to the mesh at point cp in the mesh reference 
frame, i.e., the difference between the velocity of the material (v = dx(X(q > ), t)/dt) 
and that of the mesh ( v = dcp(X[cp\t) / dt). The material rate of change of each 
variable is expressed as the sum of its rate of change in the mesh frame (i.e. at the 
current reference point corresponding to the material point), and a convective term 
arising from the component of the gradient of the variable in the direction of the 
convective velocity. For instance, 


dT(X((p),t ) dT((p,i) 


dt 


dt 


<p 


, dT(<p,t) 
d(p 


0 



The convective term gives rise to additional terms in the weak form, that lead to 
additional terms in the finite element equations. After specification of the mesh 
velocities the material velocities can be solved for. However, for convection 
dominated flows, spurious oscillations occur in the solution unless artificial diffu- 
sion is added to the system (see Chap. 7 of [5]). This is typically accomplished 
using the streamline up winding technique, which adds extra diffusion only in the 
direction of the flow, so that there is no artificial diffusion in directions normal to 
the flow, and the solution still converges to the correct one. 
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Different regions of the boundaries have different constraints on the mesh 
velocity - Eulerian boundaries through which material flows into or out of the 
mesh, and Lagrangian boundaries where the components of mesh and material 
velocity perpendicular to the mesh are the same. In the interior of the domain, the 
mesh velocity is prescribed based on different smoothing strategies to remove 
distortions, and the need to refine the mesh in regions of high solution gradients, 
geometric curvature, etc. 

When explicit integration is used, the coupled equations that describe both mesh 
motion and material displacement are usually solved in an approximate manner 
using a split operator approach. The mechanical deformation (displacement) is first 
solved for with the mesh moving with the material in the usual Lagrangian 
approach. This is followed by a mesh motion step (also called advection step) in 
which there is no deformation of the material, but the nodes are moved to smooth 
the mesh and motion of material (and of history variables such as strain, heat, 
damage, etc.) across element boundaries is calculated. For normal cutting speeds, it 
may be necessary to have an advection step only once every hundred steps of 
Lagrangian mechanical deformation. While the advection step is similar in purpose 
to the rezoning step described above, mesh motions at each update are limited to be 
a fraction of the element size, and simple algorithms can be used to calculate the 
transport across the elements and update these history dependent variables. How- 
ever, special considerations apply to the advection of discontinuous variables such 
as stresses and strain. The slightly dissipative nature of these algorithms can inhibit 
localization of deformation and lead to inaccurate results, especially when the mesh 
is not refined enough or aligned with the direction of maximum shear stress. 

Pantale et al. (1996) and Touratier (1999) developed a capability to carry out 3D 
arbitrary Lagrangian-Eulerian (ALE) analysis of oblique machining. Pednekar 
et al. (2004) used the ALE analysis capability within ABAQUS/Explicit to study 
the transition from plane strain to plane stress in orthogonal machining. They found 
that while the material along the free sides of the workpiece was deforming in plane 
stress, much of the material in the middle deformed in plane strain. It was noted that 
the strain rate in the PSZ did not change appreciably from the middle to the side 
faces because of the kinematic constraints on the deformation in the PSZ, imposed 
by the requirement that chip material outside the PSZ and SSZ travel as a rigid 
body. However, it was noted that the temperature along the side faces was signif- 
icantly different from the temperature along the midplane, due to lower strain 
within the material and lower heat generation along the sides, on account of plane 
stress deformation (Fig. 1). 


Eulerian Simulations 

In a fully Eulerian formulation, the nodes are fixed in space and velocities are the 
nodal unknowns that are directly solved for. Since velocities are the nodal variables, 
strain rates are readily computed and the FEA equations are derived from the 
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principle of virtual power. A natural simplifying assumption is to consider the 
material to be viscoplastic, wherein the stress is proportional to the strain rate 
through a viscosity term. Carroll and Strenkowski (1988) present a detailed deri- 
vation of Eulerian finite element equations and discuss the analogy between the 
equations for linear elasticity and viscoplastic deformation. In fact, it should be 
possible to use a boundary adjustment procedure along with an incompressible 
linear elastic material model and selectively reduced integration elements 
(Strenkowski and Moon 1990) to derive the flow fields for viscoplastic flow using 
elastic analysis and interpreting the displacements as velocities. However, the 
Eulerian formulation cannot be easily used for complex constitutive models 
where many variables are associated with material points and need to be tracked 
with the flow, where the geometry (of the chip) is not known a priori and evolves 
with time, and especially for cutting conditions that result in periodic fluctuations 
such as adiabatic shear bands. 

Usui and Shirakashi (1982) developed a type of analysis that they called 
“iteration convergence method” which combined the features of Lagrangian and 
Eulerian analyses in a unique and very interesting way. A finite element model was 
created with an assumed shear angle and chip shape. Mechanical FEA of the tool 
movement into the workpiece was carried out for a small cutting distance, within 
which the plastic zone and cutting force stabilized. The incremental displacements 
were treated as velocities, and the flow lines (streamlines) were calculated. Inte- 
gration of the strain rate along the streamlines gave the strain distribution. The heat 
source due to plastic dissipation, conduction, and convection of heat due to material 
flow were taken into account in a thermal FEA which computed the steady-state 
temperature field. The strain, strain rate, and temperature obtained were used to 
update the flow stress of the material. This sequence of steps was repeated till the 
flow-stress distribution and flow lines converged. This is probably the most efficient 
way to analyze steady-state cutting under conditions that lead to continuous chips, 
while taking into account complex constitutive models. The solution can be 
expected to be accurate, except for any inaccuracy in the assumed location of the 
flow separation point where the workpiece separates into the chip and the machined 
surface. 


Thermomechanical Coupling 

In a coupled thermomechanical simulation of a domain of interest, the 
governing spatial variables that need to be tracked over time include 
displacements and temperature. In addition to the equation of motion, the equation 
of conservation of energy also needs to be satisfied. For a stationary solid domain, 
conduction is the dominant mode of heat transport inside the domain. 
The differential equation governing the nonsteady (transient) temperature field 
within an isotropic conductive material of thermal conductivity k , specific heat C, 
and density p is 
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Equation 13 states that the divergence of the heat flux, i.e., the net heat flux 
leaving unit volume, is equal to the rate of heat generation within that unit volume 
less the rate of increase in heat energy of the unit volume due to increase in 
temperature. This equation is applicable when the medium is stationary with 
respect to the heat sources, as in Lagrangian FEA where the material and the heat 
sources move together as dictated by the mesh. As seen above for mechanical 
deformation, this pde is converted into a variational form that holds for arbitrary 
variations in temperature, and this variational form is used to discretize the domain 
and obtain a set of ordinary differential equations that constitute the finite element 
formulation of the conservation of energy. 

Whether or not there are any external heat sources, changes in temperature due 
to mechanical deformation (thermoelasticity, plastic dissipation, etc.) need to be 
computed. The heat generated by plastic deformation is taken to be q = /? a : e 
where /?, called the Tay lor-Quinney parameter, is the fraction of plastic power that 
is converted into heat, which lies between 0.9 and 1.0 for the large strains encoun- 
tered in machining. 

A simple approach to approximately model thermomechanical coupling is to 
assume that the elements are adiabatic (i.e., zero out the conduction term in Eq. 13) 
and to compute the change in temperature due to energy dissipated and its effect on 
the flow stress within the constitutive model (in the stress update process). When 
conduction is nontrivial, as for micromachining and the normal speed machining of 
steels and other hard-to-machine materials, coupled thermomechanical analysis is 
required. For simulation of cutting under conditions that produce continuous chips, 
steady-state flow and temperature fields can be easily obtained using a Eulerian 
analysis when history effects are negligible. 

For Lagrangian or ALE simulations of machining, when explicit integration is 
adopted, the most commonly used procedure is to use the split operator approach to 
weakly couple the mechanical and thermal steps. Alternate mechanical and thermal 
calculations of the finite element equations are carried out, keeping the temperature 
constant during the mechanical step and the displacements constant during the 
thermal step. The maximum thermal time step size is given by the following 
equation and can restrict the time step size to be smaller than that required for the 
mechanical calculations, when the element is very small or the thermal diffusivity 
a = K/pC is very high. 


= A/ 2 /2a (14) 

As noted above, the mesh has to move with respect to the material and either 
rezoning or advection has to be carried out to map the history variables onto the new 
mesh. For thermomechanical analysis, the temperature also has to be mapped to the 
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new mesh. This step leads to a finite convection of heat through the mesh, entrained 
with the flow of material. Thus, conduction is handled during the heat transfer step, 
while convection due to material flow is handled during the mesh motion 
(advection) step. 

It has not been possible to carry out Lagrangian or ALE simulations of machin- 
ing for long enough times for the thermal fields in the tool to be accurately modeled, 
even using one or more intervening steady-state thermal simulations (Yen 
et al. 2004; Deshpande and Madhavan 2012). Scaling the specific heat of the tool 
(Deshpande and Madhavan 2012), in regions away from the interface, is an 
effective way to simulate the temperature field in the cutting tool accurately. 
Without this, tool temperature cannot be accurately predicted by Lagrangian or 
ALE analyses, which can lead to error in forces, wear rate, residual stresses, etc. 

Under conditions where adiabatic shear bands are manifested, the specific heat 
should not be scaled within a small region adjacent to the chip-tool interface to 
accurately model the periodic changes in temperature that accompany the cyclic 
changes in deformation. Deshpande and Madhavan (2012) showed that the depth of 
this region into the tool (from the chip-tool interface) can be estimated by modi- 
fying Eq. 14 to obtain the distance over which significant thermal oscillations are 
experienced in response to an oscillatory heat flux of period At, as 



where, is the thermal diffusivity, co = In! At is the angular frequency 
corresponding to time period At, and A is the wavelength of thermal waves. Carslaw 
and Jaegar (1959) show that in response to a periodic boundary condition of 
frequency co applied on a planar surface of a medium with diffusivity a, planar 

temperature waves of the same frequency, and wavelength A = In^la/co are 
established. 


Contact Interactions 

Both mechanical and thermal contact between the tool and the workpiece have to be 
simulated. The rake, flank, the cutting edge of the tool, and the corresponding faces 
of the workpiece, namely, the contact face of the chip and the machined surface, are 
the faces across which contact is enforced. Contact boundaries are defined using 
element edges in 2D and faces in 3D. There are two main methods for enforcing 
contact, namely, kinematic (or constraint based) contact and penalty contact. 
Constraints result in additional variables (Lagrange multipliers) being added to 
the degrees of freedom, leading to augmentation of the stiffness matrix with 
corresponding additional terms. Penalty contact is enforced by adding additional 
correction terms to the force vector that are proportional to the penetration of the 
slave (workpiece) nodes into the master (tool) surface. 
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Displacements tangential to the contact interface are also tracked and lead to the 
addition of frictional force terms corresponding to the friction model assumed for the 
chip-tool interface. When there is relative slip at the contact interface, Coulomb friction 
models the tangential (i.e., shear) stress as being proportional to the normal (i.e., contact) 
stress via a coefficient of friction. Sticking friction considers the tangential stress to be 
constant when there is slip. Several variants of these models are also implemented, such 
as shear-limited Coulomb friction to avoid excessive element distortion and softened 
contact to smooth out sudden changes in friction forces that can lead to convergence 
issues in implicit time stepping and to stress oscillations in explicit time stepping. Noise 
in the contact stresses is usually present in explicit analysis, especially for penalty 
contact, even when a damping force term is added. This superimposed noise makes it 
difficult to simulate sticking friction accurately and the results have to be scmtinized to 
verify that the actual friction stress is correctly modeled. 

Experimental forces indicate a high average coefficient of friction and results 
from split-tool technique (Kato et al. 1972; Childs and Mahdi 1989), photoelasticity 
(Bagchi and Wright 1987), etc., and indicate a nearly constant shear stress for the 
first one-half or more of the contact length from the cutting edge, followed by a 
nearly linear decrease to zero shear stress at the end of chip-tool contact where the 
chip curls out of contact with the tool. Due to the high normal stress at the contact, if 
an experimentally indicated average friction coefficient is used without imposing a 
shear stress limit artificially, it will be found that a layer of chip material gets stuck 
to the tool and internal shear within the chip limits the friction coefficient to the flow 
stress of the chip material at the temperature, strain, and strain rate corresponding to 
those along the interface. This typically reproduces the experimentally observed 
friction stress distribution and is the most physically accurate way to model friction. 
However, this requires effective remeshing or ALE techniques to avoid excessive 
mesh distortion. In addition, since internal shear within the chip causes the effective 
friction coefficient to be lower than that used within the Coulomb model, the 
friction coefficient used for the Coulomb model will have to be chosen to be a 
higher value than found in experiments. Iterations will be required, either manually 
or using a friction subroutine (Deshpande 2006), till the experimental friction 
coefficient can be matched. Ozel (2006) has studied the effect of different friction 
models using a commercial FEA software package. The accuracy of implementa- 
tion of friction under sticking friction conditions varies across packages and users 
would be well advised to verify from the results that the time averaged friction 
stress corresponds to that expected for the average normal stress observed. 

In addition to the mechanical interactions, thermal contact also has to be 
modeled. Due to the high contact pressure and clean surfaces at the chip-tool 
interface, an appropriate model of thermal contact is to assume that thermal 
resistance of the contact is negligible (i.e., gap conductance is very high), so that 
the temperature of corresponding nodes of the chip and the tool, across the contact, 
is nearly the same. Imposing this as the boundary condition on the tool tip, and 
appropriate boundary conditions on the far side of the tool (constant temperature or 
convective boundary conditions), the total heat flow into the tool, the heat partition 
coefficient, and the tool temperature fields can be obtained. 
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Constitutive Models 


Constitutive models describing the mechanical behavior of materials at large 
strains, high strain rates, and high temperatures are required for simulation of 
machining. In addition, as the material heats up to high temperatures while still 
within the secondary and tertiary shear zones, phase changes could occur while the 
material is deforming. This would require constitutive models to track the material 
history (time at temperature, for instance). Material models of such complexity are 
very hard to use because of the extensive experimental work required to develop the 
material model coefficients for each material temper (Maekawa et al. 1983). 
A much simpler material model, the Johnson-Cook (JC) material model shown in 
Eq. 16 below, is the most commonly used material model. The effective stress 
(or flow stress) is written as a product of a power-law strain-hardening term, a 
logarithmic strain rate-hardening term, and a power-law thermal-softening term 
(which is often approximated to be linear, with m = 1). Other constitutive models 
used can often be approximated quite closely by this model. 
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The five parameter JC flow stress model is sometimes supplemented by a 
multiplicative damage term that models the strength degradation due to damage 
evolution using five additional parameters. 

It is usually not possible to locate model parameters for materials of interest in 
the same temper, grain size, etc., as used in cutting experiments. In addition, 
material parameters commonly found in literature are obtained from standard 
tests for mechanical properties such as the split Hopkinson pressure bar test 
(SHPB), which are not capable of mimicking the high strains, strain rates, and 
temperatures, especially in the secondary shear zone. Typically, the strain the 
material is subject to in SHPB tests is small, and the models derived overestimate 
the work hardening at the large strains that occur in machining. For instance, using 
the Johnson-Cook (JC) material model constants available in the literature for AISI 
4340 at 43 HRc, the cutting forces obtained from ALE FEA have been found to be 
almost three times the experimental values. The thrust forces were about twice the 
actual values, and the average strain rate along the primary shear plane was found to 
be one-half the experimental value obtained from digital image correlation of high- 
speed images (Deshpande et al. 2010). 

Development of material models that permit simulation results to match exper- 
imental observations, not just for forces but also for strain rates, temperature, chip 
curl, etc., is a topic of current research. A few research groups have carried out 
extensive experimental testing to measure the mechanical strength of materials 
under conditions similar to those in machining (Usui and Shirakashi 1982; 
Maekawa et al. 1983; Childs 1997). It is not often the case that data under 
appropriate conditions is available (Oyane et al. 1967), which can be used directly 
(by Oxley and co-workers in this case, to develop their generalized model for 
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steels), motivating some groups to continue to make efforts in this direction 
(Jaspers and Dautzenberg 2002; Mates et al. 2008; Bums et al. 2011; Wang 
et al. 2013). Such experiments are highly complex to (i) achieve heating rates and 
temperatures similar to those in machining, (ii) build up to high strains in multiple 
steps each of which impose only small strain increments in order to keep adiabatic 
heating small, (iii) recover the specimen without multiple impacts so the strain 
increment is well known, and (iv) cool the specimen rapidly after each test so that 
changes in micro structure are minimized. Since an extensive amount of these 
complex experiments is required to develop a comprehensive model for even one 
material, there is an understandable reluctance to embark on such an effort. 

Many researchers in the machining area have begun using machining tests to 
develop material models suitable for modeling of machining (Shatla et al. 2000; 
Ozel and Zeren 2006; Shi et al. 2010; Liu et al. 2013; Deshpande et al. 2010). 
However, in machining, the evolutions of strain, strain rate, and temperature are 
coupled to one another, and it is also not possible to minimize temperature increase 
by having small strain increments. In addition, since it is not possible to control the 
hydrostatic stress, damage within materials of limited ductility will be different in 
regions with different hydrostatic stress, and material damage needs to be quantified 
by separate tests (Li and Hou 2014). Unless the evolution of damage is also tracked, 
the effect of change in edge radius, rake angle, etc., will not be reproduced correctly 
by the simulation. Chandrasekaran et al. (2005) used milling tests to refine material 
properties obtained from SHPB tests and found this to work well for continuous 
chip formation but encountered issues with extending this to shear-banded chip 
formation, even after making changes to the model to account for this. Guo 
et al. (2006) fit the JC as well as the Bammann-Chiesa-Johnson (BCJ) material 
model to high-temperature compression test data for AISI 52100 steel and found 
that the chip morphology of shear-banded chips is predicted correctly by the BCJ 
material model. Uncertainties in friction model, nonuniqueness of model parame- 
ters, etc., also add to the uncertainties in material models developed using machin- 
ing tests. Shrot and Baker (2010) found, with a simplified FEA model of machining, 
that multiple sets of constitutive model parameters produce identical results for 
cutting forces and chip curl. 

The enormous amount of physical testing and validation required to verify that 
the material properties, contact conditions, etc., are accurate representations of 
reality makes this affordable only in the context of materials such as Ti-6Al-4 V 
where the economic importance of the material and the low productivity of 
machining operations together have contributed to supporting a large body of 
research on this topic. One would hope that a method for efficient determination 
of material models suitable for machining simulations is a fortuitous by-product of 
this research. 

For simulation of machining of composites (Arola and Ramulu 1997; Venu 
Gopala Rao et al. 2007; Calzada et al. 2012), fracture of brittle materials, such 
as carbon fibers, and of materials with limited ductility will have to be modeled 
accurately to model the cutting forces and the delamination due to cutting 
forces. While the matrix always exhibits at least some ductility, the fibers 
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fracture while undergoing a small amount of mostly elastic strain. For these 
simulations modeling the fracture of the constituents as well as the interfaces 
controls the accuracy of the results, and this is best done using a Lagrangian 
analysis with element and/or interface deletion. Interestingly, in contrast with 
the orthogonal cutting of ductile metals which is modeled assuming plane strain, 
simulation of the orthogonal cutting of brittle materials such as fiber-reinforced 
polymers is typically carried out assuming plane stress deformation. This is 
because fracture occurs during the initial deformation occurring along the 
primary deformation zone, leading to powdery chips of different geometric 
character depending on the fiber orientation angle. This powder does not interact 
with the rake face in any significant manner, and the role of friction at the rake 
face is insignificant. 2D plane stress elements are typically used to model 
alternating lamellae of fibers and matrix, and damage models are used to 
model tensile failure in the fiber, plastic damage in the matrix, and mode I 
and mode II failure along the interface. 3D simulations are required to improve 
the simulations, for instance, to correctly model the fracture of fiber bundles, 
and to model delamination between plys. Abrasive tool wear along the flank 
face limits tool life, and leads to machined surface damage, burning etc. 
Incipient simulations need to be supplanted by long duration simulations 
coupled with tool wear models to capture these effects. 


Experimental Validation of Simulations 

It would be of great benefit to develop a database of detailed experimental mea- 
surements of machining outputs for some cutting conditions that can be used by 
users to validate the accuracy of simulation results. High-speed photography and 
DIC can be used to track material through the primary shear zone and measure the 
strain and strain rate fields. 3D simulations show that the strain and strain rate fields 
observed on the side faces, under plane stress, are reliable representations of the 
deformation within the primary shear zone in the interior that occurs in plane strain. 
Dynamic force measurements can be resolved along the paths of maximum 
observed instantaneous strain rates to estimate the flow stress under which the 
deformation is occurring. While the great advantage claimed for orthogonal cutting 
that the process can be idealized as a 2D deformation process holds true for the 
strain fields, it is not true for temperature fields, due to the plane strain to plane 
stress transition that occurs near the side faces of the material being cut (Pednekar 
et al. 2004). Measurements of temperature distribution along the rake face, either 
using thermocouples (Usui et al. 1984; Li et al. 2013) or through transparent cutting 
tools (Menon and Madhavan 2014), can be easily compared to simulation results to 
assess the accuracy of the simulations. In the absence of such data, users currently 
tend to not rely upon the values of simulation results, but only the trends. Many 
users also develop heuristics, for instance, for the maximum tool temperature that 
can be permitted for a given combination of tool and work materials and still yield 
acceptable tool life. 
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Need for Simulation of Machining 

Simulation is important for understanding the response of complex systems. For 
instance, for many large-scale discrete systems with nonlinear response, delay 
terms, etc., a control systems approach with transfer functions describing the 
many connected interactions is the best way to model the system, following 
which it can be analyzed either in the time or frequency domains. Such analysis 
and understanding helped suppress the role of chatter as a cause of problems in 
machining and led to the development of high-speed machining of aluminum 
alloys. Similar improvements in productivity and quality for hard- to -machine 
alloys, composites, etc., is the main motivation behind current machining research. 

The physics of machining is dominated by the highly nonlinear, coupled 
thermomechanical, high strain continuum response of the material being cut at 
high strain rate and temperature. Additional layers of complexity arise from the 
unique conditions at the contact between the tool and the workpiece, consideration 
of microstructural effects and their evolution with strain, change in geometry due to 
tool wear, and due to the coupling of these “sub structural” effects with the dynamic 
physical and thermal response of the rest of the system that influences effects such 
as adiabatic shear banding and chatter. These complex interactions require the use 
of finite element analysis-based simulation. Simulation is an important tool that is 
being applied to studying a wide range of issues, as well documented by Arrazola 
et al. (2013). 

Once it is decided that machining simulations are required, the user has to decide 
upon what the input parameters for the simulation would be and what outputs would 
be of interest. This includes considerations such as whether the mechanical and 
thermal properties of work and tool materials under the conditions in machining 
are known, the extent of the experimental data that is available for comparison and 
refinement of simulations, whether a 2D or 3D simulation needs to be carried out, 
whether the coolant needs to be modeled, etc. Given the sophistication of specialized 
packages and the flexibility of general purpose FEA packages, it is now quite rare for 
users to develop their own capability for simulation of machining. Depending on the 
constraints in terms of available software, computational hardware, and the user’s 
capability for development and validation of simulations, most users of simulation of 
machining either choose commercially available software dedicated for machining 
analysis or use commercially available general purpose packages, with and without 
user programmed subroutines for customization. An advantage of using commercial 
software packages for simulation of machining is that they include various built-in 
material models for a range of workpiece and tool materials. This enables users to 
become productive and get results quickly, typically after some initial validation by 
comparison to experiments to gain confidence in the results. However, the proprie- 
tary nature of inputs and limitations in capabilities may cause users to adopt general 
purpose software packages. Given that no single software, analysis method, or 
approach used thus far is perfect and efficient for all problems, and given the rapid 
pace of advance in FEA software, it is often possible to improve upon prior work in 
terms of accuracy, efficiency, level of detail, etc. 
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The type of finite element simulation influences the quality and accuracy of 
different results of interest. For this reason, the choice of the type of simulation to 
be used should be appropriate to the phenomena of interest. For instance, to study 
the cutting force during the machining of brittle materials such as composites, 
where the chip-tool interface friction does not significantly influence the deforma- 
tion and fracture around the cutting edge and where thermal effects are not 
significant, just a small amount of cutting motion and incipient chip formation 
needs to be modeled. However, the heterogeneity of the material needs to be 
modeled in detail for the simulation to be useful for studying the effect of fiber 
orientation angle, depth of subsurface damage, etc. In contrast, during the machin- 
ing of ductile materials, friction along the chip-tool interface controls the shear 
angle, contact length, and forces, and so an analysis has to simulate the process for 
sufficient time to allow the tool temperature to approach steady state for it to predict 
the outputs mentioned above accurately. However, the material can typically be 
modeled as a homogeneous continuum and it is often the case that the flow stress 
within the material undergoing deformation is nearly uniform. For cutting condi- 
tions where continuous chips are produced, i.e., the deformation rate tends to a 
steady-state pattern, the most efficient type of simulation can change depending on 
whether forces or residual stresses are of interest, from an Eulerian analysis (with 
boundary nodes adapting to track the chip geometry) to an ALE analysis. While 
explicit time integration is robust in the face of nonlinearities due to changes in 
geometry, contact, and material properties, implicit integration typically provides 
results with less noise in cases where there are no convergence difficulties. In case 
the process exhibits periodic oscillations such as shear banding, highly localized 
deformations need to be captured, and the clear choice is Lagrangian analysis with 
adaptive remeshing and rezoning. With any of these techniques, it is relatively easy 
to achieve mechanical steady state (i.e., of the rate of deformation). However, it is 
impractical to simulate the process for a long enough time that the temperature 
distribution in the tool approaches steady state. Reducing the specific heat of the 
tool in regions not subject to thermal oscillations is an efficient method to rapidly 
achieve steady- state temperature distribution within the tool (Deshpande and 
Madhavan 2012). 

Tool wear, machined surface integrity, and cutting insert development are some 
of the main uses that motivate further understanding of the process, and most users 
typically strive to achieve their goals by a combined program of experiments and 
simulations (Yen et al. 2004; Zanger and Schulze 2013). Tool wear is still handled 
mostly empirically in industries because of the variety of tools and processes used 
and the inability to develop generally applicable models for it. Tool wear is 
influenced by the stresses and temperatures within the tool and the composition 
and micro structural features of the alloy being machined that slides over the tool. 
Since it is not easy to measure the stresses and temperatures within the deformation 
zones under commercial cutting conditions, development of physically based tool 
wear models is a big challenge. It requires the user to undertake highly demanding 
experiments (split-tool technique, embedded thermocouples, photoelasticity, infra- 
red thermography, etc.), for some specific and narrowly defined process conditions 
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(under which stochastic or chaotic effects as well as tool wear are minimized), and 
try to broaden the utility of the results by recourse to simulations that can be used to 
develop simplified models of the distribution of temperatures, stresses, etc., which 
can then be used to model tool wear. Very simple analytical models of the 
machining process that neglect temperature effects cannot predict even the forces 
for sharp tools correctly. Oxley’s model that includes some of the coupling between 
the strain, strain rate, and temperature fields is the best available model for 
attempting to predict the forces, stresses, temperature, and chip thickness. How- 
ever, it still assumes a sharp cutting edge, continuous chips without any curl, and is 
based on equilibrium without energy minimization. Given this, the best hope for 
developing tool wear models, currently, is to base them on models for the variation 
of stresses and temperatures that are developed based on the outputs of simulations 
of machining. Eventually, if these simulation outputs can be used to develop 
improved analytical models, then those models can also be extended to predict 
tool wear and tool life. 

The requirements of a simulation model that would be useful to predict tool wear 
include that it should be a coupled thermomechanical model that models material 
properties, contact properties, and boundary conditions accurately, models the flow 
around the cutting edge correctly, and models the temperature distribution within 
the tool at timescales of the order of the life of the tool. Furthermore, it should use 
the predicted distribution of stress, temperature, and sliding velocity on the tool 
faces as inputs for a model for the local tool wear rate for the particular tool and 
work material combination and model the changing tool geometry within an 
extended cutting simulation to track the evolution of tool wear and its effect on 
the stresses and temperatures (Yen et al. 2004; Zanger and Schulze 2013). The 
effect of cutting fluids on tool wear can be studied by extending these simulations to 
include the effect of coolants in a physically realistic manner, such as with fluid- 
structure interaction (Mohd Hadzley et al. 2013). The coolant modeling will also 
have to be validated to predict temperatures, contact length, etc., accurately, in 
order for the wear models developed to be useful. Currently, a particular tool wear 
study can be carried out much easier experimentally as compared to the simulation 
mentioned above. The potential for increased understanding that may in turn lead to 
improvements in tooling and process conditions is the main motivation behind 
machining simulations. In addition to contextual understanding of the mechanisms, 
significant additional value can be realized by encapsulating the observed interac- 
tions between the various phenomena in the PSZ, SSZ, and TSZ into a transfer 
function-based dynamic model of the system. 


Summary 

Simulation is a key research tool that can help improve our understanding of the 
mechanics and physics of machining, and lead to productivity improvements and 
cost savings. Different types of finite element analysis can be used, depending upon 
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the process, materials used, and objectives. The examples provided illustrate the 
typical range of applicability of the different types of simulations and the insights 
that can be obtained from them. Further advances can be expected to continue to 
occur, for instance, in more accurate constitutive models, newer element formula- 
tions (e.g., elements capable of representing slip discontinuities), and improved 
analysis procedures (constrained Eulerian Lagrangian, smooth particle hydrody- 
namics, discrete element method, element-free formulations, etc.). Experimental 
validation of simulations is important for routine use of simulations by industry, 
which can lead to significant improvements in machining productivity. 
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Abstract 

Virtual machining simulates NC code to discover errors, without a time con- 
suming trial run or online debugging on real machine tool. Since machining is a 
material removal process that will deform the workpiece geometry with cutting, 
the traditional rigid geometrical model could not be used to describe the 
in-process status of workpiece, which changes shape continually. The evolution 
of deformable workpiece model from the 2D sections to 3D representations 
revolutionized not only the machining industry, but also pioneered the digital 
manufacturing age with virtual manufacturing. This chapter traces back the 
history of CNC simulation, analysis of the different CNC machining models, 
tested with application examples, and lists different CNC verification industry 
applications for the last 30 years. Working towards a vision of pervasive 
modelling and simulation, a unified voxel-based in-process geometry model 
for multiple-machining and 3D printing simulations is discussed with industrial 
applications of composite material plating simulation. The virtual machine 
tool, which includes material removal animation and machine kinetic move- 
ment, can be controlled with a virtual CNC control panel and equipped with 
virtual jigs and inspection tools, such as dial indicator and wiggler, for 
immersive training of a young machinist. Towards a competitive sustainable 
manufacturing future, pervasive applications of virtual machining are not only 
technologically possible, but also make business sense, in this high material and 
energy cost world. 


Introduction 

Virtual manufacturing is a new and emerging concept to integrate different areas of 
manufacturing by using computer technology for creation and execution of virtual 
models. Virtual manufacturing is defined as a computer-based system, which 
consists of evolving models of manufacturing systems and processes, and is 
exercised to enhance one or more attributes of the real system. Manufacturing as 
a whole is a very complex system consisting of various interacting, interrelated, and 
interdependent subsystems and processes. Virtual machining, a small building 
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block in the comprehensive virtual manufacturing system invented in the 1960s, 
pioneered virtual manufacturing with material removal process visualization long 
before its coronation in the 1990s. 

Machining had been a low productivity manual operation until the invention of 
numerical control (NC) in the 1950s, when the hand wheels and levers were 
replaced by punch tapes control, similar to telegraphs at that time. These early 
servomechanisms were rapidly augmented with computers since the 1960s, the 
computer numerical control (CNC) machine tools have revolutionized the machin- 
ing process and radically changed the manufacturing industry. Complex 3D shapes 
are relatively as easy to cut as the plane face, and manual polishing works have been 
dramatically reduced. 

In 1958 MIT published its report on the economics of NC. They concluded that 
the tools were competitive with human operators, but simply moved the time from 
the machining to the creation of the tapes. NC programming became a bottle neck in 
machining. Automatically programmed tool (APT) language was developed to 
generate instructions for NC control during the late 1950s and early 1960s. It was 
widely used into the 1970s and is still a standard nowadays. Since APT was created 
before graphical user interfaces (GUI) and computer graphics (CG) were available, 
it relies on text to specify the geometry and process. Again, this is a highly skilled 
manual script writing that slowed NC machining, especially for high volume low 
mix parts, where a lot of new NC programs were needed. Computer aided 
manufacturing (CAM) was developed to quicken and automate this process, 
where the shape could be defined by computer aided design (CAD) software. The 
CAM/CAD system have proliferated CNC machine tools since the 1980s. 

An NC program has thousands of lines of tool movement instructions that may 
contain errors. Following these instructions, a CNC machine tool will move blindly, 
without any check on gauging, overcut or cutting force. It is impossible to verify 
code manually, thus virtual machining can virtually trial run NC code in a com- 
puter, to verify NC code and replace trial cut that is time consuming and dangerous. 

Virtual machining, visualization of material removal of various machining 
processes, is a geometrical modelling process that realistically simulates the setting 
up and running of an actual machining operation. First, the user specifies the stock 
from which the part will be cut, either by entering dimensions into the software or 
importing a CAD model. Then, after parsing NC code with the selected cutter, NC 
toolpath backplot can display tooltip trace against design model, highlighting 
grammar error and errant movement. NC simulation automatically simulates the 
motion of the tool removing material from the stock. The programmer can watch 
the material removal process and see details of how each cut changes the shape of 
the part, which is a deformable in-process geometrical model. This eliminates 
having to try to imagine how cuts from the current operation will affect subsequent 
operations, which will help to plan for the next operation. NC analysis will compare 
this in-process model against the target design and display the remaining stock with 
a coloured map and report. Since the NC programming error was haunting machin- 
ist from the beginning, this was also called NC verification, a short name for 
machine tool numerical control code verification. 
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Since machining is a material removal process that will deform the workpiece 
geometry with cutting, the conventional CAD geometrical model cannot be used to 
describe the in-process status of workpiece which changes shape continually. The 
evolution of deformable workpiece model from the 2D sections to 3D representations 
revolutionized not only the machining industry, but also pioneered digital manufactur- 
ing age with virtual manufacturing. Various in-process geometrical models and their 
applications are discussed in this chapter. Virtual machine tool, which includes 
material removal animation and machine kinetic movement, can be controlled with 
a virtual CNC control panel and equipped with virtual jigs and inspection tools, such 
as dial indicator and wiggler, for immersive training of a young machinist. 

Recently it has expanded from geometrical modelling into process modelling, 
including the machine dynamic and FEM cutting simulation, with in-process 
geometrical model as its foundation. In geometric modelling, cutter-workpiece 
engagements are extracted to support force prediction in process modelling. In 
process modelling, the physics of the machining process, such as cutting forces, 
torque and power, are predicted by integrating the laws of the metal cutting process. 
Based on these predictions, process parameters can be optimized for productivity. 
Methodologies in geometric modelling for cutter-workpiece engagement extrac- 
tions require a large number of calculations, however, the robustness and compu- 
tational stability of these approaches is a significant challenge, which will be 
covered in another two chapters of this handbook. 


Virtual Machining Industry Landscape 

Virtual machining has been a vivid academic research theme of virtual manufactur- 
ing since the 1990s, together with the low cost PC, internet, virtual reality (VR) and 
OpenGL. VR has been widely explored as a means of virtual machining with many 
academic research prototypes. JAVD 3D is used to distribute simulation image over 
the internet. STEP NC simulation is also a part of STEP research. As there are many 
review and survey papers on virtual machining academic research available online 
(AbdulKadir et al. 2011), this chapter focuses on industrial applications which are 
commercially available and technologically stable to engineers and machinists. 

Virtual machining is so important that the CAD/CAM developers were eager to 
develop integrated CNC simulation into their system, with great success on native 
CAD machine tool kinetics simulation, internal NC toolpath backplot can be 
displayed as different doted-solid colour lines, and partial success of a 2D 
in-process workpiece model, which is always deformable since the geometry 
changes with every cutter move. Different from CAD machine frame and toolpath, 
the 3D in-process workpiece could not be modelled with conventional CAD B-rep 
solid modeller, such as ACIS or ParaS olid. Initially CAD/CAM geometry model 
was tested to model machining process but failed, since the in-process geometry of 
workpiece is deforming but conventional CAD model is static. How to develop an 
in-process geometrical model (IPM) that could simulate the deforming workpiece 
has been a research challenge, which has attracted great academic research interests 
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since the 1980s. There is a vivid research theme of a smart machining system (SMS), 
which builds IPM into a brain of intelligent machining for CNC machine tool. 

Professor Donald Esterling pioneered NC verification, leading the way as a 
variety of OEM partners incorporated this technology into their products. 
N-See™ (Predator VIRTUAL CNC™) was the first volumetric based solid model 
NC verification program, raising the bar for speed and accuracy. While on the 
engineering faculty of George Washington University, he initiated a manufacturing 
program in the mid-1980s funded by a $2 M grant from IBM. Esterling has received 
research funding from NASA, NIST, NSF, NATO, US Air Force and the US Army 
Research Office. He has been awarded several prestigious and highly competitive 
Small Business Innovative Research (SBIR) grants. He has a successful track 
record of moving research projects from the lab to commercial applications. 

While he worked with McDonnell Douglas Corporation, Occidental Petroleum 
and MCS as a CAD/CAM consultant, Jon Prun accumulated abundant experience 
of computer software, computer graphics, mathematics and digital control technol- 
ogy. Having realized that there was a great need of digital control simulation 
technology by manufacturing, he established a company named CGTech in 1988 
to develop a suite of digital control simulation software VERICUT™. One after 
another, CGTech partnered with Unigraphics, Dassault Systems and PTC to get 
support from these renowned CAD/CAM companies. VERICUT™ was first run in 
UNIX system computers based on Sun workstation, and then upgraded to PCs and 
other workstations such as HP, IBM and DEC. Nowadays CGTech has many 
branches worldwide, and almost all the customers are running VERICUT on PCs. 

There was a time when CAD/CAM components were developed and marketed 
as an independent software module, such as ACIS, which was started as object 
oriented CAM components. Machine Works™, which is a spinoff from LightWork 
UK, pioneered the market for embedded simulation in the mid-1990s when NC 
simulation was a “nice-to-have”, by offering the first true history based solid model 
simulation, which has been patented worldwide. Machine Works ’ simulation solu- 
tion allowed CAM software producers to provide integrated simulation as a core 
functionality in their applications. Embedded simulation soon became a “must- 
have” for mainstream CAM applications. 

However, embedded NC simulation could only discover internal NC toolpath 
errors, which is in APT format. This internal NC toolpath is post processed into 
machine control data (MCD), such as G/M code. There are much more errors after 
post processing and human editing, where the independent machine control code 
verification package is still a must. 

Furthermore, the history based virtual machining will slow down along with the 
cutting history, since the method records toolpath into voxel cells so it can zoom 
without losing resolution. Recently Machine Works claimed they have removed 
history from their model and sped up the simulation greatly. 

Module Works™ from Module Works provides a complete CAD/CAM compo- 
nent solution with high performance toolpath simulation and NC verification. The 
toolpath simulation component supports milling, turning and mill/turn applications 
with full machine simulation, stock removal verification and toolpath analysis. 
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Module Works™ toolpath simulation will identify problem areas such as potential 
collisions, gouges or over travel and allow correction prior to NC code generation. 
It is viewed by many as an essential aspect of the CAM process. NC simulation 
library provides state of the art technology and is well proven and in use with many 
of the leading CAM software solutions around the world today. The simulation 
component can be quickly implemented using the easy to use API. ModuleWorks™ 
provide full kinematic machine simulation with comprehensive collision and axis 
limits checking. A full kinematic machine builder supports mills, lathes, mill/turn, 
robots and CMM machines with support for an unlimited number of axes. NC 
simulation and toolpath verification component also provides fast, high accuracy 
verification of stock removal for mill, turn and combined mill/turn applications. NC 
simulation also offers a full range of toolpath analysis tools for many critical 
aspects of toolpath behaviour such as segment length/type, feedrate and height 
allowing toolpath to be refined for optimum finish and quality. NC simulation tools 
are independent of the toolpath generation CAD/CAM components and can be used 
with any toolpath or backplot NC code. 

SIMNC from BinarySpaces produces 3D simulation of complex multi-axis 
machine tools, including collision detection, material removal and other 
non-cutting processes. The SIMNC Core API, the foundation of the entire product 
line, is built using the latest software architecture, which optimizes its use of 
memory and CPU power - allowing the product to run on-line on a machine tool 
control or off-line on a stand-alone PC. This modem architecture supports parallel 
processing that maximizes graphics performance on 64-bit and/or multi-core com- 
puters. SIMNC includes a machine tool builder to aid in defining the computer 
representation of the machine tool. The SIMNC control emulator allows the 
simulation engine to run off-line directly from the end user’s G-code part programs, 
while the SIMNC part set-up aids the end-user in defining the cutting tools and 
fixtures related to each individual CNC program. 


Kinetic Simulation of Machine Tool 

In advanced virtual machining systems, it is possible to include other elements such 
as machine frame, fixtures, clamps and tool holders. These are required for collision 
detection. In some packages like VERICUT and NCSIMUL, the entire machine 
also can be simulated to visualize the kinematics; this will be particularly very 
useful during 5-axis machining, where the collision of tool holder and fixture may 
destroy machine spindle. 


Machine Tool Model 

A machine tooling system is made of rigid parts (frame, jig, fixture and toolholder), 
wearable cutter, and deformable workpiece, which is continuously changing and 
has to be modelled with voxel based in- process model (IPM). Voxel based IPMs 
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Fig. 1 Virtual machine in X axis, XZ axis and XYZ axis 

are deformable and expensive in terms of computer memory, in line with the size of 
the workpiece. A machine tool frame, including jig and fixture, could be 10 m long 
and wide, so it is too expensive to model them in voxel model. During the machine 
simulation, only the parts interference is checked with a simple and quick collision 
check algorithm, without an expensive material removal calculation. 

Generally, the geometrical representation scheme for machine tool is the same as 
conventional CAD, so it is possible to simulate the entire machine motions with 
conventional CAD/CAM. Building the machine frame geometrical models, kinetic 
constrain and control are daunting tasks that are usually done by CAD/CAM 
developers. However, the geometrical representation scheme for material removal 
is different from conventional CAD, so it is not possible to simulate the material 
removal process with conventional CAD/CAM. The material removal process is 
simulated outside CAD/CAM with third party applications such as VERICUT. 
Furthermore, conventional CAD/CAM systems have their own interactive graphi- 
cal user interface (GUI) for quick creating and editing of a geometrical design 
model and drawing. It is difficult to customize this GUI to manually operate a 
virtual CNC machine tool, which functions more like a computer game. 

The simulation of material removal on the workpiece and simulation of machine 
kinematics can be done in different sessions (CAD/CAM for machine motion and 
third party applications for material removal) or simultaneously in a single third 
party application session, which is more realistic and good for machinist training. 

Since machine tool frame and jigs-fixtures are rigid static geometry, they could 
be modelled with conventional CAD and exported as triangular polyhedral mesh by 
stereo lithography (SLA) format with file extension of STL, which is quite similar 
to VRML format. Figure 1 shows the frame model built up of a 3 axis milling 
machine, which starts with X axis, Z axis and ends with Y axis. 

A comprehensive machine tool system also includes the opaque machine cover, 
transparent glass windows, movable doors, operateable handle/lever/pedal, mea- 
surement indicators, and virtual CNC control panel, which is critical for operational 
training. Figures 2 and 3 depict two models of virtual machine tools: CNC mill and 
CNC lathe, that are manually CNC operateable as a real machine. More than a real 
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Fig. 3 Virtual CNC lathe 
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Fig. 4 Reverse post processing flowchart 

machine tool, the virtual machine tool can be zoomed from different angles in 
multiple viewports highlighted with translucent colours for enhanced learning 
effects that are more like an educational computer game. 

Conventional CAD/CAM system is suitable for machine tool simulation with 
internal NC toolpath, which is cutter location (CL) data that is generated inside the 
system, usually in APT format that is an industry standard. However, the post 
processing converts APT into machine control data (MCD) with geometry trans- 
formation and CNC controller specification, a lot of things could go wrong at this 
stage. For example, table plus table and table plus head 5 -axis configurations are 
totally different. Even with the same configuration, FANUC and Siemens need 
different codes for compensation. So there is a need to simulate MCD code such as 
GE FANUC G/M code, Heidenhain G/M code or Heidenhain conversational code, 
so the machine tool will move according to real situation. Reverse post processing 
will translate MCD back into APT format. 


Reverse Post-processing MCD back to APT Toolpath 

The reverse post-processor reads, analyses NC program and translates it to internal 
NC toolpath in APT format, as described in flowchart Fig. 4. It supports structured 
programs, variables, cycles and macro calls for a wide choice of commercially 
available NC controllers. 

Analysis and parsing MCD data, such as G/M code, especially for a manually 
written program, is critical to avoid time-consuming debugging on machine tool 
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control panel, when the machine is running but without production. For most CAM 
generated G/M code, the grammar error is not a concern anymore, so it is possible 
to bypass grammar check and achieve faster NC toolpath backplot. However, 
parsing is a critical step for CNC training, where a new trainee may write strange 
code and try to run it on machine tool. 

CNC control panel is equipped with a keypad for text input. Virtual control panel 
must parse this manual input and simulate the action with warning signals. The 
CNC controller is a fully functional high level computer language interpreter, so is 
the virtual CNC controller. There are variables, formulas, subprogram and mathe- 
matic functions within a machine control code script. 

Automatically programmed tool (APT) was a high-level computer programming 
language used to generate instructions (MCD) for NC machine tool before CAD/CAM 
revolution. Now CAM software replaced APT for toolpath generation but still kept it 
as internal CL data format to express internal NC toolpath in ASCII text. Most CAM 
systems can save the internal NC toolpath as APT format and use a third party post 
processer (such as ICAM) to generate the machine tool specific MCD, such as 
FANUC 16M G/M code. Since APT is the internal NC toolpath format there is no 
reverse processing step in reading APT text data file into the internal NC toolpath. 

Most virtual machining systems accept cutter path in the form of NC code specific 
to the CNC machine or in the form of a generic format APT. If the input is NC code 
specific to a CNC machine tool, one would additionally require a machine tool data 
file that provides its process, kinematic and syntactic details and cutter data file. An 
NC reverse processor synthesises them to generate the corresponding CL file. Note 
that all the other algorithms further use only APT CL data file. 

This process requires that the reverse post processer is properly configured for 
the particular syntax of the NC language and the particulars of the CNC control. 
These configurations are machine specific. For example, the Fanuc 10A file con- 
tains a complete definition of the syntax and conventions used with this control. 
With multi-axis machines this process also requires that the machine definition is 
properly configured for the particular style of multi-axis machine. 

The reverse post-processor configuration file defines the relationship between G 
& M codes in the NC program and the associated functionality. This is the same 
process that takes place in the control itself where each NC code is interpreted 
before taking effect. G & M codes are identified through fixed patterns. The 
reverse-post will identify patterns by comparing NC program contents with the 
reverse post’s pattern definitions. When a pattern is identified, a specific function- 
ality is associated to it, and then the actual output tool motion or NC simulation is 
generated and expressed with APT format. 


Toolpath Backplot 

APT and MCD are text based high computer language scripts that are difficult to 
visualize against the part geometry, therefore, the obvious first idea was to couple 
the NC to a plotter that would trace on paper the trajectory of the cutting tool. 
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Fig. 5 Toolpath data structure 


The drawing would immediately reveal an eventual mistake. Nowadays the screen 
replaced the plotter but this NC toolpath preview function is still called back- 
plotting, the oldest and most popular NC code verification. Only after the simula- 
tion shows the program to be devoid of gross mistakes, the real machine can be 
used. Toolpath backplot follow the cutter tip movement and display as doted 
(GO fast move) or solid (G1 cutting) lines with different colours, which could be 
used to different operations. Good backplot functions could highlight current 
position, operation and cutter information. Cutter move animation is also a vivid 
simulation of cutter movement. Modern backplot toolkit works together with NC 
code text editor, so the NC code editing is visualized instantly, and doubles as an 
NC code learning tool for students. 

The traces of tooltip are imagery lines in space but are useful in visualizing tool 
movement, so these traces are also modelled as toolpath, even though in reality 
there are no such lines. The cutter and tool holder can move forward and backward 
with tip on the toolpath, this vivid animation can be used for NC programming and 
visual gauging check. 

The classic double link list data structure is used for the toolpath model that is 
modelled in Fig. 5. The double pointer enables the cutter to move forward or 
backward without looping through every node. To delete and add one node is 
easy compared with an array data structure. This data structure also works for 
extended Z buffer and extended Z map, where both need to delete and modify 
elements in real time. 
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Material Removal Simulation of Cutting 

UNIX based CAD Graphics Workstation introduced a separate graphics card and 
low cost large CRT display, where the depth and colour buffers are used to store 
screen pixels. The painter’s algorithm is used to draw animation pictures on screen. 
Virtual machining pioneers extended this depth buffer with multi value so an 
animated cutter can move through a screen and paint a “negative” trace, or erase 
something, just like cutting the stock from workpiece. This trick is also called 
extended Z buffer method that is view angle dependent. The extended Z buffer 
method runs very fast since it is directly updating graphic memory without view 
transformation which is a time consuming algorithm inside the CPU. The extended 
Z buffer picture is pixel perfect with vivid details, since every pixel is refreshed 
with a cutter colour. Some virtual machining systems are still animating material 
removal with this technique, with another more precise in-process model as data- 
base. However, directly rendering in-process model is a more popular approach, 
where the workpiece can be rotated and zoomed during cutting animation. 

This method projects the workpiece and cutter onto the display screen and gets 
an array of link lists of an element that has a near Z value and a far value, a stick 
with the size of pixel. The array correlates to the size of display window. The cutter 
image also has near and far Z values that will be used to compare with the 
workpiece image. The cutter moves through this array of sticks and cuts through 
them in Fig. 6. 

If the cutter and the element never touch, there is no change in the extended Z 
buffer in Fig. 7. The frame buffer is updated with this background image; this is to 
erase the previous cutter image at the previous position. The cutter image is painted 
on the screen with current position to show a cutter movement. Exactly a movie 
trick, the user sees a smooth move of cutter along the toolpath. However, the 
developer usually only updates a small area around the previous cutter position to 
speed up display speed. Some even used Boolean operation for colour buffer. These 
algorithms work for an old time computer that is slow and expensive. How to show 
a smooth cutter movement has been a research theme for a long time. 

If the cutter is cutting an inside element that is hidden from the user as shown in 
Fig. 8, the extended Z buffer modifies the inside element with the cutter colour. The 
cutter image is partially blocked and trimmed by the outside element so the user can 
see the cutter plunge into the material. 

If the cutter is cutting on the near end of an outside element, the user can see the 
material removal process. If the cutter is cutting on the far end of an outside 
element, the user can see the cutter plunging into the workpiece. If the cutter is 
cutting both the near and far end of the element as in Fig. 9, this element will be 
removed and the user can see the previous hidden inside element, which becomes 
the outside element at this moment. 

The swept volume of the cutter is critical for material removal animation, where 
the swept volume is subtracted from workpiece model continuously. This is a 
famous computational geometry quest so there are many academic research 
works on how to generate this swept volume for 2 axis, 3 axis and 5 axis milling, 


31 Virtual Machining 


1197 



Fig. 6 Extended Z buffer cutting animation 


Fig. 7 Air cutting cutter 
movement animation 



where the cutter position moves and cutter angle swings. For example, an end mill 
linear move in the XY plane can sweep through one box plus a half cylinder, a 
Boolean operation between this swept volume and workpiece model subtracts this 
swept volume from workpiece which deforms the workpiece model, which will 
change shape with cutting, so it is a deformable geometry model. 
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Fig. 8 Cutting inside 
element 



Fig. 9 Remove outside 
element and expose inside 
element 



The extended Z buffer algorithm is good for animation but it is view angle 
dependent. Once a simulation starts, the view angle and zoom factor cannot be 
changed, otherwise the simulation has to start over from the beginning. The 
extended Z buffer in Fig. 10 can be saved as a geometry model and measured 
against the design model. However, it is precise only in the view direction, so it is 
necessary to simulate in different angles for more reliable results. 


Workpiece In-process Model 

The stock, the machining allowance, is the material to be machined. The workpiece 
is the target machined part with stock material on surface, which may be a forged or 
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Fig. 10 Extended Z buffer 
in-process model 



cast part with the machining allowance. The initial workpiece is a block produced 
from forging, casting or rolling process. The geometry of the workpiece will change 
after each machining operation. This evolving geometry of the workpiece is defined 
as an intermediate or in-process model. 

The deformable in-process model (IPM) represents the state of the workpiece at 
each step in the machining process. It is a 3D geometrical construct that reflects the 
results of the machining operations. This model allows the user to visually verify 
that the machining operations have been defined accurately and that their sequence 
is correct. It can be automatically re-generated when there are changes in the 
product design, machining parameters or sequence of the operations. 

The in-process model is a must for the next step cutting plan. The in-process 
model is only a conceptual model for most of the commercial CAM systems, there 
is no in-process model that can be output and stored in a database. In a traditional 
NC programming environment, a significant amount of time is spent trying to 
visualize the in-process model through various process stages. The in-process or 
evolving model is used in subsequent setups and provides immediate feedback on 
the progress being made. Being able to view in-process geometry, while creating 
toolpath and process plan, greatly reduces the chances of error in both setup and 
machining. It also helps in designing fixtures, positioning clamps and so on. 


Host CAD B-rep In-process Model 

The B-rep is a typical CAD geometry model. The shape of a part is represented in a 
point-edge-face schema. The previous studies showed that NC cutting result could 
not be modelled in B-rep because of the complexity of the cutting model. The first 
choice of IPM should naturally be the geometry model B-rep used in commercial 
CAD system. The benefits of using the same geometry model for CAD as the IPM 
are obvious. The CAD geometry model is matured and available through CAD 
development kit, so there is little need to develop a new geometry model kernel. 
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Fig. 11 2.5D in process models 


Sharing a common geometry model with CAD, the IPM facilitates seamless 
integration of CAD-CAPP-CAM. 

An automatic forging design and manufacture system was developed by the 
authors in 1986, in which pre-form forging IPMs were the same as the CAD system 
CV/MUDUSA running on VAX-11/750 computer (Liu et al. 1992; Jerard 
et al. 1989; Stifter 1995). However, the creation of pre-form forging IPMs took 
days of calculation and often failed due to Boolean operation failure. 

With a great deal of research efforts in the last two decades, the B-rep geometry 
model has been improved significantly in terms of Boolean operation stability, but 
the B-rep based IPMs are still limited to 2.5-axis milling (Fig. 11). Park reported a 
prismatic IPM generation method that employed a polygon extrusion algorithm to 
sweep a ball-nose cutter (Park et al. 2003). 


Host CAD Section In-process Model 

Since the integrated B-rep IPMs cannot be created inside a CAD geometry model, a 
new, ad-hoc cross-section-wire-frame based approach was proposed in a forging 
die CAD/CAM system (Liu et al. 1991). The aim was to use a series of paralleled 
cross-section drawings to represent 3D shapes. Figure 12 depicts the cross section 
representation of a link rod 3D shape. 
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Fig. 12 Cross sections of a link rod head 3D shape 


The cross section IPM is widely used in many commercial CAD/CAM systems. 
I-DEAS from SDRC uses water level cross-section as an IPM for generative 
machining. In a traditional NC programming environment, a significant amount 
of time is spent trying to visualize the in-process stock as it goes through various 
process stages. With I-DEAS, the wireframe section in-process stock model can be 
created for downstream applications such as toolpath generation, process planning, 
fixture designing and clamp positioning. 

A part can be sectioned along the Z, X and Y axis that is shown in Fig. 13. The 
Z-axis section is usually called water level section. For 3 -axis milling, the water 
level section could have many loops, causing complications in the set operation 
between sections. X and Y sections are single half loops and the Z value is unique 
for every point, thus simplifying the set operation considerably. 

A working system for using IPM in pre-forging design is described in (Liu 
et al. 1992). A drawing sheet with part sections is first created using the BACIS 
command language of CV/MEDUSA CAD system. Since there are many sections in 
a drawing sheet, each section of wire-frame is assigned to a different layer according 
to its Y distance, and a certain number of sections can be looped through layers. Then 
each cutter section is moved to its cutter location and compared with the part sections. 
The overlap between the cutter section and the part section will be removed from the 
part section. A real milling IPM is obtained from the collection of the result sections. 

The display of sections is provided by line segments and can be confusing when 
there are too many lines, i.e. there is a need to render the IPM as a realistic 3D 
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Fig. 13 Section 
representation 



Fig. 14 Regulated section 
representation 



image. In order to calculate the surface normal required for rendering, the section 
wire frame is divided along the X direction by the same step as that for Y direction. 
A so-called regulated section is formed to facilitate the calculation of surface 
normal and interpolation of points between the sections. A given node in one 
section is linked to a node in the next section. A node’s normal can be calculated 
from the four neighbouring nodes. Figure 14 shows the regulated section represen- 
tation, which in fact is called Z map. 
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Z Map Based In-process Geometrical Model 

The regulated section can also be used to accelerate set operation between cutter 
section and part section. Calculation of intersections and trimming between two 
sections are time consuming and the re-ordering of the line segments requires more 
computing time. This can be improved with the regulated sections, where the line 
segments are indexed by both cutter section and part section. Only the line segments 
with the same index are compared and trimmed, there is no need to trim two line 
segments. If all the line segments fall on the regulated nodes, there is no need to 
trim two line segments. The set operation can be simplified to the comparison of 
two Z values, which is very fast and stable. Hence, the Z map representation of IPM 
emerges (Jerard et al. 1989). 


Classic Z Map 

If all the section line segments fall on the nodes, the object surface can be 
represented by the Z values of the nodes. A map of Z values represents the object 
geometry. In computer language terms, the Z map can be expressed as a 
two-dimension array Z[i, j], where i represents the index in X direction and j 
represents the index in Y direction. The XY position of the Z map can be calculated 
by i or j times grid size. 

The best analogy for a Z map is a needle bed, where needles are uniformly 
distributed over the XY plane of Fig. 15. The tip of every needle touches the object 
surface that it represents. A milling simulation can be seen as the tool cutting 
through the needle bed. These needles can be described in mathematical terms as 
Z-axis aligned vectors, passing through grid points on the XY plane. A Z map 
representation can be used effectively for surfaces that are visible looking “down- 
wards” on the XY plane. Since 3-axis milling parts are composed of surfaces visible 
from the Z direction, they can be expressed effectively by the Z map representation. 
With a Z map representation, the machining process can be simulated by cutting the 
Z map vectors with the cutter. 

Figure 16 shows an example of a 3 -axis milling simulation system that was 
developed by the first author in 1990. The system used DOS extender for Z map 
and SVGA for Z map rendering. The GUI and NC toolpath wireframe display was 
coded with high C graphics library. The GUI and mouse control developments were a 
very hard job and this was not resolved until the arrival of Windows 95 and OpenGL. 

The vectors in a Z map have direction and length and are infinitely thin without 
volume. The top of each vector, where the Z map and object meet, is just a point 
having no shape. Only at this point the Z map and the object meet with each other. 
Z map models cannot provide accurate object geometry outside these points. There 
are many ways of interpolating the geometry between grid points in order to render 
a Z map model, for example, forming a triangle from three neighbouring Z values. 

It is obvious that the XY resolution of the Z map grid determines the precision of 
a Z map model. A finer grid has greater precision but requires increased memory. 
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Fig. 15 Needle bed sample 
of classic Z map model 




Fig. 16 Z map milling simulation 


For a part of 1 m * 1 m, the size of the Z map is 1 ,000 x 1 ,000 if the precision is 
1 mm, but it increases to 2,000 x 2,000 if the precision is 0.5 mm. Reducing the 
model size and achieving suitable precision becomes a critical issue in a Z map. 

One of the solutions is to balance Z precision and XY precision. An integer array 
is used to replace the more common floating array of a Z map, which reduces the 
Z map size by half. At the same time, this improves the Boolean operation speed 
because the comparison of integers is much faster than the comparison of floats. 
The memory requirement of a Z map is halved again by compressing the Z map file 
section by section, similar to image compression. 
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Fig. 17 Classic Z map model 
with vertical walls 



Because of the simplicity of its data structure and fast computation time, the Z map 
model is used by most commercial CAM software (Jerard et al. 1989; Stifter 1995; 
Maenga et al. 2003). However, a Z map cannot approximate vertical wall very well 
since it always has a slope as shown in Fig. 17. This is not a problem for forging die 
design since there are always draft angles in forging parts, but it is a serious problem 
for milling parts since profiling nearly always creates vertical walls. 


Extended Z Map 

Since the precision of the Z map is determined mainly by XY resolution along the 
vertical walls, increasing the resolution along these walls while reducing memory is 
a key issue. Fortuitously, one important feature of 3 -axis milling can be leveraged. 
Viewing from the top, the vertical walls only cover a small percentage of the 
Z direction projection, so it should be possible to use finer resolution along the 
vertical walls while maintaining a rough resolution in the planar area. This was 
the initial idea for an extended Z map; at least one grid on a Z map is segregated into 
sub-cells. Only grids corresponding to intricate features on the surface of an object 
are assigned sub-cells to improve the representation of object features. Figure 18a 
illustrates the plan view of the Z map grid with sub-cells 52 the front sectional view, 
while Fig. 18b shows the sectional view. 

The size of the grid can be reduced through using sub-cells, but the precision of 
the XY dimension is still limited by the size of sub-cells. For a sub-cell of 0.1 mm, 
the best precision is 0.1 mm in XY plane. There is a need to represent XY 
dimensions precisely. Instead of using vectors in the sub-cells, the sticks in the 
sub-cells that have volumes and surface geometry are used. A B-rep surface model 
can be represented precisely using a map of B-rep sticks in Fig. 19. 
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Fig. 18 Extend Z map with 
sub-cells along vertical walls 








Fig. 19 Stick method 


Milling simulation with stick method involves Boolean operation between cutter 
and stick. Figure 20 shows different stick shapes after cutting. The experiments 
with B-rep stick model are very slow and a huge B-rep model is created. To 
simplify stick and Boolean operation, a polygon is used instead of real surface in 
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Fig. 20 Different shapes of stick elements after cutting 


a stick cell. The data structure of a polygon is much simpler than that of a B-rep 
which needs a group of complicated pointers to maintain a double wing data 
structure. 

The real world objects are not always uniform in the XY plane and can be any 
shape. Nodes are used to enhance sub-cell precision in object face representations. 
For example, one edge of the sub-cell may have two overlapping nodes to represent 
a vertical face. The nodes of a sub-cell may not be uniformly distributed over XY 
plane. Figure 21 depicts an exploded plan view of a portion of the Z map grid with 
nodes 54 and illustrates how stick method represents a circular hole and vertical 
walls. 

Z map has height value that is only suitable for 3-axis machining, where 
everything can be viewed from the top. Machine components usually need six 
sides machining, either with a rotate table or 5-axis control, which may result in 
hollow portions in some areas, which cannot be seen from the top. If one ray is 
tracing through the hollowed object, there may be more than two intersections. 
Instead of one height value, Z map could be extended to multiple values as in 
Fig. 22. Extended Z buffer is a special multiple value Z map aligning with screen 
orientation. 

Figure 23 shows shop floor examples of extended Z map IPM based NC 
simulation and verification that was developed in Singapore Institute of 
Manufacturing Technology (Liu 2005) and implemented in precision engineering 
industry for a decade. The detailed description of the extended Z map IPM can be 
found in two patents (Liu et al. 2002). 
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Fig. 21 Sticks to approximate vertical wall 




Fig. 22 Extended Z map to 
multiple values 



Voxel Based In-process Geometrical Model 

Over the last three decades academic research explored many variations of deform- 
able volumetric model, such as discrete vector, graft tree, octree or hierarchical 
space decomposition and ray tracing method. These inspiring research works 
contributed to the main stream volumetric in-process model study, which starts 
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Fig. 23 Extended Z map based milling simulation 


with extended Z buffer material removal animation, enriched with Z map stick, and 
ends with extended voxel model. The virtual machining industry learn, enhance, 
and merge these techniques into their hybrid in-process models, which may use 
extended Z buffer for material removal animation, stick for 3 axis mill, voxel for 
3-5 axis mill or turn-mill and swept volume for optimization. However, virtual 
machining industries seldom publish their internal data structures and algorithms 
except for a few patents, only disclosing certain techniques that could be easily 
identified by the export data and user interface. 


Octree Hierarchical Space Decomposition 

Instead of representing the blank as a collection of sticks in 3 axis NC simulation, it is 
possible to represent it as a collection of cubes or spheres or any such cell of the same 
size. This is called uniform space decomposition (USD). However, this is a very 
expensive way of representing solids so it is limited to medical imaging application. 
Assuming that a bit is required to denote a cell, for representing a workpiece of size 
1 m with a resolution of 1 pm, more than 1,000 x 1,000 x 1,000 GB is required. 
Obviously this is not practicable. Therefore, methods to represent an object as a 
collection of cells of varying sizes were developed, such as hierarchical space 
decomposition (HSD) or octree representations in Fig. 24. 
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Fig. 24 Octree hierarchical space decomposition 


An octree is a tree data structure in which each internal node has exactly eight 
children. Octree is a HSD representation in which an object is represented by a set 
of bigger cubes with subdivisions of eight smaller cubes. This reduces the memory 
requirement considerably. Each cube is one-eighth of its parent cube in size and is 
called an octant. All the octants can be visualized as the nodes of a tree in which 
every node has eight branches. An octant can be completely inside or outside the 
solid, when there is no need to further divide they become leaf nodes. Only 
boundary octants are further subdivided into eight octants. This subdivision con- 
tinues till the size of the sub octant equals the required resolution. The total number 
of octants to be stored in an octree is much less than that of USD representation, 
because the boundary octants take part in the subdivision. In practice, the 
non-boundary octants memory can be reduced with a compression algorithm. It 
was found that in the case of an octree the number of octants needed is nearly 
proportional to the surface area of the object. All octree computations are based on 
integer arithmetic, which means that the analysis algorithms are fast. Octree 
algorithms are readily parallel-processes by definition. Memory required by octree 
representation is independent of the number of primitives and operations. For a 
given resolution, memory required depends only on the surface area of the object. 
Boolean operations and rendering display in isometric view are trivially simple 
since these operations require only tree traversal with simple exchange of terms. 
The user is free to choose any desired accuracy (at the cost of speed and memory). 
Coarse modelling is a facility unique to HSD. A coarse model of a solid can be 
produced and processed quickly to get an order of magnitude estimate of the results. 
If these are found favourable, a more accurate refined model can be produced. 

However, Octree is an approximate representation and memory requirement 
increases exponentially with increase in resolution. Instead of using subdivisions 
of the boundary octants, many researchers proposed new ways to precisely describe 
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Fig. 25 Cubic stick as one 
level of voxel model 



the boundary surface geometry. The boundary octant is renamed cell since there is 
no sub octant anymore. 

Graft tree added two extra nodes on each edge so a few triangles could be formed 
to approximate any polygon mesh. If a mesh node falls inside this cell, an extra 
node will be recorded. A surface is subdivided by cells into small pieces and 
recorded into the cell. 

One inspiring invention is the so called machining history based method. Instead 
of recording surface into octant, this method records the neighbouring CNC 
toolpath and the linked cutter into the cell. The neighbouring toolpath is the piece 
of toolpath that most likely will cut into the cell. Any zoom or rotate of the 
workpiece will trigger a re-calculation of cell geometry and generate a more 
detailed extended Z buffer image on screen. This is good for small NC programs. 
However, the history of machining grows with the NC code, which could be 
millions of lines of text. 


Voxel as Multiple Layers of Cubic Stick 

The term voxel represents a volume element in space decomposition geometrical 
model schema, just like the term pixel denotes a picture element in raster graphics. 
Extended Z map with stick method could be considered as a simplified and 
extended one layer voxel model as in Fig. 25. 

Figure 26 depicts an example of the voxel model, which could be considered as a 
many unit height stick element stacking together and the memory requirements are 
enormous. There is a need to store the voxel array in compressed form and use 
algorithms that will operate directly on the compressed data, especially when the 
material is homogenous, where internal voxel could be represented by boundary 
voxel extension. 

It is possible to convert the voxel array into some other more compact repre- 
sentation and reconvert them into voxel when required. Voxelization is the process 
of converting a 3D object into a voxel model. Figure 27 shows a test voxelization 
example. 

A voxel-based system should be able to update the display at interactive rates. 
Current graphics rendering systems cannot provide a level of rendering perfor- 
mance on voxel models that is comparable to their polygon-rendering performance. 
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Fig. 26 Voxel method 
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Fig. 27 Experiment voxelization 


Parallel algorithms and hardware support for volume rendering are the focus of 
current research efforts. Only boundary voxel is rendered by a patented colour list, 
which effectively avoids expensive ray-casting of huge internal voxels. The ren- 
dering of a voxel model is easily achieved by rendering a points cloud. However, 
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Fig. 28 Rendering model with voxel display 

internal voxel display is not possible with this method and needs more study. 
Figure 28 shows the rendering of voxel model with voxel display. 

Further analysing the voxel model, it is believed that the voxel-based volume 
modelling is a very promising approach to the unified IPM for multiple machining 
and layered manufacturing simulations. As a natural clone of the layer manufacturing 
3D printing technology (Chandru et al. 1995), the voxel model of an object and the 
object fabricated using a 3D printing closely resemble each other since both are made 
of layers of small cells. Furthermore, voxel based models permit the designer to 
analyse the 3D printing object and modify it at the voxel level leading to the design of 
custom composites of arbitrary topology. In this paper a simplified voxel-based IPM 
is proposed to unite the new 3D printing and traditional machining simulation. 

The voxel representation also simplifies the computation of regularized Boolean 
set operations and of material removal volumes. By using the material removal rate 
measured by the number of removed voxels, the feedrate can be adjusted adaptively 
to increase machining productivity. 


Unified In-process Model of Multi Machining and 3D Printing 

During the novel combined 3D printing and multi machining, such as shape 
deposition manufacturing, a 3D printing part needs to be inserted with an electronic 
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Fig. 29 Framework of unified IPM for multi machining and 3D printing 


device and milled to a certain shape. The unified 3D printing -machining 
simulation displays the machining process in which the initial 3D printing 
generated workpiece is incrementally converted into the finished part. The voxel 
representation is used to model efficiently the state of the IPM, which is generated 
by successively subtracting tool swept volumes from the workpiece (Donggo 
et al. 2000). 

Figure 29 illustrates the framework of the unified voxel-based IPM for 3D 
printing and multi machining. The voxel based 3D printing simulation can be 
achieved by the voxelization of the road shapes, which are similar to a pipe along 
the 3D printing toolpath. Boolean addition between the road shape voxel and the 
base voxel is fast and stable, independent of the model shape, which is a critical 
issue with B-rep. One layer of road shapes would make a B-rep based solid 
modeller very slow, since B-rep Boolean operation is dependent on model shape. 

Furthermore, proposed unified IPM is a natural voxel mesh model (Nakashima 
et al. 2002) for so called image based CAE analysis and this further unified CAD, 
CAM and CAE. 

Current 3D CAD involves only shape data, which consequently poses certain 
difficulties in process modelling and simulations aimed at predicting the perfor- 
mance of final products. Kase introduced voxel CAD, which stores physical 
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Fig. 30 VERICUT composite simulation 


attributes together with 3D shape data (Kase et al. 2003). Voxel CAD allows the 
sharing of data by different simulations and flexible manufacturing methods. 

There are other approaches on unified model of manufacturing processes but 
none of them could achieve the uniformity that voxel model could offer. Voxel 
model could be used in NC toolpath generation-simulation-optimization, shape 
design optimization, forming process simulation, and many other manufacturing 
applications. This will result in a unified volumetric geometry model for all design 
and manufacturing processes that would erase the data exchange barrier and CAE 
re-meshing problem. 

CGTech started from NC verification software and then to NC optimization and 
simulation software. During the first 15 years, CGTech has concentrated on remov- 
ing material, and recently it started working on adding material. Since most 
aircrafts now need carbon fibre, the Boeing 787 program asked CGTech to develop 
the manufacturing and simulation software of composite. So now after 10 years, 
composite manufacturing and simulation is one of its core businesses. For the fibre 
composite, VERICUT can not only simulate, but also can do the fibre placement 
program. Therefore, composite manufacturing and simulation has become a new 
growth for virtual machining. 

VERICUT composite simulation (Fig. 30) reads CAD models and NC programs, 
either from VCP or other composite layup path-generation applications, and sim- 
ulates the sequence of NC programs on a virtual machine. Material is applied to the 
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layup form via NC program instructions in a virtual CNC simulation environment. 
The simulated material applied to the form can be measured and inspected 
to ensure the NC program follows manufacturing standards and requirements. 
A report showing simulation results and statistical information can be automatically 
created. 

A Virtual Machining System Example 

Practice is the best way to learn. QuickCNC from Singapore Institute of 
Manufacturing Technology is taken as an example to demonstrate the functionality 
of virtual machining and its process flow. The system has been successfully applied 
in industry and training schools for many years to promote virtual machining 
technology. 


Virtual Machining Process Flow 

The graphic user interface (GUI) of QuickCNC is depicted in Fig. 31. The multiple 
windows can be viewed from different angles, zoom factors and detail levels. For 
example, total toolpath and current toolpath can be separately displayed without 
workpiece or against design part, with cutter or holder. The view details are easily 
controlled with NC toolpath toolbar buttons and hot keys. 

The right dialog bar controls toolpath and simulation. The top slide bar interac- 
tively controls simulation speed in run time, the user can slow down cutting 
animation to watch a certain operation or get a result without animation. The cutter 
can move along the toolpath with NC toolpath dialog bar. The current operation NC 
file name and location, cycle time, lowest Z value, cutter information are updated 
instantly. The progress bar on top of this dialog bar will show the percentage of 
completion and warns user with orange colour and highlights errors with red colour. 
The error log will show the kind of error, either too deep, too much, rapid move 
collision, holder gauging or overcut, with statistic number. The user can search for 
error block without reading through the G code text file, which could be millions of 
lines long. The user can also move cutter to any node of the toolpath and get current 
NC block position, G code, feed and speed, compensation, etc. instantly. The user 
also can display only current Z level toolpath and move up/down for water level 
high speed cutting. 

The dialog bar on the left can analyse workpiece against design model with 
colour map and cross section, which are dynamically sliding along XYZ axis with 
two slide bar control. Colour map range can be modified with instant remaining 
stock display. The user can pick any point on the stock and know which operation, 
which cutter and which block of the NC code cut the location. 

Pan and rotation of the view follow Windows convention of right mouse button 
and left mouse button. Dynamic zoom uses centre wheel function. There are four 
fixed view angle buttons for quick action. 
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Fig. 31 QuickCNC GUI 

The virtual machining process flow is depicted in Fig. 27 and summarized as 
below: 

1 . The raw material model is created based on the design part model and selected 
stock. 

2. The tool path model is created based on machine control code and selected 
cutting tools (cutters). Tool list can be automatically extracted from APT cutter 
data, or G/M code, where the comment line could be customized to contain 
cutter information. 

3. Quick display toolpath for identification of geometry errors. 

4. Quick simulation or slow cutting animation. 

5. After simulation, the workpiece model can be saved and refined with amazing 
detail. The saved in-process workpiece model can be reopened as the raw 
material for the next operation. 

The details of the process flow in Fig. 32 and related working principles are 
described in the following subsections. 


Automatic Creating Raw Material Model 

A raw material model can be interactively defined as a box or cylinder as in Fig. 33 
or generated from casting or forging model as in Fig. 34 or design a part model as 
in Fig. 35. 
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Fig. 32 Virtual machining process flow 


In Fig. 33, the origin of work coordinate system (WCS) has to be selected, 
usually at the top centre or corner, since it is easy to measure with a touch probe. 
A model resolution - the size of voxel cell, has to be specified as well. The system 
usually gives a default value in line with the size of the part, mostly between 1 mm 
and 0.1 mm. However, this is not the simulation precision, which is usually less 
than 1 pm. 

Machining a part from a block could be a great waste if the part and the raw 
material differ a lot, sometimes half of the raw material has to be machined and 
becomes waste. In order to reduce waste and achieve faster production with near net 
shape machining, the raw material could be forged or cast into the final part shape 
with a few millimetres of machining allowance. 

The forging and casting parts, designed with conventional CAD tools, can be 
exported to the virtual machining system through a stereo lithography (STL) file 
which is a triangle polygon mesh in text or binary format. 

As shown in Fig. 34, a raw material model is generated from a casting or forging 
model, where the geometry may be a complex surface. The origin of work coordi- 
nate system (WCS) usually follows the part origin. 

The so called design part is the target geometry of machining, where the raw 
material stock model is the original shape of workpiece. The box envelop of design 
part can be automatically extracted to generate a raw material block for quick CNC 
simulation. However even in shop floor practice it is difficult to get an exact block 
of the design part. 
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Fig. 33 Define a raw 
material stock 




Automatic Load NC File and Reverse Post into Toolpath 

Machine control data (MCD), such as G/M code, are reverse posted into internal 
NC toolpath, usually in APT CL data format. The APT CL data file of commercial 
CAM system can be directly read in without the reverse processing step. The 
grammar errors, such as missing key words, could be discovered at this stage. 
Some geometrical errors, such as centre of the circle not-aligned or two blocks 
overlapping could be highlighted with colours in toolpath display. 

The APT CL data file contains cutter definition, such as cutter diameter/radius/ 
length/angle, so the cutter information will be automatically loaded without 
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human selection. However, there is no official cutter definition in G/M NC file, 
manual cutter selection is a boring task and introduces another possible 
human error in NC verification. QuickCNC automated the cutter selection by 
three steps: 

1. QuickCNC builds a cutter table with company specified cutter names. 

2. Customize commercial CAM system to export cutter definition on top of NC file 
inside comment lines: (HTC50R4.5 process R). 

3. QuickCNC parse comment lines to search for cutter definition. 

A well defined cutter table can standardize the tool room operation and man- 
agement. Holders and special cutters can be defined as well. Figures 36 and 37, 
respectively, illustrate the cutter definition and tool list, while Fig. 38 depicts 
automatic cutter search using cutter name. A new cutter will be created in the 
cutter table if there is no cutter match. 

The subprogram can be automatically loaded from the main program in Fig. 39. 
QuickCNC will look into the subprogram files and find the linked program number 
at the head of the file. 

Three steps for fully automatic simulation are to automatically create raw 
material, open all files in the same folder, search for cutter and subprogram. Now 
a machinist can complete a quick simulation with just three mouse clicks to load 
STL part file to create raw stock, reverse post processing a whole folder of G code 
files into toolpath and quick simulates machining operation and automatic verify 
NC program. 
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Fig. 39 Automatic search for sub program 


Quick Toolpath Display 

Workpiece, cutter and toolpath can be interactively viewed with zoom and rotate. 
The cutter moves along toolpath with a current position in the current operation that 
is highlighted as the current toolpath in red. Figure 40 displays all toolpaths in one 
viewport, which could be overlapping and confusing. A part could be machined 
with multiple operations, such as drilling, roughing, semi finishing and finishing 
milling, which are in a planed order of different NC toolpath. A colour scheme is 
used to distinguish NC toolpath with different colours and highlight the current 
operation with red. 

Figure 41 displays only current toolpath clearly against part model. For current 
toolpath, every node is highlighted with a white dot. The cutter and holder could be 
shown with solid colour or wireframe, even with a line or white dot, so the toolpath 
would not be blocked by cutter shadow. 

Figure 42 highlights NC program errors, such as too deep cut, full width cut, 
cutter fast move into material, too much cut, cut into machine table, holder gauging, 
overcut and overlapping blocks. 

In the real world, CNC machine tool follows MCD control step by step. In virtual 
machining, it is possible to move forward and backward for easy check up. 
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Fig. 42 Toolpath error display 


Figure 43 shows the toolpath control buttons that can be used to move cutter 
forward or backward along the toolpath, where the current NC block information 
is updated immediately, so the user can check and verify interactively. 


Cutting Simulation 

Quick simulation or slow cutting animation can start and switch at ease, as shown in 
Fig. 44. The animation speed could be interactively adjusted with a slide bar that is 
on the top right of the dialog bar. The workpiece can be rotated and zoomed at any 
moment of simulation. 

Upon the completion of the simulation, the initial raw material is machined into 
final shape, an in-process model is shown in Fig. 45, and a log file is automatically 
generated. The file records file name, cutter number and name, diameter and radius, 
length, minimum length and all types of error. 

• C:\QuickCNC\training_example.nc 

• Cuttter Number = 67 

• Cutter name = D 1 2 

• Cutter diameter = 12.000 

• Corner radius = 0.000 

• Error Type 1 Too deep cut = 0 








1226 


P. Liu and C.-F. Zhu 


r - OAQyfckSHflfrJEnLjTiHk'IS-nc [QuIdcCNC 1 1 


-Cvttn hC&pfeirfi VruJrto-. £HM Wk*S** 

^ e C- ;£ if-- > H| = :&| ii 



- if x 



■lift' % 



3| ®ji£%i© |g5l*HQ j 

next toolpath 


r* 


4 


55 



next error block 


S.B 



ttt mane Ewtf 

& 'JjwTt ■? r^s4fU»^ 


ip 




w 




r $ 7 ifr 3 




Back and next block 
hotkey <b> <n> 


next level 


Current block info 


jywi te M wr i 


l^d* 


t*X* 


t-htdc 


i 


A hvd ifi 


QTc-J- L D 
_ OToUhO 
GTdd -0 
Q Total- D 
OTc-rf-D 

JToUkO 

0 Td* 0 
OTdJ-O 


11M 


SO 


riOT 

*2 m m eii 

24*32 I 

Tf I JlOW J 

Z 4 EG t 

Voi 4«Q ft 

Def- MHM 

!S 


■m? 

omo 


Fig. 43 Quick move cutter forward and backward 
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Fig. 45 Completed simulation 


• Error Type 2 Full width cut = 0 

• Error Type 3 Rapid GO cut = 0 

• Error Type 4 Too much cut = 0 

• Error Type 5 Plunge into table = 0 

• Error Type 6 Holder Gouging = 0 

• Error Type 7 Overcut part face = 0 

• Error Type 8 Minimum distance = 0 

• Cutter length = 100.000 mm 

• Cutter length can be reduced =12 

• Cutter length should be longer than = 88.000 

• Volume of the remaining stock = 71,267 

After simulation, the workpiece in-process model can be saved and retrieved 
later for the next step of operation. 


Refined Workpiece In-process Model Display 

The workpiece in-process model is displayed in a rough mode for quick interactive 
viewing, such as zoom and rotates with easy mouse control. Figure 46 shows the 
refined workpiece display with amazing detail, such as the remaining stock and 
scallop height, with a colour map that could be customized by user. 
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Fig. 46 Zoom to refined details 


The mouse cross can be used as a probe to measure XYZ position on workpiece 
surface. The cutter name and operation for this position can be displayed instantly 
on the workpiece dialog bar, as shown in Fig. 47. 

Cross sections of workpiece could be viewed with or without the design part 
model. The section position could be controlled with a slide bar for dynamic effects. 
Figure 48 sectioned the workpiece in-process model along X axis against design 
model, with the display control dialog box. 

QuickCNC can display CNC errors in graphics as shown in Fig. 49, where 
conventional log file could be lengthy and difficult to read. Search for error is 
easy and quick with two buttons. 

Figure 50 uses colour map of the remaining stock to visualize the left over from 
previous machining operations. User can define the range of interested area. This 
technique can also be used to show spark gaps in EDM machining. 


Virtual Training of Machinist and CNC Programmer 

As a cutting edge technology of modern manufacturing industry, CNC machining 
produces essential inputs for virtually all types of manufacturing products for 
different applications, including injection mold, sheet metal die, casting die, jigs 
and fixtures and other special tools. CNC technology has been widely used in 
computer-aided manufacturing (CAM), high speed machining (HSM) and ultra 
precision machining (UPM). The extensive use of CNC significantly improves 
the productivity of precision engineering but has caused a shortage of skilled 
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Fig. 47 Measure workpiece in-process model against part design 
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Fig. 48 Section view against STL model 
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Fig. 50 Colour map of the remaining stock 

technicians or machinists, especially in the knowledge intensive areas such as 
HSM and UPM. Training of skilled machinists is therefore a crucial yet challenging 
job. A qualified HSM machinist should have good knowledge of machining, 
understand the operation of the machine tools, and be able to do planning for 
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machining process. Traditionally, trainees acquire their operating skills in several 
years through observation and reference to the operation manual. After which, they 
would leam to operate machines for themselves under the guidance of experienced 
operators. The acquisition and maintenance of real CNC machines, the consump- 
tions of real materials in machining, and the set-up and maintenance for workshops, 
all contribute substantially to the high cost of conventional CNC training. Cost- 
effective and safe CNC training is thus highly desired. 

An apprentice will get to know a conventional mill by handling it under 
controlled conditions, by machining initially simple parts, always being careful to 
keep the tool far away from the faceplate. Accidents happen. An extra turn of the 
lever and the tool may hit the machine table. Even a broken cutter and a scratched 
faceplate in a learning mill is not much of a loss, a CNC machine tool costs several 
times more and is more prone to serious accidents. A wrong line of code may punch 
the main spindle towards the machine table, provoking a horrendous collision 
causing serious losses. Students could be traumatized by the crash and lose interest 
in this trade, which is facing an increasing problem of manpower shortage. 

With computers becoming more common, the obvious follow up development is 
software that can simulate the entire process, dispensing with the real life machine 
tool altogether. The challenge of moving from a manual machine tool to a CNC 
version resides at the programming side, not in handling the machine. Since both 
PC and CNC control panels use touch screen, it makes little difference on whether 
the programming is for a machine simulated in PC or a real life CNC machine 
control panel, which itself is a computer, so CNC programming training can be 
naturally replaced with virtual machining. While CNC training using real CNC 
machines is necessary, the use of virtual reality (VR) technology to support CNC 
training has been a popular topic in recent years (Avgoustinov 2000). 

Simulation of the entire machining processes for CNC training is significant 
given its lower cost and risk-free nature. The drastic decrease of the cost of 
computer, coupled with the worldwide price increase in material and machine 
tools means that virtual CNC training using computerized modelling and simulation 
is a cost-effective and sustainable approach to technical and professional education 
in manufacturing applications. The virtual CNC training system is developed for 
simulation of multiple machining processes. It is particularly important in the 
training of knowledge-intensive high speed and ultra precision machining. Com- 
pared with conventional on-site manual training or e-learning, the virtual CNC 
training system greatly increases learning efficiency and effectiveness of trainees, 
and improves cost saving in terms of machine and material uses. 

Virtual manufacturing is the use of a desktop virtual reality system for the 
computer-aided design of components and manufacturing processes. Virtual reality 
is a computer technology that enables users to view or ‘immerse’ themselves in an 
alternate world. Immersion and man-machine interaction is the core of VR tech- 
nology. VR technology has obvious applications in education and training where 
potentially dangerous tasks such as flying or surgery are carried out and also has 
been used for many different applications in a variety of industries. This work 
provided some insight into reconstructing of virtual machining centre by using PC 
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platform and realized the machining centre navigation and man-machine interac- 
tive operation. In this virtual environment, users can operate the machining centre 
and complete a product machining process. Through this virtual platform, users can 
obtain knowledge about the structure of machining centre and get familiar with the 
complex operation of machining centre before they have the opportunity of manip- 
ulating the real machining tool, which is desirable for practical operation. 


Current Status of Virtual CNC Training 

Compared to NC simulation applications which are expensive and mature, the 
virtual CNC training system is still primitive. The NC simulator developers are 
not actively trying to provide a training system because the training software 
market is logically smaller than production software. More importantly, NC simu- 
lator developers need to revamp the graphics engine or geometry kernel to suit 
education game use. None of the leading NC simulators has any CNC training 
capability. This leaves the development of virtual CNC training system to machine 
tool vendors and schools who do not have expertise to develop a good graphics 
engine (Garcia-Plaza et al. 2011). 

The CNC control vendors developed their own training system. For example, 
Siemens developed SinuTrain, which is CNC training software. It runs on PC and is 
suitable for training purposes and self-study as it is for writing programs and 
simulation. It serves for writing and simulating NC programs on a PC, based on the 
DIN 66025 programming language as well as the products ShopMill, ShopTum and 
ManualTum + and language commands for SINUMERIK® 810D, 840D and 840Di 
controls, all are Siemens products. Programs written with this software can be used on 
real machines. A prerequisite is that the SinuTrain software is adapted to the 
SINUMERIK control on which the program is to be executed. This adaptation must 
be carried out by specially qualified personnel, e.g. from Siemens. It is important to 
stick to Siemens and the machine-tool manufacturer’s instmctions when adapting the 
software. No liability is accepted by Siemens if these requirements are not adhered. 

The vender specific virtual CNC training systems have very good GUIs which 
have been customized to the vender’s own CNC controller; some even have a touch 
tablet that simulates operation panel. However, the cutting simulation is rough and 
primitive, despite the sound and chip flying animation. 

VRML can be used as an inexpensive means for simulation of one of the most 
interesting but also most time and resource consuming areas of computer aided 
manufacturing (CAM) - machining of complex parts. 

There has been much research and many publications on virtual CNC training in 
the last 10 years. Most of the published graphics engines are based on VRML and 
Java 3D. They explored internet based CNC training and remote NC simulation 
etc., which are futuristic but not practical at this moment. The computer hardware is 
very cheap now that there is no need to run a training system over the internet. 
Remote graphics over the internet is not necessary. Some of them use flash movies 
such as micro media to do animation, which could only be used for pre-fixed scenes. 
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Fig. 51 System architecture of a virtual training simulator 

In VRML, the realization of dynamic material removal during a machining 
process remains a problem (Garcia-Plaza et al. 2011). Some commercial software 
such as Deneb’s virtual NC can export a VRML animation to describe a machining 
process. Nevertheless, during the cutting process, the geometry of a workpiece 
remains unchanged. The reason is that VRML does not support set operations 
among geometric objects such as union, intersection and difference. This makes 
it difficult to simulate the change in geometry of a workpiece under cutting. 

In layman’s term, the current virtual CNC training systems are educational 
games that lack the realistic feeling of machining, which is quite different from 
realistic simulators, such as the flying simulators that are used to train pilots. 

The next generation of virtual CNC training is to provide knowledge intensive 
CNC training, for the future skilled machinist of precision engineering (PE) industry 
through pervasive physics-geometrical modelling and simulation of multiple machin- 
ing processes, especially high speed and ultra precision machining. To realize this 
vision, a new in-process model (IPM) that is deformable and precise is needed. 


Virtual CNC Machine Centre 

The proposed new in-process model has been used in the virtual CNC training 
system developed at Singapore Institute of Manufacturing Technology (SIMTech) 
for training of CAM programmer and CNC machinists. The block diagram is given 
in Fig. 51. Architecture of other virtual machining systems also will be similar to 
this; they will differ only in the representation scheme. 

Virtual CNC machine and control panel were developed on Microsoft Windows 
platform, with OpenGL as the graphics engine. The system architecture follows 
Microsoft Application Framework, with a modeless dialog box as the blue print for 
control panel. 

The CNC control panel in Fig. 52 is different from normal Microsoft Windows 
Dialog in terms of user experience, since CNC control panels were developed 



1234 


P. Liu and C.-F. Zhu 


GUI 


v] Virtual CUC Train Ing Lab 


ra* rdii sicMfc 


i irri'c 


I-Jj? j$] *fr;*.*|*. 




CNC CMtr^lir P-anr-l 


SP STA 


fct 

4CUTTEftO»EVi ; 
t4S C-4D CM Ci17 CJa ; 

mo cs* mo ziooe , 

MO Jffl TO S SOW. 

M3 Z» Hfl-i UH ; 

Ml Z-J F1M, 

Ml KU Mi X-.3S TO ; 

ms je-3a> to m - 

MC3 .K* -4£C DQQ Y« HflWfl Z^-100 0® 
ACT F 100 RJMftJi 3 0 TO 

* £DfT 


* I I 



M# 


— v\ VS 

3 1*5 .'Sff 


ffc.lCvu.lB O'. 


\\ 


+ JO* 

'■ 

i 


PlB.i Jhp 

OH DIP D1 *11 


PJBI 

KlW 



NC life | 



HUI 


•0 


BiHi 


PACE 


rtw 




CAh 


MA 


RAffi 


rvuut 

* ■114-1 IK? 

EftWJCU 

km 

X V 1 


LiffiZ 1 IM WW! PEED 4UI 


**«»** 
_i" s- L 


"KSf 


O O O sKj 


o.aoo 


Machine 


Cummt TnoJpwh 

Praatt Op*ii HC iile 

| setup | |*.prc| iw?Kt«>| 

Cyi^TIrna | 

Lnwnrmas? Z 

Tool 

Dl fif 


Cun urtl Black 


nw 


c-biick 


AJCrvel 


u£ 


NO 


fnrwiml-> 


down V 


S 


Volume 

Dspih 

Widlh % Oinim 


7 


0.00] 

■ 






v \ 


lOOSMumlti 


CNC Controller Panel 


set slock 


sbi 


■stack 


stuck 


rendpf 


Section 
move naw 
1 1; n i next 

CMM 

X 300.000 
Y 200.000 
Z 0 000 

Rx 123456789 

% ts&mm 

Rz 123456789 
Op T23456?00 

Cutler 
Hal 

D 20.000 
R 0,000 


Fig. 52 Graphics user interface of a virtual training simulator 


before the PC age, with CRT display and hard buttons. In order to simulate the 
traditional CNC control panel, conventional modeless dialog has to be customized 
with special graphics features, even buttons were drawn from the bitmap image. How 
to turn the knobs is another problem, the mouse centre wheel was employed to rotate. 

The machine frame, door knob, spindle, workpiece, tool change button, probe 
indicator and work table are shown in Fig. 53. All the tools on the machine are 
operable with mouse buttons and centre wheel. 

Virtual measurement on machine tool can be realized with virtual vise, dial 
indicator, probe, wiggler and tool pre- setter in Fig. 54. The virtual instruments 
reflect the manual operation in the same way as the real one; the indicator needles 
rotate just like a real watch; the wiggler vibrates with a shadow. The user can 
manually turn the instrument with mouse clicks and centre wheel. 

With these virtual tooling components, a trainee can learn how to clamp work- 
piece on the machine table virtually with a predefined operation procedure, as 
shown in Fig. 55. 

The trainees can use the system to simulate the milling process and save the 
“machined” model for other downstream machining processes. Figure 50 demon- 
strates the simulation of the remaining stock and the scallop height. In addition, 
they can control the simulation speed to see the details at any angle on the current 
situation of the machining, which is difficult if not impossible in the real machine 
based training. 
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Fig. 53 Machine frame and fixtures of a virtual training simulator 



Fig. 54 Virtual vise, dial indicator, probe, wiggler and tool pre-setter 


A set of different machining samples has been provided to demonstrate how the 
generated tool-path works with cutter under various cutting parameters with the aid 
of the virtual controller. Trainees can learn different setups in a short time using 
virtual simulator on PC, which significantly shortens the learning curve compared 
to the traditional training in a workshop. 

While it is dangerous to show the effect of a wrong setup or NC code on the shop 
floor, the virtual simulator has graphics and sound developed to synthesize various 
effects. In particular, the virtual CNC training system can simulate an accident 
using a graphic and sound effect when a trainee breaks a leg of a work-piece during 
the virtual machining. Among several benefits safety and material cost reduction 
are the direct and major gains from virtual simulation. 
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Fig. 55 Virtual CNC simulates the clamping of workpiece 


Virtual CNC Control Panel 


The virtual CNC control panel is a virtual copy of an actual machine control panel. 
The virtual CNC panel is integrated with a simulated machine tool. The simulated 
machine responds to the programs, commands and inputs of the virtual control 
panel in the same manner as a real machine. The CNC emulator and machine tool 
simulator allow anyone to learn actual CNC automation at any time and in any 
place. The new Windows touch screen serves as a good control panel interface. For 
modern LCD touch screen control panel, touch screen PC is a natural clone so the 
emulation is perfect with the virtual key pad. 

The virtual display emulates real CRT display with traditional style of text. The 
display content will change according to different control modes, such as actual 
position, all positions, WCS table, compensation, etc., as shown in Fig. 56. The 
virtual CRT display and machine movement is synchronized without delay. 

After the trainee measures the workpiece position on machine table, the data can 
be easily input to the virtual control panel by virtual key pad. The data can be 
retrieved in a later session. 


Virtual Manual Machining Operation 

Virtual CNC training system starts as a power off machine tool. The trainee needs 
to power on the machine and release the red emergency stop button. Then the 
returning to home operation has to be completed for three axes. Without this step, 
the next operation will not be accurate and precise. 
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Fig. 56 Virtual CNC control panel display 


The door has to be opened before any operation on machine table. The tool table 
can store the fixtures. The trainee can pull the tooling between machine table and 
tool table. 

Virtual manual operation is a great challenge for a low cost personal computer. 
The trainee must be able to move workpiece, tighten screw, insert a parallel bar, 
pull a shim through a gap and turn knobs all the time. The mouse and touch screen 
are main interactive devices of the PC. After many trial and errors, the mouse wheel 
is used to tighten screws and turn knobs, the double clicks on workpiece are 
simulated as hammer to shift workpiece for positioning, and the touch screen is 
used as control panel key pad, so far this is the most realistic approach. 

The different cursors are displayed for easy use. There will be a screw cursor 
when the mouse cross is near a screw. A hammer cursor will appear at the boundary 
of workpiece and indicate a minor position shift can be achieved with a blow. 

A machinist can manually operate virtual CNC training system to mount work- 
piece on machine table, clamp it with fixture or vise, align with axis and setup 
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machining origin. Virtual vise and fixtures must be locked with a screw; otherwise 
the workpiece position will shift during cutting simulation. The workpiece shift is 
animated with vibration and sound. 

All the cutters have to be pre-set to the correct length and record this tool length 
value through the virtual control panel. Virtual dial indicator, touch probe and 
cutter pre-setter function as real ones with two degrees of needles for display. The 
real time interference checks will feed realistic values on the display. 

Virtual shim could be employed to check gap between the cutter tip and 
workpiece. The trainees use mouse to drag the selected shim through the gap and 
see the difference. If the selected shim is thicker than the gap, the shim will not pass 
through the gap. Using different shim and jogging cutter, the trainees can calculate 
the correct cutter length and workpiece position. 

Virtual wiggler is an even more interesting instrument to align workpiece. It will 
stop vibrating only if its outside diameter properly aligns with workpiece walls. 
Graphics animation of wiggler is amplified for easy observation. 


Safe Training of Machine Operation in a Classroom 

The precision engineering Worker Skill Qualification (PE WSQ) Specialist Diploma 
is a joint initiative by SIMTech and the Singapore Workforce Development Agency 
(WDA) to provide hands-on training to equip future PE professionals in cutting-edge 
precision machining processing technologies. This program is conducted through a 
series of lectures, laboratory demonstrations and project attachments in selected 
industrial applications. As most of the training organizations have limited numbers 
of CNC machine tools and CNC trainer available, they can install the virtual CNC 
training system on their PCs to conduct hands-on training. In this program, the virtual 
training laboratory is designed for 40 students to learn CNC. A high speed machining 
course is conducted for the WSQ trainees to learn machining using the system. Twelve 
sets of HSM examples are created allowing trainees to learn different machining 
techniques and strategies, one of them is shown in Fig. 57. Using virtual CNC training 
can effectively reduce CNC learning curve from typical weeks long to just one night. 
Trainees can do self-learning using the same software on their own PCs. 

With the financial support from Local Enterprise and Association Development 
Programme (LEAD), SPETA has deployed the virtual CNC training system in their 
classroom, as shown in Fig. 58. As one of the several critical areas they identified to 
enhance the capabilities of the PE companies, training of CNC machinists has been 
paid great attention. In partner with SIMTech, SPETA uses the system to train CNC 
machinists - somewhat like the flight simulators to train pilots, which will signif- 
icantly trim the training hours on the actual machine. 

The system is also used in Institute for Technical Education (ITE) for training 
computer numeric control machinists. Significantly reducing the hours and machine 
resources required, the virtual CNC training system enables trainees to practice 
more with various machining requirements within the same allocated training time. 
With this additional preparation, trainees would have a shorter learning cycle when 
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Fig. 57 Virtual CNC machining example for WSQ course 
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Fig. 58 Virtual CNC training in practice 





Fig. 59 Virtual CNC in MTA and science festival 

they start working with the companies. Virtual CNC training has been featured in 
local TV, radio, and all newspapers. Figure 59 highlighted virtual CNC training in 
Metal Asia (MTA) and Singapore Science Festival. 


Profiting from Virtual Machining 

Virtual machining is not a “nice to have” feature that focuses on trade show 
demonstrations. There are practical industrial applications which need virtual 
machining to complete, which include code parsing, toolpath backplot, trial cut 
replacement, adaptive speed-feedrate optimization and virtual training. 
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Just like all virtual manufacturing technologies, virtual machining has been ready 
for pervasive industry implementation for a long time. The only barrier was the high 
computing cost of UNIX graphics workstations, such as Silicon Graphics, which 
were beyond the reach of small machining workshops. With the recent worldwide 
price surge in materials, energy and machine tool, pervasive virtual machining is not 
only technically possible but also makes business sense, since computing cost is 
almost zero. 


First Part Right 

An NC program has thousands of lines of tool movement instructions that may 
contain errors. Following these instructions, CNC machine tool will move blindly, 
without any check on gauging, overcut or cutting force. It is not possible to verify 
code manually, so the NC verification software was developed during the 1980s. 

NC simulation features full 3D, solid model, shaded simulation of entire NC 
machine tools and material removal. This visualization tool enables programmers 
and machinists alike to preview exactly what will happen on the shop floor and check 
for collisions. Many use NC simulation for electronic shop floor documentation. 

NC verification detects problems in the NC tool path program. It is a powerful 
visual inspection tool, which highlights fast feed errors, gouges and potential 
crashes/collisions. Programmers can detect and correct problems before prove- 
out. With NC verification you can virtually eliminate NC program mistakes, 
greatly reduce the time spent on prove-outs, and make the move to “lights-out” 
machining. The NC simulation program is smart enough to detect problems such 
as fast feed errors, gouges and collisions that could potentially scrap the part, 
break the cutter or crash the machine. Any error discovered by simulating 
software allows the programmer to immediately identify the offending NC 
program record by mouse-clicking on the error. The problem can therefore be 
fixed during the NC coding phase so as to insure an error-free code when it 
reaches the shop floor. 

NC analysis identifies the tool path record responsible for an error. You can 
quickly verify the dimensional accuracy of the entire part with a full array of 3D 
measurement tools. NC analysis compares the simulated part to the design model 
so you can be sure the machined part will match the design intent. NC analysis 
performs constant gouge checking. Analysis of the “as-cut” part delves deeper 
into the verification process. Is the resulting cut part dimensionally accurate? 
Does it match the final desired part shape? NC verification software enables the 
user to zoom in on suspect areas for in-depth inspection. The part can be rotated 
and cross-sectioned at any angle to check areas that would otherwise be impos- 
sible to see, such as the intersection of drilled holes. Detailed measurement tools 
enable the user to verify dimensions such as wall and floor thickness, hole 
diameters, corner radii, scallop heights, depth, gaps, distances, angles, volumes, 
etc. NC simulation software such as VERICUT® from CGTech also provides the 
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ability to automatically compare the as-cut part with the original design. The 
AUTO-Diff module can embed the CAD design model inside the stock, automat- 
ically comparing the design to the in-process workpiece in order to reveal any 
discrepancies such as gouges or excess material not removed by the machining 
processes. 

NC errors could destroy work pieces, even damage machine tool. One NC error 
could make the workpiece a waste and take days to rework and eat into profit. In 
small batch production, there is no time for trial and error. Especially for high speed 
machining (HSM), the fast moving and expensive cutter is very easily broken. The 
dynamic machining load will greatly affect cutter life, geometry accuracy and 
surface finishing. 

The challenges also come from the huge tool path of HSM. A million lines of NC 
code are common practice in today’s shop floor. The traditional NC verification is 
so slow that even HSM itself is faster than verification. The size of the program 
combined with a high feed rate makes it almost impossible to run test simulations 
prior to cutting metal. 

NC verification cannot rely on CPU of faster processing cycles as single-core 
silicon reaches the limit of heat dissipation and power consumption. A processor 
containing multiple cores, leveraging its ability to execute multiple tasks, offers a 
higher level of computing power and functionality than the current generation 
single-core processor. As this new technology comes to market, software compa- 
nies are examining how software will adapt. The current NC simulation models are 
not optimized for multi-core computing as some software only run 1 2 % faster on a 
dual processor workstation. How to split the NC simulation between dual-core and 
graphics card is a new R&D challenge. 

Singapore Institute of Manufacturing Technology (SIMTech) has developed 
a more efficient approach based on a patented geometry representation. The 
system starts with a solid model of the machined part and quickly simulates 
and optimizes machining processes. NC code could be selectively reverse 
post processed into 3D tool path graphics display and interactively viewed, edited 
and optimized. The user can highlight or hide operation, tool path or layer. 
The user can also display and edit a certain layer of toolpath. Tool paths and 
cutting results can be viewed from any viewpoint and checked automatically. The 
machined part and the design part are compared for the remaining stock and over 
cut. Error-free tool paths are created, eliminating the need for a time-consuming 
test cut. 

Based on this patented technique, SIMTech developed several practical appli- 
cations for mould manufacturers. These include QuickSeeNC, QuickCNC and 
Parting Adviser, which provide “What You See is What You Cut” functionality 
for shop floor machine operators and mould designers. The technology is suitable 
for machine tool NC tool path simulation, verification and optimization in the 
precision engineering, automotive, aerospace and electronics industries. 
QuickCNC has been adopted by several local die and mould makers for its speed 
and simplicity. 
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Pervasive Virtual Applications in whole Process Chains 

Moving beyond the NC programming department, virtual machining could be used 
pervasively all over whole process chains, such as part design, tool design, process 
planning and scheduling, tool data management, material, setup, production and 
quality control, as shown in Fig. 60. 

Engineers often need a method of getting a model of the as-manufactured part back 
into the CAD system for a variety of reasons. It could be that the required CAD model 
does not exist but legacy NC program data to create it does. Frequently the as-cut part 
contains features (fillets, blends, etc.) not present in the original CAD design and an 
accurate and complete model is needed for finite element modelling or environmental 
simulation or further engineering analysis. Often, simulating pseudo NC paths is the 
fastest and simplest way to create complex offset surface shapes. 

Whatever the reason, NC simulation can create either surface or solid model 
representations of the simulated machined part. The exported model can be either a 
surface or solid b-rep model with geometric shapes (cylinder, cone, plane, torus 
sweep, etc.) that represent machined features such as drilled holes, pocket corners 
and walls, filleted blends and other common manufacturing features. Very small 
machined features such as scallops created by ball-end mill contouring of complex 
shapes can be collected together into large surface patches representing the nominal 
feature intended by the machining operation. 


In-Process Geometry for Manufacturing Engineering 

In addition to design engineering ’ s need for an as -manufactured CAD model, other 
manufacturing engineering and planning functions could use the information. 

It is difficult to imagine, plan and design all the resources required for subse- 
quent operations (NC programs, fixtures, custom cutting tools, inspection tools, 
work handling devices, transfer methods, etc.) without an accurate representation of 
the initial material state left by the previous operation. The accuracy, efficiency and 
“correctness” of each operation depend on the NC programmer, tool designer and 
process planner knowing the material’s initial geometric shape. Until now, the only 
way to create an in-process CAD model was with expensive, labour-intensive, 
error-prone and inaccurate methods. However, exporting a CAD model of the 
in-process or as-machined solid model created automatically from the verification 
step makes it possible to avoid these time-consuming activities. Users can create the 
CAD model at any stage in the machining process. 


Simulation for Process Planning, Scheduling, Production and QA 

In order for process planners to do their job effectively, it is cmcial to know accurate 
machining times. This can be easily obtained by simulating the NC program. Times 
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are calculated for every step in the machining process including the amount of time it 
takes to change tools, pallets or other miscellaneous machine actions. This informa- 
tion can be essential to keeping the production floor operating to its full capacity. 

Additionally, the process planners can use an in-process geometry model to create 
robust inspection instructions in very little time. Typically, a manufacturing engineer, 
NC programmer or process planner manually creates these instructions to tell the 
machine operator what to measure and how to document the results. Without an 
in-process model of the part, manual methods are very tedious and prone to mistakes. 
The highly-customizable inspection instructions can be created automatically. This 
helps to establish a formal but easy and efficient method to create the necessary 
documentation. The software outputs the inspection instructions based on the dimen- 
sions of the simulated cut stock (as-cut semi-finished wall thickness = .15", for 
example). The accurate in-process geometry is required to automatically generate 
this type of document, and is only available by simulating the NC program. 


Documentation for Workshop 

The latest NC simulation system includes powerful tools for creating custom 
reports, tailored for a specific user/department/company’s needs, containing useful 
process information generated during the simulation. The automatically generated 
documents can be used for shop floor or in-process documentation, NC program- 
ming documentation or to capture valuable process information generated during 
the simulation session. Produced in standard HTML or PDF format, the report 
layout is highly customizable, including the ability to specify page design, fonts, 
graphics, tables, pictures, statistics and user-defined information critical to 
documenting the CNC machining process. 

Simulating CAM output to view basic workpiece material removal is no longer 
enough in today’s competitive global marketplace. It is critical to be operating as 
efficiently as possible; modern simulation and optimization software has become a 
key tool to minimize the cost and time of production while maintaining or increas- 
ing product quality. It has evolved into an important process that protects and frees 
up CNC machines, helps to eliminate scraped parts, and creates in-process infor- 
mation that can be utilized throughout the manufacturing enterprise. 


Adaptive Machining Optimization 

CNC optimization automatically determines the best feed rate for each segment of 
the tool path based on the machining conditions and amount of material removed. 
Optimizing NC feed rates greatly reduces the time it takes to machine parts and 
improves the quality of surface finish. 

After CNC verification and achieving error-free machining, the in-process 
model could be used to achieve faster machining, which is based on the calculated 
material removal rate. 
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Fig. 61 NC optimization 

The machining model, simulation and verification processes ensure that the NC 
programs sent to the shop are both accurate and efficient. To create the most 
efficient machining processes possible, optimization software can determine the 
best feed rates to use for each cutting operation. Achieving the best feed rates for 
each cut in an NC program has always been a goal for NC programmers but has 
traditionally been a very difficult task plagued by a number of problems. First, 
trying to imagine the cutter contact and cutting conditions or each cut in a large NC 
program is virtually impossible. Manually inserting different feed rates for each 
changing condition is not practical. An incorrect feed rate estimate can break the 
cutting tool, damage the fixture or scrap the part. 

Typically either a single conservative feed rate is used for an entire machining 
sequence, as shown in the left side of Fig. 61, or a higher (i.e., “high speed”) feed 
rate is used but with a very conservative machining strategy. Both methods attempt 
to ensure that the cutter is not overloaded, but at the expense of very inefficient 
machining. Both of these strategies result in too slow cutting speeds or too light 
removal rates that waste time, increase costs and prematurely wear cutters. 

To address this issue, a knowledge-based machining package essentially adds 
intelligence to the cutter. During the simulation, the in-process geometrical model 
knows the exact depth, width and angle of each cut because the software also knows 
the exact shape of the in-process material at every instant of the machining 
sequence. It knows exactly how much material is removed by each cut segment, 
and the exact shape of the cutter contact with the material. With this unique 
knowledge set, it determines the best feed rate for each cutting condition encoun- 
tered, taking into account volume of material removed, chip load, and machine 
acceleration and deceleration requirements, as shown in the right side of Fig. 61. If 
desired, the software can also divide cuts into smaller segments and vary the feed 
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rates as needed in order to maintain a consistent chip load or volume removal rate. It 
then creates a new NC program, with the same trajectory as the original, but with 
improved feed rates. 

Summary and Looking to the Future 

Virtual machining simulates NC code to discover errors, without time consuming 
trial runs or online debugging on real machine tool. Working towards a vision of 
pervasive modelling and simulation, various deformable in-process geometry 
models from the 2D sections to 3D representations, from Z map to unified voxel- 
based are discussed. A practical system developed based on the deformable 
in-process geometry model is taken as an example to demonstrate the application 
of virtual machining for NC verification. Virtual machine tool with a virtual CNC 
control panel and virtual jigs and inspection tools is introduced for training purpose. 

There was a time when the computer was expensive and software was difficult to 
use, but virtual machining was still running with profit for high cost aerospace 
machining. Today the computing cost is almost zero compared to material and 
machine centre, so it is time for pervasive virtual machining application in every 
sector. An easy to use and low cost virtual machining system will find a wide market. 

Looking forwards, sustainable machining is a great challenge. Towards smart 
and competitive sustainable machining, CNC model and simulation will be used to 
optimize the machining process, where the raw material could be saved through 
first part correct technology, the energy could be saved through cutting speed 
optimization, and used parts could be saved by remanufacturing. 

The simulation of chip formation using the finite-element-method (FEM) pred- 
icates the cutting force and chip thickness, thus saving time in the subsequent 
machining trials. State of the art today is the individual simulation of the machining 
process and the machine performance. In real machining processes however, these 
parameters are inter-dependent and influence one another heavily. Integrated sim- 
ulation, whereby the process-machine interaction is simulated, is therefore a further 
key technology for sustainable production in the future. 

Tool chatter is the barrier for higher material removal rate and can damage 
machine tool spindle. A certain combination of depth of cut and speed can incur self 
excited vibration of tooling system and generate cutter marks on the machined 
surface. Dynamic machining model and simulation can predicate best cutting speed 
and depth combination that will cut faster without chattering. 

The simulation of machining operations offers the potential to fulfil the ecolog- 
ical, social and economic requirements of sustainability. For example, the adjust- 
ment of the suitable feed rate in the milling of complex geometries from difficult to 
machine materials may be optimized through simulation and thus reduce the 
machining time by up to 40 %. The resultant reduction in consumed resources 
allows a saving of both costs and energy. 

The machining stock is the volume difference between the designed part geom- 
etry and raw material geometry. Reducing the machining stock can save raw 
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material. The minimum machining stock could be achieved through near net shape 
forming of the raw material, such as casting, forging and welding. 

The most material and energy are wasted in the manufacturing processes. These 
wastes can be saved through re-machining of the damaged component, where the 
damage can be repaired by welding or thermal spray. For a long time in the 
aerospace industry, overhaul of jet engines has been a profitable business world- 
wide. Nowadays even the automotive industry has started to re -manufacture many 
components, especially engines. However, re-machining is a great challenge for 
CNC machining since the damaged component geometry is warped. Virtual 
machining can simulate this warped component and generate 3D printing and the 
following cleanup toolpath. 
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Abstract 

Tolerance charting is one of the important technologies used in process planning to 
develop the mean sizes and tolerances of the working dimensions for a new 
manufacturing process, or to analyze a set of existing dimensions and tolerances 
to determine if the component can be made to meet the blueprint specification. 
In this paper a computer-aided angular tolerance charting system is implemented, 
which is mainly based on the mathematical models of the algebraic method for both 
square shouldered and angular features. A surface changing algorithm is applied to 
calculate the working dimensions. The process tolerance allocation is carried out 
with a discrete interval cost-tolerance model and a multi-choice knapsack model. 
With a genetic algorithm, an optimal or a near-optimal solution of tolerance 
allocation is achieved consequently. The implementation of such a computer- 
aided angular tolerance charting system is presented from the viewpoint of software 
engineering. The data structures for blueprint information, operation information, 
and tolerance charts are designed. The functions of the angular tolerance charting 
system are described. The prototype of the system is analyzed in detail. 

Introduction 

According to the Academic Press Dictionary of Science and Technology, a toler- 
ance chart is defined as “a graphical representation of a sequence of operations to 
which a part must be manufactured with identifying specified dimensions indicating 
tolerance limits” (Morris 1992). It is a visual graphic tool used to develop the mean 
sizes and tolerances of the working dimensions for a new manufacturing process, or 
to analyze a set of existing dimensions and tolerances to determine if the component 
can be made to meet the blueprint specification. It is an effective means to deal with 
the tolerance stack-up problem in process planning for part machining. 

Engineering drawings have traditionally been used for design documentation and 
for effective communication between engineers from different departments. During 
the design of mechanical components and assemblies, tolerances are specified in 
conjunction with part geometry, material type, and other technical specifications. 

Process planning is a function by which the drawing specifications are 
transformed into manufacturing instructions. The process engineers interpret the 
meaning of the blueprint drawing and design the manufacturing routings by decid- 
ing which manufacturing process and machines should be used to perform the 
various operations necessary to produce a component and the sequence. 

The cost of production increases geometrically for uniform incremental tight- 
ening of tolerances. One way to minimize production costs is to ensure that no 
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tolerance is held tighter than required by the product design drawing or by the 
nature of the manufacturing process concepts covering sequencing, choice of 
location surfaces, dimensioning schemes on the cuts, work-holding principles, etc. 

The widespread and growing use of NC machining has reduced the extent of the 
tolerance stack-up control problem by allowing cuts to be machined as shown on 
the blueprint dimensioning schemes, by eliminating manual control of machine 
decisions affecting the cuts, and by reducing the number of location surface 
changes and the attendant fixture required by non-NC machining. In general, it 
has also improved size control and control of geometric characteristics of part 
feature. However, not all tolerance stack-ups are eliminated by using NC machine. 
NC machines are often combined with conventional machines to form a group of 
machines to make a part. 

Many simple parts can be made without the help of tolerance charting. However, 
in aerospace, automotive, and other industries, many parts are very expensive to 
manufacture. Scrap and rework of these parts result in large economic loss, 
especially in the just-in-time (JIT) manufacturing environment. “Doing it right 
the first time” is obviously very important. Because of the complexity of the part, 
the tightness of the tolerance, the large number of operations needed to produce the 
part, and the high frequency of changing machining datum, process tolerances for 
the part are difficult to be controlled, and the scraps are usually inevitable. In 
production planning for quantity runs, however, the part cannot always be 
machined dimensionally as shown on the drawing, so datum surfaces must be set 
up by the production engineer based on a selection of locating surfaces for fixture 
and on cutting tool design layout decisions. Whether a part is made completely to 
print in one operation or routed over a series of machine, some of which may be NC, 
the process engineer must be capable of recognizing that a tolerance stack-up 
situation has been created which will affect the tolerance assigned to the machine 
cuts. When tolerance stack-up problems must be handled, the easiest, quickest, and 
most foolproof way is by the use of the tolerance chart. The tolerance charting 
technique can systematically control the process tolerances in process planning and 
eliminate the scraps in manufacturing. The sequence of manufacturing processes 
can be visualized with a tolerance chart. Whether a part can be machined to selected 
tolerance can be predetermined graphically by constructing a tolerance chart. 
Process planners can take advantages of the tolerance charting tools in determining 
the working dimensions and in allocating optimal tolerances to corresponding 
working dimensions. Particularly for complex parts like splined shafts, fine gears, 
airplane engine shafts, and/or similar parts, tolerance charts are invaluable, practi- 
cally indispensable for economical production. This is the reason why tolerance 
charts are widely used in manufacturing engineering, particularly in precision 
manufacturing engineering. 

Industries that utilize tolerance charting vary from large corporations 
manufacturing aircraft engines, off-road vehicles, and automobiles to NC-based 
machine shops. In any manufacturing company three separate groups are constantly 
concerned with the problems of determining the magnitude of the tolerances to be 
assigned to working dimensions, how tolerances build up, and ultimately, what is to 
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be done about those manufacturing situations where in the shop did not hold the 
prescribed tolerances. 

Product designers are the first group of users, who have the responsibility of 
issuing designs that incorporate the maximum possible working tolerances com- 
patible with the functional requirements. 

Manufacturing engineers are the second group of users, who have the responsi- 
bility of issuing operation sheets that prescribe the best and most economical 
sequence capable of producing the part. 

Quality control engineers are the third group of users, who have the responsi- 
bility of measuring quality, evaluating quality, and acting as the organizational 
catalyst when tolerances are not held to print and a coordinated group effort is 
required to determine when this happened. 

In current CE (concurrent engineering) environment, tolerancing is a common 
issue concerned by all human being involved in the product project. The design 
engineers’ job will fuse with the production engineers’ job. The utility of tolerance 
charts may cover all departments of the whole plant and even multi-plants. Toler- 
ance charts become one kind of enterprise resources shared among engineers from 
different departments. They can be stored in a PDM (product data management) 
system along with the geometric model, operation routing, NC files, etc. In modern 
manufacturing industry, the user groups of tolerance charts may cover ever wider 
than before. 

Tolerance charting is one of the effective ways used by process planners to 
increase the productivity and reduce the cost by optimal designing the dimensions 
and their tolerances to reduce the scraps and reworks. Process engineers using 
tolerance charts try to guarantee that all flaws will be purged from the initial 
process-planning decisions responsible for the tolerance stack-ups. Once this has 
been done correctly, the resulting process tolerances can be optimized. The final 
process will be cost effective and free of all high cost tolerances. 

This paper introduces the development of a software package named computer- 
aided angular tolerance charting (CAATC). The mathematical models are used to 
implement the angular tolerance charting with computer programs. 


Characteristics of CAATC 

CAATC is a Windows® application that can create tolerance charts electronically. 
It is a computerized graphic method of presenting the working dimensions of a part 
with angular features at the stages of their manufacture process. It can visually 
ensure the dimensions and tolerances to meet the specifications in the blueprint 
according to the principle of tolerance stack-ups. 

CAATC is a promising tool for process planners. It will free the process planners 
from the labor-concentrated jobs. The tedious works, such as constructing the 
schematics for blueprint dimensions and stock removals, calculating working 
dimensions, and identifying dimensional chains, are accomplished by the computer. 
The process engineers are only needed to prepare some necessary data to input into 
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the computer and do some selection for machining cuts, the dimensional chains are 
identified automatically, and the process tolerances are allocated optimally. 

CAATC is a computer-aided 2D tolerance charting system, more capable than 
those simple ID tolerance charting systems. The distinctive characteristics of the 
system are summarized as follows: 


Standardization 

With the development of computer-aided tolerance charting systems, the forms of 
tolerance charts vary from one to another. A standard template for tolerance charts 
is constructed as a basis in this system. At the same time, the users can personalize 
the tolerance charts with their preferences by selecting different options. So, a 
universal tolerance chart template is developed for different users. A standard 
tolerance chart template has been developed. 


Interactiveness 

A major failure from many tolerance optimization programs is that the operation 
sequence must be planned prior to running the programs and cannot be modified 
during the charting process. Interactive tolerance charting is a process -planning tool, 
not simply a process-checking tool. An interface is developed to input the tentative 
process sequence or to transfer the tentative process sequence from a CAPP program. 
Most importantly, the tentative process sequence can be modified in need if there are 
unachievable tolerances or dimensions. So it is capable of modifying the current 
process plan and generating alternative operation sequences, as well as of checking 
the processes for each sequence. In short, interactive charting is flexible and efficient. 


Integration 

The CAATC program is designed to be able to integrate with CAD/CAPP/CAM 
systems. This capability is essential in today’s concurrent engineering environment. 
The part sketch is drawn interactively. It can also be transferred from an existing 
CAD file. So the process planners will not have to redraw the part or re-input the 
design specifications. What they must do is select the dimensions and tolerances 
involved in the tolerance charts. The system outputs are also easy to be recognized 
by other systems. 


Systematization 

Various computer-aided techniques are available for tolerance charting. This raises 
a crucial issue as to how computer-aided tolerance charting should be performed. 
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With the algebraic method, the tolerance charts are able to represent the operational 
sequence graphically. The tolerance charts are extremely flexible to deal with 
angular features. The algebraic method is much more systematic and easier to 
implement than other mathematical models. It considers the whole process proce- 
dure instead of the single-dimensional chain. Both the forward and the reverse 
dimensional chains are easily obtained. Thereafter, the working dimensions are 
determined with the reverse dimensional chains, and the tolerance allocation is 
optimized with the forward dimensional chains. 


Uniqueness 

The CAATC system can deal with the tolerance charts not only for square shoul- 
dered parts but also for angular featured parts. It determines the dimensional chains 
in both the x-axis and the y-axis. The CAATC system can handle 2D tolerance 
charts, while others cannot. This is unique. 


Uniformity 

The data structures, the surface model, the algebraic method, and the trigonometric 
formulas are uniform in spite of the square shouldered or angular features. 


Mathematical Models Inside of the CAATC System 

In order to deal with tolerance analysis and tolerance synthesis with a tolerance 
charting, many mathematical models had been developed in the last 30 years. 
Several computer-aided tolerance charting systems are used to perform efficiently 
the calculations for dimensional planning and to adjust economically and reason- 
ably the working tolerances for each machining operation in the process. 

Early computer-aided tolerance charting method just computerized the manual 
tolerance charting approaches. In order to develop a systematic, efficient method 
for tolerance charting, it is necessary to establish some mathematical models for 
tolerance charts. In 1988, Tang and Davies introduced a matrix-tree-chain approach 
for computer-aided tolerance charting (Tang and Davies 1988). By performing a 
series of operations on the matrix, the dimensional chains and other relationship 
between a dimension and other related dimensions were found out. This mathe- 
matical model recalled the interest on tolerance charting among researchers over 
the world. After that, many mathematical models representing the tolerance charts 
had been developed. Irani et al. proposed a graph representation of the machining 
sequence in tolerance charts in 1989 (Irani et al. 1989). The blueprint schematics 
and the stock removal schematics were all represented with the graph representa- 
tion. Their graph-theoretic algorithm eliminated the need to maintain balance 
dimensions, as was done in the conventional scheme. Why brew et al. introduced 
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a graph-theoretic approach to tolerance charting in 1990 (Whybrew et al. 1990). 
Two approaches to find the route from a cut face to the original surface were 
presented, that is, a rooted tree diagram and a tabulation method. A. Y. C. Nee and 
A. S. Kumar presented a rule-based system for angular tolerance charting in 1992 
(Nee and Kumar 1992). In the following year, B. K. A. Ngoi and C. K. Chua 
developed a matrix method of deriving the dimensions and tolerances during 
tolerance charting (Ngoi and Chua 1993). P. Ji presented a tree-theoretic represen- 
tation for tolerance chart (Ji 1993a). Tang G. R. et al. presented a list approach to 
the tolerance chart (Tang et al. 1993). The year of 1993 gained the most popular 
interest in computer-aided tolerance charting. 

As to the tolerance allocation problems, the heuristic or mathematical optimi- 
zation approaches were often used. Chase et al. (1990) presented an approach to 
tolerance allocation for mechanical assembly based on the discrete cost-tolerance 
relationship. Three methods, that is, exhaustive search, univariate search, and 
sequential quadratic programming, were developed for selecting the most econom- 
ical manufacturing process for each dimension from a set of alternative processes. 
Dong and Soom (1990) used an expert system to automatically generate the 
optimization objective functions and to associate each design tolerance with an 
appropriate production cost-precision mode. The general formulation of optimal 
tolerance design was introduced for multiple related dimensional chains. Lee and 
Woo (1989) formulated the tolerance allocation for assembly as a discrete optimi- 
zation problem. For an optimum selection of tolerance from a given discrete model 
involving various manufacturing processes, a branch and bound algorithm was 
developed to ensure optimum selection. Gadallah and El Maraghy (1994) devel- 
oped an algorithm using the combinatorial nature of orthogonal arrays to assign 
dimensional tolerances and to select corresponding process. Kopardekar and Anand 
(1995) presented a neural network-based approach for the tolerance allocation 
problem considering process capabilities and mean shifts. Lin et al. (1997) studied 
the assembly tolerance allocation problem based on the Monte Carlo simulation. Ji 
et al. (2000) proposed a fuzzy comprehensive evaluation approach to evaluate the 
machinability of a part and developed a genetic algorithm to optimize the model. 
Chen and Fischer (2000) also used the genetic algorithm method in dealing with 
tolerance allocation problems for assembly. Ming and Mak (2001) used the genetic 
algorithm to generate the optimal tolerance for each manufacturing process and 
adopted the Hopfield neural network to select the manufacturing operations. 

He developed an optimization program to determine the optimal set of dimen- 
sions and tolerances by combining three cost-based objective functions (He 1991). 
For concurrent optimization of both design and manufacturing tolerances, 
Zhang et al. (1992) presented a simulated annealing method. Al-ansary and Deiab 
(1997) formulated a nonlinear multi-variable optimization model solved with a 
genetic algorithm. Jeang and Hun (2000) presented a method to determine the 
process means and tolerances simultaneously and to adjust them sequentially 
by considering process drift or deterioration. Li et al. (2000) used the genetic 
algorithm method to find the optimal machining datum set and machining 
tolerances simultaneously. 
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In fact, tolerance charting is the most effective tool for process tolerance 
allocation. In early days, process tolerance allocation was accomplished with the 
trial and error method with the aids of tolerance charts (Ahluwalia and Karolin 
1984). Initial tolerances were assigned to the machining cuts. The tolerance stack- 
up was then calculated to check whether the blueprint requirements were met or 
not. Obviously the optimal solutions were not so often obtained with such a 
heuristic method. In the 1990s, some advanced tolerance allocation approaches 
were developed. A linear programming model was developed for tolerance 
balancing (Ngoi 1992). Another direct linear programming model was developed 
on considering the blueprint dimensions and the process capabilities (Ji 1993b). 
A gradient search procedure was proposed by formulating the tolerance allocation 
as an optimization problem to minimize the cost under the constraints of design, 
machining, and process capabilities (Nurre and Vedati 1998). A feature-based 
tolerance chart was used in alternative tolerance allocation for machining parts 
(Tseng and Temg 1999). 

Because most models of tolerance allocation problems were formulated with the 
manufacturing cost as the objective functions, the cost-tolerance relationship 
became one of the most important issues for the tolerance allocation problems. 
Various functions had been proposed to describe the cost-tolerance relationship 
and various optimization methods had been applied (Chase et al. 1990; Wu 
et al. 1988). Dong et al. (1994) developed several new cost-tolerance models and 
a hybrid-model tolerance optimization formulation. These cost-tolerance models 
were all formulated as continuous curves based on the collected empirical 
cost-tolerance data. In practice, the empirical cost-tolerance data should be directly 
obtained from machine shops and recorded in the original form of discrete points. In 
fact, the cost-tolerance model can be derived as a discrete interval model based on 
the tolerance grades that can be achieved with a certain machining cuts. 

Here in the CAATC system, the mathematical model of algebraic method is 
employed to calculate the working dimensions and to verify the tolerance stack-ups. 
A surface model is employed to calculate the surface change after a machine cut is 
carried on the surface. With the cost-tolerance relationship, a multi-choice knap- 
sack model is formulated for the tolerance allocation problem. The genetic algo- 
rithm is used to solve the knapsack problem. Finally, the tolerances are optimally 
allocated to each work dimensions. And it is easy to check the tolerance stack-up. 
Figure 1 shows the relationships between the different mathematical models. 


Design of the CAATC System 

The general overview of the CAATC system is shown in Fig. 2. Three main classes 
of information are required to input into the system to build a tolerance chart. They 
are blueprint information, operation sequence information, and stock removal 
information. Blueprint information includes the part sketch and the dimensions. 
The part sketch is drawn through defining the points and the surfaces describing the 
part. The dimensions are inputted manually based on the part sketch. Besides, they 
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Fig. 1 The relationship between the mathematical models 



Fig. 2 General overview of the CAATC system 
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can also be transferred from a CAD file. Operation information is the tentative 
operation sequence that used to manufacture the part. It includes operation number, 
operation type, machined surface, and reference surface. It is often designed manu- 
ally. It can also be transferred from a CAPP system. Stock information includes the 
stock removals and process capability of each machining cut. It is the manufacturing 
resource information including machine types, minimum and maximum stock 
removals, and multiple tolerances appropriate to a specific material for machine 
cuts. If the information is retrieved from an existing CAD or CAPP files, it should 
be transferred to the data format that can be recognized by the CAATC system. 

The output of the tolerance charting system consists of the working dimensions, 
the updated operation sequence, the stock removals, and the resultant dimensions. 
The working dimensions include the mean working dimensions and their toler- 
ances. The updated operation sequence is the output information only if the 
operation sequence is modified in the tolerance charting system. The optimized 
stock removals and their tolerances are used to check whether the machine cut can 
be accomplished or not. The resultant dimensions are indicated if the resultant 
dimensions are different from the blueprint dimensions although they are still 
proper to manufacture the part. 


Data Structures in the CAATC System 

The CAATC system is developed with Visual C++ on the Windows 2000/XP 
platform. According to the information involved in the system, several data struc- 
tures are designed to describe the blueprint information, the operation information, 
and the tolerance charts. 


Data Structures for Blueprint Information 

Blueprint information is the basic data required to the CAATC system. It is the 
design information describing a part. It usually includes the part shape, the dimen- 
sions, the tolerances, and the surface finish information, etc. However, the surface 
finish information is not considered in the CAATC system. The part shape is 
described with the basic geometric information, such as points and surfaces. 
A part sketch is often shown at the top of a tolerance chart in aid of analyzing the 
blueprint information. The presentation of the part sketch is different from the part 
drawing in a CAD system. It does not need to be scaled. In 2D space, the part sketch 
can be defined with the points and the surfaces that determine the delimitations of 
the part. So, a list of points and a list of surfaces should be defined. The surface is 
described with a line segment and its normal direction in this system. The line 
segment is expressed with two points, and the normal direction is described with an 
angle, which is measured from the positive x-axis to the normal of the surface. 
Here, the two points describing the surface have two facets: one is the real position 
of the surface in the part coordinate system and the other is the descriptive values 
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used to draw the surface in the sketch. So, both the position values should be included 
in the definition of the points. So, the class of the points is given as follows, 
class CPoints: : public CObject 

{ 

CString m_strName ; 
int m_nDrawX; 
int m_nDrawY; 
int m_nRealX; 
int m_nRealY; 

} 

And the surface class is defined as follows: 
class CSurface: : public CObject 

{ 

CString m_strBeginPoint ; 

CString m_strEndPoint ; 
float m_f Angle 

} 

Here, Examples of the points and surfaces files are presented for the part shown 
in Fig. 3. 

Example . ptr 
PI 10 20 
P2 10 120 
P3 60 120 
P4 10 60 
P5 120 20 
P6 120 90 
P7 120 60 

And the surface file Example. sur is shown below: 

Example . sur 
A PI P5 270 
B P4 P7 270 
C P2 P3 90 
D P3 P6 78 
E PI P2 180 
F P5 P6 0 
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An ideal CAD system should not only support the geometry and topology of the 
part in question but also its tolerance specification and functional description 
(Mullapudi and Gavankar 1994). In addition to the geometric data, the topology 
of the part sketch is also required by the CAATC system. It can also be extracted 
from the CAD file. However, the topology of the part needed in the tolerance chart 
is different from that in the CAD file. It only records the adjacent relationship 
between the surfaces. It can be generated when the surface information is inputted. 

Another blueprint information, that is, the dimension, is also obtained from the 
blueprint drawing. The blueprint dimensions are all length dimensions, because the 
angle information is described with the normal direction of the surface in the part 
sketch. Each blueprint dimension includes the following information: number, 
begin surface, end surface, dimension, and tolerance. Because the resultant dimen- 
sions are the final dimensions of the part, they are considered as one of the 
properties of the blueprint dimensions. In addition, each blueprint dimension is 
produced with several machining cuts in the process sequence. The link of the cuts 
forms a dimensional chain. So the blueprint dimension also has a chain property. 
Thus the data structure of the blueprint dimension is designed as follows: 
class CBlueprint : : public CObject 

{ 

int m__nNo ; 

CString m_strFrom; 

CString m_strTo; 
float m_nBlueprintMean; 
float m_nBlueprintTol ; 
float m_nResultantMean; 
float m_nResultantTol ; 

CString m_strChain; 

} 

Data Structure for Process Operation Information 

The tentative operation sequence is the main body of a tolerance chart. The 
operation sequence is the backbone that links up all machining cuts to manufacture 
the part. Each operation has some static attributes, such as the sequence number, the 
operation name, the machined surface, and the datum surface. The working dimen- 
sions and their tolerances are the dynamic attributes of the operation. The stock 
removals and their tolerances also belong to certain operations. For those second 
machining cuts, the stock removal is the closing link of the dimensional chain. The 
dimensional chain is one of the attributes of the operation. In the tolerance chart, 
each row of the chart represents an operation. So, the data structure of the operation 
is designed as follows: 

class COperation : : public CObject 

{ 

int m nSeqNo; 
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CString m_strName ; 

CString m_strMachiniedSurf ace ; 

CString m_strDatumSurf ace ; 
float m_nWorkingDimensionMean; 
float m_nWorkingDimensionTol ; 
float m_nStockRemovalMean; 
float m_nStockRemovalTol ; 

CString m_strChain; 

} 

Each operation may consist of several machining cuts. So it should be trans- 
ferred from an initial tentative operation sequence to a formal machining cut 
sequence, that is, the operation sequence shows all the machining cuts in the 
process plan. 


Data Structure for a Tolerance Chart 

The data structure of a tolerance chart should include all information involved. A 
tolerance chart is made up of four main components. First, a list of points is used to 
describe the delimitation of the part. Second, a list of surfaces (include square 
shouldered and angular featured surfaces) is used to describe the outline of the part. 
Third, a list of operations is used to describe the sequence of machining cuts, which 
form a tentative operation sequence that the part will be machined with. Each 
machine cut in the operation sequence is extended to have the working dimension 
and the stock removal dimension in the tolerance chart. Fourth, a list of blueprint 
dimensions is used to describe the blueprint dimensions to be controlled in the 
tolerance chart. These objects are necessary for constructing the tolerance chart. 
The manipulation of the tolerance chart is accomplished through the manipulations 
on these object lists. The interrelationship between the three lists reflects the 
complexity of the tolerance charting for angular features. 

The class of tolerance chart is defined as follows: 

class CToleranceChartDoc : public CDocument 

{ 

CTypedPtrList<CObList , CBluePrint*> m_blueprintList ; 

CTypedPtrList<CObList , COperation*> m_opeat ionList ; 

CTypedPtrList<CObList , CSurface*> m_surf aceList ; 

CTypedPtrList<CObList , Cpoints*> m_pointsList ; 

} ; 


Functional Modules of the CAATC System 

The mathematical models developed in the authors’ previous works (Xue and Ji 
2002, 2004, 2005) are used to help the process planner to determine the working 
dimensions and to allocate the process tolerances in the CAATC system. The data 
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structures of the classes involved have been introduced in last section. How these 
classes interact will be revealed in this section. 

The CAATC system is designed based on the software engineering. The data 
flow diagram of the CAATC system is shown in Fig. 4. Rectangles represent data 
processes, arrows represent data flows, and parallelograms represent data elements. 
Rhombuses indicate alternative selections. Trapezoids indicate manual inputs. 
Documents and screen shows are also included in the data flow diagram. 

Based on the data flow of the CAATC system, the interfaces between 
different modules are defined. However, the transfers of the files from CAD and 
CAPP systems are not performed in this book. All necessary information is 
inputted manually. Based on the functional requirements, seven functional processing 
modules and one auxiliary module of the CAATC system are developed as follows. 


Part Sketch Drawing Module 

The part sketch can be either obtained from a CAD file or inputted manually. The 
part sketch is represented with points and surfaces and their topological relation- 
ships. The points are inputted first. Based on the point’s information, the surfaces 
are defined with two points and an angle between the surface normal and the x-axial 
direction so that the surfaces are fully represented in 2D space. The characteristic 
values are derived from the points in the part sketch. The lines representing the 
characteristic values are drawn to the bottom of the chart. All points and surfaces 
are visually illustrated. The topological relationship of the part is generated auto- 
matically when the surface information is inputted. 


Blueprint Dimension Module 

Blueprint dimensions determine the final characteristic values of the surfaces in 
fact. The blueprint dimensions are the distances between any two characteristic 
values in the tolerance chart. In this module, each blueprint dimension and 
its tolerance are inputted manually. The corresponding characteristic values 
are designated at the same time. By setting the origin of the coordinate system, 
two additional conditions are attached, which are X = 0 and Y = 0. The relation- 
ships between the blueprint dimensions and the characteristic values are established 
in a matrix mode. By use of the Gaussian eliminating method, the initial charac- 
teristic values are obtained. The blueprint information is stored in computer. 


Operation Sequence Module 

An operation sequence can be either obtained from a CAPP system or manually 
inputted. The information of the operation sequence includes sequence numbers, 
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operation numbers, machining cut type, machined surfaces, and the characteristic 
value measured from and the characteristic value measured to. The machined 
surfaces are selected from the surface list generated in the part sketch module. 
The initial operation sequence has no working dimension and no stock removal. 
The information will be generated and updated in the procedure of manipulating the 
tolerance chart. When inputting the operation sequence information, the character- 
istic values of all surfaces are updated at each machining cut backward from the last 
machining cut to the first one. The changes are calculated with two unique trigo- 
nometric formulas as the Eq. (1). The tentative operation sequence information is 
also stored in the computer. 

For Eq. 1, the normal depth of cut is a; by deriving backward, before the 
machining cut, x will be decreased with Ax, and y will be increased with Ay. 


f 

< 



Ax = 

Ay = 


—a sin (fi) 
sin ( a — p) 

a cos (fi) 
sin ( a — P) 



Angles a and p must be reckoned from the positive x-axis to the normal of the 
surface with the right-hand rule. The two formulas are applicable to both the 
internal and external angular features and for the square shouldered features and 
even for the “V”-shaped angular features. Details can refer to author’s work (Xue 

and Ji 2005). 

Here, in order to calculate the surface change during the charting procedure, an 
algorithm is developed for calculating the surface changes due to the machining cut 
made on the surface based on Eq. E 


Algorithm SCC (Surface Change Calculation) Algorithm:. Input: List “A,” a list 
containing N characteristic values, machined surface P (xa, ya, xb, yb, angle 1), its 
adjacent surface Q (xb, yb, xc, yc, angle2), and the stock removal on machined 
surface “S.” 

Output: List “B,” the list containing n characteristic values, in which the 
changed characteristic values are updated. 


Algorithm Function SCC (A, P(xa, ya, xb, yb, anglel) , Q(xb, yb, 
xc, yc, angle2) , S) 

{ 

DelX = - S * sin (angle2 ) / sin (anglel - angle2 ) ; 

DelY = S * cos (angle2 ) / sin (anglel - angle2 ) ; 

For (1 = 0, I + +, I < N ) 

{ 

if ( A [ I ] . x = = P . xb ) ; 

{ 

B [I] ,x = A[I] .x + DelX: 
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} 

else if (A[I].y = = P.yb) 

{ 

B [I] . y = A [I] .y + DelY; 

} 

else 

B [I] = A[I] ; 

} 

Return B ; 

} 


Working Dimension Module 

The working dimensions are determined with the blueprint dimensions and the stock 
removals. The working dimensions are obtained from the absolute values by 
subtracting the characteristic values of datum surfaces from the characteristic values 
of machined surfaces in the operation sequence. The result is a set of linear equations. 
These linear equations can be neatly formulated into a matrix. The elements of some 
columns are all zeroes in this matrix, and the stock removals associated to these 
columns are “solid” stock removals. After eliminating these columns and their 
corresponding stock removals, the remained matrix is the reverse dimensional chain 
matrix. Each row of the matrix represents a reverse dimensional chain. 


Dimensional Chain Module 

The forward dimensional chain matrix is the inverse of the reverse dimensional 
chain matrix. The reverse dimensional chains are represented with a square matrix. 
With the inverse matrix function, it is easy to obtain the forward dimensional 
chains. In this module, the main activity is the matrix manipulation. The forward 
chains are illustrated in the chain column in the tolerance chart. 


Process Tolerance Allocation Module 

Based on the multi-choice of the tolerance grades that each machining cut may 
have, the process tolerance allocation is formulated as a multi-choice knapsack 
model. The objective function is to minimize the total manufacturing cost. The 
constraints include tolerances stack-ups under the blueprint and stock removal 
limits, the process capabilities, and the maximum and minimum stock removals. 
This multi-choice knapsack model is solved using a genetic algorithm. If no 
feasible solution of tolerance allocation exists, the report will show which con- 
straint is violated. If the model has a feasible solution, an optimal or near-optimal 
solution is reported. 
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Output Module 

The final goal of CAATC is to produce a complete tolerance chart. If the tentative 
operation sequence is economic to manufacture the part, it can be considered that 
the tolerance chart has been successfully constructed. Thereafter, the tolerance 
chart can be printed as a final document. 


Auxiliary Input Module 

Some auxiliary information is needed to input into the computer. The information 
of the tolerance chart layout is manually inputted. The information of the machining 
cuts, such as the machining capability, minimum and maximum stock removals, the 
process tolerance grades, the tolerance values, and the manufacturing costs, are 
stored in the computer. The information is necessary for the tolerance allocation 
module. 

Graphic User Interface of CAATC 

CAATC is an independent program that can run on a stand-alone personal com- 
puter. It can also be integrated with CAD/CAM/CAE/CAPP software applications 
by developing some interfaces between the systems. CAATC can be launched at the 
“Start” menu in Windows 2000/XP environment. 


Guide to Using CAATC 

The overall procedure for using CAATC can be simply described as follows: 

1 . Start a new tolerance chart, the system will open a blank tolerance chart. 

2. Input the part sketch step by step. The part sketch can be defined interactively. 
First, define the points one by one or import the points from a file. Second, 
define the surfaces or import the surfaces from a file based on the point’s 
information. The topology of the part is automatically generated when input- 
ting the surfaces. 

3. Derive the characteristic values. 

4. Define the blueprint dimensions from a file. 

5. Define the operation sequence from a file. This file can be edited line by line to 
modify the process operations. 

6. Determine the “solid” stock removals. Load stock removal values to the 
remaining machine cuts from a file. 

7. Derive the working dimensions. The reverse dimensional chains are formulated 
automatically. 
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Fig. 5 Workspace of the tolerance charting system 


8. Allocate process tolerances to the machining cuts optimally with the genetic 
algorithm. 

9. If a problem exists, interactively edit the process operations, the working 
dimensions, and the stock removals as necessary. 

10. If it is all right, then save the angular tolerance chart into a file. 

1 1 . Print out the complete tolerance chart. 


Graphic User Interfaces 


When the menu item “New” is clicked, a blank tolerance chart is created. At this 
time, the part sketch and the operation sequence are all blank, as shown in Fig. 5. 

Click on the Define menu, the part sketch can be inputted interactively and 
shown at the top of the chart. The part sketch input dialog has two phases, one is for 
point input and the other is for surface input, as shown in Figs. 6 and 7, respectively. 
If the information of a part has been prepared, it is suggested to retrieve the part 
information directly. The “import points” and the “import surfaces” menu items are 
used to input the points and the surfaces in groups. 

The blueprint dimension can be inputted with the dialog shown in Fig. 8. 
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Fig. 6 Points input dialog 
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Fig. 7 Surfaces input dialog 


In order to input the operation sequence, click on the menu of input operation 
single; the dialog for inputting the operation information is shown in Fig. 9. 

The user interface can be activated via the menu items, as shown in Fig. 10 or the 
system tool bar, as shown in Fig. 11. 

Once all inputting entries are complete, the resultant tolerance stack-ups are 
calculated, and the resultant lower and upper dimensions are compared with the 
design targets. If a design specification is not achievable, the relevant design 
specification dimension and all the operations that contribute to its resultant toler- 
ance stack-ups are highlighted in red color on the computer screen. It is not possible 
to progress to the next stage in the charting procedure until the process plan has 
been modified to satisfy the design specification. 

Stock removal allowances are checked next in a similar manner. If an operation 
has a tolerance stack-up greater than the allowance, the mean dimension and the 
stack-up tolerance are highlighted in the red color on the computer screen. There 
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Fig. 8 Blueprint dimension input dialog 
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Operation Name 
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Cancel 


Fig. 9 Operation sequence input dialog 

are two options available for the user to choose during checking. First the user can 
set the stock removal allowances equal to the tolerance stack-up. This feature is 
sometimes used to generate initial allowances quickly for comparison with a good 
machining practice. If this option is chosen, allowances do not need to be entered 
during input of the operations. Second, the user can override the checking for any 
individual operation and accept a tolerance greater than an allowance. This is useful 
for grinding and similar operations. If this option is chosen, the allowance for that 
operation is highlighted in red to remind the user that the checking has been 
overridden. 
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After all the information needed in an angular tolerance chart is checked, with 
the algebraic method, the dimensional chains can be shown automatically. Working 
dimensions cannot be calculated until the process plan has been verified with regard 
to the design and stock removal allowance constraints. Resultant dimensions can be 
specified at this stage. The program will re-compute resultant dimensions’ upper 
and lower limits automatically and check these dimensions against the design 
specifications. Finally, a hard copy of the completed chart can be printed, if 
required. 


Summary 

This CAATC software package can perform the most tasks of the tolerance charting 
with modern advanced algorithms and methods. With the input by users, CAATC 
can automate the design tasks that would take hours, days, or even weeks if 
performed manually. CAATC looks simple, but very complex in fact. CAATC 
gives users friendly interfaces and aids the process planner to construct the toler- 
ance chart, and uses the algebraic method to determine the dimensional chains and 
working dimensions and to implement the tolerance allocation with the genetic 
algorithm. CAATC needs to deal with much information about the machining cuts, 
work dimensions, process sequences, stock removals, resultant dimensions, and so 
on. The information is very complex. So, it is hard to construct an excellent data 
structure to describe the angular tolerance charting. These tasks must be dealt with 
one by one. After the CAATC system becomes more experienced, a more simpli- 
fied data structure should be constructed. The algebraic method used in determining 
the working dimensions is very effective. Some of the conditions are required to 
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input directly by the user interactively. Using CAATC as a tool of developing the 
tolerance charts, the time can be reduced sharply. 

However, how to integrate the CAATC software with existing CAD, CAPP, or 
CAM software packages is another pending problem to be solved. 
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Nanomanufacturing utilizes value-added processes to control materials structures, 
components, devices, and systems at the nanoscale for reproducible, commercial- 
scale production. It is the essential bridge between the discoveries of nanoscience 
and real-world nanotechnology products. In recent years, ion beam technology has 
found wide applications in nano precision or nanoscale manufacturing. Ion beam 
nanomanufacturing (IBNM) technologies mainly consist of four approaches: 
ion-induced imaging, ion milling removal, ion-induced deposition, and ion implan- 
tation or modification, which are usually performed under a stream of accelerated 
ions by electrical means. Versatile materials can be processed by IBNM, such as 
diamond, metals, thin films, polymers, and semiconductors. 

Based on the size of an ion beam source, IBNM technologies can be categorized 
into two main groups: focused ion beam (FIB) technology and broad beam 
technology. FIB has become one of the most important nanofabrication technolo- 
gies. It has been widely used to fabricate nano-devices and nano -components such 
as nano- optic devices, TEM sample preparation, and micro tools with nanometric 
cutting edges. On the other hand, broad beam technology has been widely applied 
to nano-accuracy manufacturing. For example, one typical broad beam technology 
can fabricate ultra-smooth surfaces with a surface roughness of 0.2 nm in rms by 
low-energy argon ions. 

This book will comprehensively introduce the challenges encountered in IBNM 
as well as its recent developments. The content is mainly focused on the 
working principles, technologies, and applications of ion beam manufacturing. 
Chapter 34, “► State-of-the-Art for Nanomanufacturing Using Ion Beam Technology” 
overviews the development and characteristics ion beam nanomanufacturing. 
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Chapter 35, “► Ion Beam Instruments Used for Nanomanufacturing” illustrates typical 
FIB system components, newly developed FIB instruments, and broad beam instru- 
ments for IB etching/deposition/implantation systems. 

Some newly developed nanomanufacturing technologies involving the broad 
beam approach have been introduced within the following three chapters. 
Chapter 36, “►Ion Beam Figuring Technology” discusses Ion Beam Figuring 
(IBF) developments and its challenges, including high gradient optical surface figur- 
ing, thermal effects, and super smooth surface figuring. Chapter 41, “► Plasma-Based 
Nanomanufacturing Under Atmospheric Pressure” discusses some efficient finishing 
techniques, such as plasma-based nanomanufacturing under atmospheric pressure. 
It is capable of generating a smooth surface without introducing subsurface damage. 
Moreover, how to fabricate complex ultra- smooth surfaces on brittle materials with 
high precision and high efficiency has been an important topic in nanomanufacturing. 
A novel process known as Nano Machining of Ion Implanted Materials (NilM) 
employing ion implantation surface modification for manufacturing brittle monocrys- 
talline materials is comprehensively discussed in Chapter 38, “► Nanometric Cutting 
of Crystal Surfaces Modified by Ion Implantation.” 

The remaining chapters explore the development of FIB techniques and their 
applications. Chapter 37, “►Focused Ion Beam Nanofabrication Technology” 
shows the developments involved in the nanofabrication using focused ion 
beams. Chapter 39, “► Micro Tools Fabrication by Focused Ion Beam technology” 
introduces micromachining tool sharpening with nanometer cutting edges and 
complex configurations developed by FIB nanofabrication. In chapter 40, 
“► Nano-gap Electrodes Developed using Focused Ion Beam Technology,” several 
techniques based on maskless FIB etching to fabricate nanogap electrodes for 
measuring electron transport in nanometer-scaled objects are discussed in details. 

I would like to acknowledge the authors of each chapter for their great 
contributions. Special thanks to Changzhi Gu and Wuxia Li from the Institute of 
Physics CAS, Shengyi Li and Xuhui Xie from National University of Defense 
Technology, Kazuya Yamamura and Yasuhisa Sano from the Osaka University, 
and Takashi Nagase from Osaka Prefecture University. Gratitude would also be 
kindly given to the MNMT members for their efforts, who are Yunhui Chen, Wei 
Wu, Haifeng Gao, Xuecen Shen, Tingting Gao, Feng Lin, Kang Li, and Ting Liu. 
I am particularly grateful to Zongwei Xu for conducting most of the coordination 
works and research investigations. Without his dedication, it was not possible to 
successfully complete this book in time. 

It is wished that this book will stimulate and promote further research and 
development and applications of ion beam nanomanufacturing. The book can be 
used as a text or reference book both for undergraduate students, graduate students, 
and researchers in areas related to nanomanufacturing. 
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Abstract 

Ion-beam manufacturing is developing toward nanoaccuracy and nanoscale. In 
this regard, the concept and working principle of ion-beam manufacturing in 
nanoaccuracy and nanoscale are presented in this chapter. The key techniques 
for ion-beam manufacturing are discussed with an emphasis on their capabilities 
in the fabrication of micro-/nano-features. The corresponding typical applica- 
tions involved in ion-beam manufacturing are provided. The recent develop- 
ments in ion-beam-related instruments are given as well. Finally, the future 
trends for ion-beam manufacturing are predicted. 


Introduction 

Global trends toward miniaturization have opened versatile new fields in science 
and technology. The rapid developed nanotechnology has evolved the field of 
manufacturing, namely, nanomanufacturing. Nanomanufacturing mainly focused 
on value-added processes to control materials structures, components, devices, and 
systems at nanoaccuracy and nanoscale. 

Nanomanufacturing can be classified into two main areas: firstly, nanoscale 
manufacturing which is characterized by a feature size in the range of 0.1-100 nm 
and, secondly, nanoaccuracy manufacturing which produces nanometer and even 
sub-nanometer surface quality, while the components developed usually in a large 
scale. Recently, ion-beam technologies have become increasing popular tools for the 
development of various types of nanoaccuracy or nanoscale structures. 

Nanomanufacturing has attracted more and more attention from the scientists, 
engineers, research organizations, and governments in various countries. The 
International Society for Nanomanufacturing (ISNM) was initiated in 2006 and 
launched later on by a group of leading scientists and engineers from all over the 
world in order to serve the fast-growing community. The nanomanufacturing 
programs have been founded in many countries, such as the Fundamentals of 
Nanomanufacturing founded by the National Natural Science Foundation of 

China in 2009. 

As one of the most important manufacturing approaches, the ion-beam tech- 
nology has been proved to meet versatile demands of today’s micro/nano science 
and technology. Ion-beam machining (IBM) is an important nonconventional 
manufacturing technology, using a stream of accelerated ions by electric means 
in a vacuum to remove, add, or modify atoms on the substrate. Based on the size 
of the beam source, ion-beam instruments can be categorized into two broad 
groups, the broad-ion-beam tools and the focused-ion-beam tools. The ion-beam 
technology can be further classified to three major techniques, such as ion 
etching, ion implantation, and gas-assisted ion deposition, which remove sub- 
strate materials, change materials properties, or deposit materials on a substrate 
by direct impingement of the ion beam on the target with or without chemical 
reactions (Yao 2007). 
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Fundamentals of Ion-Beam Fabrication 


Ion-Beam Machining Principle 


The ion-beam manufacturing technology has many advantages over other energetic 
particle beam technologies in applications (Yao 2007). Table 1 shows the quanti- 
tative comparison of FIB ion and SEM electron. Comparing with electrons, ions are 
much heavier so that they are capable of transferring patterns on versatile materials 
with much greater energy density, and much less proximity effects of backscattered 
energetic particles, as shown in Fig. 1. Therefore, the ion-beam technology has 
become not only a popular approach widely used in semiconductor industry for 
failure analysis and mask repairing but also a powerful tool in developing func- 
tional nanostructures and nanodevices with high accuracy and high quality. 

Focused ion beam (FIB) is normally used as the direct writing process by 
directly hitting the target substrate for the defining structures fabrication without 
additional mask requirements. When an energetic ion bombards the target surface, a 
variety of ion-target interactions would occur, as illustrated in Fig. 2. A fraction of 
the inclined ions are backscattered from the substrate surface, while the others are 
slowed down into the solid substrate. The collision of ions with substrate particles 
induces secondary processes, such as substrate particle sputtering, defect formation, 
electron excitation and emission, etc. Radiation and thermal-induced diffusion 
result in various phenomena of amorphization, mixing of constituent elements, 
phase transformation, crystallization, and so on. To understand the various phe- 
nomena, the processes of ion sputtering, ion implantation, and ion-induced depo- 
sition should be studied in advance, which are often interrelated in a complicated 
way. The working principles of ion etching, ion implantation, and gas-assisted ion 
deposition are discussed below. 


FIB Etching 

FIB etching, often called as FIB milling, is a process that combines amorphization, 
physical sputtering, and material redeposition (Ali et al. 2010). At the beginning 

i c o 

with an increase in the ion dose of the incident ions to the order of 10 ions/cm , 
amorphization or swelling effect would be dominant in the FIB irradiation area 
(Frey et al. 2003), resulting in the irradiation area higher than the substrate with 
several nanometers, as shown in Fig. 3a, d (Xu et al. 2012). With the ion dose 
further increase larger than the order of 10 ions/cirr, sputtering process would be 
dominant in the interactions, which is the most important interaction for the 
ion-beam milling. During the sputtering process, the substrate material removal 
would happen because the ion bombardment and collision cascade transfer suffi- 
cient energy from the accelerating ions to the substrate material, as shown in 
Fig. 3b, e. 

Sputtering yield is defined as the number of atoms ejected from the target surface 
per incident ion, which has been used to measure the efficiency of the sputtering 
process. Generally, heavier ion sources or lower binding -energy target materials 
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Table 1 Quantitative comparison of FIB ion and SEM electron (Courtesy of FEI Co.) 




FIB 

SEM 

Ratio 

Particle 

Type 

Ga + ion 

Electron 


Elementary charge 

1 

-1 


Particle size 

0.2 nm 

0.00001 nm 

20,000 

Mass 

1.2 x 10 25 kg 

9.1 x 1“ 31 kg 

130,000 

Velocity at 30 kV 

2.8 x 10 5 m/s 

1.0 x 10 8 m/s 

0.0025 

Momentum at 30 kv 

3.4 x 10 20 kgm/s 

9.1 x 10~ 23 kgm/s 

370 

Beam 

Size 

nm range 

nm range 


Energy 

Up to 30 kV 

Up to 30 kV 


Current 

pA to nA range 

pA to pA range 


Penetration 

depth 

In polymer at 30 kv 

60 nm 

12,000 nm 

0.005 

In polymer at 2 kv 

12 nm 

100 nm 

0.12 

In iron at 30 kv 

20 nm 

1,800 nm 

0.11 

In iron at 2 kv 

4 nm 

25 nm 

0.16 

Average signal 
per 100 
particles at 
20 kv 

Secondary electron 

100-200 

50-75 

1.33-4.0 

Backscattered 

electron 

0 

30-50 

0 

Substrate atom 

Several hundreds 

0 

Infinite 

X-ray 

0 

0.7 

0 



Fig. 1 Comparison of the penetration depths of various energetic particles commonly used in 
micro-/nanomanufacturing (Gierak 2009) 


can produce higher FIB sputter yields. Figure 4 shows the energy dependence of 
sputtering yield for Au and Si target substrates by three types of ions (Tseng 2004). 
The optimized FIB ion energy with good sputtering efficiency is in the range of 
10-100 keV, and usually the commercial FIB ion energy is chosen up to 30 keV. 
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Fig. 2 Illustration of a 
collision cascade generated 
by a 30 keV Ga + ion incident 
on a crystal lattice (Volkert 
and Minor 2007) 
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For energy higher than the order of 100 keV, the ions can easily penetrate into and 
be trapped inside the substrate where the sputtering yield would either level off or 
decrease and implantation would be dominated. 


Ion Implantation and Gas-Assisted Ion Deposition 

When energetic ions collide with solid substrate surface, parts of the ions would be 
finally deposited into the substrate surface after cascading collision, forming 
ion-beam implantation layer. Ion implantation is a high-technology approach 
using highly energetic ions beam to modify surface’s structure and chemistry of 
materials without affecting the component dimensions or bulk material properties. 
The ion implantation techniques have been a key process in the currently 
established Si-based micro-/nanomanufacturing technologies resulting from its 
well-known advantages, such as low-temperature processing, accurate control of 
distribution and concentration of dopants, etc. (Yao 2007; Fig. 5). 

Transmission electron microscopy (TEM) and cross-sectional FIB/SEM high- 
resolution observations were widely applied in the microscopical study of ion 
implantation. The sample defects induced by the focused-ion-beam implantation 
(FIB I) may range from amorphous layer to the formation of defect agglomerates 
and even intermetallic phases (Joachim et al. 2007). If the ion dose is larger than a 
threshold value of 1.4 x 10 ion/cnT, self-assembled nanoparticles with a diameter 
of 10-15 nm were formed and evenly distributed in the FIBI layer for the Ga ions into 
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Fig. 3 Swelling effect and sputtering effect during the ion-beam fabrication, (a) Swell effect 
would happen after low-dose FIB processing on substrate with crystal structure (Frey et al. 2003), 
(b) material sputtering or removal would be dominant under high-dose FIB (Reyntjens and Puers 
2001), (c) substrate volume change for 30 keV gallium ions on silicon dependent on ion dose (Frey 
et al. 2003), (d, e) star structures fabricated on the monocrystalline silicon by FIBDW. (d) Star 
with 0.75 nm swelling step with 6.7 x 10 14 ion dose; (e) star with 13 nm depth with 2.5 x 10 17 
ion dose. The scale of Z axis in (d, e) is 1.5 nm and 100 nm, respectively (Xu et al. 2012) 


silicon (Xu et al. 2009). FIBI has been used in the fabrication of nanostructures, where 
the FIBI layers can be used as the etching mask for subsequent wet etching (Qian 
et al. 2008), reactive ion etching (Coyne and Zachariasse 2008), and on-site FIB 
gas-assisted etching. FIB implantation has been used for irradiating polymeric resist 
materials to increase their resistance to subsequent physical etching (Arshak 
et al. 2004). Combining the FIBI and FIB gas-assisted etching, nano-electrodes and 
sinusoidal micro stmctures can be fabricated (Xu et al. 2009). 

FIB gas-assisted deposition process can be employed to realize the localized 
maskless deposition of both metal and insulator materials, as shown in Fig. 6. 
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Fig. 4 Sputtering yield of Au and Si target substrates varied with different ion energy by three 
types of ions at normal incidence (Tseng 2004) 
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Fig. 5 Micro-/nanostructures fabricated by FIB implantation and subsequent FIB XeF 2 
gas-assisted etching (FIB-GAE) (Xu et al. 2009) 

During the deposition process, precursor gas is first sprayed and then adsorbed 
locally on the sample surface using a fine nozzle, as shown in Fig. 6a. Then the 
injecting ion beam decomposes the adsorbed precursor gases. The volatile reaction 
products desorb from the surface and are removed by the FIB vacuum system, 
while the desired reaction products would deposit on the surface and form a thin 
film layer. 
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Fig. 6 Ion-beam-induced deposidon process, (a) Schematic diagram showing FIB-induced depo- 
sition process (Sarvesh et al. 2008). (b) Pt-deposited protection layer by FIBID for the cross- 
sectional TEM specimen preparation 


The incident ion beam decomposes the precursor gas on the surface and sputters 
or mills the materials on the surface. Therefore, the total deposition yield (T dep ), the 
number of atoms deposited per incident ion, can be expressed as a function of the 
decomposition yield (T decomp ) of the precursor gas and the sputtering yield (T sputt ) 
on the surface: 

f^dep = Udecomp f^sputt (1) 

The decomposition yield is proportional to the number of adsorbed molecules on 
the surface and the cross section for molecular dissociation. For the success of 
deposition, many beam parameters need to be adjusted properly. In general, the 
deposition rate is a combination of local gas flux or local pressure, ion-beam 
current, pattern size, dwell time per pixel, beam overlap, and raster refresh time 

(Yao 2007). 

Usually, the deposited thin film is not fully pure material, which would contain 
organic contaminants and Ga ions from the FIB (Reyntjens and Puers 2001). TEM 
images of the free-space wiring and the pillar showed that the structure developed 
by FIBID consists of amorphous carbon containing a Ga core in the wiring. The 
center position of the Ga core is located in the center of the pillar and below the 
center of the free-space wiring, which are corresponded to the ion range 
(Hemandezramirez et al. 2006; Fig. 7). 

Different kinds of materials can be deposited by altering the precursor gas source 
such as diamond-like carbon, tungsten, platinum (Fang et al. 2009), and silicon 
oxide (Utke et al. 2012). Compared with the electron-beam-induced deposition 
(EBID), the speed of ion-beam-induced deposition is much higher than that of 
electron beam due to the mass difference between the electron and ion. Therefore, 
in order to lessen the substrate damage and increase the deposition speed as well, a 
thin layer of Pt deposition is induced by EBID in advance of FIBID during the TEM 
sample preparation. 


34 State-of-the-Art for Nanomanufacturing Using Ion Beam Technology 


1287 



Fig. 7 Sn0 2 nanowire with a diameter of D = 140 nm and a length of L = 15.9 pm contacted 
with ion- and electron-assisted platinum deposition (a) Enlarged image of the area indicated by the 
red arrow in (b) (Hernandezramirez et al. 2006) 
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Fig. 8 SRIM and MD simulations of ion-sample interactions, (a) SRIM results of the moving and 
stopped recoiling atoms of the Si substrate under the Ga ions implantation. The green elements 
show the Si atoms’ movement and the red lines the Ga ions’ trajectory, (b) MD results of the Si 
(100) sample after 425 Ga + impacts, Si atoms are colored by the distance displaced from their 
initial position (Michael et al. 2008) 


Simulation of Ion-Sample Interactions 

A software package based on the Monte Carlo method, called SRIM (Stopping and 
Range of Ion in Matter), has been widely used for predicting the ion-target 
interactions for many different ions over a wide range of energies (10 eV-2 GeV 
per atomic mass unit). SRIM provides the distribution of the ions and the kinetic 
phenomena associated with the ion’s energy loss, including target damage, phonon 
production, ionization, ion reflection, implantation, and sputtering. It is indicated 
that the SRIM simulations become converged when the ion number is larger than 
500 (Tseng 2004). Figure 8a shows the SRIM Monte Carlo simulation results of 
30 keV Ga + ions interacting with the silicon substrate. 
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Besides the widely adopted SRIM simulation, molecular dynamics (MD) simu- 
lation has recently been conducted to study the working principle of ion-beam 
machining (Caturla et al. 1996), as Fig. 8b illustrated. Versatile atoms or ions have 
been simulated with incident energies from 1 keV up to 30 keV, including B, As, 
Ar + , Ga + , etc. The single crystal silicon is often chosen as the substrate material 
during the IBM MD simulation. As Ga + is widely used in the focused ion-beam 
nanofabrication, MD simulations have simulated the Ga + interactions with Si in 
recent years (Michael et al. 2008). Comparing with SRIM results, MD simulation 
can give more specific information for the ion-sample interactions, such as phase 
transformation during ion implantation, thermal annealing effects on the 
amorphous-to-crystal transformation, etc. The most important step for an MD 
simulation is the selection of the potential that describes the interactions among 
the atoms, which would predominantly determine the simulation performance. 


Ion-Beam Fabrication Techniques 

The key issue in the IBM technology is to operate an ion beam with proper process 
parameters such as ion-beam size, shape, and dwell time for removing a specified 
volume of material from a predefined localized area (Tseng 2004). 


Bitmap Patterning Method 

The FIB patterning system allows importing a bitmap file as a pattern, for which the 
dwell time for each pixel of the ion-beam location is normalized to the color 
contrast of the bitmap. A bitmap file must be saved as 24 bit bitmap files. Each 
pixel consists of a red, green, and blue component (RGB). The green component 
determines whether the beam is blank or not. When drawing a bitmap, it is 
recommended to use black (0, 0, 0) for none milling points and blue for milling 
points. The dwell time corresponding to the pixels is linearly interpolated on the 
basis of the blue component value between zero and the user-defined value. As a 
result, the ion beam at the white position has longer dwell time than the one in the 
black position. Figure 9 shows the bitmap of a Chinese poem and its milled pattern 
fabricated by FIB direct writing (FIBDW). 

The FIB milling process is performed using a precise pixel-by-pixel movement, 
as shown in Fig. 10. The time that the beam remains on a given target pixel is called 
the dwell time. The distance between the centers of the two adjacent pixels is called 
pixel spacing. Pixel spacing should be small enough to allow a proper overlapping 
between the adjacent pixels so that a smooth uniform profile can be fabricated using 
FIB milling. 

To mill a smooth profile with a constant material removal rate, the ion intensity 
rate or ion flux with respect to the scanning direction has to be uniform. To achieve 
this, the pixel spacing must be small enough to allow a proper overlap between 
adjacent pixels so that a smooth uniform profile can be fabricated (Tseng 2004). 
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Fig. 9 Nano Chinese poem a b 

fabricated by FIBDM. (a) 

Bitmap of poem, (b) poem 
nanofabricated by FIB (Fang 
et al. 2010) 




Fig. 10 Schematic diagram 
of FIB milling with pixel 
overlapping. Scanning 
procedures with arrows 
indicates the scanning 
direction of serpentine scan 
(Fang et al. 2010) 
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The ion distribution of a FIB can be approximated by a Gaussian distribution. The 
scanning ion flux becomes steady and unwavering when the normalized pixel 
spacing (pja) is smaller than 1.5, where a is the standard deviation of the Gaussian 
distribution. As for a Gaussian distribution beam, d f is equal to 2.35a, and ion 
intensity rate would be uniform if the pjd f is smaller than 0.637 (Tseng 2004). 


Optimization of Amorphization and Redeposition 

The majority of sputtered atoms (neutrals and ions) ejecting from the solid surface 
would change from a solid phase to a gas phase. Because the ejected atoms are not 
in their thermodynamic equilibrium state, they tend to condense back into the solid 
phase by collision with any solid surface nearby (Yao 2007). Usually, the atoms 
ejected from the bottom of the milled structures have a certain possibility to collide 
with the fabricated sidewalls and be redeposited, which would degrade the struc- 
ture’s sidewall slope (Ding et al. 2008), as shown in Fig. 11. 

Amorphization and redeposition would usually degrade the fabricated 
nanostructures’ dimensional accuracy and performance, which should be effec- 
tively minimized during the nanofabrication. Decreasing the FIB energy can effec- 
tively reduce the amorphization layer thickness for the structures after FIB 
nanofabrication, as shown in Fig. 12. Several strategies have been explored to 
effectively minimize and overcome the redeposition degradation, including care- 
fully controlling the FIB dwell time (Utke et al. 2012) or the FIB scanning pathway 
(Fang et al. 2010), choosing FIB gas-assisted etching (Yao 2007), etc. When FIB 
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Fig. 11 Redeposition effect during the FIB fabrication, (a) Redeposition effect during FIB 
nanofabrication, (b) Redeposition comparison for FIB milling of full edges and one opening 
edge on diamond substrate with an ion current of 2.2 nA at an ion incident angle of 0° (Ding 
et al. 2008) 



Fig. 12 Amorphization effect in FIB fabrication and high-aspect-ratio holes produced, (a) TEM 
sample preparation and thickness of the amorphous layer (Pastewka et al. 2009); (c, d ) TEM 
images of the interface between protective platinum coating and the silicon single crystal, polished 
with 2 and 5 kV beams, respectively; (b) high-aspect-ratio microhole on nickel beryllium alloy 
produced by FIB bitmap patterning (Ali et al. 2010) 


dwell time is set too long, that means FIB is moving slowly from one pixel to the 
nearby one. The redeposition effect would be a serious problem during the FIBDW 
process. For a small FIB dwell time (fast repeated scans), some fractions of the Ga 
ions are used to sputter away most of the redeposited material formed in the last 
loop. FIB etching could be effectively used for drilling a microhole with 8-10 
aspect ratio. Microhole with aspect ratio of 13.8 was produced by FIB etching on 
nickel beryllium, as shown in Fig. 12b (Ali et al. 2010). 
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Material Effect in FIB Nanofabrication 

Different types of substrate materials could be adopted in IBM in order to meet 
versatile function-oriented requirements. Single crystal diamond material is the best 
choice for the development of cutting tool with sharp edge; Au or Ag thin film is 
widely adopted in the nano-optics research resulting from their good dielectric 
properties. Many investigations have been focused on how to fabricate 
nanostructures on different substrate material with high accuracy and consistency. 

If the target material is a crystalline material where atoms are orderly arranged, 
incident ions may penetrate much deeper along the low indexed axis of crystalline. 
The penetration depth can be several times of that for amorphous target. Such 
phenomenon is called ion channeling effect. Ion channeling can occur when the 
angle of incidence of the ion beam, y/, with respect to a plane or an axis of the 
crystal is less than a certain critical angle, y / c , which can be approximately defined 
in Eq. 2 (Kempshall et al. 2001): 



3A 2 Z\Z2 

Ed 2 




where Z 2 is the atomic number of the target material, is the atomic number of ion, 

E is the energy of the incident ion, d is the distance between atom positions along 
the index direction, and A is the Thomas-Fermi screening length (Kempshall 
et al. 2001). If y/ > y / c , the ion has a too large transverse energy (vertical to the 
ion injection direction) to be trapped in the transverse potential (to be channeled). 
On the contrary, if y/ < yr c , the ion would not have enough transverse energy to 
overcome the potential barrier and would be trapped in the transverse potential 
well. Under this condition, a channeled particle would be guided along a plane or an 
axis of the crystal, as shown in Fig. 13. 

The ion channeling contrast shows a clear change of milling properties in Cu 
(Kempshall et al. 2001) and diamond (Picard et al. 2003), as shown in Fig. 13. 
When the ion channeling contrast across the grain boundary is not uniform, 
differential FIB sputtering yield as well as trench wall slope from redeposition 
would come out. The main reason for the sputtering yield variation is that the 
channeled ions undergo mostly electronic energy losses as opposed to nuclear 
energy losses and are able to penetrate deeper into the crystal lattice (Kempshall 
et al. 2001). To minimize the channeling effects during the IBM, fabrication 
techniques should be carefully controlled, including gas-assisted etching (Yao 
2007), incidence angle adjustment of the ion beam (Kempshall et al. 2001), and 
overlap control (Xu et al. 2012). 

FIB patterning forming nanowire pattern on thin film substrate can be applied to 
nano-optics, thermo-nanosensors, interconnects, etc. A series of experiments of Au 
nanowires fabricated by FIB on 50 nm-thickness thin film substrate show that 
structural instabilities would happen for the nanowires with a width less than 
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Fig. 13 Channeling effect during the FIB nanofabrication, (a) Schematic diagram of channeling 
effect, (b) Channeling effect for the FIB fabrication on the single crystal diamond substrate 
(Xu et al. 2010) 
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Fig. 14 Cross section of the Si x N y membrane with FIB-etched gold nanowire; (b-d) are the FIB 
fabrication results of a gold nanowire with width of 200 nm, 50 nm, and 20 nm, respectively (Naik 
et al. 2011) 


50 nm, as shown in Fig. 14 (Naik et al. 2011). The nanostructure form accuracy 
fabricated by IBM on thin film substrate can be effectively improved by adding a 
several nm thick Cr or Ni underlayer to enhance the surface adhesion, by increasing 
the Au thin film thickness (Lin et al. 201 1). A novel method of FIB nanofabrication 
in advance with thin film coating was proposed by which nanogaps with a width 
less than 20 nm of Au film can be fabricated uniformly, as shown in Fig. 15 (Gao 
et al. 2012). 

Researches show that nanostructures with a width much narrower than the FIB 
beam diameter can be fabricated by precisely terminating the etching step (Nagase 
et al. 2005), and film pre-coating method (Menard and Ramsey 2011). Maskless 
nanogap electrode has been fabricated by Ga FIB direct writing technique. The 
etching steps are monitored in situ by measuring a current fed through the substrate 
films and terminated electronically by a preset current (Nagase et al. 2005). 
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Fig. 15 Au film nanostructures developed by FIB pre-milling followed with Au thermal evapo- 
ration. (a) Schematic images of the process of FIB pre-milling followed with Au thermal 
evaporation, (b, c) nanostructures fabricated by FIB on Si substrate before and after Au thin 
film coating (Gao et al. 2012) 
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Fig. 16 SEM images of nanostructures less than 10 nm fabricated by the FIB. (a, b) 5 nm-wide 
nanogap formed by FIB (Nagase et al. 2005). (c) Top-view SEM image of a sub-5 nm-wide 
nanochannel milled into a quartz substrate through a 300 nm Cr layer (Menard and Ramsey 2011) 


Figure 16b shows the 5 nm-wide nanogap formed by FIB with a beam size of 12 nm 
on double-layer films consisting of 10-30 nm-thick Au top conducting layer and 
1-2 nm-thick Ti bottom adhesion layer. 


FIB Fabrication of 3D Nanostructures 

IBM has been widely applied to fabricate complex surface structures. The 
nanofabrication of 3D complex structures in micro-/nanoscale has been investi- 
gated with FIB, including micro hemispherical, parabolic, and sinusoidal features; 
nanoscale helix structure; micro Fresnel lens (Xu et al. 2010); nano-net for 
bio-experiments (Reo et al. 2005), etc. 

Figure 17 shows the TRIM simulation and experimental results of the dependence 
of the sputtering yield on the ion incident angle. Two types of ion, Ga and As, in 
sputtering of Au and Si substrates are considered. For all cases considered, increasing 
the ion-beam incidence angle would increase the sputtering yield until it reaches its 
maximum near 80°; then it decreases very rapidly to zero as the incident angle 
approaches 90° (Tseng 2004). Because the amount of sputtering is dominated by 
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Fig. 1 7 Angular dependence of sputtering yield in IBM. (a) Angular dependence of sputtering 
yield of 30 keV As and Ga ions on Au and Si substrates (Tseng 2004), (b, c) TRIM results of 30 kV 
Ga ions interaction with carbon substrate under different IB incidence angle of 0° and 80°, 
respectively 


surface collision cascades, as the angle of collision between the ions and target atoms 
increases from normal incidence, the possibility of the target atoms escaping from the 
surface during the collision cascades increases and eventually results in an increased 
sputter yield, as shown from the TRIM simulation results Fig. 17. 

Because ion incident angle would change gradually during the curve structures 
fabrication, the angular dependence of sputtering yield gives a big task for achiev- 
ing high form accuracy of the 3D micro -/nanostructures developed by IBM, as 
illustrated in Fig. 18. In order to fabricate 3D profile with a high accuracy, practical 
methods can vary the ion dose per pixel with a suitable dwell time (Adams and 
Vasile 2006), two-dimensional slicing by a slice approach (Fu and Bryan 2004), etc. 
A chirp measurement standard has been developed by FIBDW for verifying the 
overall functional performance of surface texture-measuring instruments (Fujii 
et al. 2011), as shown in Fig. 18. It has been used to evaluate the performance of 
laser scanning microscope with the response of varying slope angles up to 45°. 

To fabricate complex surfaces, investigations have been conducted to develop 
high-accuracy and high-performance microcutting tools by FIB fabrication 
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Fig. 18 Curve structures fabricated by FIB. (a) Illustration of ion incident angle would change 
gradually during the curve structures fabrication (Adams and Vasile 2006); (b) microscope image 
of the fabricated chirp sample surface; (c) comparison of profiles between designs measured with 
AFM (Fujii et al. 2011) 

(Picard et al. 2003; Ding et al. 2009; Xu et al. 2010). Various functional complex 
surfaces and structures can be fabricated based on the microcutting tools developed. 
The FIB would produce sharp edges on the side of facets furthest from the ion 
source, while the facet edge closest to the ion source is rounded because of the 
FIB’s Gaussian intensity distribution. Micro-tools with nanometric cutting edges 
and complicated shapes could be fabricated by controlling the tool facet’s orienta- 
tion relative to the FIB and FIB bitmap patterning method. The tool edge radius of 
15-30 nm is achieved for the nanoremoval of the work materials. High- 
performance micro-diffractive optical elements (DOE) have been fabricated by 
ultraprecision machining using hemispherical microcutting tools shaped by FIB, as 
shown in Fig. 19 (Xu et al. 2010). Experiments show that the proposed method can be 
a highly efficient way in fabricating micro -DOE with a nanoscale surface finish. 

Complex three-dimensional (3D) micro-/nanostructures have been successfully 
fabricated by FIBID. Arbitrary 3D structures such as wineglasses, bellows, and 
coils can be fabricated. Micro- wineglass with an external diameter of 2.75 pm and a 
height of 12 pm has been fabricated by FIBID on human hair. Nano-net structure 
was fabricated with FIB-CVD using C 14 H 10 as precursor gas source, as shown in 
Fig. 20. Polystyrene microspheres with a diameter of 2 pm have been captured 
using the nano-net under the optical microscope observation. The nano-net and 
other nano-tools fabricated by FIB-CVD would be potentially powerful tools for 
operations and analysis in the bio-experiments (Reo et al. 2005). 



1296 


F. Fang and Z. Xu 



Fig. 19 Diffractive optical elements fabricated by hemispherical microcutting tools, (a) Hemi- 
spherical micro tool, (b) micro Fresnel optical component, (c) morphology of the micro Fresnel 
lens, (d) diffraction rings (Xu et al. 2010) 

Although 3D nanofabrication has been established by FIB ID using a hydrocar- 
bon precursor, the FIBID method still has many limitations. Therefore, a novel 
method of producing 3D nanostructures was proposed based on FIB stress- 
introducing technique (FIB -SIT) (Xia et al. 2006). In the FIB stress-introducing 
method, a damaged layer is formed in the milled area near the thin film substrate 
material surface, where the tensile stress and strain in the damaged layer are 
employed to fabricate 3D nanostructures. Figure 21 shows the FIB stress- 
introducing process and 3D nanohelix developed by the FIB -SIT technology. 
With the advantages of high flexibility and controllability, the FIB -SIT is a prom- 
ising technique to fabricate various 3D nanostructures and implement 3D assembly, 
which might find broad applications in the biomedical sensing systems and nano- 
electromechanical systems. 


Surface Polishing and Finishing Technologies with Nanoaccuracy 

The classical process chain for the development of precision optical component 
consists of three steps. Firstly, the generating process includes precision cutting, 
grinding, or milling, which is called optical surface forming (shaping). Secondly, 
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Fig. 20 3D nano-tools developed by FIBID. (a) Concept of 3D nano-tools for bio-experiments, 
(b, c) nano-net before and after capture microspheres (Reo et al. 2005) 
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Fig. 21 Nanohelix fabricated by FIB stress-introducing process, (a) Schematic of the FIB stress- 
introducing process. Cantilever curling happens at the part where FIB damages the surface layer, 
(b) 3D nanohelix structure fabricated by FIB stress-introducing process with 750 nm diameter and 
1,980 nm pitch; the substrate is Si 3 N 4 thin film with 165 nm thickness (Courtesy of Prof. Wu and 
Dr. Xu) 


the polishing process is used to remove surface roughness, subsurface damage, and 
surface error. While the widely adopted chemical mechanical polishing (CMP) 
process is difficult or time-consuming to reach the specified surface accuracy 
resulting from tool wear and force loading of workpiece, etc. Finally, the surface 
finishing process, which is a necessarily additional step to deterministically correct 
the surface contour to its desired accuracy. In this process, the typical machining 
methods include ion-beam figuring (IBF), magnetorheological finishing (MRF), 
and so on. In this book, advanced surface polishing and finishing technologies of 
IBF and atmospheric-pressure plasma manufacturing would comprehensively 
introduce in chapters ► Ion Beam Figuring Technology and ► Plasma-Based 
Nanomanufacturing Under Atmospheric Pressure, respectively. 
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Fig. 22 Flow of ion-beam figuring {IBF) process (Courtesy of XH Xie and SY Li) 

The basic flow of ion-beam figuring (IBF) process is shown in Fig. 22. Prior to 
any figuring, the material removal function must be determined for the process that 
it is the base of deterministic figuring process. The material removal function 
(or called beam function) provides a depth removal rate distribution as a function 
of radial distance from the ion-beam center. 

In the IBF process, the material removed and its distribution are represented by 
the convolution of the removal function and the dwell time 


R (x, y) = B (x, y) * T(x, y) (3) 

where R(x, y ) is the material to be removed, B(x , y) is the material removal function, 
and the T(x, y) is the dwell time. When the material to be removed and material 
removal function of ion beam are known, the dwell time can be obtained by 
de-convoluting the Eq. 3. The way to gain this function and the IBF techniques’ 
details will be discussed in the chapter ► Ion Beam Figuring Technology. 

Atmospheric-pressure plasma manufacturing technologies are very promising to 
fabricate optical components and substrates for electronic devices with high form 
accuracy and high efficiency, including numerically controlled plasma chemical 
vaporization machining (NC-PCVM), numerically controlled sacrificial oxidation 
(NC-SO), and plasma-assisted polishing. 

NC-PCVM technology can be applied in the thickness correction of SOI 
and quartz crystal wafers by numerically controlled atmospheric-pressure plasma 
etching (Mori et al. 2000), which can obtain thickness uniformity with nanometer- 
level accuracy without introducing subsurface electronic defects. Thickness 
correction by numerically controlled plasma chemical vaporization machining 
can be performed in accordance with the following procedures: (1) The thickness 
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Fig. 23 Schematic of 
experimental setup for 
plasma-assisted dry polishing 
(Yamamura et al. 2011) 



distribution of the specimen is measured and the thickness error distribution is 
calculated by subtracting the target thickness from the measured thickness. 
(2) Because the removal or oxidation volume is proportional to the dwelling time 
of the plasma, the distribution of the scanning speed of the XY table is determined 
by the deconvolution of the thickness error distribution and the removal function 
that is a removal spot obtained in unit time, as discussed in chapter ► Plasma-Based 
Nanomanufacturing Under Atmospheric Pressure. (3) Numerically controlled 
thickness correction is performed by raster scanning, as shown in Fig. 23, to 
deterministically obtain a wafer of uniform thickness (Yamamura et al. 2008). In 
this NC-PCVM correction process, the reproducibility of the removal function are 
superior to those of the mechanical machining process, because there is no wear of 
the electrode and the process gases are uniformly supplied to the plasma region. 
Furthermore, the removal function in NC-PCVM is insensitive to external distur- 
bances, such as the vibration and thermal deformation of the machine and specimen 
due to the noncontact removal mechanism. 

A novel dry polishing technique named plasma-assisted polishing (PAP) is 
employed to combine the atmospheric-pressure plasma process with fixed abrasive 
polishing (Yamamura et al. 2011). In this technique, the irradiation of reactive 
plasma modifies the surface of a hard material to form a soft layer, and the 
subsequent polishing using a soft abrasive preferentially removes the soft layer, 
as shown in the Fig. 23. In contrast to chemical mechanical polishing (CMP), PAP 
is a dry process without the use of slurry and/or chemicals. Therefore, PAP 
technique is promising as an environmentally benign finishing process for difficult- 
to-machine materials. 

The methodologies and experimental results for functional materials by 
atmospheric-pressure plasma manufacturing technologies are comprehensively 
described in the chapter ► Plasma-Based Nanomanufacturing Under Atmospheric 
Pressure. 
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Typical Applications of IBM 

Applications in Semiconductor Sector and Material Studies 

FIB direct writing technology was firstly applied in semiconductor sector and material 
studies, including photomask repairing (Yao 2007), IC chip failure analysis (Utke 
et al. 2012), TEM sample preparation (Fang et al. 2011; Joachim et al. 2007), etc. 
Based on the capabilities of FIB milling and FIB -induced deposition Matsui (2006), 
FIB technologies have become one of the most important analysis methods applied in 
s emiconductor sector and material studies, as shown in Figs. 24 and 25. 


Functional Nanoscale Device Fabrication 
Functional Nano-optics Device 

Nano-optics studies the behavior of light on the nanometer scale. A wide variety of 
novel optical properties would appear in nano-optics, such as strongly enhanced 
light transmission, beyond-diffraction limit, etc. There is a rapid development in 
nano-optics based on surface plasmon polaritons (SPPs) in recent years. Surface 
plasmon (SP) is a surface electromagnetic wave existing in the metal and dielectric 
interface, which has a large wave vector. Surface plasmon polaritons can be excited 
by the nanostructures on metal film such as Ag and Au, providing wave vector that 
offsets the mismatch value between SPs and light. When light irradiates the rough 
surface, the SPs can be excited to a higher level and then be coupled with the light. 

The resolution of conventional optical system is governed by the light diffraction 
limit. Great effects of how to overcome the diffraction limit have been extensively 
made. Recently, nano-optics investigations on surface plasmons have been 
extended to circumvent the diffraction limit as they have shorter wavelengths 
(Pan et al. 2011). Due to its high resolution and high flexibility, FIB fabrication 
method has been widely adopted in the fabrication of nano-optical elements. 

Luo firstly proposed a novel nanolithography technique (Luo 2004): sub- 100 nm 
lines were patterned photolithographic ally using surface plasmon polariton inter- 
ference in the optical near field excited by a wavelength of 436 nm. The metallic 
mask pattern on the illuminated side collects light through SPP coupling, and 
SPP on the exit side of metallic mask redistributes the light into nanoscale 
spatial distribution, which can be used to fabricate nanostructures, as shown in 
Fig. 26. Zhang’s group proposed a novel multistage scheme that is capable of 
efficiently compressing the optical energy at deep sub-wavelength scales through 
the progressive coupling of propagating surface plasmons and localized surface 
plasmons (Liang et al. 2011). Combining this scheme with air bearing surface 
technology, a plasmonic lithography with 22 nm half-pitch resolution with scan- 
ning speed of up to 10 m/s is demonstrated as shown in Fig. 27. The low-cost 
scheme has the potential of higher throughput than current photolithography, and it 
would potentially open a new approach toward the next-generation semiconductor 
manufacturing. 
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Fig. 24 IC chip modification 
by FIB deposition and milling 
(Courtesy of FEI Co.) 





Fig. 25 TEM sample preparation using FIBDW. Structural changes of the surface layer, (a) SEM 
observation and (b) TEM observation of the ion modified layer (Fang et al. 2011) 


As the core elements in the nano-optics investigations mentioned above, the 
photomask nanostructures and plasmonic less on metal film of Cr or Ag should be 
fabricated with high fabrication accuracy and high uniformity. Comparing with the 
ordinary photomask fabrication methods using the electron-beam or the laser 
lithography process, FIBDW method has advantages of combining direct writing 
with high resolution in generating nano-photomasks. 

In surface plasmonic structure-based photolithography systems, the reported 
experimental results were obtained at near field with about tens of nanometer gap 
between the plasmonic lens and substrate surface (Fang et al. 2005), and the gap is 
very difficult to precisely control in practical application. Therefore, if the effective 
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Fig. 26 Nano-photomask used in nanolithography and developed by FIBDW. (a) Schematic 
representation of surface plasmon polaritonic lithography, (b) simulation of the near-held lithog- 
raphy by surface plasmon polariton with symmetric mask (Luo 2004), (c) SEM picture of the 
100 nm lithography results with 300 nm pitch mask, (d, e) 40 nm and 32 nm line width mask for 
near-held lithography developed by FIBDW (Fang et al. 2010) 
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Fig. 27 Plasmonic lens with beyond-diffraction limit developed by FIBDW. (a) A SEM image of 
hexagonally packed multistage plasmonic lens (MPL) array. The enlarge SEM picture on the 
right-top of (a) shows a MPL consisting of a dumbbell-shaped aperture, a set of ring couplers (two 
inner rings) and a ring rehector (the outer ring), fabricated on a 60 nm thick Cr him. (b) A 
plasmonic hying head uses advanced air bearing surface technology to maintain the gap between 
the lenses and the substrate at 10 nm at a linear scanning speed of 10 m/s. (c) AFM image of 
closely packed dots at 22 nm half pitch on the thermal resist (Liang et al. 2011) 


working distance between the plasmonic lens and substrate surface could be 
extended to several microns and even longer, this would make the practical control 
and operation process much easier. Plasmonic lens with extraordinarily elongated 
depth of focus (DOF) has the unique features in practical applications, such as 
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Fig. 28 Images of the plasmonic structures fabricated by FIB direct milling technique, (a) SEM 
image of the plasmonic lens, (b) 3D SNOM result of the plasmonic lens (Liu et al. 2011) 


nanoscale imaging and lithography (Fu et al. 2010). 3D image of cells or molecular 
is possible to be obtained using plasmonic lens in bio-imaging systems, such as 
confocal optical microscope. 

In order to realize superfocusing function, plasmonic lenses of metallic 
subwavelength structures with chirped (variant periods) slits and nanopinholes 
were put forward. Fu group has proposed a novel plasmonic lens with a micro- 
zone plate-like (PMZP) structure in which a thin him of Ag is sandwiched between 
air and glass, as shown in Fig. 28 (Liu et al. 2011). However, the ring-based 
structures have higher sidelobes. To suppress the sidelobes and further improve 
the focusing quality, a circular and elliptical hole-based plasmonic lens were put 
forth (Fu and Zhou 2010). 

Surface-Enhanced Raman Scattering (SERS) 

Raman scattering can be used to identify the target molecules in chemical and 
biological systems because every molecule has its unique Raman spectmm. 
Unfortunately, the Raman scattering cross section and signal intensity are very low. 
Therefore, the applications in bio-sensing had been limited until the 1970s, when 
researchers indicated that the magnitude of Raman scattering signal can be largely 
enhanced by roughened noble metal surface (Fleischmann et al. 1974). This phenom- 
enon was named as surface -enhanced Raman scattering (SERS). SERS has attracted 
considerable attention since it can accurately identify extreme small sample volumes 
and even single molecules. Recently, focused-ion-beam direct writing (FIBDW) has 
found applications in the SERS substrate’s development because of its high accuracy 
and uniformity, which are beneficial to the SERS research and applications. 

It is widely accepted that most of the SERS gain originates from the molecules 
situated at nanoscopic “hot spots,” typically formed at the junctions between nano- 
sized particles, sharp protrusions, or holes because these are the regions where 
the electromagnetic held intensity can be enhanced greatly. SERS has a close 
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Fig. 29 SERS substrate by FIB nanofabrication combined with Au film coating, (a, b) SEM 
images of elliptical nanostructures coated with different Au thicknesses of 10 nm and 20 nm, 
respectively, (c) SERS spectra of 10 -6 mol/L R6G for the substrates coated with different film 
thicknesses (Gao et al. 2012) 

relationship with the typical configuration of the nanostructures developed, such as 
their size, shape, and spacing Wang et al. (2011). Tremendous electromagnetic 
enhancement (EE) is associated with the excitation of localized surface plasma 
resonance (LSPR) that is confined to the metal nanostructures. Investigations show 
that sharpening the nanostructure’s tip and decreasing the nanostructures’ spacing 
during FIB nanofabrication would have a great contribution to the electric enhance- 
ment (Min et al. 2008; Lin et al. 2011). Min fabricated a double-hole SERS 
structure which provides two sharp apexes with a radius of curvature of ~20 nm 
at the tip (Min et al. 2008). Lin proposed four configurations of SERS substrate. 
Due to the availability of multiple edges and small curvature, hexagon-like micro-/ 
nanostructures induced a higher electromagnetic mechanism (EM) and better SERS 
performance. It is found that the SERS performance would be exponential increase 
as the nanostructures’ spacing decrease from 40 nm to 22 nm. The “hot spot” effect 
was greatly reduced as the spacing is larger than 40 nm. 

SERS substrates can be developed by patterning nanostructures on Si substrates 
using FIBDW technology following with precise thermal evaporation of gold film 
on the substrate (Gao et al. 2012). This method shows a great potential for the 
higher performance SERS substrates development, which can further reduce the 
spacing among hot spots and achieve smaller nanogap less than 20 nm, as shown in 
Fig. 29. The effect of SERS on the substrate was systematic investigated by 
optimizing the processing parameters and the gold film thickness. The gold film 
thickness’ variation shows great effects on the SERS performance; for the elliptic 
nanostructures on Si substrates, coating with approximate 15 nm-thickness gold 
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Fig. 30 Concentric arcs SERS substrate developed by FIB. (a, b) Scanning election microscopy 
images of the SERS substrate containing eight concentric arcs, (c) near-held scanning optical 
microscopy (NSOM) images of the SPP distribution in samples containing eight arcs for incident 
light with wavelength of 647 nm polarized parallel to the symmetry axis (Bahns et al. 2007) 


film has exhibited an optimization performance. Dramatic performance degradation 
was found as the gold film thickness further increased, which most probably 
resulted from changes and degradation of the nanostructures’ “hot spots” of ellip- 
tical tip and spacing (Gao et al. 2012). 

Besides LSPR having a great contribution to SERS performance, some studies 
found that surface plasmon polaritons (SPPs) could also contribute to the Raman 
enhancement signal (Bahns et al. 2007). Argonne National Laboratory developed a 
concentric arcs nanostructure by FIB nanofabrication and obtained high electric 
enhancement at the center due to the focusing of propagating surface plasmons on 
flat metal surfaces, as showed in Fig. 30. It is found that increasing the arc number 
would improve the SERS performance accordingly. The results show that focusing 
of propagating surface plasmons on flat metal surfaces may be an alternative for 
achieving enhanced Raman scattering. 


Nano-holes and Nanochannels for Bio-X 

Nanopores or nanochannels fabricated by IBM have been used for the rapid electric 
detection and characterization of single biopolymer molecules (Chih et al. 2006; 
Menard and Ramsey 2011; Norman and Theodore 2012). A focused ion-beam 
sculpting or scanning technique was used to engineer nanopores precisely (Chih 
et al. 2006). Based on this technology, it can tune pores of various sizes and 
irregular shape to symmetric and shrunk nanopores simply by scanning the mem- 
brane surface with FIB . 
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Fig. 31 Nano-hole applications for Bio-X. (a) Electron microscopy of nanopores with different 
diameters and methods (Chiu and Christopoulos 2012). (b) Resistive-pulse technique based on 
Bio-X nanopores for detecting and characterizing the binding of antibodies to virus particles. 
Detection of virus particles before the addition of antibodies: single virions passing through the 
conical pore cause a transient reduction in current (resistive pulse) as shown by the spikes in 
the current trace. The dotted line represents the mean of a Gaussian curve fit to the distribution of 
the peak amplitudes of the events, and the average current passing through the pore for all 
experiments was 140 nA. (c) Detection of virus particles after addition of antibodies: binding of 
antibodies to the virus increases the volume of the particle leading to an increase in the peak 
amplitude (Uram et al. 2006) 


Based on the developed nanopores or submicrometer pores, resistive-pulse sensing 
methods have been proposed for the detection, characterization, and quantification of 
the antibodies binding to virus particles. Single virions passing through the conical 
pore cause a transient reduction in current (resistive pulse) in the current trace, while 
binding of antibodies to the virus would increase the volume of the particle, leading to 
a sensitive increase in the peak current amplitude, as shown in Fig. 31. 

The nanochannel devices fabricated by IBNM can be applied in single-molecule 
studies of DNA transport, where fluorescently stained double-stranded DNA 
molecules are electrophoretically driven through FIB -milled nanochannels of 
various sizes (Menard and Ramsey 2011). Figure 32c shows individual frames 
from a recorded time series of the transport of a molecule of }i-phage DNA 
through a 50 pm-long nanochannel with a 50 nm x 50 nm cross-sectional area. 
DNA molecules are transported smoothly through the nanochannels as small as 
15 nm x 15 nm with little evidence of adhesion. 


Functional Nanoaccuracy Device Fabrication 

Single crystal silicon’s brittle nature at ambient temperature prevents it from 
producing complex features and optical quality surfaces. Ultraprecision cutting 
with diamond tools has been suggested as a replacement fabrication method, 
which can make the production of silicon components with nanometric surface 
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Fig. 32 Nanochannel devices applied in single-molecule studies of DNA transport, (a, b) are 
nanochannels milled in Si substrates ^35 nm in width and depth before and after a thin A1 layer 
removal, respectively, (c) Series of frames recorded every 3 ms during the translocation of a 
single molecule of fluorescently stained ^-phage DNA through a 50 nm channel (Menard and 
Ramsey 2011) 

finish and submicron level geometry accuracy possible by ductile cutting under 
certain parameters. However, the short tool life and surface fractures are still the 
serious problems to be solved in achieving nanometric surfaces of silicon compo- 
nents (Fang and Venkatesh 1998; Fang and Chen 2000; Fang et al. 2005, 2007). 

A novel method of ion implantation surface modification (NilM) for cutting 
single crystal silicon was proposed in 201 1 (Fang et al. 2011). It was proposed to 
modify the mechanical properties of the substrate’s surface layer using ion 
implantation method, which provides a possibility to reduce surface fractures, 
prolong cutting tool life, and increase the machining efficiency during the cutting 
process. The Si substrate’s implantation using F ion with 10.0 MeV energy 

14 o 

and 1 x 10 ion/cnr ion dose can produce a modification layer in several 
microns depth, as shown in Fig. 33. The studies show that the modified silicon 
performs a deeper ductile-brittle transition point of 924 nm than the 236 nm of 
normal silicon by taper cutting experiments. Nanometric surface with a surface 
roughness of about 0.861 nm in Ra can be achieved by ultraprecision 
cutting of the modified Si substrate. The surface roughness kept almost constant 
at a low level during the 6.5 km cutting is achieved. The feed marks of cutting on 
the machined surface are clearly displayed, and there are no apparent fractures 
and micro cracks, which demonstrate a ductile material removal mode. The 
experimental results prove that the method is viable to fabricate complex 
silicon surface geometry with nanoaccuracy and prolong tool life, as shown in 

Figs. 33 and 34. 

Silicon carbide (SiC) is a promising next-generation semiconductor power 
device material for high-power and high-temperature applications, resulting from 
its excellent properties, including excellent thermal conductivity, wide energy band 
gap, high saturated electron drift velocity, and good chemical stability. To apply 
SiC as a material for high-performance electronic devices, an atomically smooth 
and damage-free surface is essential. However, because of its hardness and chem- 
ical inertness, SiC is one of the most difficult-to-machine materials, where diamond 
abrasives are generally used in the slicing and lapping processes of SiC. These 
mechanical removal processes inevitably introduce micro scratches and subsurface 
damage to the workpiece surface, which deteriorate surface integrity. 
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Fig. 33 Fundamentals of ion implantation surface modification for Si substrate, (a) Implantation 
results of cross-sectional SEM image and TEM image of the modified layer and the electron 
diffraction pattern, (b) Load-displacement curves during nanoindentation for Si before and after 
ion implantations, (c) 3-dimensional MD model system of ion implantation and nanometric 
cutting, (d) RDF results of the lattice transformation during ion implantation from MD (Fang 
etal. 2011) 


A 4H-SiC (0001) surface was processed by plasma-assisted polishing (PAP) 
using a Ce0 2 abrasive, and an atomically smooth surface structure without lattice 
strain was obtained (Yamamura et al. 2011). The irradiation of water vapor plasma 
reduced the hardness of the SiC surface by about one order of magnitude, which 
enabled to obtain an atomically smooth surface with a step and terrace structure 
without introducing scratches or crystallographical subsurface damage by using a 
soft ceria abrasive, as shown in Fig. 35. 


IBM Instruments 

Ion-beam systems have become increasingly popular due to the capability to 
develop various types of nanostructures. Based on the size of the beam source, 
ion-beam instruments can be catalogued into two broad groups of broad ion beam 
and focused ion beam. The restrictions encountered during the IBM applications 
would promote the ion-beam instrument development. 
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Fig. 34 Ion implantation surface modification (NilM) method for brittle material cutting, (a, b) 
are the AFM measurement of the ductile-brittle transition depth after NilM. (c) Fabricated GaP 
micro-pyramids, (d) Mirrorlike surface: aspheric lens (Fang et al. 2011) 



Fig. 35 Microscopic interferometer images of processed 4H-SiC wafer, (a) The as-received 
wafer (4.41 nm P-V, 0.62 nm rms), (b) the surface polished by ceria abrasive (5.49 nm P-V, 
0.57 nm rms), (c) the surface processed by PAP (1.89 nm P-V, 0.28 nm rms) (Yamamura 
et al. 2011) 


For example, the FIB machining efficiency is generally lower than other 
fabrication methods. How to increase the FIB efficiency without degrading its 
resolution is a key topic for the research and development in FIB instruments. In 
order to improve the FIB fabrication efficiency, the maximum FIB current has 
been increased from 20-65 nA in the new developed commercial FIB systems. 
Moreover, a projection focused-ion-multibeam (PROFIB) system has been pro- 
posed using multi-ion beam which is emitted from the multi-ion source to 
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enhance the productivity of FIB technology (Hans Loeschner et al. 2002). 
The typical and newly developed IBM instruments, including FIB/SEM 
dual-beam system, SEM/FIB/Ar triple-beam system, and helium ion microscope, 
would be comprehensively introduced in chapter ► Ion Beam Instruments Used 
for Nanomanufacturing. 

Dual-beam focused-ion-beam system and some further development for the 
function extension are briefly introduced, for instance. 


Advantages of Dual-Beam Focused-lon-Beam System 

The most dramatic improvement to the focused-ion-beam system was the 
addition of a secondary column, an electron column, to create the dual-beam 
FIB/SEM instrument. This dual-beam system, having both ion and electron beam 
focused in coincidence on the sample, has enabled simultaneous milling and 
imaging of samples. These advantages are very beneficial for the IBM applica- 
tions. With a Slice and View software, and similar programs offered by FIB 
manufacturers, automated and sequential milling of samples dramatically reduces 
the user and instrument time while enabling a three-dimensional investigation of 
the sample (Grandfield and Engqvist 2012). The benefit from the dual-beam 
system, three-dimensional structural analysis, has been demonstrated on 
biomaterial tissue interfaces (Giannuzzi et al. 2007) and block copolymer 
(Mitsuro et al. 2007), as shown in Fig. 36. Three-dimensional (3D) internal 
structure of a poly (styrene-block-isoprene) block copolymer can be observed 
at sub-micrometer levels. The size of the reconstructed image by FIB/SEM was 
6.0 x 6.0 x 4.0 pnT, which was greater than the transmission electron 
microtomography data by a factor of about 40. The results indicate that 3D 
reconstruction using FIB/SEM technique is quite useful for direct 3D observa- 
tions for different applications. 


Function Extension of FIB Instruments 

A nanomechanical testing setup was established to conduct a quantitative in 
situ nanomechanical test (Gianola et al. 2011), where specimen manipulation, 
transfer, and alignment were performed using a nanomanipulator, independently 
controlled positioners, and focused ion beam, as illustrated in Fig. 37. 
Gripping of specimens was achieved using electron-beam-assisted Pt deposition. 
A transducer based on a three-plate capacitor system was used for high-fidelity 
force and displacement measurements. Local strain measurements are obtained 
using digital image correlation of electron images taken during testing. 
The nanomaterials ’ size effects on mechanical behavior and the challenges of 
nanomechanical testing in vacuum environments can be studied in details based 
on the nanomechanical testing setup. 
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Fig. 36 Three-dimensional structural analysis based on the FIB/SEM dual-beam system, 
(a) Schematic diagram of the dual-beam system, (b) SEM image of the cross section of poly 
(styrene-block-isoprene) (SI) block copolymer, (c) 3D reconstruction of the SI block copolymer 
(Mitsuro et al. 2007). (d, e) shows the three-dimensional reconstructions of the TiUnite and bone 
interface using Slice and View FIB/SEM. (d) Three orthogonal sections of backscattered electron 
scanning electron microscopy images through the reconstructed volume of FIB -milled serial sec- 
tions. Units are provided in pm. (e) Reconstructed volume of 68 SEM images (Giannuzzi et al. 2007) 
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Fig. 37 Nanomechanical testing setup, (a) Image of the assembly in the vacuum chamber of the 
dual-beam FIB/SEM. The transducer, manipulator, and nanopositioners are arranged in a config- 
uration that allows for sufficient range of motion and modular installation of the components, 
(b) Schematic of the setup highlighting the DOF of motion available from the nanopositioning 
and microscope stages. The nanomanipulator is not shown in (b) (Gianola et al. 2011) 
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Summary 

Ion-beam nanofabrication technologies have been widely used in micro-/nano- 
manufacturing, with unique advantages of high fabrication resolution and high form 
accuracy, high flexibility, as well as maskless processing. IBM technologies have a 
significant impact in many key research areas, such as nanotechnology, micro-/nano- 
optics, surface engineering, and biotechnology. With further developments in the 
high-performance IBM system, IBM methods would be a vital candidate to become 
the mainstream approach in the micro-/nano technology. 
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Abstract 

The understanding of the physics and mechanism and the ability of imaging, 
manipulation as well as fabrication of devices and systems at nanometer scale is 
undoubtedly the frontier of today’s science and technology. The momentum of 
current research and industrial progress calls for continuous development of new 
state-of-the-art tools. The interdisciplinary fields of material science, physics, 
biology, mechanics, and electronics clearly perpetually seek superpower facilities 
that persist integrated functions to conduct operation as well as to see the nano- 
world. The ion beam (IB) technology is one of them that can meet the demands of 
today’s science and technology. In this chapter, we present a basic introduction of 
the ion beam instruments, which are classified into two catalogs that based on 
focused ion beams and broad beams. In particular, the ion sources, ion optics, and 
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some most advanced focused ion beam facilities are discussed followed by the 
brief introduction of broad beam instruments for nanofabrication, including the ion 
beam etching, deposition, and implantation systems. 


Introduction of Ion Beam Instruments 

Ion beam technology has many advantages over other energetic particle beams in 
nanofabrication, to mention a few: (i) ions are much heavier than electrons and are 
element in nature, thus can be used to strike materials with much higher energy 
density though relatively short wavelengths for direct patterning with greater through- 
put; (ii) the proximity effect inherited by photon/electron beam writing can be well 
eliminated when using ion beam; and (iii) ions can be used to tune or modify the 
structures and properties of the targets due to the element- specific nature. 

In the last few decades, ion beam systems have become increasingly popular due 
to the ability to form various types of nanostructures. Techniques of ion beam 
manufacturing are effects based on the bombardments of the substrate surfaces with 
energetically charged particles to realize the processes of material sputtering, 
deposition, implantation, and property modification. Based on the size of the 
beam source, IB facilities can be catalogued into two broad groups, the broad 
beam tools and the focused ion beam tools. Ion sources are developed based on 
different materials for different applications, such as gaseous ions of H, He, Ar, N, 
Xe, Kr, and O which are developed for nanofabrication, surface finishing (metal, 
polymer, and ceramic surfaces), and properties (electrical, mechanical, and optical 
properties) modification; metallic ions of Ag, Fe, Mg, Y, Zr, and Cr which are used 
for gemstone coloring; ions of B, P, As, Co, Cu, N, and metallic ions which are 
employed for semiconductor doping. Ion sources can also be classified as liquid 
metal ion source (LMIS) and gas field ion source (GFIS). The most commonly used 
LMIS is Ga+, and other metal ion sources, including elemental ion source (e.g., Al, 
As, Au, B, Be, Bi, Cs, and Cu) and alloy ion sources (e.g., AuSi, AuSiB, PdAs, 
PdAsB, NiB, and NiAs), have also developed and are commercially available. 


Focused Ion Beam Instruments 

A potential type of tool that has successfully met the requirements and promises 
to continue to fulfill the future more advanced demands in nanotechnology is the 
focused ion beam (FIB) systems. In terms of their applications, FIBs can be classified 
as FIB lithography, implantation, etching, deposition, repairing, imaging, and ana- 
lyzing systems. Based on the energy of the ions, FIBs can be grouped as high-energy 
(>100 keV), medium-energy (10-100 keV), and low-energy (<10 keV) systems. 

Besides the unsurpassed capabilities to directly remove a wide range of materials 
in a site-specific area in nanometer scale, a single ion beam system can also be 
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upgraded to be DualBeam or CrossBeam systems to take the imaging advantage of 
a scanning electron microscope and to be a triple beam system to remove the 
contamination or damage from the energetic ions during processing, or integrated 
with a gas injection system (GIS) for enhanced etching or material deposition, or 
installed with manipulators for in situ characterization and manipulation, or 
installed with a fixed beam moving stage (FBMS) for advanced nano-patterning. 
Thus in terms of the configuration, FIB systems can be further catalogued as single- 
beam single-column FIB, dual-beam single-column FIB-SEM systems, dual-beam 
dual-column FIB-SEM systems, and triple-beam systems. 

It is well known that FIBs are an integral part of material science and the semicon- 
ductor industry due to their day-to-day tremendous applications, which include the 
transmission electron microscopy specimen preparation, in situ cross sectioning and 
analysis of a fabricated device, gas-assisted milling and deposition, mask repair, 
micromachining, scanning ion microscopy, and ion beam lithography. However, 
inevitable contamination due to the use of liquid metal ions is the major drawback of 
FIB machines; thus, a noble gas ion source is very much desired (Cui 2005; Yao 2010). 


Components of a Typical Focused Ion Beam System 

Fundamentally, a focused ion beam system produces and directs a stream of high- 
energy ionized atoms of a relatively massive element, focusing them onto the 
sample by electrostatic lens and apertures. The major components of an FIB are 
the ion source, ion column (suppressor, extractor, apertures, alignment octopole, 
condense lens, objectives and deflection system), signal detector, vacuum chamber, 
and sample stage. Here we will only focus on the ion sources and ion optics. 

Liquid Metal Ion Source (LMIS) 

On a focused ion beam system, the generating source of the beam is known as the 
ion source; it is also referred to as an ion gun. Gallium (Ga + ) ions are the most 
widely used ion source in FIB systems due to a number of advantages it persists: 
firstly, it has a low melting point (29.76 Q C) which could minimize the interdiffusion 
with the holding needle substrate; secondly, it has a medium-heavy mass that 
allows milling of heavier elements though not too heavy to cause obvious damage; 
thirdly, its low volatility at the melting point could conserve the usage of metal 
and yield a long source life of about 400 pA-h/mg; fourthly, Ga does not require an 
E x B mass separator in the optic column since it has relatively low vapor pressure 
which allows it to be used in its pure form instead of an alloy; and finally, Ga can be 
easily distinguished from other elements, which could somehow minimize the 
interference during sample analysis (Yao 2010). 

The conceptual and cross-sectional diagrams of a liquid metal ion source are 
shown in Fig. la, b respectively. In the most popular FIB systems, a reservoir of 
heavy metal atoms, typically gallium, due to the aforementioned advantages it has, 
is heated to near evaporation; it then flows and wet the heat-resistant tungsten tip 
which has a sharp tip. The Ga atoms thereafter flow to the very end of the tip of the 
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Fig. 1 (a) Ion source conceptual diagram; (b) A cross-sectional diagram of a LMIS 


needles, drawn by concentric annular electrodes around the tip. In that an electrical 
field on the order of 10 10 V/m is applied between the sharp, metallic tip and the 
pulling electrode, causing the energetic ions in that region immediately above the 
tip to accelerate towards the extractor. A “Taylor cone” shape of ions with an apex 
about only 5 nm in diameter is formed in the region where the electrostatic and 
surface tension forces are balanced. Since the cone tip is so tiny that Ga from the 
needle tip can be easily and efficiently ionized by field evaporation at the end of the 
Taylor cone, which is then accelerated by the applied accelerating potential down to 
the ion column to collide with the sample. 

Generally, to obtain a stable beam with reduced energy spread, low emission 
currents of about 1-3 pA are recommended though an LMIS system could supply a 

o o 

current density on the order as high as 10 A/cnr. Low ion beam could 
avoid unnecessary energy spread, prolong source lifetime, as well as reduce the 
possibility of dimer, trimer, and droplet formation. The current emitted from the tip 
is called the extraction current; it is regulated by both the suppressor and extractor. 
The suppressor works with the extractor to maintain a constant beam current by 
applying an electrical field, which provides an alternative approach to change the 
extractor current without changing the extraction voltage. This is of great impor- 
tance since changing the extraction voltage would result in spatial displacement of 
the Taylor cone and may cause obvious beam shift. Meanwhile, adjusting the 
suppressor is useful to remove the gradual downward shift in extraction current 
since in LMIS the surrounding electrical field causes electrons to collide with 
particle or other materials in the vacuum and contaminate the ion source. 

Ions to form an ion beam source are produced by field evaporation. Ions are 
produced in the LMIS when field evaporation process takes place. In theory, the 
energy required to generate an ion in free space is given by: 

Qo = H a +I n — (1) 

Here H a denoted the heat of atomic desorption, l n is the ionization energy to 
produce an n- fold ion, and O is the work function of the field emitter. The term 
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nO refers to the released energy when n electrons return to the metal. Thus for a 
field with strength of E , which lowers the potential barrier for field ionization by 

Q O I/O 

(2 e E) , the energy required for evaporation of liquid Ga is 

Qe = Qo- (« Ve) 1/2 (2) 

Alternatively, the atoms in the Taylor cone will ionize when 

1-0 = eEx c (3) 

Where I is the ionization potential near a metal with work function O in the 
presence of an electrical field E , is the critical distance from the tungsten 
tip required for ion production, which is about 0.2 nm at a field with strength of 
10 10 V/m (Yao 2010). 

Apparently, the total current in the beam leaving the extraction aperture is only a 
fraction of that produced by the source. Since the velocity vectors of a very large 
amount of the particles are not along the direction of the beam, the majority of the 
ions are blocked by the aperture. Thus it has the limitation of brightness. A brighter 
beam means better imaging and milling characteristics though may have poorer 
uniformity and precision in focusing compared to a beam passing through a finer 
beam aperture. LMIS also suffers the problem of uniformity. Ions emitted from the 
LMIS tend to follow a Gaussian energy distribution, possibly asymmetric, which 
could lead to chromatic aberration. The ion source also experiences an additional 
degree of energy spread due to mutual repulsion between the charges just beyond 
the source. The ions repel each other, imparting a small but significant random 
velocity that changes the overall energy profile, which could cause further chro- 
matic aberration and forming the fundamental limit for the focusing ability of FIB 
systems. 

Gas Field Ion Source (GFIS) 

The commercial gallium liquid metal ion source has served as the ion source of 
choice for the past 30 years. While it is very reliable and stable in operation, it has 
relatively high energy spread and large virtual source size that limit their current 

o 

density to approximately 5 A/cnT and with some recognized shortcomings arising 
from the probe size, electrical contamination, and optical opacity. To overcome 
some of the drawbacks, ion species that produce inert noble gas ions have been 
developed, not only to avoid the Ga contamination; in addition, such sources also 
have low energy spread and a small virtual source, which can result in higher beam 
currents at smaller beam diameters compared with LMIS. Soon after its develop- 
ment in 1955, the GFIS development efforts were hampered by issues related to the 
difficulty to implement (ultrahigh vacuum and cryogenic temperatures) as well as 
source lifetime and the short- and long-term temporal stability (Edinger et al. 1997; 
Kalbitzer 2004). In very recent years, the helium GFIS performance has been vastly 
advanced - permitting the development of the helium ion microscope (HIM), which 
will be discussed in more details at a later stage. 
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Fig. 2 (a) Image of the atoms at the end of the source tip emitting helium ions and (b) the 
schematic drawing of the gas field ion source configuration 

In a GFIS, ion emission is confined to a small protrusion near the apex of a single 
crystal tungsten emitter based on the so-called super tip as shown in Fig. 2a, leading to 
exceptionally high angular ion current densities of 20-30 mA/sr for gases such as 
helium, hydrogen, and neon. After ion generation, a gimbal mount is used to align the 
ion beam with the focusing optics since the emission site is randomly oriented with 
respect to the emitter axis in the super tip (Edinger et al. 1997; Kalbitzer 2004). To 
obviate the using of the gimbal arrangement, Edinger et al. very recently developed 
another approach to investigate standard thermal field methods to produce emitters 
with ion emission confined to a small region along the emitter axis (Edinger 
et al. 1997; Kalbitzer 2004). In their approach, the super tip is a small protrusion on 
a large emitter substrate, basically a small cluster of atoms at the apex. 

Figure 2b shows a schematic drawing of the GFIS component proposed by 
Edinger et al. (1997). It consists of the gas delivery system, the emitter assembly, 
the cooling assembly, and the vacuum chamber. The ion source assembly is cooled 
with liquid nitrogen/helium in containers mounted onto a diagnostic column. The 
gas delivery system consists of a turbo pumped gas manifold, an additional titanium 
sublimation pump for gas purification. Immediately before the gas enters the 
emitter compartment, it is precooled by flowing through stainless steel filter 
which is submerged in liquid nitrogen. During operation, the ion pump is valved 
off and the emitter compartment is differentially pumped by the turbo pump 
through the extraction aperture. The main vacuum chamber is equipped with a 
residual gas analyzer and pumped with a combination of turbo pump/ion pump. The 
emitter assembly consists of a tungsten single crystal wire (W< 1 1 1 > or W< 100>) 
welded onto a tungsten filament for emitter heating. An e-type thermocouple is spot 
welded onto the filament for in situ temperature monitoring. The emitter is enclosed 
in a cylindrical compartment with an extraction aperture at the bottom for a 
relatively high pressure at the emitter region. The imaging gas is leaked into the 
emitter compartment through several small slits around the edge of the aperture and 
in direct line of sight of the emitter. The emitter assembly is enclosed in a double 
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cold shield arrangement, with the outer Cu shield attached to the outer reservoir and 
the inner stainless steel shield to the inner reservoir of a vacuum insulated, two 
vessel Dewars. Electrical and thermal connection to the emitter assembly is pro- 
vided through two Cu wires and two thermocouple wires attached to an inner 
reservoir. The temperature of the emitter is typically 18-21 K (Edinger et al. 1997). 

Field emission is the process that uses high electrical field to ionize the gaseous 
atoms or molecules. The ionization time required for an atom is given by 

t = l(T 16 exp(0.68V) 2/3 /£ (4) 

Here V is the potential barrier of the atom and E is the electrical field required for field 
ionization. It can be seen that a higher electrical field results in shorter ionization time. 
For normal field ion emitter, sharp tips with diameter in the order of several microns 
are used in order to produce strong field for gaseous atoms ionization. The ion current 
emitted from an emitter with a tip radius of r can be given by 

aE 2 4jiPr 2 

1 ~ 2 KT V2jc mKT 

Here P is the gas pressure, T is the absolute temperature, a is the ionization 
coefficiency, K is the Boltzmann constant, and m is the relative mass of the atom. 
With proper lenses, fine beam with high brightness can be produced. 

The world’s first scanning helium ion microscope is Carl Zeiss ORION Plus 
Helium Ion Microscope, which employs a GISF source, can be used to see samples 
with sub-nanometer resolution, even for low-Z materials, like carbon nanotubes, with 
resolution and surface information unavailable from a typical SEM. It also has the 
advantages of being able to see material contrast that was never seen before; for sharp, 
clear, and bright imaging since it has a higher, more varied secondary electron yield; 
and observing insulating samples where charging effects are minimized by imaging 
with the unique backscattered ion mode (Edinger et al. 1997; Kalbitzer 2004). 

Ion Optics 

After a beam of charged particles is generated from the ion source, it must be 
focused using a series of lenses to a desired spot size. The lens used for ions is 
almost identical to a lens for light. Ion lens functions very much like light optics 
though the construction is quite different. Instead of using a particular or a set of 
complex materials with a certain shape to manipulate the path of a light, ion optics 
using electrostatic force to change the path of the ions. A schematic diagram of a 
typical ion column is shown in Fig. 3. The ion optics also has similar parameters as 
the light optics and electron optics, such as the focal length, refractive index, 
chromatic aberration, spherical aberration, and astigmatism. An additional specific 
to ion beam is the interparticle repulsion related source size. 

Different from electron beams, which consist of fast moving, low-mass particles 
and generally focused by only magnetic fields, charged ions are much heavier and 
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required stronger fields for focusing due to the higher energy they have. Thus in 
contrast to the magnetic lens used in an electron beam system, electrostatic lens are 
almost exclusively used in the ion beam systems. The reason for this is from the fact 
the when a charged particle placed in an electrical field E , the force it encountered is 
F e = qE , which is independent of its velocity. Whereas, the force exerted by a 
magnetic field B, F = qV x B, is directly related on the velocity. A particle 
accelerated by a potential drop of AV will gain a speed of (2qAV/m) l/2 (m is the 
mass of the particle). Thus for a higher mass of ions, their velocity is about 0.0028 
of that of an electron accelerated by the same potential, while their momentum is 
370 times higher. It requires an unpractically large magnetic optics to provide 
enough focusing power for an ion beam. 

Electrostatic lens, however, can be made extremely small and are capable of 
producing much faster response for beam deflections. A diagram of a three- 
electrode electrostatic lens in an ion beam column is depicted in Fig. 4a. It operates 
in a way that positively charged particles enter the lens from the top and encounter 
an electrical field formed by the larger voltage difference between the top two 
electrodes. The ions follow the field lines, moving towards the optical axis and gain 
a boost in velocity by the increasing negative field. As the ions pass the second 
middle electrodes, they are pulled downwards. When they are closer to the axis, 
more momentum is gained; here the change in direction is less than from the first 
electrodes. Upon passing the second field, the beam is again condensed though 
is decelerated by the increasing positive field to near their original velocity. 
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Fig. 4 (a) Schematic of an electrostatic lens that consists of three pairs of electrodes in an ion 
beam column and (b) the beam path of a GFIS ion column in an FIB system 


The charged particle beam then exists the column and continuously been narrowed 
towards its focal point. A typical LMIS column that is mostly used in current FIB 
columns is shown in Fig. 1.3, and a typical diagram of a newly developed GFIS 
optical column is shown in Fig. 4b. 

Most ion columns have more than one lens, and imperfection or fundamental 
limit in one lens could be multiplied and cause a significant loss in resolution. 
Usually apertures are inserted along the beam zone axis to reduce the aberrations in 
the electrostatic lenses of ion columns. Also, it is critical to operate the lenses in the 
so-called paraxial mode in that the angles of the trajectory of ions with respect to the 
lens axis are extremely small. There are three basic groups of aberrations, which are 
spherical aberration, chromatic aberration, and astigmatism. Spherical aberration is 
the nonlinear dependence of beam deflection on radius within a lens. Since in a real 
lens, close to the substance of the lens, fields experience a degree of fringing and 
edge effect, causing the beam to deflect at an angle deviated from the desired focus. 
As a result, the parallel vectors traveling through the lens may be focused at 
different positions, and the focal length varies with radius instead of remaining as 
a constant. The phenomenon can be reduced by careful lens design or inserting an 
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aperture in the beam path. The diameter of the beam resulting from a spherical 
aberration can be given as 





Here C s is the spherical aberration coefficient and a is the aperture angle. 

Chromatic aberration is the main limitation on the focusing ability of ion optics. 
According to Lorentz force law, the focal length of a particle is different when there 
is a small difference in velocity owing to a difference in initial energy, since the lens 
relies on the interaction between fields, charges, and velocities. Therefore, the 
spread in energies translates to a spread in focus. The chromatic aberration is the 
fundamental property of the source that represents the most serious practical 
limitation on the performance of the ion beam systems. With an energy spread of 
A E, the new focused beam diameter becomes 


A E 

d c C c ot 

E 

where C c is the chromatic aberration coefficient of the lens. 

Astigmatism is an effect of asymmetry in the focusing field, whereby the cross 
section of the beam in a given plane is not circular but ellipsoid, increasing the 
beam diameter to 



d A = Af A a (8) 

The beam can be reshaped, and the astigmatism can be corrected by a set of 
magnetic coils near the beam exit with deliberately applied asymmetric fields. 
Chromatic and spherical aberrations are properties of the lenses and ion sources, 
which can only be corrected by apertures and additional lenses with opposing 
aberrations. The final beam diameter resulting from the combination of these 
aberrations can be described in the quadrature equation: 

(ftp = JqM 2 + d^ + dP s + cP c (9) 

Here d 0 is the initial source beam diameter. In reality, as the half-angle of the 
virtual source arriving at the final focal spot diminishes to less than 1 mrad, the 
term d$M dominates in the quadrature equation. Thus the fundamental limit of the ion 
optics is the minimum beam diameter of the virtual source and the energy spread 
resulting from mutual repulsion and space charge (Cui 2005; Yao 2010). 

Detection of Electron and Ion Signals 

The secondary electrons (SE) are mostly used for high-resolution focused ion beam 
imaging, though backscattered electrons (BSE) and secondary ions (SI) also can be 
used for analysis. For SE, two main types of detectors, the multichannel plate and 
electron multiplier, are used. A multichannel plate can be directly mounted above 
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the sample which offers negligible topography information. However, the Everhart- 
Thornley electron multiplier detector, also known as a scintillator-PMT 
(photomultiplier tube) detector, is most widely used today. It consists of a collection 
grid and screen that located to the side of the sample stage for SE collection. 
Captured SE cause the scintillator to emit photons; then a light pipe extends from 
the scintillator and internally reflects the photons to the PMT, where the signals are 

o 

amplified by a factor as high as 10 and output as electron signal. More secondary 
electrons can be collected from the areas that are above the surface though less in 
concaved area; thus, contrast is created and can be used to interpret the surface 
morphology. SE signal can be increased by titling the substrate to face the detector 
for more effective collection for contrast-based topography analysis. Backscattered 
electrons can also be detected by a scintillator-PMT if the collection gird is 
negatively biased instead of positively, therefore repelling lower-energy SE but 
attracting BSE. 

Secondary ions can be detected and utilized for imaging with advantages in 
some aspects. In particular, SI images reflect more details in topography and atomic 
number difference of the substrate due to the channeling effect. Information about 
the grain size and crystal orientation can also be obtained using an FIB, which 
provides invaluable capability for material science. Generally, there are two 
methods for SI detection, the microprobe mode and the microscope mode. For the 
microprobe mode, the primary ion beam is raster scanned, while the SI signal is 
synchronously detected. Different from the SE detection, the detector gird is biased 
to a high negative voltage to repel both the SE and BSE but to attract positive ions, 
which are subsequently amplified regardless of the ion mass. On the other hand, the 
microscope mode, which is more sophisticated, is a mass-resolvable ion imaging 
that is equipped with secondary ion mass spectrometer (SIMS), allowing elemental 
analysis of the sample combined with ion imaging. In this mode, the most com- 
monly used detector is the microchannel plate connected to a phosphor screen with 
the image being captured by a sensitive CCD. 


Focused Ion Beam Facilities 

There are many types and models of FIBs which have their own particular appli- 
cation concerns though they share the similar constructions and working principle. 
A typical single-beam FIB system includes five major parts; they are the ion source, 
the ion optics, the beam scanning component, the sample stage, and the signal 
detectors. To meet the requirements and promises to continue to fulfill the future, 
more advanced demands in nanotechnology, extended functionalities, and 
advanced performances are introduced to the single-beam FIB systems with 
advanced quality of the ion source. Here the FEI dual-beam system (Helios 
NanoLab™ 600i), SII NanoTechnology Inc SEM/FIB/Ar Triple Beam System 
(SII SMI3000TB Series), and Carl Zeiss Helium Ion Microscope (ORION 
PlusHIM) are selected as examples for system components, specifications, and 
typical applications discussion. 
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Fig. 5 A schematic diagram illustrates the major components in a dual-beam FIB/SEM system 


Dual-Beam Systems 

The two-beam systems provide advantages in nanoscale imaging, analyzing, and 
fabrication compared with a single-beam system. SEM images have relatively low 
contrast, but with higher resolution without obvious damage to the specimen, though 
FIB images have more contrast with inevitable contamination. By combination of the 
two beams, comprehensive imaging and characterization can be performed. Also 
reconstruction of three-dimensional stmcture can be simplified using a two-beam 
system to interpolate two-dimensional SEM and FIB images. Using the milling 
feature of the ion beam, 3D chemistry mapping of a sample can be achieved with 
the ion-assisted SIMS of layers that have been exposed to the ion beam. 

The complementary nature of the negatively charged electrons and the positively 
charged ions also eliminates the charging problem found in a single-beam FIB. 
Except the better and more extensive data can be collected with a two-beam system, 
the other advantages include (i) SEM allows the real-time monitoring of the FIB 
operation with guaranteed higher levels nanofabrication and positioning accuracy; 
(ii) more options are provided to deposit materials with desired properties, e.g., to 
deposit metallic layer with better conductivity using ion beam while dielectric layers 
with better insulating properties using the electron beam; (iii) fabrication of nanoholes 
with diameter that is beyond the FIB capability, for instance, after ion milling, using 
electron beam-induced deposition to reduce the diameter by 50 % or more (http:// 
www.fei.com/products/dualbeams/helios-nanolab.aspx). A schematic diagram shows 
the major components in a dual-beam FIB/SEM system that is presented in Fig. 5. 
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Fig. 6 (a) Image of the FEI Helios NanoLab™ 600i and (b) a schematic of the two-beam FEI 
SEM-FIB system at the position where the ion beam is normal incident to the substrate and the 
electron and ion beam are co-focused 

Helios NanoLab™ 600i is an advanced DualBeam™ provided by FEI for 
ultrahigh resolution imaging, analysis, and fabrication at the nanoscale. It offers 
the advances in electron beam imaging, ion beam patterning, and a range of features 
to make milling, imaging, analysis, and sample preparation down to nanoscale. 
Figure 6a shows the images of the Helios NanoLab™ 600i system. It consists of the 
ion optics, the electron optics, the vacuum chamber, the sample stage, the computer 
control system, and the gas injection system. 

The electron beam and the ion beam are placed in a fixed position with an angle 
of 52° between the two beams for the best performance (as shown in Fig. 6b). The 
two beams are co-focused at the coincidence point, an optimized working distance, 
where when changing the tilt angle of the stage, the deviation of the position of the 
SEM image and FIB images on the screen are minimized. 

The innovative electron column forms outstanding high-resolution imaging 
performance, which features constant power lenses for higher thermal stability, 
electrostatic scanning for higher detection linearity and speed, and column designed 
for pin sharp imaging. The imaging performance has also improved with the 
through-the-lens detector (TLD), set for high collection efficiency of SE and 
on-axis BSE, complemented by FEI’s latest developed retractable solid state 
backscatter detector and the multisegment scanning transmission electron micro- 
scope (STEM) detector, for low kV second electron/backscattered electron 
(SE/BSE) and bright field/dark field/high angle annular dark field (BF/DF/ 
HAADF) imaging, respectively, as well as the optional high-performance detector 
for optimized FIB-SE and SI imaging. The ion column of such a system provides 
unrivaled fast, precise, and reliable milling, patterning, and ion imaging. 
The exceptional low-voltage performance is proven to produce very good quality 
thin samples for high-resolution STEM and atom probe microscopy. Except the 
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excellent ion imaging resolution, it also delivers a tighter beam and a more accurate 
scan profile for extremely precise ion milling with its integrated differential 
pumping and time-of-flight correction. Complex structures at the nanoscale fabri- 
cation are also possible with extensive range of gas precursors. Furthermore, with a 
new generation of automated software, unattended sample preparation or 3D 
characterization and analysis can be done with the robust, precise FIB slicing, 
combined with a high-precision piezo stage and superb SEM performance. 

The specifications of these systems feature that the electron source Schottky 
thermal field emitter can last over 1 year; the ion source gallium liquid metal has a 
lifetime of about 1,000 h. For the SEM part, the beam voltage is in the range of 
350 V-30 kV and that for FIB is in the range of 500 V-30 kV. The STEM has a 
resolution of 0.8 nm; at optimized working distance, the resolution for SEM is 0.9 
nm, when operating at 15 kV, it is 1.4 nm at 1 kV. FIB resolution at the coincident 
WD is 5 nm at 30 kV. The Stage is 5 axis motorized, it can be piezo driven with a 
travel distance of 150 mm in both X and Y axes. Whole wafers up to 200 mm in 
diameter as well as small pieces of substrate, packaged parts, and TEM grids can all 
be hosted. User interface Windows® GUI are with integrated SEM, FIB, GIS, and 
simultaneous patterning and imaging mode, providing great flexibility in operation 
and controlling (http://www.fei.com/products/dualbeams/helios-nanolab.aspx). 

SEM/FIB/Ar Triple Beam System 

With the further miniaturization in nanoscale semiconductor devices, the demand 
for high-quality, high-resolution observation using a transmission electron micro- 
scope (TEM) and a scanning transmission electron microscope (STEM) has 
increased, resulting in the need for compatible high-quality TEM sample prepara- 
tion technology that ensures minimal sample damage. While processing using low 
accelerating voltage argon (Ar) ion beam is known as a solution that allows 
preventing of such damage, there have been unresolved challenges in observing 
the sample through an Ar ion beam alone. It suffers the reliability in end point 
detection and working efficiency in ultrathin TEM sample preparation. To meet 
these requirements, SII NanoTechnology Inc, in December 2005, released the 
world’s first commercially available triple beam system SMI3000TB Series, 
which enables high-definition TEM sample preparation to be performed efficiently 
with a single system. An outer view of the hybrid SMI3000TB Series triple beam 
system is shown in Fig. 7. 

The SMI3000TB Series was developed based on the concept that integrated a 
newly developed Ar ion beam column with a dual-beam system. It is ideal for high- 
quality TEM sample preparation since the system enables positioning at specific 
locations as well as end point detection during Ar ion milling. By accurately emitting 
a low- acceleration Ar ion beam of 1 kV or less at the processing location of a sample 
arranged at the FIB and EB coincidence point (beam intersecting point), the system 
achieves accurate and low-damage sample finishing. TEM sample preparation pro- 
cess from high-precision FIB processing to low-damage finishing using a 
low-acceleration Ar ion beam can be in situ observed with the SEM, resulting in 
the improvement of reliability and efficiency in TEM sample preparation. 
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Fig. 7 (a) The outer viewer of the SMI3000TB SEM-FIB-Ar triple beam system, (b) the side 
view, and (c) the top view of the relative position of the electron beam, ion beam, and the Ar gun 


Based on the high-performance automation platform of SIINT’s SMI3000 
series, recently, the new XVision 300 TB products are in the market; these 
facilities are equipped with the SIINT’s latest FIB column and Gemini column 
of Carl Zeiss, achieving highly precise processing with less damage compared 
with conventional models. The SEM/FIB/AR triple beam system has the features 
of real-time imaging at high-resolution, full wafer, and photo-mask applications, 
automated TEM sample preparation, failure analysis, process qualification, and 
mask inspection and modification. The XVision300TB could hold sample with 
diameter up to 300 mm; it has 5 -axis motorized eucentric tilting stage. For the 
FIB part, the acceleration voltage is in the range of 1 ~ 30 kV (5 kV step) and the 
SE observation resolution is 4 nm @ 30 kV; for SEM part, the accelerating 
voltage is in the range of 1 ~ 30 kV and the resolution at 5 kV is 3 nm; for the 
argon ion beam, the accelerating voltage is in the range of 0.5 ~ 1 kV with a 
maximum beam current of 10 nA at 1 kV. Four channels of gas supply systems are 
attached for material deposition and selective/enhanced etching (http://www. 
siint.com/en/news/news_2005_ ll_09_a.html). 

Helium Ion Microscope (Carl Zeiss ORION Plus HIM) 

Field ion microscopy can provide images of solid surfaces with atomic resolution. 
For this purpose, fine metal tips together with an imaging gas, preferably helium, 
are operated at temperature as low as 20 K or lower; the atomic structure near the 
tip apex is resolved. The objective of a gas field ion source is to produce intense 
focused beam with an adequately fine ion beam for local modification of solid 
surfaces. The selected projectile species depends on the particular application. 
H + and He + predominantly produce electronic excitations in solids, while heavy 
ions like Xe + efficiently generate atomic displacements. The wide spectrum 
of applications ranges from photoresist exposure for production of integrated 
circuits to material amorphization for generation of optical patterns. With the 
advent of the GFIS super-tip, GFIS with a single emitting center, a few nanome- 
ters in size, has been shown to exhibit excellent source properties. Specific 
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brightness values up to 10 ~ A/cnr sr eV have been achieved; as a consequence, 
high-intensity ion beams with image spots in the low-nanometer range have 
been obtained. 


The secret to the amazing resolving power of the helium ion beam starts with the 
super source tip. Individual atoms are stripped away from the source until an atomic 
pyramid is created with just three atoms at the very end of the source tip - a 
configuration called the “trimer.” This repeatable process can be accomplished in 
situ. Once the trimer is formed, the tip is maintained under high vacuum and 
cryogenic temperatures with helium gas flowing over it. A high voltage is applied 
to the needle to produce an extremely high electrical field at its apex. The helium 
gas is attracted to the energized tip where it is ionized. With ionization happening in 
the vicinity of a single atom, the resulting ion beam appears to be emanating from a 
region that is less than an angstrom in size. This produces an extremely bright beam 
that can be focused to an extraordinarily small probe size. 

Furthermore, helium ions are about 8,000 times heavier than electrons. A helium 


ion beam exhibits very little diffraction when passed through an aperture or across 
an edge. Diffraction is a significant problem for a SEM where the diffraction effect 
limits its ultimate spot size. The helium ion beam has a DeBroglie wavelength that 
is approximately 300 times smaller than an electron beam resulting in much less 
diffraction. Since the helium ion beam is not affected by diffraction, it can be 
focused to a much finer spot size, enabling sub-nanometer resolution. The helium 
ion source has a very long lifetime due to the fact that the source tip is always kept 
at a positive potential. The only things attracted to the tip are electrons, which do 
not cause any ill effects. Positively charged ions are repelled from the tip, and other 
gases will be ionized before they have a chance to strike the source tip. For this 
reason, the source lifetime for the helium ion source is well over 1,000 h. The 


number of detected secondary electrons is used to determine the gray level of that 
particular pixel. Since the number of detected secondary electrons varies with 
material composition and shape, the images provide excellent topographic and 
compositional information. 

Carl Zeiss ORION Plus Helium Ion Microscope moves beyond sub-nanometer 
imaging to advance the fabrication of nano-electronic and nanophotonic devices. 
Unlike scanning electron microscopes, which image and measure by focusing 
electrons into a beam, the ORION uses helium ions which can be focused into a 


smaller probe size. This provides a smaller sample interaction compared to elec- 
trons and allows for significantly greater image resolution, depth of focus, and 
material contrast. The ORION helium ion beam is reported to be the brightest 
illumination source that has been created so far. This is due to the fact that it 


originates from a region less than one angstrom in size. The use of helium ions 
instead of electrons or other alternatives also provides other benefits, including a 
high secondary electron yield, and less sample damage. The images provide 
detailed information on the samples’ topographic, material, crystallographic, and 
electrical properties. 

Scientists are currently investigating new uses for helium ion microscopy in 
nano -patterning. Since it is not limited by the proximity effect of conventional 
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e-beam lithography, helium ion technology seems to be an up-and-coming alterna- 
tive, especially when it comes to printing ever finer features. For over 2 years, the 
ORION Plus Helium Ion Microscope from Carl Zeiss has been known for its 
excellence in imaging uncoated insulating samples and soft materials with 
sub-nanometer resolution (resolution specification <0.35 nm). Now, with a grow- 
ing installed base of ORION instruments in academic and industrial research and 
development, microscopists are extending the diversity of applications to include 
nanofabrication. Researchers have patterned 5 nm dots on a 14 nm pitch, while 
other teams have demonstrated well-delineated L-bar lines in HSQ resist. This new 
regime of pattern fidelity is enabled by the large depth of held of the microscope 
and by the short range of the ion-initiated secondary electrons in typical resist 
materials. Helium-ion-dehned patterns in HSQ can easily be obtained without the 
usual narrow process window constraints often associated with conventional optical 
and e-beam lithography (http://www.microscopy-analysis.com/news/carl-zeiss- 
advances-nanofabrication-research-helium-ion-microscope). 

Ion Beam Nanolithography Instruments 

In recent years, intense efforts have been conducted internationally to find the best 
option for sub- 100 nm lithography. With 193 nm water immersion lithography, 
there seems to be the possibility to push device production to the 65 nm level and 
possibly even below, by implementing sophisticated resolution enhancement tech- 
niques. Beyond these photolithographic limits, mask-less lithography becomes an 
important option for low and medium volume device fabrication. For devices with 
sub-50 nm features, such as nanoelectromechanical systems, nanophotonics, nano- 
magnetics, molecular nanotechnology devices, and novel and even revolutionary 
fabrication methods will turn out to be mandatory. Traditional lithography is only a 
fraction of a large variety of nano structuring possibilities. Ideally, a new paradigm 
is required, to overcome the interference limitations inherent in optical lithography. 

Ion beam lithography is the practice of using energetic ions to irradiate a surface 
in order to create very small structures such as integrated circuits or other 
nanostructures. Ion beam lithography offers higher resolution patterning than UV, 
X-ray, or electron beam lithography because the ions have a smaller wavelength 
than the UV light, X-rays, and e-beam and undergo almost no diffraction or 
scattering. There is also a reduced potential radiation effect to sensitive underlying 
structures compared to X-ray and e-beam lithography. It has been found to be useful 
for transferring high-fidelity patterns on substrates, especially three-dimensional 
surfaces (Parikh et al. 2008). Meanwhile, since ions are massive compared to 
electrons, thus the penetration and projection trajectories are much shorter than 
those of electrons. In other words, the energy can be transferred to the irradiated 
resist in a short time; thus, it has much higher sensitivity, which leads to one to two 
orders of quicker exposure speed. There is almost no proximity effect that exists in 
ion beam lithography since the traveling distance of ions in the resist as well as in 
the substrate is short. Moreover, the negligible proximity effect means that the 
feature size of the exposure pattern is very little affected by the exposure dose, 
which means ion beam lithography could greatly increase the yield rate and reduce 
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Fig. 8 The schematics of ion beam lithography modes: (a) scanning ion beam lithography, (b) 
masked ion beam lithography, and (c) projection ion beam lithography 


the cost. Most importantly, it can be used to direct write without using any resist. 
However, it has the disadvantages, such as it may cause damage of the substrate and 
the exposure speed is limited by the operational scanning speed and the speed of the 
blanking plate. 

According to the exposure modes, ion beam lithography can be classified into 
(a) scanning ion beam lithography, (b) masked ion beam lithography, and 
(c) projection ion beam lithography. The first one is also called the mask-less ion 
beam lithography (MLIBL). It is similar to the electron beam lithography. In the 
MLIBL systems, ions emitted from the ion source is firstly accelerated, condensed, 
and focused by the ion optics; then the fine beam is scanned across the sample surface 
and follows the user-designed pattern by the deflection coils and beam blanking 
plates, which are further controlled by computer. Masked ion beam lithography is a 
technique where a broad beam of energetic ions floods a stencil mask and transmitted 
beamlets transfer the mask pattern to resist on a substrate. In the masked ion beam 
lithography and projection ion beam lithography systems, parallel beams are formed 
and exposure is performed with a prefabricated mask. IBL has a depth-of-field up to 
20,000 times larger than the minimum feature size; the parallel exposure method also 
offers high-throughput potential; thus, it is very attractive for prototyping and 
manufacturing nanoelectromechanical systems over the steep topography of 
micromachined silicon wafers (Cui 2005; Yao 2010; Loeschner et al. 2002). 

The ion projection focused ion beam direct writing is a promising technique to 
achieve high-throughput fabrication at nanometer scale. IMS Nanofabrication GmbH, 
based on the former 4x, 5x and 10 x reduction ion projection systems, developed a 
projection focused ion multi-beam tool (PROFIB), which has a 200 x ion-optical 
reduction, offering interesting applications for micro and nano technology. The 
principles of the PROFIB system are schematically shown in Fig. 8c. Argon gas is 
fed to a multi-cusp ion source coupled with a proper extraction system and E x B 
mass filter. An electrostatic condenser optical system generates a homogeneous broad 
ion beam, illuminating a stencil mask membrane which constitutes an objective plane 
for the following electrostatic reduction optics. As the stencil mask is part of the 
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electrostatic reduction optical system, a diverging electrostatic lens effect can be 
achieved which greatly reduces the field curvature near the substrate plane. There is a 
cross-over within the multi-electrode optics, which is enlarged using aberration, thus 
minimizing stochastic Coulomb interactions. 

The ion source extraction optics is properly shaped so that the virtual ion source 
has a diameter of -10 pm. Therefore a point in the stencil mask is irradiated from 
ion beam cone of very small angle which leads to a numerical aperture (NA) of the 
optics as small as about 10 -5 rad. Whereas a projection system is operated with 
100 keV electrons at this small, NA would have a diffraction-limited resolution of 
several 100 nm; the operation with ions has decisive advantages. In the same system 
using 50 keV Ar+ ions, the diffraction- limited resolution is about 1 nm. Therefore 
optics of an ion projection lithography system can be based on extremely small NA 
which is the key to achieving large exposure fields. 

A concept for a 200 x reduction projection focused ion multi-beam tool devel- 
oped by IMS nanofabrication. There is a difference between the Micro-PROFIB 
and Nano-PROFIB though the basic principles are the same. Micro-PROFIB is 
operated in the maximum current mode, whereas Nano-PROFIB is operated in the 
maximum resolution mode. The PROFIBs have potential applications in the fields 
of micro- and nano-electronics, MEMS and NEMS, functionalized nano-materials, 
sensor and actuator systems, and biomedical nanotechnology. 

Using an ion-optical system with 200 x reduction offers the possibility to achieve 
high ion beam current densities at the substrate to be modified, with respect to 
topographical, morphological, chemical, or electronic contrast. To achieve 1 pm 
resolution at writing currents typically required for MEMS fabrication (-1 pA 
mode), 200 pm openings in a stencil mask plate (e.g., 50 pm stainless steel plate) 
are used, which can be simply fabricated with, e.g., micro-erosion techniques. On the 
other side, the same optical setup is able to resolve <10 nm features (-10 nA mode) 
with stencil mask openings of -1 pm at comparably lower current through the 
column. Masks with 1 pm openings can be fabricated with silicon technology as 
used in any conventional semiconductor fabline. Due to the high reduction factor, 
mask damage is effectively reduced (40,000 x). To extend stencil mask lifetime, 
protection coatings are used to further reduce the sputter and implantation damage. 

Concerning the Nano-PROFIB, the aim is to obtain sub- 10 nm resolution 

o 

within a 20 pnT exposure field. This means that there will be the possibility to 
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expose, e.g., one million 10 nm dots in less than 0.5 s for dose of 10 ions/cnT in a 
single shot. The pattern will be distortion-free, due to the use of a pre-contoured 
stencil mask. In particular the combination of top-down structuring with selective 
bottom-up self-assembly techniques is the key to implement novel patterning for a 
large variety of emerging nanotechnology products in the field of nano-optics, 
nano -magnetics, etc. 

Stepping and scanning exposure modes are realized. Implementing multipole 
lens electrodes in the column, electrostatic step exposure (ESE) techniques will add 
to the versatility for rapid prototyping. This enables the realization of arbitrary 
patterns with one single stencil mask, thus eliminating the need of complementary 
stencil mask overlay. ESE is achieved by fourfold exposure, through electrostatic 
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Fig. 9 (a) Schematics of the mask for electrostatic step exposure (ESE) and (b) the exposure 
sequence: 1st (red), 2nd (green), 3rd (yellow), and 4th (purple) 
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Schematics of ion projection focused ion beam system for direct nanofabrication 


shifts of the image with ~0.1 nm precision (Fig. 9). The machining strategy for a 
certain application will be adjusted to the particular structures to be transferred. 

PROFIB can also be used to perform on projection direct structuring, which offers 
a plurality of interesting applications in micro and nanotechnology, including ion 
beam erosion, ion beam assisted deposition, implantation, and other ion beam mod- 
ifications of matter (Fig. 10). As the total beam current acting parallel on the substrate 
is targeted to 1 pA/1 pm resolution and 10 nA/<10 nm resolution, respectively, the 
potential throughput of PROFIB is 2-3 orders higher than for present (single -beam) 
FIB technology. This high productivity (despite the relatively low source brightness 
compared to a liquid metal ion source) (Scipioni et al. 2000)) is based on the novel 
large-field 200 x reduction optical system, which allows efficient use of the current 
density extracted from a multi-cusp ion source, resulting in an enhanced productivity 
on an exposure area of up to 100 pm image field size (Loeschner et al. 2002). 
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Broad Beam Instruments for Nanofabrication 

Broad ion beam systems have been widely used in semiconductor industrials and 
research establishments with applications for ion-induced etching, deposition, and 
implantation. Using precision-controlled streams of different types of ionized gases 
(ion beams), removal of material coating a wafer (etching or ion milling) or deposit 
more material upon it (deposition) can be realized much more efficiently than the 
focused ion beam techniques. By using a combination of ion beam etching and 
deposition, devices with feature size in deep submicron or nanometer scale features 
can be achieved (Liu 1995). 


Ion Beam Etching Systems 

The etching by the approach of dry ion milling, reactive ion beam etching, or 
chemical-assisted ion beam etching has advantages over the wet chemical process. 
In simple terms ion beam milling can be viewed as an atomic sand blaster. In the ion 
beam etching process, ion particles are accelerated and bombard the surface of the 
target work while it is mounted on a rotating sample stage inside a vacuum 
chamber. The target work is typically a wafer, substrate, or element that requires 
material removal by atomic sandblasting. 

As with any etching process, a selectively applied layer of protection material 
(photoresist, metal or dielectric, etc.) is applied to the substrate prior to its intro- 
duction into the ion miller. The protection layer, protects the underlying material 
during the etching process, which may be up to four hours or longer, depending 
upon the amount of material to be removed and the etch rate of the materials. 
Everything that is exposed to the collimated diameter of the ion beam etches during 
the process cycle. Since the etch rate of the protection layer is lower than that of the 
material that is being etched, so pattern can be transferred to the substrate when the 
process is complete. Different types of photoresist can be used depending on 
the type of metal and the amount of material to be removed. 

The impinging ions strike the target materials while they rotate. This ensures 
uniform removal of waste material resulting in straight side walls in all features 
with zero undercutting. This leads to a perfectly repeatable circuit time after time. 
The precision and attendant repeatability is ultimately the key strength of the 
wide collimated ion beam milling process. Other methods of etching or cutting 
such as the chemical process or laser simply do not deliver the same level of precision 
that an ion beam etch can. Furthermore, some materials such as platinum cannot 
be etched effectively using a chemical process. The ion beam milling process comes 
as close as possible to a universal etching solution. Actually, the impact area on the 

nn o 

substrate of one impinged individual ion is about 10 ~ cm , and the impact time 
is about 10 s. Usually, the atomic binding energy of a certain material is around 
1-3 eV; thus, the ion milling is the atomic scale removal of layers of materials. The 
vertical nature of the incident ions offers the high anisotropy in the milled couture 
of the pattern. 
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Fig. 1 1 The schematic diagram shows a simplified view of a typical ion beam mill 


A typical simple view of an ion beam miller is shown in Fig. 11. Argon ions 
contained within plasma formed by an electrical discharge are accelerated by a pair 
of optically aligned grids. The highly collimated beam is focused on a tilted work 
plate that rotates during the milling operation. A neutralization filament prevents 
the buildup of positive charge on the work plate. As noted in the picture, the work 
plate is cooled and rotates so as to ensure even uniformity of the ion beam 
bombardment. The work plate can be angled to address specific requirements, but 
it usually sits at an 8° to 10° to the ion beam. 

One of the widely provided ion beam systems is the Oxford IonFab 300 + (the 
image of the system and the attached SIMS. It has dual RF sources for ion beam 
etch and deposition. The etch process is done using Ar ion milling. The secondary 
ion generated during the etch process is monitored in situ using a Hiden SIMS 
(Secondary Ion Mass Spectrometer) probe. As a result, accurate end point control of 
the etching process is possible. After etch, the device can be encapsulated by 
alumina deposition without breaking vacuum. 


Ion Beam Deposition Systems 

Ion Beam Deposition (IBD) is a process of applying materials to a target through 
the application of an ion beam. An ion beam deposition apparatus typically consists 
of an ion source, ion optics, and the deposition target. Optionally a mass analyzer 
can be incorporated. IBD uses an energetic, broad beam ion source carefully 
focused on a grounded metallic or dielectric sputtering target. Material sputtered 
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from the target deposits on a nearby substrate to create a film. Most applications 
also use a second ion source - ion assist source, or IAD - directed at the substrate to 
deliver energetic noble or reactive ions at the surface of the growing film. IAD is 
particularly desirable when growing metal oxide and metal nitride films as it 
improves the stability, density, dielectric, and optical properties of the films. The 
ion sources are “gridded” ion sources of the Kaufman type and are typically 
neutralized with an independent electron source. Many applications use an assem- 
bly of multiple targets that can be indexed into position to create multilayer thin 
him devices. IBD processing yields excellent control and repeatability of him 
thickness and properties. Process pressures in IBD systems are ~10 -4 Torr. 
Hence, there is very little scattering of either ions delivered by the ion sources or 
material sputtered from the target of the surface. Compared to sputter deposition 
using magnetron or diode systems, sputter deposition by IBD is highly directional 
and more energetic. In combination with a substrate hxture that rotates and changes 
angle, IBD systems deliver a broad range of control over sidewall coatings, trench 
filling, and lift-off prohles. 

In the ion source, source materials in the form of a gas, an evaporated solid, or a 
solution (liquid) are ionized using. For atomic ion IBD, electron ionization, held 
ionization or cathodic arc sources are employed. The later one is used particularly 
for carbon ion deposition. Molecular ion beam deposition employs electrospray 
ionization or MALDI sources. The ions are accelerated, focused, or dehected using 
high voltages or magnetic helds. Optional deceleration at the substrate can be 
employed to dehne the deposition energy. This energy usually ranges from a few 
eV up to a few keV. At low energy molecular ion beams are deposited intact (soft 
landing). At a high deposition energy, a molecular ion fragments, or atomic ions 
penetrate further into the material, a process known as ion implantation. Ion optics 
such as radio frequency quadrupoles can be mass selective. In IBD they are used to 
select a single or a range of ion species for deposition in order to avoid contami- 
nation. In particular for organic materials, this process is often monitored by a mass 
spectrometer. The ion beam current, which is a quantitative measure for the 
deposited amount of material, can be monitored during the deposition process. 
Switching of the selected mass range can be used to dehne a stoichiometry. 

The famous ion beam deposition suppliers include the 4 Wave Ltd, and Veeco 
(Veeco’s SPECTOR® Loadlock IBD System NEXUS LDD-IBD System, 
SPECTOR Large Area PlanetarylBD System and SPECTOR IBD System) systems 
can be used to deposit dielectric, metal nitrides, metal oxides, metals, and other 
materials such as polysilicon, amorphous silicon, diamond-like carbon, SiC, and 
SiGe. Co-deposition and multi-targets are also available. 

The 4W-LANS laboratory alloy and nanolayer sputtering system addresses the 
needs of researchers in demanding, cross-disciplinary areas of dielectric, metallic, 
magnetic, superconducting, and semiconducting thin him development. The system 
provides previously unobtainable degrees of layer thickness control, interface 
control, alloy composition control, and material hexibility in a small, cost-effective 
package. A new sputtering technology called Biased Target Deposition (BTD) 
pioneered by 4 Wave employees is at the heart of this revolutionary new product 
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for thin film materials development. BTD is a hybrid between Ion Beam Deposition 
(IBD) and conventional sputter deposition which combines the best of each tech- 
nique. BTD is uniquely suited to demanding applications requiring atomically 
engineered thin films and interfaces as it offers a large range of process pressures, 
control of adatom energies, and excellent uniformity and repeatability. The features 
include that it uses low-energy plasma source with high output; samples can be 
hosted to 6 x A " -biased target carousel with independent target biasing for alloy 
composition control; it has very good processing uniformity (<3 % nonuniformity 
over 75 mm) and can be employed to do substrate plasma cleaning, etching, 
oxidation, and nitridation. The sample holder is water cooled and can be rotated 
during processing with a shuttered stage; it can also be heated up to 600 °C (http:// 
w w w .4 waveinc . com/dslans .html) . 


Ion Beam Implantation Systems 

Ion implantation is a material engineering process by which ions of a material are 
accelerated in an electrical field and impacted into a solid; it is a special case of 
particle radiation. This process is used to change the physical, chemical, or electri- 
cal properties of the solid. Ion implantation equipment typically consists of an ion 
source (where ions of the desired element are produced), an accelerator (where the 
ions are electrostatically accelerated to a high energy), and a target chamber (where 
the ions impinge on a target that the material to be implanted). 

Each ion is typically a single atom or molecule, and thus the actual amount of 
material implanted in the target is the integral overtime of the ion current, which is 
called the dose. Typical ion energies are in the range of 10-500 keV. Energies in the 
range 1-10 keV can be used, but result in a penetration of only a few nanometers or 
less. Energies lower than this result in very little damage to the target and fall under 
the designation ion beam deposition. Higher energies can also be used: accelerators 
capable of 5 MeV are common. However, there is often great structural damage to 
the target. 

The energy of the ions, as well as the ion species, and the composition of the 
target determine the depth of penetration of the ions in the solid. Under typical 
circumstances, the average penetration depth ion ranges is between 10 nm and 
1 pm; thus, ion implantation is especially useful in cases where the chemical or 
structural change is desired to be near the surface of the target. Ions gradually lose 
their energy as they travel through the solid, both from occasional collisions with 
target atoms and from a mild drag from overlap of electron orbitals. 

The applications of ion implantation are quite broad. One of the most common 
applications is doping a semiconductor by ion implantation. Dopant ions such as 
boron, phosphorus, or arsenic are generally created from a gas source, so that the 
purity of the source can be very high. Ion implantation can also be used to prepare 
silicon on insulator (SOI) substrates. One prominent method for preparing SOI 
substrates from conventional silicon substrates is the SIMOX (Separation by 
IMplantation of OXygen) process, wherein a buried high-dose oxygen implant is 
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converted to silicon oxide by a high-temperature annealing process. On the other 
hand, Mesotaxy, the growth of a crystallographically matching phase underneath 
the surface of the host crystal (compare to epitaxy, which is the growth of the 
matching phase on the surface of a substrate), can be achieved by ion implantation. 
In this process, ions are implanted at a high enough energy and dose into a material 
to create a layer of a second phase, and the temperature is controlled so that the 
crystal structure of the target is not destroyed. The crystal orientation of the layer 
can be engineered to match that of the target, even though the exact crystal structure 
and lattice constant may be very different. Other applications include metal 
finishing and ion beam mixing. Ion beam mixing is the process of mixing up 
atoms of different elements at an interface, which aims at achieving graded 
interfaces or strengthening adhesion between layers of immiscible materials. 
However, the problems with ion implantation are mainly the crystallographic 
damage, amorphization, sputtering, and ion channeling. 


Summary and Future Perspectives 

In short, after the fast development and innovation over the past a few decades, ion 
beam technologies and facilities, based on both focused and broad beams, have 
been intensively used and recognized. Various types of configurations, with unique 
and superior performance and functionalities, have been created one after another. 
As for the focused ion beam systems, the developing started from a single-beam to 
dual-beam and then to multi-beam systems to meet the requirements to see and 
operate at nanoscale with improved resolution, controllability, precision, flexibility, 
and functionality. 

However, to keep up the breathtaking development of nano science and nano- 
technology in recent years, controlled and reproducible fabrication of nanostruc- 
tured materials will constitute one of the main industrial challenges for the next 
10 years. To overcome severe limitations of existing nanofabrication techniques, 
ultimate usable resolution of an FIB system still requires improvement to allow 
quite simple, direct, clean, and reproducible nanofabrication close or below the 
10 nm level. Thus the directions of FIB systems developing and innovation may lie 
to (i) perfection of the ion column to achieve higher resolution with higher intensity 
ion beam source though less energy dispersion and developing more types ion 
sources for broader applications; (ii) controlling the number and position of ions in 
the ion beam to prompt the developing of quantum computing, single atom devices, 
single photon emitters, and single atom probes that based on spins; (iii) increasing 
of the ion beam scanning speed, mainly to explore high-performance beam blanker 
and reflector; (iv) developing of high and efficient ion projection focused ion beam 
system, which has the potential to be the industrial predominant nanofabrication 
technology; (v) improvement of material growth capability to provide controllable 
substrate temperature, advanced pattern generator and more gas precursor types, 
and controllable gas molecule distribution for efficient, repeatable, uniform, and 
tunable material growth in large scale; and (vi) expanding of the functionalities, for 
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instance, to integrate micro-/nano manipulator with force feeding-back sensor, 
argon or oxygen ion guns, cryostat, 3D pattern generator and analyzer, for a 
combined super-machine with imaging, fabrication, manipulation, and characteri- 
zation capabilities in one. 
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Abstract 

In deterministic figuring process, it is critical to guarantee high stability of the 
removal function as well as the accuracy of the dwell-time solution, which 
directly influences the convergence of the figuring process. As an ultraprecision 
optical machining technique, ion beam figuring (IBF) has unique features, such 
as a highly controllable, stable, and noncontact material removal process, atomic 
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scale material removal capability, etc., well to satisfy this requirement. Cur- 
rently, IBF is widely used to machine ultraprecision optical elements which is 
used in lithography, space observation, and so on. This chapter has three sections 
to describe the IBF technology. Some important research results, summaries, 
and applications come from our research group. The fundamental theory of IBF 
is introduced firstly, which includes its principles, its distinctive performances 
and advantages, the current status and future of IBF, etc. The main content of this 
chapter is to discuss the key technology of IBF, such as material removal 
function modeling, contouring algorithm, analysis of correcting ability, opti- 
mum material removal of IBF, realization of IBF technique, and so on. In the 
third section, the challenges of IBF technical development and its new applica- 
tions are also discussed in detail. They are (1) high-gradient optical surface 
figuring by IBF, (2) high thermal expansion and crystal optics figuring by IBF, 
and (3) supersmooth surface figuring and micro-roughness evolution. Finally, 
some conclusions and suggestions are summed. 


Introduction 

The classical process chain for the fabrication of precision optical component 
consists of three steps. (1) The generating process includes grinding and milling, 
which is used to generate a precision optical surface with micrometer range of 
contour accuracy but still with microscopic roughness. This process is so-called 
optical surface forming (shaping). (2) The polishing process is used to remove 
roughness, subsurface damage, and surface error under corrective improvement of 
the shape accuracy by speed/pitch polishing. After this process, the surface usually 
has been smoothed to optical quality, but it is difficult or time-consuming to reach 
the specified surface accuracy since tool wear, edge roll-off effect, force loading of 
workpiece, etc. (3). The surface finishing process is a necessarily additional step to 
deterministically correct the surface contour to its desired accuracy. The main role 
of this process is to further improve the surface accuracy. In this process, the typical 
machining methods include CCOS (computer-controlled optical surfacing), MRF 
(magnetorheological finishing), IBF (ion beam figuring), and so on. The ion beam 
figuring (IBF) has the best machining precision. The characteristic features of each 
process step are listed in Table 1 (Braunecker et al. 2007). 


Fundamental Theory of IBF 
Principles of IBF 

Ion beam figuring (IBF) is a technique used for removing material from a surface by 
transferring kinetic energy from impinging neutral particles which is a nonabrasive 
technique for fine correcting the contour of precision optics. This figuring process 
utilizes a Kaufman-type ion source where plasma is generated in a discharge 
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Table 1 Overview characteristics of different processes (typical in production) 


Process 

Shape 

deviation 

(nm)PV 

Surface 

roughness 

(nm)RMS 

Advantage 

Limits 

Grinding 

1,000 

50-1,000 

Fast generating process 

Subsurface 

damage 

Speed/ 

pitch 

polishing 

300 

0.2-0. 5 

Very low surface 
roughness, fast polishing 
process 

Correction of 
local surface 
deviation 

CCOS 

30 

0.5 

30 years experience 

Tool wear, edge 
roll-off 

MRF 

10 

0.3 

No edge roll-off, no tool 
wear, low damaged 
surface 

Center artifact for 
y-cp tool path 

IBF 

5 

0.2 

No edge roll-off, no tool 
wear, no damaged surface 

Low removal rate, 
vacuum 


magnetic screen accelerate 

column cathode anode grid grjd 


water cooled 
shell 


cathode 

power 



anode 

power 


screen grid 
power 


accelerate 
grid power 


neutralizer 

power 


ion beam 


Fig. 1 Scheme of Kaufman ion source 


chamber by controlled electric potential (Kaufman et al. 1977). Its sketch graphic is 
shown in Fig. 1, where charged concave-type grids extract and accelerate ions from 
the chamber. The accelerated ions form a directional ion beam with Gaussian-type 
distribution. A neutralizer outside the grids supplies electrons to the directional ion 
beam to necessarily neutralize the beam to prevent charging optical component and 
to avoid bending the beam by extraneous electromagnetic fields from a charged 
workpiece or from other source. When the ion beam hits the optical component, 
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Fig. 2 Scheme of the IBF process 


the amount and distribution of material sputtered is a function of the material of 
optical component, the incidence energy, distance, and angle of ion beam (Sigmund 
1973; Bradley and Harper 1988). 

In IBF process, a non-varied ion beam energy maintains a constant sputtering rate 
(or material removal rate) and profile which is very important for optical deterministic 
figuring method to gain a constant beam removal function. Then, this temporally and 
spatially stable ion beam is held perpendicular to the optical surface at a fixed distance 
with ion source controlled by a 5 -axis CNC system shown in Fig. 2 (Ion Beam 
Finishing Technology for High-Precision Optics Production). The optical determin- 
istic figuring method described here is assumed a constant beam removal, so that the 
process can be represented by a convolution operation based on the CCOS (computer- 
controlled optical surfacing) theory. If not a constant beam removal, its corrections 
would be required to model the process which will be discussed in the section “High- 
Gradient Optical Surface Figuring by IBF.” 


The Typical Features and Its Purpose of IBF 

Using ion beam to figure optical component, it has the following features and 

advantages: 

1 . High figuring precision. Based on the physical sputter effect, the material removal 
rate is in molecular or atomic level, so IBF figuring precision can be maintained in 
nanometer or sub-nanometer precision in the condition of ion beam stability. 

2. Highly predicable and stable. Compared with traditional abrasive optical figur- 
ing methods, where chemical interaction (hydrated layer) and mechanical inter- 
action (scratching) remove the material of optical component, IBF is figuring 
with ions, such as argon, which is like sandblasting. Its sputter rate can be 
accurately calculated by the physical law of elastic and inelastic scattering. 
Since no a chemical interaction, ion beam parameters that can be easily held 
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in constant are the main reasons for the stability of IBF process. These features 
make the figuring process highly predictable and stable. That allows the figuring 
process to rapidly converge to the desired specifications and save significant 
time and cost. 

3. Noncontact figuring. The noncontact nature eliminates the problems of tool 
wear, edge roll-off effect, loading force on the workpiece, and surface/subsur- 
face mechanic damage generated by mechanical interaction in the conventional 
grinding and polishing methods. This feature is very useful for figuring very thin 
and lightweight optical component. 

4. Good material removal function. The two main advantages of material removal 
function are Gaussian distribution and constant profile. The Gaussian distribu- 
tion removal function is an ideal function for figuring process which can 
simplify the dwell-time calculation and improve figuring precision. 

5. No or minimized support structure print effect. The so-called support structure 
print effect is the honeycombed support structure of the back side that is visible 
on the front optical surface for lightweight structure optical component, such as 
lightweight SiC optics. This is troublesome problem for traditional figuring 
process because of the loading force impact. 

In addition, the ion beam figuring process is a clean figuring process which 
avoids the influences of polishing liquid and polishing abrasive in the conventional 
figuring process which usually lead to generate redeposition layer and insert 
polishing abrasives. IBF is an excellent complement to conventional figuring to 
gain very high optical surface and subsurface quality. However, it has some 
shortcomings, such as working in a vacuum chamber; component heating because 
the ions hit the optical surface with several hundreds to thousands eV, some of them 
being stopped by absorption which heats the workpiece; difficulty of improving 
surface roughness because of the ion “sandblasting” effect at the atomic level 
(recent work suggests that it could even be used to improve the roughness); slow 
material removal rate with normal values from tens to several hundreds nanometer 
per minute; etc. 


Description of IBF Process 

The basic flow of IBF process is shown in Fig. 3. Prior to any figuring, the material 
removal function (shown in Fig. 3) must be determined for the process as it is the 
base of deterministic figuring process. About how to gain this function, its detail 
will be discussed in the section “Removal Function Modeling and Analyzing of 
IBF.” The material removal function (or called beam function), analogous to a point 
spread function, provides a depth removal rate distribution as a function of radial 
distance from the ion beam center. 

The ion beam figuring process begins with measuring the contour of the optical 
component by interferometer, such as ZYGO series of interferometer, which results 
in an x-y array map of relative surface height values. Then to gain a removal map, 
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Fig. 3 Flow of IBF process 

the difference between the measured surface contour and the goal surface contour, it 
describes the material to be removed. Based on the removal function and removal 
map, a dwell-time calculation is the third step in this flow. In CAM step, its one aim is 
to make the ion beam moving routine and its velocities to realize dwell function 
according to the machine motion performance, such as axial maximal velocity and 
acceleration. The other aim is to automatically generate control codes or NC codes to 
the CNC system when the controller of an IBF machine is a standard CNC system. 
For figuring, the optical component is placed in the vacuum chamber, and the ion 
beam raster scans over the surface of optical component according to the dwell time 
and the moving routine. In the IBF process, the material removed and its distribution 
are represented by the convolution of the removal function and the dwell time: 

R(x,y)=B(x,y)*T(x,y) (1) 

where R(x , y) is the material to be removed, B(x , y) is the material removal function, 
and T(x, y ) is the dwell time. If the material to be removed and a material removal 
function of ion beam were known, the dwell time could be solved by deconvoluting 
Eq. 1. This is the fundamental of IBF process. 

There are some key problems that would be paid more attention to realize IBF and 
gain good figuring results. They are listed as (1) how to select suitable sizes and beam 
parameters of ion beam that is very important to control the low-, middle-, and high- 
spatial-frequency errors of surface contour; (2) how to correctly evaluate the material 
removal function; (3) how to reasonably process the measuring contour data to make 
ion beam move smoothly as soon as possible to decrease the high-spatial-frequency 
errors generated; and (4) how to plan suitable ion beam moving routine including the 
suitable velocity, acceleration, pitch of raster scanning, and so on. Those four 
problems will be discussed in detail in the section “The Key Technology of IBF.” 
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Current Status and Future of IBF 

The earliest work is done by Meinel et al. (1965), which is the first report to apply 
ion beam to polish optical glass. Since the over high beam energy generated by ion 
source, the polished glass surface was seriously damaged. The subsequent success- 
ful work on IBF should firstly thank Kaufman who invented a new low-energy ion 
source, so-called Kaufman ion source. Early work on IBF was re-performed using 
Kaufman ion source by Gale at the end of 1970s (Gale 1978). This work was deeply 
expanded at the University of New Mexico in the USA by S. R. Wilson et al. (1987, 

1989) . They did many initial figuring experiments on fused silica, Zerodur, and 
copper optical component with 2.54 cm Kaufman ion source. Their representative 
result is to figure a 30 cm fused silica optics from contour precision 0.4 IX RMS to 
0.042X RMS (X = 632.8 nm) in one figuring cycle with 5.5 h. Lynn N. Allen 
et al. originally developed IBF system at the Eastman Kodak Company 
in 1988 and became operational in 1990 (Allen and Keim 1989; Allen and Roming 

1990) . This IBF system is designated for figuring large optics with up to 
2.5 m x 2.5 m x 0.6 m of sizes. There are about 65 segments of 10 m Keck 
primary mirror was successfully realized their final figuring with 15 months. And 14 
segments of them were measured after final figuring. Their average figuring accuracy 
was improved from 0.347 pm to 0.062 pm, and the maximal one-cycle error 
convergence ratio is 17.5 and the average value 5.6. The highly efficient figuring 
capability of IBF was successfully shown in Kodak Company which opened a new 
era for optical figuring technology (Allen and Roming 1991; Allen and John 1991). 

Another representative work is the new Precision Ion Machining System (PIMS) 
research facility at NASA’s Marshall Space Flight Center at the beginning of 1990 
(Drueding 1995), which is focused on the figuring of small optics using 3 cm ion 
source. Since the ratio of ion beam to the size of small optics is greater, figuring a 
smaller optics is more difficult. Currently, the small optics figuring by IBF is also an 
interesting and valuable research work in optical figuring (Fawcett 1994; Shanbhag 
et al. 2000). 

Besides the above work, there many IBF systems were setup at the end of 1990s, 
such as CSL lab in Belgium (Took et al. 1999), IOM & NTG in Germany (Fruit 
et al. 1999; Schindler et al. 2000), INAF-OAB in Italy (Ghigo et al. 2001), and so 
on. Cannon Co. Ltd in Japan set up its IBF system for EUVAL in 2004 (Ando 
et al. 2004). NUDT (National University of Defense Technology) in China set up a 
series of IBF system in 2006, 2010, and 2011 (Lin et al. 2007; Yuan et al. 2011). 
One of these IBF systems is listed in Fig. 4. In addition, INAF-OAB in Italy and 
REOSC in France set up large IBF systems for 1.7 m- and 2 m-diameter space 
optics fabrication, respectively (Ghigo et al. 2009; Roland 2010). And also a large 
IBF for figuring 1.6 m diameter optics is building in the NUDT of China. 

Currently, IBF method is universally used to fabricate ultrahigh-precision opti- 
cal component, such as optics of DUV and EUV lithography, large space optics that 
many of them are the stitched primary optics whose segments are required no or 
very small error of edge roll-off effect. The interesting and focus research contents 
mainly include (1) figuring supersmooth optics, in which their key problems are 
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Fig. 4 One of IBF systems in NUDT 


how to control the middle- and high-spatial-frequency errors and how to hold or 
improve the surface roughness; (2) dwell-time calculation and figuring technique; 
(3) the heat effect control that is very serious for high thermo expand material and 
crystalline, such as BK7, BK9, CaF 2 , etc.; and (4) the optical material fabrication 
properties. The surface properties of the optical component influence the effective- 
ness of the process, and on the other hand, not all of the optical material can be 
figured by IBF. 


The Key Technology of IBF 

The computer-controlled optical surfacing (CCOS) method is a deterministic opti- 
cal fabricating method based on the known figuring process model. Its ideal figuring 
process is based on the following several assumptions: 

1 . Material removal is linear and proportional to the dwell time. 

2. Respective material removal rate is constant over the entire surface. 

3. Material removal function is non-variant with time. 

4. Material removal function is insensitive to position on the optical surface. 

Analyzing these assumptions, the core is to gain a desired non-variant material 
removal function or to know its varied property. For conventional deterministic 
figuring processes, it is very difficult to know a precise material removal function 
because of some of the physical constraints, such as optic geometry and construc- 
tion, aspheric departure, tool wear, edge roll-off effect, and loading force on optical 
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Fig. 5 A typical material removal function of IBF 

component. These constraints limit the fabricated error convergence rate and the 
final fabricated contour accuracy. With IBF, it is nearly independent of the above 
constraints since the “figuring tool” is a noncontact and controllable compliant tool 
(Shengyi and Xuhui 2010). It is can be said that the IBF is reasonably valid to these 
assumptions in a wide range of material applications which is proved by many 
actual applications. It is acknowledged as the most deterministic optical figuring 
method. 


Removal Function Modeling and Analyzing of IBF 

In IBF, a typical material removal function (or called beam removal function, BRF) 
b(x , y) shown in Fig. 5 is defined as the material removal profile or “footprint” on 
optical surface generated by the projected ion beam per unit time. In most cases, the 
removal function distribution is a circular Gaussian shape, which is dependent of 
the ion beam size and intensity, the bombarded material. When these are deter- 
mined, the removal function can be described as (Lin et al. 2007) 

x 2 +y 2 

b(x,y) = A • e . (2) 

where A is the peak removal rate and o is the Gaussian parameter. It can be also 
expressed as 



where B is the volumetric removal rate; it can be calculated by 
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Fig. 6 Evaluating parameters of Gaussian-type removal function 




o 1 

B is in unit of pm • mnT • min . It means the volumetric removal of the BRF, and 


B = 2jio 2 A. (5) 

There are two kinds of method to determine a BRF. One is an empirical method by 
optical or electro -probing. The electro-probing method is used to measure the ion beam 
current distribution by Faraday cup to get a BRF profile; its drawback is not to contain 
the correlation with the actual removed material, but it is a good method to regularly 
monitor the stability of BRF in the actual figuring process. Compared with this method, 
that of the optical probing is more efficient because it directly uses the ion beam to 
locate and bombard an optical surface in a given time to generate a “footprint.” The 
BRF mathematical expression is obtained from the real interferometric evaluations to 
this “footprint.” The other is mathematical method based on the ion sputtering theory. 

With the optical probing method, the key problem is to correctively evaluate the 
parameters of Gaussian-type BRFs: A , a (or J), and B , as shown in Fig. 6a. There 
are two ways to do this. The common way is to hit 4-8 “footprints” on an optical 
planar surface and then to evaluate the BRF Gaussian-type function with the 
average of these “footprints.” Another way is to scan a trench on planar optical 
surface with travel speed v (mm • min -1 ) and hold the ion source parameters 
constant; the transverse material removal profile along the trench can be approxi- 
mated as a Gaussian function r s (y ) as shown in Fig. 6b: 

r s (y) = A s e~ (6) 

where it is assumed that the ion beam scanning direction is x and its transverse 
direction is y. o s is the standard deviation; it equals to o\ 



< 7 . 


( 7 ) 


36 Ion Beam Figuring Technology 


The peak removal rate A s is 


1353 



Therefore, the BRF key parameters A and a can be estimated by the trench 
transverse profile parameters A s and o s in Eqs. 6-9. 

The volumetric removal rate B can also be estimated as 


B = vV2no s A s . (10) 

However, the estimated B is less believable because of the estimated errors of A s and 
o s . So, a better method to estimate the volumetric removal rate B is to define r v ( y) as 



The both sides integral form of Eq. 1 1 is 


r* 






Then, B can be estimated by 






Here, r s (y ) is the interferometric data of the trench transverse profile. It is proved 
that it is more precious to calculate B with Eq. 13 than that of Eq. 10. The other 
advantage with this method is that the uncertainty of RBF can be evaluated at the 
same time by analyzing the scatters on the width and depth of the profile of trench. 

The mathematical method to model the material removal function is based on 
the Sigmund sputtering theory (Sigmund 1973). According to this theory, the 
energy deposition of incident ions follows a Gaussian distribution as shown in 
Fig. 7. The energy deposition at point O as the result of cascade collision when ions 
hit the surface at point A and travel along the z-axis can be written as 



where (x, y, z) is a coordinate system with origin at the energy deposition center and 
the z-axis along the projected path, e is the total energy deposited, a is the average 
incidence depth of the ions, and a and // are the Gaussian distribution parameters 
along and transverse to the beam direction, respectively. 
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Fig. 7 The sketch map of energy deposition 


It is known that the material removal rate and profile relate to the parameters of 
the IBF process. Besides these, they also are influenced by the curvature of optical 
surface and the beam incidence angle which would be paid attention in the fine 
figuring process. With Sigmund sputtering theory, Bradley and Harper deduced the 
etching rate on an arbitrary surface z = h{x , y) (Bradley and Harper 1988). When a 
uniform ion beam bombards a surface at incidence angle </>, its etching rate is 


v(<p,c u c 2 ) = (f /n)Y 0 ((p)[ cos (/) - Ty((f>)ci -T 2 (<p)c 2 ] (15) 

where Y 0 {(t)) is the sputtering yield for a flat surface 



where / is the ion flux, n is the amount of atoms at unit volume in the amorphous 
solid, <fi is the incidence angle, and p is the proportional constant relating the power 
deposited at the bombarded point to the rate of erosion. The other coefficients are 
defined as 






sin (f) cos <p. 


,A 2 

3 + — I cos cp, 


B 


1 
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Fig. 8 Ions hitting an 
optical surface at the 
incidence angle 4 > 


A 


Ion Beam 



From Eq. 17, it can be seen that the coefficients c 1 and c 2 depend on the curvature of 
the surface and the coefficients (</>), r 2 (0), and Y 0 ((/)) depend on the incident 
angle <p. Therefore, the relationship of etching rate function with incidence angle 
and bombarded surface curvature can be obtained using Eqs. 15-17. 

Although Eq. 15 is the etching rate of uniform ion flux deduced in the micro area, 
it is also practicable in macro area with ion beam diameter in millimeter or 
centimeter scale since the local point flux could be considered identical. In Fig. 8, 
an ion beam with flux / bombards the surface at point O by the incidence angle </>; 
the distribution of flux is approximately f(x cos <fi, y) in workpiece reference frame 
(x, y, z) according to the projection theory. Thus the theoretical model of the beam 
removal function is 


BRF(^,d,c 2 ) = {\ / n)Y 0 {4>)[cos <f> - T\(4>)c x - T 2 (4>)c 2 \f{x cos <p,y) (18) 

where cj and c 2 are 





For most of the surfaces (especially spherical surfaces), the unit of a is A which 
is not comparable in magnitude with r 0x and r 0y . Hence cj and c 2 are infinitesimals, 
but r and E 2 (</>) are finite coefficients, so Eq. 18 can be approximated by 

BRF(0,ci,c 2 ) = (l / n)Y o(tp) cos (pf(x cos (p,y). (19) 

Equation 19 shows that the contribution of the curvature of surface to the beam 
removal function can be neglected. It can be concluded that the ion beam removal 
function varies with the varying of ion beam incidence angle and its footprint on the 
oxy plane transforms from circle to ellipse with the increasing of its incidence 
angle. In the following of this section simulation and experiment on the BRF will 
be done. 
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Fig. 9 The simulation result of Ar ions vertically incident on Si0 2 


Assuming that 8,000 vertically incident Ar + with the energy of 1,200 eV were 
simulated to bombard fused silica by the SRIM program (http www srim org), then 
the parameters of ion energy distribution in Eq. 14 can be obtained. In Fig. 9, the 

o 

simulation result shows that the average incidence depth of ions is a = 48 A and 

o o 

Gaussian distributing parameters are o = 20 A and p = 13 A. 

From Eqs. 16 and 19, it is known that a theoretical beam removal function with 
arbitrary incident angle is difficult to be gained since the coefficients n, /and p in 
these equations are unobtainable. But a beam removal function with vertical 
incident angle is usually easily obtained by experiment. It is enlightened that to 
an arbitrary incident angle theoretical beam removal function, a valid way is to 
estimate it by that of vertical incident angle case which is unnecessarily known all 
above coefficients. 

To gain a theoretical beam removal function model with arbitrary incident angle 
by this experimental method, the normalized peak etching rate is introduced: 



where p ^ is the peak etching rate at incident angle (ft and p 0 is the peak etching rate 
at vertical incident angle which can be estimated by experiment. Assuming that the 
ion beam is Gaussian distribution in figuring process, the beam removal function 
with arbitrary incident angle would be written as 
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Fig. 10 The footprints of removal functions at different incidence angles 


BRF(</>) = k,j,p 0 exp - C ° S ^ 2 +>>2 j . (21) 

Figure 10 shows the simulation result of the beam removal function with 
different incident angles with Eq. 21. It is apparently shown that the incident 
angle influences the footprint shape of the beam removal function, and their 
footprints on the oxy plane change from circle to ellipse and expanse their active 
zone with the increasing incident angles. However the length of semi-minor axis 
a is invariable, and the semi-major axis b is varying with a / cos </>. 

Figure 1 1 also shows that the normalized peak etching rates depend strongly on 
the incident angle. It can be seen that the etching rate increases firstly and then 
decreases with the increasing incident angle. At about 70°, it reaches the maximum 
etching rate which is about 4.5 times than that at 0°. 

To testify this theoretical model by experiment, an experimental equipment is 
set up in Fig. 12. In order to obtain removal functions with different incident 
angles, the experiment was done on six small fused silica samples (the size is 
10 mm x 10 mm x 15 mm) with a 2 mm diameter ion beam. In this experiment, 
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Fig. 11 The normalized peak etching rates vary with various incidence angles 


Fig. 1 2 The fixing method of 
the small fused silica samples 



the samples are fixed at angles from 10° to 60° with 10° interval relative to the 
experimental planar board. The ion beam respectively bombards the surfaces of 
these samples along the vertical direction of experimental planar board. Every 
sample is bombarded 3 minuts, and the process parameters were set according to 
the modeling simulation. By this experiment, there are six footprints gained to 
estimate their beam removal functions. The one of footprints is shown in Fig. 13. 
All removal functions of this experiment are shown in Fig. 14, which validate the 
result of the theoretical analysis that the etching rates and footprints vary with 
incident angles. 
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Fig. 13 The removal footprint on a fused silica sample 



Fig. 14 Experimental removal functions depending on incidence angles 


With Gauss-Newton method to fit these experimental beam removal functions, 
their beam removal functions are obtained as shown in Fig. 15. For example, the 
fitted model at 40° is shown in Fig. 16a, and its fitted residual error is acceptable 
(Fig. 16b). The footprint eigenvalues of these removal functions are estimated 
by 6a method in mathematical statistics, which are shown in Fig. 17a with 
the semi-minor axis a and the semi-major axis b. In Fig. 17b, c, they show that 
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Fig. 15 The fitting result of the removal function 


the eigenvalues and the etching rates of this experiment are close to the theoretical 
values. These results indicate that the length of semi-minor axis is approximately 
invariable and the semi-major axis is inversely proportional to the cosine of the 
incident angle. 

From the above analysis, the following conclusions can be gained: 

1 . In IBF, the beam removal function varies with the ion beam incident angle. 

2. To figure optical surface, the best way is that the ion beam bombards the surface 
along its normal direction, which can well constantly hold the etching rate and 
footprint of beam removal function. 

3. For spherical and aspherical optical surface figuring, the good suggestion is that 
the ion source hold and motion mechanism is designed 5 axes motion mecha- 
nism to hold the ion beam moved along the normal direction of optical surface. 
For a 3 -axis motion mechanism, the beam removal function must be real-time 
correct to compensate etching rate and footprint varying with the ion beam 
incident angle. 


Contouring Algorithm for IBF 

The figuring process is represented in Eq. 1. A significant step in the process is the 
calculation of dwell-time function that controls the correction of optical surface 
error. It is known that the ion beam figuring process is a convolution process and the 
calculation of dwell-time function is a deconvolution process. 

To obtain good dwell-time function, there are three important problems that 
should be paid more attention which are (1) calculation algorithm, which calculates 
the dwell-time function distribution that will yield a desired material removal 
distribution when a specific beam removal function is applied; (2) the data process 
of optical surface edge, which influences the calculated dwell-time precision at the 
neighbor of surface edge; and (3) determining or predicting the optimal removed 
material amount or residual error result in every step of iteration, which is very 
important to gain the efficient dwell-time function and decrease the times of 
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Fig. 16 The removal function at 40° corrected with Gauss-Newton fitting method 



iteration and is very useful to control the middle- and high-spatial-frequency errors. 
About the third problem, it will be discussed in the next section. 

About the algorithms to calculate the dwell time, they can be summed up to four 
kinds: (1) Fourier transform method (the early work was done by Wilson et al.) 
(1987); (2) iterative method for finding a solution to the dwell-time function (Allen 
and Roming 1990); (3) algebra method, such as wavelet algorithm (Shanbhag 
et al. 2000) and Bayesian algorithm (Changjun et al. 2009); and (4) matrix-based 
method, such as in Carnal et al. (1992) and Zhou lin et al. (2007). 
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Fig. 17 The definition of the eigenvalues and the result of experiment: (a) the eigenvalues of the 
removal functions, (b) the changing rule of the footprint eigenvalues with various incidence 
angles, (c) the changing rule of etching rates with various incidence angles 


Rewrite Eq. 1 as 


R(x,y) =B(x,y) *T(x,y ) 




— y')T(x ' , y')dx' dy' . 



Here, T is the dwell-time function. Assuming that the IBF scanning is a raster 
routine as shown in Fig. 2, there are two figuring techniques to realize the dwell 
time in the actual IBF: (1) position mode, where T represents the time that the ion 
beam spends at a location on the optical component per unit area, and (2) velocity 
mode, where T represents the ion beam raster scanning speed of a strip unit width. 

In the position mode, the dwell-time function can either be broken into areas 
or interpreted as a time. So, the amount of removed material is proportional to 
the amount of time the ion beam must be positioned in the area which is equal to 
the integral of the dwell-time function over the area. For example, if the total 
optical component is broken into a square grid as shown in Fig. 18, the ion beam 
is centered at each square an mount of time approximately equal to the value of 
the dwell-time function at the center times the area of square, which discretizes 
Eq. 22 as 
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Fig. 18 The optical 
component broken into 
square grids 



R(x,y) = ^2J2 B {x-Xi,y - yjjThci^AAxjAyj. (23) 

i = 0 7=0 

This discretization of the removal function suggests that the figuring process can 
also be discretized in a similar way. Thereby the IBF process is represented as a 
discrete two-dimensional (2D) convolution. In actual figuring process, the optical 
component surface profile and the material removal function are provided as an x-y 
grid array from the profile interferometer, so the position mode process can be 
exactly realized with a square discrete convolution by maintaining the same 
discretization as the profile interferometer. 

Alternatively, in the velocity mode, the optical component is broken in strips, 
where each strip has a width and a velocity function associated with it. The velocity 
function is the scanning speed that the ion beam moves along the strip and is equal 
to the inverse of the dwell-time function integrated over the strip width W : 




T{x,y)ds. 



Then Eq. 22 can be approximated as 





The IBF process is represented as a discrete one-dimensional (ID) convolution. 
Each one-dimensional strip has a raster interval. For example, if the optical 
component is broken into strips parallel to the x-axis, the raster interval width is 
Ay which is the y separation between the boundaries of the strips 
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V(x,y) 


l/T(x,yi)Ay. 


(26) 


In this mode, the calculation of dwell time is more complicated than that of 
position mode since the precious material removed is related to the velocity and 
acceleration performance of the machine tool. This mode provides a partial 
discretization that has some advantages over the position mode. For the position 
mode, the ion beam remains on while it traverses from one grid point to another, 
which may cause unwanted material removed if the position time is not negligible 
compared to the dwell time in this grid area. In the velocity mode, the strip scanning 
is continuous which can avoid the unwanted material removed. In most cases, the 
velocity mode is chosen since the machine tool moves smoother which can decrease 
the generated middle or high-spatial-frequency error in the actual figuring process. 

In the following of this section, an example is given about how to realize dwell 
time based on the Bayesian algorithm (Changjun et al. 2009). 

Assuming that the optical surface error E(x , y) and dwell-time function T(x, y ) 
are both random, the solution to dwell-time function T(x, y ) to maximize the 
posterior probability function P(T|E) of the dwell-time function according to 
the maximum likelihood method (Molina et al. 2001; Fang and Xiao 1998). With 
the Bayesian principle, the relation among the posterior probability function 
P(T\E), the prior probability function P(T), and the probability function P(E\T) 
of the simulated removal error E if T were the true dwell time is 


P(T\E) =P(E\T)P(T)/P(E). 


(27) 


In the IBF process, the ion beam density is the result of statistical average. 
Assume that it is of Poisson distribution and P(E T) follows the Poisson distribution 


with parameter B (g) T. In this case, maximizing Eq. 27 could be transferred into the 
following minimization problem: 


min/i (T) (28) 

where the performance function Jj(T) is 




(B <g)T — Elog(B (g) T))dxdy. 



With calculus of variation, the optimization condition for Eq. 28 can be 
deduced as 



With a multiplicative algorithm, the Bayesian-based iterative algorithm can be 
deduced from Eq. 30: 
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Fig. 19 Scheme of edge extension 



Equation 31 is the generalized form of the traditional Richard son-Lucy algo- 
rithm (Lucy 1974; Richardson 1972) with respect to the nonnormalized removal 
function. This algorithm has an interesting property of nonnegativity: if the first 
estimate T 0 is nonnegative, none of the further estimate will be negative. In dwell- 
time iteration, the initial value is usually the offset nonnegative surface error. With 
this property, the nonnegativity demand of dwell time is satisfied. 

In the beginning of this section, the data process of optical surface edge has been 
simply discussed. About how to solve this problem, it will be discussed. Since the 
optical shape usually is circular or non-regular, the discretized matrix of surface 
error function would not be filled completely, which may make the property of the 
points at the edge of the optical surface differ from that of the points in its inner. 
This difference can induce an algorithm edge effect which affects the convergence 
and calculated accuracy of algorithm at the edge of surface. So, it must be the edge 
extension which can weaken or even eliminate this effect. For example, in a circular 
optical component as shown in Fig. 19, the diameter of the component is D w , and 
the radius of the removal function is B t . The width of the extended rectangular area 
as shown in Fig. 19 is ( D w + 2B t ). With a Gaussian algorithm, the data of any point 
/ in the extended area of the surface error can be expressed as 



E( f) =£(p)exp 


(32) 
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where a is the Gaussian extension parameter, generally o > Bt/3. With the 
extended matrix above, only four fast Fourier transform computations are needed 
in one iteration of Bayesian-based algorithm in Eq. 31. 


Analysis of Correcting Ability of IBF 

In ion beam figuring, the ability to “correct” the surface errors of specific spatial 
wavelength or frequency depends on the ratio of the ion beam diameter d to the 
error wavelength A, i.e., d/A. The evalute of this correcting ability is the material 
removal efficiency e, which is defined as the ratio of the volume of desired material 
removal to the volume of the predicted (or real) material removal. The Kodak had 
estimated that the ratio of the beam diameter to the error wavelength must be < 0.5 
to achieve 90 % of material removal efficiency (Allen and Keim 1989). It is obvious 
that the ion beam diameter is a key parameter in ion beam figuring process. 
In order to find an optimal ion beam diameter, Kodak used four ion beam 
diameters of 2.5, 5.1, 10.2, and 12.7 cm in their simulations to correct the presented 
error of wavelength of 10.5 and 9.5 cm. They found that the best choice is 
the 5.1 cm diameter which indicated an excellent correcting ability, while the 
total dwell time was favorable. Besides Kodak, the IOM in Germany also 
researched the correcting ability of different ion beam diameter. They used a 
smaller diameter down to 0.5 mm to meet the demanding requirements for 
correcting the long spatial wavelength part of the so-called mid-spatial-frequency 
roughness (MSFR) down to the sub-nanometer RMS level (Haensel et al. 2006). 
Further work has been done by Lin Zhou et al. in China (Lin et al. 2008, 2009). 
Assume that Eq. 25 is normalized as 


R(x) 


B(x — x')T(x')dx ' . 


(33) 


And assume that the optical surface has the spatial frequency error R 


A 


Ra = S a [ sin (2 n- \ + 1 


(34) 


where A is the wavelength of the surface error and is the amplitude of the error. 
The circular Gaussian-shaped ion beam removal function is 


B(x) 


1 


X 


V2 


e 2 « 2 . 


(35) 


no 


Combining Eqs. 33-35, the dwell-time function can be calculated as 
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Since the dwell-time function distribution must be nonnegative, the actual dwell- 
time function distribution is 



Based on Eqs. 33 and 37, the actual material removed can be calculated as 


R a {x) = R A + sdA* 


1 


(38) 


Equation 38 shows that in any ion beam figuring process, there are always extra 
materials removed. The extra removal material y(x) is 


r(x) = R a (x) - R A {x) = S A (V("i) 2 - l) (39) 

where d = 6o is the ion beam diameter. Equation 39 shows that the extra removal 
material y(x) is independent of x; therefore, y(x) can be shortly written as y. 

The material removal efficiency £ can be described as 


< 5/1 -di.(d\ 2 

£ = e 18 w . 

Equation 40 shows that the material removal efficiency £ is a negative exponen- 
tial function of d// 1. For a known spatial frequency error with wavelength X, the 
more the beam diameter is, the less the material removal efficiency. For a given 
beam diameter, the smaller the wavelength is, the less the material removal 
efficiency. In this sense, the smaller ion beam diameter is always a best choice in 
ion beam figuring. However smaller ion beam diameter always result in more 
process time, which usually decreases the process reliability. To balance the 
material removal efficiency and process stability, a suitable ion beam diameter 
should be chosen. 

Simulating the material removal efficiency 8 at different conditions of d/X 
and comparing the results with the theoretical results achieved by Eq. 40, based 
on the result shown in Fig. 20, it can be seen that the simulation-predicted 
material removal efficiencies are in satisfactory agreement with the theoretical 
ones. From Fig. 20, it can be seen that the material removal efficiency is 87 % 
when d/X is 0.5. This value approximately corresponds to the Allen’s estimated 
material removal efficiency, 90 % (Allen and Keim 1989), which is acceptable in 
ion beam figuring process. When the ratio of d/X is up to 1, the e rapidly decreases 
to 58 % which is unacceptable, and it extremely degrades to only 11 % when the 
ratio is up to 2. 

According to theoretical analysis and simulation results, it is recommended that 
the ratio of d/X should be less than 0.5 in order to obtain acceptable material 
removal efficiency in ion beam figuring process. 
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Fig. 20 Comparison of 
theoretical material removal 
efficiencies with the 
simulation ones 



d/X 


The Optimum Material Removal of IBF 

It has been known that if the less desired material removal is specified, the 
calculated dwell function is consequently small for a given initial surface figure 
error, which means a short IBF process time, but as a result, the actual post- 
machined residual surface figure error is great. If a larger desired material removal 
is specified, although the resulted residual figure error is smaller, the calculated 
dwell function is greater, which means a longer process time. 

Therefore, in order to balance the process time, which is determined by the dwell 
function and the resulted residual error, the desired material removal specified to 
the contour algorithms should be considered and optimized. In this section, it will 
be discussed how to determine an optimum material removal. Firstly the conven- 
tional method to determine the removal is discussed in Lin Zhou et al. (2010). 

Since the real removal in an IBF process is always nonnegative, the specified 
removal should be nonnegative too. However, the data of a surface figure error from 
metrology usually contain negative elements. Therefore, in order to obtain a 
nonnegative removal, the conventional method is that the error data is simply offset 
to be nonnegative: 


R = E ~ m i n f E) (41) 

where R is the desired material removal and E is the surface figure error and min(E) 
is the minimum of E. This method is illustrated in Fig. 21. Due to the original 
surface figure error from metrology that inevitably contains noises, E should be 
smoothed to reduce the influence of noises on the magnitude of min(E). With a 
smoothed figure error, the removal determined by Eq. 41 is reasonable. 

However, for ultraprecision optics, in order to keep more details or higher- 
spatial-frequency information about the figure error, the original figure error from 
metrology is often used instead of the smoothed figure error. For original errors, 
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Fig. 21 Illustration of conventional method to determine specified removal, (a) Error profile from 
metrology; (b) determined specified removal R (R> 0) 


Fig. 22 Calculated result of 
error profile with edge fall, 
(a) Error profile from 
metrology; (b) determined 
specified removal R (R>0) 



Fig. 23 Calculated result of 
error profile with pits, (a) 
Error profile from metrology; 
(b) determined specified 
removal R (R>0) 




since they contain more noises, and more edge fall and pits, as shown in Figs. 22 
and 23, the material removal determined by Eq. 41 are not reasonable and tend to be 
greater, consequently causing longer IBF process time. 

It is concluded that, in the conventional method, the material removal is the sum 
of the figure error and an invariable uniform material removal. To avoid the 
drawback of the conventional method, Lin Zhou et al. proposed an optimum 
material removal method which determines the material removal flexibly (Lin 
Zhou et al. 2010). In this method, an adjustable uniform removal U is introduced 
to substitute the invariable [— min(Zi)]. The formula is 

(£+£/, E+U>0 

\0, E+U< 0' 

In Eq. 42, the adjustable uniform removal U can be broken into two parts, i.e., 
U = ye, where e is the RMS value of the figure error E and y is an adjustable 
parameter to be used to control the magnitude of the material removal. Since 
e describes the mean deviations of the figure error, it contains the main information 
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Fig. 24 Typical figure- 
prediction curve (process 
time vs. RMS value of 
residual figure error) 



Residual error (rms) /nm 


of the figure error. Therefore, the adjustable parameter y is just a simple factor, and 
for different figure error, the optimum y values experientially tend to be in the same 
range from 1 to 4. 

Since the iterative dwell-time function algorithm is simple and has met with 
considerable success in practice, this problem is discussed by one iterative algo- 
rithm expressed as (Drueding et al. 1995) 


T n + 1 — T n + £E n , E n — R — B * T n (43) 

where T n and E n are the dwell time and the residual figure error after n computation 
iterations, respectively, f is the relaxation factor, and B is the beam removal 
function. The initial T 0 is often set to a proportion of the specified removal, usually 
T 0 = R/B 0 , where B 0 is the integration of B. After several computation iterations, 
the final dwell time and the residual error can be obtained. 

For different y value, the material removal is differently determined by Eq. 42, 
and consequently the results, which are calculated in Eq. 43, including dwell time 
and residual figure error, are different too. 

A y curve in a coordinate system of process time (the sum of all dwell time, total 
dwell time) vs. RMS value of the residual error is shown in Fig. 24, which is called a 
figuring prediction curve which is usually a monotonically decreasing curve. In this 
curve, it can be seen that a larger material removal causes a smaller residual error, 
but consumes a longer process time, and a smaller material removal causes a shorter 
process time, but induce a larger residual error. This implies it is difficult to 
determine an optimum material removal for both small residual figure error and 
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short process time. However, fortunately, a figuring prediction curve is often in the 
shape of the letter “L.” This property is useful to determine the optimum material 
removal. The optimum material removal should be located at the comer of the 
letter “L,” since for removals smaller than this point, the residual error 
decreases rapidly while the process time increases slowly and, for removals 
greater than this point, the process time increases rapidly without much decrease 
in residual error. 

In addition, the figuring prediction curve can be used to predict the process 
time and the residual figure error of an IBF process. Since the figuring prediction 
curve is in the shape of the letter “L,” there exist two significant RMS, the comer’s 
RMS e c and the smallest RMS e L \ both are illustrated in Fig. 24. For an initial 
optical surface, if the desired residual error is represented by e D RMS, then an IBF 
process can be classified into one of the following three cases according to the 
magnitude of e D \ 

1 . eo < ei 

In this case, the figuring prediction curve indicates that the desired accuracy 
cannot be obtained. This implies that the initial figure error is too great to be 
removed. Therefore the optical component should be returned to the 
prepolishing process for a smaller initial error. 

2. Cl < eo < ec 

In this case, although the figure -prediction curve indicates that the desired 
accuracy can be obtained, the process time is too long to be practicable. 
Therefore, the optical component should also be returned to the prepolishing 
process for a smaller initial error. 

3 . eo > ec 

In this case, the figuring prediction curve indicates that the desired accuracy 
can be obtained in a practicable process time and the optimum removal at the 
comer is preferred. Moreover, in this case, if the desired RMS e D is significantly 
greater than the comer’s RMS c c , to reduce the process time, a smaller removal 
may be chosen instead of the corner’s removal. 

Although the property of the “L” shape about the figure -prediction curve is 
found out in the iterative contour algorithm, it also exists in other contour algo- 
rithms, such as the matrix algebraic algorithm. Therefore, this method to determine 
the optimum material removal can be used in other contour algorithms. 

A suggestion should be given that a smaller material removal is advisable. 
According to simulations, a process with larger removal usually induces a smaller 
residual figure error. However, in a real process, the resulted residual figure error 
from a process with larger removal is often significantly greater than the prediction. 
In addition, some experiments have indicated that a process with more removal will 
likely degrade surface roughness. Therefore, for an IBF process, a smaller material 
removal is advisable. 
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Realization of IBF Technique 

This section describes the operational steps to realize IBF technique. The procedure 
used in the process is outlined below: 

1 . Calculating the removal map 

At the beginning of process, an interferometric map of the optical compo- 
nent’s surface height data (or called surface map) is loaded in an array. For 
aspherical optical surface, this surface map should be calibrated according to the 
curvature of aspherical surface since the distortion of measure data project. The 
removal map is equal to the surface map minus the component’s desired surface 
map. This removal map generally needs to be filtered by Zernike polynomial to 
avoid influence of the measure noise or high- spatial-frequency error. With this 
way, it makes the IBF machine move smoothly. Further work in this step is the 
removal map data extension along the component edge. 

2. Selecting beam removal function 

It is the most important step to select a suitable beam removal function in 
an IBF process. It has been known that there are two key parameters for a 
Gaussian beam removal function, which are beam diameter d evaluated by the 
full width at half maximum (FWHM) or 6a and the peak material removal rate 
(or volume removal rate), which is mainly controlled by the ion beam voltage 
and current. About how to gain a suitable beam diameter, it will be discussed 
below. 

There are two methods to change the diameter of ion beam. The direct method is 
to decrease the scale of screen grid of the ion source shown in Fig. 25a. By this way, 
there is a ratio between the diameter of screen grid and the diameter of ion source 
inner chamber. This ratio is about 1:4. For example, in a 90 mm ion source, the 
minimal diameter of screen grid is about 22.5 mm. Based on the ratio, the better 
practical selectable range of screen grid is 25-90 mm for 90 mm ion source, 
15-50 mm for 50 mm ion source, 10-30 mm for 30 mm ion source, and so on. In 
these ranges, it can gain satisfied material removal rate universally. Otherwise, its 
peak material removal rate and volume material removal rate are rapidly reduced as 
the diameter of ion beam decreases. For example, when the diameter of ion beam 
decreases from 25 mm to 15 mm for 90 mm ion source, its peak material removal 
rate is rapidly reduced from 0.13 pm/min to 0.025 pm/min, and its volume material 

Q Q Q Q 

removal rate is also reduced from 16.6 x 10 mm /min to 1.67 x 10 mm /min 
from 0.13 pm/min to 0.025 pm/min (Xie Xuhui et al. 2009). Because the material 
removal rate decreases rapidly as the diameter of screen grid decreases, it is difficult 
to gain very small ion beam by this method. Currently, with this method, the 
smallest stable ion beam is about 4-5 mm(FWHM) by 3 cm ion beam in IBF. To 
gain more small ion beam, another method is to put an ion diaphragm (or mask) 
before the ion beam outlet of the ion source which passes only part of the ion beam 
through the ion diaphragm as shown in Fig. 25b. Compared with the direct method, 
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Fig. 25 The methods to gain the suitable diameter of ion beam 


this method is a secondary way which can gain about 0.5-4 mm Gaussian-type ion 
beam (FWHM) with good peak material removal rate. For example, by this way, it 
is able to gain 1.7 mm diameter (FWHM) of ion beam with a 2 mm diameter of 
ion diaphragm with peak material removal rate about 200 nm/min (5 cm ion source 
with 40 mA ion beam current, 1,000 eV ion beam voltage). And the ion 
beam longtime stability is better than 2 %/3 h. So, it is a conclusion that ion 
diaphragm is an efficient method to gain more small ion beam which has a 
good performance to improve IBF figuring ability. Said to this, there is a problem 
would be paid attention that a match relationship between the diameter of screen 
grid and the diameter of ion diaphragm is also existed to get full Gaussian-type 
material removal function. 

Based on the above discussion, the following conclusions can be drawn: 

1 . Different beam removal function can be gained by selecting suitable screen grid 
and ion beam power parameters. 

2. Ion diaphragm is a good secondary method to improve IBF figuring ability. 

3. In IBF process, it would select different screen grid and ion diaphragm to control 
full spatial frequency figuring error. 


3. Calculating and Realizing the dwell-time function 

About how to calculate dwell-time function, it is has been discussed in detail in 
the above section. Here, it is discussed how to realize the dwell time which may be 
seen in the continuation of the section “Contouring Algorithm for IBF.” 

Continuing the Bayesian-based algorithm discussion in the section “The Typical 
Features and Its Purpose of IBF,” assuming that the desired dwell-time function is 
T(x, y), the discrete intervals are Sx , Sy along the x and y direction, respectively. 
In the figuring process, the ion beam scans continuously in the x direction and 
raster moves in the y direction. Omitting the transient (acceleration and deceleration) 
of the translation system, the velocity distribution in the x direction can be 
computed as 
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Fig. 26 Diagram for analysis 
of realization error with 
approximated velocity 



V = — = 1 (44) 

T(x,y)S x S y T(x,y)S y ' K ’ 

The variation of the dwell time along the x direction introduces the transient and 
induces a realization error of dwell time with velocity given by Eq. 44. As shown in 
Fig. 26, the relative realization error of dwell time is 


St _ (d TsAx,y )) 2 _ S x (d x T ) 2 

TS x S y 2 aT\x,y)S x S 2 y 2 aT A S 2 y 



where a is the acceleration of the translation system. Equation 45 indicates that the 
relative realization error is inversely proportional to the acceleration a and to the 
square of intermittent increment Sy and is proportional to ( d x T ) 2 . 

Smoothing Eq. 45, it indicates that it can reduce the realization error. In order to 
smooth dwell time and to de-noise, total variation norm / 2 (E) is introduced, which 
is gradient based: 


MT)=fi 


VTldxdy 





where p is a weight factor. Adding Eq. 29 to Eq. 46, then the minimization problem 
is transferred as 


mm (/ 1 +/ 2 ). 


(47) 


With calculus of variation and multiplicative algorithm, the modified Bayesian- 
based algorithm for dwell time can be deduced as 



In the iterative process with Eq. 48, a small weight factor p cannot smooth the 
dwell time and cannot filter the noise. On the other hand, a large eight factor can 
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Fig. 27 The raster scanning 
of IBF 




make dwell time so smooth as to reduce the precision of dwell-time density 
function. There is a trade-off. 

Another key problem in the calculation of dwell time is the raster pitch 
(interval). In the raster scanning of ion beam figuring as shown in Fig. 27, the 
x direction is the scanning direction and the y direction is the raster direction. In the 
actual figuring process, the “tool trace” can be seen as raster pitch increasing as 
shown in Fig. 28, which will generate the middle- or high-spatial-frequency resid- 
ual surface errors on the optical surface. In Fig. 28, the ion beam removal function 
has 5 mm diameter. 

How is a suitable raster pitch to control its figuring residual errors selected? 
Answering this question, the figuring precision and its efficiency (or figuring time) 
are needed in actual process firstly. According to the Nyquist sample theorem, the 
pitch spatial frequency should be at least twice larger than the spatial cutoff 
frequency of the removal function. Assuming the axial symmetric removal Gauss- 
ian function has W width (6a), its cutoff frequency will be 



So the raster pitch should satisfy 





As known in Eq. 49, the small ion beam needs small pitch to satisfy the sample 
theorem. But in the actual process, an optimized pitch should be considered to 
match the precision and efficiency comprehensively. By computer simulation of the 
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Fig. 28 Simulated residual machining errors for different raster pitches, (a) 5 mm raster pitch, 
(b) 2 mm raster pitch, (c) 1 mm raster pitch 


Fig. 29 Simulation of pitch 
vs. its machining error for 
4>100 mm planar optics 



pitches and its related figuring errors from small to large pitches, a curve of the 
pitch vs. its figuring error can be drawn. An example shown in Fig. 29, the “dot” 
curve is for 5 mm diameter of ion beam and “square” one is for 10.6 mm diameter 
of ion beam. It can be seen that each curve has one turning point - 1 mm pitch for 
5 mm diameter of ion beam and 1.8 mm pitch for 10.6 mm diameter of ion beam, 
respectively. The turning point means that the change of relative figuring error (the 
ratio of actual figuring error to its desired error) is slower below this point, while 
faster above it. For example, of the “dot” curve in Fig. 29, when the pitches are 
smaller than 1 mm, their relative figuring errors are smaller than 10 %. When they 
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are larger than 1 mm, their relative figuring errors are increased rapidly. So the 
result that the 1 mm pitch would be selected as the raster pitch for 5 mm diameter of 
ion beam is obtained, which makes good trade-off between the relative figuring 
error and the figuring efficiency. With the above analysis, it is known that a curve of 
pitch vs. its figuring error can be set up to help in selecting the optimal pitch (Xuhui 
et al. 2011). 


The Challenges of IBF Technical Development 
High-Gradient Optical Surface Figuring by IBF 

In the above section, the measure data project distortion has been discussed. For 
high-gradient optical surface, this problem must be considered seriously when the 
surface error E{x , y, z ) is projected from three-dimensional (3D) Cartesian frame to 
2D frame, resulting in E{x , y) for the deconvolution operation. Furthermore, in 
order to maintain a constant removal function at every dwell point, the ion beam is 
usually held perpendicular to the surfaces. This means at least five axes and larger 
workspace is required. If wanting to figure high-slope surfaces with a linear three- 
axis machine, which is definitely economical and more reliable compared with a 
five-axis system, some problems should be solved first. Thomas Haensel 
et al. (2008) and Dai Yifan et al. (2010) have discussed to figure strongly curved 
surfaces with a linear three-axis system in IBF. 

In the section “Removal Function Modeling and Analyzing of IBF,” the chang- 
ing rule of the removal function has been discussed with the incident angle based on 
theoretical and experimental investigation of the removal characteristics, which is 
used to figure high-gradient optical surface. 

As shown in Fig. 30, a Gaussian ion beam bombards a high-slope surface 
z! = h(x ', /) parallel to the optical axis. The actual removal rate at point B r (x',y') 
in the action region is R eij when the beam dwells at point A f (x f , /) with a removal 
function R(x , y). Since the dwell time is calculated in 2D plane (OXY), the nominal 
removal rate at B(x, y) is R ^ , which is obviously unequal to the nominal one R ei j. 
The normalized removal rate is 



where 6 is the incident angle. R 0l j and R,/ are defined as 




So the actual removal rate becomes R ei j = K ij R l j. By compensating the change of 
coefficient K ip according to the removal characteristics in IBF process, the original 
program developed for figuring flat surfaces is still applicable to high-slope 
surfaces. 
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Fig. 30 Ion beam figuring of 
a high-slope surface along the 
optical axis 
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Substituting the normalized removal rate K t j into Eq. 19, which is the theoretical 
model of the removal function, it can be seen that the function at arbitrary incident 
angle can be gained when the removal function with perpendicular incident is 
obtained through experiments: 

R(x,y) = K 0 R(/(x cos 0,y). (52) 

Based on Eq. 52, the curve of normalized peak removal rate is shown in Fig. 31 
with the energy dispersion parameters determined. It can be verified through 
experiments with the ion beam scanning linearly and etching a sample along one 
of its generatrix. For example, the target distance is fixed and the constant scanning 
velocity is 1 mm/min. The aperture of the target surface is 21.3 mm, and the radius 
of curvature is 16 mm, which indicates the maximal incident angle is 41.7°. The 
actual material removal is shown in Fig. 32. For the purpose of comparison, 
the experimental curve of the removal rate is drawn in Fig. 31. It is consistent 
with the theoretical curve. Therefore it is reasonable to apply the theoretical model 
instead of a series of experiments to get the removal rate for high-slope samples of 
various apertures and various curvatures (Dai et al. 2010). A figuring experiment is 
done to testify this method with 5.7 mm (6a) diameter ion beam on a linear 
three-axis IBF machine. 

Figure 33a shows the original surface error map before figuring (101.9 nm PV 
and 13.1 nm RMS). According to the theoretical analysis and experiment, the 
distribution of the normalized removal rate K iy - is given in Fig. 33b. 

The first iteration aims to confirm the positioning precision and remove surface 
protuberances. It takes 4.2 min to reduce the figure error to 99.8 nm PV and 8.5 nm 
RMS with RMS convergence ratio 1.55. The residual error map shown in Fig. 34a 


36 Ion Beam Figuring Technology 


1379 


Fig. 31 Theoretical and 
experimental curves of the 
removal rate 
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Fig. 32 The removal result 
by linearly scanning a 
spherical surface 
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indicates that the local protuberances at central and marginal regions have been 
removed, and the RMS error is reduced evidently. In this iteration, the ion beam 
positioning problem is met. Xuhui et al. (2011) have discussed how to solve it. Then 
in the next iteration, it takes only 4.8 min to reduce the figure error to 44.3 nm PV 
and 5.9 nm RMS, with RMS convergence ratio 1.44, comparable to that in flat 
surface figuring. The final figure error map is shown in Fig. 34b. 

The total time consumed is 9 min, and the total convergence ratio reaches 2.24. 
Moreover, the figure error at the marginal region is successfully corrected without 
edge effect. This experiment proves the proposed method is excellent for figuring of 
high-slope surfaces. 

Based on figure error compensation, the influence of varying removal function 
and projection distortion on the dwell-time solution is reduced when figuring a 
high-slope surface. Hence the five-axis figuring machine can be replaced by a three- 
axis one with smaller workspace. And the program for figuring flat surfaces still 
works in high-slope cases. The limitation is that the maximal incident angle is 
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Fig. 33 The surface error, (a) The original figure error, (b) The distribution of normalized peak 
removal rate, (c) The compensated figure error 
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required within about 60°. For higher-slope surfaces, the uniform gridding partition 
of surface error in 2D plane would lead to loss of local details including some high- 
frequency errors. To solve this problem, it will be with a smaller ion beam and 
subregion stitching method. 


High Thermal Expansion and Crystal Optics Figuring by IBF 

Unlike conventional methods, ion beam figuring process must be in the vacuum 
environment and generates high surface temperature on optical component because 
some part of the energy which is not transferred to component atom momentum 
heats the component. This section mainly talks about the thermal effects on optical 
component in the IBF process. 

To know the temperature effect, the direct method is to measure the surface 
temperature by thermo-sensor, such as thermocouples, infrared camera, etc. 
(Gailly et al. 1999; Xuhui et al. 2012). The aim to discuss the temperature effect 
is to solve its influence to the optical component, especially to the high thermal 
expansion and crystal optics. The better method to solve this problem is to 
construct a thermal model to estimate the surface temperature (Xuhui et al. 2012; 
Nelson 2010). 

For the thermal modeling of component, something should be known about the 
ion beam, such as ion beam power density distribution, ion beam power reached to 
component, absorbed energy (heat) by component, and so on. Only known to these 
problems, a useful thermal model may be set up to correctly estimate the compo- 
nent temperature and its distribution. 

In ion beam, ion (A r +) proportion is about 75-90 %; the other 25-10 % is the 
neutron- atom. When the ion beam runs from the outlet of ion source to the surface 
of component, its power will be lose partly because of resonance charge exchanging 
between the ions and neutron- atoms. So, the ion beam power P' that reached to the 
surface of the component is 


P' = tjIU = r\P (53) 

where / is the ion beam electro -current, U is its voltage, P is the ion beam power 
generated by ion source, and r\ is the correcting factor of ion beam. Based on the ion 
beam correcting theory (Meinel et al. 1965), this correcting factor mainly lies on the 
section area of resonance charge exchange and ion velocity 

kTd 

a/2 P o^rce 

where e , E h and M t are the ion charge, ion energy, and ion mass, respectively; a and 
b are the ion constants; k is the Boltzmann constant; P 0 is the pressure of vacuum; 
d is the distance of ion source outlet to the surface of component; and <t rce is the 
section of resonance charge exchange: 
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0RCE 







The ion velocity is 



Therefore, the ion beam power P f that reached to the surface of component can 
be gained theoretically by Eqs. 53-56. 

The power that reached to the surface component is divided into two parts. One 
is so-called sputter power which is transferred to component atom momentum to 
make the atom escape the surface of component. The other is the main part 
absorbed by the component transferred heat to make the component surface tem- 
perature increase. 

The absorbed heat may be calculated according to the rate of the component 
temperature increased: 


P i = mc ,, t- (57) 

where is the rate of the component temperature increased and m and c p are the 

component mass and thermal capacity, respectively. It is therefore evident that the 
absorbed power would be estimated if it gained the rate of the component temper- 
ature increased. Another method to estimate the absorbed power is simulation based 
on the Monte Carlo method in the SRIM software which is an ion sputter simulation 
software (Shengyi and Xuhui 2010). With this method, it is also able to simulate the 
absorbed power P 1 : 



where Q is the absorbed energy and Qi is the sputter energy simulated. These two 
methods can validate each other to guarantee the model efficiency. 

When the ion beam bombards the surface of component, a temperature gradient 
field is formed which may generate thermal stress. Here, a heat transfer model is 
constructed to analyze the temperature gradient distribution. Assuming that the 
radiated energy of ion source would be ignored, the component heat source is only 
ion beam which is a Gaussian function distribution as described in Fig. 5. The 
absorbed power may be described as 



D 



where r is the distance from the center of the Gaussian function as shown in Fig. 35. 
Equation 59 shows that the absorbed power is also a Gaussian distribution which 
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Fig. 35 Ion beam bombards 
component configuration 




Ion beam 

source 




may generate nonuniformed thermal stress on the component. It is important to 
construct a suitable heat transfer model to analyze the thermal stress of component 
which is the key step to analyze and control the component thermal stress magni- 
tude and the stress distribution. 

As shown in Fig. 35, assuming that the ion beam bombards the center of the 
component along Z direction, the component heat transfer equation based on the 
Fourier heat transfer formulated in the cylinder coordinate frame is 

dT r d 2 T 1 dT d 2 T 

^ p dt dr 2 + r dr dz 2 

where p , c p and k are the material density, thermal capacity, and thermal conduc- 
tivity, respectively. T is the interior temperature field distribution which is the 

_A 

function of radial coordinate r, axial coordinate z, and heated time t. Ae is the 
component area power density of ion beam. 

Using the above temperature model, it can be analyzed and modeled that the 
temperature field and its temperature stress field of actual optical component 
figuring by ion beam. The experimental parameters are ion beam voltage 700 eV 
and its electro -current 60 mA, the diameter of ion beam 10 mm with ion diaphragm, 

y 

and the vacuum 2.7 x 10 “ Pa. The BK7 experimental component is 40 mm 
with thickness 10 mm. The reason to select a small component is that it is 
convenient to simulate the machining process. The actual component surface 
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Fig. 36 Optical component machining By IBF 


contour is shown in Fig. 36a, and its figured scanning is raster with 2 mm raster 
pitch shown in Fig. 36b. 

Based on the data of actual surface contour, a simulation is done to get the 
temperature field and its temperature stress field by the above temperature model. 
The simulated results are shown in Fig. 37. In this figure, it can be seen that the 
machined component temperature field and its corresponding stress field are not 
uniform. It is known that the high-temperature gradient field on these materials will 
generate large thermal stress in the component, which apparently distorts its 
surface, and when the thermal stress is larger than the material mechanic stress 
limit of component, the component generates crack or break. The reasons to 
generate thermal stress are the rapid change of the component’s temperature and 
high-temperature gradient. So, the troublesome and important problem is to select a 
suitable machining technique to control the temperature change rate and make 
temperature distribution uniformity in the IBF process. 

A suitable machining technique method to make the component’s temperature 
more even is to select a reasonable figuring method which can reduce the temper- 
ature gradient distribution, such as by designing a low-pass spatial frequency 
filter to process the component surface data measured by Zygo interferometer to 
only hold the component’s low-frequency part and its middle- and high-frequency 
parts to be filtered. By this way, computer simulation results about the related 
temperature field and its thermal stress field shown in Fig. 38. Comparing Figs. 37 
and 38, it is seen that the filtered IBF can gain the more even temperature field and 
thermal stress distribution. A final simulated result shows that the component 
maximum temperature is decreased about 10 % and its thermal stress is decreased 
about 22 %. 
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Fig. 37 Temperature field and its thermal stress field by IBF 
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Fig. 38 Temperature field and its thermal stress field by filtered IBF 
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Fig. 39 Filtered ion beam figuring flow 


Based on the above analysis, an ion beam figuring flow is set up for high thermal 
expansion and crystal optics as shown in Fig. 39. 


Supersmooth Surface Figuring and Micro-roughness Evolution 

The supersmooth surfaces of optical component is very precisely figured plane, 
spherical, and aspherical surfaces with the accuracies in depth down to the 
sub-nanometer level over the entire spatial wavelength range. The kinds of optics 
are very important for advanced DUVL, EUVL, and synchrotron. According to the 
Marechal condition, the wave front of a diffraction-limited imaging system must 
achieve a deviation of A/14 in the exit pupil (k = operating wavelength). However, 
requirements for a lithographic system are even more demanding. For nowadays 
193 nm and 13.5 nm systems, the expected residual wave front error of each optical 
element amounts to at least A/20 or below: 0.20-0.28 nm RMS and 0.11-0.20 nm 
RMS, respectively (Bruning 2007). Similar or even smaller values hold for spatial 
frequencies higher than 1/1 mm as outlined elsewhere. The roughness of EUVL 
substrates is described with two different areas of spatial frequency: 

MSFR = mid-spatial-frequency roughness between 1/mm and 1/pm and 
HSFR = high-spatial-frequency roughness ranging from 1/pm to 50/pm. 

It is proved that the ion beam figuring is a very useful tool to figure supersmooth 
optical surface. For example, supersmooth optical surface figure by IBF in Carl Zeiss, 
serial sphere with a diameter of 178 mm, rms = 0. 13 nm, serial asphere with a diameter 
of 260 mm, rms = 0.19 nm (Weiser 2009). In China, our research group has also 
figured supersmooth optical surface shown in Fig. 40 using our designed IBF machine. 
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Fig. 40 Ion beam figuring results in NUDT 


With ion beam to figure optical component, the micro-roughness is a regardful 
problem. Ion beam sputtering or ion beam erosion of surfaces can generate a 
diversity of surface topographies. Typically, during ion beam sputtering, the sur- 
face of the solid is far from equilibrium and a variety of atomistic surface processes 
and mechanisms become effective. It is the complex interplay of these processes 
that either tends to roughen (e.g., by curvature-dependent sputtering) or smoothen 
(e.g., by surface diffusion or viscous flow of surface atoms) the surface, which, 
finally, can result in the spontaneous formation of patterns (Frost et al. 2009). 


Summary 

In this chapter, it is known that IBF has five features and advantages: (1) high 
figuring precision, (2) highly predicable or stable, (3) noncontact figuring, (4) good 
material removal function, and (5) no or minimized support structure print effect. 
These make it to be an ideal machining method to figure ultraprecision optical 
component. Likely the conventional CCOS method, material removal function 
modeling, contouring algorithm, and realization of IBF process are its key 
technologies. 

However, it has some shortcomings, such as working in vacuum chamber, 
component heating, difficulty of improving surface roughness because of the ion 
“sandblasting” effect at the atomic level, slow material removal rate, etc. 

So, using IBF to machining optical component, the following problems would be 
deeply thought about: 

1 . The component material characteristics, if it is suitable to machining by IBF. 

2. As one of the controllable compliant tools, it is well to adapt different asphere 
shape to improve its contour accuracy, but it is difficult to control its middle- and 
high- spatial-frequency error. 

3. The ion “sandblasting” effect at the atomic level may make the surface rough- 
ness worse, so the ion beam parameter, total machining time or machining cycle, 
total removed material volume, etc., would be suitably selected. 
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Abstract 

Focused ion beam has become an increasingly popular tool for the manufacturing 
of various types of micro-/nanostructures and devices for different applications. 
In this chapter, the recent developments of the FIB technologies in the micro/nano 
manufacturing are presented in details. FIB technologies mainly involve four 
main approaches: imaging, milling, ion-induced deposition, and implantation. 
The working principle and key techniques underlying the four approaches are 
introduced with an emphasis on their abilities in micro-/nanofabrications. The 
application fields involving using the FIB micro -/nanofabrication technologies are 
also presented, such as micro optical elements, plasmonic lenses, etc. Concluding 
remarks and outlook of the future research on the FIB technologies in micro/nano 
manufacturing are provided at the end of the chapter. 


Introduction 

Focused ion beam (FIB) technology has unique advantages in comparison with other 
micro/nano manufacturing technologies such as high resolution and flexibility, 
maskless processing, and rapid prototyping (Reyntjens and Puers 2001; Volkert 
and Minor 2007). FIB can be considered as a compound platform combining a 
cutting tool with a beam diameter of tens nanometers and with a nanometer’s imaging 
resolution microscope. Uniquely, FIB is capable of milling substrate materials and 
adding materials at predefined locations with a high resolution. Versatile materials, 
such as diamond, metal, thin film, polymer, and glass can be processed by FIB. FIB 
technology has become one of the key approaches in micro-/nanofabrication for 
various applications, including, but not limited to, nano-optics, MEMS, nano- 
manufacturing, and nanotechnology. 

In FIB systems, the ion beam ionized and emitted from liquid-metal ion sources 
(LMIS) is accelerated, collimated, focused, and finally injected into the target with 
energies from several to dozens of kilo-electron volts. Then, a cascade collision 
occurs, in which energy and momentum are transferred from the incident ion to the 
target particles (atomic nucleus or electrons), and these collisions result in a series 
of effects introduced in the following. 

The most important physical effects of energetic ions on the substrate mainly 
include displacement of atoms and ion deposition in the sample (induced damage 
and ion implantation), electron emission (an effect realizing imaging), sputtering of 
neutral and ionized substrate atoms (an effect realizing substrate milling), and 
chemical interactions of breaking chemical bonds and thereby dissociating mole- 
cules (this effect is exploited in the ion-induced deposition), as shown in Fig. 1 
(Reyntjens and Puers 2001; Volkert and Minor 2007). The FIB four core techniques 
of imaging, direct writing, induced deposition, and implantation would be intro- 
duced in the following sections. 

Stopping and Range of Ions in Matter (SRIM), a group of computer programs 
which is employed to compute the interaction of ions with matter (Ziegler 2004), 
has been widely adopted in the FIB process investigations. As a core 
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Fig. 1 Principle of ion beam machining, (a) Illustration of a collision cascade generated by a 
30 keV Ga ion incident on a crystal lattice (Volkert and Minor 2007). Working principle of FIB (b) 
milling and (c) induced deposition (Reyntjens and Puers 2001) 


program of SRIM, the Transport of Ions in Matter (TRIM) program involves a 
Monte Carlo calculation which follows the ion into the target with about 
10 eV ~2 GeV ion energy, making detailed calculations of the energy transferred 
to every target atom collision. TRIM can calculate both the ions’ final 3D 
distribution and also all the kinetic phenomena associated with the ion’s energy 
loss: target damage, sputtering, ionization, and phonon production. Figure 2 
shows the TRIM Monte Carlo simulation results of 30 keV Ga ions interacting 
with the silicon substrate. 


FIB Imaging 

FIB imaging is mainly based on detecting the ion-induced secondary electrons 
(ISEs). Generally, 1-10 electrons with energies below 10 eV are generated by one 
injecting 5-30 keV gallium ion (Volkert and Minor 2007). Although ion beams are 
not as finely focused as electron beams, the ISE generation contrast mechanisms in 
FIB imaging are different from the one for SE generation in SEM imaging. The 
different contrast mechanisms for ISE formation are illustrated in Fig. 3, where 
crystal orientation, atomic mass, and surface geometry would influence the ISE 
generations (Volkert and Minor 2007). 
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Fig. 2 TRIM Monte Carlo simulation results of number of 100 Ga ions with 30 keV interacting 
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Fig. 3 Illustration showing the influence of (a) and (b) crystal orientation, (c) atomic mass, and 
(d) surface geometry on FIB Ga + collision cascades and ion-induced secondary electron (ISE) 
image contrast formation. Similar concepts influence the FIB sputtering yields. The orange atoms 
in (c) are more massive than the yellow atoms in (a), (b), and (d) (Volkert and Minor 2007) 


The SE and ISE images of the brass cross-section images show contrast due to 
material differences and surface topography. However, the ISE imaging shows 
stronger channeling contrast from crystals than the SE imaging. Because crystal 
contrast changes with the ion beam incidence angle, where the FIB penetration 
depth’s variation would influence the secondary electron emission rate, the contrast 
due to crystal orientation in ISE image can be easily distinguished while SE image 
cannot (Volkert and Minor 2007; Kempshall et al. 2001). FIB imaging can offer 
complementary information about a sample surface, which can be effectively 
applied in the IC chips’ analysis, alloy material science, etc. Figure 4 shows the 
SE and ISE images of a German silver substrate. 
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Fig. 4 Comparison of the secondary electron images of German silver material acquired by 
electron beam and ion beam, (a) Secondary electron (SE) and (b) ion-induced secondary electron 
(ISE) images of German silver 


FIB Sputtering 

Bitmap Patterning Method and Features 

One of the key issues in the FIB micro-/nano-manufacturing technology is to 
operate an FIB with optimized process parameters such as ion beam size, shape, 
and dwell time for removing a specified volume of material from a predefined 
localized area. Bitmap patterning method has been widely applied to fabricate 
versatile micro structure/nano structure patterns using FIB (Fang et al. 2010a; 
Fu et al. 2000). 

Bitmap Patterning Method 

The FIB patterning can be realized by importing a bitmap file as a pattern, in which 
the dwell time for each pixel of the ion beam location is normalized to the color 
values of the bitmap (Fang et al. 2010a). A bitmap file should be saved as 24 bits 
bitmap files. Each pixel consists of a red, green, and blue component (RGB). The 
green component determines whether the beam is blank or not. Any other values 
larger than 0 would open the ion beam. Blue component determines the dwell time 
per pixel, which can be set from 0 to 255. The dwell time corresponding to the 
pixels is linearly interpolated on the basis of the color values between the dwell 
time of zero and the user-defined maximum value. As a result, the ion beam at the 
white position has longer dwell time than the one in the black position. Figure 5 
shows the bitmap of a Chinese poem and its milled pattern fabricated by FIB direct 
writing (FIBDW). 
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Fig. 5 Nano Chinese poem fabricated by FIBDW. (a) Bitmap of poem; (b) poem fabricated by 
FIB (Fang et al. 2010a) 


Astigmatism Correction 

To fabricate nanostructures using FIBDW, detection and correction of the ion 
beam’s astigmatism are crucial issues. The astigmatism makes the shape of the 
focused ion beam spot an ellipse, which is supposed to be a circle, as shown in 
Fig. 6a, b. There might be four different orientations of the ellipse that are formed 
due to the imbalance of electrostatic forces that originate from the octopole 
electrodes (Fu and Bryan 2004b). 

The FIB system is capable of correcting astigmatism by using stigmators 
equipped in the ion beam column. By applying combinations of voltages to these 
deflectors, asymmetrical forces can be exerted on the beam to change its cross- 
sectional shape. The beam shape becomes circular when balanced stigmating forces 
are exerted by the octopoles. Unfortunately, the voltage offsets cannot be displayed 
on the operation screen. The FIB astigmatism is often judged by the naked eye of 
the operators according to their observation of the sample micrograph during FIB 
imaging, which is an obstacle to eliminate the astigmatism accurately. Dot array 
milling can be employed to inspect and correct the beam astigmatism (Fang 
et al. 2010a). The shape of the milled dots would reflect the ion beam shape. Dots 
in elliptical shape would indicate that an astigmatic ion beam has been formed. 
Then the astigmatism correcting process should be performed accurately until the 
beam shape becomes circular, as shown in Fig. 6c, d. 

Beam Overlap Control 

The FIB milling process is performed through a precise pixel-by-pixel movement, 
as shown in Fig. 7. Pixel spacing is the distance between the centers of the two 
adjacent pixels, which should be small enough to allow a proper overlapping 
between the adjacent pixels so that a smooth uniform profile can be fabricated, 
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Fig. 6 Illustration of the FIB with astigmatism and its correction by means of electrostatic force 
adjustment using the octopole electrodes. SEM images of dots array milled by FIB with and 
without astigmatism (Fang et al. 2010a). (a) An astigmatic beam, (b) a beam without astigmatism, 
(c) dots by astigmatic FIB, (d) dots by circular FIB 


Fig. 7 Schematic diagram of 
FIB milling with pixel 
overlapping (Fang 
et al. 2010a) 
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Fig. 8 Normalized ion flux distribution along a scan line with normalized pixel spacing equal to 
(a) 3.0 and (b) 1.5 (Tseng 2004) 


as shown in Fig. 8. To have a uniform scanning ion flux with respect to the scanning 
direction, the relation between the pixel spacing ( p s ) and FIB beam diameter (df) 
should be pjd f < 0.637 (Tseng 2004). In the bitmap patterning method, the beam 
overlap can be controlled by means of defining the bitmap pixel density, milling 
size, beam parameters, etc. 

Dwell Time 

The ion beam dwell time definition is vital for the FIB micro/nano fabrication (Fang 
et al. 2010a; Urbanek et al. 2010; Adams and Vasile 2006). The time that the beam 
remains on a given target pixel is called dwell time. The FIB dwell time can be 
varied from 100 ns to 4 ms in the FIB system (FEI Nova Nanolab). If the beam 
dwell time is too short (less than 1 ps), the FIB scanning loops would increase 
dramatically, which would result in unwanted ion milling (Fang et al. 2010). On the 
contrast, when dwell time is too long, the redeposition effect would also be a serious 
problem during the FIBDW process (Fang et al. 2010a; Adams and Vasile 2006). 
As the sample material is sputtered away, some is redeposited in the volume that is 
being sputtered. For a small dwell time (fast repeated scans), some fractions of the 
Ga ions are used to sputter away most of the redeposited material formed in the last 
loop. For a large dwell time (slow scans), the redeposited material cannot be 
removed completely, as shown in Fig. 9a. 

Scan Orientation 

Two different FIB scanning strategies, a conventional raster scan and a new spiral 
scan, have been compared in holes array fabrication (Hopman et al. 2007). Spiral 
scan means that each hole is scanned in a spiral pattern and turns out that better 
effect can be obtained in this way for several reasons. Firstly, the unwanted beam 
exposure or etching is much smaller in the case of the spiral scan. Secondly, 
compared with the shape of the holes produced through raster scan, in which 
process the beam is always moving in the same direction, the shape produced by 
the spiral scan is much more symmetric. In the third place, given that redeposited 
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Redeposition material 


Fig. 9 Large beam dwell time degrades the mask fabrication results due to the redeposition effect, 
(a) shows the redeposition effect under large FIB dwell time, and (b) shows the grating results with 
severe redeposition (Fang et al. 2010a) 



material can be better removed from the sidewalls as the beam progresses from the 
center of a hole outward, the spiral scan also benefits by improving the hole’s 
sidewall steepness (Hopman et al. 2007), as shown in Fig. 10. 

For nano-gratings patterned with two perpendicular slits, there would be remark- 
able height differences for the two different orientated slits if the bitmap is chosen as 
“r” or “L” shape in the FIBDW method, as shown in Fig. 11a, c. The FIB scan 
direction is set horizontally here. The FIB redeposition effect and milling restriction at 
the positions for the slits vertical to the scan direction (area B) in FIBDW would be 
larger than that of the parallel slits (area A). This results in height differences for the 
slits with different orientations, which is the reason why slits oriented differently are 
not in the same focus in the SEM image, as shown in Fig. 1 lc. By tilting the “p” shape 
bitmap 45° to the “V” shape, the orientation of the two slits relative to the FIB scan 
direction is nearly the same. The height difference can also be well minimized. 

Channeling Effect 

If the target material is a crystalline material where atoms are orderly arranged, 
incident ions may penetrate much deeper along the low indexed axis of crystalline. 
The penetration depth can be several times of that for amorphous target. Such 
phenomenon is called ion channeling effect. Ion channeling can occur when the 
angle of incidence of the ion beam yr with respect to a plane or an axis of the crystal 
is less than a certain critical angle yr c which can be approximately defined as 
(Kempshall et al. 2001) 



3A 2 ZiZ 2 
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Fig. 10 SEM images of comparing 250 nm diameter holes with 440 nm pitch milled into bulk 
silicon using different FIB scanning strategies (a) shows the raster scan results and (b) shows the 
spiral scan results (Hopman et al. 2007) 


(*A 1/3 y . 

I 128 ) X a ° 

A = — ^ m 2 

( Zl 2/3 +Z2 2/3) 1/2 

where Z 2 is the atomic number of the target material, Z x is the atomic number of ion, 

E is the energy of the incident ion, d is the distance between atom positions along 

the index direction, and A is the Thomas-Fermi screening length. 

If i/a > iff a the ion has a too large transverse energy (vertical to the ion injection 

direction) to be trapped in the transverse potential (to be channeled). On the 

contrary, if i/a < i/a c , the ion would not have enough transverse energy to overcome 

the potential barrier and would be trapped in the transverse potential well 

• • 

(Chouffani and Uberall 1999). Under this condition, a channeled particle would 
be guided along a plane or an axis of the crystal, as shown in Fig. 12. 

The ion channeling contrast shows a distinct change in Cu milling properties, as 
shown in Fig. 13. The [100] direction of the grain on the right side of Fig. 13a is 
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Fig. 11 Effect of slit orientation on the FIBDW process; (a) and (b) show the bitmaps for L- and 
V-shape mask; (c) and (d) show the fabrication results by FIB nanofabrication (Fang et al. 2010a) 


atomic 



Channeling effect 



Free moving 


Fig. 12 Schematic diagram of channeling effect 
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Fig. 13 Effects of channeling on the FIB milling of a Cu 10°/[100] twist bicrystal at (a) 0° tilting 
angle and (b) tilted a few degrees until the ion channeling contrast across the boundary is uniform 
(Kempshall et al. 2001) 


a 



x 0.200 |im/div 
z 10.000 nm/div 


|im 


0.8 


|im 


x 0.200 |im/div 
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Fig. 14 Channeling effect for the FIB nanofabrication on the single crystal diamond substrate 
(Xu et al. 2010) 


aligned with the focused ion beam and appears darker than the grain on the left side. 
When the ion channeling contrast across the grain boundary is not uniform, 
differential FIB sputtering yield as well as trench wall slope from redeposition 
would come out, as shown in Fig. 13a. Using the ion channeling contrast as a guide, 
the sample can be tilted a few degrees until the ion channeling contrast across the 
boundary is nearly uniform, as shown in Fig. 13b (Kempshall et al. 2001). From 
Fig. 13a, it is found that as the ion beam becomes incident in a channeling direction, 
the sputtering yield would decrease. The main reason for the sputtering yield 
decrease is that the channeled ions undergo mostly electronic energy losses as 
opposed to nuclear energy losses and are able to penetrate deeper into the crystal 
lattice (Kempshall et al. 2001). 

Ion channeling effects bring up many new phenomena such as reduced 
sputtering yield, low processing efficiency, groove-like morphology of the surface, 
and high surface roughness, as shown in Fig. 14. The factors that affect ion 
channeling include the angle of incidence of the ion beam, the characteristics of 
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Table 1 Calculated 
critical channeling angle i// c 
for Cu and Si (30 keV Ga + 
FIB) (Kempshall 
et al. 2001) 


Direction 

Angle from [100] 

y/ c for Cu 

y/ c for Si 

[110] 

45 

9.87 

6.39 

[100] 

90 

7.61 

4.93 

[112] 

35.3 

6.54 

4.23 

[130] 

18.4 

5.40 

3.49 

[111] 

54.7 

5.04 

4.05 

[123] 

36.7 

4.76 

3.08 

[114] 

19.5 

4.33 

2.80 

[120] 

26.6 

4.16 

2.69 

[150] 

11.4 

3.77 

2.44 

[125] 

24.1 

3.58 

2.31 

[113] 

25.2 

3.10 

2.49 


the ion, and crystallographic orientation of the target. Table 1 shows the comparison 
of the calculated critical channeling angle i \j/ c with different crystal directions for Cu 
and Si (Kempshall et al. 2001). One way to improve the sputtering roughness is to 
deliberately block the ion channels by doping the polycrystalline material with 
impurity atoms. The doped copper layer would be like amorphous layer and its 
sputtering uniformity can be greatly improved (Cui 2005). 


FIB Sputtering Applications 

Because the FIB sputtering method has advantages of high resolution, high flexi- 
bility, and maskless process, it has been widely used in the versatile applications, 
such as nano-optics (Gramotnev and Bozhevolnyi 2010), nano-manufacturing 
(Allen et al. 2009), bio-sensing (Lai et al. 2007), micro tool with nanoscale cutting 
edge (Xu et al. 2010), etc. 

Micro Optical Elements 

Micro optical elements (MOE) are key components for building compact optoelec- 
tronic systems. Micro-diffractive optical elements (MDOE) has shown many 
unique advantages, such as small volume and weight, good optical quality, and 
large apertures. With the trend of miniaturization, there is an increasing demand for 
applications of MDOE in solar condenser lenses, infrared sensors, laser beam 
shaping, and high-performance optical imaging systems. 

There are several approaches in fabricating micro optical elements, such as the 
binary optics fabrication method (Herzig 1997), direct writing method using a laser 
or a focused ion beam (Fu et al. 2010), single-point diamond turning (SPDT) 
technology (Yi and Li 2005; Fang et al. 2008), etc. The lithography method 
needs multistep processes with high cost, which needs continuous photolithography 
and etching process to produce multiphase steps structures. 

Focused ion beam direct writing method is found to be effective in MOE 
fabrication. Two-dimensional slice-by-slice approaching can be employed to 
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Fig. 15 Micro-diffraction 
lens developed by FIB 
(Fu and Bryan 2004a) 



facilitate three-dimensional micro optical elements’ fabrication, including micro- 
diffraction lens and micro hybrid refractive-diffractive lens (Fu and Bryan 2004a), 
as shown in Fig. 15. Micron-scale curved structures have been sculpted by varying 
the FIB ion dose per pixel and accounting for angle-dependent sputter yield 
provided that a range of suitable dwell times is selected, such as sinusoidal and 
near-parabolic features in Si (Adams and Vasile 2006). FIBDW is considered as a 
promising method for micro-optics device integration and optical system’s minia- 
turization. However, the FIB bitmap patterning method is time-consuming to 
accurately fabricate curved shapes in micro structures till now. 

SPDT can precisely control the fabrication process with a high machining 
accuracy, and it is much suitable for processing complex micro-diffractive optical 
elements. However, with the optical elements miniaturization, the development of 
the micro tools with nontraditional shape and sharp edges used in SPDT is becom- 
ing a very important topic. Due to the size and configuration limitations of tradi- 
tional cutting tools in SPDT, the shadowing effect would directly degrade the 
optical properties of the device and reduce its diffraction efficiency. While this 
problem can be overcome using a micro hemispherical tool (Xu et al. 2010; Zhang 
et al. 2009), an approach is needed to form the sharp cutting edge in the MOE 
fabrication. 

Micro-diffractive optical elements could be fabricated by micro tools with 
nontraditional geometry fabricated with FIB. Micro tools with nanometric cutting 
edges and complicated shapes are fabricated by controlling the tool facet’s orien- 
tation relative to the FIB (Ding et al. 2008; Xu et al. 2010; Zhang et al. 2009). The 
experiments for micro tool fabrication were performed under an FIB/SEM dual- 
beam system equipped with high-resolution rotational equipment. The translation 
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Fig. 16 Illustration of micro tool sharp edge formation (Xu et al. 2010) 


stage can be tilted within 15° ~ 60°. Tool rotation is controlled by a rotational axis 
that can rotate unlimitedly. Different tool faces can be milled by accurately 
adjusting their relative positions to the FIB by the rotation and sample tilt control 
(Xu et al. 2010; Zhang et al. 2009). 

During the process of the micro tool fabrication by the FIB milling, the FIB 
milling sequence and tool position relative to ion beam are critical factors in 
determining micro tool characteristics, such as cutting edge, rake face, and relief 
angle. The FIB would produce sharp edges on the side of facets furthest from 
the ion source, while the facet edge closest to the ion source is rounded because the 
part of the Gaussian beam intensity would extend outside the defined pattern 
boundary, as shown in Fig. 16. Therefore, in order to produce sharp cutting 
edges, tool position with respect to the ion beam and the fabrication sequences 
are crucial. Based on the sharp edge generation analysis above, three steps are 
employed to fabricate micro tool with sharp cutting edges by the FIB method. 
Figure 17 illustrates the fabrication procedure for an arc-shaped micro tool. 
A desired cross-sectional shape can be fabricated by the FIB bitmap patterning 
method, as shown in Fig. 18. The major advantages of the FIB-based micro tool 
fabrication method are the better geometry control of micro tools with sharp edge, 
but without the introduction of any machining stress comparing with precision- 
grinding method. 

The hemispherical micro tool was used to fabricate micro-diffractive optical 
component by the ultraprecision machining on a 6061 aluminum workpiece. 
The diffractive optical element designed has a diameter of 15 mm, a curvature 
radius of 30 mm, and the interval between the adjacent rings of 30 pm. The 
machined Fresnel optical lens is measured by a Veeco NT9300 interferometer, as 
shown in Fig. 19. 
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Fig. 17 Micro tool machining using FIB (Xu et al. 2010) 



Fig. 18 SEM images of the FIB -fabricated rectangular- shaped (a), spherical (b), hemispherical 
(c), and DOE-shaped (d) micro tools (Xu et al. 2010; Zhang et al. 2009) 
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Fig. 19 Data of micro-diffractive optical component, (a) Micro optical component and (b) 
morphology of the micro lens (Xu et al. 2010) 



Fig. 20 SEM images of nano patterns fabricated by the FIB. (a) 32 nm line width grating patterns 
on Cr him (Fang et al. 2010a); (b) nanogap with a width of 5 nm formed by FIB (Nagase 
et al. 2005) 

Nanofabrication 

As the FIBDW bitmap patterning method shows high resolution and flexibility, 
various nano patterns can be fabricated using this method, such as nano-grating 
patterns with 32 nm line width, as shown in Fig. 20a. Previous studies show that 
nanostructures with a width much narrower than the FIB beam diameter can 
be fabricated by precisely terminating the etching step (Nagase et al. 2005; 
Menard and Ramsey (2011)). Maskless nanogap electrode has been fabricated 
by Ga FIB direct writing technique. The etching steps are monitored in situ by 
measuring a current fed through the substrate films and terminated electronically by 
a preset current (Nagase et al. 2005). Figure 20b shows a nanogap with a 
width of 5 nm formed by FIB with a beam size of 12 nm on double-layer films 
consisting of 10-30 nm thick Au conducting layer and 1-2 nm thick Ti bottom 
adhesion layer. 

Nano-optics studies the behavior of light on the nanometer scale. A wide variety 
of novel optical properties would appear in nano-optics, such as strongly enhanced 
light transmission, beyond diffraction limitation, etc. There is a rapid expansion of 
research into nanophotonics based on surface plasmon-polaritons (SPPs) in recent 
years (Gramotnev and Bozhevolnyi 2010). Surface plasmon (SP) is a surface 
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electromagnetic wave existing in the metal and dielectric interface, which has a 
large wave vector. Surface plasmon polaritons can be excited by the nanostructures 
on metal film (Ag, Au), providing a wave vector that offsets the mismatch value 
between SPs and light. When light irradiates the rough surface, the SPs can be 
excited to a higher level and then be coupled with the light. 

With the rapid development of nano-optics, the surface plasmon-based nano- 
optical elements, which can be used in the surface plasmon interference 
nanolithography, the plasmon-enhanced sensing and spectroscopy, the nanoscale 
superfocusing, and so forth have attracted extensive attention. Due to its high 
resolution and high flexibility, FIB nanofabrication method has been widely 
adopted in the nano-optical elements’ fabrication (Liu et al. 2007a; Fang 
et al. 2005). The investigations about typical plasmonic lenses developed by FIB 
would be presented below. 

The resolution of conventional optical system is governed by the diffraction 
limit, which typically is about half of the wavelength. In recent years, plasmonic 
lens focusing by means of surface plasmon polaritons (SPPs) at the negative index 
medium (NIM) is attracting much interest, which can focus the evanescent waves in 
the near-held region with sub-diffraction-limit resolution (Fu and Zhou 2010; Liu 
et al. 2007a). 

The superlens was first developed for the superfocusing study. It was proved that 
the silver superlens can image arbitrary nanostructures with sub-diffraction-limited 
resolution (Fang et al. 2005). As a core part, the chrome him was patterned on 
quartz by using focused ion beam technique. The hrst type of plasmonic lens 
mentioned above can overcome the diffraction limitation, but it also has disadvan- 
tages such as no working distance, no amplifying function (magnihcation is 1), and 
one-dimensional imaging only. A hyperlens can be used to magnify the object in 
radial direction on a half-cylindrical multilayer structure, where sub -diffraction- 
limited objects could be fabricated on a 50 nm thick chrome layer located at 
the inner surface (Liu et al. 2007b). Plasmonic slits with different stepped grooves 
have been designed and fabricated to achieve efficient plasmonic focusing. 
The SPPs propagate along the surface of the metal him and are diffracted to 
the far held by the periodic grooves, which are usually designed with a width 
smaller than half of the incident light’s wavelength. Constructing interference of 
the diffracted beams results in the focusing effect at a certain point on the beam 
axis (Fu et al. 2010). Adjusting the periodic grooves’ parameters (such as width, 
depth, period, and number), the diffracted beams can be fully controlled to 
form a tailored ultracompact lens with sub-wavelength resolution (Shi et al. 2007; 
Liu et al. 2011). 

In order to realize superfocusing function, plasmonic lenses of metallic 
sub-wavelength structures with chirped (variant periods) slits and nanopinholes 
were put forward (Fu et al. 2010; Zhang et al. 201 1). A novel plasmonic lens with a 
micro-zone platelike (PMZP) structure can be used, in which a thin film of Ag is 
sandwiched between air and glass, as shown in Fig. 21 (Liu et al. 2011). However, 
the ring-based structures have higher sidelobes. To suppress the sidelobes and 
further improve the focusing quality, a circular and elliptical hole-based plasmonic 
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Fig. 21 Images of the plasmonic structures fabricated by FIB direct milling technique, (a) SEM 
image of the plasmonic lens; (b) the SNOM result of the plasmonic lens (Liu et al. 2011) 

lens were put forth (Fu and Zhou 2010; Zhang et al. 2011). As an example, an 
elliptical hole-based plasmonic lens was studied with eight rings on a 200 nm 
thickness Ag film coated on quartz substrate, as shown in Fig. 22. The pinholes are 
completely penetrated through the Ag film during FIBDW. The number of pinholes 
from inner to outer rings is 8, 20, 36, 55, 70, 96, 107, and 140, respectively. The 
elliptical hole-based plasmonic lens can generate ultra-enhanced lasing effect and 
realize an extraordinarily elongated DOF as long as 13 pm under illumination of 
plane wave in linear y-polarization (Zhang et al. 2011). 

Plasmonic lens with extraordinarily elongated DOF has the unique features in 
practical applications, such as nanoscale imaging and lithography (Liu et al. 2011; 
Fu et al. 2010). 3D image of cells or molecular can be obtained using plasmonic 
lens in bio-imaging systems, such as confocal optical microscope. In surface 
plasmonic structure-based photolithography systems, the reported experimental 
results were obtained near-held with about ten nanometers gap between the SP 
structure and substrate surface (Fang et al. 2005), and the gap is difficult to precisely 
control in practical operation. Fortunately, when the plasmonic lens are utilized, the 
working distance between the structure and substrate surface can be 10 pm and even 
longer, which would make the practical control and operation process much easier. 


FIB-Induced Deposition 

FIB can be employed to realize the localized maskless deposition of both metal and 
insulator materials. In FIB-induced deposition (FIBID), the precursor gas is first 
sprayed and then adsorbed locally on the sample surface using a fine nozzle. Then 
the injecting ion beam decomposes the adsorbed precursor gases. The volatile 
reaction products are desorbed from the surface and are removed by the FIB vacuum 
system, while the desired reaction products would fix on the surface and form a thin 
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Fig. 22 Pinhole array plasmonic lens, (a) Schematization of the pinhole array with focal length /. 

2 

(b) E-held intensity distribution lEyl at x-z and y-z planes for the pinhole array, (c) Pinhole array 
fabrication result by FIBDW. (d) SNOM results regarding focusing performance with sample 
scanning along propagation direction at distance Z = 5 pm (Fu et al. 2010; Zhang et al. 2011) 
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Fig. 23 Pt deposited cross- 
sectional of Si specimen with 
less than 100 nm thickness 
milled by FIB 



Fig. 24 CNT probe for high 
aspect ratio imaging 

(Xu et al. 2009b) 



film. The deposited thin film is not fully pure material, which would contain organic 
contaminants as well as Ga ions from the FIB (Reyntjens and Puers 2001). 

FIB ID can be used in the IC chips’ modifications by introducing a conductive 
connection. It is also employed in the TEM samples’ preparation by film deposition 
on the interesting areas to protect the specimen from the ion beam damage, as 
illustrated in Fig. 23. However, as the FIB would induce damage to the sample 
during the FIB ID process, electron beam-induced deposition method has gradually 
replaced the FIB ID method in preparing TEM sample. 

FIBID can be used as a flexible adhesive in the micro/nano devices fabrication. 
For example, carbon nanotube (CNT) can be fixed to the end of ordinary probe of 
atomic force microscope by the FIBID process combining with the nanomani- 
pulations, as shown in Fig. 24 (Xu et al. 2009a). Another application of FIBID 
is to fabricate three-dimensional (3D) microstructures/nanostructures. Matsui 
fabricated micro- wine-glass with 2.75 pm external diameter and 12 pm height 
on Si substrate and human hair (Matsui 2003). Various materials can be deposited 
to build 3D structures by altering the precursor gas source such as diamond-like 
carbon, tungsten, and silicon oxide (Kometani et al. 2010). 
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FIB Irradiation 
lon-Beam-Induced Damage 

In addition to emitted electron and secondary ions, ion implantations induce point 
defects, clusters and amorphous regions, depending on different ion doses and 
annealing temperature, as showed in Fig. 25 (Sebania and Antonino 2010). For 

1 A A 

low-dose (<10 cm ) implants at low annealing temperatures (<300 °C), the 
main induced damage is point defects. When the target is annealed for implantation 

Q IQ Q 

doses in the range of 10 ~ 10 cm at temperatures lower to 550 °C, the dominant 

1 Q Q 

damage is clusters. For high implantation dose (> 10 cm ) implants, amorphous 
pockets and even continuous amorphous layers are observed in the implanted 
region. High ion implantation doses annealing at high temperatures above 600 °C 
would form other extended defects like dislocations (Sebania and Antonino 2010). 

• Point Defects 

If the energy of the target atom received from an incident ion is larger than a 
threshold value, this atom displaces from its initial site in the lattice, leaving a vacancy 
defect there. Then it collides with other target atoms causing a cascade collision or 
stops in the interstitial site in the lattice forming an interstitial atom defect. When the 
transformed energy attained to a larger value, a Frenkel pair would be created 
(Sebania and Antonino 2010), which consists of one vacancy and one interstitial 
atom. Di-vacancies are also principal defect at room temperature, as shown in Fig. 26. 

• Defect Clusters and Amorphous Regions 

1 o A 

After Si self-implantation with doses over 10 ~ cm ~ and annealing tempera- 
tures over 550 °C, the point defects gradually develop to a more stable and complex 
structure, i.e., clusters, as shown in Fig. 26 (Sebania and Antonino 2010). The 
clusters form in some specific conditions such as (i) when the implanted ions are 
mobile to attain a reasonable concentration at part of the implant region, (ii) when 
the implanted ions are immiscible with target materials, and (iii) when the temper- 
ature is high enough for implanted ions migration to enable clustering. Under these 
conditions, the individual impurity atom A is possible to form a dimer A 2 , then a 
trimer A 3 , and finally growth into a cluster. The cluster could also form via 
nucleation, where an extremely high nucleation energy barrier needs to be 
exceeded. Since the nano-clusters have a large surface-to-volume ratio, they are 
quite different from that in normal equilibrium. For instance, the melting transition 
of Ga 39 + and Ga 40 + happened at around 550 k (Breaux et al. 2003), much higher 
than the bulk melting point (303 K). 

Due to the accumulation of point defects and disorder, Si would undergo a phase 
transition from a crystalline to amorphous during ion irradiation with a dose on the 

1C Q 

order of about 10 cm at room temperature, when the free energy of the damaged 
crystalline phase is higher than that of the amorphous phase (Pelaz et al. 2004). 


Annealing temperature (°C) 
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Fig. 25 Illustration of the principal damage features as a function of the ion dose and annealing 
temperature for MeV light ion implantation on Si (a) (Sebania and Antonino 2010) and illustration 
of point defects (b) 
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Fig. 26 Nanoparticles of gallium-silicon compound in the FIBI layer in Si (100). (a) Cross 
section of FIBI layer, (b) The FIBI layer after XeF 2 gas flow eliminating the redeposition 
(Xu et al. 2009a) 


The processes’ characteristic times which take place in MeV ion irradiation 
show different timescales. The ion-nucleus scattering and ion-electron scattering 
process would be in as timescale, while the defect clusters and dislocations would 
be formed in ns timescale (Whitlow and Zhang 2010). 


FIBI Used as Mask 

In the ion implantation, the bond between target atoms could be broken while the 
ion would bond with target particles forming new compounds. Nanoparticles of 
gallium-based compound with 10-15 nm diameter were found evenly distributed in 
the FIB implantation (FIBI) layer in the Si (100) substrate when the ion dose is 
larger than, as shown in Fig. 26 (Xu et al. 2009b). The FIBI layer could be used as 
etching mask in the micro-fabrication, based on that the etching rate of 
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Fig. 27 Nanoelectrodes made by FIB implantation and subsequent XeF 2 gas-assisted etching, (a) 
Structures made by FIB implantation, (b) nanoelectrodes made by combining FIBI with FIB-GAE 
(Xu et al. 2009a) 

FIB -irradiated area is much lower than that of nonirradiated area during the subse- 
quent dry (wet) chemical etching or subsequent in situ FIB XeF 2 gas- assisted etching 
(Xu et al. 2009b). The etch stop mechanism has been ascribed to the formation of a 
resistive gallium oxide layer in the doped surface layer for Si (Arshaka et al. 2004). 
By using FIBI as mask, many kinds of nanostmctures have been achieved, such as 
nanochannels, nanocups, freestanding bridges (Reyntjens and Puers 2001; Tseng 
2004), and nanoelectrodes (Xu et al. 2009b), as shown in Fig. 27. 

The ion implantation dose would affect the masking capability of Ga + -implanted 
region and its etch stop mechanism (Fang et al. 2010b). For the range of ion dose 

1 /C ^ 

<2.8 x 10 cm , implanted Ga ions are incorporated as dopants into the ZnO 
substrate without noticeable second phases or precipitates. The Ga-doped ZnO 

1 ZT rs 

layer can be used as the etch mask. For the ion dose >4.6 x 10 cm , the 
ZnGa 2 0 4 particles would segregate on the surface, which are responsible for the 
etch mask behavior, as shown in Fig. 28. Ga ions in the ZnGa 2 0 4 lattices attract O 
ions strongly and prevent Zn-0 bonds from reducing because the electronegativity 
of Ga is higher than that of Zn. 

Besides being used as the etching mask, the Ga + -focused ion beam implantation 
has also been used to in situ control the nucleation of individual Ge islands on the Si 
(001) surface at nanometer scale. The patterning technique consists of three steps: 
(i) low-dose Ga + implantation, (ii) post-implantation annealing, and (iii) high 
vacuum deposition of Ge. Particular nano-topography formed after FIB implanta- 
tion and annealing acts as island nucleation centers (Portavoce et al. 2007). 
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Fig. 28 TEM image of the Ga-implanted ZnO. (a) Cross-sectional TEM image of the 
Ga-implanted ZnO with a dose of 1.8 x 10 17 cm -2 , after H 2 etching at 700 °C for 30 min. 
(b) HRTEM image of a surface-segregated particle in (a). The inset shows the Fourier transformed 
image taken from the particle (Fang et al. 2010b) 


Nano-hole Fabrication 

Nanopores or nanochannels fabricated in solid-state membranes have been used for 
the rapid electrical detection and characterization of single biopolymer molecules 
(Chih et al. 2006). A focused ion beam sculpting or scanning technique can be 
employed to fabricate nanopores precisely (Chih et al. 2006). Relying on this 
technology, it can tune pores of various sizes and irregular shape to symmetrical 
and shrunk nanopores simply by scanning the membrane surface. The membrane 
used was made of silicon nitride (SiN) with a thickness of 100 nm, which was 
coated with 1-2 nm of gold-palladium (AuPd) to help imaging and to prevent the 
charging effect during exposure to ion radiation. The ion beam provides some 
mobility to the SiN membrane material around the pores and allows the pores to 
shrink due to surface tension. The technique provides a visual feedback over the 
nanopores ’ formation process, so that it is capable of producing highly symmetrical 
nanopores with diameters of smaller than 5 nm (Chih et al. 2006). This sculpting 
process also allows an array of multiple nanopores to be fine-tuned simultaneously, 
as shown in Fig. 29. The nanopores array can be used as templates for photonics, for 
the synthesis of nanostructures, and so on. 


3D Nanostructures Fabrication and Assembly 

Focused ion beam (FIB) has been recognized to be convenient to fabricate 
two-dimensional (2D) nanostructure. However, although 3D nanofabrication has 
been established by FIB chemical vapor deposition (CVD) using a hydrocarbon 


37 Focused Ion Beam Nanofabrication Technology 


1417 


a 


B e fo re sc a n n i ng 


b 


After FIB scanning 



* 


* 



W 05- CM I M*i 1 ■' tD&mm IPj pJ wm w im 

iM'-i* n&m\ taoix now i serai 


WIO! I 

<!$*!» 




Fig. 29 Nanopores have been reduced in size and made more symmetrical using the FIB 
sculpting process, (a) An array of asymmetric pores with ~100 nm diameter was initially milled 
by FIB. (b) All pores were tuned simultaneously to symmetrically shaped nanopores of diameter 
~20 nm by FIB sculpting (scanning) (Chih et al. 2006) 

precursor, the FIB-CVD method still has many limitations. 3D nanostructures can 
be produced based on FIB stress -introducing technique (FIB -SIT) (Xia et al. 2006). 

In the FIB stress-introducing method, as a result of the ion implantation during 
the FIB milling, a damaged layer is formed in the milled areas near the thin film 
material surface; tensile stress and strain are arise in the damaged layer. Therefore, 
the distribution of tensile and the film curling in nanometer scale can be flexibly 
controlled to form 3D nanostructures. 

The 3D nanohelix structural parameters can be calculated as follows: 

Number of helical turns : n = s - 0/360° (3) 


Pitch : p = 


360° , 

1 cos a 

0 



360 /sin a 

Helix diameter : D = ( 5 ) 

On 

where / is the FIB scanning (stress-introduced) period, a is the FIB scanning 
orientation, s is the times of FIB stress-introduced, and 0 is the bending angle of 
cantilever made by every stress-introduced, as shown in Fig. 30. Based on the 
rolling strategy, the pitches, diameters, and chiralities of the nanohelixes can be 
controlled effectively. For example, the nanohelix’s pitch and chiralities can be 
controlled by varying the FIB stress-introduced orientation. Combining with the 
FIB-CVD method, a fixed-fixed helix with 710 nm diameter was successfully 
fabricated by FIB -SIT. 3D nanohelix can be fabricated using FIB -SIT method 
on different substrates, such as Si film, Al film, N 3 Si 4 film, etc. The FIB -SIT 
technology could also be used in the 3D assemblies. With the advantages of high 
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Fig. 30 Nanohelix fabricated by FIB stress-introducing process, (a) Schematic of the FIB stress- 
introducing process. Cantilever curling happens at the part where FIB damages the surface layer, 
(b) 3D nanohelix structural parameters (Courtesy of Prof. Wu and Dr. Xu) 



Fig. 31 Nanotube probe’s orientation and length are optimized by FIB irradiation and milling 
processes, (a) Misaligned CNT probe, (b) well-aligned CNT probe, and (c) CNT probe after 
shortening (Fang et al. 2009) 


flexibility and controllability, the FIB -SIT is a promising technique to fabricate 
various 3D nanostructures and implement 3D assembly, which might find broad 
applications in the biomedical sensing systems and nano-electromechanical 
systems. 

Similar to the FIB -SIT technology, focused ion beam irradiation method can be 
employed to modify the orientation of carbon nanotube (CNT) AFM probe with 
high accuracy and reproducibility (Deng et al. 2006; Park et al. 2006). In the CNT 
probe’s orientation alignment process, repeated single scans of the ion beam 
imaging were applied. The CNT probe was gradually aligned towards the FIB 
irradiation direction under every FIB scan and finally aligned parallel to the FIB 
irradiation direction, as shown in Fig. 31. 

The variables that affect the FIB -CNT interaction process include ion acceler- 
ating voltage, ion beam current, exposure time, and ion scan size (or the scan pixel 
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density). Optimized combinations of these variable values can produce acceptable 
results (Fang et al. 2009). The CNT orientation changes are mainly due to the strain 
induced by the FIB irradiation and the CNT’s excellent plastic ability. It was found 
that the carbon nanotubes can be considered as self-healing materials under electron 
or ion irradiation (Krasheninnikov and Nordlund 2010). 


FIB Damage Annealing 

The FIB micro/nano machining processes would induce lattice damage, 
make the top layer amorphous, and implant ions into the substrate. The FIB 
irradiation-induced damages during the FIB milling process would degrade some 
properties of fabricated devices, such as silicon photonic devices (Tanaka 
et al. 2006; Tian et al. 2009; Ridder et al. 2007). The FIB -induced damages 
would introduce dramatic optical loss or absorption to the fabricated silicon 
photonic devices, which can reduce the cavity Q factors (Tanaka et al. 2006). 
Fabrication strategies have been used to minimize the optical losses, such as 
high-temperature annealing or using a protective mask and a chemical etchant 
(Schrauwen et al. 2006). 

Annealing treatment can be an effective method to reduce the FIB -fabricated 
device properties’ degradation (Schrauwen et al. 2007). Tanaka et al. have exper- 
imentally demonstrated that a high-temperature annealing treatment at 800 °C in 
vacuum can remove most of the residual FIB gallium ions penetrated into 
the silicon photonic crystal slab (Tanaka et al. 2006). Focused Ga ion-beam 
implantation has been developed to change the structural and electrical properties 
of microchannels and nanochannels in Si on SOI substrates. Thermal annealing 
results in the restoration of the majority of the damaged lattice (Pan et al. 2005). 
Structures of Ga ions implanted silicon-on-insulator (SOI) devices before and 
after annealing have been characterized by cross-sectional TEM and Raman 
spectroscopy (Pan et al. 2005). It is found that the defect structure rearranges 
after the annealing as presented in Fig. 32a, b. Raman spectrums of the 
Si implanted with different doses before and after annealing showed in 
Fig. 32c, d separately, which also indicates that the majority ion-induced damage 
is annealed out. 

A thermal oxidation process can be used to reduce the ion-induced damages to 
the silicon substrate, such as implanting ions, amorphous layers, and redeposition 
process. Experimental results show that by means of silicon thermal oxidation 
treatment, the Q factors of the silicon photonic resonators fabricated by FIB milling 
can be significantly improved (Tian et al. 2009). In order to reduce the FIB 
implantation’s degradation, an additional 200 nm thick silica layer acting as a 
protective mask in FIB machining was used to improve the device’s properties 
(Tian et al. 2009). This mask has twofold purposes: protecting the region from ion 
damage during the FIB focusing and removing the FIB redeposition when the silica 
dissolved in hydrofluoric (HF) acid. 
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Fig. 32 Cross-sectional TEM micrograph of the silicon-on-insulator after (a) Ga + implantation 

ic o 

with a dose of 10 cm ; (b) annealing at 650 °C for 300 s. High-resolution images of the 
implanted region on the left of each image. Raman spectrum of the SOI implanted with different 
doses of Ga + (c) as implanted, (d) annealed at 650 °C for 300 s (Pan et al. 2005) 


Summary and Outlook 

Focused ion beam (FIB) technologies have been widely used in micro/nano 
manufacturing, with unique advantages of high fabrication resolution, high flexi- 
bility, maskless processing, and rapid prototyping. FIB technologies have a signif- 
icant impact in various areas, such as micro-/nano-optics, surface engineering, 
biotechnology, and nanotechnology. 

FIB possesses the advantage of high resolution in comparison with the laser 
machining technology and many other micro/nano manufacturing technologies. 
However, it has two main disadvantages need to be overcome. 

Firstly, the FIB machining efficiency is generally lower than others micro/nano 
fabrication methods. How to increase the FIB efficiency without its resolution 
degradation is a key topic in the future. Manufacturer has newly combined FIB 
with pulsed laser beam (AURIGA laser), which can dramatically increase the 
material removal efficiency. Another good example is projection focused ion 
multi-beam system (PROFIB) using multi-ion beam which is emitted from the 
multi-ion source (Loeschner et al. 2002). 

Secondly, how to control the FIB milling results with nanometer- scale form 
accuracy is still an important area in milling high-precision components, especially 
for the curved micro/nano devices fabrication. Therefore, it is necessary to further 
develop a reliable in situ feedback system to detect or monitor the milled shape in 
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nanoscale accuracy during FIB operation. Integrating the atomic force microscope 
(AFM) with FIB might be an appropriate choice. Once the high-performance FIB 
system is developed, it would become the mainstream tool in the micro-/ 
nanotechnology in the future. 
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Abstract 

Single-crystal materials are widely used in optics and electronics field and are 
essential in condensed matter physics and materials science researches. However, 
the hard and brittle nature of the materials prevents it from manufacturing intricate 
features and optical quality surfaces. Nanometric cutting assisted with ion implan- 
tation surface modification method reduces the fracture and tool wear during 
machining by improving the surface mechanical properties. The mechanism, tech- 
niques, and applications of this method are specifically explained in the chapter. 


Introduction 

Due to the absence of the defects associated with grain boundaries, single crystals have 
unique properties, particularly mechanical, optical, and electrical properties, which can 
also be anisotropic, depending on the type of crystallographic structure. These proper- 
ties find wide applications in the fields of optics and electronics. Single crystals are also 
essential in research, especially for condensed matter physics and materials science, etc. 

Monocrystalline silicon is the principal material applied in microelectronic, 
micro -mechanical, and infrared optical components. Germanium “metal,” i.e., 
isolated germanium, is used as semiconductor in transistors and various other 
electronic devices. Germanium’s major end uses in the present are fiber-optic 
systems and infrared optics. It is also used in solar cell applications. Lithium 
niobate (LiNb0 3 ) is widely used in a variety of photonic devices due to its high 
electric-optical, acousto-optic, piezoelectric, and nonlinear optical coefficient. For 
its low-cost and outstanding physical properties such as high Curie temperature and 
high rigidity, LiNb0 3 has been successfully used in the applications of integrated 
optics. High-quality electric-optical modulators, optical switches, and rare earth- 
doped waveguide lasers based on LiNb0 3 have been developed. 

However, these crystal materials are nominally hard and brittle, which 
prevents them from manufacturing intricate features and optical quality surfaces 
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(Zhang and Zarudi 2001). It is currently finished by lapping and chemo-mechanical 
polishing (CMP) (Yan et al. 2001). Many efforts were made to improve the 
productivity in recent years. Ultra-precision cutting can machine crystals in a 
ductile mode without the need for subsequent polishing, but the machining scale 
must be controlled to be in a range of a few tens of nanometers (Fang and Venkatesh 

1998). 

Nanometric cutting assisted with ion implantation surface modification provides 
a novel approach for manufacturing brittle monocrystalline materials. The mechan- 
ical properties of the surface layer of the material are modified in a depth up to a few 
micrometers by decrystallization using high-energy ion bombardments so that the 
brittleness is reduced and the plasticity is enhanced. The productivity is greatly 
improved by avoiding the fractures during the surface mechanical processing (Fang 
et al. 2011). 


Mechanism of Ion Implantation Surface Modification 
Introduction 

Optical single crystals have a brittle nature at ambient temperature. However, its 
amorphous modification exhibits a plastic flow under low loads so that possible 
phase transformation undergoing machining operation is of great significance in 
producing damage-free components by machining. Single-point diamond turning in 
ductile mode of silicon surfaces leads to amorphization. This amorphization pro- 
cess can be attributed to the pressure-induced phase transformation generated by 
the hydrostatic pressure. 


Ion Implantation Surface Modification Mechanism 

Ion implantation surface modification of single crystals has attracted significant 
interest. A number of theoretical calculations and experiments were designed to 
provide a better understanding of the mechanisms behind the crystal-to-amorphous 
transition. 

A renewed interest in the modeling of amorphization mechanisms at atomic 
level has arisen due to the use of preamorphizing implants and high dopant 
implantation doses. Ion implantation surface modification of single crystals is the 
result of a critical balance between the damage generation and its annihilation 
(Fang et al. 2011). Ion bombardment and cascade collision generate different 
damage mechanisms from point defects (vacancies and interstitials) to point defect 
clusters. The superlinear trend in the damage accumulation with dose and the 
existence of an ion mass depending critical temperature above which it is not 
possible to be amorphized are some of the intriguing features of the ion-beam- 
induced surface modification. 
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Mechanism of Nanometric Cutting of Crystal Surfaces Modified by 
Ion Implantation 

In nanometric cutting, tool extrusion caused by effective negative rake angle pro- 
duces hydrostatic pressure on the workpiece (Fang and Venkatesh 1998 and Fang 
et al. 2005). Monocrystalline silicon has perfect diamond lattice at ambient tem- 
perature. The hydrostatic pressure generates lattice slip and lattice disorder, which 
could be seen in Fig. 1. Defect-free atoms in the workpiece are minimized for 
visualization, and lattice defects can be observed. When the tool imposes force onto 
the workpiece, the hydrostatic pressure induced by tool extrusion leads to lattice 
slip. Layer by layer, lattice slip extends to the scale of the cutting depth. When 
lattice slip extends to the threshold, the material is conveyed at the lattice slip layer, 
and dislocation carries out underneath the tool cutting edge. As the cutting process 
steps into an equilibrant state, lattice deformation only exists at the tool edge. The 
lattice disorder and atom flow, which contribute to a small-scale cutting force 
vibration, form the cutting chip and finished surface. 

After ion implantation, the lattice structure of the surface layer of the mono- 
crystalline silicon has been transformed due to the ion bombardment. The silicon 
atoms are no longer arranged in an ordered structure. The internal damage created 
by ion implantation acts as the core to absorb the tool-induced energy and expands 
during the cutting process which prevents lattice slip and dislocation from going 
further. As the cutting comes into the equilibrant state, all the damages inside the 
workpiece disappear due to self-relaxation. The lattice deformation also only 
occurs at the tool edge. The process is shown in Fig. 2. The ion-implanted silicon 
reduces the resistance against load. The brittleness is reduced and the elasticity is 
enhanced. It is beneficial for prolonging the cutting tool life and generating an 
optical and damage-free surface finish. 


Nanometric Cutting Approach of Ion Implantation Surface 
Modification 

Ion Implantation Surface Modification Simulation 

In real ion implantation process, the ion-material interaction is swift and in atomic 
scale, which can hardly been observed. Nanometric processing involves changes of 
the material in only a few nanometers from the surface. It is difficult to observe and 
illustrate accurately the structural changes during the process. Molecular dynamics 
(MD) simulation method becomes an important tool to analyze the mechanism of 
ion implantation surface modification and the follow-up processing. 

MD simulations are carried out to study the mechanism of monocrystalline 
silicon surfaces modified by ion implantation. Lattice transformation is observed 
during the implantation process to examine the relationship between the damage 
mechanism and the implantation parameters. Nanoindentation simulations are also 
carried out, and it is proved that the ion implantation modification indeed influences 
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Fig. 3 (a) 3-dimentional MD model system of ion implantation, (b) 3-dimentional MD model 
system of nanoindentation 


the mechanical property of monocrystalline silicon and the mechanism of the 
mechanical processing of the material. Experiments of ion implantation, 
nanoindentation, and nano- scratching are employed to verify the simulation results. 


MD Simulation Model 


The simulations are carried out using large-scale atomic/molecular massively 
parallel simulator (LAMMPS) (Plimpton 1995). The use of molecular dynamics 
simulation to study atom irradiation effects of solids (Pelaz et al. 2004) and 
nanometric processing (Fang et al. 2005, 2007) has become an effective approach. 
Three-dimensional model is employed in the simulation. Figure 3 is the 
3 -dimensional MD model of the system. In the model, the workpiece is silicon 
consisting of 57,600 atoms and has been divided into three different layers: 
boundary layer, thermo layer, and Newton layer. Atoms in the boundary are fixed 
in position to avoid unexpected movement of the workpiece during simulation and 
reduce the boundary effects. Thermo layer is used to conduct away the outward heat 
from the interaction region by maintaining a constant temperature of 300 K. The 
motion of the atoms in the Newton layer is determined by the forces produced by 
Newton’s equation of motion. Figure 3a is the ion implantation simulation model. 
The implantation position for ions is randomly chosen. Perpendicular incident is 
adopted for computationally simpler handling. To modify the surface to a few 
micrometers, heavy ions and MeV implantation energy are adopted in the experi- 
ment. Considering the scale of the simulation, 30 keV is chosen. The simulation 
time step is prolonged to 1 f. to adapt to the energy. Figure 3b is the nanoindentation 
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Table 1 Workpiece and simulation conditions (Fang et al. 2011) 


Properties 

Parameters 

Work material 

Monocrystalline silicon 

Work material and structure 

Diamond cubic 

Work dimensions 

20 a x 15 a x 24 a, a 5.431 A 

Potential for simulation 

Morse and Tersoff model 

Implantation energy 

30 keV 

Implantation fluence 

1 x 1,013,5 x 1,013,1 x l,014ion/cm 2 

Indenter dimension 

15 a in diameter 

Loading direction 

(100) surface 

Loading speed 

4 m/s 


model. The nanoindenter in the simulation is regarded as rigid body so that the 
atoms are fixed relatively to each other and with no interaction force between the 
atoms. Details of the workpiece and simulation conditions are summarized in 
Table 1. 


MD Simulation Potentials 

There are two different atomic interactions in the MD simulations processes: Eq. 1 
the interaction in the workpiece and Eq. 2 the interactions between the workpiece 
and the implant ions and between the workpiece and the diamond indenter. 

For the interaction between the silicon atoms in the workpiece, Tersoff potential 
(Tersoff 1988) is used. The interactions between silicon atoms and implant ions and 
between silicon atoms and carbon atoms of the diamond indenter were described by 
Morse potential. 

In ion implantation simulation, Si atoms interact with each other normally via 
the Stillinger- Weber (SW) potential (Stephenson et al. 1996). However, the stan- 
dard potential is Tersoff for nanoindentation simulation to observe the lattice 
evolution (Cheong and Zhang 2000), and the mechanism changes of nanometric 
processing is the main interest. SW potential is sacrificed in order for the continuous 
and quantitative result in simulation. 


MD Analysis Algorithm 

Radial distribution function (RDF) and a geometric criterion for damage detection 
(Caturla et al. 1996) are used in the simulation to analyze the simulation 
process. RDF, also called g(r), is a useful index to judge phase transformation by 
evaluating the number of neighbor atoms and obtaining atomic distribution. RDF 
quantitatively analyzes the structural changes of the workpiece during ion implan- 
tation in histogram form by evaluating the statistical information of bond length 
variation. 
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Fig. 4 Schematic for ion implantation: (a) ion-substrate interaction in ion implantation process, 
(b) simulated implantation process 


The nearest-neighbor distance in c-Si, rNN = 2.35 A, is used for damage 
detection. A sphere around each atom with radius rNN/2 is defined, and all the 
atoms at the initial undisturbed lattice of occupied atoms sites are regarded as 
ordered, which follows the concept of the Lindemann sphere (Hensel and Urbassek 
1998). A Si atom that is outside all the spheres is categorized as interstitial, while 
within the spheres, when no atom is found, a spot categorized is defined as a 
vacancy. Here the concept of lattice site is not adopted for disordered zones, for 
the atom displacements are large in both high-energy bombardment and mechanical 
processing. 


Ion Implantation Process 

High-energy ions bombard the material surface and interact with silicon atoms, as 
shown in Fig. 4. Figure 4a illustrates the ion-substrate interaction in ion implanta- 
tion process, while Fig. 4b is the simulated implantation process. One part of ions 
collides with the surface atoms and is backscattered; the other part penetrates into 
the material, and cascade collides with atoms inside the material, resulting in a 
collision cascade. The penetrated ions gradually lose their energy, finally rest in a 
certain depth of the material, and form ion implantation. Substrate atoms are 
displaced or excited by atom collision. A number of atoms affected increase 
exponentially, where some are even bounced into the inner layer or sputtered out 
of the material. Secondary electron and X-ray are also emitted during the process. 

Ion implantation surface modification is created by implanted ions and bounced 
atoms penetrating into the surface. The lattice is disturbed under the impact of 
the incident ions, resulting in dislocations and forming ion implantation damages. 
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Fig. 5 RDF results of the 
lattice transformation during 
ion implantation (Fang 
et al. 2011) 
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The structural changes in the implantation layer are the result of the critical balance 
between the damage generation and its annihilation. The depth of the modified layer 
is related to the incident energy, incident angle, incident ion species, and surface 
conditions. Ions are energy free when reaching the ion resting layer and the lattice 
structure in this region is influenced by the resting ions. The structural changes are 
smaller than the modification layer. The substrate is not influenced by the 
implantation. 

Figure 5 shows the RDF results of the lattice transformation during different ion 
implantation periods. Different peak distributions of RDF curves illustrate the 
changes in atomic arrangement. The width of the peak indicates the density of a 
specific crystal form; the narrower the peak is, the more lattices are in perfect order 
and vice versa. 

Before implantation, silicon exhibits a narrow and clear single peak, indicating 
that the substrate is perfect diamond cubic crystal. Due to the cascade collision, the 
crystal structure of the material is significantly been influenced during ion implan- 
tation. The peak is shortened and broadened, a series of second peaks appear, and 
the peak shape is no longer clear, which shows the existence of non-ideal crystal 
structure. Although how far the disordered zones contribute to the well-known 
phenomenon of ion beam-induced amorphization has not been resolved up to now 
(Pelaz et al. 2004), considering the absence of a significant peak, it can be predicted 
that the material has the tendency of becoming amorphous. After implantation, due 
to the self-annealing phenomenon of silicon, the material recrystallizes and mono- 
crystal peak reappears. However, the peak is wider, and also with a second peak, 
indicating that even self-annealing happens, the structure changes still exist. 


Ion Implantation Damage 

Cascade collision introduced by ion implantation induces lattice structure changes. 
According to the result of RDF analysis, the damage would not completely heal 
even after implantation. The factors affecting implantation include implantation 
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energy, ion fluence, and beam current. Implantation energy has an approximately 
linear relationship with the depth of modification layer, which could be adjusted to 
fulfill the application requirements. Beam current relates closely to ion fluence, but 
it would not be considered in the simulation, since the thermo layer is adopted to 
constrain temperature, and beam current is strictly restricted in the real experiment 
for reducing self-annealing. Figure 6 shows the vacancy and interstitial detected by 

1 O 1 o 

geometric criterion under ion implantation with ion fluence of 1 x 10 1 ", 5 x i(r~\ 

1 A O 

and 1 x 10 ion/cnr. Perfect lattice structure is processed to be semitransparent 
and the simulation border is defined. 

Ion implantation would also introduce defects in the form of clusters due to the 
cascade collision. With an increase in the implantation fluence, the number of 
defects rises correspondingly, and the defect clusters are more obvious and con- 
centrated, while the percentage of monocrystalline structure diminish accordingly. 
When ion fluence reaches to a certain extent, it can be assumed that the material has 
the tendency of becoming an amorphous state. Since the ion implantation location 
is randomly chosen, the defects can be regarded to be evenly distributed as a whole, 
and the mechanical property changes of the substrate are uniform and controllable. 


Mechanical Property Testing Simulation for Nanoindentation 
of Ion-Implanted Surfaces 

Molecular dynamics are carried out on both normal and implanted silicons for 
three-dimensional nanoindentation simulation to analyze the changes of mechani- 
cal property and the mechanism of the changes. 

For nanoindentation simulation, round diamond indenter with a diameter of 15a 

o 

(a = 3.57 A) is used to carry out constant load tests on material surfaces. The 
loading speed is 4 m/s. Due to the scale of the simulation, indentation depth is set to 
be 0.6 nm. 

The indenter is set to be rigid to ensure that they would not interact with the 
testing material, which makes the simulation results closer to reality. The loading 
force is calculated using Morse potential, which is the composite force workpiece 
atoms applied on the indenter. 

In nanoindentation, diamond indenter produces hydrostatic pressure on the 
workpiece. Single -crystal silicon has perfect diamond lattice at ambient tempera- 
ture. The hydrostatic pressure generates lattice slip and lattice disorder, which could 
be seen in Fig. 7b. Defect-free atoms in the workpiece are minimized for visuali- 
zation, and lattice defects can be observed. Layer by layer, the lattice slip extends to 
the scale of the indentation depth. As lattice slip extends to the threshold, the 
indentation process steps into an equilibrant state; lattice deformation only exists 
around the indenter, as shown in Fig. 7c. 

After ion implantation, the lattice structure of the surface layer of the mono- 
crystalline silicon has been transformed due to the ion bombardment. The silicon 
atoms are no longer arranged in an ordered structure. The internal damage created 
by ion implantation acts as the core to absorb the indenter-induced energy and 
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(f) equilibrant cutting of modified silicon 
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Fig. 8 Nanoindentation 
load-displacement 




expands during the indentation process which prevents lattice slip and dislocation 
from carrying out. 

As the indenter gradually presses deep into the material, the damage inside the 
workpiece disappears due to self-relaxation. The lattice deformation only occurs 
around the diamond edge. The process is shown in Fig. 7. Figure 7a-c illustrate the 
indentation process of the monocrystalline silicon, while Fig. 7d and e show the 

1 O 

structural changes with modified silicon. Low ion fluence of 1 x 10 ion/cm was 
chosen to illustrate the phenomenon. The ion-implanted silicon reduces the resis- 
tance against load. The brittleness is reduced and the elasticity is enhanced. 

The load-displacement curves are shown in Fig. 8. Compared with monocrys- 
talline material, the silicon surface modified by ion implantation requires smaller 
loading force when reaching identical loading depth. Therefore, the mechanical 
property of implanted silicon is improved and the hardness is reduced. With an 
increase in the implantation ion fluence, the loading force decreases correspond- 
ingly, indicating that more lattices are transformed to amorphous structures and the 
mechanical property is enhanced more significantly. The displacement discontinu- 
ity (pop-out) of single-crystal brittle material disappears by observing the unload- 
displacement curves, illustrating that the brittleness of the material is reduced and 
the plasticity enhanced after ion implantation, which is beneficial for generating 
damage-free surface finish. Nanometric cutting of the single-crystal brittle material 
after ion modification shares a similar mechanism. 


Ion Implantation Experiment 

Ion implantations are carried out on monocrystalline silicon in order to prove the 
validation of the method, choosing implantation depth correspond to the 
manufacturing scale and low ion fluence to minimize defects quantity. The crystal- 
to-amorphous transition is examined by transmission electron microscopy (TEM), 
and mechanical property changes are examined using large penetration depth 
nanoindentation. 
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Ion Implantation 

Ion implantation are performed on (lOO)-oriented monocrystalline silicon samples 
at room temperature using a van de Graaff accelerator, with an implantation angle 
of 7° respective to the surface direction to minimize channeling effects. Specimens 
are bombarded under vacuum at an energy of 10.0 MeV and a fluence of 1 x 10 , 

IQ 1 A Q 

5 x lO 1 ", and 1 x 10 ions/cnr. Implantation angle of 7° to the surface direction is 
adopted to minimize channeling effects. 


Focused Ion Beam Preparation of TEM Samples 

Cross-sectional transmission electron microscopy (TEM) sample lamellae is pre- 
pared to study the structural changes of the modification layer using a focused ion 
beam (FIB) system equipped with an in situ Kleindiek rotational nanomotor. The 
angle between the ion and electron beam is 52°. Electron beam is used for real-time 
accurate observation during the operation which introduces least damage to the 
sample. 

The energy and current of both beams (ion beam and electron beam) are 
optimized to reduce obvious specimen damage during TEM specimen preparation. 
The accelerating voltage and current of the incident e-beam is maintained at 5 kV 
and 98 pA, throughout the entire process. An incident ion beam accelerating voltage 
of 30 kV, with an ion current ranging from 0.5 nA to 10 pA, is used during the FIB 
milling process. 

The specimen preparation follows the in situ H-bar method, which is more 
efficient and flexible than the conventional technique (Mayer et al. 2007) for 
improving the yield and reducing the damage during FIB TEM specimen prepara- 
tion. The implanted surface layer is mechanically trimmed to about 10 pm and 
glued on the copper grid. The TEM copper grid is then clamped by nanotweezer and 
attached to the rotational nanomotor, as shown in Fig. 9. 

'"7 

The nanomotor can be rotated by 360° continuously with 10 rad resolution, 
which is used for tilting the lamellae surface vertical to the ion beam inside the FIB 
chamber. 

Before thinning, a Pt line should be deposited on the surface to protect the top 
portion of the specimen from Ga ion implantation. For e-beam deposition intro- 
duces least damage to the specimen (Decoster and Vantomme 2009), electron 
beam-induced Pt deposition is initially used to deposit a thin film (about 35-50 nm). 
Then, focused ion beam-induced deposition (30 kv, 30 pA) is used to further deposit 
a thick film (about 500 nm) for sufficient protection from the subsequent FIB cross- 
section process. After cutting two deep trenches on either side of the lamella, the 
specimen is less than 100 nm thick, which is ready for the TEM observation, as 
shown in Figs. 10 and 11a. The TEM specimen is then characterized using a TEM at 
acceleration voltage of 200 kV. 

Figure lib shows TEM and SAD patterns of the modification layer, showing the 
implantation effect on lattice structure. Both halo rings and diffraction spots can be 
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Fig. 9 SEM image of the 
nanotweezer clamping 
sample inside the FIB 
chamber (Chen et al. 2012) 




Fig. 10 Cross-sectional specimen been milled to 100 nm thick 

seen, but the spots are weak, indicating the structure is almost amorphous. 
The critical influence for amorphous is determined by ion energies when ion 
mass is settled (Decoster and Vantomme 2009). When the ion energy reaches 

14 9 

10 MeV, 1 x 10 ions cm ~ reaches the critical influence. In the ion resting 
layer, ions are energy free when reaching the resting layer and the lattice structure 
in this region is influenced by the resting ions. The structural changes are smaller 
than the modification layer. The substrate is not influenced by the implantation. 

After implantation, the surface layer of the silicon is modified to a depth up 
to 4 pm, as shown in Fig. 12. Cross-sectional TEM results are shown in Fig. 13. 
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Fig. 1 2 Ion implantation 
results 






Fig. 13 Crystal stmcture with implantation fluence of (a) 1 x 10 13 (b)5 x 10 13 and(c) 1 x 10 14 ion/cm 2 

After modification, the sample exhibits different crystal structures according to the 
implantation fluence. Different crystallographic orientations can be seen on differ- 
ent parts of the TEM image in Fig. 13a, indicating that the perfect crystal structure 
has been divided into several parts. In Fig. 13b, when the ion fluence increases to 
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Fig. 14 Load-displacement 
curves of the indentation tests 



IQ Q 

5x10 ion/cnT, crystal structure could only be observed in a few places. As the 

1 A Q 

ion fluence rises to 1 x 10 ion/cm , the sample becomes significantly amorphous, 
and the crystal structure totally disappears, as shown in Fig. 12c. 


Nanoindentation and Nano-scratching Analyses 

Nanoindentation measurements are carried out with a nanoindenter using high- 
precision Berkovich diamond tip to measure the mechanical property changes of 
the implanted materials. Surface oxide layer is removed by hydrofluoric acid to 
exclude the influence. Tests are conducted under indentation penetration depth of 
3 pm to minimize the influence of the bulk material and measure the mechanical 
property at nano -manufacturing scale. The results are shown in Fig. 14, which show 
that the actual surface modification effect by ion implantation is more obvious 
comparing to the simulation results. With the same displacements, monocrystalline 
silicon requires a larger load (up to 130 mN) to reach the identical penetration 
depth. Displacement discontinuity (pop-out) is observed in unloading curves of the 
silicon, which is related to the density change caused by the high-pressure phase 
transformation (Fang et al. 2007). While implanted silicon exhibits a smooth curve, 
indicating no phase transformation occurs during unloading, as the phase transfor- 
mation already takes place during the implantation process. With an increase in the 
implantation fluence, the hardness decreases accordingly. It should be noted that the 
loading force reduction rate is not in direct proportion to the ion fluence, but getting 
smaller when the fluence is relatively large, indicating the degree of the amorphous 
becomes saturated, close to the reduction limit. 

Figure 14 shows the load-displacement curves of the indentation tests. With the 
same displacements, normal silicon requires a larger load (up to 130 mN) to reach 
the identical penetration depth, but the implanted silicon only needs about 35 mN. 
Displacement discontinuity (pop-out) is observed in unloading curves of normal 
silicon, which is related to the density change caused by the high-pressure phase 
transformation, while implanted silicon exhibits a smooth curve, indicating no 
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phase transformation occurs during unloading, for phase transformation have 
already been carried out during implantation process. 

After ion implantation, the structure of the monocrystalline silicon’s surface 
layer turns amorphous due to the ion bombardment. The bonding force between 
atoms greatly reduced, while the Si-Si bonding has even been broken. Therefore, 
the implanted silicon performs a different mechanical behavior. The resistance 
against load was lowered, the brittleness reduced, and the elasticity enhanced. 
Figure 15 shows the results of hardness and elastic modulus measurements of 
3 pm indentations on normal and implanted silicon. Depth- sensing indentation 
hardness is evaluated from the indentation load-displacement curve using the 
Oliver and Pharr method. Young’s modulus is deduced from the unload- 
displacement curve using a Poisson’s ration v = 0.25, with a Young’s modulus of 
1,141 GPa and v = 0.07 for the diamond indenter. 

Surface topography after nanoindentation can be observed by SEM, and the 
results are shown in Fig. 16. After ion implantation, the pressure mark exhibits 
smoother surface and the indenting area shows smaller brittle cracking after 
indentation. From the results, it is predicted that ion injection offers a possibility 
of reducing surface roughness and tool wear in micromachining process. With an 
increase in the implantation fluence, the hardness decreases accordingly. It should 
be noted that the loading force reduction rate is not in direct proportion to the ion 
fluence, but getting smaller when the fluence is relatively large, which indicates the 
degree of the amorphous becomes saturated, close to the reduction limit, as shown 
in Fig. 17. 

In situ nanomechanical test system is used to carry out the nano- scratching tests 
with Berkovich diamond tip. Surface topography after nano -scratching is observed 
in situ AFM scanning, and the results are shown in Fig. 18. After ion implantation, 
the scratching mark exhibits smoother surface and smaller brittle cracks. The lateral 
force measured during scratching shows similar results as nanoindentation. 
According to the results, it is predicted that ion implantation offers a possibility 
for the high-quality nanomachining process through enhancing the plasticity of the 
brittle monocrystalline silicon. 

Ion implantation surface modification provides a novel approach for 
manufacturing brittle monocrystalline materials. The mechanical properties of the 
surface layer of the material are modified in a depth up to a few micrometers by 
decrystallization using high-energy ion bombardments; the brittleness is reduced 
and the plasticity is enhanced. The productivity is greatly improved by avoiding the 
fractures during the surface processing. 


Potential Developments 

Since the ion implantation surface modification method for ultra-precision machin- 
ing has been proved to be valid, applications of the method for actual production 
need to be considered. Low-cost and large-implanted area should be taken priority. 
Figure 19 shows various radiation methods and their damage to the substrate. 
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Fig. 17 Nanoindentation 
results of different ion fluence 




Fig. 18 (a) Nano -scratching 
results: in situ AFM 
of the scratching. 

(b) Nano- scratching results: 
later force 
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ig. 19 Schematic diagram of the most relevant radiation damage processes in glasses 
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Table 2 Test results for gamma irradiation 


Brittle-ductile 

transition 

Original 

600Gy 

5,000Gy 

lOkGy 

50 kGy 

n-type 1 

98.369 nm 

136.415 nm 

— 

147.489 nm 

133.388 nm 

n-type 2 

123 nm 

— 

133.4 nm 

— 

— 


Fig. 20 Taper cutting results 
of silicon after gamma 
irradiation (a) AFM 
measurement of the 
ductile-brittle transition point, 
(b) cross-section of 
ductile-brittle transition point 


a 


400.0 nm- 
0.0 nm - 


b 



20 pm 



After neutron irradiation, it should be waited for a half-life of 3-6 months due to the 
activation phenomenon, which is a great obstacle to the productivity. Electron 
irradiation and gamma irradiation have been applied to industrial production. The 
effectiveness of the gamma-ray irradiation is taken into consideration first. 


Gamma Irradiation 

It is found that gamma irradiation can only generate effective modification in n-type 
silicon. Ultra-precision turning is used to conduct taper cutting in order to test the 
brittle -ductile transition of gamma-irradiated silicon. The test results are examined 
by atomic force microscope. Nanoindentation results show that only the low-doped 
n-type (100) silicon wafers have a significant reduction of the modulus (Table 2). 

Test results of atomic force microscope on the brittle -due tile transition point are 
shown in Fig. 20, which is the low-doped n-type (100) silicon with 600 Gy gamma 
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Table 3 Test results for low-energy swift ion implantation 


Ion fluence 

5 x 10 14 

5 x 10 14 

1 x 10 14 




Ion 

Hardness 

Modulus 

Hardness 

Modulus 

Hardness 

Modulus 

H + 

12.73322 

177.2188 

12.83681 

173.7482 

12.82986 

177.9358 

He 

12.29011 

176.6287 

12.2996 

176.1936 

12.43458 

176.3419 


irradiation. It can be seen from the figure that the brittle-ductile transition depth is 
136.4 nm. 

However, since the changes only occur in some types of silicon, this result may 
not be valid to prove the effectiveness of gamma irradiation. The reason for this 
phenomenon is considered to be the doping on silicon wafers; some doped com- 
position can be activated by gamma irradiation, resulting in internal lattice structure 
changes, and the mechanical properties change; the silicon atom itself has not 
received much impact. For the widely used high resistance silicon which does not 
exist doped, gamma irradiation effect is not obvious. 

Therefore, it is concluded that gamma irradiation surface modification for 
certain types of silicon can be effective. 


Low-Energy Swift Ion Implantation 

Low-energy ion implantation experiments are conducted on high-resistivity silicon 
to verify the validity of this method. 

Industrial ion implantation machine can be used for low-energy swift ion 
implantation for a large area to improve the implantation efficiency. The implan- 
tation beam current is 40 pA/s, and the implantation can be completed in a short 
time. The results of H + and He ions into silicon of nanoindentation and mechanical 
properties testing are shown in Table 3. 

Low-energy ion implantations have no obvious influence on the surface mechan- 
ical properties of materials regardless of hardness or modulus. The phenomenon 
from the principle of low-energy ion implantation of the material could be 
predicted, where the damage is mainly due to nuclear energy loss, while the high- 
energy ion implantation damage is dominated by the electronic energy loss. In order 

1 A 

to achieve the effect F-ion 10 MeV 1 x 10 ion/cnr introduced, low-energy light 
ion implantation requires a higher dose, even multi-implantation. 


Multi-implantation 

With single low-energy light ion implantation, the damage layer stays from the 
surface with a depth of 2-3 pm. Figure 21 shows the SRIM simulation results of N 
ion with 550 keV nuclear energy loss. 

It is believed that through a combination of different energies, different fluences, 
and different particles, multi-implantation can be flexible and controllable to 
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Fig. 21 SRIM simulation of N implantation damage on silicon substrate 
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Fig. 22 Example of multi-implantation design 


achieve the uniform damage layer into the material surface. The following is an 
example of multi -implantation simulated by the nuclear energy loss, with the 
comparison with high-energy F-ion implantation (see Fig. 22). 

14 9 

In the example, the F-ion 10 MeV 5x10 ion/cnr nuclear damage is set as the 
target for multi-implantation simulation. He and B ions are used in the simulation to 
form a superimposed damage. The results show that the nuclear damage by four 
times implantation could meet the requirements for a uniform implant layer. 
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Nanometric Cutting Approach 
Introduction 

As nanometric cutting involves changes in only a few nanometers from the surface, 
it is difficult to observe the machining process and measure the cutting force 
accurately. Molecular dynamics (MD) simulation is developed as an effective 
way for studying the material removal process at nanometric scale. 

In nanometric machining of monocrystalline materials, cutting force is predict- 
able if the change in crystallographic orientation of the substrate material with 
respect to the cutting direction is known. However, little is known about the reason 
for the cutting force vibration which is an important factor in tool wear. Few studies 
were done on the chip formation mechanism from the view of cutting force. 


Nanometric Cutting Experiments 

High-energy ions bombard the silicon surface and initiate defects production. When 
fluence is sufficient, c-a transformation even occurred (Fig. 23b and c). In this 
process, the modification layer is formed. Ions lost energy in collision and finally 
rest in the certain depth of about 4.5 mm in the substrate shown in Fig. 23a. 


Taper Cutting 

Taper cutting experiments with a continuous change in depth of cut are performed 
on ion implantation-modified silicon to examine the ductile -brittle transition on the 
modified materials. Depth of cut is changed from 0 to 2 pm at a constant cutting 
speed of 400 mm/min. The experiments are carried out on an ultra-precision turning 
machine. Single-crystal diamond tools are specified with a nose radius of 
0.513 mm, a rake angle of 0°, and clearance angle of 12°. The taper-cut grooves 
are scanned by AFM. Topographic details of ductile-brittle transition depth are 
measured in Fig. 24. Modified silicon performs a ductile-brittle transition point of 
923.6 nm, much deeper than normal silicon of 236 nm (Fang and Venkatesh 1998). 


Nanometric Surface Generation and Tool Wear 

Experiments are carried out to generate nanometric surface and examine the tool 
wear for nanometric cutting of ion implantation-modified silicon. The workpiece is 

1 A m O. 

pre -modified by F-ions with the implantation parameters of 1 x 10 ion/cm ~ in 
ion fluence and 10 MeV in energy. The machining parameters of the depth of cut, 
federate, and cutting speed are 1.0 pm, 1 mm/min, and 1,500 r/min, respectively. 
The single-crystal diamond tool has a rake angle of 0°, a clearance angle of 12°, and 
a nose radius of 1.0 mm. 


Surface 
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(c) Implantation results: electron diffraction pattern (Fang et al. 2011) 
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Fig. 24 Taper cutting results of ion implanted silicon (a) AFM measurement of the ductile-brittle 
transition point, (b) cross-section of ductile-brittle transition point (Fang et al. 2011) 


Mirrorlike surface finish is achieved, as shown in Fig. 25. Figure 26a and b 
shows the machined surface. The feed marks on the machined surface are clearly 
displayed, and there is no existence of apparent fractures and micro cracks. The 
mean surface roughness is measured to be 0.861 nm in Ra. The machined surface 
results demonstrate a ductile material removal mode. 

Figure 26c is an SEM photograph of the cutting edge of the diamond tool after 
ductile mode cutting of ion implantation surface-modified silicon for 6.5 km. A 2 pm- 
wide flank wear with micro grooves is formed. The flank wear is smooth and uniform 
along the entire cutting edge, indicating that the tool wear has been a stable and 
gradual process. The surface roughness kept almost constant at a low level during the 
6.5 km cutting, which is much longer than normal silicon (Fang et al. 2007). 


Microstructure Generation 

Fracture-free micro structures are achieved, as shown in Fig. 27. Cutting depth is 
more than 400 nm, which is much deeper than the brittle-ductile transition point of 
monocrystalline silicon. The results demonstrate a ductile material removal mode. 
The feed marks on the machined surface are clearly displayed, and there are no 
apparent fractures and micro cracks, ductile chips remain on the edge of the cross 
cutting. Diamond tool shows no obvious wear on sharp tips after the machining 
process, as shown in Fig. 28. 


Processing with Other Materials 
GaN 

Taper cutting with maximum cutting depth of 1 pm is carried out on GaN surface. 
The results are shown as Fig. 29. GaN is hard and brittle, normally with a shallow 
brittle -ductile transition point of about 176 nm. 
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Fig. 26 (a) Cutting results: machined surface by SEM (Fang et al. 2011). (b) Cutting results: 
machined surface by AFM and tool wear of the cutting edge by SEM (Fang et al. 201 1). (c) Cutting 
results: machined surface by SEM (Fang et al. 2011) 

It is found that of GaN brittle-plastic transition depth has a close relationship 
with cutting speed. The transition depth would increase when the cutting speed 
reduces. Figure 30 shows the result of decreased cutting speed taper cutting and the 
corresponding AFM results. It can be seen that the brittle-ductile transition depth is 
234 nm and the surface fracture is significantly reduced. 

Nanoindentation is also carried out to measure the microhardness of GaN and to 
give references for the nano-cutting process. Measured average hardness is 
20.248GPa. Figure 31 shows the loading and unloading curve in the indentation 
process. It can be seen that GaN have the mechanical properties similar to mono- 
crystalline silicon, with brittle-plastic transition phenomenon in the loading process 
(the pop-in phenomenon). 

The loading and unloading curve is compared with <100> crystal plane p-type 
high-resistance silicon nanoindentation results in Fig. 32, with average hardness of 
12.992 GPa and modulus of 170.617 GPa. The hardness and modulus of Si are 
significantly less than that of GaN. This shows that the GaN would cause more 
severe tool wear than that Si but have better plasticity from the unloading curves. 








38 Nanometric Cutting of Crystal Surfaces Modified by Ion Implantation 


1457 







1458 


Y. Chen and F. Fang 






38 Nanometric Cutting of Crystal Surfaces Modified by Ion Implantation 


1459 


Fig. 29 Taper cutting results 
of GaN 
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As shown in Fig. 33, single-point diamond turning generates surfaces with 
obvious tool marks and brittle fracture. It has small possibility of achieving good 
surface quality while tool wear is serious, which makes the material not suitable for 
direct cutting. Ion implantation offers a way to reduce the surface hardness. 

Experiments of F-ions implanted into the GaN surface with a dose of 5 x 10 14 
ion/cnrT and energy of 6 MeV are carried out. A nanoindenter is used to measure the 
surface properties, as shown in Fig. 34. Modified GaN has a hardness and elastic 
modulus for the average of 22.719 and 274.877 GPa, respectively. The primitive 
GaN in the same test conditions result in 20.248 GPa for hardness and 294.627 GPa 
for modulus. The reduction of hardness is not obvious, but the modulus is, indicat- 
ing that the resistance of the material on the cutting process significantly reduced. 

The pop-in phenomenon upon loading disappears. This shows that the brittle 
behavior of materials is suppressed, which would be very beneficial in reducing the 
occurrence of tool wear during cutting. Moreover, the slope of unloading curve of 
the materials under the same load to 8,000 pN elevated, that means, plasticity of the 
material increases, and the material is easier to be processed. 


GaP 


The mechanical property of GaP is first tested using taper cutting. A single-point 
diamond tool with a nose radius of 0.5 mm is used for the taper cutting experiments. 
AFM and SEM are used to observe the brittle -ductile transition zone. The ductile- 
brittle transition point is measured to be 159.2 nm. 

Ion implantation of GaP has been carried out; considering the larger density of 

i r ^ ^ O 

material, Cl ions of 1 x 10 ion/cnrr and Si ions of 5 x 10 ion/cnrr are implanted. 
After implantation, the brittle-ductile transition depth is observed, and it is 
found that the implanted material has better cutting performance. The AFM test 
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Fig. 30 SEM and AFM images of GaN transition point with decreased cutting speed 
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Fig. 31 Load curve of GaN with pop-in 
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Fig. 32 Loading and unloading curves of GaN and silicon 


Fig. 33 Single-point 
diamond tool cutting of GaN 
surfaces 



results show that the two different kinds of particles enhance the brittle-ductile 
transition depth of more than 50 nm (211.346 nm for Cl ions and 203.902 nm for 
Si ions). 

Micro structures are machined on GaP using flying cutting as shown in Fig. 35. It 
can be seen that good surface quality can be achieved and surface modification 
using ion implantation improves the cutting efficiency. 
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Fig. 34 Loading and unloading curves for modified and primitive GaN 



Fig. 35 Mechanism of fly cutting of grooves (Fang and Liu 2004) 


LiNb0 3 

LiNb0 3 is an important optoelectronic material due to its suitable electronic and 
acousto-optic constants, high chemical and mechanical resistance, high Curie 
temperature, and large single domain crystals which are available at low costs. 

LiNb0 3 has a brittle nature, which prevents it from mechanical manufacturing. 
Traditionally it is fabricated by chemical etching, ion bombardment, etc., which are 
time-consuming and with low efficiency. 

A bare possibility in LiNb0 3 of ductile chip removal and mirror-finished surface 
can be obtained by micro machining using highly accurate machine tool under 
vibration-free operation. Ion implantation is a superb method for modifying the 
surface properties of materials since it offers accurate control of dopant composi- 
tion and structural modification at any selected temperature. 
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Fig. 36 Thermal annealing results 


Surface Annealing 

Machining would inevitably cause subsurface damages such as phase transforma- 
tions, dislocations, and fractures. It is the depth and nature of the subsurface 
damages that influence the mechanical, optical, and electronic performances of 
silicon products (Yan et al. 2005). Surface-modified silicon, for the same reason, 
needs to be annealed to recover the damages. 

Many factors could influence the annealing results, such as the temperature, 
annealing time, heating rate, and even the environmental pressure. Simulation and 
experiment are both needed for studying the phenomenon. Several methods could 
be applied for the damage recovering process, such as thermal annealing and laser 
radiation annealing. Thermal annealing is widely used as it is efficient and easy to 
control. 

Thermal annealing was carried out on (100) CZ silicon implanted with 
B (200 keV, 500 keV) and He (350 keV, 500 keV) ions. Ion fluences are 

i c o i c o 

1x10 cm and 2x10 cm , respectively. Experiment parameters and result 
are shown in Table 2 and Fig. 36. Structural recovery is measured using changes of 
bulk resistivity. Implanted ions are blocked at interstitial positions. Thermal 
annealing allow interstitial ions enter the substitutionally points which could reduce 
the resistivity. 

The results show that temperature is more influential than time. The resistivity 
descending speed is quite slow when the temperature increases. It is because the 
stable structure which ion implantation induced requires high temperature for 
annealing. Longer time can reduce resistivity as well, but it is not as effective as 
temperature. As time increases, only the damages which can be decomposed at 
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Table 4 Parameters for low-energy swift ion implantation 


Ion 

Energy 

Ion fluence 



H + 

500 keV 

5 x 10 14 40 |iA 

5 x 10 15 40 \iA 

1 x 10 16 40-50 |iA 

He 

550 keV 

5 x 10 14 30 \\A 

5 x 10 15 40 \iA 

1 x 10 16 45-60 \iA 



0 12 3 4 


Time(h) 

Fig. 37 Thermal annealing results 


annealing temperature disappear; resistivity reduces more slowly with time. In 
addition, it will form more stable damage structures when temperature is not high 
enough (as 300 °C annealing in Fig. 36). This phenomenon indicates that simple 
defect (e.g., Frenkel pairs) can move and accumulate. The new stable damages will 
survive the annealing and then absorb more defects, which increases the resistivity. 

Another group of experiments was also carried out, aiming to study how 
temperature change rate (TCR) and multiple- stage annealing influence resistivity. 
The parameters and result are shown as Table 4 and Fig. 37. 

For the same annealing condition (30 min at 400 °C), small TCR (~2 °C/min, 
furnace cooling) has better effect. While temperature is increasing slowly, damages 
have enough time to be dissolved and annihilated at each “temperature stair” than 
large TCR (~10 °C/min). Actually, though both of them keep 30 min at 400 °C, 
small TCR process has longer annealing time which reduces resistivity more. 

Multiple -stage annealing (400 °C, 120 min + 485 °C, 60 min) has very little 
improvement compared with single 120 min, 485 °C, annealing. Multiple-stage 
annealing spends more time at 400 °C, so it can dissolve more damages before 
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reaching 485 °C. But the maximum temperature has not been changed. In other 
words, they have the same recovery capability. So the final resistivity doesn’t 
change obviously. Only time delay cannot compensate temperature difference. As 
time increasing to 3 h and 4 h at 400 °C, resistivity reduces slowly. This is 
consistent with the analysis above. 

Appropriate high temperature and long time are beneficial for better annealing, 
while the latter may have negative effect at low temperature. Temperature is more 
influential and cannot be compensated only through annealing time. Reduce the 
TCR (temperature change rate) if possible. Multiple-stage annealing has no 
obvious improvement to the result; however, it may be combined with laser 
annealing. 


Applications 
THz Fresnel Lens 

A plethora of significant advances have been seen in materials diagnostics by 
terahertz (THz) systems, as higher-power sources and more sensitive detectors 
open up a range of potential uses. Applications including semiconductor and 
high-temperature superconductor characterization, tomographic imaging, label- 
free genetic analysis, cellular level imaging, and chemical and biological sensing 
have thrust terahertz research from relative obscurity into the limelight. 

The terahertz region of the electromagnetic spectrum has proven to be one of the 
most elusive. Terahertz radiation is loosely defined by the frequency range of 
0.1-10 THz (1,012 cycles per second). Being situated between infrared light and 
microwave radiation, THz radiation is resistant to the techniques commonly 
employed in these well-established neighboring bands. High atmospheric absorp- 
tion constrained early interest and funding for THz science. The past 20 years have 
seen a revolution in THz systems, as advanced materials research provided new and 
higher-power sources, and the potential of THz for advanced physics research and 
commercial applications are demonstrated. Terahertz technology is an extremely 
attractive research field, with interest from sectors as diverse as the semiconductor, 
life science, manufacturing, space, and defense industries. 

Lenses are basic elements in an optical imaging system. In imaging and 
THz-TDS technologies, THz wave focusing and collimating have relied mainly 
on parabolic mirrors, silicon lenses, and polyethylene lenses. However, for a THz 
beam, it is impossible to fabricate lenses with short focal lengths and large numer- 
ical apertures by using silicon or polyethylene. For two-dimensional CCD THz 
imaging, it is very difficult to obtain a high-quality THz image on a ZnTe sensor by 
using parabolic mirrors because of their aberrations and the difficulty of alignment. 
However, fabricating a large numerical aperture, THz binary lens with a short focal 
length is possible by using modern planar micro-fabrication technology, which 
makes such lenses attractive for THz imaging applications. Binary lenses are much 
lighter and more compact than conventional THz optics and lend themselves to 
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Fig. 38 Binary lens in T-Ray 
system (Wang et al. 2002) 




Binary lens 




Probe beam 



CCD 


Table 5 Calculated and 
measured diffraction 
efficiency of T-ray binary 
lenses (Wang et al. 2002) 


Phase level L 


n 

2 

4 

8 

Al zone plate 

Theory 

41 

81 

95 

41 

Exp. 

11 

75 

90 

38 


unique beam forming, and so they may see increasing usage in the future. The 
system is shown in Fig. 38. 

The diffraction efficiency increases rapidly with the number of phase levels. As 
shown in Table 5, for a binary lens with L = 8, the diffraction efficiency 
reaches 95 %, in contrast with an Al zone plate or a 2-level lens, which has a 
41 % efficiency. 2-Level, 4-level, and 8-level lenses, with a 30 mm diameter 
consisting of a total of 14 zones, are fabricated on silicon wafers by ion 
etching. Silicon has a refractive index of 3.42 in the far infrared region from 0.5 
to 1.5 THz; therefore, the required etch depth, l/[L(nTHz— 1)], is small and a thin 
binary lens can be fabricated. The lenses are designed for 1 THz with a focal length 
of 25 mm. 

Figure 39 shows the figures of the three binary lenses and of the corresponding 
THz intensity distribution on their focal plane. The level numbers of these lenses 
are 2, 4, and 8, respectively. They are designed to have a focal length of 25 mm at 
1 THz. The diameter of each of these lenses is 3 mm. Comparing with the THz 
distribution on the focal plane for the three binary lenses, the focused THz intensity 
increases with the level number. As the level number of the binary lens increases, a 
smaller focal area and sharper focused THz peak can be observed. This corresponds 
to the enhanced diffraction efficiency, which increases dramatically with an 
increasing level number of a binary lens. 

If the lenses could be fabricated by diamond turning with continuous level, then 
the efficiency would reach maximum. The fabrication can be achieved on silicon by 
ion implantation modification. 
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2 level T-Ray Binary lens 


^ 0.2 
=3 


cc 




4 level T-Ray Binary lens 




8 level T-Ray Binary lens 



Fig. 39 The photos of the THz binary lenses and the THz wave intensity distribution on their 
focal plane (Wang et al. 2002) 


GaP Micro-pyramid 

Optical rectification effect is a special kind of nonlinear optical effect, which is the 
interaction of ultrashort pulse laser and nonlinear media arising from the process of 
low-frequency electric field (THz). This low-frequency electric field can produce 
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ultrafast electromagnetic radiation. Optical rectification and the conversion effi- 
ciency are mainly dependent on the material nonlinear coefficient and phase 
matching conditions. 

Optical rectification method is an important way to generate broadband THz 
wave. THz time-domain spectrum technology (THz-TDS) is an ideal source of 
radiation to obtain the order of sub-picosecond THz pulse by using the 
nonlinear crystal optical rectification effect. The limit for THz wave to apply 
into application lies in the insufficient transmit power of the existing THz wave 
source. 

To improve the power of THz radiation, researchers have carried out mass 
studies such as the use of large diameter transmitter, the use of high-power narrow 
pulse femtosecond laser pump source, and the use of quasi-phase-matching method 
to extend the distance in the crystal. All these methods are effective to improve the 
THz output energy. It is also noted that in these experiments, there are still some 
problems for THz-coupled output and transmission, i.e., the nonlinear crystal for 
the transmitter and the refractive index in the THz range generally very high, such 
as GaP crystal and the LN crystal have the refractive index of 3.3 and 5.2, 
respectively. Thus, the Fresnel reflection loss could be up to 28.6 % and 45.9 %. 
As a result, not only the transmission power is restricted, but also the multiple 
reflections of the THz wave form a sub-peak, which causes interference in the 
frequency domain. 

GaP crystal has a large effective nonlinear coefficient and the THz band absorp- 
tion is small. It is widely used in ultrafast laser pulses of 1 pm-band optical 
rectification to generate THz wave. However, due to its high refractive index in 
the THz range, the ordinary massive transmitter crystal-air interface Fresnel reflec- 
tivity is high. The THz cannot effectively be coupled and output. In order to reduce 
reflection, anti-reflection coating is used, but this only applies to narrow-band THz 
wave source. 

Micro-pyramid structure of GaP can effectively couple THz wave output. The 
residual pump light in the exiting surface is scattered and greatly reduce the residual 
pump light. 

Studies have shown that the use of sub-wavelength scale, one-dimensional or 
two-dimensional protruding micro structure consisting of graded refractive index 
layer, can be used to reduce the THz reflectivity. This structure actually constitutes 
an effective refractive index gradient of the coupling layer: 





where n s is the refractive index of the substrate material; n h is the refractive index of 
the surrounding environment, for the transmitter in the air of n b = 1 ; and p is the 
area occupancy rate of each layer of base material. For pyramid micro structure 
layer, it is assumed that the micro structural layer thickness is h and, in a vertical 
position, z. The layers of equivalent refractive index are expressed as 
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Fig. 40 Designed GaP 
micro-pyramids 



tleff = \f 1 + («? - i) X (1 -z/h) 2 (2) 

The principle of THz emitter crystal coupling is applied to the design of micro- 
pyramid emitter in order to facilitate the high transmittance output THz wave 
generation. The micro-pyramid structures are shown in Figs. 40 and 41. 


LiNb0 3 Grating 

In order to improve the efficiency of THz wave generation, more effective 
nonlinear medium is adopted with higher peak power of the pump light pulse to 
excite the crystal. In traditional method, the nonlinear crystal has an effective 
nonlinear coefficiency that is not big enough, and the common transmitter shows 
low-order nonlinear absorption. It is difficult to use higher-power laser systems as 
pump source which limits the THz output power. It is suggested that by tilting the 
pulse non-collinear phase, optical rectification could realize in the crystal the 
non-collinear phase match. 

Its basic structure tilts the incident femtosecond pulse intensity wave front using 
the grating with appropriate parameters, making the pump light speed in the 
direction of the wave vector component in the crystal and the THz wave velocity 
match. LiNb03 (LN) is the most common choice with effective nonlinear coeffi- 
cient of 168 pm/V, which is the largest in the common inorganic nonlinear crystal. 
LN has high resistance to damage, which could efficiently emit THz even in high- 
power pump conditions. 

However, due to the nonlinear transmission of the generated THz wave and pump 
light in the crystal, it is difficult to observe and collect the THz wave; on the other 
hand, the LN crystal refractive index is higher, and the reflection loss of THz wave 
generated in the exit end of the Fresnel reflection is large and the efficiency is low. 
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Fig. 41 Fabricated GaP micro-pyramids 


It is necessary to design to avoid the wave front aberration and to increase the 
THz wave projection efficiency and simplify the structure of the THz transmitter. 
Compared to the traditional collinear phase matching optical rectification technol- 
ogy, by using LN crystal, the pump intensity output would be enhanced. 


Summary and Further Developments 
Summary 

A new method has been demonstrated to improve the diamond cutting process and 
overcome the shortcomings of conventionally machining monocrystalline silicon. 
From the study, the conclusions can be drawn as follows: 

1 . Ion implantation surface modification can effectively achieve ductile machining 
brittle optical materials and efficiently inhibit surface fracture and tool wear in 
the machining process. 

2. The validity of this novel is proved by both theory and experiments. 
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3. Ion implantation surface modification ultra-precision machining makes it pos- 
sible to generate intricate features and optical quality surfaces on brittle optical 
materials which have a wide range of applications. 


Further Developments 

Ion implantation surface modification of brittle crystal materials for nanometric 
cutting just developed for a few years. Further work should be done on the 
mechanism and applications. 

One example of further application is X-ray lenses (Chao et al. 2005). 

Analytical tools that have spatial resolution at the nanometric scale are indis- 
pensable for the life and physical sciences. It is desirable that these tools also permit 
elemental and chemical identification on a scale of 10 nm or less, with large 
penetration depths. A variety of techniques in X-ray imaging are currently being 
developed that may provide these combined capabilities. 

A technique to produce diffractive X-ray lenses optimized for high heat load 
applications is made from monocrystalline silicon membranes, which have uniform 
thermal conductivity and homogeneous thermal expansion. Silicon Fresnel zone 
plates with an outermost zone width down to 30 nm are produced by means of 
electron beam lithography and reactive ion etching. They are tested in conventional 
synchrotron light sources in a full-held transmission X-ray microscope and in 
a scanning transmission X-ray microscope. High resolution in combination 
with ability to withstand high heat load radiation make them a potential 
candidate for the new generation of X-ray sources based on the free electron laser 
principle. 

The fabrication of diffractive X-ray lenses can be achieved on ion implantation- 
modified silicon using ultra-precision turning with single-point diamond tool, 
which could enhance the fabrication efficiency and accuracy. 
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Abstract 

With the trends towards miniaturization, micro- systems, sophisticated devices, and 
miniaturized three-dimensional (3D) structures are in great demands, which stim- 
ulate the development of micro-/nano-manufacturing technologies. Micro-/nano- 
cutting is one of the most important methods in micro-/nano-manufacturing and it 
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is capable of fabricating micro structures on various materials. However, research 
and development of the micro-cutting tools largely determined the progress of 
micro-/nano-cutting technologies and their applications. As a novel fabrication 
technology, focused ion beam (FIB) direct writing is capable of fabricating the 
microtools with specific tool profile and nanometric cutting edge. In this chapter, 
various efforts to fabricate geometrically complex and sharp microtools are 
described. The fabrication techniques and their performance and applications 
are discussed. The characteristics of the FIB related to its material processing 
rates and surface morphologies are introduced. Furthermore, the machining 
technique and applications using microtools are discussed and their future devel- 
opments on microtool fabrication by FIB are provided as well. 


Introduction 

Micromachining 

Miniaturized high-precision components with manufactured features in the range of 
several to several 100 pm are increasingly in demand in various fields, including 
aerospace, biomedical, electronics, and automotive fields. Tolerance limits give a 
critical requirement for micromachining. The ratio of tolerance to feature size from 
10" J to 10"" is required strictly in the applications, which meets the functional and 
structural requirements to use the diverse materials, such as metallic alloy, com- 
posites, polymers, and ceramic. In addition, true 3D free-form surfaces are required 
with the appearance of many miniaturization applications (Fig. 1). 


Tool-Based Micromachining 

Miniaturization manufacturing would provide micro-systems which could enhance 
healthcare and economic growth in such applications as micro -channels for lab-on-a- 
chip, shape memory alloy “stents,” fluidic graphite channels for fuel cell applications, 



Fig. 1 Micro advices and micro components: (a) micro lathe (Kitahara et al. 1996); (b) micro- 
injection molded components (Courtesy of World Technology Evaluation Center report: 
“Micromanufacturing”) 



39 Micro Tools Fabrication by Focused Ion Beam Technology 


1475 



Fig. 2 Tool-based micromachining (Fang et al. 2003). (a) Experimental setup, (b) micromachining 
of embossing dies. Microtool material: stainless steel (Stavax) with hardness of HRC52. 
(c) Micromachining of biomedical parts. Work material in (b) and (c): brass with hardness of HB85 


subminiature actuators, and sensors (Corbett et al. 2000; Madou 1997; Week 
et al. 1999; Lang 1999). Due to these requirements, versatile micro-fabrication 
methods have been developed, such as LIGA (a photolithography method using a 
synchrotron), ion beam, and micro-electro discharge machining (Alting et al. 2003; 
Masuzawa 2000). However, the broad commercialization of micro-systems is mainly 
restricted by the low productivity and efficiency of manufacturing techniques till 
now. For example, usually only 2D micro structures on flat substrates can be fabri- 
cated using lithography, laser, and ion beam technologies, while efficiency is quite 
low using ion beam and micro-electro discharge. In general, materials that can be 
fabricated using lithography and micro-electro discharge are limited. 

As a traditional machining method, tool-based micromachining has played a 
critical role in microscale manufacturing with the abilities of achieving nanometric 
surface finish, high efficiency, and complicated 3D microstructures, as shown in 
Fig. 2. Unpredictable tool life and premature tool failure present a serious concern 
in micromachining. Systematic investigations of the various types of tool geometry 
have been conducted in micromachining (Fang et al. 2003). For the tool type of 
triangle-based end mills, the tool breakage could be avoided under the experimental 
conditions due to its high tool tip rigidity, unlike two-flute end mills where the tools 
are broken frequently. With the experimental study and FEM analysis, it was 
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Fig. 3 Overviews of produced microtools under optimum machining conditions: (a) ultraprecise 
tool, (b) extremely large aspect ratio microtool (Ohmori et al. 2003) 


confirmed that, for a tool diameter in the order of 0.1 mm, D-type end mills are 
more suitable for fabricating micro parts, as shown in Fig. 2b. Successful applica- 
tions of the micro semicircle-based end mills both in biomedical parts and micro 
embossing dies with feature size smaller than 80 pm have been demonstrated. 


Microtool Developments 

Microtools are the key parts for the development of micro-cutting. These tools 
would directly affect the processing accuracy, surface quality, etc. Various 
methods, including mechanical grinding, ion beam sputtering etching, thermo- 
chemical polishing, and vacuum plasma chemical polishing, have been employed 
to fabricate microscale fabrication tools. Ohmori et al. utilizing ELID grinding 
technology developed micromachine tools with various cross-sectional shapes 
and typical dimensions smaller than 10 pm, as shown in Fig. 3 (Ohmori 
et al. 2003). Thermochemical polishing is affected by several elements, such as 
temperature, grinding pressure, platen speed, etc., and the efficiency of the 
vacuum plasma chemical polishing for microtools is quite low, less than 0.25 
atomic layer per minute (Zong et al. 2007a; Zaitsev et al. 1998). The surface 
quality of microtools is closely related to the micromachining performance, part 
quality, and tool strength and lifetime. As a traditional process, mechanical 
grinding is hard in fabricating a tool with nanometric cutting edge as it has an 
intermittent impact to the tool edge. 

Focused ion beam (FIB) sputtering is adopted as an innovative method recently 
for fabricating micro-cutting tools with typical working dimensions in tens of 
microns and sharp cutting edges and complex tool configurations (Taniguchi 
et al. 1998; Russell et al. 1998; Stanishevsky 2001). The investigations of these 
tools for machining metals and polymers have been demonstrated in the researches 
with micro-milling and micro-cutting (Vietzke et al. 1997; Harriott 1995; Winters 
and Cobum 1992). This chapter would comprehensively review the progress and 
status of the microtool developed by FIB technologies. 
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Tool Fabrication Mechanism and Process 
Fundamentals of FIB Processing 
Surface Quality Fabricated with FIB 

Single crystal diamond is considered as an ideal tool material in ultraprecision and 
nanometric cutting because of its highest hardness, good heat conductivity at 
normal temperature, excellent chemical stability and wear resistance, etc. To 
understand the characteristics of the diamond, the material removal rate, surface 
morphology, and FIB sputtered form accuracy at different sputtering parameters 
should be analyzed for the preparation of fabricating micro-diamond tools by FIB 
milling. 

In FIB system, the ion beam ionized and emitted from liquid metal ion sources 
(LMIS) was accelerated, collimated, focused, and finally injected into the target 
with energies from several to dozens of kilo-electron volts. Then, a cascade 
collision occurs, in which energy and momentum are transferred from the incident 
ions to the target particles (atomic nucleus or electrons). 

How to fabricate nanometric surfaces is the fundamental requirement for the 
microtool fabrication based on FIB . It was found that not only the FIB parameters 
should be carefully controlled but also the tool material effects during FIB fabri- 
cation should be considered in order to get a smooth cutting tool face. 

The FIB milling process is performed through a precise pixel-by-pixel 
movement, as shown in Fig. 4 (Tseng 2004). Pixel spacing is the distance between 
the centers of the two adjacent pixels, which should be small enough to allow a 
proper overlapping between the adjacent pixels so that a smooth uniform profile 
can be fabricated, as shown in Fig. 5 (Fang et al. 2010). In order to have a 
uniform scanning ion flux with respect to the scanning direction, overlap should 
be set larger than 36 % (Tseng 2004). In the bitmap patterning method, the beam 


Beam 

diameter Pixel space 



Fig. 4 Schematic diagram of FIB milling with pixel overlapping (Tseng 2004) 
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Fig. 5 Normalized ion flux distribution along a scan line with normalized pixel spacing equal to 
(a) -27.3 % and (b) 36.3 % (Fang et al. 2010) 



Fig. 6 Micro-cutting tools fabricated by FIBDW with different tool material of hard alloy (a) and 

• • 

single crystal diamond (b) (Chouffani and Uberall 1999) 


overlap can be controlled by means of defining the bitmap pixel density, milling 
size, beam parameters, etc. 

For the crystalline materials such as diamond, the channeling effect (materials 
effect) would also contribute to the smooth surface developments. If the target 
material is a crystalline material where atoms are orderly arranged, incident ions 
may penetrate much deeper along the low indexed axis of crystalline. The penetra- 
tion depth can be several times of that for amorphous target. Such phenomenon is 

• • 

called ion channeling effect (Kempshall et al. 2001; Chouffani and Uberall 1999), 
as shown in Fig. 6. 

A series of experiments have been carried out to produce cavities with dimen- 

Q 

sions of 10 x 10 x 2 pm on the diamond blank top surface fabricated by FIB. 
Once ion current and ion incident angle changed, so did the sputtered surface 
roughness on the diamond substrate (Ding et al. 2008). In the experiment, if the 
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Fig. 7 Plot of Ra (nm) versus the ion beam incidence angle with changes of the ion current 
(nA) (Ding et al. 2008) 


incident angles are less than 30°, the sputtered surface would be very smooth with 
the surface roughness of Ra less than 1 nm, as shown in Fig. 7. The situation would 
change if the incident angle becomes larger than 30°. There would be a ripple 
pattern which was aligned nearly perpendicular to the direction of the incident ion 
beam. 

Redeposition Control During FIB Sputtering 

Redeposition is a very important effect in FIB fabrication, which would affect the 
surface quality developed. The FIB is used for the fabrication of the cavities with 
full edges. It was found that amorphous layers appeared on all cavity sidewalls. 
A damaged layer was evidently shown in Figs. 8a and 9a for FIB sputtering at 
0° and 30° incidence angles with ion currents of 2.2 nA and 20 nA, respectively. 
The formation of the amorphous layer could be explained as follows: firstly, the ion 
beam bombardment on the substrate surface provides momentum to the component 
atoms, leading to the emission of the atoms and polyatomic species out of the solid 
as well as the rearrangement of the atoms. Secondly, it could be due to redeposition 
of the sputtered material onto the sidewalls during FIB sputtering as shown in 
Fig. 10. Benefit to the easy excavation of a sputtered material, there was no 
amorphous layer on the sidewalls during sputtering the cavities with one opening 
edge. From Figs. 8 and 9, it is also found that the slanting incidence ion angle of 30° 
could produce a deeper cavity. 

Ion Implantation Damage During FIB 

It is noticed that the FIB milling would introduce nanoscale damage layers in the 
subsurface of machining surface, as shown in Fig. 11. The damage type might be 
point defects, amorphization, Ga ion implantations, etc., as illustrated in Fig. lib. 
The sidewall thicknesses of the amorphous layers were measured to be ranging 
from 3.5 to 16 nm and increasing with the FIB energy (Rubanov and Suvorova 
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Fig. 8 Cavity sputtered by FIB with an ion current of 2.2 nA at an ion incidence angle of 0° (cavity 

Q 

dimensions: 10x10x10 pm with a sputtering time of 3,1 12 s): (a) with full edges (after cross- 
sectioning) and (b) with one opening edge (Ding et al. 2008) 
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Fig. 9 Cavity sputtered by FIB with an ion current of 2.2 nA at an ion incidence angle of 30° 
(cavity dimensions: 10 pm x 10 pm x 10 pm with a sputtering time of 2,673 s): (a) with full 
edges (after cross-sectioning) and (b) with one opening edge (Ding et al. 2008) 

2012). It is presumed that during ion implantation Ga ions interact with the 

Q 

electrons and nuclei of the target atoms, which breaks some of its sp bonds and 
induces a large number of point defects. When the density of local point defects 
exceeds the amorphization threshold, the crystal turns into an amorphous state 
(Sergey and Alexandra 2011). 

The Effects of H 2 0 Vapor During FIB Milling of Diamond 

In order to increase the material removal rate, the FIB sputtering yield, and the 
microstructural damage, H 2 0 gas-assisted FIB milling processes are discussed in 
the following for the diamond microtool fabrications. 

When H 2 0-assisted processes are used, there is an increase in material removal 
rate. Figure 12 shows a broad range of angles with the enhancement in yield 
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Fig. 10 Sputtered material redeposited on the sidewalls (Ding et al. 2008) 
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Fig. 11 Nanoscale damage layers in the subsurface of machining surface by FIB. (a) Bright held 

2 

STEM image of a single crystal diamond sample exposed to a dose of 4.2 x 1,017 Ga/cm“ over 
the area indicated. The image has been electronically elongated in the vertical direction (see scale 
bars) to highlight the surface profile, (b) EELS profile of the implanted diamond region defined in 
(a) (Weima et al. 2001) 


depending on the specific angle. Large yield enhancements at 9 < 75° could be 
found. The enhancement in yield depends on the H 2 0 and ion/neutral flux, as shown 
in Fig. 13. Therefore, a strategy for maximizing the yield includes selecting a small 
pixel dwell time to effectively lessen the redeposition and properly choosing a 
suitable ion beam orientation to sample surface (Adams et al. 2003). 

Furthermore, a 20 % reduction in the amorphous thickness is found when using 
gas-assisted FIB milling. Figure 14 shows an example for bombardment at 9 = 0°. 
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Fig. 12 Plot of yield 
(C atoms/ion) and ion beam 
incidence angle, 0. Two data 
sets are shown including 
results from gallium ion beam 
bombardment ( x ) and H 2 0- 
assisted processes (■). The 
lines shown are guides to the 
eye. 0 is defined with respect 
to the substrate normal as 
shown in the inset (Adams 
et al. 2003) 
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Fig. 13 Plot of yield (C atoms/ion) versus pixel dwell time for H 2 0-assisted FIB milling of 
diamond ( filled symbols ) and FIB sputtering (A). Milling experiments at three different angles of 
incidence show the influence of pixel dwell time when using a fixed H 2 0 pressure. Dotted lines 
shown are simulated (Adams et al. 2003) 
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The reduction in layer thickness comes from an effect of reactive sputtering. The 
difference in amorphizated thickness is consistent with an enhanced material 
removal rate for H 2 0-assisted processes compared with FIB sputtering. Further- 
more, although FIB sputtering and H 2 0-assisted FIB milling both would generate 
ripples with nearly a single orientation, there were noticeable differences in mor- 
phology formed by the two processes, as shown in Fig. 15. H 2 0 gas-assisted etching 
can effectively reduce the ripple wavelength and amplitude under certain FIB 
sample orientation. 
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Fig. 14 AFM images of diamond surface FIB sputtered (a) and FIB milled in the presence of H 2 0 
(b). Experiments involved a single ion beam incidence angle, 0 = 55°, and a fixed dose of 

19 2 

1 x 10 ions/cm . Arrows indicate the projection of the ion beam vector (Adams et al. 2003) 
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Fig. 15 Transmission electron micrographs showing the damage layers resulting from FIB 

sputtering (a) and H 2 0-assisted FIB milling (b). Sample was bombarded with Ga at 0 = 0° to 

18 2 

10 ions/cm . “a-C” indicates the damaged region (Adams et al. 2003) 


Microtool Fabrication Using Focused Ion Beam Milling 
Fabrication Approaches 

Investigation of microtool fabrication can be carried out using the FIB/SEM dual- 
beam system, as shown in Fig. 16. The dual-beam system is a complete nanotech- 
nology laboratory combining a field emission SEM with ultrahigh imaging resolu- 
tion and a precise focused ion beam etching system. In addition, a stand-alone 
rotational axis is fixed to the sample stage of dual-beam system to provide another 
degree of freedom that is required for numerous SEM and FIB applications. Tool 
blanks can be inserted into the front of the axis and rotated, allowing the orientation 
of a microtool to be performed easily. Tools can be ion milled accurately by 
adjusting the relative position of tool blanks and ion column. 
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Fig. 16 Tool bank in sample 
chamber of FIB/SEM dual- 
beam system 




Fig. 17 Procedure for making a rectangular micro-cutting tool by focused ion beam sputtering. 
Arrows indicate the direction of ion beam impingement. The tool is rotated —90° between steps 
2 and 3 and +90° between steps 3 and 4. Images in left column show tilted view of model tool. 
Complementary images in the right column are end-on views (Adams et al. 2000) 


Figure 1 8 illustrates a typical fabrication process for a rectangular micro-cutting 
tool by focused ion beam sputtering. At the beginning of fabricating a microtool, a 
polished tool blank is roughly milled using a larger beam current (~20 nA). 
A smooth facet can be sputtered at the tool end, as shown in Fig. 17. After that, the 
finished facet is nearly aligned with the tool axis. Then, two facets are created on both 
sides of a tool, which defines the cutting width and tool cross-sectional profile. 
For instance, ion milling two nearly parallel facets created a tool with a rectangular 
cutting shape. 
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Fig. 18 Illustration of 
microtool sharp edge 
formation 
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During the ion milling, each side facet forms a natural taper angle of 5-10° with 
respect to the ion beam direction, using the maximum sputter yield at incidence 
angles between 75 and 85° with respect to the surface normal (Vasile et al. 1990). 
If a different taper angle of individual side facet is required, rotating the tool blank 
relative to the FIB before the FIB sputtering. After the formation of side facets, 
the focused ion beam is used to create rake features that clear chips during 
ultraprecision machining. 

Then, the tool is rotated —90° to the orientation. A projected area is milled on 
one side of the tool to form a desired back rake angle. This step also sets the rake 
facet length of typically 20-30 pm. At the last step, a sharp cutting edge having 
clearance is formed. Microtools are placed to original orientation with respect to the 
ion beam. This creates an end facet that intersects the rake facet at a sharp edge. 

The FIB would produce sharp edges on the side of facets furthest from the ion 
source, while the facet edge closest to the ion source is round because of the 
Gaussian beam intensity, as shown in Fig. 18. 

Comparing with ordinary precision grinding methods, FIB fabrication can 
produce complex tool configuration by accurately controlling the FIB milling 
time at predefined area when the flank face is prepared, as shown in Fig. 19. 
The designed DOE profile diagram is transformed into a bitmap file with 24 bits 
(Zhang 2009), as shown in Fig. 19a. The “*.bmp” file will be uploaded into the 
operating system of the FIB system as a desired milling pattern. Thus resolution of 
the bitmap pattern determines the final FIB -milled pattern. Micro diffractive optical 
elements (MDOE) can be developed using a DOE-shaped microtool on 6061 
aluminum. 
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Fig. 19 MDOE fabricated by a DOE-shaped microtool by FIB: (a) DOE bitmap used for FIB 
milling; (b) DOE-shaped microtool by FIB; (c) MDOE machined by the DOE-shaped microtool 
(Zhang et al. 2009) 


As the tool edge radius would directly affect the quality of machined surfaces, 
how to accurately measure the nanoscale tool edge radius is vital for its 
applications. Two precision metrology methods of atomic force microscope 
(AFM) and scanning electron microscope (SEM) can be adopted to the 
microtool’s edge radius measurements. When the AFM method is used, there 
would be AFM probe’s broadening effect, as shown in Fig. 20. To overcome this 
problem, the scanning probe should be sharpened and contour specified. 
AFM has the advantage of high measurement accuracy, but it cannot be used 
for the online measurement. Sandia National Laboratories’ researchers used the 
SEM to obtain the edge radius, as shown in Fig. 21 (Picard et al. 2003). The edge 
of intersection is cross-sectioned by an FIB system and viewed at a magnification 
of 200,000 x to determine radius of curvature (Zhang 2009). But this method 
would damage the microtool, as shown in Fig. 22. The Centre of MicroNano 
Manufacturing Technology (MNMT) at Tianjin University proposed a 
method using SEM to measure triangle microtool without FIB cross-sectional 
damaging, where the measurement plane must be placed vertical to the SEM 
observation direction. 
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Fig. 20 Broadening effect in 
AFM 
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Fig. 21 Scanning electron microscope images of a tool cutting edge. The lower-magnification 
image {left) and the higher-magnification image (right) (Adams et al. 2000) 

Microtools of Various Materials 
Diamond Microtools 

Single crystal diamond is considered as an ideal tool material in ultraprecision 
machining because of its highest hardness, better chemical stability at normal 
temperature, resistance to wear, and the least sharpness to be finished. Therefore, 
studies of single crystal diamond microtools are widely conducted using FIB 
milling technology. 

Figure 23 shows a turning tool made from a diamond tool blank. The width 
of the micro tool is about 23 pm after FIB milling (Picard et al. 2003). 
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Fig. 22 The cutting edge cross-sectional observation by FIB milling, (a) The tool edge cross- 
sectional processing; (b) SEM images of a diamond microtool cutting edge (Zhang 2009) 



Fig. 23 Scanning electron microscope images of a tool cutting edge, (a) Low-magnification view 
of a single crystal diamond tool shank and junction with mandrel. The tool cutting edges are 
fabricated on the last ^30 pm near the tip; (b) perspective view, high-magnification scanning 
electron micrograph of the same diamond tool showing the focused ion beam-shaped facets; (c) 
left side cutting edge of the same microtool. This image shows the intersection of multiple FIB 
sputtered facets; (d) end view of the tool (Adams et al. 2000) 
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Fig. 24 Scanning electron microscope images of two rectangular cutting surface microtools, (a) 
Two-tip, single crystal diamond tool shaped by focused ion beam sputtering; (b) end view of the 
same tool demonstrating relief behind cutting edges; (c) high-magnification view of a single tip of 
this two-tip tool (Picard et al. 2003) 


Figure 23c shows the higher magnification of the cutting edge formed by the 
intersection of two ion-milled facets. Microtools with a 0.04 pm roughness edge 
and a radius of curvature less than 40 nm are developed. There were some ripples at 
the microtool facets. They expected that these features formed as a result of a 
well-documented sputter-induced morphological instability (Bradley and Harper 
James 1988) and related phenomena (Carter 1999). 

Microtools of two rectangular cutting surfaces with 10 pm wide tips separated by 
7.7 pm were fabricated based on a single crystal diamond blank (Picard et al. 2003), 
as shown in Fig. 24. It is expected that shaping of similar two-tip diamond tools on 
this scale would be virtually impossible by conventional methods such as grinding. 
And much complex tool face geometry is possible to be developed by FIB. 

Other Materials for Microtools 

Other materials are also applied for the microtools for different applications, 
including tungsten carbide, CBN, and high-speed steel (Adams et al. 2003). 

Figure 25 shows a tungsten carbide tool with a cutting edge width of 1 8 pm and 
clearance behind all cutting edges. This tool has a 7° back facet, designed to deflect 
chips away from the cutting edge and workpiece surface (Adams et al. 2000). The 
grooving tool shown in Fig. 26 is made of CBN, with the rake and rear angles of 
2.4° and 12.4°, respectively. Besides, the length of main cutting edge is 22.87 pm, 
and cutting edge roughness is 17 nm in Ra. 

Micro-Milling Tools 

Micro-milling tools have also been fabricated using FIB direct writing. Nonplanar 
facets of a polished tool blank were processed by ion sputtering to make the micro- 
end mill, which is shown in Fig. 18. Because the Gaussian beam intensity would 
extend outside the original design area, the facet edge closest to the ion source 
would produce a round edge. In order to sharpen the edge, ion needs to continue 
sputtering this particular geometry, which has a truncated intensity distribution due 
to shadowing by the tool facet. First, approximately 5 pm was bombarded from the 
end of the tool, creating a polished facet that provides clearance for chip removal. 
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Fig. 25 Tungsten carbide microgrooving tool fabricated by focused ion beam sputtering. Tool has 
a near rectangular cross section with an 18 mm wide tool-end cutting edge. Images show different 
perspectives of the microtool but do not correlate with tool’s orientations with respect to the 
focused ion beam (Adams et al. 2000) 



Fig. 26 Grooving tool made of CBN (Zhang 2009) 


Second, the chip-cutting facets on a plane, containing the tool axis but tangential to 
the tool circumference, were processed over areas approximately 3 x 50 pm by 
FIB. The SEM image of a submicron radius of curvature along a cutting edge is 
shown in Fig. 27, which is taken near parallel to the tool axis (Adams et al. 2000). 
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Fig. 27 Sharp cutting edge 
of a four-side, tungsten 
carbide micro-end mill tool. 
Edge shown is the 
intersection of two focused 
ion beam sputtered facets 

(Adams et al. 2000) 




Fig. 28 Micro-end mills made by focused ion beam sputtering having 2 (a), 4 (b), and 5 (c) 
cutting edges. Scanning electron micrographs (Adams et al. 2000) 


Figure 28 shows three kinds of micro-end mills with 2, 4, and 6 cutting facets. 
Their facets and sharp edges were fabricated by a sequence of tool rotation and 
sputter removal steps, and the number and position of facets could affect the 
characteristics of a tool. The tool in Fig. 28a has two cutting facets, two diametri- 
cally opposed sharp edges and one tool-end clearance facet, and the one in Fig. 28c 
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Fig. 29 Scanning electron micrographs of concentric circular grooves machined in 6061 alumi- 
num alloy using a microgrooving tool shown in (a) are a micromachined annulus. Micrograph (b) 
displays a high-magnification view of a single groove (Adams et al. 2000) 


has six cutting facets and clearance behind each of the five cutting edges, since 
almost the entire circumference is sputtered. During sputtering steps, a particular 
stage rotation sequence should be selected to produce six-facet end-mill tools with 
4, 5, or 6 sharp edges. Since tool rotation direction for milling operations could be 
made for clockwise or counter-clockwise (looking down on a workpiece), a partic- 
ular placement of facets by FIB sputtering should be test, which determines tool 
rotation direction clockwise. 


Applications 

Concentric Ring Features 

Microgrooves are key features in microchips, optical elements, and other micro 
functional devices. For instance, the binary amplitude -phase mask, a concentric 
groove structure, is widely used in hybrid imaging systems to exhibit high resolu- 
tion and extended depth of field. Investigations of fabricating microgrooves by 
microtools shaped with FIB are discussed as follows. 

In Fig. 29, there are four concentric rings on the planar aluminum workpiece, 
which can be cut using a depth per pass equal to 2 pm with the feed rate of 8 pm/s. 
The cutting speed varies from 3.611 to 10.205 pm/s decided by the distance from 
the center of the rings. The depth of the inner concentric ring is 2 pm while the outer 
three are 10 pm. The resolution of the tool fed vertically is 8.6 nm so that the depth 
of the rings could be controlled precisely. The average width of the groove 
measured by the optical interferometry is 30.7 pm, which is close to the microtool 
cutting width of 30.4 pm. Due to higher cutting speed, the roughness (R a ) of the ring 
bottom is small, 0.14-0.25 pm, measured by a white light interferometer (Adams 
et al. 2000). 

An instance of circular curves machined on aluminum is shown in Fig. 30. Two 
concentric rings that have a sinusoidal cross section demand numerous passes in 
order to keep the curved surface smooth, and their maximum depth is 10 pm. The 
center of the rings was a single crest with the original height of the workpiece surface. 
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Fig. 30 Scanning electron micrographs of concentric circular grooves having a sinusoidal 
waveform cross section. Features are cut on 6061 aluminum using a microgrooving tool (Adams 
et al. 2000) 


Fig. 31 The SEM image of 
the microtool (Xu et al. 2010) 
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By this technique, similar concentric rings could be fabricated with the spindle speed 
of 300 rpm, i.e., the maximum cutting speed amounting to 20,000 pm/s. The 
minimum pitch of these rings is 62.5 pm when using a tool with a width of 30.4 pm. 

An arc edge micro-diamond cutting tool was fabricated by FIB milling in 
MNMT, as shown in Fig. 31, having a radius of curvature less than 30 nm and an 
edge roughness of 12 nm. The radius of arc is 10 pm, while the tip thickness is about 
21 pm. The sinusoidal modulation templates were machined on a 6061 aluminum 
workpiece, and the surface was measured by a white light interferometer, as shown 
in Fig. 32. To test their cutting performance, they were utilized to process micro- 
structures with an ultraprecision lathe. The period and amplitude of the sine curve 
are nearly in accordance with predefined cross-sectional profile. The machining 
results suggest that the FIB -fabricated microtools are capable of accurately fabri- 
cating the designed micro structures. 
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Fig. 32 3D and 2D images of the sinusoidal modulation templates (Xu et al. 2010) 


Micro optical elements (MOE) are considered as a key component in building 
compact optoelectronic systems. There are several methods for fabricating micro 
optical elements, such as binary optics fabrication method (Herzig 1997), direct 
writing method using a laser or a focused ion beam (Maruo and Saeki 2008; 
Fu et al. 2010), single-point diamond turning (SPDT) technology (Yi and Li 
2005), etc. SPDT is one of the most widely used fabrication methods for various 
types of MOE. With the trend towards optical element miniaturization, the devel- 
opment of the microtools with nontraditional shape and sharp edges used in SPDT 
is becoming a very significant issue in the MOE fabrication. However, because of 
the size and configuration limitations of traditional cutting tools in SPDT, some 
areas with a high aspect ratio in the MOE cannot be machined, as shown in Fig. 33a. 
This would cause the shadowing effect, which directly degrades the optical prop- 
erties of the device and reduces the diffraction efficiency (Riedl 1995). Therefore, 
to overcome this problem, the MNMT developed a micro hemispherical tool using 
FIB milling, as shown in Fig. 33b, c (Xu et al. 2010). 

The hemispherical microtool can be used to fabricate micro Fresnel optical 
component by ultraprecision machining. The machining parameters are spindle 
speed of 1,900 rpm, feed rate of 1 mm/min, and depth of cut on z-axis of 1 pm. 
The micro Fresnel lens was machined on a 6061 aluminum workpiece. Then, the 
machined Fresnel optical lens was measured by a white light interferometer, as 
shown in Fig. 34a, b (Xu et al. 2010). The optical property of the micro 
Fresnel optical component was tested using a He-Ne laser with a wavelength of 
632.8 nm. The He-Ne laser irradiated the micro Fresnel optical component surface, 
resulting in diffraction rings whose order of diffraction could be up to 22, as shown 
in Fig. 34c. It is found that the micro Fresnel optical lens fabricated using this 
method is of high resolution and high diffractive efficiency. 

Machining on Cylindrical Workpieces 

Micro-cutting tools for ultraprecision turning of grooves on cylinders or other 
rotationally symmetric shapes were investigated. The applications include 
extremely small solenoids and the rotary element of microscrew pumps. 

A polyetheretherketone (PEEK) cylinder was attached to the lathe chuck, and at 
a distance of 2.55 mm from the chuck, the groove cutting begins to fabricate a long 
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Fig. 33 Shadowing effect comparison in MDOE fabrication with spherical and hemispherical tools: 
(a) traditional shape tool; (b) micro hemispherical tool; (c) hemispherical microtool (Xu et al. 2010) 


helix and one ring at both ends with the same depth of the helix from the groove, 
showed in Fig. 35. The helix portion in Fig. 35b clearly demonstrates the smooth 
groove bottom and sidewall surfaces. 

It was measured that the width of the helical groove was 94.6 pm and the depth 
was 44.6 pm with a standard deviation of 0.63 pm or 2.3 pm, respectively. By a 
white light interferometer, the roughness of the groove bottom was measured to be 
0.22 pm (R rms ) and 0.14 pm (R a ) (Picard et al. 2003). 

Because the FIB -polished microtools significantly deflected the workpiece, the 
groove depth deviated by > 10 % of the intended value for cylinder lengths of ~ 4.0 
mm, where a rougher surface finish created most likely due to the chatter. The 
grooves of constant depth were expected to be made longer by using a tailstock for 
workpiece support. After removing the material of two PEEK cylinders amounting 

Q Q 

to 4.5 x 10 cm in the volume, the tool edge was inspected to be found that the 
edge radius increased less than 0. 1 pm. 

Triangular grooves cut into a 3 mm diameter PEEK rod excellently matched 
the tool shape in the cross section. The grooves have almost the same angle 
(approximately 62.5°) and depths (close to the intended depths, ~15 pm). 
The spacing of the groove pair was 150 pm, and the total arc length of each groove 
was 18 cm (Adams et al. 2003). 

V-shaped microtool was used to fabricate microgrooves for roller mold using 
nanoimprint in MNMT. In order to enhance the tool performance, single crystal 
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Fig. 34 Micro Fresnel diffractive optical component: (a) micro Fresnel optical component; (b) 
morphology of the micro Fresnel lens; (c) diffraction rings (Xu et al. 2010) 





Fig. 35 Grooves machined in PEEK using focused ion beam-polished, custom ground lathe tool, 
(a) shows an image of the groove ending in a circumferential ring, (b) shows a high-magnification 
view of the smooth helical feature (Picard et al. 2003) 

diamond tool blank is selected, as shown in Fig. 36. The microtool rake angle is 
0° and the relief angle is 11°. Through the micro-feed in z-axis, the microgrooves 
can be machined step by step, as shown in Fig. 37. The microgrooves were 
machined on a brass workpiece. 

A series of grooves were produced with different cutting parameters. The images 
of final microgrooves were taken by SEM, as shown in Fig. 38. It can be readily 
found that among these images, the products resulted from the second fabrication 
method are of better quality. 
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Fig. 36 SEM images of the FIB -fabricated V-shaped microtool 



Fig. 37 Image of ultraprecision machining and schematic diagrams of the UV roll 
nano imprinting system 


Fabrication Trenches 

Micro-milling tools can also be used to fabricate trenches that are several millime- 
ters long by ultraprecision machining. Trenches have nearly vertical sidewalls and 
small bottom surface roughness. A good matching of tool diameter to trench width 
is found for all materials tested including PMMA, 6061 Al, 4340 steel, and brass 
(Adams et al. 2001). 

The sidewalls of the trenches milled in PMMA, Al alloy, and 4340 steel were all 
nearly vertical, for instance, the vertical deviation of those in 6061 Al as less than 2° 
measured by scanning electron microscopy. From the SEM, it can be seen that the 
trench widths are close to the intended sizes. The trench width produced by the 
experiments was ~2 pm larger than the tool diameters, which also was adapted to 
different tool designs and workpiece materials. The radial error of the tool and 
spindle could contribute to the small deviation from the intended width. Besides, it 
was presumed that the eccentricity of tool cutting edges or possible tool vibration 
could also cause an error. 
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Fig. 38 SEM image of microgrooves developed with microtools: (a) feed rate, 400 mm/min; 
depth of cut on z-axis, 2 pm; (c) feed rate, 200 mm/min; depth of cut on z-axis, 5 pm; (b) and (d) 
are high-magnification views of (a) and (c) 

The trench milled in brass shown in Fig. 39 with 10 mm in length and 28.8 pm in 
width was completed by 25 passes, which required extremely precise positioning 
repeatability of the milling machines. 

The terminal sections of four parallel grooves were shown in Fig. 40, which were 
milled in PMMA at 20,000 rpm with 2.5 and 5.0 mm axial depths per cut. The 
curled piece of debris near the base of groove #2 indicates that PMMA is removed 
by cutting chips rather than burnish. 

Micromachining Mechanism Investigation 

Another important application for the microtools developed by FIB is the investi- 
gation on micro-/nanoscale tool-based machining mechanism. Investigations have 
been conducted on the microtool geometry effect on workpiece surface finish and 
dimensional accuracy and micro-cutting tool wear. 

Figure 41 shows the magnified tool tips of the SCD tools with a shaped angle of 
90° and 150° used in the experiments (Ding et al. 2009). The workpiece used in the 
experiments was the copper-nickel alloy C76000, which has a very good corrosion 
resistance. From the experiment it can be concluded that larger cutting depth brings 
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Fig. 39 Brass machined with micro-end mill tool having five cutting edges. Plan-view scanning 
electron micrograph (a) and a higher-magnification, perspective view (b) (Adams et al. 2001) 


Fig. 40 Four parallel 
grooves milled in PMMA at 
20,000 rpm with a 
2 mm/minute feed rate. 
Micrograph is taken prior to 
cleaning of workpiece 
(Adams et al. 2001) 




Fig. 41 SCD tool tip with a shaped angle of 90° (a) and 150° (b) (Ding et al. 2009) 
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Fig. 42 Grooves obtained with 150° shaped angle SCD tool at the cutting speed of 400 mm/min: 
(a) depth, 5.65 pm; (b) depth, 1.004 pm (Ding et al. 2009) 


O 


surface defects and burrs to grooves for both types of tools with different shaped 
angle, as shown in Figs. 42 and 43. Severe burrs would be generated at low cutting 
depth by using SCD tools with a shaped angle of 90 

In order to improve the performance of micromachining, a new machining 
strategy using a reduced cross-feed is tested. Invariant machining force, minimized 
burr size, and improved machined surface integrity were obtained while employing 
this machining strategy (Ding et al. 2012). Three grooves were fabricated using the 
microtool on the top surface of OFC (oxygen-free copper) as shown in Fig. 44. The 
depths of grooves from the 1st to the 3rd groove were 0.9, 1.4, and 2.2 pm, 
respectively. No coolant was used during the micromachining. The machining 
length of three grooves was around 2.4 mm/pass. The 1st and 2nd grooves and 
Zone Z1 at the 3rd groove were formed by removing a full layer with layer width 
equal to the machining contact length of tool-work interaction at the primary 
clearance face at a machining speed of 1.0 mm/min. The Zone Z2 in the 3rd cut 
was generated by a series of passes with the same machining speed of 1 .0 mm/min 
aligning the machining direction, however repeatedly removing the right material 
to the tool at a cross-feed of 1 pm/pass (the contact length of tool-work interaction 
equals to 1 pm). 

Burrs could be easily found at all groove edges except the right edge of the 3rd 
groove. Figure 45 showed close-up views of groove edge at the location of B and B’ 
which were formed at different cross-feeds. No severe burrs could be observed at 
the entire right edges of the machined 3rd grooves; meanwhile severe deformations 
or burrs could be easily observed at any other edges. 

It is also found that a change in crystallographic orientation results in a strong 
variation in the machining force, chip thickness, and shear angle, leading to a 
change in machined surface integrity. When a micro-size tool traverses within a 
phase at a machining direction aligned with a particular crystallographic orienta- 
tion, the work material in front of the machining tool is found to be severely 


39 Micro Tools Fabrication by Focused Ion Beam Technology 


1501 



Fig. 43 Grooves machined by SCD tools with a shaped angle of 90° at cutting of 400 mm/min: 
(a) depth, 2.99 pm; (b) depth, 7 pm; (c) depth, 1.49 pm (Ding et al. 2009) 



g roove g move groo ve 

(a I ) B e fore I ■’ [ B etch i ng 


groo v e g to o ve grow v e 
(a2) After FIB etching 



(hi) M a gn \ ji ed v i e w a t A 



( b 2 ) Magn i fied vi e w a l A L 


Fig. 44 Three grooves formed using the microtool (Ding et al. 2012) 
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C I o so - vi e w at B Close- view at B' 


Fig. 45 Strategy to improve the micromachining performance (Ding et al. 2012) 



Fig. 46 The change in crystallographic orientation, (a) The scaling observed at the free face of 
the 2nd chip; (b) machined surface at which the scaling was generated. Arrow without label shows 
the chip flowing direction in (a) or cutting direction in (b) (Ding et al. 2012) 


deformed, leading to a much reduced shear angle, a thicker chip, higher machining 
forces, and a degraded machined surface (Ding et al. 2012), as shown in Fig. 46. 
And the hard spots in the material would affect the machining surface, as shown in 
Fig. 47 (Ding et al. 2008). 

The reductions in the contact area at the interface between the chip and the 
machining face played an important role in minimizing the burr size and avoiding 
the degraded machined surface in micromachining. This machining strategy was 
subsequently applied to generate high aspect ratio micro-pillar array with high 
surface qualities, as shown in Fig. 48. 
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Fig. 47 Microgroove with a depth of less than 0.25 pm cut by the micro-size tool ((a) by SEM; 
(b) and (c) by Veeco NT3300 optical profiler; work material: A16061) (Ding et al. 2008) 


Fig. 48 High aspect ratio 
micro-pillar arrays were 
successfully generated with a 
size of 12.8 x 15.8 x 28.2 
pm (Ding et al. 2008) 



Minimum Undeformed Thickness Investigation 

Micro mechanical machining techniques have introduced many advantages to the 
fabrication of micro-sized features. Micromachining differs from macro-cutting 
operations because the chip thickness in micromachining applications is 
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Fig. 49 Chip growth at (a) a/r = oc, r = 0; (b) a/r = 3, r = 1 pm; (c) a/r — 0.6, r = 5 pm; 
(d) a/r = 0.2625, r = 8 pm (a is undeformed chip thickness in microscale; r is the tool edge radius) 
(Woon et al. 2008) 

comparable in size to the edge radius of the tool, resulting in large negative rake 
angles and elastic-plastic deformations of workpiece materials. 

The minimum thickness of cut (MTC) in metal cutting is defined as the mini- 
mum undeformed thickness of chip that can be removed stably from a work surface 
at a cutting edge under perfect performance of a machine tool. The minimum 
thickness of cut is an important concept in understanding the extreme machining 
accuracy attainable for a specific cutting condition. From the physical point of 
view, it reflects material behavior under extreme mechanical conditions at the 
cutting edge. If the depth of cut decreases to submicron level, the tool edge radius 
which can be ignored in conventional machining must be considered. When the 
depth of cut reduced to nanometric scale, the mechanism cannot be explained by 
the conventional cutting theory based on “continuum mechanics” because of the 
size effect, as shown in Fig. 49 (Woon et al. 2008). In order to find the MTC, Yuan 
introduced a simple analytical relationship between the tool edge radius, the cutting 
forces, and the friction coefficient (Yuan et al. 1996). 

The molecular dynamic (MD) simulation technique is based on interatomic 
force calculations. It can accommodate micro-material characteristics as well as 
dislocations, crack propagations, specific cutting energy, etc. Hence, MD simula- 
tion has also been used to predict the MTC. Shimada et al. used MD simulations to 
study the achievable accuracy using edge radius tooling in diamond turning 
(Shimada et al. 1993a). It was pointed out that the MTC effect was the 
major limiting factor in achieving accuracy because of the generated surface. 
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Cutting direction : < 121 > 


Fig. 50 Deformation around the cutting edge in micro-cutting of single crystal copper (r = 5 nm) 
(Shimada et al. 1994) 


Shimada et al. developed an MD model for understanding the chip removal 
mechanism in micromachining of copper with depths of cut down to 1 nm (Shimada 
et al. 1993b). The model was able to continuously generate dislocations in front of 
the tool tip and produce chip morphology showing a very good agreement with 
experimental results. In another work (Shimada et al. 1994), they also investigated 
crystallographic orientation effects on plastic deformation on single crystal copper 
using MD simulation (Shimada et al. 1994). Additionally, they found that the 
ultimate possible surface roughness was about 1 nm for both single crystal and 
polycrystalline copper materials assuming perfect cutting condition, shown in 
Fig. 50. MNMT proposed a new approach based on the maximum displacement 
of atoms in cutting direction to estimate the chip formation in MD simulation. It is 
found that the minimum rake angle for chip formation is —65° ~ (—70°) and the 
subsurface deformations of copper are mostly the dislocation and stacking faults, 
shown in Fig. 51 (Lai et al. 2012). 
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Fig. 51 The difference between (a) chip formation and (b) elastic-plastic deformation in the 
region of maximum displacement for workpiece atoms (Lai et al. 2012) 

It is worthwhile to mention that the edge radius that MD can simulate is still 
smaller than real machining process. That means MD simulation results cannot be 
effectively verified by ultraprecision experiments in the same scale. Therefore, the 
preparation of the extremely sharp cutting tool with edge radius less than 10 nm is a 
critical requirement in the MD and MTC researches of nano-cutting. 

In the early 1990s, Ikawa et al. carried out significant micromachining trials in 
order to find out the minimum cutting thickness at Lawrence Livermore National 
Laboratory. According to their reports, the minimum thickness of chip can reach to 
1 nm, and hence, a diamond tool with a cutting edge radius of 3-5 nm should be 
provided (Ikawa et al. 1992). Zong et al presented the brittle-ductile transition 
lapping mechanism for mechanical lapping of single crystal diamond cutting tools. 
Atomic force microscope (AFM) results indicate the cutting edge radius has been 
sharpened down to 2-9 nm (Zong et al. 2007a, b). 

The focused ion beam process has been used to fabricate sharp microtools. 
Compared to other preparation methods, focused ion beam direct writing can 
fabricate it with nanoscale edge radius and more importantly with high reproduc- 
ibility. By controlling the relative position between the ion source and micro tool in 
the process, Sandia has fabricated a tool with an edge radius of less than 40 nm and 
the Singapore Institute of Manufacturing Technology (SIMTech) has fabricated a 
tool with an edge radius of about 20 nm. 

The microtools with edge radius less than 20 nm have also been fabricated by 
optimizing the process parameters in MNMT. In the experiment, the MTC less than 
10 nm can be obtained. That means extremely sharp tools can be prepared by 
focused ion beam milling. Figure 52 shows the geometry of the cutting experiment 
and Table 1 conditions of the cutting experiment. The MTC t 0 can be controlled by 
the tool setting angle cp , the feed speed F (pm/min), and the spindle speed N (rpm). 
Figure 53 shows the SEM micrographs of chips, while the minimum thickness of 
cut chip is about 9 nm. 

From the cutting mechanics point of view, the extreme accuracy feasible in 
micro-cutting may be of the order of 1 nm when the cutting edge sharpness is 
maintained at the level of a few nanometers. 




39 Micro Tools Fabrication by Focused Ion Beam Technology 


1507 



Fig. 52 Geometry of the cutting experiment and microtool 


Table 1 Conditions of the 
cutting experiment 


Work material 

Single crystal copper 

Tool setting angle 

20° 

Spindle speed 

2,000 rpm 

Depth of cut 

5 pm 

Feed speed 

20 pm/min 

Feed rate 

10 nm/rev 

Coolant 

None 





Fig. 53 SEM micrographs of chips: thickness of cut 20 nm 


Summary and Outlook 

The major advantages of microtool fabrication using FIB include achieving more 
complex tool shape, better dimensional resolution, high repeatability, and online 
observation of the tool shaping process. FIB sputtering creates tool shapes that are 
beyond the capabilities of conventional techniques such as precise polishing and 
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grinding. Moreover, a negligible mechanical force would affect the tool’s quality 
during the traditional tool fabrication methods. 

The new method has a significant impact in many research areas, such as 
communications, healthcare, and electronics. However, some issues in microtool 
fabrication using FIB still need to be resolved as listed below. 

Sharpness of the Tool 

Current investigations reveal that the micro-cutting radius of as small as 15-20 nm 
is available on FIB -fabricated diamond tools, while 40 nm on hard alloy tool. 
However, a comprehensive study with the sharpness limits for diverse tool mate- 
rials is needed to be investigated. Therefore, it appears to be more important to 
investigate the highest sharpness based on materials and fabrication methods. 


Damage of FIB Sputtering on Tools and Its Alleviation Strategies 

Ion beam inevitably brings damages to tools to be processed with nanometer-sized 
damage layer. Future work should concern on how to evaluate the FIB -induced 
damage and its mechanism and effectively alleviate it. 

Morphology Control in Fabrication and Efficiency 

In the FIB milling process, ripples are apt to appear during the microtool surface 
fabrication, which would seriously influence the facial feature and degrade tool 
functionality. As a consequence, how to alleviate this effect and improve fabrica- 
tion quality is valuable. 

The FIB machining efficiency is generally lower than other micro-fabrication 
methods. Increasing the FIB efficiency is a key topic in future investigation. In 
order to improve it, the appropriate tool blank should be selected to reduce the 
removal of focused ion beam. The method of choosing the right assisted etching 
(e.g., the H 2 0) can effectively improve processing efficiency. 

Future Development 

One of the most important tendencies in manufacturing is miniaturization. Consid- 
ering the increasing demand of MEMS, ultraprecise instruments, and so forth, 
smaller feature dimension and more complex shape would be critical issues in 
future manufacturing. 

Tool with higher quality would certainly find wider market in practical use. 
To meet the requirements of use in complicated environment and obtain better 
fabrication results, properties of tool, including edge radius, form precision, effec- 
tive longevity, and cost-performance, need to be advanced. 
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Abstract 

Maskless fabrication methods for nanogap electrodes using sputter etching with 
a Ga focused ion beam (FIB) are presented. These methods are based on the in 
situ monitoring of the etching steps by measuring the current through patterned 
electrode films. The etching steps were terminated electrically at a 
predetermined current level. In the present experiment, a 30-keV Ga FIB with 
a beam size of ~12 nm was irradiated on double-layered films consisting of a 
10-30-nm-thick Au top electrode layer and a 1-2-nm-thick Ti bottom adhesion 
layer to form nano wires and nanogaps. Electrode gaps that were much narrower 
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than the beam size could be reproducibly fabricated using the presented method. 
The controllability of the fabrication steps was significantly improved by using 
triple-layered films consisting of a thin Ti top layer, Au electrode, and a bottom 
Ti adhesion layer. The minimum gap width achieved was ~3 nm, and the 
fabrication yield reached ~90 % for ~3-6-nm wide gaps. Most of the fabricated 
nanogap electrodes showed high insulating resistances, ranging from 1 G£2 to 
1 T£l. The applicability of the fabricated nanogap electrodes to electron transport 
studies of nanometer-sized objects was examined using electrical measurements 
of Au colloidal nanoparticles. 


Introduction 

Nanogap electrodes are important for various nanometer- scale devices, including 
single electron devices and molecular electronic devices. In particular, molecular 
electronic devices employing a single molecule as an active element (Aviram and 
Ratner 1974) have attracted growing interest as a possible alternative for overcom- 
ing the limits on silicon-based microelectronic devices. The downscaling of silicon 
devices will continue but will reach technological and physical limits sometime in 
the future. Below ~10 nm, for example, device performance becomes extremely 
sensitive to minute variations in size, and lithography techniques are facing tough 
demands in relation to extremely high controllability and increased cost. In nature, 
molecules are a few nanometers in size and have identical sizes and structures. 
Moreover, molecules with various unique functions can be created using chemical 
synthesis. 

To realize molecular electronic devices, the electrical characterization of indi- 
vidual molecules is a crucial issue and requires the fabrication of nanogap elec- 
trodes, the gaps of which are comparable to the size of single molecules, i.e., a few 
nanometers. However, fabricating such small nanogaps is difficult because the 
resolution of standard electron beam (EB) lithography is limited to approximately 
20 nm as a result of the proximity effect. Therefore, numerous techniques for 
fabricating nanogap electrodes have been developed (Li et al. 2010), including 
mechanically controllable break junction (MCBJ) (Muller et al. 1996; Reed 
et al. 1997), shadow evaporation (Klein et al. 1996), metal deposition over 
suspended structures (Bezryadin et al. 1997), electrochemical plating (Morpurgo 
et al. 1999), electromigration (Park et al. 1999), and molecular rulers (Hatzor and 
Weiss 2001; McCarty 2004). Among these, the MCBJ and electromigration 
methods have been widely used for the electrical measurements of single molecules 
and have shown many impressive results (Reed et al. 1997; Park et al. 2000, 2002). 
The MCBJ method is based on the mechanical breaking of an electrode wire 
fabricated on an elastic substrate by bending. The width of the resulting electrode 
gap can be precisely controlled using a piezoelectric element under the in situ 
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monitoring of the tunneling current. Although this method allows the fabrication of 
a very narrow nanogap, it involves complicated processes and has a low yield. The 
electromigration method, which utilizes electrical breaking caused by the migration 
of metallic ions by an electron flow, makes it possible to fabricate nanogap 
electrodes with gaps of a few nanometers on conventional Si/Si0 2 substrates, 
which is advantageous in the preparation of three-terminal devices. However, it 
only provides statistical control with a low reproducibility. More recently, high- 
yield fabrication techniques based on EB lithography have been developed (Liu 
et al. 2002; Saifullah et al. 2002). These techniques, however, involve precise 
alignments of the fabrication conditions, which require great skills. To investigate 
the electrical characteristics of molecular electronic devices, the development of 
reliable fabrication methods is still required. 

To address these issues, nanogap fabrication techniques using a focused ion 
beam (FIB), have been investigated which enables the reliable and direct fabrica- 
tion of narrow nanogaps (Nakayama et al. 1998; Nagase et al. 2003, 2005, 
2006a, b). Recent FIB systems provide a beam with a diameter of ~10 nm and 
even a 4-nm system is commercially available. Using these systems, nanostructures 
with a feature size much smaller than 10 nm can be fabricated. Two fabrication 
techniques are possible using FIB. These are ion beam-assisted deposition (Gamo 
et al. 1984; Gamo and Namba 1990; DeMarco and Melngailis 1999; Shigeto 
et al. 2006) and etching. In the deposition techniques, a precursor gas such as 
metal organic compounds is introduced in the proximity of a sample surface and 
FIB is irradiated to induce decomposition in the adsorbed gas molecules and the 
deposition of metal films. Using this technique, nanogaps with a low leakage may 
be formed in air bridge structures (DeMarco and Melngailis 1999; Shigeto 
et al. 2006). However, it may be difficult to form nanogap structures attached 
directly to a substrate because a Gaussian beam blur, together with an exponential 
tail distribution, will cause blurred edges in the deposited patterns and contamina- 
tions, which may induce current leakages in the gaps. On the other hand, physical 
sputter etching is a simple and clean technique that does not involve the use of 
precursor gases. The exponential tail distribution may not cause severe problems 
because of its low fraction of the beam intensity. Moreover, irradiation in metal 
films may cause a negligible degradation in conductivity. As the electrodes for 
molecular conductance measurements, very thin metal films are sufficient. This fact 
also makes FIB sputter etching techniques attractive because the redeposition 
effects and ion dose should be greatly reduced. 

In this chapter, a simple yet highly reproducible method to form nanogap 
electrodes using maskless sputter etching by FIB with the in situ monitoring of 
the current fed to the electrode film during the etching steps is reported. The effects 
of various fabrication parameters are also reported, including the density of the 
monitoring current and film structures. In addition, the insulating characteristics of 
the fabricated nanogap electrodes and their applicability to the electrical measure- 
ment of nanometer- sized objects are discussed. 
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Fabrication Procedures 
Electrode Samples 

The substrate used in the study was a highly doped Si wafer which has a thermally 
grown silicon dioxide (Si0 2 ) layer with a thickness of ~200 or ~300 nm on the 
surface. Double-layered metal (Au/Ti) and triple-layered metal (Ti/Au/Ti) films were 
deposited on the Si0 2 surfaces using Ar sputtering. The Au film was used as an 
electrode. The bottom Ti film was necessary to obtain good adhesion of the Au film to 
the underlying Si0 2 surface. The top Ti film in the Ti/Au/Ti structure was prepared to 
prevent excess etching of the underlying Au film by FIB tails during the FIB etching 
steps. This was effective because Ti has a much lower sputter rate than Au. The 
thicknesses of the top and bottom Ti films were ~6 nm and 1-2 nm, respectively. The 
thickness of the Au film was varied from ~10 to ~30 nm to examine the influence of 
the electrode thickness on the nanogap fabrication. As shown in Fig. 1, ~5-7-pm 
wide and ~3-pm long patterns with ~100-pm wide leads and contact pads were 
defined in the layered metal films by using photolithography and Ar sputter etching. 


Fabrication Processes 

A schematic diagram of the nanogap electrode fabrication setup is shown in Fig. 2. 
This setup consists of an FIB system (SII Nano Technology Inc., JFIB-2300) with 
an auxiliary beam blanking circuit, voltage source, and ammeter. The minimum 
spot size (full width at half maximum ([FWHM]) of the FIB generated from a 
30-keV Ga ion source was ~12 nm. A current density of ~1 A/cnr was used. 
A constant DC voltage ranging from 50 to 200 pV was applied to the sample to 
monitor the gap formation etch step. 

Nanogap fabrication using FIB sputter etching involves two steps. In the first 
step, the FIB was irradiated in U-shaped pattern #1 (Fig. 1) to form nano wires with 
a width of a few tens of nanometers and a length of 100-200 nm in the multilayered 
metal films. The etching step was monitored in situ by measuring the current in the 
sample, which decreased with the progress of the etching step. To correct the beam 
drift during the etching, the beam position was corrected periodically according to a 
spot marker, which was formed by FIB etching just before the first etching step. In 
the first etching step, the etching was terminated manually after the formation of the 
nanowire. In the second step, the FIB was irradiated in pattern #2 (Fig. 1) to form a 
nanogap in the nano wire with a single line scan. The nanogap formation step was 
terminated by blanking the FIB automatically. The sample current or etching depth 
at which the FIB was blanked was preset by a reference voltage, V ref . By the in situ 
monitoring of the etching steps, the gap width can be reliably controlled, and the 
etching in the underlying substrate can be minimized. 

The fabricated nanogap electrodes were observed using a scanning electron 
microscope (SEM) (JEOL Ltd., JSM-6700F). The width and electrical resistance 
of nanogaps were estimated from SEM images and DC measurements, respectively. 
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Fig. 1 Schematic diagrams of nanogap and nanowire fabrication 
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Fig. 2 Experimental setup for nanogap electrode fabrication 


Numerical Simulation 

Simulation results for the etched gap shape and dependence of the gap width on the 
dose are shown in Fig. 3. The numerical simulation was performed for single line 
scanning by a 30-keV Ga FIB on double-layered structures composed of a 27-nm- 
thick Au top layer and 1.6-nm-thick Ti bottom layer using a string segment model 
(Neureuther et al. 1979). In the present simulation, the FIB current distribution was 
assumed to be Gaussian, and the redeposition effect was neglected. Needless to say, 
it is doubtful that we can form narrow patterns with a very high aspect ratio if we 
take the redeposition effect into account. However, the present simulation may be 
useful to estimate the feasibility of nanogap formation using FIB maskless etching. 
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Fig. 3 Calculated cross-sectional profiles for gap fabricated by Ga FIB with beam diameters 
(FWHM) of (a) 4 nm and (b) 12 nm. (c) Gap width as a function of dose normalized to the critical 
dose to cut films. The film is composed of 27-nm-thick Au and 1.6-nm-thick Ti layers deposited on 
Si0 2 (Reprinted from Nagase, Gamo, Kubota, Mashiko (2005). Copyright 2005, Elsevier) 


Although the angular dependence of the etching rate was taken into account, it 
seemed to induce almost a negligible effect on the estimated pattern profiles. The 
relative etching rate ratio for the Au to Ti films was taken to be 8, the value of which 
was observed in the present experiment. Figure 3a and b show the patterns that were 
formed successively with an increasing dose using FIB single scanning with beam 
diameters (FWHM) of 4 and 12 nm, respectively. The calculated gap width as a 
function of the dose for each beam diameter is shown in Fig. 3c. In Fig. 3c, the dose 
is normalized to the minimum dose where the bottom Ti layer is removed. 

From the simulation, it can be expected that a narrow gap with a width narrower 
than 2 or 4 nm is formed using FIB with a beam size of 4 or 12 nm, respectively. It 
can also be seen from Fig. 3c that gaps with a width much smaller than the beam 
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size can be formed if the dose is controlled precisely. With a 12-nm beam, a gap 
with a width of ~3 nm is expected at a dose 5 % larger than the critical dose, while 
an increase of another 4 % yields a 40 % increase in the gap width. Therefore, in situ 
monitoring of the etching step and automatic beam blanking based on monitoring 
are indispensable for the formation of nanogap structures with a width much 
narrower than the beam spot size. It can also be expected that a narrower beam 
would have the advantage of larger process latitude. 

Nanogap Electrode Fabrication 
In Situ Monitoring of Etching Steps 

Typical current traces in some Au/Ti double-layered samples observed during FIB 
etching are shown in Fig. 4. In this experiment, the thickness of the Au film was 
~30 nm. Figure 4a shows a current trace measured during the nanowire formation step. 


Fig. 4 Typical current traces 
measured during etching 
steps in Au/Ti double-layered 
sample. The thickness of the 
Au him was ~30 
nm. (a) Current measured 
during nanowire formation 
etching, (b) Current measured 
during nanogap formation 
etching. Beam was blanked at 
a current of ~120 nA. Inset 
shows normalized observed 
and calculated currents during 
the gap etching as functions 
of normalized depth removed 
(Reprinted from Nagase, 
Gamo, Kubota, Mashiko 
(2005). Copyright 2005, 
Elsevier) 
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The small steps observed in the current trace were caused by interruptions in the 
etching, during which a prefabricated marker pattern was detected and beam drift 
was minimized. After the FIB irradiation began, the current initially remained 
unchanged and then began to decrease. At the initial stage of etching, the sample 
resistance was dominated by the resistance of the lead regions, which resulted in a 
constant current. When the remaining film thickness in the irradiated regions 
became sufficiently thin, the sample current was mainly determined by the resis- 
tance of the nano wire region. It can be seen from Fig. 4a that the current exhibited a 
rapid decrease after initially remaining constant and then decreased more gradually. 
The rapid decrease in the current trace corresponded to the final stage of the 
nano wire formation etching. The gradual decrease in the current after the rapid 
decrease reflected the narrowing and thinning of the nanowire caused by etching at 
wire edges due to FIB tail components. The nano wire formation etching was 
terminated manually after the gradual decrease was observed in the current trace. 

A current trace measured during the nanogap formation step is shown in Fig. 4b. 
Again, the two large steps caused by the interruptions in the etching for the drift 
correction are observed in the current trace. In the present step, the FIB etching was 
terminated by blanking the FIB automatically at a current level of ~ 120 nA, which was 
preset by V ref . The removal rate of the Au film for this example corresponded 
to ~3.8 nm/s or ~15 ML (monolayers)/s. The current decreasing rate (~1.2 pA/s) at 
the final etching stage suggests that the average remaining thickness should be ~1 to 
2 ML. The inset in Fig. 4b shows the normalized current trace observed without the 
interruptions in the etching, along with the simulated current as a function of the 
normalized depth removed (which is the same with the normalized etching time). The 
current trace during the nanogap formation step could be reproduced reasonably by the 
numerical simulation. A gradual decrease in current with a smaller slope by the Ti to 
Au etching rate ratio should be observed at the end of the sharp current decrease if the 
thin Ti adhesion layer had the same magnitude of conductivity as the bulk value. The 
absence of such a slower decrease in the present experiments may suggest that the 
conductance of the thin Ti films was very low and was negligible compared to that of 
the Au film. 


Monitoring Current Level 

It is important to adjust the preset current level that controls the blanking of the FIB 
for the reproducible fabrication of nanogap electrodes using the present method. It 
was observed that narrower nanogaps were formed when a higher blanking current 
was used, as could be expected. As an example, Fig. 5a shows an SEM image of a 
nanogap electrode fabricated using a relatively higher blanking current of ~230 nA 
in a Au/Ti sample. Using the present blanking, a ~3-nm wide gap was formed, and 
the average remaining thickness at the time of the blanking was estimated to be 2 to 
3 ML. However, we found that nanogaps fabricated with a blanking current higher 
than 100 nA were often broken after the current-voltage (I-V) characteristic 
measurements of the gap. The current density of the blanking current was estimated 
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Fig. 5 SEM images of nanogap electrodes fabricated with (a) high blanking current of ~230 nA 
and (b) low blanking current of ~40 nA in Au/Ti samples. The images in (a) and (b) were observed 
before and after I-V characteristic measurements, respectively. The thickness of the Au films was 
~30 nm (Reprinted from Nagase, Gamo, Kubota, Mashiko (2005). Copyright 2005, Elsevier) 
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to be ~ 10 A/nr, which is more than an order of magnitude lower than the current 
level needed to induce electromigration in Au films (Durkan et al. 1999). The break 
might have been caused by a discharge of static electricity during sample handling 
before the I-V characteristic measurement, electromigration during the I-V charac- 
teristic measurement, or by a combination of Joule heating and residual stress in the 
films. Figure 5b shows an SEM image of a nanogap electrode fabricated with a 
relatively low blanking current of ~40 nA in the Au/Ti sample and observed after 
the I-V characteristic measurements. The figure indicates that a ~5-nm wide gap 
was formed without a break. Using the present method, nanogaps with a width 
narrower than half of the FIB spot size (~12 nm) can be reproducibly obtained. 


Electrode Thickness 

Nanogaps with thinner electrodes are advantageous because the number of mole- 
cules that bridge electrodes is expected to decrease, which makes it possible to 
measure the conductivity of this smaller number of molecules connected in parallel. 
We examined the feasibility of fabricating thin electrode films by using Au/Ti 
double layer films with a thin Au electrode film with a thickness of ~10-15 
nm. Figure 6a shows an SEM image of a fabricated nanogap electrode. A nanogap 
with a width of ~7 nm was successfully formed. However, electrodes were fre- 
quently broken during nano wire formation etching, as shown in Fig. 6b. In this 
experiment, the gap was formed accidentally before the nanogap formation step. 
This was because the effect of etching at the wires due the FIB tail and the thickness 
fluctuation caused by the grain became significant compared with thick Au films, 
and the thinnest region was first broken during the nanowire etching process. It 
should be noticed that this break was not caused by a complete removal from the 
etching but rather by a coupled effect that included the monitoring current and 
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Fig. 6 SEM images of electrodes fabricated in Au/Ti samples of thin Au films with ~10-15 nm 
thickness, (a) A ~7-nm wide nanogap was formed, (b) A nano wire was broken during the nanowire 
formation etching (Reprinted from Nagase, Gamo, Ueda, Kubota, Mashiko (2006b). Copyright 

2006, SPIE) 

residual film stress. To reproducibly fabricate nanogap electrodes in thin electrode 
films, precise adjustments of the etching time and FIB focusing are needed. 


Improvement of Fabrication Reliability 

An unintentional defocusing of the FIB enhances the narrowing and thinning of 
nanowires and degrades the reliability in the nanogap fabrication using Au/Ti double- 
layered samples. The reliability of nanogap fabrications can be improved using 
Ti/Au/Ti triple-layered samples. The representative current traces observed during 
nanowire formation steps in Au/Ti and Ti/Au/Ti samples are shown in Fig. 7a. The 
thickness of the Au films was ~30 nm. In these fabrications, rectangle-shaped patterns 
were used to form nanowires instead of the U-shaped patterns in order to obtain the 
side view image of nanogap electrodes. The increase of the etching time observed for 
the nanowire formation in the Ti/Au/Ti sample was due to the etching of the top Ti 
film that has much less etching rate than Au. It can be seen from Fig. 7a that the 
decrease rate of the currents after the nanowire formation (rapid decrease of current) 
in the Au/Ti samples differed among two traces. The higher decrease rate indicates 
that the thickness and width of the nano wire are decreased by the etching. This should 
occur by FIB tail component which may vary depending on operation conditions such 
as focusing and vacuum in the optics column. We also see that that the current 
decrease caused by the narrowing and thinning of the nanowire was considerably 
reduced in the Ti/Au/Ti sample. The current traces during the nanogap formation 
steps are shown in Fig. 7b. The current in the Au/Ti sample decreased immediately 
after FIB irradiation, while the current in the Ti/Au/Ti sample showed gradual 
decrease and then rapid decrease. The gradual decrease reflects the etching of the 
top Ti film, indicating that the Ti film remained after the nanowire formation etching. 
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Fig. 7 Current traces 
measured during the etching 
steps in the Au/Ti and Ti/Au/ 
Ti electrode samples. 

(a) Currents measured during 
nanowire formation steps. 
The current traces of the 
Au/Ti samples were slightly 
shifted downwards to avoid 
overlapping of the traces 
(the initial levels of original 
currents were almost the 
same), (b) Current traces 
measured during nanogap 
formation (Reprinted from 
Nagase, Gamo, Kubota, 
Mashiko (2006a). Copyright 
2006, Elsevier) 
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Fig. 8 SEM images of nanogap electrode fabricated in Au/Ti double-layered sample, (a) Top 
view image. The gap width is ~5 nm and wire width is ~40 nm. (b) Side view image. The tilt angle 
is 70° (Reprinted from Nagase, Gamo, Kubota, Mashiko (2006a). Copyright 2006, Elsevier) 
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Fig. 9 SEM images of nanogap electrode fabricated in Au/Ti double-layered sample, (a) Top 
view image. The gap width is ~6 nm and wire width is ~33 nm. (b) Side view image. The tilt angle 
is 70° (Reprinted from Nagase, Gamo, Kubota, Mashiko (2006a). Copyright 2006, Elsevier) 
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Fig. 10 SEM images of nanogap electrode fabricated in Ti/Au/Ti triple-layered sample, (a) Top 
view image. The gap width is ~3 nm and wire width is ~50 nm. (b) Side view image. The tilt angle 
is 70° (Reprinted from Nagase, Gamo, Kubota, Mashiko (2006a). Copyright 2006, Elsevier) 

Figure 8 shows SEM images of a nanogap electrode fabricated in the Au/Ti 
double-layered sample. The side view image was obtained by rotating the sample 
at a 70° angle with respect to the substrate normal. The current traces are shown in 
Fig. 7 by solid lines. As can be seen in Fig. 8a, a nanogap with a width of ~5 nm was 
formed. Using the Au/Ti double-layered samples with ~30-nm thick Au, we 
succeeded in the fabrication of nanogaps with widths of ~5-8 nm. However, gaps 
wider than 10 nm were sometimes observed. Hence, the fabrication yield for gaps of 
<10 nm was reduced to ~50 %. Another example of a nanogap electrode fabricated in 
the Au/Ti double-layered sample is shown in Fig. 9. The current trace observed for 
this fabrication is shown in Fig. 7 by the bold line. From the comparison of Figs. 8 
and 9, it can also be seen that the effect of the FIB tail was clearly extended and the 
width and thickness of the nanowire were considerably reduced in this sample. The 
enhancement of the narrowing and thinning of a nanowire caused by the FIB blur 
presumably prevents the reproducible fabrication of narrow nanogaps. 
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Fig. 1 1 Histograms of resistances of nanogap electrodes fabricated in Au/Ti samples (a) and 
Ti/Au/Ti samples (b) (Reprinted from Nagase, Gamo, Kubota, Mashiko (2006a). Copyright 2006, 
Elsevier) 


Figure 10 shows SEM images of a nanogap electrode fabricated in the Ti/Au/Ti 
triple -layered sample. It is seen in Fig. 10a that an ~50-nm wide nanowire with an 
~3-nm wide nanogap was formed. Although both edges of the nanowire were etched 
because of the FIB tail, no apparent narrowing and thinning of the nanowire were 
observed in most electrodes fabricated in Ti/Au/Ti samples. These results indicate that 
the excess etching due to the FIB blur at the edges of Au/Ti nanowires is suppressed by 
the top Ti film. In the Ti/Au/Ti samples, gaps of ~3-6 nm were reproducibly 
fabricated. The fabrication yield increased to ~90 %. By using the Ti protective 
layer, the controllability of the nanogap fabrication steps was considerably improved. 


Gap Resistance 

To measure the electrical properties of single molecules, an electrical insulating 
resistance of a nanogap electrode higher than 1 G£2 is desirable. We measured the 
electrical resistances of the fabricated nanogap electrodes. Figure 11a shows a 
histogram of the resistances of 14 nanogap electrodes fabricated in the Au/Ti 
samples. It was found that the resistances were scattered. The highest resistance 
observed in the electrodes was ~80 G£2, while most of the electrodes showed 
resistances higher than a few G£2. Figure 1 lb shows a histogram of the resistances 
of 19 nanogap electrodes fabricated in the Ti/Au/Ti samples. Most of the electrodes 
exhibited resistances of 10 G to 1 T£2, which were more than one order of 
magnitude higher than those fabricated in the Au/Ti samples. The higher resistances 
of the Ti/Au/Ti nanogap electrodes were most likely caused by the decrease in the 
influence of the redeposition of the sputtered Au ions on the Si0 2 surface around the 
tips of the electrodes during the nanogap formation step. This is because a higher 
dose was required to form nanogaps in the triple layers. At the higher dose, a deeper 
groove was formed around the tips of the electrodes, and the redeposition around 
this groove region was reduced as a result of the shadowing effect. 
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Fig. 12 I-V characteristics of Au colloidal nanoparticle on Ti/Au/Ti nanogap electrode, (a) I-V 
characteristics measured at different temperatures between 50 and 300 K. The open circles 
represent the I-V characteristic of the Ti/Au/Ti nanogap electrode without Au colloidal 
nanoparticles measured at room temperature, (b) I-V characteristic measured at 4.2 K (Reprinted 
from Nagase, Gamo, Kubota, Mashiko (2006a). Copyright 2006, Elsevier) 


Application to Electrical Measurement of Nanosized Objects 

To demonstrate the applicability of the fabricated nanogap electrodes to the elec- 
tron transport studies of nanometer- sized objects, we measured the electrical 
characteristics of Au colloidal nanoparticles. The samples were prepared by placing 
a drop of commercial Au colloidal nanoparticles with a size of ~5 nm in solution 
(Poly science, Inc.) on the Ti/Au/Ti nanogap electrodes. Since these Au 
nanoparticles are covered with insulating organic ligands, a single electron tunnel- 
ing behavior is expected for such a device at low temperatures. Figure 12a shows 
the temperature dependences of the I-V characteristics of the fabricated device. In 
the figure, the I-V characteristic of the Ti/Au/Ti nanogap electrode (without Au 
colloidal nanoparticles) measured at room temperature is also shown. The insulat- 
ing characteristic of the nanogap electrodes stayed in the voltage range of ±1 V, 
and no increase in the current from applying a high electrical field was observed in 
this range, which was consistent with the previous result in sub-3-nm gap electrodes 
(Lambert et al. 2003). The I-V characteristics were dramatically changed by 
depositing the Au nanoparticles on the nanogap electrode. It can be seen that the 
linear I-V characteristic of the fabricated device became nonlinear with decreasing 
temperature. A similar behavior was typically observed in colloidal nanoparticles 
(Bezryadin et al. 1997; Liu et al. 2002). This result indicates that the Au 
nanoparticles were separated from the two lead electrodes with organic ligands; if 
the Au nanoparticles contacted the electrodes directly, the temperature dependence 
of the I-V characteristic should exhibit a metallic behavior. The thermally activated 
I-V characteristics suggest that the hopping transport may dominate near room 
temperature. Figure 12b shows the I-V characteristic of the device measured at 
4.2 K. This I-V characteristic exhibits a suppression of the current at a low bias 
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voltage, indicating that the coulomb blockade effect dominates the electron trans- 
port. The observed characteristics seem to be consistent with a rough estimate based 
on the capacitance of the Au particle, suggesting single electron tunneling through 
the Au nanoparticle. The results show the validity of using the nanogap electrodes 
fabricated by our method for the electrical characterizations of nanostructures such 
as single molecules. 


Summary 

Maskless methods for fabricating nanogap electrodes by using Ga FIB sputter 
etching were investigated. In the presented methods, the etching steps were mon- 
itored in situ by measuring the current fed to patterned metal films and terminated 
electrically at a preset current level. Our work demonstrated that nanogaps with a 
width much narrower than the beam size could be fabricated by precisely termi- 
nating the etching step. Using double-layered Au/Ti films, nanogaps with widths of 
~5-8 nm were reproducibly obtained by FIB with a spot size of ~12 nm. The 
controllability of the fabrication steps was considerably improved by using Ti top 
layers to prevent the etching of the underlying Au films as a result of the FIB tails. 
In the triple-layered Ti/Au/Ti films, nanogaps of ~3-6 nm were reproducibly 
generated, and the fabrication yield was increased to ~90 %. Most of the fabricated 
nanogap electrodes showed high insulating resistances, ranging from 1 G£2 to 1 T Q. 
The applicability of the fabricated nanogap electrodes to the electron transport 
studies of nanometer- scale devices was demonstrated by the electrical measure- 
ment of Au colloidal nanoparticles. The proposed simple and highly reproducible 
fabrication methods would be useful for systematically investigating the electrical 
properties of single molecules. 
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Abstract 

Atmospheric -pres sure plasma manufacturing is a very promising technique 
fabricate optical components and substrates for electronic devices with high 
form accuracy and high efficiency. The thickness correction of SOI and quartz 
crystal wafers by numerically controlled atmospheric-pressure plasma etching, 
which named numerically controlled plasma chemical vaporization machining 


K. Yamamura (ISI) 

Research Center for Ultra-precision Science and Technology, Graduate School of Engineering, 
Osaka University, Suita, Osaka, Japan 
e-mail: yamamura@upst.eng.osaka-u.ac.jp 

Y. Sano 

Department of Precision Science and Technology, Graduate School of Engineering, Osaka 
University, Suita, Osaka, Japan 
e-mail: sano@prec.eng.osaka-u.ac.jp 


© Springer-Verlag London 2015 

A.Y.C. Nee (ed.), Handbook of Manufacturing Engineering and Technology , 
DOI 10.1007/978-1-4471-4670-4 68 


1529 


1530 


K. Yamamura and Y. Sano 


(NC-PCVM), enabled us to obtain thickness uniformity with nanometer-level 
accuracy without introducing subsurface electronic defects. A numerically con- 
trolled sacrificial oxidation process using a multielectrode array system demon- 
strated its potential for realizing the high-throughput thickness correction of SOI 
wafers. A 4H-SiC (0001) surface, which is a difficult-to-machine material 
because of its hardness and chemical inertness, was processed by plasma- 
assisted dry polishing using a Ce0 2 abrasive, and an atomically smooth step 
and terrace structure without lattice strain was obtained. 


Introduction 


There are many types of substrates used in electronic devices whose thickness 
should be uniform. The thickness uniformity of silicon substrates used for large- 
scale integrated circuits (LSIs) should be less than the depth of focus of the 
exposure equipment. The numerical aperture (NA) of exposure equipment is 
expected to increase as the size of transistors in LSIs decreases, which will result 
in the requirement of more uniform thickness. According to the International 
Technology Roadmap for Semiconductors (ITRS) 2011 (International Technology 
Roadmap for Semiconductors 2011), the site front least-square range (SFQR) 
specified for each year is less than half of the dynamic random access memory 
(DRAM) half pitch, for example, it is expected to be less than 13 nm in 2020. 

Silicon-on-insulator (SOI) substrates are used for high-performance LSIs. An 
SOI substrate has a very thin single-crystal silicon layer on a thin buried oxide 
(BOX) layer formed on a silicon substrate. The thickness uniformity of the silicon 
layer directly affects the threshold voltage of each transistor in an LSI and should be 
within ±5 %. A thinner silicon layer will be required as the transistor size in LSIs 
becomes smaller, and a thickness uniformity of less than ±0.35 nm will be required 
in 2020 according to ITRS 2011. 

Moreover, the thickness uniformity of quartz substrates for quartz resonators 
used in computers or in mobile telecommunication devices is also very important 
because multiple resonators are fabricated on a single quartz wafer by a lithography 
process and the thickness of the wafer directly affects their resonance frequencies. 
The resonance frequency of a quartz crystal resonator is inversely proportional to 
the thickness of the quartz crystal wafer, and the tolerance of the thickness variation 
in commercial products is ±10 ppm. Therefore, the initial thickness of the quartz 
wafer must be uniform to reduce the time required to adjust the resonance fre- 
quency for individual resonators in the final trimming process. 

Improving the surface integrity, i.e., reducing the roughness and subsurface 
damage, of semiconductor materials is also important for the fabrication of 
high-performance electronic devices. Silicon carbide (SiC) is a promising next- 
generation semiconductor power device material for high-power and high- 
temperature applications owing to its excellent properties such as wide energy 
band gap, excellent thermal conductivity, high saturated electron drift velocity, 
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and good chemical stability. To apply SiC as a material for high-performance 
electronic devices, an atomically smooth and damage-free surface is essential, but 
SiC is one of the most difficult-to-machine materials because of its hardness and 
chemical inertness. In the present machining process for SiC wafers, a crystal ingot 
grown by a modified Lely method is first sliced by a wire saw and then planarized 
by lapping. Because of the high hardness of SiC, diamond abrasives are generally 
used in the slicing and lapping processes. These mechanical removal processes 
inevitably introduce micro scratches and subsurface damage to the workpiece sur- 
face, which deteriorate surface integrity. 

To prepare SiC wafers with a smooth surface, chemical mechanical polishing 
(CMP) is now widely used as a finishing process for the surfaces of single-crystal 
SiC and/or GaN substrates. Zhou et al. proposed the process of colloidal silica 
polishing, which requires a concentrated colloidal silica slurry at temperatures 
higher than 55 °C and a pH value higher than 10 (Zhou et al. 1997; Neslen 
et al. 2001). Surfaces finished by CMP have no subsurface damage, as evaluated 
by cross-sectional transmission electron microscopy (XTEM), and CMP leads to 
the formation of a high-integrity epitaxial growth layer in terms of surface mor- 
phology (Zhou et al. 1997; Neslen et al. 2001; Saddow et al. 2001). However, the 
material removal rate of CMP is very low (less than 0.5 pm/h); hence, a novel 
highly efficient finishing technique that will generate an atomically smooth surface 
without introducing subsurface damage is crucial for improving the quality of SiC 
wafers. 

To satisfy these requirements, we have developed some ultraprecision fabrica- 
tion techniques utilizing atmospheric -pres sure plasma. In this chapter, the method- 
ologies and experimental results of thickness correction and surface finishing for 
functional materials using the developed methods are described. 


Numerically Controlled Plasma Chemical Vaporization Machining 

Deterministic Correction of Thickness Distribution by Direct Dry 
Etching 

Thickness correction by numerically controlled plasma chemical vaporization 
machining (NC-PCVM) is performed in accordance with the following procedures: 
(1) The thickness distribution of the specimen is measured by an optical or 
electrical method. (2) The thickness error distribution is calculated by subtracting 
the target thickness from the measured thickness. (3) Since the removal or oxidation 
volume is proportional to the dwelling time of the plasma, the distribution of the 
scanning speed of the X-Y table is determined by the deconvolution of the thickness 
error distribution and the removal function that is a removal spot obtained in unit 
time. (4) Numerically controlled thickness correction is performed by raster scan- 
ning, as shown in Fig. 1, to deterministically obtain a wafer of uniform thickness 
(Mori et al. 2000a, b; Yamamura et al. 2008). In this correction process, the 
temporal stability and reproducibility of the removal function are superior to 
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Fig. 1 Correction procedures 
of thickness error of the 
substrate by controlling the 
scanning speed of the 
localized plasma 
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those of the mechanical machining process. This is because there is no wear of the 
electrode and the process gases are uniformly supplied to the plasma region, 
whereas in the mechanical removal process, the degradation of the tool head, 
which may be a grinding wheel or polishing pad, and the unevenness of the 
polishing pressure deteriorate the surface integrity. Furthermore, the removal 
function in NC-PCVM is insensitive to external disturbances, such as the vibration 
and thermal deformation of the machine and specimen owing to the noncontact 
removal or oxidation mechanism. Therefore, the PCVM equipment does not require 
high stiffness, in contrast to conventional mechanical machining equipment, and 
consequently, these processes realize cost-effective ultraprecision thickness 
correction. 


NC Thinning and Correction of SOI Thickness 

The NC-PCVM equipment, which includes a spherical rotary electrode, is used for 
thinning the silicon layer of SOI substrates (Mori et al. 2004). An atmospheric- 
pressure gas mixture of He:CF 4 :0 2 = 99.98:0.01:0.01 is filled in the chamber and 
a localized plasma is generated by supplying very high-frequency (VHF) power 
(/= 150 MHz) between the electrode and the wafer. Commercially available 6-in. 
and 8-in. SOI wafers were used in our experiments. The thickness of the top silicon 
layer was measured by a spectroscopic ellipsometer. 

A commercially available 6-in. SOI wafer with a thickness of 201.6 nm and a 
thickness variation of ±4.1 nm was thinned and uniformized to 13.0 nm and ±2.0 
nm, respectively (Mori et al. 2004). After improving the stability of the parasitic 
capacitance of the apparatus, a commercially available 8-in. SOI wafer with a 
thickness of 97.5 nm and thickness variation (standard deviation) of 2.4 nm was 
thinned and improved to 7.5 and 0.38 nm, respectively, as shown in Fig. 2 (Sano 
et al. 2007). 

Since RF power is applied directly between the electrode and the wafer in 
PCVM, the degradation of the processed surface by ion bombardment was of 
concern. Thus, we attempted to fabricate MOSFETs on SOI wafers thinned by 
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Fig. 2 Thickness distributions of SOI layer of 8" SOI wafer before and after thinning by 
NC-PCVM 

both PCVM and conventional sacrificial oxidation and compared their perfor- 
mance. A commercial 8-in. SOI wafer was thinned by PCVM to a thickness of 
63.8 nm. Another SOI wafer as a reference was thinned by a combination of 
sacrificial oxidation and wet etching of the oxide layer. MOSFETs were fabricated 
on the entire area of these wafers using 0.35-pm-process technology (Adan 
et al. 1996; Azuma et al. 1995). Figure 3 shows the I ds (source-drain current)-V g 
(gate voltage) curves for the SOI n-MOSFETs. The curves for the MOSFET on the 
wafer fabricated by PCVM and the reference MOSFET are shown by filled squares 
and open squares, respectively. The leakage currents of the MOSFETs on both 
wafers were equally small. The subthreshold slopes were steep and almost equal. 
These results were the same for all other transistors on both wafers. Thus, it was 
shown that MOSFETs on an SOI wafer processed by PCVM operate normally. This 
means that PCVM induces not only no ion bombardment damage because of the 
small mean free path of the gas molecules in the atmospheric-pressure plasma but 
also no trace metal contamination, which degrades MOSFET performance. 


Thickness Correction of Quartz Crystal Wafer 

Open-air-type NC-PCVM equipment was used to correct the thickness distribution 
of a quartz crystal wafer (Yamamura et al. 2008; Ueda et al. 2010). A plasma- 
generating unit is constructed from a coaxially arranged electrode and dielectric 
cover, and helium-based process gas is supplied to the electrode tip through the 
space between the electrode and dielectric cover. The replacement of the air in the 
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Fig. 3 Ids - V g curves for SOI 
n-MOSFETs on PCVM- 
processed wafer and 
reference wafer. The gate 
length is 0.35 pm and the gate 
width is 10 pm 
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vicinity of the electrode tip, which is surrounded by the dielectric cover and the 
specimen, with a helium-based process gas enables the generation of a stable glow 
discharge plasma at atmospheric pressure without any vacuum operation. The 
diameter of the electrode, which was made of aluminum alloy, is 3 mm. The 
composition and flow rate of the process gas are controlled by mass flow control- 
lers. The plasma is generated between the electrode tip and the quartz crystal wafer 
by applying a radio-frequency (RF) electric field (/ = 13.56 MHz). An increase in 
the electric power to the localized plasma to increase the etching rate tends to cause 
a breakage or the formation of twins in the quartz crystal wafer by thermal stress 
because quartz crystal is brittle and has low thermal conductivity. Therefore, a 
function generator is built into the RF power supply to decrease the surface 
temperature of the quartz crystal wafer by generating pulse-modulated plasma 
(Ueda et al. 2010; Yamamura et al. 2009). The quartz crystal wafer is held by a 
vacuum chuck, which is installed on the X- Y table along with a heater. The relative 
position and scanning speed between the electrode and the quartz crystal wafer are 
controlled by the X-Y table, and motion in the Z-direction is driven by AC 
servomotors. The thickness distribution of the wafer is measured by a resonance- 
frequency measurement system constructed from a noncontact electrode unit and a 
network analyzer. The frequency measurement point of the wafer is placed between 
a pair of probe electrodes with a gap distance of 30 pm. The resonance frequency of 
a small localized area of the wafer, which corresponds to the area of the probe 
electrode with a diameter of 3 mm, is measured by determining the capacitive 
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Fig. 4 Thickness distribution of AT-cut wafer (a) before and (b) after correction by NC-PCVM 

coupling between the wafer and the electrodes. The localized thickness of the wafer 
t between the probe electrodes is converted from the resonance frequency/by using 
the relationship t (pm) = 1670/f (MHz). The removal volume distribution on the 
wafer, which represents the distribution of the volume to be removed, is calculated 
by subtracting the target thickness from the measured thickness distribution. 

A double-sided polished AT-cut quartz crystal wafer with an average thickness 
of 100 pm was used as a specimen to evaluate our developed NC-PCVM technique. 
The flow rates of the reactive gases and the machining gap were He:CF 4 :0 2 = 
250:20:4 (cc/min) and 250 pm, respectively. Figure 4a shows the thickness distri- 
bution of the as-received wafer. The thickness of the quartz crystal wafer was 
converted from the resonance frequency, which was measured with a 4 mm pitch. 
Figure 4b shows the thickness distribution after correction by NC-PCVM. The 
thickness error of the wafer decreased from 122.6 nm peak-to-valley (p-v) to 
14.9 nm p-v over an area of 24 x 24 mm . The total correction time was 
107 s. This result shows that the thickness uniformity required for final ion-beam 
trimming was successfully achieved. Figure 5a, b show histograms of the thickness 
deviation on the wafer from the average thickness before and after NC-PCVM, 
respectively. The standard deviation of the thickness deviation decreased from 33.2 
to 3.2 nm. 

The thickness correction improved resonant characteristic as shown in Fig. 6. 
Before thickness correction, many unwanted spurious peaks were observed near the 
main peak of the resonance curve. In contrast, there were no spurious peaks around 
the main peak for the corrected wafer. Figure 7 shows the thickness deviations of 
the wafer before and after thickness correction, where filled symbols indicate points 
where the resonance curve was measured. The thickness variations at the measure- 
ment points of the wafer before and after correction were 34.9 and 4.6 nm/mm, 
respectively. It is assumed that the improvement of the parallelism in the area 
facing the measurement probe reduced the number of spurious peaks. 
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Fig. 5 Histogram of thickness deviation to the 
wafer (a) before and (b) after correction are 33.2 
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Fig. 6 Resonance curve of 
the AT-cut quartz crystal 
wafer before and after 
correction of the thickness 
distribution 
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The Q factor of a crystal oscillator manufactured using the wafer processed by 
NC-PCVM was approximately 1 x 10 5 , which is equivalent to that of a conven- 
tional commercial oscillator; thus, there was no characteristic degradation of the 
oscillator due to plasma irradiation. In the case of the wafer processed by a reactive 
ion etching (RIE), large spurious peaks can be observed near the main peak 
(Nagaura and Yokomizo 1999). It is considered that these spurious peaks, which 
result in a decrease of the Q factor and a change in the resonance frequency of the 
oscillator, are caused by subsurface damage formed by the collision of high-energy 
ions. In contrast, the kinetic energy of ions in PCVM is very low because of the 
small mean free path under the atmospheric -pres sure condition. In our previous 
work, the measurement of surface photovoltage (SPV) revealed that the surface 
defect density of a single-crystal silicon wafer processed by PCVM is equivalent to 
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Fig. 7 One-dimensional 
thickness deviations of the 
AT-cut quartz crystal wafer 
before and after thickness 
correction 
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that of a surface processed by chemical etching using a mixture of hydrofluoric acid 
and nitric acid. On the other hand, a surface processed by mechanical polishing and 
argon ion sputtering had a large defect density of more than two orders of magni- 
tude greater than that of the surface processed by PCVM (Mori et al. 2000b). 

These results show that numerically controlled open-air-type PCVM is a prom- 
ising process for improving the thickness uniformity of the AT-cut quartz crystal 
wafers without degrading the resonance properties. 


Numerically Controlled Sacrificial Oxidation 
Concept of NC-SO 


Numerically controlled sacrificial oxidation (NC-SO) consists of two steps: numer- 
ically controlled oxidation followed by hydrofluoric acid treatment (Sano 
et al. 2008a). Atmospheric -pres sure plasma is used for the oxidation. When plasma 
is generated at atmospheric pressure in the presence of oxygen gas, only the area 
that faces the localized plasma is oxidized, which results in the formation of an 
oxidation layer with an arbitrary thickness distribution by controlling the dwell time 
at each location on the sample. Therefore, NC-SO can be realized by controlling the 
dwell time of plasma exposure on each part of the surface. After oxidation, the 
desired surface is revealed when the sample is dipped into HF solution. An HF 
cleaning process is generally used as a final cleaning process in silicon wafer 
production. However, this final HF treatment is considered not to be necessary in 
NC-SO since it is performed simultaneously with the removal of the oxidation layer 
formed in this technique. Although NC-SO is a removal process using plasma, there 
is no need for fluorine-containing gases and no waste gases are generated. The 
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Fig. 8 Concept of raster- 
scan-type and electrode- 
array-type NC processing 
system, (a) raster- scan-type 
system, (b) multi electrode - 
array-type system 
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oxidation rate depends on the thickness of the oxide layer and rapidly decreases 
when the thickness reaches several tens of nanometers. Thus, this method is suitable 
for finishing processes that require the removal of a small amount of material. 


NC Correction of SOI Thickness Using Raster Scan System 

Figure 8a shows the concept of raster scan-type NC processing, which can be 
realized by controlling the plasma dwelling time by the raster scan of a single 
electrode with controlled velocity. A commercially available p-type 300-mm SOI 
wafer was used to evaluate the proposed method. The thickness of the top silicon 
layer was approximately 60 nm, the thickness of the buried oxide layer was 145 nm, 
the crystal orientation was (100), and the resistivity was 8-12 Q cm. Gas with 
composition He:0 2 = 98:2 was used to fill the chamber to atmospheric pressure 
after evacuating air from the chamber. The gap between the electrode and the 
specimen was 0.5 mm, and the supplied RF power was 750 W. The thickness of the 
top silicon layer before and after the process was measured by reflection 
spectroscopy. 

Figure 9a shows the thickness distribution of the silicon layer of an as-received 
300-mm SOI wafer, and Fig. 9b shows the thickness distribution of the wafer 
processed by NC-SO (Sano et al. 2008b). The dispersion of the thickness and the 
standard deviation of the thickness variation of the silicon layer were, respectively, 
improved from ±1.4 nm (p-v, 2.8 nm) and 0.33 nm to ±0.45 nm (p-v, 0.9 nm) and 
0.14 nm over a circular area with an edge exclusion (EE) of 10 mm. 


41 Plasma-Based Nanomanufacturing Under Atmospheric Pressure 


1539 


Fig. 9 Silicon layer 
thickness dispersion of 
300 mm SOI wafer before and 
after processing, (a) 
As-received wafer, (b) 
processed wafer by NC 
sacrificial oxidation 
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Fig. 10 Schematic of NC processing system using multi electrode-array system 


NC Correction of SOI Thickness Using Electrode Array-Type System 

Figure 8b shows the concept of electrode array-type NC processing, which can be 
realized by controlling the plasma generation time of each electrode in an array of 
electrodes instead of the raster scan of a single electrode (Kamisaka et al. 2010). 
The plasma area can be easily separated by separating the electrodes because of the 
small mean free path of the gas molecules under atmospheric pressure. Since the 
initial gap between the sample and the electrode is approximately 1 mm, the electric 
field intensity decreases by half upon changing the gap from 1 to 2 mm. Thus, the 
plasma can be easily switched on and off by small displacements of the electrode. 
Figure 10 shows a schematic of the NC processing system using the electrode array 
system. In this system, a quartz wall between the electrode array and the wafer 
forms a chamber for gas replacement with a very small volume, which means that 
supplied process gas is sufficient to replace it with air; thus, there is no need to use a 
vacuum pump. A prototype apparatus with 40 electrodes that covered one-sixth of 
the area of an 8" wafer was developed. A commercially available p-type 8" SOI 
wafer was used to demonstrate the proposed method. NC plasma oxidation was 
performed in a He:0 2 = 99:1 atm with a maximum RF power of 300 W. 

Figure 1 1 shows the thickness distribution of one-sixth of the area of a commer- 
cially available 8-in. SOI wafer before and after NC thickness correction of the SOI 
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Fig. 11 Silicon layer 
thickness dispersion of 1/6 
area of 8" SOI wafer, (a) 
As-received wafer, (b) 
processed wafer by electrode - 
array-type NC sacrificial 
oxidation 
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layers by using the electrode array system. The range and standard deviation of the 
thickness variation were improved from 5.9 and 1.5 nm to 3.4 and 0.8 nm, respectively; 
thus, it was shown that the electrode array system can be used for NC processing (Sano 
et al. 2010). In raster scan methods, the throughput should decrease with increasing 
wafer diameter because of the increase in the path length to be scanned. Thus, a faster 
scanning table will be required to maintain the throughput for larger wafers. On the 
other hand, in the electrode array system, there is no change in the throughput with 
increasing diameter of the wafer. Thus, it is considered that this system is effective for 
realizing a high-throughput simultaneous NC process for large-diameter wafers. 


Plasma-Assisted Polishing 
Concept of Plasma-Assisted Polishing 

In the PCVM process, a damage-free surface can be obtained because of its 
chemical removal mechanism (Mori et al. 2000b). However, the ability of PCVM 
to achieve atomic-level flatness is lower than that of conventional polishing because 
surface atoms are removed by isotropic etching under the atmospheric-pressure 
plasma condition. Yamamura et al. have recently proposed a novel dry polishing 
technique named plasma-assisted polishing (PAP) that combines the atmospheric- 
pressure plasma process with fixed abrasive polishing (Yamamura et al. 2011). In 
this technique, the irradiation of reactive plasma modifies the surface of a hard 
material to form a soft layer, and the subsequent polishing using a soft abrasive 
preferentially removes the soft layer. Atmospheric -pressure water vapor plasma is 
used to soften the 4H-SiC (0001) surface by oxidation in this study. The 
atmospheric-pressure plasma is generated by applying an RF (/ =13.56 MHz) 
electric power, and helium-based water vapor (ca. 2 %) with a flow rate of 1.5 
L/min is supplied as a process gas. Water vapor is introduced into the process gas by 
bubbling helium through ultrapure water (UPW), and its concentration is measured 
using a dew point meter (DPM). The copper electrode used for generating the 
plasma is covered with a closed-end quartz glass tube to prevent arc discharge 
through the generation of dielectric barrier discharge. In the mechanical removal 
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Fig. 12 Nanoindentation measurement results of 4H-SiC (0001) surface, (a) Load-displacement 
curves of the as-received and water vapor plasma irradiated surfaces, (b) Hardnesses calculated by 
using the Oliver-Pharr method 


part, which is separate from the electrode, a polishing experiment using a polishing 
film (#?8 mm) placed at the tip of a spindle was conducted. A specimen was placed 
on a rotary table, and surface modification by plasma irradiation and dry polishing 
using a polishing film were sequentially conducted. The coaxial electrode and 
polishing film were located at the same distance from the center of the rotary 
table. Thus, the polished field was not the whole surface of the specimen but a 
ring with a width of about 8 mm. We used a commercially available 2-in. n-type 
4H-SiC (0001) 0.29° off wafer with a specific resistance of 0.110 £2 cm as the 
specimen, and the polishing film contained Ce0 2 abrasive particles with an average 
grit size of 0.5 pm that were fixed on a polyethylene terephthalate (PET) film using 
a polyester polyurethane adhesive. In contrast to CMP, PAP is a dry process without 
the use of slurry and/or chemicals. Therefore, this technique is promising as an 
environmentally benign finishing process for difficult-to-machine materials. 


Finishing of 4H-SiC (0001) 

Nanoindentation measurements were conducted to evaluate the hardness of the 
surface modified by the irradiation of water vapor plasma (Lucca et al. 2010). 
Figure 12a shows load-displacement curves of the 4H-SiC surface measured by the 
nanoindentation method. A Berkovich- type indenter made of diamond was used, 
and the number of measurement points and the maximum load for each sample 
were 5 and 0.5 mN, respectively. The hardnesses of the surface shown in Fig. 12b 
were calculated by the Oliver-Pharr method (Oliver and Pharr 1992). From this 
figure, it was found that the irradiation of water vapor plasma reduces the average 
hardness of 4H-SiC from 37.4 to 4.5 GPa, indicating that the irradiation of water 
vapor plasma is effective for forming a soft layer on the hard SiC surface. 
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Fig. 13 XPS spectra of the processed 4H-SiC surfaces, which correspond to (a) Si2p, (b) Si2p 
(in detail), (c) Cls. (i) as-received, (ii) after irradiated water vapor plasma (iii) after PAP using 
Ce0 2 abrasive, (iv) after SPM cleaning followed by HF dipping 


Figure 13a-c show X-ray photoelectron spectra (XPS) of the 4H-SiC surface 
processed by PAP corresponding to Si2p, Si2p (in detail), and Cls, respectively. In 
each figure, (i), (ii), (iii), and (iv) denote the as-received surface, the surface after 
water vapor plasma irradiation without polishing, the surface after PAP, and the 
surface cleaned with a mixture of H 2 S0 4 and H 2 0 2 (SPM cleaning) after PAP, 
respectively. The composition of the SPM solution was H 2 S0 4 (97 wt%):H 2 0 2 
(30 wt%) =4:1, and the immersion time in the solution was 10 min. The SPM 
cleaning for the removal of organic and metal contaminants was followed by 
dipping in 25 wt% HF for 5 min to remove the oxide film. A peak corresponding 
to the Si-0 bond (102.9 eV) can be observed in Fig. 13a (ii), b (ii). The peaks 
observed at 101.5 and 283.9 eV were identified as corresponding to the interface 
oxide Si 4 C 4 _ x 0 2 (Hornetz et al. 1994). These results indicate that the irradiation of 
water vapor plasma oxidized the surface of SiC, and it is considered that the 
oxidation species in this reaction system are hydroxyl radicals because strong 
optical emission from excited hydroxyl radicals was observed in the optical emis- 
sion spectroscopy measurement of the plasma (Yamamura et al. 2011). Therefore, it 
is assumed that the formation of the oxide layer led to the decrease in hardness 
observed in the nanoindentation measurements. After PAP, the peak intensity of the 
Si-0 bond decreased as shown in Fig. 13b (iii), and a peak from C-H/C-C bonds on 
the surface (285.0 eV) was observed, as shown in Fig. 13c (iii). It is considered that 
the detection of the C-H/C-C peak is due to the contamination of the adhesive 
component of the polishing film or ambient. This contamination can be easily 
removed by SPM cleaning as shown in Fig. 13c (iv). Figure 13b (iv) shows that 
the residual oxide was completely removed by HF dipping after SPM cleaning. 

Figure 14 shows atomic force microscopy (AFM) images of 4H-SiC (0001) 
surfaces. In these figures, (al) and (a2) show the as-received surface, and (bl), 
(b2) and (cl), (c2) show the surfaces processed by PAP using Ce0 2 abrasive, respec- 
tively. The as-received surface has many scratches and is highly undulating. 
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Fig. 14 AFM images of 4H-SiC (0001) surface. As-received surface: (al) 13.40 nm p-v 1.31 nm 
rms (a2) 3.39 nm p-v 0.53 nm rms, After PAP with residual scratches: (bl) 2.26 nm p-v (b2) 0.20 nm 
rms, After PAP without scratch: (cl) 1.48 nm p-v 0.25 nm rms (c2) 1.74 nm p-v 0.26 nm rms 


In the initial stage of PAP smoothing, deep scratches remain as shown in Fig. 14b 1, b2. 
However, the rest of the surface is very smooth. Finally, as shown in Fig. 14cl, c2, the 
whole surface has an atomically flat step and terrace structure, which corresponds to the 
inclination of the crystal plane (0.29°), without any newly formed scratches. 




1544 


K. Yamamura and Y. Sano 


Fig. 15 Lattice constants of 
as-received and PAP 
processed 4H-SiC (0001) 
surfaces obtained by RHEED 
measurements. Inset is a 
RHEED pattern of the PAP 
processed surface 
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Subsurface damage, such as microcracks and lattice strain, remaining on the SiC 
substrate causes the degradation of quality of the SiO^SiC interface and epitaxial 
layer, which are formed to fabricate electronic devices. Therefore, an atomically 
smooth damage-free SiC substrate is essential for fabricating high-performance SiC 
power devices. The residual lattice strains of the surface observed before and after 
PAP were evaluated by reflection high-energy electron diffraction (RHEED) mea- 
surement. We evaluated two commercially available 4H-SiC wafers supplied by two 
different companies, which were sliced on-axis (0.29° tilt: sample 1) and off-axis 
(8° tilt: sample 2) from the (0001) basal plane. The acceleration voltage and beam 
current in the RHEED measurements were 15 kV and 20 pA, respectively. The inset 
of Fig. 15 shows the RHEED pattern of sample 1 after PAP. Kikuchi lines were clearly 
observed and the background intensity was lower than that of the as-received surface. 
These results indicate that both surface roughness and crystallinity are improved by 
PAP. Figure 15 shows the lattice constants of the two samples calculated from their 
diffraction spot patterns. The lattice constants of the surfaces processed by PAP 
approached the ideal a-axis value of 0.307 nm, which is indicated by a dashed line 
in Fig. 15, and positive strains (0.7-2.9 %) were observed in both as-received wafers. 

Figure 16a, b shows the XTEM images of the surface processed by PAP using 
ceria abrasive for 1 h. Oxide layer having amorphous structure is not observed on 
the PAP processed surface. Since the hardness of ceria is almost the same with that 
of Si0 2 (Shorey et al. 2000), the oxide layer formed by irradiation of water vapor 
plasma is preferentially removed by soft ceria abrasive as shown in Fig. 16a. It 
seems that moderate removal of the oxidized surface atom is very effective to 
obtain an atomically smooth surface. On the other hand, the deep contrast layer 
remains on the surface, and this layer is considered as an interface silicon 
oxycarbide layer (Si 4 C 4 _ x 0 2 ) Si0 2 (Hornetz et al. 1994). As a product of insuffi- 
cient oxidation, it is very reasonable to assume that the hardness of Si 4 C 4 _ x 0 2 layer 
is between that of SiC which is the base material and Si0 2 which is the sufficient 
oxidation product. As the hardness of ceria is almost the same with that of Si0 2 , 
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Fig. 16 XTEM images of the surface processed by PAP. (a) Low magnification image, (b) Close- 
up image and schematic of 4H-SiC structure 


only the top amorphous Si0 2 layer can be removed, and the silicon oxycarbide layer 
will remain on the surface. Figure 16b shows the close-up image of the PAP 
processed surface. A periodical well-ordered structure, which corresponds to the 
structure of 4H-SiC, is continuously observed from the bulk region to the top 
surface. These observation results lead to the conclusion that PAP technique 
enables us to obtain an atomically smooth surface of single-crystal SiC substrate 
without introducing crystallographical defect in the subsurface region. 

These results lead to the conclusion that PAP finishing makes it possible to 
obtain an atomically smooth surface on a single-crystal SiC substrate without 
introducing crystallographical subsurface damage. 


Summary 

We have developed deterministic thickness correction techniques and a finishing 
technique utilizing atmospheric-pressure plasma for fabricating next-generation 
electronic material substrates with high-integrity surfaces, and we have reported 
experimental results obtained by applying those techniques. 
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SOI and AT-cut quartz crystal wafers with nanometer-level thickness uniformity 
were obtained by an NC-PCVM correction process. We evaluated the performances 
of a MOSFET and an oscillator fabricated on SOI and quartz crystal wafers 
processed by NC-PCVM and found that the performances of these devices are 
equivalent to those of devices fabricated on substrates prepared by conventional 
production processes. We therefore conclude that there are few or no electronic 
defects on the processed surface introduced by NC-PCVM because the kinetic 
energy of charged particles in the atmospheric-pressure plasma is low. Further- 
more, NC-SO using an electrode array system showed the potential to correct the 
thickness error of SOI wafers with high throughput compared with the raster scan 
method. 

In the case of the PAP finishing of a 4H-SiC (0001) wafer using the soft Ce0 2 
abrasive, the irradiation of water vapor plasma reduced the hardness of the SiC 
surface by about one order of magnitude, enabling us to obtain an atomically 
smooth surface with a step and terrace structure without introducing scratches or 
crystallographical subsurface damage. 


References 

Adan AO, Kaneko S, Naka T, Urabe D, Higashi K, Kagisawa A (1996) Channel-drain lateral 
profile engineering for advanced CMOS on ultra-thin SOI technology. IEEE Int SOI 
Conf: 100-101 

Azuma K, Kishi A, Tanigawa M, Kaneko S, Naka T, Ishihama A, Iguchi K, Sakiyama K (1995) 
Application of Ti salicide process on ultra-thin SIMOX wafer. IEEE Int SOI Conf :30— 3 1 
Hornetz B, Michel H-J, Halbritter J (1994) ARXPS studies of Si0 2 -SiC interfaces and oxidation of 
6H SiC single crystal Si-(001) and C-(001) surfaces. J Mater Res 9:3088-3094 
International Technology Roadmap for Semiconductors (2011) Edition, Table FEP10. http://www. 

itrs.net/Links/201 1ITRS/201 lChapters/201 IFEP.pdf 
Kamisaka S, Yoshinaga K, Sano Y, Mimura H, Matsuyama S, Yamauchi K (2010) Improvement 
of thickness uniformity of silicon on insulator layer by numerically controlled sacrificial 
oxidation using atmospheric -pres sure plasma with electrode array system. Jpn J Appl Phys 
49:08JJ04 

Lucca DA, Herrmann K, Klopfstein MJ (2010) Nanoindentation: measuring methods and appli- 
cations. Ann CIRP 59:803-819 

Mori Y, Yamamura K, Sano Y (2000a) The study of fabrication of the X-ray mirror by numerically 
controlled plasma chemical vaporization machining: development of the machine for the X-ray 
mirror fabrication. Rev Sci Instrum 7 1 :4620-4626 
Mori Y, Yamauchi K, Yamamura K, Sano Y (2000b) Development of plasma chemical vapori- 
zation machining. Rev Sci Instrum 71:4627-4632 
Mori Y, Yamamura K, Sano Y (2004) Thinning of silicon-on-insulator wafers by numerically 
controlled plasma chemical vaporization machining. Rev Sci Instrum 75:942-946 
Nagaura Y, Yokomizo S (1999) Manufacturing method of high frequency quartz oscillators over 
1 GHz. In: Proceedings of the 1999 I.E. international frequency control symposium Besancon, 
France, pp 425-428 

Neslen CL, Mitchel WC, Hengehold RL (2001) Effects of process parameter variations on the 
removal rate in chemical mechanical polishing of 4H-SiC. J Electroact Mater 30:1271-1275 
Oliver WC, Pharr GM (1992) An improved technique for determining hardness and elastic 
modulus using load and displacement sensing indentation experiments. J Mater Res 

7:1564-1583 


41 Plasma-Based Nanomanufacturing Under Atmospheric Pressure 


1547 


Saddow SE, Schattner TE, Brown J, Grazulis L, Mahalingam K, Landis G, Bertke R, Mitchel WC 
(2001) Effects of substrate surface preparation on chemical vapor deposition growth of 4H-SiC 
epitaxial layers. J Electroact Mater 30:228-234 

Sano Y, Yamamura K, Mimura H, Yamauchi K, Mori Y (2007) Fabrication of ultrathin and highly 
uniform silicon on insulator by numerically controlled plasma chemical vaporization machin- 
ing. Rev Sci Instrum 78:086102 

Sano Y, Masuda T, Mimura H, Yamauchi K (2008a) Ultraprecision finishing technique by 
numerically controlled sacrificial oxidation. J Cryst Growth 310:2173-2177 

Sano Y, Masuda T, Kamisaka S, Mimura H, Matsuyama S, Yamauchi K (2008b) Improvement of 
thickness uniformity of SOI by numerically controlled sacrificial oxidation using atmospheric- 
pressure plasma. IEEE Int SOI Conf : 165-166 

Sano Y, Kamisaka S, Yoshinaga K, Mimura H, Matsuyama S, Yamauchi K (2010) Numerically 
controlled sacrificial plasma oxidation using array of electrodes for improving thickness 
uniformity of SOI. IEEE Int SOI Conf :68-69 

Shorey AB, Kwong KM, Johnson KM, Jacobs SD (2000) Nanoindentation hardness of particles 
used in magnetorheological finishing (MRF). Appl Opt 39:5194-5204 

Ueda M, Shibahara M, Zettsu N, Yamamura K (2010) Effect of substrate heating in thickness 
correction of quartz crystal wafer by plasma chemical vaporization machining. Key Eng Mater 

447-448:218-222 

Yamamura K, Shimada S, Mori Y (2008) Damage-free improvement of thickness uniformity of 
quartz crystal wafer by plasma chemical vaporization machining. Ann CIRP 59:567-570 

Yamamura K, Morikawa T, Ueda M, Nagano M, Zettsu N, Shibahara M (2009) High efficient 
damage-free correction of thickness distribution of quartz crystal wafer by atmospheric 
pressure plasma etching. IEEE Trans Ultrason Ferroelectr Freq Control 56:1128-1130 

Yamamura K, Takiguchi T, Ueda M, Deng H, Hattori NA, Zettsu N (2011) Plasma assisted 
polishing of single crystal SiC for obtaining atomically flat strain-free surface. Ann CIRP 
60:571-574 

Zhou L, Audurier V, Pirouz P (1997) Chemomechanical polishing of silicon carbide. J 
Electrochem Soc 144:L161-L163 



Section VII 


Nanomanufacturing and Non-Traditional 
Machining: Non-Traditional Machining 

Processes 

Hong Hocheng and Hung-Yin Tsai 



Electrical Discharge Machining Processes 



Masanori Kunieda 


Contents 


Introduction 1552 

Overview 1553 

Principle 1553 

Sinking EDM and Wire EDM 1554 

Characteristics and Applications 1556 

Pulse Generators 1557 

Relaxation Pulse Generator 1557 

Transistor Pulse Generator 1559 

Servo Feed Control (Gap Control) 1560 

Discharge Phenomena 1561 

Single Pulse Discharge 1561 

Energy Distribution 1565 

Removal Mechanism 1566 

Process Stability 1569 

Discharge Delay Time 1571 

Discharge Location 1572 

Machining Characteristics 1576 

Influence of Pulse Conditions 1576 

Influence of Polarity 1577 

Influence of Thermal Properties of Electrode Materials 1577 

Influence of Electric Conductivity of Electrode Materials 1578 

Summary 1579 

References 1579 


M. Kunieda (ISI) 

Department of Precision Engineering, School of Engineering, The University of Tokyo, 
Tokyo, Japan 

e-mail: kunieda@edm.t.u-tokyo. ac.jp 


© Springer-Verlag London 2015 

A.Y.C. Nee (ed.), Handbook of Manufacturing Engineering and Technology , 

DOI 10.1 007/978-1 -447 1 -467 0-4_7 1 


1551 


1552 


M. Kunieda 


Abstract 

Electrical discharge machining (EDM) is a removal process which exploits 
melting and evaporating of workpiece materials caused by pulse 
discharges which are ignited several thousands to tens of thousands times 
per second in the small gap between the tool electrode and workpiece. The 
advantage is that electrically conductive materials can be machined very 
precisely into complicated shapes independent of their hardness. Hence, 
EDM is preferably used in die and mold making, aeroengine manufacturing, 
and micro-hole drilling for ink jet and fuel nozzles, where complicated 
shapes in hard materials and with high precision have to be machined. 
This chapter first describes the principle of EDM. Then, the removal 
mechanism due to single pulse discharge is explained in details from 
the thermophysical aspects, followed by the clarification of the gap phenomena 
in consecutive pulse discharges. Thus, the machining characteristics of EDM 
are understood theoretically based on the fundamental insight into the 
phenomena. 


Introduction 

In the 1930s, the first attempts were made to machine metals with electrical 
discharge. The American company ELOX developed “disintegrators” to remove 
broken taps from valuable workpiece materials such as cemented carbide and high- 
speed steel (Schumacher et al. 2013). Erosion was caused by intermittent arc 
discharges occurring in air between the tool electrode and workpiece connected 
to a DC power supply. Arc discharges were initiated by mechanical contact 
between the tool electrode and workpiece and interrupted by separation due to 
vibration and rotation of the tool electrode. This process was not precise because the 
discharge energy was not controlled by a pulse generator, leading to overheating of 
the machining area. 

During World War II, physicists B. R. and N. I. Lazarenko in Moscow 
conducted studies on the minimization of wear of electric power contacts. 
They tested different materials with discharges of defined energy, generated by 
a capacitor. B. R. Lazarenko published the paper “To invert the effect of wear 
on electric power contacts,” in 1943 (Lazarenko 1943a). This idea started 
the development of EDM, using controlled discharge conditions, for achieving 
precision machining (Lazarenko 1943b). Since then, EDM technology has devel- 
oped rapidly and become indispensable in manufacturing applications such as die 
and mold making, micromachining, prototyping, etc., because electrically conduc- 
tive materials can be machined very precisely into complicated shapes independent 
of their hardness. 
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Overview 

Principle 

Figure 1 shows the principle of EDM. Pulsed arc discharges are generated with the 
interelectrode gap width of micrometer to tens of micrometer range in a bath of a dielectric 
liquid like oil or deionized water. Discharge location is only one for each pulse because 
once dielectric breakdown occurs at a location, temperature rises up to 7,000 K in the arc 
plasma. Since the ionization in the plasma channel is accelerated with increasing the 
temperature, the electrical conductivity is elevated resulting in the concentration of 
electric current at the discharge spot. The heat fluxes from the arc column thus generated 

Q 9 

exceed 10 W /nr (based on Eq. 6) on the anode and cathode spots, generating a minute 
discharge crater on both surfaces due to melting and evaporating of the electrode 
materials. Removed materials are cooled and resolidified in the dielectric liquid, forming 
spherical debris particles which are flushed away from the gap with the dielectric liquid. 

The next pulse voltage must not be applied to the gap until the temperature at the 
previous discharge spot falls sufficiently to recover the dielectric strength, enabling 
the next discharge to occur at different place. In this way, discharges are dispersed 
over the electrode surfaces; thereby the shape of the tool electrode is copied on the 
workpiece electrode. 

Thus, to copy the tool electrode shape on the workpiece, it is essential to use 
pulse discharge to randomize the discharge location. If discharge is not intermit- 
tently ignited, the temperature at the discharge location cannot decrease; thereby 
the plasma stays at the same location where the discharge was first generated, 
resulting in serious thermal damage on the workpiece. 


Feed 


Dielectric liquid 



Workpiece 


Bubbles 



Discharge current 


Gap width: several 
tens of micrometer 


Discharge column 


Fig. 1 Concept of EDM 
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Sinking EDM and Wire EDM 

Figure 2 shows the configuration of a sinking electrical discharge machine. The 
workpiece can be formed either by the replication of a shaped tool electrode (Fig. 3) 
or by 3D movement of a simple electrode like in milling (Fig. 4) or a combination of 
the above. The electrode material is normally copper or graphite. The numerical 
control monitors the gap conditions (voltage and current) and synchronously 
controls the different axes and the pulse generator. The dielectric liquid is filtrated 
to remove debris particles and decomposition products. 

Figure 5 outlines the wire electrical discharge machining (WEDM) method. 
Complicated shapes can be cut using a wire electrode of 0.02-0.33 mm in 



Pumping system 


Fig. 2 Sinking electrical discharge machine 
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Fig. 3 Sinking electrical 
discharge machining (Seibu 
Electric & Machinery) 
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Fig. 4 Milling electrical 
discharge machining (Sodick) 


Fig. 5 Wire electrical 
discharge machining 
(WEDM) 
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diameter like a wire sawing machine (Fig. 6). The wire electrode is usually a plain 
brass wire or coated wires, such as zinc coated brass or coated steel wires. 
Sometimes tungsten or molybdenum wires are used in case of thin wires. Since 
wire orientation can be changed by controlling the horizontal position of the upper 
wire guide relative to the lower guide, all types of ruled surfaces can be cut. Since 
discharge currents with a high peak value over a short duration are used, current is 
supplied through both the upper and lower feeding brushes to obtain a quick rise 
in the discharge current by reducing the inductance and to avoid wire breakage 
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Fig. 6 Wire electrical discharge machining (Sodick) 


due to Joule heating. Tension is applied to the wire to reduce vibration and 
deflection, which deteriorates cutting accuracy. 

In WEDM, water is most often used as the dielectric liquid, but its specific 
electrical conductivity should be decreased using deionizing resins to avoid elec- 
trolysis and to keep high open voltage. Use of deionized water is preferable to 
hydrocarbon dielectrics considering higher material removal rates and fire safety. 
However, finer surface finish can be obtained with hydrocarbon dielectrics under 
the same discharge energy. In contrast, hydrocarbon dielectrics are normally used 
in sinking EDM because surface roughness is better and tool electrode wear is lower 
compared to deionized water. 


Characteristics and Applications 

Since EDM is a thermal process, even hard materials such as quenched steel, 
cemented carbide, and electrically conductive ceramics can be machined. EDM 
also allows machining of complicated shapes. Since the tool electrode does not 
need to rotate for material removal like milling or grinding, holes with sharp 
corners (Fig. 3) and irregular contours (Fig. 6) can be machined without difficulty. 
Reaction forces generated in the EDM gap are insignificant, which also facilitates 
the machining of thin and flexible parts, deep grooves and holes (Fig. 3), and micro 
parts (Masaki 1993; Fig. 7) which are difficult to machine by milling. 

Generally, machining accuracy of EDM is very high in the order of several 
micrometers, and achievable surface roughness is Rz 0.4 pm. On the other hand, the 
material removal rate of EDM is low compared to other machining processes. 
Hence, EDM is preferably used in die and mold making, fuel jet nozzles drilling, 
and aeroengine manufacturing, where complicated shapes in hard materials and 
with high precision have to be machined. 
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Fig. 7 Micro electrical 
discharge machining (Masaki 

1993) 



Pulse Generators 
Relaxation Pulse Generator 


Early EDM equipment used relaxation pulse generators with capacitor discharges 
as shown in Fig. 8. The capacitance is charged according to the following equation: 




i{t)dt + i(t)R 



Here, u 0 is the open voltage of the power source, and i(t) is the charge current. 
When the gap voltage exceeds the dielectric breakdown strength, discharge is 
ignited in the gap as shown in Fig. 9. Discharge current i e (t) can be obtained from 
the following equation: 



Here, u e is discharge voltage. Discharge duration is several microseconds or less, 
even shorter than ten nanoseconds followed by recovery of dielectric strength of the 
gap. Thus, charging and discharging of the capacitance can be repeated autono- 
mously. This type of pulse generator has been used, especially in WEDM until 
recent years because discharge current with high peak values and short duration is 
needed in WEDM due to the reasons described in section “Influence of Pulse 
Conditions.” Disadvantage of the relaxation pulse generator is the difficulty of 
pulse control. For example, since the gap voltage starts rising immediately after 
each discharge, deionization of the plasma cannot be secured. Therefore, in the case 
that the plasma is not extinguished after discharge, the current from the power 
source leaks through the discharge gap, disabling the charge of the capacitance. 
Thus, with the recent development of power transistors which can handle large 
currents with high response, relaxation pulse generators have been replaced with 
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Fig. 8 Relaxation pulse generator 


Fig. 9 Gap voltage and 
discharge current with 
relaxation pulse generator 




Fig. 10 Transistor type pulse generator 


transistor pulse generators shown in Fig. 10. However, the relaxation-type pulse 
generators are still being used in finish machining and micromachining because it is 
difficult to obtain significantly short pulse duration with constant pulse energy using 
the transistor pulse generator. 
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Transistor Pulse Generator 


With the transistor pulse generator, a series of resistances and transistors are 
connected in parallel between the direct current power supply and the discharge 
gap as shown in Fig. 10. Here, consider the case that the open voltage of the power 
supply is 100 V and only one transistor is switched on. It is known that in EDM, 
when discharge is ignited, the gap voltage drops to around 20 V independent of the 
discharge current. This is because larger current brings about higher degree of 
ionization and greater diameter of the arc column resulting in increased electrical 
conductivity of the plasma. Hence, the discharge voltage u e is normally around 
20 V. The discharge current obtained is therefore 4 A using a single transistor 
circuit in this example. Thus, it is found that the discharge current can be raised by 
increasing the number of transistors which are switched on at the same time. 

Figure 1 1 shows the waveforms of the gap voltage and current, which are typical 
in sinking EDM operations. When the transistors are switched on, the open voltage 
u Q , 100 V in the case of Fig. 10, is applied between the tool electrode and 
workpiece. However, discharge does not occur immediately, but its ignition is 
delayed by t d . This is because dielectric breakdown needs a statistical time lag in 
which initial electrons should be generated by the collision of cosmic, radioactive, 
ultraviolet, and X-rays with the neutral species in the gap and the formative time lag 
in which the initial electrons are accelerated to generate the electron avalanche 
(Meek and Craggs 1978). In EDM, since the dielectric liquid is contaminated with 
electrically conductive debris particles, the ignition delay time is determined not 
only by the gap width but also by the concentration of debris particles in the gap. 

After the dielectric breakdown, a discharge current, i e9 flows in the gap. The gate 
control circuit keeps the transistors on for the preset discharge duration, t e9 after the 
dielectric breakdown, resulting in a uniform discharge crater size independent of 
the ignition delay time which can vary statistically for each discharge. Then after 
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Fig. 1 1 Gap voltage and current waveforms 
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the fixed discharge interval, t Q , the transistors are again switched on and open 
voltage is applied between the electrodes. 

The discharge energy per single pulse q is expressed as 


q = u e x i e x t e (3) 

where u e is the discharge voltage, which is around 20 V. Hence, discharge current 
and discharge duration are set by operators considering whether the process is for a 
roughing or a finishing operation. To obtain stable machining, the discharge 
interval should be sufficiently long so that the plasma is extinguished and hence 
the dielectric breakdown strength is recovered during the interval. Duty factor, the 
ratio of discharge duration to the average discharge cycle time, can be defined as 


D.F. 


te 

te + t 0 + {td) 



where (t d ) is average ignition delay time measured. With increasing D.F., material 
removal rate can be increased, while machining becomes unstable. Advanced 
electrical discharge machines can control the discharge interval adaptively by 
monitoring the gap voltage waveforms. 


Servo Feed Control (Gap Control) 

Tool electrode feed is not constant, which is different from the conventional 
machining methods where cutting tools are fed at a constant feed speed. The 
servo feed control shown in Fig. 12 keeps the working gap at a proper width. 
Larger gap widths cause longer ignition delays, resulting in a higher average gap 
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voltage. When the average gap voltage measured is higher than the servo reference 
voltage preset by the operator, the feed speed is increased. On the contrary, the feed 
speed is reduced or the electrode is retracted when the average gap voltage is lower 
than the servo reference voltage. Thus, short circuits can be avoided even when 
debris particles or humps of discharge craters bridge the gap. Hence, the feed speed 
of the tool electrode is changed adaptively according to the variation of the 
discharging surface area during the process. 

For example, suppose the reference voltage is set by the operator at 25 V, with 
the discharge duration of 100 ps and pulse interval of 40 ps. Since the servo feed is 
controlled so that the average gap voltage equals 25 V, the following equation can 
be obtained: 


100 V x (/j)ps + 20 V x 100 ps 
(t d ) ps + 100 ps + 40 ps 


25 V 



From the above equation, the average ignition delay time (t d ) is found to be 
20 ps. Then, D. F. can be obtained from Eq. 4 as 0.625. Since the average discharge 
cycle time is (t d ) + t e + t Q = 160 ps, the discharge frequency can be calculated as 
6.25 kHz. When the servo reference voltage is set high, the gap width is enlarged; 
thereby removal of debris particles from the gap becomes easy. Lower D. F. leads 
to more stable machining. However, the accuracy of replicating the tool electrode 
shape into the workpiece is lowered, and material removal rate decreases. 

In some cases, the average ignition delay time is used in place of the average gap 
voltage to monitor the gap width (Altpeter and Perez 2004). 


Discharge Phenomena 
Single Pulse Discharge 

Initial electrons, which are generated by the ultraviolet ray, X-ray, cosmic ray, and 
radiation from the earth crust, are accelerated by the electric field and ionize the 
neutral species due to collision, resulting in an electron avalanche. Thus, the electric 
field is distorted and streamers are developed toward both the anode and cathode. 
The accelerated positive ions bombard the cathode surface; thereby secondary 
electrons are emitted (secondary emission). With increasing the current density, 
the surface temperature rises, activating the electrons to jump over the work 
function of the material (thermal emission). High electric field existing on the 
cathode extracts the electrons due to the tunneling effect (field emission). When 
both temperature and electrical field are high, the thermal and field emission 
processes strongly interfere with each other (T-F theory) (Lee 1959). 

These emitted electrons are accelerated toward the anode and collide with the 
neutral atoms in the gap. Thus, great amount of positive ions are formed due to 
collision ionization, resulting in an established discharge (Meek and Craggs 1978). 
In EDM, the temperature at the discharge spot rises over the melting point and even 
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Fig. 13 Single pulse discharge phenomena 


higher than the boiling point of the electrode materials. Hence, thermal and field 
emissions of electrons from the cathode spot are dominant compared to secondary 

O Q 

emission; thereby the current density at the discharge spot reaches over 10-10 
A/m . Therefore, EDM discharge is an arc discharge. The arc plasma is highly 
ionized resulting in high current densities with the comparatively low discharge 
voltage of about 20 V. The discharge voltage is composed of an anode drop, 
cathode drop, and voltage drop in the positive column, and its value will slightly 
change depending on electrode materials, dielectric fluids, gap width, and pulse 
conditions. 

Figure 13 shows the gap phenomena due to a single pulse discharge. The 
constituents of the arc plasma are vapors of the dielectric liquid, molecules, and 
atoms generated by the decomposition of the dielectric, atoms and ions of the anode 
and cathode materials, and electrons. The arc plasma temperature in EDM was 
measured using spectroscopic analysis, and it was found that the temperature 
reaches 6,000-7,000 K (Natsu et al. 2004a). 

The diameter of the arc plasma expands with the passage of time because high- 
temperature region is enlarged due to heat conduction and because high-energy 
particles are dispersed due to diffusion and convection. Figure 14 shows the side 
view of the plasma generated between copper electrodes with discharge current of 
36A (Kojima et al. 2008). It is found that the plasma expansion is completed within 
90 ps, and the resultant crater diameter was 0.1 mm. Figure 15 shows the plasma 
and discharge crater observed from the reverse side of a transparent plate electrode 
(Kitamura et al. 2013). Ga 2 0 3 single crystal was used as the cathode plate, and 
discharge current was 20 A. The dark area at the center indicates the discharge 
crater formed on the Ga 2 0 3 single crystal plate, and the violet area surrounding the 
crater shows the plasma region. These images taken by high-speed video cameras 
show that the diameter of arc plasma is significantly larger than the gap width and 
that the growth rate of discharge crater is slower than that of the arc plasma. 

Due to the high temperature of the plasma, electrode materials and dielectric 
liquid evaporate, molecules are dissociated, and atoms ionized, resulting in a rapid 
expansion of a bubble. Figure 16 shows side views of a bubble generated by a single 
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Fig. 14 Temporal change in 
arc plasma diameter 
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Fig. 15 Expansion of arc plasma 


discharge in deionized water and EDM oil. The bubble and dielectric liquid are 
analogous to the spring and mass oscillation system, respectively (Eckman and 
Williams 1960). Starting from the initial condition, where the bubble is compressed 
in a small volume, the dielectric liquid is accelerated radially. At the moment the 
pressure inside the bubble equals the atmospheric pressure, the kinetic energy 
peaks. Hence, the bubble continues expanding. The diameter of the bubble peaks 
when all the kinetic energy is transferred to the potential energy of the bubble. 
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Fig. 16 Side view of bubble oscillation (/ e : 20 A, t e : 100 ms, gap width: 0.1 mm, anode: Cu </> 
5 mm, cathode: Cu 4> 5 mm) 


The diameter of the bubble reaches several millimeters, several tens of times greater 
than the gap width. Thereafter the bubble starts contracting until it is compressed to 
its initial diameter. In reality, the viscosity of the liquid causes damping in the 
oscillation. 

Debris particles removed from the crater proceed straight through the bubble as 
can be seen in Fig. 15, and hit the opposite electrode surface, which is wetted with 
the dielectric liquid, or penetrate the bubble wall and as a result decelerate. They 
then solidify into a spherical shape under the influence of surface tension. Thus, the 
dielectric liquid is important for the cooling and flushing of debris particles. 
Without the liquid, the molten and evaporated debris particles are reattached to 
the opposite electrode surface. 

After discharge, ions and electrons are recombined, and the dielectric strength of 
the gap is recovered. The evaporated atoms and molecules are solidified or con- 
densed to form debris particles or dielectric liquid, but gases such as hydrogen and 
methane which are generated by the dissociation of the working oil are left to form a 
bubble. The resultant diameter of the bubble is smaller than the maximum diameter 
during oscillation, but even larger than one millimeter. Hence, in consecutive 
discharges, the working gap is mostly occupied by bubbles although the discharge 
gap is submerged in dielectric liquid (Kitamura et al. 2013). 

When discharge is ignited in a gas, the plasma is easy to expand because there 
exists no liquid which serves as an inertia to resist the quick expansion of the high- 
pressure plasma. Hence, the plasma completes expanding within 2 ps after dielec- 
tric breakdown, and thereafter, its diameter remains constant during discharge 
(Kojima et al. 2008). Since the discharge gap is mostly occupied with bubbles in 
actual EDM processes where pulse discharge occurs consecutively, the rate of 
plasma expansion is more similar to the discharge in gas rather than that in liquid. 
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Energy Distribution 

It is very important to know the percentages by which energy is distributed to anode 
and cathode among the total discharge energy q given by Eq. 3 to discuss the 
material removal rate of workpiece and wear rate of tool electrode. 

First, consider the heat transfer by heat conduction due to the thermal motions of 
plasma constituents. Assuming the plasma is in thermo -equilibrium, the energy inputs 
into anode and cathode would be equal because the temperature distribution in the 
plasma is more or less symmetrical between the anode and cathode regions. Then, 
consider the kinetic energies of charged particles accelerated by the electric field. On 
the anode surface, all the discharge current is carried by electrons which are 
bombarding the anode. In contrast, the cathode surface gains energy due to the kinetic 
energy of the impinging positive ions and loses energy equivalent to the work function 
of the cathode material due to the emission of electrons. Hence, the discharge current 
on the cathode is the sum of the charges of the impinging ions and departing electrons 
per unit time. Due to the continuity of current, the discharge current on both electrodes 
must be equal. Thus, increase in the electron current results in decrease in the ion 
current on the cathode surface. Ion bombardment heats the electrode, while electron 
emission cools the cathode surface because free electrons in the metal must gain the 
energy equivalent to the work function of the metal to be emitted from the surface. 
Hence, it is considered the energy input to anode is greater than that into cathode. In 
reality however, physics is more complicated because the drop of electric field on the 
cathode (in the cathode sheath) is greater than that on the anode. Since analysis of 
EDM arc plasma is extremely difficult, the energy distribution must be obtained from 
experiments. The energy distribution in a single discharge was measured by compar- 
ing the measured temperatures of foil electrodes with the calculated results obtained 
under the assumed ratio of the energy distributed to electrodes, using a finite difference 
model (Xia et al. 1996; Hayakawa et al. 2001; Zahiruddin and Kunieda 2010). When 
the calculated and measured temperatures were in agreement, the energy distributed to 
the anode and cathode was found as shown in Fig. 17. The energy distribution into the 
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anode is consistently greater than that into the cathode regardless of discharge 
duration. In micro EDM, where the discharge duration is in the order of nanoseconds 
to sub-microseconds, energy distribution into the electrodes is lower than that in the 
macro (normal) EDM. This is because a large amount of energy is used for 
establishing the arc plasma. On the contrary, with the discharge duration extraordi- 
narily long compared to that used in normal EDM processes, most of the energy is 
distributed to electrodes, because the arc plasma is fully developed. In the normal 
EDM processes, the ratio of energy distributed to the anode and cathode to the total 
discharge energy// is about 40-48 % and 25-34 %, respectively. Thus, the heat flux on 
the electrode surface can be expressed as 



where d(t) is the arc plasma diameter. 


Removal Mechanism 


Temperature Rise in Electrode 

Given the energy distribution ratio rj and time-dependent diameter of the circular 
heat source d(t) as boundary conditions, the following heat diffusion equation can 
be solved to obtain the temperature distribution in the electrodes: 



d T 



Here, T is temperature, p is density of the electrode material, c p is specific heat, 
k is heat conductivity, and q is the rate at which energy is generated per unit volume 
of the electrode due to Joule heating. Nowadays, with the development of powerful 
computers and numerical methods, it is no longer difficult to take into account the 
temperature dependence of thermophysical properties of electrode materials and 
latent heat of melting and vaporization. However, it should be emphasized that 
correct temperature distributions cannot be obtained unless correct boundary con- 
ditions are used, i.e., energy distribution ratio r\ and plasma diameter d{t). 

From the time-dependent plasma diameter shown in Fig. 14 and the influence of 
discharge current on the plasma diameter obtained by the same authors (Kojima 
et al. 2008), the plasma diameter can be expressed as 


d(t) = 1.7 x 10“ 3 x f 035 x l, 0 ' 48 (8) 

Here, units of d(t ), t , and i e are [m], [s], and [A], respectively. This equation is 
based on the empirical equation obtained by Saito and Kobayashi (1967) and Ikai 
et al. (1992) who assumed that heat source diameters are equal to discharge crater 
diameters which were measured in experiments as follows: 


42 Electrical Discharge Machining Processes 


1567 


Fig. 18 Analysis model for 
temperature distribution in 
single pulse discharge 
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Fig. 19 Temperature distribution in steel workpiece (cathode) (/ e : 30 A, t e : 100 ps, energy 
distribution to cathode: 25 %) 


d{t) = 2.4 x 10“ 3 x t 0A x i e 0A (9) 

As described in section “Single Pulse Discharge,” the plasma diameter is larger 
than that of the discharge crater. Hence, d{t) in Eq. 8 is larger than that in Eq. 9. 
However, since the temperature distribution in the plasma is not uniform, the degree 
of ionization is not uniform. Therefore, the current density may be more concen- 
trated near the axis of the cylindrical plasma, resulting in effective heat source 
diameter smaller than that of the plasma diameter. Hence, d(t) in Eq. 9 was used in 
place of Eq. 8 as the heat source diameter. 

Figure 18 shows the analysis model to calculate the temperature distribution 
inside the electrode due to single pulse discharge. The heat transfer equation was 
solved using a simplified model assuming an expanding circular heat source with a 
uniform heat flux and ignoring the metal removal and temperature dependence of 
thermophysical properties. Figure 19 shows the calculated results when carbon steel 
is used as the electrode material, assuming the energy distribution ratio of 25 %. 
Given the boiling and melting points of carbon steel are 3,273 and 1,808 K, 
respectively, it is found that the melting region is expanding with the passage of 
time after dielectric breakdown. However, the peak temperature at r = z = 0 and 
boiling region decrease with time because the heat flux decreases due to plasma 
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diameter increase with time. During the discharge interval, 20 ps after the end of 
discharge, temperature of the whole discharged spot falls below the melting point 
and all the material is resolidified. 

Mechanism of Removal 

Calculation of the temperature distribution in the electrode only is insufficient to 
obtain the volume of removal due to a single discharge. Simulations of crater 
formation and eruption of molten pool are indispensable. There is no doubt that 
the material can be ejected when its temperature exceeds the boiling point. How- 
ever, it is still difficult to calculate the removal amount because some of the molten 
material is removed, but not all the molten region is ejected (Zahiruddin and 
Kunieda 2010; Van Dijck 1973). It is the commonly accepted view that the metal 
removal occurs due to boiling of the superheated molten mass in the crater. 
However, the thermo-hydrodynamics calculations of the crater formation and 
debris ejection due to boiling have been difficult. Hence, it has been believed that 
most of the metal removal occurs due to boiling at the end of the discharge because 
boiling of the superheated metal is suppressed by the bubble pressure which is 
significantly high during discharge (Zolotykh 1959). 

Van Dijck (1973) calculated the volume of the region inside the normal boiling 
point isothermal surface, at the end of discharge when the bubble pressure was 
considered to be at normal atmospheric pressure. The calculated volume agreed 
well with the measured material removed per pulse. It was also found that the 
material removal efficiency, which was defined as the ratio of the ejected to 
melted volume, was only 1-10 %. However, high-speed camera images of flying 
debris scattered from discharge points in Fig. 15 show that material removal occurs 
even during the discharge (Kitamura et al. 2013). Thus, the development of 
computation models which can simulate the material removal phenomena is 
eagerly awaited. 

In some cases, molecular dynamics (MD) simulation can be very helpful to 
analyze the phenomena which are difficult to model using conventional thermo- 
hydrodynamics simulations. Figure 20 shows the forming mechanism of a dis- 
charge crater analyzed by the MD simulation (Yang et al. 2010, 2013). Both the 
space and time domains which can be handled are unrealistically small compared 
with those used in EDM. Nevertheless, Fig. 20 clearly demonstrates the evidence of 
the superheating theory. It is found that removal can occur at any time when the 
pressure inside the molten pool exceeds the pressure outside (Yang et al. 2013). 

The superheating theory indicates that removal can occur without a liquid 
dielectric. In air, however, most of debris particles are reattached to the opposite 
electrode surfaces as described in section “Single Pulse Discharge.” Hence, the 
major role of dielectric liquid is cooling and flushing debris particles but not 
material removal. However, the dielectric liquid affects the expansion rate of the 
arc plasma. The rate of increase in the plasma diameter d{t) given by Eq. 8 is 
significantly higher in gas. Hence, the heat flux from the plasma generated in liquid 
is higher than that in gas, resulting in larger removal due to higher-temperature 
distribution in the discharge crater (Takeuchi and Kunieda 2007). 
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Fig. 20 Molecular dynamics simulation of material removal (Yang et al. 2010) 

Residual Stress 

The temperature rise due to discharge results in a compressive thermal stress on the 
electrode surface until the stress is released by material yielding at a high temper- 
ature. The stress becomes nearly zero when the material is molten. After discharge, 
due to the temperature decrease, the molten material resolidifies and starts shrink- 
ing, generating a tensile residual stress on the surface (Yang and Mukoyama 1996). 
Thus, a resolidified layer, called white layer, is formed on the surface. Since the 
white layer is a hard and brittle layer, the tensile stress causes micro cracks on the 
EDMed surfaces. Since such a heat-affected layer is not acceptable for practical use 
of the workpiece, the thickness of the layer should be minimized in finish processes. 


Process Stability 
Probability of Discharge 

To ignite an electric discharge in clean oil at an open voltage of 100 V, the gap 
width must be less than several microns. In actual EDM processes however, since 
the dielectric liquid is contaminated with electrically conductive debris particles, 
whose average diameter is even more than one third of the gap width (Yoshida and 
Kunieda 1998), discharge can occur at gap widths of tens of micrometers or more 
(Bommeli et al. 1979; Schumacher 1990). This fact sometimes reduces the repli- 
cating accuracy due to the uneven distribution of the debris particles. On the other 
hand, the extended gap width is favorable for easy gap control because it is difficult 
to keep the gap width constant at several microns. 

In order to study the influence of debris particles, a debris particle with a 
diameter of 5 pm was placed in a gap of 20 pm as shown in Fig. 21 (Kunieda and 
Nakashima 1998). If it were true that the discharge occurs at a point where the gap 
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Fig. 21 Probabilistic nature 
of discharge location 





50mm 



Debris particle 


a 


Toot electrode 


Feed 


Depth of cut 
due to feed 


Sinking EDM 



b 



Removal volume 
per unit area 


Gap wi 



Wire EDM 


Fig. 22 Non-uniform load removal and gap width due to probabilistic nature of discharge 


is shortest, then the discharge should occur at the point where the debris particle is 
placed. However, experimental results showed that in most cases, the discharge 
crater was not generated at the point where the debris particle was placed. This is 
because the probability of discharge in a certain area is obtained from the product of 
the probability of discharge per unit area and its surface area (area effect). Since the 
projected area of the debris particle in the direction normal to the electrode surface 
is negligible compared with the area of the electrodes (50 x 50 mm), the probability 
that discharge does not occur on the debris is greater than the probability that the 
discharge occurs on the debris. This result indicates that the discharge location is 
determined in a probabilistic way and not deterministic. 

Another example which indicates the probabilistic nature of discharge location 
is shown in Fig. 22. In sinking EDM, when the tool electrode is fed by a distance 
during a certain time, the removal volume per unit area on the tool electrode is 
greater at a curved surface than that at a flat surface. The removal volume on an 
inclined surface is smaller than that on a horizontal surface. Therefore, the dis- 
charge frequency per unit area must be higher on a convex surface and lower on an 
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inclined surface compared with that on flat and horizontal surfaces, respectively. To 
satisfy these requirements, the gap width must be smaller on a convex surface and 
greater on an inclined surface than the gap on flat and horizontal surfaces, respec- 
tively. In wire EDM, the gap width distribution around the wire is not uniform as 
shown in Fig. 22b, which never happens if discharge is located deterministically 
based on the gap width distribution. 


Discharge Delay Time 

The discharge delay time depends upon the local values of the gap width and 
concentration of debris particles at the discharge location. Hence, the tool electrode 
feed is controlled based on the discharge delay time as described in section “Servo 
Feed Control.” The average discharge delay time t d; ave [ps] was quantified empir- 
ically as a function of the gap width, gap [mm]; concentration of debris particles, 
cone [mm /mm ]; machining area, area [mm ]; and debris diameter, r [mm], under 
the following machining conditions: copper anode, carbon steel cathode, discharge 
current of 3 A, discharge duration of 300 ps, and open circuit voltage of 120 V 
(Morimoto and Kunieda 2009). 
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The machining area is a factor because the probability of discharge within a 
specific area is proportional to the ratio of its surface area to the whole discharge 
area as described in the previous section. t d> ave was obtained experimentally based 
on the fact that the discharge delay time t d conforms to the exponential distribution 
obtained from the Faue plot method (Bommeli et al. 1979; Araie et al. 2008). The 
Faue plot shows the percentage of electric insulation that does not break down until 
time t after the supply of a pulse voltage (Institute of Electrical Engineers of Japan 
1998). Suppose a single pulse discharge was generated N times and the number of 
discharges which did not occur until time t was n\ then n as a percentage of N can be 
expressed by 



where t dy ave the average value of t d is given theoretically (Mood et al. 1974) by 



ave 




The Laue plot enables easy evaluation of the discharge delay time, which 
normally has a large scatter, from the slope t d ave as shown in Figure 23. 
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Fig. 23 Evaluation of discharge delay time using Laue plot 


Fig. 24 Principle of 
measuring discharge 
locations 
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Discharge Location 

To obtain stable conditions in EDM, it is essential for the discharge locations to be 
dispersed over the working surface. Figure 24 shows the equivalent circuit to detect 
discharge locations (Kunieda and Kojima 1990). The electric feeder is intentionally 
divided, and each divided feeder is connected to the tool electrode at each end. 
While the resistance of the feeder, r 09 and contact resistance at the feeding point, r c , 
are constant, the resistance in the tool electrode from each end of the tool electrode 
to the discharge location, Rj and R 2 , respectively, varies depending on the discharge 
location. Consequently, by measuring the ratio between currents i j and i 2 through 
each divided feeder, the discharge location can be obtained in process. 
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Figure 25 shows the distribution of discharge locations measured with the 
extension of the depth of erosion. The working area of the tool electrode was a 
rectangle of 14 mm by 105 mm. The abscissa shows the number of discharges 
sampled every 50 pulse discharges, and the ordinate shows the discharge location 
along the longitudinal direction of the working surface. The dots indicate the 
location of each discharge. It was observed that when the depth of erosion was 
shallow, the process was stable. The distribution of discharge location was 
uniform, and no relationship between consecutive discharges was visible. 
The tool electrode feed rate was constant, and the distribution of gap width was 
uniform. With the increase of the eroded depth however, discharge became local- 
ized. The area of localized discharges moved periodically with a period of several 
minutes. When the machining depth increased further, the localization became 
prominent, and the area of localization narrowed. The movement of the localized 
area was mostly periodical. In this phase, the tool electrode feed was not smooth, 
and the distribution of gap width was not uniform. Ultimately, discharge was 
concentrated at a certain location. In this hazardous situation, it was no longer 
possible to continue the process, and thermal damage was left on the workpiece 
surface . 

The mechanism of the periodical movement of localized discharge is shown in 
Fig. 26. If the flushing of debris is difficult, local distribution of discharge spots 
cannot be eliminated once it is created at some area, even if the gap width of the 
other regions is narrower, since the dielectric strength is considerably decreased due 
to contamination and temperature rise in the localized area. Meanwhile, the local- 
ized area moves to other regions, where the gap width is sufficiently short. At this 
moment, the position of the tool electrode is retracted resulting in the fluctuation of 
the feed speed. 

Relation Between Distribution of Discharge Location 
and Process Instability 

Localization and concentration of discharges result from locally weakened 
dielectric breakdown strength in the gap. There seems to be two causes for the 
decrease of the dielectric breakdown strength. One of the causes is thermal. It was 
found both analytically and experimentally that it takes 5 ps or less for the 
temperature at the center of the plasma generated by the preceding discharge 
to drop under 5,000 K, which is the temperature where the dielectric strength of 
the gap is considered to recover sufficiently (Hayakawa et al. 1996; 
Natsu et al. 2004a). When deionization is insufficient during the discharge interval, 
subsequent discharges occur at the same location, resulting in the further increase 
in the temperature at the discharge spot, which accelerates the concentration. 
Formation of debris particles in the gap can be another reason. If flushing of the 
debris particles is insufficient, localization of discharge causes increased concen- 
tration of debris particles locally, leading to the acceleration of discharge 
localization. 
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Fig. 26 Mechanism of movement of localized discharge 


Fig. 27 Flushing methods 



Lift of tool electrode 



C Flushing 



Flushing through hole 


Flushing 

To maintain stable machining, it is critical to flush debris particles and cool the 
working gap in order to prevent the localization and concentration of discharge 
locations. Normally, the tool electrode is lifted periodically to replace the contami- 
nated dielectric fluid in the gap with fresh one (Fig. 27a). Linear motor equipped 
sinking EDM machines realized increased removal rate, especially in the machining 
of narrow and deep grooves due to the high-speed jump flushing operation of tool 
electrode which can prevent short circuiting. Machining accuracy was also improved 
because the lateral gaps of the groove became narrower and more uniform. The 
working gap can be flushed by a fresh dielectric fluid jetted from nozzles placed 
adjacent to the discharge gap (Fig. 27b). However, jetting of dielectric fluid merely 
from one direction causes increased density of debris particles in the downstream, 
resulting in uneven distribution of gap width deteriorating the machining accuracy. 
Pressure or suction flushing through holes in the electrode or workpiece remains one 
of the most efficient flushing methods at least if those holes have to be provided 
anyway or do not harm the workpiece (Fig. 27c). Both in pressure and suction 
flushing, one can observe lower electrode wear and larger gap width at the outlet 
point in comparison with the inlet point (Koenig et al. 1977). 
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Machining Characteristics 
Influence of Pulse Conditions 

Figure 28 shows three machining characteristics which are most important in EDM 
practice: material removal rate, surface roughness, and tool electrode wear ratio. 
Here the tool electrode wear ratio is defined as the ratio of volume of tool electrode 
wear to the volume of workpiece removal. Requirements for any two of the 
characteristics can be satisfied if the remaining one is sacrificed using the discharge 
current waveforms indicated in Fig. 28. However, there is no current waveform 
which can satisfy all the requirements. 

Current waveforms with higher peak current and longer discharge duration result 
in higher material removal rates. At the same time, low tool electrode wear can also 
be satisfied because the carbon layer deposited on the anode tool electrode is thicker 
when longer discharge durations are used as described later (Xia et al. 1996; Natsu 
et al. 2004b; Kunieda and Kobayashi 2004). However, surface roughness is not 
good because the crater generated by each pulse discharge is large. Thus, this pulse 
condition is normally used in rough machining. 

On the other hand, the pulse condition with longer discharge duration and lower 
peak current brings about both lower tool electrode wear ratio and better surface 
roughness. However, the lower heat flux due to the lower current density at the 
discharge spot results in lower energy efficiency of material removal compared 
with the pulse condition with shorter discharge duration and higher peak current 
even though the same pulse energy is applied. In this situation, the melted and 
evaporated regions in the workpiece are small, and most of the energy distributed to 
the workpiece is dissipated inside the workpiece due to heat conduction. Obviously, 
the material removal per pulse is small. In addition, longer discharge duration 
results in lower discharge frequency per unit time. Thus, this pulse condition is 
suitable for finishing. 

Finally, pulse conditions with shorter discharge duration and higher peak current 
provide better surface roughness due to smaller discharge crater. Moreover, 
although material removal per pulse is not large, it is large enough compared 
with the pulse condition with longer discharge duration and lower peak current 


Fig. 28 Relation between 
discharge current waveforms 
and machining characteristics 
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under the same discharge energy per pulse because energy efficiency is higher due 
to larger heat flux. Higher repetition rate of discharge per unit time due to shorter 
discharge duration further increases the material removal rate. However, the tool 
electrode wear ratio is high because the carbon layer deposited on the tool electrode 
is thin. This is why extremely short discharge duration and high peak current 
compared with the case of sinking EDM, for example, 1 ps and 100 A, are used 
in WEDM, where the problem of tool electrode wear is insignificant. 


Influence of Polarity 

As shown in Fig. 17, the energy dissipation into the anode is greater than into the 
cathode. Nevertheless, in sinking EDM, polarity of the tool electrode is normally 
positive except when very short discharge duration is used. This is because the 
carbon layer which is deposited on the anode surface due to thermal dissociation of 
the hydrocarbon oil (see Fig. 3) protects the anode surface from wear (Xia 
et al. 1996; Natsu et al. 2004b; Kunieda and Kobayashi 2004). Since the carbon 
layer is thick when the discharge duration is long, the tool electrode wear ratio is 
low with the polarity of positive tool electrode under the pulse condition of longer 
discharge durations. On the contrary, a negative tool electrode is used considering 
the energy distribution in the cases of finish machining and micromachining where 
deposition of carbon layer is scarce and also in WEDM where deionized water is 
normally used as dielectric liquid. 


Influence of Thermal Properties of Electrode Materials 

Since EDM is a thermal process, the influence of thermal properties of electrode 
materials on the removal volume is significant. When the heat flux from the arc 
column is equal, higher heat conductivity results in lower temperature on the 
electrode surface. Hence, materials with higher heat conductivity are suitable as 
tool electrodes. On the other hand, under the same temperature distribution inside 
the electrode, when the surface temperature is not over the boiling point or 
melting point, removal does not occur. Hence, materials with higher melting 
point and boiling point are also suitable as tool electrodes. The results of heat 
conduction analysis shown in Figs. 19 and 29 indicate that although both melting 
point and boiling point of copper are lower than those of steel, copper is more 
suitable as a tool electrode because copper has much higher heat conductivity than 
steel. With the expansion of the plasma radius, the melted zone is even resolidified 
during the discharge duration in the case of copper. During the discharge interval 
t 0 , since the surface temperature at the discharge spot drops rapidly in copper, 
the time needed for the plasma to be extinguished is short, leading to stable 
machining. 

Thus, considering the higher energy distribution to the anode, carbon layer 
deposition on the anode, and the thermal properties of electrode materials, it should 
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/ = 10 ps, t = lOOps, 20(.is after the end 

= 5 0. 8 jim d(t)= 121 pm of discharge 


Fig. 29 Temperature distribution in copper workpiece (cathode) (/ e : 30 A, t e : 100 ps, energy 
distribution to cathode: 25 %) 


be possible to select appropriate discharge current waveforms, polarity, and tool 
electrode materials to obtain minimum tool electrode wear ratio even lower than 
several percentage. 


Influence of Electric Conductivity of Electrode Materials 

In the electric circuit shown in Fig. 10, the resistances in copper tool electrode and 
steel workpiece are negligible. In the machining of electrically conductive ceramics 
and semiconductors, however, discharge current obtained is lower than 
metallic workpiece because of significant voltage drop in the workpiece electrode. 
Hence, power supplies with higher open voltage should be used, and the electric 
feeder should be connected to the workpiece at the point closer to the discharging area. 

The voltage drop is especially large adjacent to the discharge spot because the 
current density is extremely high in the order of ijn (d(t)/2) . Zingerman (1956) 
states that Joule heating does not play a substantial role in the machining of metallic 
materials, because electrical potential drops in metallic workpieces are negligibly 
small. According to the calculated results of Rich (1961), however, the Joule heating 
is extremely concentrated in a hemispherical volume directly under the discharge 
spot, and for high-resistivity metals, e.g., Hg, Sb, Fe, and Bi, the Joule heating is 
comparable to the energy input from the arc plasma. Saeki et al. (1996) calculated 
the workpiece temperature distribution considering the Joule heating as well as the 
heat flux from the arc column and found that the removal of high-electric-resistivity 
material (Si 3 N 4 -30wt%SiC) in a single discharge is greater than that of steel due to 
Joule heating under the same discharge current. However, it was found that a longer 
discharge interval is necessary to deionize the arc plasma and recover the insulation 
of the gap after each pulse, which results in a lower material removal rate of high- 
resistivity materials in a practical EDM. In the case of ceramics and semiconductors, 
which are normally hard and brittle materials, debris particles are generated not only 
by melting and boiling but also by fracture due to thermal stress. 
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Summary 

EDM processes involve transient phenomena occurring in a narrow space and in a 
short period of time. These EDM phenomena are not in thermal equilibrium, but 
include transitions between solid, liquid, gas, and plasma. Chemical reactions, mass 
transfer, and displacement of boundaries occur at the same time. Hence, compared to 
other discharge phenomena such as glow discharge in dry etching processes and arc 
discharge in welding processes, physics involved in EDM processes are obviously 
most complicated, rendering observation and theoretical analysis extremely difficult. 
Although it is a long way to fully understand the EDM phenomena, thanks to the 
advancements of various surface analyzing equipments, microscopes, high-speed 
imaging devices, and software tools for numerical analysis, fundamental studies are 
progressing step by step, and some of the common EDM knowledge which has been 
accepted for a long time is being modified. Given that numerous discharge gap 
phenomena have yet to be fully understood, the potentials of EDM technology may 
not be fully realized. For example, based on the fact that hydrocarbon dielectric is 
decomposed to deposit pyrolytic carbon on the electrode surface, a method to machine 
completely nonconductive ceramics was developed (Mohri et al. 2002). Furthermore, 
considering that removal can occur without a liquid dielectric, the dry EDM method 
was developed (Kunieda and Yoshida 1997). Undoubtedly, continuous efforts are 
expected to lead to further developments of the EDM technology in the future. 
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Abstract 

Electronics, optics, mechanics, and opto-mechatronics, which is the combina- 
tion of the first three technologies, have been developed during this half century 
and created high-performance and multifunctional devices/systems through 
research, development, and application of various functional materials such as 
semiconductors, dielectric materials, magnetic materials, ceramics, polymers, 
and glasses. To utilize the unique characteristics of each functional material, 
which is sometimes used as an active layer and sometimes as a substrate for 
high-quality epitaxy, it is necessary to machine the material into a desired shape 
and dimension with high precision, quality, and efficiency. In addition, as seen in 
the case of a magnetic head slider of a hard disk drive, the high functional parts 
are required strictly to maintain their relationships and functions for either 
contact or noncontact configurations. It is often the case that the fabrication of 
high-performance parts requires a design especially considering the interface of 
moving parts and their ambient. Among the machining processes of functional 
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materials, the ultra-high precision polishing, which is the most important 
finishing step, directly determines the performance of devices. 

This chapter reviews the recent trend of LSI devices from the viewpoint of 
machining process followed by the outline of chemical-mechanical polishing 
(CMP) and chemical-mechanical machining process (CMmP). It also presents 
the contribution of CMP technology in device processes including a personal 
opinion of the field. 


Introduction 

Processing Mechanism of Chemical Compound Polishing/CMP and 
Their History of Development 

Machining Mechanism and Characteristics 

The three categories of machining with slurry, “lapping,” “(mechanical) polishing,” 
and “ultraprecision polishing” in a simplified model are mapped as Fig. 1 by taking 
a removal unit which corresponds to the effective depth of slurry/size of debris for 
the horizontal axis. When the work is made of crustaceous material such as silicon 
and glass, lapping proceeds with a mechanical destruction through generation of 
cracks and debris caused by hard abrasives, whereas polishing attains a mirror 
surface through an integration of micro scratches with minimum mechanical effect. 
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A sequence of CMP mechanism of Silicon 

When OH~ ion contained in the slurry approaches to or 
contact the silicon surface: 

(1) A product of Si0 2 abrasive in alkaline solution 
is Si0 3 radical 

(2) Si0 3 radical bonds to Si surface 

(3) Si atom is removed from matrix exposing fresh 

bulk Si 

(4) Cyclic process returns to Step (1) 


h 2 o 

OH 


2 (=Si-OH) — ► =Si-0-Si= + I- 9 0 


Polishing pressure 


Motion 



h 2 o 


[ Si crystal ] 


Fig. 2 Polishing mechanism of Silicon from micro-viewpoint 


The ultraprecision polishing at nanoscale includes refined methods of the conven- 
tional mechanical polishing such as optical or metallographical polishing and a new 
approach using a chemical method, a physical method, or a combination of both 

(Doi et al. 2011). 

Here the CMP as chemically compound polishing is explained. The CMP aims 
to improve the quality of the polishing surface typically by utilizing slurry distrib- 
uted ultrafine abrasives and a soft pad. It is conducted in a clean room with 
high-purity de-ionized water to avoid particles and contaminations. The CMP is 
characterized with high efficiency and extremely low mechanical strain. The factors 
of machining mechanism are summarized as follows: 


(a) Soft or crustaceous reaction product on the work surface created by a chemical 
solution in the slurry is easily removed through mechanical effect of the slurry. 

(b) The rubbing interfaces on the work surface where fine abrasives are applied 
become activated to promote reaction with the chemical solution, and the 
reaction product is easily dissolved. 

(c) Temperature increase created by rubbing of the abrasives and a pad enhance a 
chemical reaction. 

(d) Formation of strained layer through micro scratches with abrasives supports a 
chemical process. 

Furthermore, if observed microscopically, 

(e) The adhesive effect between the abrasives in the slurry and atoms on the work 
surface sometimes accelerates machining. 


Figure 2 shows a model of microscopic mechanism for CMP of Si crystal with 
silica (Si0 2 ) particles. Si atoms on the silica surface and Si atoms on the crystalline 
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surface form Si-O-Si (siloxane) bonds via oxygen atoms, resulting in weaker bonds 
of Si atoms inside the crystal, which allows removal of the surface atoms through 
breakage of bonds by relative movement of pressed silica particles. This is the 
microscopic mechanism of machining which can also be applied to polishing of 
glasses (Cook 1990). The situation and the environment of the process greatly 
depend on many factors such as slurry composition, relationship of zeta potential 
between the abrasives and the work surface, type and condition of pads, and 
mechanical conditions. 

Development History of CMP Technology 

Mechanical polishing as represented by optical or metallographical polishing was 
already applied in the era of Isaac Newton (around seventeenth century) to fabricate 
lenses and mirrors for telescopes. When the mechanism of optical polishing was 
revealed, people started to recognize the importance of mirror finishing with higher 
technology and precision, inventing many kinds of ultraprecision polishing 
methods. Polishing process has not fundamentally changed from the old-day 
method in which a work and a pad are rubbed against each other with addition of 
slurry. A significant improvement in accuracy happened during the Renaissance, 
when people demanded higher magnification and better correction of astigmatism 
in telescopes and microscopes with several lenses. The optics as a scientific subject 
greatly advanced due to the improvement of polishing technology. 

Nowadays unique polishing methods using various abrasives and/or solutions 
have been proposed. The understanding of the material nature of a work and various 
mechanical and chemical effects induced by combination of process components 
such as slurry and pads led to many variations of chemical-mechanical polishing. 
The chemical-mechanical polishing (CMP) and mechanochemical polishing (MCP) 
are typical examples for such polishing methods (Doi et al. 2011). 

In fabricating devices, except for strained layer transistors utilizing increased 
carrier mobility in strained Si, it is quite rare to use a damaged layer from 
machining as an active layer. Thus, it is required to finish the wafer having a 
non-disturbed surface, i.e., a surface free of a damaged layer. A typical example 
of this finishing process to attain a non-disturbed surface is CMP. In the case of 
optical polishing for glass lenses, high-quality mirror finishing with surface rough- 
ness of ~5 nm Rz has been possible by using a pitch tool plate as a polishing pad and 
powder of Fe 2 0 3 (red iron oxide) or ceria (cerium oxide). However, the application 
of this technique to Si LSI process did not attain the same surface quality of planer 
technology in 1960s - micro scratches were observed after etching even if the 
surface looked mirror-like. This problem triggered the development of CMP as a 
chemical combined polishing. The CMP technology became the basis of polishing 
step for bare Si wafers and made a remarkable progress. Currently, the CMP for 
bare Si wafers generally uses alkali-based colloidal silica (high-purity colloidal 
silica produced by ion-exchange method or hydrolysis of alkoxysilane) which has a 
low size distribution and a high dispersion characteristics (Doi et al. 2011). 

In the actual CMP process, basic or acidic solution, which causes a chemical 
reaction with the wafer surface, is used as slurry distributed abrasives. Wafers are 
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Fig. 3 Polishing style and element techniques in CMP technology 


pressed against a rotating pad (also called as a polisher or polishing cloth) with the 
slurry dripped and spread on, and the top surface of the work is mechanically 
removed. Figure 3 shows fundamental equipment components to carry out the 
CMP. Among these components, a pad and slurry determine the CMP characteris- 
tics and thus are extremely important. A CMP pad for device wafers is different in 
specification and structure from those for bare Si wafers because of different 
objectives (Doi 2001). 

Matured CMP Technology for Si Wafers 

The industrial application of CMP as ultraprecision processing started in early 
1970s, when the CMP was used in Si semiconductor process. As the Si LSI 
fabrication demands higher-quality Si wafers, the CMP technology made a signif- 
icant progress. There is no other example than Si wafers that are mass produced 
(monthly production of more than 100,000 slices of f300 mm wafers in one factory) 
with extremely precise finishing (Doi 2001). 

There are two ways of Si CMP, both-side polishing like lapping and single-side 
polishing. This polishing process is required to meet many criteria including high 
flatness, surface roughness less than 1 ~ 2 nm Rz, free of oxidation-induced 
stacking fault (OSF), free of micro scratches, and free of haze. To attain these 
characteristics, the polishing process consists of several steps. The first polishing 
aims to obtain a mirror surface at high efficiency, the second polishing improves 
surface roughness with elimination of OSF, and the third and fourth polishing attain 
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a haze-free, contamination-free surface. Si wafer polishing in each step uses an 
appropriate pad and slurry to achieve required accuracy, quality, and efficiency. 
CMP slurry is generally an alkali solution with dispersed ultrafine silica particles of 
colloidal silica (Doi 2001). 

On the other hand, pad materials are composed of relatively soft polymers 
(synthetic resin) such as nonwoven fabric (first step) and suede-like artificial leather 
(second and third steps). The wafers are precisely cleaned immediately after the 
CMP process. A typical wafer cleaning is the so-called RCA cleaning (Kem 1970): 

SC-1 cleaning with NH 4 OH : H 2 0 2 : H 2 0 =1 : 1 : 5, diluted HF (DHF) cleaning, 
and 

SC-2 cleaning with HC1 : H 2 0 2 : H 2 0 = 1:1:5, 

which removes contaminants introduced during the polishing process. 

After the abovementioned polishing and cleaning process, the high-quality Si 
wafers are processed to fabricate devices. The similar ultraprecision processing 
explained here is also applied to planarization CMP of device wafers after 1990 
especially in multilayer metallization. Figure 4 shows the processing flow from Si 
crystal growth to wafer process and the processing flow of device process which 
applies planarization CMP. 

Additionally, the CMP technology has expanded its applications in many mate- 
rials including compound semiconductors such as GaAs and InP, quartz, metals, 
and plastics. The CMP technology is one of the most highlighted machining 
processes in many industries. 

In the next section, a planarization technique, which has become indispensable 
in the semiconductor device process, will be introduced. 


Current Status of CMP for Semiconductor Devices 

First, the reason why CMP technology has become indispensable to the semicon- 
ductor device processing is explained by introducing the historical background and 
current situation of semiconductor devices. Semiconductor devices are produced by 
constructing extremely fine patterns consisting of small components such as tran- 
sistors, resistors and capacitors, and electric wiring which connect each component 
to create electric circuits. Figure 5 is a photograph of semiconductor devices under 
formation of fine patterns on a Si wafer. The surface is covered with fine and 
complicated dips and bumps. If one sees the cross section, the bottom of the layer 
structure is an active region with numerous components such as transistors on Si 
wafer. A region of shallow trench isolation (STI) is stacked on the active region 
followed by metallization layers to connect components in the active region as well 
as plugs to connect lines in each layer. The metallization layers are insulated and 
mechanically supported with interlayer dielectric. As shown here, the multilayer 
metallization is a technology to connect various fine components created system- 
atically on the wafer with narrow lines of metal stacked for several layers and to 
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Fig. 4 Fabrication process of bare silicon wafers and the CMP technique’s application to devices 
process 


achieve designed function of the device. This technology is realized with flattening 
CMP, which is the application of ultraprecision polishing. The CMP technique is 
regarded as a standard flattening process and indispensable to the semiconductor 
device process now (Doi 2001). Before introduction of planarization CMP, the only 
practical planarization method was spin on glass (SOG), which could fabricate at 
most three layers of metallization as shown in Fig. 6, and this was the major 
bottleneck for the development of very large-scale integrated circuit (VLSI) 
(Doi 2001). 

To realize multilayer metallization after CMP of STI, several times of flattening 
CMP steps for many kinds of materials such as oxidation layer, tungsten, and 
copper as wiring metal are required at appropriate timing in the fabrication process 
(Doi et al. 2011). Figures 7 and 8 show CMP methods for dielectric film and 
interconnecting metal, respectively. In the case of the latter, Cu-CMP is called 
Damascene process (the embedding wiring planarization CMP). 
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Fig. 5 Micro-patterns formed on a silicon wafer, and multilayer wiring structures produced by 
CMP technology 




2 foyers wiring (without CMP) 


7 foyers wiring by pbnoriiatio n CMP 



Fig. 6 Effect of planarization CMP (comparison between A1 and Cu multilayer wiring) (Courtesy 
of Hitachi) 


Figure 3 in the last section presents equipment and basic components of CMP. 
Among these components, a pad and slurry determines the CMP performance, 
thus quite important. Since CMP for bare Si wafers and device wafers has 
different purposes, the specification of the pad and the slurry are different for these 
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processes (Doi et al. 201 1). Moreover, since the LSI technology evolves restlessly, it 
is necessary to follow the technological trend in a timely manner. For example, 
reduction of resistance by narrowing the line width is the major issue in the current 
wiring technology, and at this moment Cu line is used due to its low resistivity. 
However, further narrowing of the line width requires to support the Cu line with 
porous insulation layer having a low dielectric constant, which is the so-called 
Cu/low-k interconnects. In the Cu/low-k interconnect structure, an extra care is 
required during the CMP process to avoid collapse of interlayer dielectric and Cu 
metallization layers because the mechanical strength of the interlayer becomes 
extremely low and the thin film tends to peel off. Since peeling of a film is dominated 
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by a shear stress applied to the interface, one tries to avoid the peeling of the low-k 
layer by reducing the pressure applied to the wafer during the CMP process (Doi and 
Kinoshita 2001). Nevertheless, according to the Preston’s law, which is an empirical 
law in the polishing field, the removal rate (PR) is determined by, 

PR = k • p • v, 

where k is a Preston constant determined by polishing conditions, p is a pressure 
applied to a work, and v is a relative velocity or rotation. Therefore, the removal rate 
will decrease when the applied pressure is reduced. To supplement the decreased 
removal rate, it is necessary to increase the relative velocity (v) or Preston constant 
(, k ) by enhancing chemical effect of the slurry. In addition to this solution, people 
attempt to avoid reduction of Cu volume by reducing the thickness of barrier metal 
such as tantalum (Ta). When the thickness of Ta reaches its limit of thinning, 
alternative metals such as ruthenium (Ru) are considered now. Although Ru is an 
expensive noble metal, it is expected to be the most promising metal owing to its 
lattice matching, wetting property, and low resistivity (Ryuzaki et al. 2009). 

As discussed above, for the current structure of multilayer metallization having 
fragile low-k materials and new materials, the flattening CMP has an increasing 
demand for less dishing erosion and edge inclusion (EE). Recent introduction of 
novel materials such as Ru requires higher performance of a pad and slurry. 
Additionally, there is a problem of wafer expansion (450 mm) as listed in ITRS. 
The CMP process must be developed with total consideration of the entire Si 
process starting with bare Si wafer fabrication (Doi et al. 2007a). Table 1 shows 
the achievement levels for technical items related to the flattening CMP summa- 
rized by Planarization CMP Research Group/Committees in the Japan Society for 
Precision Engineering (Doi 2013). The items include machining equipment tech- 
nology, cleaning/purification technology, measurement/characterization tech- 
niques, device fabrication technology, and CMP applications. As shown in the 
table, there are many issues unsolved in each item. 

In the next section, the machining equipment technology and their supporting 
supplies, which are important in CMP technology, are explained. 


Recent CMP Equipment and Supply Materials 
Technology Trend of CMP Equipment 

There were approximately 15 manufactures of CMP equipment in about 1995. With 
intense competition and elevation of the technology level, the most of the manu- 
facturers vanished from the market. Currently, Applied Materials (AMAT) and 
Ebara Corporation dominate the market (Doi 2013). Figure 9 shows an example of 
the CMP machine by Ebara Corporation. All manufacturers employ the same 
machining method to attain the wafer- surface standard reference; however, the 
method and the head mechanism to achieve uniform pressure are different between 
each manufacturing company. Typically, an air-bag method which presses the 
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Table 1 The achievement situation of planarization CMP in 2012 


Fields 

The achievement situation of fiscal year 2012 

Processing/equipment 

technology 

cp300 mm is the mainstream of the mass production 

The mass production grail starts by cp450 mm 

Edge exclusion (EE) : 2 mm, setting removal rate variation 

<2.5 % (la) 

Throughput >50/h (interlayer, interconnection) 

Special note : Practical implementation of dual Damascene 
Cu-CMP process 

♦Cu-CMP : Polish uniformity, WIWNU <2 % WTWNU <2 %, 
WTWNU2 % 

Dishing <20 nm (20 pm isolated pattern), erosion <15 (L/S : 
20 pm/0.25 pm) 

Cu removal rate > 1 ,000 nm 

Barrier polishing : Cu/barrier him 2/3 step polishing 

Consider about the nano-topography problem, dishing, and 
erosion <20 nm 

Cleaning/washing 

technology 

In the case of Cu-CMP: 

2 

cp300 mm: Metal impurities concentration <1E10 atom/cm 

cp300 mm: Particle size (include scratch) 

Surface 80/ wafer (>0.16 pm) 30/wafer (>0.2 pm) 

Back side 200/wafer (>0.16 pm) 60/wafer (>0.2 pm) 

Measurement/ 
evaluation technology 

Practical realization of APC (Advanced Process Control) 

Integrated management of in-process and in-line monitor, 
interlocking movement with other process, CIM-ization 

Accuracy of in-process monitor < 1 % (practical value on the 
oxidation him) 

Particle detection performance >0.16 pm (practical value on the 
oxidation him) 

Construction of particle management technique at back side and 
end face of water 

In-process monitor (mainly used for EPD), in-line monitor (feed 
back the information about uniformity, planarity, defection of 
water to the process) 

Device technology 

Apply Cu-CMP to 30 nm generation device 

Number of wiring layer 10-13 layer Wiring width 0.04 ~ 0.8 pm 

Permittivity of interlayer dielectric k = 2. 3-2. 5 (dishing) + 
(erosion) <20 nm, 3 ~ 10% of wiring height 


wafer from the backside of the wafer is used. Figure 10 shows an example design of 
the polishing head by Tokyo Seimitsu Co. 

Recently each equipment manufacturer improves the perfection of the machine, 
although each company takes its own approach. Applied Materials focuses on 
control techniques such as process controls rather than improvement of hardware. 
A control technique called real-time process control (RTPC) monitors the polishing 
profiles, figures out the best polishing conditions, and adjusts the condition in the 
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CMP system "Reflexion IK" of CMP system "F-REX 300s I ) of 
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Fig. 9 Examples of CMP systems (all automatic machine of cassette to cassette and 
dry-in-dry-out) 



Locally in- plane wafer was pressed by zone compacting force 

High-precision polishing profile can be controlled 


Fig. 10 Head zone control method in CMP system (Tokyo Seimitsu) 


real time during the process. Optical endpoint detection technology is also 
improved. Conventionally the endpoint was detected with a system called ISRM, 

'T'AyT 

which utilizes laser beams. Currently, they use a technology called FullVision 
which utilizes multi wavelength light (The Semiconductor Industry 2008). They 

T^a/i 

also use a window embedded in a pad called Window-In-Pad to observe the 
wafer surface. 

Ebara Corporation utilizes their simulation technology to conduct fluid analysis 
and structural analysis. They conduct the simulation and actual process in parallel 
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Fig. 11 Fundamental mechanism of the endpoint detection using a skin effect 


and evaluate the result (Tabata 2009). As for the minimization of edge exclusion 
area, they analyze the stress at the roll-off of the edge and evaluate how the shape of 
the roll-off has an influence on the pressure distribution during polishing (Hiyama 
et al. 2008). 

Tokyo Seimitsu makes more efforts on novel component technologies. For 
example, they developed and announced an endpoint detection system using 
Eddy current from a skin effect. Figure 11 is the fundamental mechanism of the 
endpoint detection using a skin effect. As shown in the figure, the endpoint is 
determined by detecting the maximum of Eddy current induced by a skin effect 
during the polishing under a high-frequency magnetic field generated by a plane 
coil. The detection sensitivity near the endpoint can be increased by reducing the 
penetration of the magnetic field into the devices (Fukuda et al. 2009). 

Current standard CMP machines by most of the manufactures have polishing 
heads with multi-zone pressure control (Fig. 12) to improve higher uniformity 
under lower-pressure condition, which is necessary to ensure polishing process of 
Cu/low-k interconnect structure. When the pressure is reduced, the removal rate is 
also reduced. To supplement the loss of speed, more effective slurry must be 
developed. In this context, E-CMP which applies electro-machining is attractive 
since it can attain high removal rate with minimum friction during CMP. 


Supply Materials: Slurry and Pads 

Here the recent trend of slurry and pads as supply materials, which are quite 
important to realize CMP technology, as shown in Fig. 3, is explained. Among 
the several kinds of slurry for CMP, one for W has shown sufficient level of quality. 
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Fig. 1 2 An example of polishing heads with multi-zone pressure to improve higher uniformity 


Each manufacturer develops high-performance slurry for STI, Cu, and other mate- 
rials, especially one having non-Prestonian characteristics. Hitachi Chemical 
announced nano-colloidal ceria slurry of 5 nm, which can attain a removal rate of 
more than 100 nm/min for an oxide layer at the concentration of 0.01 wt% or more. 
It is worth noting that the removal rate of 400 nm/min was obtained at 0. 1 wt% of 
concentration (Ryuzaki et al. 2009). Figure 13 summarizes the characteristics of 
various kinds of slurry for STI and interlayer dielectric mainly. 

On the other hand, slurry for line metals is different from one for insulators. For 
example, oxidizer is indispensable for line metals like Cu and W or barrier metals 
like TiN, Ta, and TaN. Currently, instead of the conventional hydrogen peroxide, 
people start to use APS (ammonium persulfate) which has high flattening capability 
although the lifetime is short. This is because the metal polishing has a mechanism 
where the oxidizer forms an oxide layer on the surface of the work and the 
weakened oxide layer is removed with abrasives and a pad. For the recent buried 
Cu metallization, Ru film is considered as a replacement of Ta or TaN to improve 
the coverage in Cu electroplating. Ru film requires a new kind of slurry; therefore, 
development of higher-performance slurry is more and more active. 
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Fig. 13 Standard for selecting abrasives 

As for CMP pads, in addition to the standard IC pads made of polyurethane foam 
(Fig. 3/R&H corp.), newer pads are being developed. For example, WSP (water 
soluble particle) pads by JSR Corp. and halogen-free pads by Toray Corp. are 
available in the market. Recently, Cabot Microelectronics Corp., which is a leading 
supplier of slurry, released that an Epic Pad (D100) foamed with carbon dioxide gas 
could minimize the dishing erosion. Figure 14 shows the example of pore distribu- 
tion for the Epic Pad (D100 and D200) observed with an electron microscope. 


Challenges Toward Innovative CMP 

It has been discovered that CMP under high pressure of air or oxygen with a newly 
invented bell-jar-type closed CMP machine (See Fig. 15) could increase a removal 
rate compared with the ordinary polishing under atmospheric pressure (Fujita 
et al. 2007; Doi et al. 2004; DeNardis et al. 2005). In this setup, the existence of 
high-pressure oxygen has a significant impact on the removal rate, and chemically 
stable silicon dioxide films as well as chemically and mechanically stable sapphire 
crystals showed 1.5-2 times faster removal rate. Recently polishing of glass sub- 
strates for HDD was attempted using the bell-jar-type closed CMP machine, as a 
part of the “development of materials for rare earth replacement” by NEDO (New 
Energy and Industrial Technology Development Organization). Figure 16 is an 
example of a relationship between the removal rate of glass substrate by ceria slurry 
and the ambient pressure inside the bell jar (i.e., chamber) (Ohnishi et al. 2011; 
Yamazaki et al. 2010). When polished under various compressed gases, the 
removal rate increased with the pressure inside the bell jar. Particularly, polishing 
under high-pressure (500 kPa) air enabled 2.2 times faster removal rate. Currently, 
the mechanism of this enhancement is not clear, but it is surmised that the excess 
oxygen has an effect on the polishing process. 

Gallium nitride (GaN) crystals, which is highlighted as a promising material for 
LED lighting and laser diodes, are very stable both chemically and mechanically, 
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Fig. 14 Example for Cabot’s pad (different pore size and distribution of D100 and D200) 
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Fig. 15 Construction drawing of the bell-jar shaped CMP machine 

thus require a breakthrough to achieve efficient machining. A double removal rate 
of GaN under high-pressure oxygen or air was confirmed with the bell-jar-type 
CMP machine as shown in Fig. 17, demonstrating the effectiveness of this bell-jar 
approach (Doi et al. 2012). 
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Similarly, silicon carbide (SiC) for power devices, which also presents chemical 
and mechanical stability, has an issue of low machining efficiency. Application of 
the bell-jar- type CMP enhanced the removal rate by several tens of percent under 
high-pressure oxygen, although the rate was still low (Doi et al. 2007b). 

The abovementioned experiments implied that intentional supply of oxygen on 
the polishing surface would improve the machining efficiency. One of the 
approaches from this idea is introduction of active oxygen by a photocatalytic 
reaction. It is generally said that irradiation of ultraviolet (UV) light on Ti0 2 
particles excites electrons to a higher energy band leaving holes which convert 

o 

O ions into highly oxidizing oxygen radicals. CMP of SiC substrates under high- 
pressure oxygen was attempted in the bell-jar-type CMP machine with UV irradi- 
ation through a glass (quartz) window on the bell jar (see Fig. 18) and addition of 
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Ti0 2 particles to colloidal silica. Figure 19 is the comparison of removal rate of SiC 
single crystal substrates with and without UV irradiation for various kinds of gas in 
the bell jar (Doi et at. 2007b). With UV irradiation to Ti0 2 added to the colloidal 
silica slurry, the removal rate became twice faster than that without. This seems to 
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be the effect of photocatalytic reaction. It should be noted that over four times 
higher removal rate was obtained under high-pressure oxygen with UV irradiation. 
This condition resulted in a satisfactory polished surface of SiC substrates with the 
surface roughness (Ra) of 0.2 nm. 

The bell-jar-type closed CMP machine demonstrated efficient machining of 
extremely hard materials such as SiC with high-quality finish by introducing 
high-pressure oxygen and photocatalysis. This approach gave an important insight 
to the machining of hard materials and it is expected to become one of the important 
future machining technologies. 


Summary 

This chapter reviewed the development history of polishing techniques which have 
been the cutting-edge technology to meet the demands in any time and focused on 
the CMP technology, which is indispensable to the semiconductor device 
processing now. The background of CMP technology in the semiconductor 
processing and the recent technological trend were also covered. The future 
polishing process, which will contribute to the world through production of inno- 
vative opto-mechatronic devices, were also introduced. 

Incidentally, a part of this work was supported by JSPS KAKENHI Grant(Grant- 
in-Aid for Scientific Research(S)) Number 24226005. 
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Abstract 

Laser machining belongs in the large family of material removing or machining 
processes. It is one of the most widely used thermal energy-based noncontact- 
type advanced machining processes. Laser beams can be used in many industrial 
applications, including machining, whereby it constitutes an alternative to tra- 
ditional material removal techniques and can be used for the processing of a 
variety of materials, namely, metals, ceramics, glass, plastics, and composites. 
Laser machining is characterized by a number of advantages such as the absence 
of tool wear, tool breakage, chatter, machine deflection, and mechanically 
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induced material damage, phenomena that are usually associated with traditional 
machining processes. However, as it is the case with all manufacturing pro- 
cesses, it is the optimum operating parameters that have to be determined. In the 
present chapter, the state of the art on the laser machining is being presented. 


Introduction 

Lasers differ from the ordinary light in that they consist of photons all of which 
have the same frequency and phase (coherence). A laser’s ability to produce 
coherent light is based upon the principle that the light photons can stimulate the 
electrons of atoms and, consequently, emit photons of identical frequency. The 
basic mechanisms required for the production of laser light include stimulation, 
amplification, and population inversion and have been thoroughly described in 

(Chryssolouris 1991). 

Laser machining belongs in the broad family of material removal or machining 
processes. Laser machining is the material removal process accomplished through a 
laser beam and target material interactions. Such machining processes transport 
photon energy into the target material in the form of thermal or photochemical 
energy and remove material by melting or by direct vaporization/ablation. Con- 
ventional machining processes rely on mechanical stresses induced by tools for the 
breakage of the material bonds. This basic difference in the material removal 
mechanism states the advantages and disadvantages of laser machining, compared 
with those of traditional machining processes. 

Lasers can be used in many industrial machining processes for a variety of 
materials including metals, ceramics, glass, plastics, and composites. Laser beams, 
used as machining tools, are not accompanied by problems such as tool wear, tool 
breakage, chatter, machine deflection, and mechanically induced material damage, 
phenomena that are usually associated with traditional machining processes 
(Chryssolouris 1991; Kempfer 1995; Chryssolouris et al. 1988a). The goal of laser 
machining is that the material removal rate, the surface quality, and the dimensional 
accuracy are maximized in order for a high-quality part or product to be economically 
manufactured. Laser machining allows for a great deal of flexibility, since the laser 
beam is optically manipulated, and could potentially lead to the implementation of 
drilling, cutting, scribing, cladding, welding, and heat treating processes in one setup 
on one machine, where the different processes are executed just by changing process 
parameters. Laser machining among other applications is used in industry for pro- 
ducing one dimensional features in turbine blades, combustion chambers and aerosol 
nozzles, intricate two-dimensional shapes in workpieces or scribe substrates for 
cooling and identification labels in finished parts (Hutfiess et al. 1995; Kikuchi 
et al. 1995; Hecht 1994; Ziegert and Mize 1994; Katsuki et al. 1994; Poprawe 
et al. 1995). These applications, however, are at different stages of development and 
industrial exploitation. Some of them do not exhibit any significant exploitation, either 
because they do not offer sufficient advantages over competing processes or because 
they are not yet adequately developed and proven for industrial use. The industrial 
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Table 1 Types of lasers (Chryssolouris and Salonitis 2011) 


Laser type 


Wavelength (mm) 

Typical Performance 

Solid 

Ruby 

694 

PM, 5W 


Nd:YAG 

1,064 

PM, CW, 1-800 W 


Nd:glass 

1,064 

PM, CW, 2 mW 

Semiconductor 

GaAs 

800-900 

PM, CW, 2-10 mW 

Molecular 

C0 2 

10.6 pm 

PM, CW, <15 kW 

Ion 

Ar 

330-520 

PM, CW, 1 W-5 kW 


Excimer 

200-500 

PM 

Neutral gas 

He-Ne 

633 

CW, 20 mW 


usage of some laser-based machining applications is also dictated by an additional 
development of improved and cost-effective laser sources (Haferkamp and Seebaum 
1994; Firestone and Vesely Jr 1988; Hugel et al. 1994; Leidinger et al. 1995). 
A typical laser machining system is complex, with the laser source though, being 
the most critical component. Based on the lasing medium used, the laser sources can 
be classified into solid, liquid, or gas either in a continuous wave (CW) or in the pulsed 
mode (PM) operation. Nowadays, although there is a variety of different laser sources 
(Table 1), the most common ones used in machining are C0 2 , Nd:YAG, and excimer 

Q o i c o 

lasers. Beam power intensity may also vary from 10 W/crrT to 10 W/crrT. 


Laser Beam and Workpiece Material Interaction 

Laser machining is an extremely, spatial and temporal, localized thermal process. 
During beam-material interaction, a certain portion of the incident light energy is 
absorbed by the material (photon absorption), and high temperature is developed in 
the vicinity of the beam spot, resulting in material softening, local yielding, 
melting, burning, or evaporation. In addition to the thermal effects, laser-material 
interactions may be associated with photochemical processes, namely, the 
photoablation of the material. The effects of laser-material interaction are sche- 
matically presented in Fig. 1. 

Depending on the material to be processed, the laser beam power intensity, the 
spot size, the beam distribution, and the scanning velocity, in case of relative 
motion between the beam and the workpiece, a number of effects may be observed 
(Chryssolouris 1991; Chryssolouris et al. 1988a; Chryssolouris et al. 1997; Choi 
and Chryssolouris 1995; Haferkamp and Seebaum 1994), such as: 

Mechanical effects. In case of moderate laser power intensity and rather high 
scanning velocity, the temperature rise of the laser-impinged workpiece surface 
is typically below the material’s melting point. As a result, the workpiece is 
locally heated, the material softens, and thermal stress gradients arise. As a 
consequence, local yielding, loss of stiffness, thermal cracking, or local buckling 
may occur. 
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Fig. 1 Effects of laser-material interactions 


Phase-change effects. Laser power intensity may raise the temperature of the laser- 
impinged workpiece surface above the material’s melting point, high enough to 
change phase, and result in material melting or evaporation. 

Physicochemical interactions. Depending on the assisting material used in the 
process (e.g., gas, liquid, powder, graphite, etc.), which influences it (i.e., cools, 
removes melt, reacts, etc.), simultaneous physicochemical reactions may take 
place between the assisting material and the workpiece, if these are chemically 
reactive. This results in the activation of various phenomena such as burning, 
sintering, soldering, alloying, etc. 

In any laser machining operation, one or more of the abovementioned phenom- 
ena incur. Analytical and numerical models help the understanding of laser 
processing, but there are still many questions to be answered. Depending on the 
mechanism involved during the beam-material interaction, laser processes can be 
roughly divided into material removal processes, joining or additive processes, and 
modification processes. 


Laser Manufacturing Processes 

As indicated in the introduction, laser machining can replace the mechanical 
removal methods in many industrial applications. The fact that laser machining is 
a noncontact, thermal process enables the machining of difficult-to-machine mate- 
rials; however, the power densities introduced on the surface of the workpiece are 
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Fig. 2 Degree of absorption as a function of wavelength (Hiigel and Teubner 1992) 
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Fig. 3 Overview of laser machining processes (Meijer 2004) 


high and can lead to elevated temperatures and to material decomposition. The 
absorption by a material though depends on the wavelength of the laser beam 

(Fig. 2). 

Since the invention of the laser, there has been a constant development made for 
the shortening of pulse times, aiming at different laser machining processes. As 
indicated in Fig. 3, the different applications required a different kind of laser 
sources, in terms of pulse durations and power density. 
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Fig. 4 Schematic of one-, two-, and three-dimensional laser machining (Chryssolouris 1991) 


In general, laser machining can be divided into one-, two-, and three- 
dimensional processes as it can be seen in Fig. 4. These categories can be identified 
through the examination of the shape and the kinematics of the erosion front, 
namely, in the region of the workpiece that material removal takes place. 

In the case of a one-dimensional process (drilling), the laser beam is relatively 
stationary to the workpiece. The erosion front, located at the bottom of the drilled 
hole, propagates in the direction of the line source in order to remove the material. 
In the case of a two-dimensional process (cutting and grooving), the laser beam is in 
relative motion with respect to the workpiece. Material removal occurs when the 
line source is in a perpendicular direction to the line, thereby forming a 
two-dimensional surface. The erosion front is located at the leading edge of the 
line source. In grooving, a laser beam does not cut through the entire workpiece. 
The grooving process exhibits complicated characteristics, such as three- 
dimensional heat transfer, two material phases, a moving boundary, a spatial 
distributed heat source, etc. For three-dimensional machining, two or more laser 
beams are used, and each beam forms a surface through relative motion with the 
workpiece. The erosion front of each surface is found at the leading edge of each 
laser beam. When the surfaces intersect, the three-dimensional volume, bounded by 
the surfaces, is removed. Increased demands, placed on technical products, lead to 
the use of high-performance materials. The processing of this kind of materials 
aided by a laser beam, during machining, opens up a completely new range of 
efficient cutting options. Laser-assisted machining (LAM) is a hybrid cutting 
process during which a laser beam is used for locally heating and softening the 
workpiece in front of the cutting tool. Localized heating, directly in front of the 
cutting tool and the derived reduction in material strength, results in significantly 
increasing the material’s machinability. The temperature rise in the shear zone 
reduces the workpiece’s yield strength and work hardening that causes the plastic 
deformation of difficult-to-machine materials. Titanium-, nickel-, and cobalt-based 
alloys can already be machined easier and be of a higher quality. This process’s 
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benefits comprise reduction in the cutting forces/specific cutting energy, longer tool 
life, better surface integrity, and high productivity. 

Over the recent years, lasers with their excellent beam quality are achieving 
noticeable advantages and improvements in high precision and material processing 
at the microscale. Besides the excellent beam quality, there are more advantages of 
the laser technology such as compact installation size, high laser efficiency, and 
moderate system price. Typical applications of laser micromachining have been 
developed including drilling, cutting, structuring, lateral material removal, as well 
as marking. 

The three major issues in all the aforementioned laser machining process are: 

• Material removal rate 

• Dimensional accuracy 

• Surface quality 

The material removal rate is governed in each case by the propagation speed of 
the erosion front. In laser drilling (a one-dimensional process), the material removal 
rate is determined by the speed of the erosion front when moving toward the beam. 
In laser cutting (a two-dimensional process), the scanning velocity determines the 
rate at which cut surface increases in the workpiece. In three-dimensional laser 
machining, the two-dimensional surfaces produced by two laser beams define a 
three-dimensional volume of the material to be removed. The speed at which these 
two surfaces propagate determines the time required for the removal of a materiaTs 
given volume. The dimensional accuracy is particularly determined by the taper of 
the hole in laser drilling, the kerf geometry in laser cutting, and the groove shape in 
three-dimensional machining. The surface quality of all laser machining processes 
is related to factors such as surface roughness, dross formation, and the heat- 
affected zone. 


Drilling 

Laser drilling is a relatively modern, accurate, nonconventional process with a 
variety of applications. The major advantage of laser drilling, in comparison to the 
conventional methods, is the small size and aspect ratio (up to 1:20) of the keyhole 
that can be created. Either continuous or pulsed laser sources can be used in such 
applications. Pulsed laser sources present a number of advantages, such as less 
plasma generation. Laser drilling involves a stationary laser beam, which uses its 
high power density to melt or vaporize material from the workpiece. Laser drilling 
(Fig. 5) (Chryssolouris 1991) has a variety of applications. Laser pulsed drilling, 
also referred to as percussion laser drilling or on center drilling, utilizes several 
pulses to create the hole (Fig. 6). 

Laser drilling is based on the absorption of the laser energy by the workpiece 
material and the conversion of the photon energy into thermal energy. When the 
temperature exceeds that of the melting and/or vaporization, the workpiece material 
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Fig. 5 Laser drilling process schematic 



Fig. 6 Single -pulse drilling, percussion drilling, trepanning, and helical drilling techniques 
(Dausinger 2000) 

changes phase and hole geometry is formed. If the laser irradiance is kept below a 
certain threshold (typically ca. 106 W/cnr for steels), the workpiece material is 
melted and not vaporized (Salonitis et al. 2007). In that case, the hole is formed due 
to ejection of the melted material with the use of an assisting gas jet (LIA 2001). For 
laser irradiance values beyond the threshold value, the material is removed mainly 
due to vaporization. 

Single-Pulse Drilling 

Single-pulse (shot) drilling tackles high-speed production of blind or through holes 
and delivers aspect ratios typically below 15:1. High production throughput, how- 
ever, is a trade-off with tolerances, generally above 10 %, compared to those of 
trepanning. For many applications, where cycle time is more important than 
quality, this is acceptable. An example of single-pulse drilling is found in the 
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automotive industry, where laser drilling is used for the creation of a scribed 
guideline for breaking off (cracking) a connecting rod of diesel engines. Scribing, 
in effect, drills blind holes closely enough to create a notch. Another single-pulse 
drilling application in the automotive industry is in the manufacturing of filters. 

Percussion Drilling 

Percussion drilling delivers successive laser pulses to the same spot and is the best 
trade-off between throughput and hole quality. The process has become standard 
for the creation of cooling holes in turbine-airfoil blades. Laser percussion drilling 
is favored over the conventional drilling and the other laser drilling techniques, 
because it is by far the quickest technique. However, it still reveals some draw- 
backs. A common drawback is the formation of a recast layer , which is resolidified 
material remaining at the wall of the hole. Normally, some resolidified material can 
also be found at the entrance and exit of the hole, in which cases it is called spatter 
and dross , respectively. Another drawback is tapering. The latter refers to the 
decrease in the hole diameter with increasing depth. Nowadays, tapering is no 
longer considered a disadvantage; however, control of the taper angle and repro- 
ducibility are required. Occasionally, the hole resulting from a laser percussion 
drilling process shows barreling or a local increase in the diameter of the hole. 

Trepanning 

Trepanned laser drilling is a method used to remove a cylindrical core or circular 
disk from a substrate. Trepanning is the standard technique for holes of approxi- 
mate diameters 500 pm, e.g., holes in turbine blades. It is essentially a percussion 
drilling process, followed by a cutting procedure. The application of nanosecond 
pulses to trepanning can increase the quality of the hole. However, the drawbacks of 
percussion drilling still remain. Trepanning enables the drilling of large and shaped 
holes further to the advantage of consistency. Trepanning also reduces the holes’ 
taper. Unlike percussion laser drilling, the position of the beam or substrate is 
moved in conjunction with a predetermined laser beam “overlap” to achieve the 
desired edge quality and production throughput. Less overlap trepanning laser 
drilling can increase the throughput but produces a more jagged edge quality. On 
the other hand, more overlap creates a finer hole resolution and edge quality. Laser 
drilling through the trepanning method can also produce proportionately larger exit 
holes by “tilting” the laser beam within an already drilled entrance hole. 

Helical Drilling 

A relatively new technique, called helical drilling, makes use of the breaking up of 
the process into a multitude of ablation steps in order for accuracy to be enhanced. 
In contrast to trepanning, the helical drilling reaches at breakthrough only after 
many turns of spiral describing the path of the ablation front. Helical drilling has the 
following favorable effects on the drilling accuracy: more deviation from circular 
geometry can be reduced than it can from trepanning; the load on the opposite walls 
is minimized; and most importantly, the recast layers as observed in percussion 
drilling, using nanosecond lasers, can be greatly reduced or completely avoided. 
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Fig. 7 Spatial intensity distribution for TEM00 laser beam (Chryssolouris 1991) 

The beam path is not limited to circular geometry. With suitable optical systems 
such as scanners or by the movement of the workpiece, any shape can be formed. 
This means that helical drilling can be applied to laser micromachining when high 
accuracy and high machining quality are required. Helical drilling is a technique 
that is efficient when the helical diameter is close to the focal diameter of the 
laser beam. 

In principle, laser drilling is governed by an energy balance (Fig. 5) among the 
irradiating energy from the laser beam and the conduction heat into the workpiece, 
the energy losses to the environment, and the energy required for a phase change in 
the workpiece (Salonitis et al. 2007). 

The incident beam energy has a spatial intensity distribution, which in laser 
drilling is usually a Gaussian distribution produced by a laser operating in a TEM 0 o 
mode. The focused beam radius is usually specified as the distance between the 
beam center and a point, where the intensity is reduced from its maximum value at 
the beam center by a factor of e~ (Fig. 7); the average diameter of the drilled hole 
may be less than the beam diameter due to various heat loss effects. These heat 
losses, primarily conduction through the interior of the workpiece and losses in the 
environment, divert the beam energy away from the actual hole drilling process. 
Laser drilling has several advantages over mechanical methods: 

• Due to the thermal nature of the laser drilling process, holes can be made on 
materials such as ceramics, hardened metals, and composites that are difficult to 
be machined by conventional methods (Chryssolouris and Salonitis 2011). 

• Higher accuracies and smaller dimensions can be achieved by laser drilling than 
by conventional drilling methods. Depending on the focusing lens used, hole 
diameters between 0.001 in (0.018 mm) and 0.050 in (1.3 mm) are achievable. 
With the proper selection of beam power values, beam pulse characteristics, 
focusing lens, and interaction time, the desired hole geometry can be obtained. 

• High drilling rates can be achieved in a production environment by using a 
pulsed beam source. By coordinating workpiece motions with the pulse period, 
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Fig. 8 A schematic diagram 
of laser grooving process 
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drilling rates above 100 holes per second can be achieved. The controlling of 
process variables allows rapid changes in hole diameters and hole shapers to be 
made in process, thus eliminating the need for tool changes. 

• The laser allows holes to be drilled at high angles of incidence on the surface 
(up to 80°). Shallow angle drilling is difficult to be mechanically achieved due to 
tool deflections. 

There are limitations to laser drilling in some cases; however: 

• Holes with stepped diameters cannot be drilled using a laser. 

• Due to instabilities in the laser drilling process, depth control in a blind hole 
drilling is difficult. However, continuous monitoring of the beam mode and the 
beam power regulation can provide substantial benefits when, in the form of a 
controller, incorporated into the laser drilling system (Stournaras et al. 2010). 

• For deep holes, the effects of beam divergence may become unacceptable. This 
can be compensated for by using a longer focal length lens or by continuously 
moving the focal point from the workpiece surface to a point at the workpiece 
interior. 


Grooving 

During laser grooving (Fig. 8), a laser beam is scanned over the workpiece surface, 
resulting in increasing its temperature, above the material’s melting point, in a 
small region near the beam spot (Stournaras et al. 2009a). In cases where the heat 
flux provided by the process parameters is significant, vaporization of the material 
might also occur. A gas jet is applied coaxially or off-axially along with the laser 
beam in order to remove the molten material and produce the groove. In the case of 
coaxial gas jet, the process results in the concentration of molten material at the 
erosion front, which cannot be removed efficiently, due to the presence of the grove 
bottom. In order for the molten layer to be minimized, it is recommended that an 
off-axial gas jet be used. Laser grooving is very similar to laser scribing, only with 
the exception of the latter, where the groove depths are usually very small and their 
ratios to the corresponding widths are close to one. Laser grooving is met in various 
manufacturing applications, such as the creation of micro-channels for cooling 
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systems and the creations of slots for assembly, while in the form of laser scribing, 
it can be used for the purpose of marking parts’ identifications. The mechanism of 
laser grooving takes place also in the application of 3D laser machining (turning, 
milling). In 3D laser machining, usually more than two laser beams are used for the 
removal of a volume of material, which is produced by the intersection of two blind 
grooves (Tsoukantas et al. 2002). 

During laser grooving, several process parameters should be taken under consid- 
eration. The dominant ones include the laser power, the diameter of the laser beam on 
the workpiece surface, the laser scanning speed, the gas type, the gas pressure, the 
nozzle cross section, and the gas jet direction relative to the beam axis. The laser 
grooving process has been investigated by a number of researchers. Laser grooving 
and laser scribing have the following advantages over conventional techniques: 

• Lasers can be used to scribe or mark permanent identification patterns on 
metallic or ceramic parts. Laser-marked IDs can withstand greater amounts of 
wear than those marked with other methods. 

• Since lasers can be focused to a small spot, they are ideal for micromachining 
applications to repair defective integrated components which would otherwise 
be scrapped. 

On the other hand, laser grooving and scribing have the following drawbacks: 

• When applied to metals and ceramics, these processes result in molten material 
accumulation at the erosion front. However, unlike in laser cutting, a coaxial gas 
jet is not effective for ejecting molten material due to the presence of the groove 
bottom. 

• The groove depth can fluctuate due to disturbances caused by laser beam 
changes, mechanical vibrations, material impurities, and gas jet fluctuations. 
Consistency in groove depth can be maintained via a closed-loop control 
scheme. 


Cutting 

In the laser through- cutting process, a kerf is created through relative motion 
between the laser beam and the workpiece surface. This process allows intricate 
two-dimensional shapes to be cut on flat workpiece. The physical mechanisms of 
material removal and energy losses (Fig. 9) are similar to those of drilling, where 
the incoming laser beam energy is balanced by the conduction heat, energy for 
melting or vaporization of the material, and heat losses to the environment. How- 
ever, due to the relative beam/workpiece movement, the erosion front of the laser 
beam and the temperature field in the workpiece are stationary with respect to a 
coordinate system moving with the laser beam; therefore, laser cutting can be 
considered a steady-state thermal process. Since the workpiece thickness is 
equal to the depth of cut, conduction heat occurs in the plane of the workpiece. 
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Fig. 9 Laser through cutting 
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The temperature inside the workpiece is dependent on the distance of the erosion 
front and independent of time. When material is removed through melting, a molten 
layer is formed at the erosion front. The accumulated molten material can be 
expelled from the bottom of the kerf with the aid of a coaxial gas jet. 

Owing to the unique characteristics of the laser beam, laser cutting has several 
advantages: 

• For most industrial materials with workpiece thicknesses up to 10 mm, laser 
cutting produces a significantly higher material removal rate than the mechanical 
cutting or shearing does. 

• Laser cutting produces kerf widths, which are narrower than those achievable by 
mechanical cutting. This results in less material being wasted during the cutting 
operations. 

• When coupled with a multi-axis position control system of the workpiece or 
beam, the shapes can also be cut from curved workpieces. Conventional 
mechanical methods can only cut flat workpieces effectively. Lasers can be 
applied to trimming operations to remove flash and burrs from curved parts. 

• For the cutting of fibrous material such as wood, paper, or composites, the laser 
beam vaporizes the volume of the material to be removed, thereby eliminating 
the residue and debris that remain after mechanical cutting. This reduces both the 
need for solid waste collection and disposal and the health hazard in the work 
environment. 

The drawbacks of laser cutting in comparison to cutting by conventional 
methods are: 

• The laser cutting effectiveness reduces as the workpiece thickness increases and 
depends on the type of the material. Workpieces greater than 15 mm in thickness 
generally cannot be cut effectively by modem industrial lasers. Furthermore, 
materials such as aluminum and stainless steel with high reflectivity can be cut at 
lower thicknesses compared to those of carbon steel (Stournaras et al. 2009b). 
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• Laser cutting produces a tapered kerf shape, compared with the straight vertical 
kerf walls achievable by conventional methods. The kerf taper results from the 
laser beam ’ s divergence and becomes more pronounced as the workpiece thick- 
ness increases. The kerf taper can be reduced by adjusting the focal point of the 
laser beam to the interior of the workpiece instead of the workpiece surface. 


Three-Dimensional Machining 

Flexible manufacturing requires that three-dimensional shapes be produced in a 
short period of time. Although lasers can be easily and accurately manipulated with 
optical components, until recently, it was difficult for three-dimensional shapes to 
be processed and/or produced efficiently. The development of advanced optical 
scanning systems in combination with laser sources of increased power, quality, 
and overall efficiency has paved the way for the so-called 3D laser materials’ 
processing. The term 3D laser processing has been used so far for the description 
of a group of different three-dimensional laser processing concepts. At each of 
these concepts, the 3D aspect refers to a different manipulation of one or more laser 
beams, so as to process and/or produce three-dimensional geometries by 
performing material removal, welding, or heat treating. The most important con- 
cepts mainly focus on laser machining and laser welding processes with the 
incorporation of one or two laser beams simultaneously. The main concepts that 
have been introduced so far as research or industrial solutions for 3D laser mate- 
rials’ processing can be isolated into three main categories: 

• 3D laser machining with two converging laser beams 

• 3D remote laser processing 

• 3D laser processing with 5-axis laser heads 

3D Laser Machining with Two Converging Laser Beams 

By utilizing two converging laser beams (Fig. 11), this concept is capable of 
producing parts with shapes similar to those produced by conventional machining 
methods (Chryssolouris et al. 1988b; Chryssolouris et al. 1991). The primary 
feature of three-dimensional machining is the laser grooving process. Each beam 
creates a blind kerf or groove in the workpiece through single or multiple passes. A 
volume of material is removed when the two produced grooves intersect. Turning 
operations can be accomplished by ring or helix removal (Fig. 10a, b). 

The ring removal method uses two perpendicular beams to remove concentric 
rings from a workpiece, while in the latter method, two angular beams create a 
continuous thread. In the case of laser milling, two laser beams are positioned 
usually at oblique angles from the workpiece surface, in order for converging 
grooves to be created on the workpiece (Fig. lie, d). The volume of material 
removed is prismatic in shape with a triangular cross section (Chryssolouris 1991, 
1994; Chryssolouris et al. 1991). Gear cutting is also possible with the certain 
concept; however, more sophisticated adjustments of the relative angles between 
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Fig. 1 1 Three-dimensional laser machining using two converging laser beams 


the two laser beams are required, due to the complex gears’ geometry 
(Chryssolouris et al. 1991; Chryssolouris 1994). By manipulating the orientation 
of the two beams, the geometry of the removed chip varies correspondingly in the 
variation of the cutting tools’ geometry in mechanical machining. Energy is only 
expended in melting and/or vaporizing the material in the grooves. This results in 
greater energy efficiency and material removal rates comparable to single-beam 
processes. Issues related to process efficiency and control have been investigated 
both theoretically and experimentally. 
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3D Remote Laser Processing 

The principle of 3D remote laser processing is based on the implementation of high- 
quality laser sources and optical assemblies that combine lenses with adequate focal 
lengths (up to 1 .600 mm) along with one (Fig. 11a) or two (Fig. lib) deflecting 
mirrors (Goebel et al. 2000; Tsoukantas et al. 2007). Such configurations produce 

o 

fairly large working volumes (approximately 1 m or greater) of spherical (one 
mirror) or conical (two mirrors) geometry. In many cases, such scanning systems 
implement F-theta optics, which produce planar image fields. 

Due to the very low inertia of the scanning optics, high “duty ratios” can be 
achieved and large- sized workpieces can be processed in three dimensions by only 
one remote laser processing station. One of the most significant applications, which 
adopt the remote aspect for laser processing, is remote laser welding, with the 
keyhole welding to constitute its dominant mechanism; however, it could be easily 
used for trimming and scribing applications as well. The angles between the laser 
beam axis and the workpiece surface can reach 600, resulting in a beam spot area 
distortion of 25 % (Kautek et al. 1996). Regarding the process monitoring aspect, 
quite sophisticated sensing solutions are required due to the relatively large work- 
ing volumes that have to be covered. Gantry and robot arm stations, used in research 
or at industrial level, constitute the existing remote laser welding systems. Gantry 
stations, with applications in the automotive industry, are equipped with high power 
(>3,5 kW) C0 2 lasers and use focal lengths of 1,000-1,600 mm, resulting in 

o 

working volumes that range between 0, 1 and 1 nr . On the other hand, robot arm 
stations integrate Nd: YAG (>4 kW) or HPDL beams that can be guided to a robot’s 
end effector via fiber optics. Such systems provide lower working volumes, ranging 

Q 

between 22 and 6,400 cm . Other laser processes, adopting the 3D remote principle, 
include the majority of laser heat treatment processes such as hardening, alloying, 
sintering, etc. Mainly, the elliptical beam spots on the workpiece surface have an 
effect on the mechanism of each one of these processes that reduce power density, 
especially for low incidence angles. Laser shaping constitutes another significant 
laser process that adopts the 3D remote principle. Laser shaping systems use 
two-mirrored deflection systems in combination with Q-switched Nd:YAG lasers 
of approximately 100 W. Such systems have already been commercially available, 
and they have been used successfully in the manufacturing of molds and dies for 
rapid tooling applications (Holthaus 2002). 

3D Laser Processing with 5-Axis Laser Heads 

The majority of laser processes can be realized in 3D via 5-axis laser heads. Their 
concept is based on the realization of optical assemblies in combination with 
special motion controlling systems, which provide the movement of a laser head 
in five axes, i.e., three linear movements and two rotational around perpendicular 
axes (Fig. 12). 

Although such laser heads can implement a variety of focal lengths, they have to 
be in close range from the processed material, especially in the case of laser 
machining, since an inert or an active gas jet has to be applied during the 
processing. As with remote laser processing systems, the 5 -axis laser head ones 
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Laser Cutting Laser Welding 

Fig. 12 Laser processing examples with a 5-axis laser head 

present advanced flexibility, since they can apply multiple laser processes (e.g., 
cutting, drilling, welding, hardening) to quite bulky workpieces of complex geom- 
etry with just one setup. Since the laser head is in close range with the workpiece, it 
allows for the implementation of an efficient and simple process monitoring system 
in contrast to remote laser processing. The perpendicular or angled processing is 
also possible in 3D geometry, depending on the users’ wish. Other laser processes 
such as cladding or sintering are not recommended in these setups due to the nature 
of the processes, which require a rather distant approach of the beam to the 
workpiece for the protection of the optics. Such systems are commercially available 
and present quite a high level of automation. 


Micromachining 

The excellence attributes of laser radiation, combined with a high degree of 
flexibility, contact- and wear-less machining, the possibility of high automation, 
as well as easy integration, allow the use of laser tool in a wide field of 
macromachining processes. Laser machining is divided into micro- and 
macromachining. This classification is not based on the size of the workpiece but 
rather on the fineness of the impact caused by the laser tool. The laser system used 
for micromachining normally employs pulsed beams with an average power of well 
below 1 kW, while those used for macromachining generally use continuous-wave 
(CW) laser beams up to several kW. 

Laser micromachining is a technology that can produce parts in the micro- and 
sub-microscale. For such applications, lasers with pulse duration in the femtosec- 
ond range (FEMTOLASER) are widely used. Laser ablation occurs due to the 
irradiation of the laser beam onto the material causing a combination of sublima- 
tion, vaporization, and melting. It is characterized by small temporal and spatial 
scales and an extremely high material temperature and pressure. FemtoLaser 
(FL) drilling uses pulses with duration in the order of tens to hundreds of 
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Left Entrance of a hole produced in a 1 mm thick steel plate using with 150 fs pulses 
at a wavelength of 7 SO nm, having an energy of L3mJ, Right Exit of the hole 


Fig. 13 Experimental results on FemtoLaser drilling 


femtoseconds. During the recent years, the FL ablation has shown a rapid devel- 
opment due to its distinct advantages compared with those of the longer pulse lasers 
(Kautek et al. 1996; Staurt et al. 1996; von der Linde and Sokolowsi 2000). As a 
result of its extremely short pulse duration, the heat diffusion is confined, and the 
heat-affected zone (HAZ) is rather limited. This high localized heating, in each 
laser pulse, results in smaller material volume removal, and therefore, more precise 
machining results compared with the ones obtained from longer laser pulses. The 
laser ablation process depends on the physical properties of solid matter, laser 
parameters, and environmental conditions (Semerok et al. 1999; Stavropoulos and 
Chryssolouris 2007; Stavropoulos et al. 2010; Stavropoulos et al. 2012). Experi- 
mental investigations have revealed that with FL, along with the crater, there 
coexist a debris formation, the height of which depends on the target matter, the 
laser energy, and the number of pulses fired at the target. The crater diameters have 
been found to be greater than those of the laser beam. There has been coincidence 
found on the crater shape with radiation intensity distribution only at low laser 
fiuences (~1 J/cm ). The crater’s widening is the result of the laser beam being 
scattered in the near-surface plasma and the plasma action on the crater formation. 
The debris generation is the result not only of the liquid phase explosion but also of 
the plasma particle recondensation (Kelly and Miotell 1994). 

Experimental results (Nolte et al. 1999) are shown in Fig. 13. Laser 
microprocessing is used by many industrial sectors - semiconductors, electronics, 
medical, automotive, aerospace, instrumentation, and communications. 

Lasers for micromachining offer a wide range of wavelengths, pulse duration (from 
femtosecond to microsecond), and repetition rates (from single pulse to megahertz). 
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Table 2 Typical characteristics of lasers for micromachining operations 


Laser type 

Wavelength (nm) 

Pulse length 

Frequency (kHz) 

TEA C0 2 

10,600 

200 ps 

5 

Nd:YAG 

1060, 532, 355, 266 

100 ns, 10 ns 

50 

Excimer 

193-351 

20 ns 

0.1-1 

Copper vapor 

611-578 

30 ns 

4-20 

Ti sapphire 

775 

100 fs 

1-250 


These attributes allow for micromachining with high resolution in depth and 
lateral dimensions. The field of micromachining comprises manufacturing 
methods such as drilling, cutting, welding, as well as ablation and material surface 
texturing, whereby it is possible for very fine surface structures in the micrometer 
domain to be achieved. Such processes require a rapid heating, melting, and 
evaporation of the material. The use of extremely short nano- and pico- and even 
femtosecond pulse durations helps minimize thermal effects such as melting 
and burr formation, thus eliminating the need for any postprocessing measures. 
Typical characteristics of laser, used in micromachining operations in terms of 
source, wavelength, pulse durations, and frequency, are displayed in Table 2, while 
Table 3 categorizes the laser micromachining applications per source, application, 
and material. 

General qualitative results, highlighting the opportunities of laser 
micromachining, are presented in the following paragraphs. A comparative study 
with laser macromachining is also presented in some cases. Classification is based 
on the target material. 

Micromachining of Metals 

The micromachining of bulk metals with the use of nanosecond (or longer) pulse 
lasers was historically a difficult task due to issues that have to do with melting 
effects around the machined site, recast material, and general edge quality of the 
micro structure. The fine control that was required for precision microstructures 
could not be obtained with such lasers. Recently, the femtosecond lasers have 
overcome these problems and high-quality micromachining is now possible 
(Nolte et al. 1997) in a range of bulk metals, as shown in Fig. 14. 

Another recent development that has affected the area of metal micromachining 
is that of high repetition rate solid-state lasers, particularly the neodymium vana- 
date (Nd:YV04) lasers. The relatively short pulse duration of these lasers (typically 
a few nanoseconds to a few tens of nanoseconds) and the repetition rates of a few 
tens of kilohertz have enabled the sufficient development of robust processes. 
Extremely high-quality micromachining has also been demonstrated using these 
nanosecond pulse systems. Figure 15 shows an example of high precision 
micromachining using a 355 nm Nd: vanadate laser, operating at 10 kHz and with 
an average power of 6 W and a pulse duration of 38 ns. 

It can be seen in Fig. 15 that even with stainless steel metals, which are relatively 
difficult to be laser machined in good quality, comparatively good results can be 
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Table 3 Typical application of lasers in micromachining (Ogura et al. 1998) 


Laser 

Applications 

Material 

Microelectronics 

packaging 



Excimer 

Via drilling and interconnect drilling 

Plastics, ceramics, 
silicon 

Lamp-pumped 
solid state 

Via drilling and interconnect drilling 

Plastics, metal, 
ceramics, silicon 

Diode -pumped 
solid state 

High volume via drilling, tuning quartz 
oscillators 

Plastics, metal, 
inorganics 

C0 2 sealed or TEA 

Excising and scribing of circuit devices, 
large panel via 

Ceramics, silicon 

Semiconductor 

manufacturing 



Excimer 

U V-lithography IC repair, thin films, 
wafer cleaning 

Resist, plastics, metals, 
oxides silicon 

Solid state 

IC repair, thin films, bulk machining 
resistor 

Plastics, silicon, 
metals, oxides silicon 

C0 2 or TEA 

Excising, trimming 

Silicon 

Data-storages devices 



Excimer 

Wire stripping, air bearings, heads micro 
via drilling 

Plastics, glass, silicon, 
ceramics, plastics 

Solid state 

Disk texturing servo etching micro via 
drilling 

Metal, ceramics, 
metals, plastic 

C0 2 or TEA 

Wire stripping 

Plastics 

Medical devices 



Excimer 

Drilling catheters, balloons, angioplasty 
devices, micro-orifice drilling 

Plastics, metals, 
ceramics, inorganics 

Solid state 

Stents, diagnostic tools 

Metals 

C0 2 or TEA 

Orifice drilling 

Plastics 

Communication and 
computer peripherals 



Excimer 

Cellular phone, fiber gratings, flat panel 
annealing, ink jet heads 

Plastics, silicon, glass, 
metals, inorganics 

Solid state 

Via interconnect coating removal tape 
devices 

Plastics, metals, oxides, 
ceramics 

C0 2 or TEA 

Optical circuits 

Glass, silicon 


obtained using nanosecond pulse lasers provided that a proper process optimization 
has been undertaken. 

Micromachining of Glass 

The use of glass-based microstructures in disciplines such as biomedicine, bio- 
chemistry, lab-on-chip devices, sensors, and MEMS devices has led to increasing 
the attention that is being paid to glass micromachining. In almost all cases, any 
damage and micro-cracks around the laser-machined site should be avoided, and 
the femtosecond lasers have proven to be excellent sources for such precise 
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Fig. 14 


Femtosecond laser micromachining of aluminum (left) and steel (right) 


Fig. 15 100 pm diameter 
hole drilled in stainless steel 
using a 355 nm nanosecond 
pulse Nd:vanadate laser 



Exitech Limited 


machining work. Figure 16 shows two examples of borosilicate glasses machined 
with the use of a femtosecond laser. 


Micromachining of Diamond/CVD 

Diamond, due to its transparency in a wide wavelength range, is considered as a 
difficult-to-machine material. At high power densities, however, the diamond is 
transformed into graphite, which absorbs the laser power and is subsequently 
removed by ablation. Diamond machining is currently carried out by microsecond 
pulse Nd:YAG and nanosecond pulse excimer lasers. The synthetic CVD diamond 
has received a lot of attention as an attractive material for different applications, 
namely, in IR optical applications, detectors, sensors, and thermal management 
systems. The diverse range of applications results from the unrivalled set of 
mechanical, thermal, electrical, and optical properties exhibited by diamonds. 
However, one of the drawbacks resulting from the inertness and mechanical 
hardness of a diamond is its relative difficulty to be machined or etched conven- 
tionally. Lasers can overcome these difficulties, while various nano-millisecond 
pulse lasers (e.g., IR Nd:YAG lasers) have been used for a number of years for the 
cutting and dicing of diamonds. As diamond devices have matured, there are a 
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Fig. 16 Glass samples micromachined using a femtosecond laser 




Fig. 17 (a) Cutting of a CVD diamond wafer using a femtosecond laser, (b) sample of CVD 
diamond where the surface has been smoothed using a femtosecond laser and the edge has been cut 
using a nanosecond pulse 355 nm Nd:YV04 laser 


number of more stringent criteria since the existing “long” pulse lasers can no 
longer meet the necessary quality. Laser smoothing can be used for the removal of 
the surface’s facets and the creation of a flatter surface. UV excimer laser smooth- 
ing has been demonstrated and results in a very good quality, “smooth” surface. The 
use of femtosecond laser smoothing does not graphitize the surface in any notice- 
able manner, while electrical tests conducted on femtosecond laser-smoothed 
diamond have confirmed this. Figure 17a shows an fs-laser cut piece of a diamond, 
and fs-laser smoothing is shown in Fig. 17b. 

Laser micromachining is a rather precise process requiring accurate simulation 
methods (Stavropoulos and Chryssolouris 2006). Aiming at the simulation of such 
small-scale temporal and spatial phenomena, several techniques have been devel- 
oped. Molecular dynamics (MD) is based on the solution of Newton’s second law 
and aims at monitoring the movement of an atom within a system (Stavropoulos 
and Chryssolouris 2007; Stavropoulos et al. 2010). Each particle in the ensemble of 
N particles is treated as a mass point, interacting through force fields, which in turn 
have been derived from interacting potentials. MD works through the solution of 
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Newton’s equation of motion. Therefore, the system’s evolution can be calculated 
in certain time steps, where its total information (particle positions, velocities, 
kinetic and potential energies) is available for each time step. All further properties 
such as temperature, pressure, and autocorrelation functions can be determined 
without any additional parameters (Rieth 2000; Allen 2004). The reaction process 
of a laser beam with a bulk solid is the interaction between the electromagnetic field 
of light and the electric field of the atoms or molecules of the solid material. The 
light absorbed by the solid material can be divided into three categories, depending 
on the level of the light energy: 

• The excitation of the electronic state of atoms and molecules of the lattice 
particles 

• The excitation of the intermolecular vibration state of the lattice molecules 

• The excitation of the intermolecular or interatomic vibration of rotational state 
of the lattice molecules 


Emerging Industrial Applications 

There are a number of emerging candidates for the application area, which might 
drive the femtosecond laser technology into mainstream, mass-market industrial 
use - the “killer application,” as it is often called. Some of the application sectors 
include (Kautek et al. 1996; Staurt et al. 1996): 


• Biomedical devices: use of femtosecond lasers for stent manufacture or eye 
surgery. This is thought to be having huge benefits in terms of precision, damage 
to the surrounding area, and patient recovery. 

• Surface structuring: surface smoothing without imparting any thermal influence 
on the substrate or for the selective machining of multilayer devices without 
affecting particular layers. 

• Micro-optics: machining of micro-lenses or diffractive optical elements in 
optical materials with a high surface finish. 

• Photonics devices: machining of optical waveguides in bulk glasses or silica for 
photonic lightwave circuits and other telecommunications devices. 

• Micromachining: drilling of diesel injector nozzles or other thick metal machin- 
ing. It should be noted, however, that this particular application has been touted 
for many years without being significantly advanced, while it may now be the 
case that the nanosecond lasers may offer sufficient machining quality, suitable 
for the production of injector nozzles. 


Assisted Manufacturing Processes 

Laser-assisted manufacturing (LAM) is a hybrid cutting process during which 
a laser beam is used for locally heating and softening the workpiece in front of 
the cutting tool. The temperature rise at the shear zone reduces the workpiece’s 
yield strength and work hardening that makes the plastic deformation of 
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Fig. 18 Schematic of (a) 
laser-assisted turning and (b) 
laser-assisted grinding (Rozzi 
et al. 2000) 
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difficult-to-machine materials easier during machining. The technology of hot 
chipping/machining is not new. Previous generations of this technology had 
employed low-grade heat sources. Along with the development of more 
heat-resistant and difficult-to-cut tools, in conjunction with cost-effective lasers, 
hot machining became more and more important. LAM has some inherited 
advantages such as increased material removal rates, reduction in chatter, 
tool failure, decrease in cutting forces, and tool wear, and more importantly, 
LAM enables the machining of brittle work materials without extensive cracking 
(Chryssolouris et al. 1997). On the other hand, its thermal impact on the material 
is the main disadvantage of this process. The laser source of energy is used for 
softening the workpiece’s material when it is combined with conventional 
machining processes, such as turning, shaping, and grinding. The hybrid 
machining processes developed with laser and non-laser conventional machining 
processes are shown in Fig. 18. 

In laser-assisted turning, the laser heat source is focused on the un-machined 
section of the workpiece, directly in front of the cutting tool. The addition of heat 
softens the surface layer of difficult-to-tum materials, so it is ductile deformation 
that takes place rather than brittle deformation during cutting. This process yields a 
higher removal rate while maintaining the workpiece’s surface quality and 
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dimensional accuracy (). Work material classes that are well suited to the applica- 
tion of LAM technology include: 

• Hard and brittle materials such as ceramics 

• Heat-resistant materials such as nickel alloys 

• Materials with abrasive constituents such as high- silicon-content aluminum alloys 

• Materials inclined to significant strain hardening such as the austenitic stainless 
steels 

Klocke and Zaboklicki (1998) confirmed the feasibility of cutting sintered silicon 
nitride through LAM. For workpiece temperatures in the range of 100 °F, chip 
formation was observed to be realized predominantly through ductile plastic defor- 
mation, resulting in machined surfaces of a quality similar to that obtained in diamond 
grinding. Based on the electron microscopic analysis of chips obtained, Lei 
et al. (2000) inferred that the plastic deformation of silicon nitride in the shear zone 
was sustained by a reduction in the viscosity of the intergranular glassy phase at 
increased workpiece temperatures. Rebro et al. (2002) developed a double-ramp laser 
profile protocol for the LAM of ceramic to preclude the workpiece’s thermal fracture. 
The LAM of an alumina reinforced aluminum metal matrix composite was studied by 
Wang et al. (2002), who reported that laser has assisted in reducing the cutting forces 
and wear of the carbide tool by 30-50 % and 20-30 %, respectively. Westkaemper 
(1995) has further reported a laser-assisted grinding process whose stock removal rate 
in the continuous dress grinding of hot-pressed silicon nitride was increased by a 
factor of 6 with no adverse effects on the machined surface quality. Hybridization of 
laser machining by way of unconventional machining processes has also proven to be 
advantageous for improving the quality of machining oriented to microscale products. 
Ultrasonic -assisted laser beam machining (UALBM), laser- assisted electrochemical 
machining (LAECM), laser-assisted electro-discharge machining (LAEDM), and 
laser-assisted etching (LAE) are examples of laser hybrid machining processes. 


Summary 

Laser machining is a widely used nonconventional manufacturing process. It is 
based on thermal, noncontact interaction between a beam and the material to be 
machined. It can be applied to a wide range of materials. In this chapter, the state of 
the art of laser machining has been presented, and the major research areas in laser 
machining were discussed. The majority of the research work published is related to 
laser cutting, drilling, grooving, LAM, and micromachining; however, material 
thickness during laser machining is a major constraint, which can be addressed 
by improving the beam quality. The majority of the experimental works presented 
in the frame of laser machining studies the effect of parameter variations on quality 
produced characteristics. Among the various types of lasers used for machining in 
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industries, the C0 2 and Nd:YAG lasers are most established. Laser machining 
presents a number of advantages, compared with those of conventional machining. 
However, there is space for the process to be further improved with respect, in 
particular, to the energy consumption. 
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Abstract 

Ultrasonic machining (USM) is a nontraditional mechanical machining 
process and can be used in many applications. This chapter presents USM 
and rotary ultrasonic machining (RUM), including definitions, machine 
elements, input variables and their effects, applications, and advantages and 
disadvantages. In addition, ultrasonic vibration-assisted (UV-A) machining 
processes will also be introduced. These processes include UV-A turning, 
UV-A drilling, UV-A milling, UV-A grinding, UV-A electrical 
discharged machining (EDM), and UV-A laser beam machining (LBM). The 
machining principles, input variables, and major features for each process will 
be discussed. 


Introduction 

Wood and Loomis first reported the method of ultrasonic machining (USM) in 1927 
(Nishimura 1954; Neppire 1956). The first patent on an ultrasonic machine was 
granted in 1945. Later, many ultrasonic machines were manufactured worldwide 
(Springborn 1967). In 1954, the first ultrasonic tool was manufactured (Rozenberg 
et al. 1964). Ultrasonic machining has other names, such as ultrasonic impact 
grinding (Moore 1986). By the early 1960s, hundreds of papers have been published 
on ultrasonic machining and ultrasonic machine tools with various sizes and types 
(Churi 2010). 

In order to overcome USM’s shortcomings (the details will be presented in 
section “Advantages and Disadvantages”), a rotary ultrasonic machine was 
invented in 1964 by Percy Legge (Legge 1966). Brown and Roney proposed the 
idea of combining drilling and vibration in his patent (Brown and Roney 1960). In 
the patent, Brown and Roney used low-frequency (lower than 1 kHz) vibration to 
assist the drilling process. At that time, this drilling process was used only for 
machining of wood materials. 

By adding a rotating ultrasonic transducer to the old version of RUM (the 
workpiece vibrates up and down), a newer version of RUM was developed 
(Legge 1966). This improvement led to the possibility of RUM to drill holes in 
stationary workpieces to close tolerances. Using tools of various shapes, RUM 
could be used for end milling, tee slotting, dovetail cutting, screw threading, and 
internal and external grinding (Legge 1966). 

The purpose of this chapter is to introduce ultrasonic machining (as well as 
rotary ultrasonic machining), including definitions, machine elements, input vari- 
ables and their effects, applications, and advantages and disadvantages. In addition, 
ultrasonic vibration-assisted (UV-A) machining processes will also be introduced. 
These processes include UV-A turning, UV-A drilling, UV-A milling, UV-A 
grinding, UV-A electrical discharged machining (EDM), and UV-A laser beam 
machining (LBM). The machining principles, input variables, and major features 
for each process will be discussed. 
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Fig. 1 Illustration of 
ultrasonic machining 



Ultrasonic Machining 
Definition 

USM is used to make holes and cavities in hard and brittle workpieces by shaped 
tools, ultrasonic frequency mechanical vibration, and an abrasive slurry 
(Komaraiah and Reddy 1993). It forms a mirror-imaged cavity corresponding to 
the tool shape (Boothroyd and Knight 2006). During the machining process, as 
illustrated in Fig. 1, the tool is vibrating ultrasonically in the direction perpendicular 
to the machined surface and is fed into the workpiece by a constant force. The 
abrasive slurry is pumped into the gap between the tool and workpiece. Abrasive 
grains in the slurry act as indenters to generate brittle fracture in workpiece 
material. The tool vibration results in indentation of abrasive grains into the 
workpiece material. Material is removed primarily by abrasive actions of the slurry: 
impact between free abrasive grains and work surface due to vibration of the tool 
(Jana and Satyanarayana 1973; Markov 1966; Rozenberg et al. 1964; Pei 
et al. 1995b). The secondary material removal principles in USM include cavitation 
erosion and chemical action related to the slurry (Springbom 1967). 

Material removal mechanism in USM is mainly attributed to brittle fracture 
(Lawn et al. 1980; Bangalone 2001). To understand the material removal mecha- 
nism, single abrasive indentation fracture process needs to be understood. The 
sequence of single point indentation that occurs during USM process includes 
indentation, the initiation and propagation of cracks, and chipping of brittle material 
(Lawn and Wilshaw 1975; Komaraiah and Reddy 1993). The crack formation under 
single point indentation process is illustrated in Fig. 2 and can be summarized in the 
following steps: 

1. The single sharp indenter produces a plastic deformation zone (Fig. 2a). 

2. At a certain threshold, a median crack suddenly develops (Fig. 2b). 
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Fig. 2 Illustration of crack 
formation under single point 
indentation (+P, increasing 
load; — P, decreasing load; 
after (Moreland and Moore 

1988 )) 



3. With the increase of load, median crack grows (Fig. 2c). 

4. During unloading, the median crack begins to close (Fig. 2d) and lateral cracks 
form (Fig. 2e). 

5. After unloading, the lateral cracks propagate towards the workpiece surface and 
may accordingly lead to chipping (Fig. 2f). 

In some cases, lateral cracks may not extend to workpiece surface and 
not generate chipping. The material will be removed from the workpiece if 
the lateral cracks form due to two adjacent indentations (Komaraiah and Reddy 

1993). 

Machine Elements 

The ultrasonic machine mainly consists of two major systems, as illustrated in 
Fig. 3, an ultrasonic spindle system and an abrasive slurry circulation system. 

The main machine elements for ultrasonic spindle system include (1) high- 
frequency generator, (2) ultrasonic transducer, (3) ultrasonic amplitude transformer 
and tool holder, and (4) tool (Boothroyd and Knight 2006; Singal et al. 2008). The 
ultrasonic machining system is shown in Fig. 3. 

The ultrasonic transducer, ultrasonic amplitude transformer, tool holder, and 
tool must be in resonance to obtain useful amplitude and power output (Springbom 

1967). 
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Fig. 3 Schematic diagram of ultrasonic machining system (After (Singal et al. 2008)) 


High-Frequency Generator 

High-frequency generator (also called ultrasonic power supply or electronic oscil- 
lator) converts conventional electrical supply (typically, 50 Hz or 60 Hz) to high- 
frequency electrical energy. Frequencies of 20-40 kHz are the most commonly 
used (Boothroyd and Knight 2006; Singal et al. 2008). The high-frequency electri- 
cal energy is supplied to the ultrasonic transducer. 

Ultrasonic Transducer 

Ultrasonic transducer converts the high-frequency electrical energy into mechani- 
cal motion. The length and strength of the transducer material control the vibration 
amplitude (Boothroyd and Knight 2006; Singal et al. 2008). There are two major 
types of transducers - piezoelectric transducer and magneto strictive transducer. 

Piezoelectric transducer is made from quartz crystals or polycrystalline 
ceramics. 

Quartz crystals have volumetric shrinkage and swelling, if they are subjected to 
high-frequency electrical energy. Volume change due to electrical energy leads to 
mechanical motion (ultrasonic vibration). The maximum amplitude of vibration 
can be found at the resonant frequency of the crystal. In this case, the length of the 
crystal must equal to the half wavelength of the ultrasound wave in the crystal 
(Bradford and Richardson 1980; Singal et al. 2008; Boothroyd and Knight 2006). 

As illustrated in Fig. 4, the transducer made from polycrystalline ceramics has a 
sandwich structure which is stacked-up ceramic disks, high-density base, and 
low-density block (Boothroyd and Knight 2006). Such transducers can convert 
96 % of electrical energy to mechanical motion without cooling (Bradford and 
Richardson 1980; Singal et al. 2008; Boothroyd and Knight 2006). 

Magnetostrictive transducer is made from ferromagnetic materials, such as 
nickel and nickel alloys. When magnetostrictive transducer is magnetized, an 
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Fig. 4 Piezoelectric 
ultrasonic transducer (After 
(Bradford and Richardson 

1980 )) 
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Fig. 5 Magnetostrictive 
ultrasonic transducer (After 
(Bradford and Richardson 

1980 )) 
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pack 


expansion occurs. The structure of this kind of transducer is shown in Fig. 5. 
Applying high-frequency electrical energy to the coils surrounding the ferromag- 
netic material leads to vibration at the applied frequency. This kind of vibration can 
occur over a wide frequency range (e.g., 17-23 kHz for a 20 kHz generator). 
Generator with a wide frequency range allows greater design flexibility of ultra- 
sonic amplitude transformer and accommodates tool wear (Thoe et al. 1998). The 
energy conversion efficiency is 20-35 % with water cooling (Bradford and Rich- 
ardson 1980; Singal et al. 2008; Boothroyd and Knight 2006). 

Ultrasonic Amplitude Transformer (Horn) and Tool Holder 

The vibration amplitude generated from transducer, with the limit being approxi- 
mately 0.025 pm, is not adequate for machining. An ultrasonic amplitude trans- 
former is used to increase the vibration amplitude. The length of the transformer is 
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Fig. 6 Different shapes of cylindrical tool holder (After (Thoe et al. 1998)) 


integral times of half the ultrasound wavelength in transformer material. The 
reduction of cross-sectional area results in the increase of vibration amplitude 
(Boothroyd and Knight 2006; Singal et al. 2008). The increase in vibration ampli- 
tude can reach 600 %. Tool holder not only mounts the tool but also increases the 
amplitude with a similar function as ultrasonic amplitude transformer. In other 
words, the tool holder acts partially as an acoustic resonance transformer. The 
ultrasonic amplitude transformer and tool holder can be in different shapes, as 
illustrated in Fig. 6 (Thoe et al. 1998). 

The tool holder can be classified as amplifying tool holder (stepped, exponential, 
and tapper tool holder) and non-amplifying tool holder (hyperbolic tool holder). 

The commonly used materials for ultrasonic amplitude transformer and tool 
holder include monel, titanium alloy (Ti-6Al-4 V), stainless steel (AISI304), 
aluminum, and aluminum bronze (Thoe et al. 1998). 

Tools 

Since tools should not have brittle failure due to abrasive indentation, the cutting 
tools are usually made of tough, strong, and ductile materials, such as nimonic 
alloy, tungsten carbide, stainless steel, titanium, and copper. The tools should be 
designed to provide the maximum amplitude of vibration at the free end under a 
given frequency (Legge 1966). The tools can be mounted to a tool holder by 
brazing, soldering, or screw fastening (Miller 1957; Legge 1966). Alternatively, 
the actual tool can be machined onto the end of the tool holder. Screw fastening is 
most commonly used because this method is quick and easy. 

Generally speaking, the tool is moved towards the workpiece by static load 
produced by weight, spring, pneumatic/hydraulic feeding system, or solenoid 
system (Kennedy and Grieve 1975; Neppire 1956). 
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Machining variables 
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Fig. 7 Input variables of USM 


Abrasive Slurry 

The slurry is a mixture of abrasive grains and a carrier fluid (normally water or oil) 
with abrasive weight concentration of 30-60 %. The abrasive grains in the slurry 
are hard materials that can cut or abrade the workpiece. Common abrasive materials 
include boron carbide (BC), silicon carbide (SiC), silica (Si0 2 ), and alumina 
(A1 2 0 3 ) (Singal et al. 2008; Boothroyd and Knight 2006). Boron carbide is the 
hardest among these materials and leads to the highest material removal rate 
(Springborn 1967). 

The abrasive slurry circulating system is composed of a pump, a refrigeration 
system (optional), a slurry reservoir, and pipes. It is used to circulate the abrasive 
slurry to the cutting interface between tool and workpiece. During the ultrasonic 
machining process, heat is generated at the cutting interface and increases the 
temperature of the abrasive slurry. The refrigeration system is sometimes used to 
keep the slurry cool. The slurry is kept in a reservoir and cutting interface can be 
merged in the slurry (Kennedy and Grieve 1975; Neppire 1956). 


Input Variables and Their Effects 

The input variables of USM include variables related to machining, slurry, tool, and 
workpiece material, as illustrated in Fig. 7. 
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The guidance for these input variables is as follows: 

• Static load: about 0.1-30 N (Thoe et al. 1998) 

• Ultrasonic vibration frequency: 20^-0 kHz (Singal et al. 2008) 

• Ultrasonic vibration amplitude at the tool end: 5-50 pm (Singal et al. 2008) 

• Abrasive type (hardness): BC > SiC > Si0 2 > A1 2 0 3 

• Abrasive grain size: 15-150 pm (Singal et al. 2008) 

• Abrasive weight concentration: 30-60 % (Singal et al. 2008; Boothroyd and 
Knight 2006) 

• Tool materials (hardness): nimonic alloy > tungsten carbide > stainless steel > 
titanium > copper 

The USM experiments have been carried out under a large range of input 
variables by many researchers (Neppiras and Foskett 1956; Singal et al. 2008; 
Springborn 1967; Miller 1957; Thoe et al. 1998) to assess the effects of these input 
variables. 

Effects on Material Removal Rate (MRR) 

MRR increases with the increase of ultrasonic amplitude, frequency, abrasive 
concentration, abrasive material hardness, and the decrease of viscosity of carrier 
fluid and tool cross-sectional area (Singal et al. 2008; Springborn 1967; Miller 
1957; Thoe et al. 1998). Machining of soft and brittle materials (such as soft 
ceramic) leads to a higher MRR than machining of hard and ductile materials 
(Singal et al. 2008; Springborn 1967; Miller 1957; Thoe et al. 1998). Hardness of 
the tool material also affects the MRR. Harder tool material leads to a higher MRR. 

There is no monotonic relationship between the two other input variables 
(abrasive size and static load) and MRR. Increase in abrasive size increases MRR 
until it equals to the ultrasonic amplitude. Beyond this point, further increase in 
abrasive size decreases MRR. This trend is similar to the relationship between static 
load and MRR (Singal et al. 2008; Springborn 1967; Miller 1957; Thoe et al. 1998). 
MRR increases from 0 to a maximum at an optimum static load and then decreases 
(Neppiras and Forkett 1956). MRR also increases with the abrasive concentration 
of slurry to a value of 30-40 %. Further increase in concentration leads to a decrease 
in MRR (Bang alone 2001). 

Effects on Surface Roughness 

Lower surface roughness can be obtained by a decrease in abrasive grain size or an 
increase in static load, ultrasonic amplitude, and abrasive hardness (Nishimura 
1954; Thoe et al. 1998; Neppiras and Forkett 1956; Thoe et al. 1998). 

Effects on Tool Wear 

Harder and bigger size abrasive or softer tool leads to higher tool wear (Adithan and 
Venkatesh 1974). Tool wear increases with the increase of cutting depth (Thoe 
et al. 1998). 
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Table 1 Material machinability by USM (Springbom 1967) 


Aluminum 

Steel 

Super alloy 

Titanium 

Refractories 

Ceramics 

Plastic 

Glass 

Poor 

Fair 

Poor 

Fair 

Good 

Good 

Fair 

Good 


Effects on Hole Oversize 

The size of the machined hole by USM is equal to the size of tool plus an overcut 
which is approximately equal to two times of abrasive size (Bangalone 2001). For 
deeper hole machining, the oversize is greatest at the side of entrance and increases 
rapidly with the increase of cutting depth (Bangalone 2001). 


Applications 

Ultrasonic machining can be used to drill holes in almost all types of materials, such 
as metallic, ceramic, and plastic materials. However, it is not economic in machin- 
ing of materials with hardness less than 40 HRC (Bangalone 2001). Therefore, 
USM is primarily used in machining of hard and brittle materials. Table 1 shows 
material machinability by USM. 

In USM, material is removed by crack initiation, propagation, and brittle fracture 

Q 

of material. MRR in USM is not high (commonly less than 1.5 cm /min). The size 
range for holes that could be drilled is from 0.076 to 75 mm. Machining depths 
could be as large as 50 mm, but usually less than 5 mm. 

The shape of machined cavity is the inverse of the shape of the tool. A cavity of 
almost any cross-sectional shape can be machined (Thoe et al. 1998). 

Applications of USM include (Springborn 1967): 

• Machining slot of different shapes or contour profiles in hard and brittle 
materials 

• Producing holes in various cross sections using appropriate tools 


Advantages and Disadvantages 

The advantages of USM include (Springborn 1967; Singal et al. 2008; Markov 

1966; Rozenberg et al. 1964): 

• The process does not produce heat-affected zone and residual stress on machined 
parts. USM may also increase the high cycle fatigue strength of the 
machined part. 

• Compared with conventional machining methods, brittle materials can be 
machined more efficiently by USM. 

• Complex three-dimensional contours can be machined as quickly as 
simple ones. 
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Table 2 Comparison of output variables between conventional machining (milling) and ultra- 
sonic machining (Springbom 1967; Singal et al. 2008) 


Parameter 

Value 

Conventional machining 

Ultrasonic machining 

MRR 

(mm 3 /min) 

50,000 

300 

Tolerance 

(pm) 

50 

7.5 

Surface roughness 

(pm) 

0.5-5 

0. 2-0.5 

Subsurface damage 

(pm) 

25 

25 

Cutting force 

Level 

High 

Low 

Cutting temperature 

Level 

High 

Low 


• The dimensional accuracy (±0.005 mm) and surface roughness (0.1-0.0125 pm) 
can be good. It is a burrless and distortionless process. 

• The equipment is safe and easy to operate. 

There are disadvantages associated with USM (Springbom 1967; Singal 
et al. 2008). They are listed as follows: 

Q 

• MRR is low (about 3 mm /s). 

• Power consumption is higher than conventional machining processes. 

• Deep holes are difficult to be machined due to the restricted movement of slurry. 
Generally speaking, the depth of cylindrical holes could not be larger than the 
diameter of the tool. 

• There is a problem of tool wear. In order to produce a hole with accurate 
dimension, frequent tool replacement is needed. 

Table 2 compares the output variables between conventional machining and 
ultrasonic machining. 


Rotary Ultrasonic Machining 
Definition 

Rotary ultrasonic machining (RUM), as illustrated in Fig. 8, is a hybrid machining 
process that combines material removal mechanisms of abrasive grinding and ultra- 
sonic machining (USM) (Churi 2010; Legge 1964). In RUM, the tool is a core drill 
with metal-bonded diamond abrasives. During drilling, the rotating tool vibrates axially 
at an ultrasonic frequency (typically 20 kHz) and feeds along its axial direction towards 
the workpiece at a constant feedrate or under a constant pressure. Coolant pumped 
through the core of the drill washes away the swarf and keeps cutting temperature low. 

Machine Elements 

The rotary ultrasonic machine mainly consists of two major systems, as illustrated 
in Fig. 9, an ultrasonic spindle system and a coolant system. 
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Fig. 8 Illustration of rotary 
ultrasonic machining 
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Fig. 9 Schematic diagram of rotary ultrasonic machining 
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Fig. 10 Illustration of 
cutting tool in RUM 
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The major components in ultrasonic spindle system include an ultrasonic spin- 
dle, an ultrasonic power supply, an electric motor, a feeding device, and a control 
panel. Same as in ultrasonic machining, the ultrasonic spindle includes (1) high- 
frequency generator (ultrasonic power supply or electronic oscillator), (2) ultrasonic 
transducer, and (3) ultrasonic amplitude transformer and tool holder. 

There are two different RUM machines with two kinds of feeding devices 
(constant feedrate and constant pressure). For the type of machines with constant 
feedrate, constant feedrate is applied on the tool (or workpiece) to feed the tool 
(or workpiece) towards the workpiece (or tool). For the other type of machines, 
constant pressure is applied on the tool (or workpiece) to feed the tool 
(or workpiece) towards the workpiece (or tool). 

The motor atop the ultrasonic spindle supplies the rotational motion of the tool, 
and different speeds can be obtained by adjusting the motor speed controller on the 
control panel. 

The coolant system is comprised of pump, coolant tank, pressure regulator, flow 
rate and pressure gauges, and valves. The coolant system provides coolant to the 
spindle and the interface of machining. 

The cutting tool used in RUM is a superabrasive core drill which is a kind of cup 
shape grinding wheel, as illustrated in Fig. 10. 


Input Variables and Their Effects 

Since RUM is a hybrid machining process, some input variables are related to 
abrasive grinding and others are related to ultrasonic machining. For machines with 
constant feedrate, cutting force is variable; for the machines with constant pressure, 
the feedrate is variable. 
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The important input variables in RUM can be classified into four categories: 


Machining variables: ultrasonic vibration amplitude, ultrasonic vibration fre- 
quency, tool rotation speed, and federate (or static load) 

Tool variables: abrasive type, abrasive size, abrasive concentration, bond type, 
and tool geometry (slots, outer diameter, and wall thickness) 

Cooling variables: coolant type, coolant flow rate, and coolant pressure 
Major output variables in RUM include cutting force (for constant feedrate type 
machine); material removal rate, torque, surface roughness, tool wear, and edge 
chipping (for brittle materials); edge burr (for ductile materials); and delamina- 
tion (for composite materials). 


Effects on Cutting Force 

The increase of tool rotation speed or ultrasonic power and the decrease of feedrate 
lead to the decrease of cutting force in RUM of silicon, stainless steel, carbon fiber- 
reinforced plastic (CFRP) composites, ceramics, and titanium (Cong 2013; Liu 
et al. 2012; Churi 2010). In RUM of alumina, cutting force decreases remarkably as 
feedrate decreases, tool rotation speed increases, or abrasive size increases; and it 
decreases lightly with increase of ultrasonic amplitude or decrease of abrasive 
concentration (Liu et al. 2012). In RUM of silicon carbide (SiC), the effect of 
ultrasonic power is not significant. Higher tool rotation speed and lower feedrate 
result in lower cutting force (Churi 2010). 


Effects on Surface Roughness 

In RUM of stainless steel and CFRP composites, surface roughness increases as 
spindle speed or feedrate increases. Surface roughness is the lowest when ultrasonic 
power is 20^4-0 % (Cong 2013). In RUM of titanium, surface roughness decreases 
as the tool rotation speed increases, feedrate decreases, or ultrasonic power 
increases (Churi 2010). In RUM of SiC, higher tool rotation speed, lower feedrate, 
higher ultrasonic power, and higher abrasive grain size lead to lower surface 
roughness (Churi 2010). 

Effects on Edge Chipping (for Brittle Materials) 

In RUM of silicon, higher tool rotation speed, higher ultrasonic power, and lower 
feedrate lead to smaller edge chipping thickness and lower cutting force. Larger 
edge chipping is almost always accompanied by higher cutting force (Cong 2013). 
In RUM of alumina, ultrasonic power does not have a significant effect on edge 
chipping. Lower federate and higher tool rotation speed lead to smaller edge 
chipping (Jiao et al. 2005a, b; Li et al. 2005, 2006). In RUM of SiC, higher tool 
rotation speed, lower feedrate, lower ultrasonic power, and higher abrasive grain 
size lead to smaller edge chipping (Churi 2010). 


Effects on MRR (for the Machine with Constant Pressure) 

In RUM of ceramics, MRR increases with the increase of vibration amplitude, static 
load, tool rotation speed, or abrasive size (Pei et al. 1995b). MRR increases up to a 
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maximum value as the abrasive concentration increases. A further increase in 
abrasive concentration leads to lower MRR (Pei et al. 1995b; Hu et al. 2002). 

Effects on Tool Wear 

In RUM of alumina, with the decrease of ultrasonic frequency or diamond concen- 
tration or with the increase of static load, abrasive size, and bond strength, tool wear 
decreases. The tool wear decreases to a minimum value and then increases as the 
vibration amplitude increases (Zeng et al. 2004). 


Applications 

RUM is most commonly used in drilling. RUM is also extended to face milling (Pei 
et al. 1995a), disk grinding (Khanna et al. 1995), and complex contour machining 
(Ya et al. 2002). 

RUM has been used in machining alumina, beryllium oxide, canasite, compos- 
ites, ferrite, glass, polycrystalline diamond, silicon carbide, silicon nitride, stainless 
steel, titanium, uranium oxide cermet, zirconia, etc. (Cong 2013). 


Advantages and Disadvantages 

RUM is different from USM in several aspects. First, fixed abrasive tool is used 
instead of free abrasive slurry. Second, the tool is rotating. RUM inherits some 
advantages of USM including low cutting force, low cutting temperature, and little 
damage to workpiece. 

In comparison with USM, RUM is about ten times faster (Cleave 1976). 
Especially, it is much easier to drill deep and small holes with RUM than with 
USM. Other advantages of RUM include improved hole surface roughness (Graff 
1975), low cutting pressure and torque (Cusumano et al. 1974), low energy input 
(Cleave 1976), reduced chipping (Cleave 1976; Legge 1966), and little surface 
damage and low tool wear (Markov 1977). 

There are some disadvantages associated with RUM. (1) Only circular holes can 
be machined efficiently. (2) The tool may wear the wall of the machined hole which 
limits the accuracy, particularly for small holes. 


Ultrasonic Vibration-Assisted Machining 

In ultrasonic vibration-assisted (UV-A) machining, ultrasonic vibration is applied 
on either cutting tool or workpiece. For ultrasonic vibration-assisted (UV-A) 
conventional machining processes, the tool loses contact with the workpiece peri- 
odically under appropriate combinations of cutting speed, vibration amplitude, and 
vibration frequency (Berhl and Dow 2008). Generally speaking, applying ultrasonic 
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Fig. 11 Illustration of UV-A 
turning (After (Babitsky and 
Kalashnikov 2003)) 
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vibration in machining can increase cutting speed and reduce cutting force, cutting 
temperature, tool wear, cutting chip, and surface roughness (Berhl and Dow 2008). 

UV-A machining processes primarily include UV-A turning, UV-A drilling, 
UV-A grinding, UV-A electrical discharge machining (EDM), and UV-A laser 
beam machining (LBM). This section will discuss machining principles, input 
variables, and features of these machining processes. 


Ultrasonic Vibration-Assisted (UV-A) Turning 

In UV-A turning machines, the ultrasonic vibration generator is usually installed in 
the cutting tool holder base. As illustrated in Fig. 11, the workpiece is rotated, and a 
single-point cutting tool is fed longitudinally into the workpiece to machine exter- 
nal cylindrical (or conical) surface. In the meantime, cutting tool can vibrate in 
three directions: feeding direction, depth of cut direction, or cutting direction (Feng 
et al. 2012). 

Vibration can be applied in a single direction (ID system, most commonly used) 
or both cutting and feeding directions (2D system) (Brehl and Dow 2008; Babitsky 
and Kalshnilov 2003). Major input variables include tool type, workpiece rotation 
speed, feedrate, depth of cut, and vibration amplitude and frequency. 

Compared with conventional turning, UV-A turning can increase machinability 
with a wide range of feedrate, depth of cut, and tool material hardness (Brehl and 
Dow 2008). The advantages include reduced cutting force, extended tool life, 
improved surface roughness and accuracy, decreased cutting temperature, reduced 
cutting chip size, suppressed burr size, decreased residual stress and strain harden- 
ing, and reduced machining time (Thoe et al. 1998). 


Ultrasonic Vibration-Assisted (UV-A) Drilling 

In UV-A drilling, as illustrated in Fig. 12, ultrasonic vibration is applied on either 
drill or workpiece. In most of the UV-A drilling processes, ultrasonic transducer is 
located in the workpiece holder, making workpiece vibrate up and down. If the 
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Fig. 12 Illustration of UV- A 
drilling (After 
(Xu et al. 2009)) 
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Fig. 13 Illustration of 
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workpiece is too large or too heavy, a vibrating tool is preferred. Input variables of 
UV-A drilling include tool rotation speed, feedrate, tool geometry, tool material, 
and vibration amplitude and frequency. 

Compared with regular drilling, UV-A drilling has many advantages: reduced 
cutting force and torque, increased hole accuracy, reduced edge burr, lower hole 
surface roughness, decreased tool wear, and increased MRR (Xu et al. 2009). 


Ultrasonic Vibration-Assisted (UV-A) Milling 

The cutting tool used in milling is known as a milling cutter. Equally spaced 
peripheral teeth on the milling cutter intermittently engage and machine the work- 
piece (DeGarmo et al. 2010). In ultrasonic vibration-assisted (UV-A) milling, the 
ultrasonic vibration drives the workpiece along normal (vertical) or feeding direc- 
tion, as illustrated in Fig. 13. 

Compared with regular milling processes, UV-A milling leads to lower surface 
roughness, higher machine accuracy, and lower cutting force (Shen et al. 2012). 
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Fig. 14 Illustration of UV- A 
grinding (After (Zhao 
et al. 2008)) 
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Fig. 15 Illustration of 
ultrasonic vibration-assisted 
(UV-A) EDM 

Ultrasonic Vibration-Assisted (UV-A) Grinding 

In grinding, a grinding wheel is used to realize material removal by the interaction 
of abrasive grains with the workpiece at high cutting speeds and shallow penetra- 
tion depth (DeGarmo et al. 2010). In UV-A grinding, ultrasonic vibration can be 
applied to the grinding system (normally workpiece) in three different directions, as 
illustrated in Fig. 14, normal direction, longitudinal direction, and transverse 
direction. 

Ultrasonic vibration can be applied not only in either one of these directions but 
also in a combination of two directions (Zhao et al. 2008). Reported input variables 
include wheel cutting depth, wheel rotation speed, feedrate, vibration amplitude 
and frequency, as well as tool and workpiece variables (Feng et al. 2012). 

MRR in UV-A grinding can be higher than those in conventional grinding. 
UV-A grinding leads to lower grinding force and lower surface roughness than 
conventional grinding. 
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Ultrasonic Vibration-Assisted (UV-A) Electrical Discharge 
Machining (EDM) 

As illustrated in Fig. 15, EDM uses a series of rapid recurring current discharges 
(sparks) across a thin gap between tool electrode and workpiece to achieve material 
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Table 3 Summary of feasible materials and advantages 


Process 

Feasible material 

Advantage 

Ultrasonic 

machining 

Hard and brittle 
materials 

Lower cutting force, lower cutting temperature, less 
damage to workpiece, no heat-affected zone, less 
residual stress, higher dimensional accuracy, lower 
surface roughness, quick machining of complex 
contour 

Rotary 

ultrasonic 

machining 

Hard and brittle 
materials and some 
alloys 

Lower cutting force, lower cutting temperature, less 
damage to workpiece, higher machining accuracy, 
lower cutting pressure, smaller edge chipping, 
lower energy input 

UV-A turning 

Most metals and 
alloys 

Lower cutting force, longer tool life, lower surface 
roughness, higher machining accuracy, lower 
cutting temperature, smaller cutting chip, smaller 
burr size, lower residual stress and strain hardening, 
shorter machining time 

UV-A drilling 

Lower cutting force, higher hole accuracy, smaller 
edge burr, lower hole surface roughness and size 
tolerance, lower tool wear, smaller built-up edge 

UV-A milling 

Lower surface roughness, higher machine accuracy, 
lower cutting force, lower tool wear 

UV-A 

grinding 

Most materials 

Lower grinding force, lower surface roughness, 
lower tool wear 

UV-A EDM 

Most conducting 
materials 

Improved dielectric circulation, enhanced debris 
removal, shorter machining time 

UV-A LBM 

Most materials 

Higher machining efficiency and machin ability, 
better quality of machined hole, smaller heat- 
affected zone 


removal by melting and vaporizing (DeGarmo et al. 2010). The dielectric fluid is 
pumped through the interelectrode gap and flushes out the chips and bubbles while 
confining the sparks (DeGarmo et al. 2010). However, flow rate of the dielectric 
fluid in this gap is slow, and removal of discharged debris becomes difficult, 
resulting in lower machining efficiency and accuracy (Tong et al. 2008). Applying 
ultrasonic vibration in EDM is one of the methods used to improve dielectric fluid 
circulation, which can enhance debris removal. 


Ultrasonic Vibration-Assisted (UV-A) Laser Beam Machining (LBM) 

In LBM, intense laser beam focuses on the workpiece and generates high temper- 
ature in a very short time and cuts the workpiece by melting and vaporizing the 
workpiece material (Miyamoto et al. 2001). It is widely used in cutting and hole 
making on sheet materials. Ultrasonic vibration can be applied to the laser generator 
or workpiece in the direction perpendicular to laser beam. The illustration is similar 
to EDM in Fig. 15. 
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Compared with conventional LBM, UV-A LBM can facilitate machining effi- 
ciency and machinability, improve quality of machined hole, and reduce heat- 
affected zone (HAZ) (Liu et al. 2005; Xu et al. 2009). 


Summary 

This chapter has introduced ultrasonic machining (including rotary ultrasonic 
machining) as well as ultrasonic vibration-assisted (UV-A) machining processes. 
UV-A machining processes primarily include UV-A turning, UV-A drilling, UV-A 
grinding, UV-A electrical discharge machining (EDM), and UV-A laser beam 
machining (LBM). Table 3 summarizes the feasible materials and the advantages 
of these processes. 
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Abstract 

This chapter presents the waterjet as a tool for several processes, applications, 
and industries. To form a waterjet, the water must be pressurized to ultrahigh 
pressures (UHP), reaching 650 MPa, and then released across a small size orifice 
(~0.25 mm). A high-velocity jet is formed with capabilities of cutting soft 
materials such as plastics, leather, carpets, and fabrics. When abrasives, such 
as garnet, are added to the waterjet, an abrasive waterjet (AWJ) is formed. This 
AWJ is capable of cutting any material including metal, composite, glass, and 
ceramics. Also briefly discussed in this chapter are other types of cutting jets 
such as fan jets, cryogenic, and slurry jets. 

The waterjet system platforms are discussed in this chapter with focus on the 
UHP platform including the UHP pumps and plumbing. The working princi- 
ples of the intensifier and direct drive pumps are presented. The cutting 
platform is presented to include the cutting heads and jet formation along 
with several hydraulic parameter relationships considering the water com- 
pressibility effect. Also discussed is the effect of waterjet orifice upstream 
conditions on forming coherent waterjets with higher power densities and thus 
more efficient cutting performance. Mixing abrasives with a waterjet occurs in 
a mixing tube which must be of optimal length and diameter to produce 
coherent and efficient AWJ. 

The cutting trends and attributes of an AWJ are presented with focus on 
achieving accurate results using automatic kerf taper compensation. Several 
applications are also presented in this chapter addressing cutting, drilling, 
turning, and milling along with predictive models for these applications. 
The versatility of the AWJ process was demonstrated for machining complex 
3D parts. 

Introduction 

The power of waterfalls and rivers has been known since creation. This power is 
what alerted man to using water as a tool. In ancient Egypt, river branches were 
diverted to wash out soil in search of gold and other minerals deposited in relatively 
soft formations In the nineteenth century, higher-velocity streams were needed for 
washing out and carrying the ore from harder formations such as coal. Early in the 
twentieth century, pumps were developed to obtain more energetic jets for profit- 
able mining practices. The recent waterjet cutting technology was introduced to the 
industry in the early 1970s, enabled by the development of intensifier pumps 
capable of generating pressures in excess of 300 MPa pressures. The initial focus 
was on cutting tissue paper and cardboard, as well as mining and rock fragmenta- 
tion. Soon after, in the 1980, the abrasive waterjet (AWJ) technology was intro- 
duced which enabled the cutting of almost any material including metal, glass, and 
composites. This boosted the acceptance of waterjet technology and introduced it 
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into many industries and applications. Today, there are over 30 industries that apply 
waterjet technology. Examples of these industries and the materials being cut are: 

• Pulp and paper: Cardboard, tissue, diapers, paper 

• Aerospace and aircraft: Metal, composites, plastics, rubber 

• Optics industry: Mirror cores of fused silica, ULE, Zerodur 

• Automotive: Carpets, dashboards, glass 

• Arts and architecture: Stone, glass, metal 

• Food processing: Chicken, fruits, meats, chocolate, candy, fish 

• Mining: Phosphate, uranium, tar sands 

• Construction and civil engineering: Concrete, soil, metal, rock 

• Nuclear decontamination and decommissioning: Jet pumps, steel (control rod 
drives), concrete 

• Oil and gas/offshore: Metal and concrete cutting and piercing 

• Demilitarization: Steel, aluminum, propellants, chemicals 

• Petrochemical: Scales, crude residue, monomers 

• Marine and shipyard: Rust removal, steel 

• Electronics: PWBs, PCBs, ceramics 

In this chapter, we will present the waterjet tool(s), systems, and processes 
covering a few disciplines and interests. We will first discuss the waterjet tools 
and how they are formed. This will be followed by presenting selected system 
components such as pumps and cutting heads. Processes and applications such as 
cutting, drilling, turning, and milling will be discussed at the end of the chapter. 


Waterjet Tools 

A waterjet will be used as the general term for a wide range of fluid, fluid/fluid, and 
fluid/solid jets. Waterjets or waterjet tools (Hashish 1998a) can be classified as 
follows: 

• Fluid jets 

- Waterjets (WJ): Plain water is used to form the jet. These are round jets and 
range in size from 0.076 mm to 0.760 mm. 

- Pulsed waterjets (PJ): The waterjet is pulsed to increase impact stresses. 
Typically, these jets utilize lower pressure and higher flow rates, and by 
pulsing the flow, the impact pressure intensifies to that of water hammer 
pressure. 

- Polymer waterjets (PWJ): Polymer is added to the water. It has been observed 
that adding certain high molecular weight polymer to the water, jet coherence 
increases with additional benefits to efficiency improvement (Hashish 

1998a). 
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- Cavitating waterjets (CWJ): Cavitation is formed at the impact zone mainly 
when submerged. The collapse of cavitation bubbles on the surface of the 
material results in water hammer level pressure and thus accelerated material 
removal. 

- Cryogenic and liquefied gas jets (CJ): Liquefied gasses are used (Dunsky and 
Hashish 1994) to form the jet, which then becomes gaseous at ambient 
conditions. Liquid nitrogen jets have been formed (Dunsky and Hashish 
1996) up to 400 MPa pressures using specially designed pumps. Although 
not cryogenic, liquid ammonia jets have also be demonstrated (Hashish 
2002a) for demilitarizing chemical weapons. 

Fluid/fluid jets 

- Gas waterjets (fuzzy jets): Air (or another gas) is entrained in waterjets. Air 
entrainment helps break up the jet into droplets which in some cases may 
clean the material or affect the surface with certain desired characteristics 
(Chillman et al. 2010). 

- Liquid waterjets: A liquid is entrained in a waterjet. This type of jet is 
commonly known as a jet pump but not for the purpose of cutting. When a 
relatively small flow rate of the secondary fluid is used without significantly 
affecting the jet’s capability for cutting, some useful results may be obtained. 
For example, latex has been entrained into a waterjet to cut and seal the edges 
of fiberglass while cutting. 

Fluid/solid jets 

- Abrasive waterjets (AWJ): Abrasives are entrained in a waterjet. This is the 
most commonly used jet in the waterjet industry today (Hashish 1982, 

1984a). 

- Abrasive suspension jets (ASJ): A premixed slurry is pumped through a 
nozzle (Hollinger and Mannheimer 1991; Hashish 1991). An order of mag- 
nitude increase in abrasive power density is obtained in ASJ as compared to 
AWJs. However, significant hardware challenges slowed down the spread of 
this tool thus far. 

- Ice waterjets (IWJ): Ice is entrained into a waterjet (Geskin et al. 1999). 
Crushed ice has been entrained into waterjets mainly for cleaning 
purposes. Special preparation for the ice particles is needed to be of 
certain size and shape. Also, dry ice (C0 2 ) has been tested as an 
abrasive material for cleaning but more commonly used with air jets (dry 
blasting). 

- Abrasive cryogenic jets (ACJ): Abrasives or vanishing abrasives are used in a 
CJ. Jets formed with garnet entrained in liquid nitrogen showed similar 
performance to AWJs (Dunsky and Hashish 1996). 


Among all of the above waterjets tools, the most commonly used in the industry 
are plain waterjets, fan waterjets, and abrasive waterjets. The rest of this chapter 
will be limited to them. 
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Waterjet Systems 

There are several platforms (or subsystems) in a waterjet system. Figure 1 shows a 
schematic of a typical abrasive waterjet (AWJ) system. The waterjet system 
platforms can be divided as follows: 


Ultrahigh-pressure (UHP) pumps and lines: This is the most upstream compo- 
nent in a waterjet system where the water enters the pump at ambient pressure 
and exits the pump into the plumbing system at higher pressure. The UHP 
plumbing is used to transport the pressurized water to the jet-forming nozzle. 
Cutting heads: An on/off valve is used upstream of the nozzle to start and stop the 
UHP flow to the jet-forming orifice. The orifice is the most critical component that 
transforms the pressure energy into kinetic energy. If abrasives are used, then a 
mixing tube is used to mix and accelerate the abrasives using the high-velocity jet. 
Motion system: The motion system is used to manipulate the cutting head (or the 
workpiece) to affect the cutting process when the jet interacts with the material. 
These motion systems are most commonly of a gantry or cantilever architecture 
and may be one to five axes of motion. A CNC controller is typically used to 
control the interpolating motion of the machine. A robotic arm system may also 
be used as a manipulator. 
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Fig. 1 Waterjet system 
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Fig. 2 Pressure intensifier 


• Software: The software is used to enable operators to interface with 
the machine controller. Models that relate jet parameters to cutting results 
are used in the front end of the software. The models aid in identifying the 
motion kinematics in a transparent way to the operator whose concern may 
be limited to the CAD portion of the software for such tasks as path planning 
and nesting. 

• Ancillaries: There are several ancillaries in a waterjet system. These ancil- 
laries are used depending on the application. For example, a catcher tank is 
used for shape cutting, while a point catcher is used for trimming on the 
edge of a part. Probes, machine vision, and other sensors are also considered 
ancillaries. 


Waterjet UHP Pumps 

Two main classes of pumps are used in ultrahigh-pressure (UHP) waterjet cutting: 
intensifier- type and direct drive pumps. 

Intensifier pumps (Fig. 2) work on the principal of pressure intensification. A 
low pressure acting on a large area results in a higher pressure acting on a smaller 
area. Conventional variable output hydraulic pumps are used to generate pressures 
of about 20 MPa over a piston of an intensifier. A plunger with 20 times less area 
attached to that piston is used to transmit this hydraulic force to water, thus 
pressurizing it to 400 MPa. In double-acting intensifies, two plungers are attached 
to one piston, so while high-pressure water is being discharged on one side, 
low-pressure water is charged on the other side. Typical pump frequency is 
60 cycles per minute. Significant energy is lost due to flow and shifting of the 
hydraulic oil on the low-pressure side of the intensifier pump which is converted to 
heat. An important component of the intensifier pump is a pressure attenuator which 
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Fig. 3 Direct drive pump (one of three cylinders shown) 


is a pressure vessel used to store energy to compensate for pressure drop during 
intensifier position reversal. 

In direct drive pumps (Fig. 3), plungers are connected to a crankshaft drive rather 
than use hydraulic pressure to affect pressure on water. Most pumps of this type use 
three plungers (triplex pumps). The rotational speeds (400-2,200 rpm) of the 
crankshaft determine the flow rate and the required drive power. Current direct 
drive pumps are capable of 380 MPa operating pressure; however, higher reliability 
is obtained at lower pressures. These pumps are suitable for high flow rates as may 
be needed for cleaning or surface preparation. 


WaterJet System Plumbing 


Ultrahigh-pressure (UHP) plumbing consists of tubing, hoses, fittings, swivel joints, and 


other components as shown in Fig. 4. Small-diameter ( I.D . = 2 mm, O.D. 


6 mm) 


tubing is commonly used due to its relative flexibility in the form of long whips 
and coils. The pressure drop in these relatively small-diameter tubing can be 
significant, and thus its use must be limited to sections needing flexibility. An alternative 
option to achieving flexibility is to use swivel joints. In this case, larger tubing 
(less flexible) can be used with a number of swivels to provide the needed degrees 
of freedom. Pressure drop can also be significant in swivels and, thus, the 
plumbing system components and tubing. High-pressure flexible hoses with working 
pressures up to 380 MPa have been developed for field applications and also for 
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Fig. 4 Examples of plumbing components used in waterjet systems 


plumbing gantry and robot manipulators for factory applications. These hoses are 
composed of nylon liners with steel wire winding. The steel wrapping limits the 
hoses radius of curvature. 

A critical component in waterjet system is the on/off valve. Typical on/off 
valves are naturally closed and pneumatically actuated. They consist of two 
principal parts: an actuator and a poppet/seat assembly. Standard shop air pressure 
(~0.4 MPa) acts upon the actuator piston against a spring to allow lifting the poppet 
off the seat (containing a hole) and enables the flow to occur. It closes when the air 
pressure is released. 


Water Compressibility 

Water compressibility becomes an important factor at the pressure level used in 
waterjet cutting. The relationship between pressure, P, and density, p , of water can 
be obtained from 



It was found that the above equation fits Bridgman’s data (Bridgman 1970) with 
L = 300 MPa and n = 0.1368 at 25 °C. Figure 5 shows how the density and 
volume changes with pressure. This compressibility of water affects the overall 
energy performance of the system. For example, if the water compresses by 10 % 
before it reaches the required waterjet cutting pressure, then at least 10 % of the 
energy is lost due to compression. This is also a critical factor in designing high- 
pressure pumps where the dead volume must be minimized to maximize the pump 
volumetric efficiency. 
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Fig. 6 Examples of waterjet mounts 


Jet Formation 
Waterjet 

Typically, an ultrahigh-pressure waterjet is formed by an orifice made out of sapphire, 
ruby, or diamond to allow for more wear resistance. Figure 6 shows two methods of 
holding and sealing the orifice. The picture on the left shows a ruby orifice sealed with 
a plastic seal typically made out of UHMW polyethylene. The picture on the right 
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Fig. 7 Effect waterjet velocity and compressibility factor 


shows a natural diamond orifice held and sealed by metal sintering. Recently, 
synthetic diamond orifices have been introduced. They are typically held in the 
mount using a metal ring rather than a plastic seal. 


Waterjet Velocity 

The jet velocity Vj (considering compressibility) and its kinetic energy E can be 
expressed using Bernoulli’s equation in conjunction with the Eq. 1 for water 
compressibility. The resulting expression (Hashish 2002b) is as follows: 



The power E of a waterjet emerging from an orifice with a diameter d n at 
pressure P is simply the pressure multiplied by the flow rate, which can be 
expressed as follows (Hashish 2002b): 






where yr is water velocity compressibility factor (less than 1) and is shown in Fig. 7, 
C c is the jet contraction coefficient, and C v is the coefficient of velocity. The orifice 
overall coefficient of discharge, C d , includes these coefficients. 

Without considering the compressibility, the following are simple relations 
among the different hydraulic parameters of a waterjet: 
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Fig. 8 Waterjet flow, power, and orifice size relationship 
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The relationship between the waterjet pressure, flow rate, orifice size, and power 
is shown in Fig. 8. 


Waterjet Coherency 

An important feature of a plain waterjet is its coherency. Higher coherency 
increases the power density, and it has been observed that coherent jets are more 
effective in cutting. Also, coherent jets can operate at longer standoff distances. 
Figure 9 shows waterjets with different coherency levels. Yanaida (1974) presented 
equations describing the waterjet structure and its velocity distribution. Basically, a 
high-pressure waterjet consists of an initial coherent waterjet core followed by a 
zone of droplet jet. The critical factors that affect the jet coherency (and its power 
density) include the following: 

• Upstream tube diameter and length: The effects of upstream tube diameter, d, 
and the level of turbulence above the orifice were studied. Results show that 
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Fig. 9 Waterjets with different coherency levels at 350 MPa pressure 


the jet coherency improves when the tube size reaches a certain critical size. 
Any further increase in tube diameter does not further improve coherency. The 
upstream Reynolds number correlates well with jet coherency length (defined as 
the length at which the jet diameter is doubled) as laminar upstream flow results 
in more coherent jets. The addition of long-chain polymer to the water was found 
to enhance the jet coherency due to drag reductions and upstream turbulence 
suppression. An upstream length of at least 20 tube diameters was found to be 
important in producing coherent jets. 

• Orifice edge geometry and condition: A chipped edge on an orifice is the most 
important factor that affects jet coherency. Accordingly, water filters are typi- 
cally used upstream of water pumps. In the abrasive waterjet nozzle, the migra- 
tion of abrasives upstream of the orifice should be eliminated by proper 
sequencing of abrasive and waterjet shutdown. Also, controlling the pressure 
field in the mixing chamber is of critical importance. 


Fan Waterjets 

For cleaning and surface preparation applications, it was found that fan-type jets 
are more practical and more efficient than round jets. In these jets, the energy is 
spread linearly in order to cover a wider zone. The power density is thus lower 
but only needs to exceed the material threshold for material removal. Shimizu 
(2006) showed the effect of fanjet parameters on material erosion. Fanjet 
nozzles are typically made out of metal (recently from diamond as well) 
using special manufacturing techniques to obtain the correct geometry, surface 
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Fig. 10 UHP fanjet and nozzle 


finish, and the material prestress condition. The design and fabrication processes 
aim at evenly distributing the jet power and minimizing the edge “hot spots” 
where the jet tends to be more powerful. Figure 10 shows the shape of the fanjet 
orifice whose projected shape is elliptical, while the front and side views of the 
fanjet (Shimizu 2006) show an initial laminar zone followed by turbulent and 
structured zone. 


AWJ Formation 

Several methods for forming an ultrahigh-pressure AWJ have been reported by 
Hashish (Chillman et al. 2010). Figure 11 shows a schematic of the current AWJ 
nozzle. Typical waterjet jet diameters are 0.08-0.5 mm, and typical jet velocities 
are up to 900 m/s at 400 MPa. The flow of the high-velocity waterjet into the 
concentrically aligned mixing tube creates a vacuum, which is used to transport 
abrasives from a hopper to the nozzle abrasive chamber via a suction hose. A 
typical abrasive material is garnet, which has flow rates from few grams per minute 
to 2 kg/min. Medium and fine abrasives (mesh 60 to mesh 200) are most commonly 
used for metal, glass, and resin composites. The abrasives are accelerated and 
axially oriented (focused) in the mixing tube, which has a length-to-diameter 
ratio from 50 to 100. Typical tube diameters are 0.5-1. 3 mm with lengths up to 
150 mm. A hard and tough material such as tungsten carbide is used as a mixing 
tube to resist erosion. 

Based on the momentum and continuity equations of the liquid and solid flow, 
the following equation by Hashish (2003) results for the particle velocity V a at a 
distance x inside the mixing tube: 
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Fig. 1 1 Abrasive waterjet nozzle and parameters 





where 



3C D {\+r) 2 


4S a dp 



( 10 ) 

( 11 ) 


where r— rh a /rh w is the abrasive loading ratio. The above equations suggest that 
larger particles require longer mixing tubes. Note also that as the abrasive flow rate 
increases, shorter tubes can be used to attain the maximum velocity. For 100 mesh 
abrasives, for example, a mixing tube length of only 33 mm is required for an 
abrasive loading ratio of 0.12. The typical mixing tube length used in industry for 
this case, however, is about 76 mm. The additional length is used to collimate the jet 
and to raise the value of A to about 0.95, as can be calculated from the above 
equations. Note that the additional 43 mm of mixing tube length contributes only 
5 % to the maximum possible velocity. 

While the addition of abrasives to waterjets was found to significantly enhance 
its cutting power, the abrasive kinetic energy in the abrasives will not be more than 
10-15 % of the waterjet. Moreover, this energy is distributed over the mixing 
nozzle cross section, which is typically 10 times more than the area of the waterjet. 
Accordingly, the power density of the abrasives in an AWJ is about two orders of 
magnitude less than that of a waterjet. However, this power is refocused through the 
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Jet Motion 



Fig. 12 Cutting stages 

tips of the abrasives on the workpiece to cause more local energy concentration and 
thus significantly more material removal than is possible with plain waterjets. 

The use of vacuum assist was introduced to allow jets with weak air entrainment 
performance to draw more air and, thus, provide a more effective abrasive-carrying 
capacity. 


Waterjet Processes 

The above waterjet tools have been demonstrated for a wide range of material 
removal applications. The following is a list of applications where waterjets have 
been applied. While some of these applications are in commercial use today, other 
applications are still emerging: 

• Cutting: The jet is used to cut shapes or sever materials. 

• Drilling (piercing): The jet is used to drill a hole without trepanning. 

• Turning: The jet is used to create a surface of revolution. 

• Milling: The jet is used to remove material to a specific depth. 

• Fragmentation: The jet is used to fragment the workpiece. 

• Jet assist: The jet is used to assist other material removal processes such as 
cooling, lubrication, debris removal, and laser beam guiding. 

• Surface modification: The jet is used to modify the surface such as cleaning, rust 
removal, paint removal, peening, texturing, stripping, or polishing. 

• Other: Deburring, peeling, and powder fabrication. 


Cutting 

A visualization study conducted by Hashish (1988) in transparent materials such as 
glass, Lexan, and Plexiglas illustrated the macro nature of the penetration process of 
the AWJ. Figure 12 shows a schematic of the different cutting stages. The devel- 
opment (or initial) stage of cut is observed before the jet reaches the maximum 
depth of cut, h. Beyond this initial stage, the cutting proceeds in a cyclic manner. 
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Fig. 13 Images of jet/material integration 

A steady-state interface to a depth h c exists at the top of the kerf. Below h c , a step of 
material exists and appears to move under the impact of the jet until it reaches the 
final depth h. This step progression process is illustrated further in Fig. 13 with a 
series of pictures of the jet/material interface. 

Based on erosion studies terminology by Bitter (1963) and Finnie (1958), the 
zone up to the depth h c can be termed the “cutting wear zone,” as material removal 
occurs primarily by particle impact at shallow angles. The step formation zone 
below h c can be termed the “deformation wear zone,” as impact at large angles 
causes material removal. Another marked difference between the zones of cutting is 
that cutting wear is a steady-state process in which the material removal rate equals 
the jet material displacement rate by traversal. When this condition terminates at 
the depth h c , the jet penetrates the material by removing a step at a decreasing rate 
as the depth increases. During this process, the effective jet diameter changes, and 
jet instabilities contribute to waviness formation. 

Traverse-induced waviness often appears on the cut edge starting from the upper 
surface either randomly or regularly depending on the traverse system. Waviness can 
also be caused by unsteadiness of the waterjet pressure or the abrasive feed rate. The 
cutting wear mode, however, may not contribute to kerf generation at certain ranges 
of parameters. For example, increasing the traverse speed will decrease the depth of 
cutting wear (Hashish 1984b). Over a certain critical traverse speed, no steady-state 
(cutting wear) kerf will be generated, and all cutting will be due to deformation wear. 

An interesting observation seen in kerfs made by all stream-like cutting tools 
such as waterjets, lasers, and plasma jets is that they have the common feature of 
exhibiting a wavy or striated surface at certain ranges of parameters. 
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Cutting Model 

Based on a simple mathematical derivation, Hashish (1984b) expressed the depth of 
cut h c as 



where c represents the portion of the abrasives used for cutting. Inclusion of a 
threshold velocity below which all deformations are elastic, as in Bitter (1963), can 
be incorporated by replacing V a with (V a — V c ), where V c is the critical velocity. 
The intrinsic velocity (Vi) combines both particle and material characteristics and 
may be used as an erosion condition for erosion characterization. It is expressed as 



where Rf is a particle roundness factor defined by 





The equation for deformation wear depth (h d ) was also derived by Hashish 
(1984b) based on Bitter’s (1963) equation for erosion by solid particle impact at 
large angles. This equation is 




JT dj (7^ U 

2(1 - c )m a (V a -V c ) 




Then, the total depth of cut (h) is the sum of h c and h d . 

It should be noted that at high traverse rates, no steady zone ( h c ) will be 
established. In this case, all the cutting action will belong to the deformation 
wear zone, and its attributes will cover the entire cut surface. The critical rate at 
which this transition occurs is related to the critical angle (a c ) of erosion by particle 
impact. 

The critical traverse rate can be determined from 
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Fig. 14 General trends of cutting with an abrasive waterjet 


Linear Cutting Trends 

The AWJ operation is controlled by the following parameters: 

• Hydraulic parameters 

- Waterjet pressure (P) 

- Nozzle (orifice) diameter ( d n ) 

• Abrasive parameters 

- Abrasive material (density ( q a ), hardness, toughness, etc.) 

- Abrasive particle size ( d a ) and size distribution 

- Abrasive particle shape (roundness, sphericity, etc.) 

- Abrasive flow rate ( m a ) 

• Mixing parameters 

- Mixing tube length (l m ) 

- Mixing tube inner diameter (d m ) 

The effects of these parameters on the cutting speed or cutting depth is qualita- 
tively shown in Fig. 14. Figure 15 shows an example of the effect of pressure on 
cutting speed. 


Cut Attributes 

Figure 16 shows the different geometrical attributes that may or may not exist on an 
AWJ cut surface. These attributes are: 

• Trailback 

• Kerf width (taper) 
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Fig. 15 Effect of pressure on cutting speed 



Fig. 16 AWJ cut attributes 


• Kerf shape (bow) 

• Surface finish 

• Top edge rounding 

• Exit burr 

Trailback 

When jets (or any other beam cutting tool) cut through and separate the material, 
three phenomena are observed. The first is that the jet is deflected opposite to the 
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direction of the motion (Hashish 1984b; Henning and Anders 1998; Henning 
et al. 2002; Henning and Westkamper 2000). This means that the exit of the jet 
from the material lags behind the point at the top of the material where the jet 
enters. The distance by which the exit lags the entrance is typically called the 
trailback, lag, or drag as shown in Fig. 16. Crow and Hashish (1989) and Hashish 
(2002b) developed a universal AWJ kerf equation by dividing the kerf zone into an 
upper direct impact zone and a more significant centrifugal abrasion zone. Ignoring 
the upper zone, the trailback (t h ) at depth ( h ) has been derived as follows: 






where 





While the above equation does not include the effect of particle velocity decay as 
depth increases, it shows that jet power density is an important factor in reducing 
the trailback. 


Kerf Width and Taper 

The second phenomenon is that the width of the cut varies along the cut from top to 
bottom (see Fig. 16). This difference in width is typically called the taper of the cut. 
A taper can be either positive or negative, that is, the width at the exit of the cut may 
be either smaller or larger than the width at the top. Typically, the kerf width at the 
exit side is smaller than that at the entry at practical cutting speeds. Hashish 
(Hashish 2002b), based on a waterjet model (Hashish and DuPlessis 1979), 
proposed a kerf width profile equation in the form 



An example of the cut profile cross section is shown in Fig. 17 at different 
cutting speeds as percentages of the maximum possible cutting speed (Hashish 
2007). The cutting parameters are also shown in the same figure. Observe that the 
zero taper condition is somewhere between 5 % and 10 % cutting speeds. A 
striation-free cut is also observed to be at about 30 % cutting speed from Fig. 17 
or about 1.4 mm/s (i.e., 4.1 times the zero taper cutting speed condition). 

The kerf profiles suggest that tilting the jet to compensate for taper will be of an 
advantage to increase the cutting speed. 

For shape cutting, the trailback and taper phenomena manifest themselves in 
distortions to the geometry of the cut at the exit side. The sketch in Fig. 18 shows 
an undercut due to the trailback phenomena (Hashish 2007; Knaupp et al. 2002). 
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Fig. 17 Width of cut profiles in 1 in. thick titanium 





Fig. 18 Undercutting at the bottom of cuts 


The picture in the same figure shows distorted square-shaped cuts at the bottom 
surface of the material due to trailback and taper. 

Surface Finish 

The third phenomenon is related to the surface waviness which is the macro-level 
surface finish of the cut. Figure 19 shows a typical striated (wavy) surface produced 
by AWJ. Observe that the upper surface of the cut is free from waviness but 
still rough due to the abrasive erosion process (micro-level material removal). 
The hypothesis of the waviness is that the jet/material interface is not steady. 
A step of material moves under the jet until it reaches the bottom of the workpiece. 
During this time, the jet traverses, and its effective diameter is reduced as 
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Fig. 19 AWJ cuts showing Kerf top, bottom, side, and surface morphology 


it penetrates deeper. Hashish (1992) developed the following simplified 
expression based on this hypothesis: 




djhu 

°-5 m a V 2 Jo f 




This equation shows the effect of jet power density on the surface waviness. 

The general trend of taper, trailback, and surface finish as functions of speed is 
illustrated in Fig. 20. This figure shows general speed zones separated by four 
critical cutting speeds. The first critical cutting speed u x is the one at which zero 
taper occurs. Speeds slower than u x will result in divergent cuts with negative taper 
and no waviness. The second critical cutting speed u 2 characterizes the beginning of 
waviness formation. Increasing the speed beyond u 2 will continue to increase taper 
to a maximum value at the third critical speed u 3 . Beyond u 3 , the taper will 
decrease, and the surface will be highly wavy and irregular. At speed w 4 , the jet 
will barely cut through the material, not cut through completely. 

The cut surface at speed slightly below u 2 will produce a waviness-free surface 
similar to, but slightly rougher, than that obtained at speed u x . Usually, u 2 is several 
times faster than u \ . 

To capitalize on dynamic waterjet angle tilting capability (Knaupp et al. 2002; 
Zeng et al. 2005), the cutting speed can be maximized based on the required surface 
finish regardless of taper (and trailback). In this case, taper angles are used to obtain 
the required part accuracy by correcting the wall taper on the required side of the 
cut. Assume that an acceptable surface finish is Ra. This will identify a cutting 
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Fig. 20 General cutting speed zones 


speed Uf. The taper obtained at this speed is then determined as shown in Fig. 21. 
This will define a taper angle to be used. The same applies for trailback. 


Selected Applications 
Small Hole Drilling 

Jets (or any energetic beam) do not necessarily produce straight-walled holes or 
holes with uniform diameters, like solid tool drills, due to the nature of a jet and its 
interaction mechanics with the material. Figure 22 shows different hole shapes that 
may result from waterjet piercing. 

As the jet is penetrating through the material, the return flow is also exiting the 
hole and may cause secondary erosion. The strategy for controlling the qualitative 
and quantitative features of a hole, as well as reducing the drilling time, includes 
both before-breakthrough and after-breakthrough techniques (Hashish 2002c; 
Ohlsson et al. 1992). Some of the most effective techniques, which can be grouped 
into the categories of jet dynamic parameters and kinematic drilling parameters, are 
discussed below. 

Jet dynamic parameters are those that can be changed during drilling, such as 
pressure, abrasive flow rate, and abrasive material (or particle size). Useful jet 
dynamic parameters include: 

• Pressure ramping during drilling: This is particularly important when drilling 
relatively thick sensitive materials that may crack, such as glass; delaminate 
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Fig. 21 Gain in speed due to dynamic jet tilting 



Fig. 22 Hole geometries Straight Convergent Rounded Divergent Divergent 

obtained with AWJ entr Y Convergent 


such as composites; and/or chip such as coated (or TBC) metals. This results in 
faster drilling and better control over hole shape than using a constant “safe” 
lower pressure. 

• Vacuum assist: With small jets or at lower pressures (e.g., during ramping), 
vacuum assist enhances the jet suction capability by entraining the abrasives 
more reliably into the mixing chamber and also before the waterjet is fired. 

• Flushing: For high-volume hole drilling (hundreds and thousands), it was found 
that periodic flushing of the mixing chamber with water is important to maintain 
process reliability. The flushing water removes the abrasives build up in the 
mixing chamber. 

Additional dynamic and kinematic techniques that could be used with AWJ hole 

drilling are: 

• Abrasive ramping during drilling: This is to keep the abrasive flow rate above a 
certain critical limit (Hashish 2002c). 

• After-breakthrough parameter. 
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Fig. 23 AWJ-drilled holes 


• Continuous angle change during drilling: This method is of importance when 
drilling at shallow angles (Hashish 2002c). 

• Trepanning while drilling: Trepanning while drilling can be achieved by rotation 
or orbital motion. Traversing the manipulator (in a circular path) is a simple and 
common approach (Hashish 2002c). This results in faster penetration due the 
enlarged escape path for the return jet. 

• Increase or decrease in standoff distance after breakthrough: This method is used 
to improve the hole taper and also to round the edges of the hole if needed. 

• Dwelling after breakthrough: This adds a control parameter that affects the hole 
shape, and it also improves hole roundness. 

Figure 23 shows examples of AWJ-drilled holes. 

Drilling Model 

Several analyses and modeling studies have been performed on the AWJ drilling 
(Ohlsson et al. 1992; Guo and Ramulu 1999; Ramulu et al. 2005); a simple analysis 
of the drilling process by Hashish (2002c) yields the following equation for the 
drilling time: 



2E d K 2 





This equation shows the importance of increasing the power density of the 
abrasive particles in order to reduce the drilling time. It also shows that the drilling 
time is exponentially proportional to the depth. As depth increases, the abrasive 
kinetic energy needs to be increased, and thus pressure ramping is important. 
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Increasing the factor K 2 in the above equation will also increase the drilling time; 
thus, efforts should be made to reduce K 2 . The factor K 2 from Eq. 21 can be 
reexpressed as follows: 




4 S a d p 



where s is the abrasive particle -specific gravity and d p is the abrasive particle 
diameter. Now, from this equation, it is obvious that reducing the drilling time 
requires reducing the drag coefficient as can be intuitively understood. This can be 
accomplished by reducing the water flow rate out of the hole (which is equal to the 
inlet flow rate), at least before breakthrough occurs. Trepanning, because it enlarges 
the hole and directs the return flow, reduces C d significantly. This results in a 
dramatic improvement in drilling time. Increasing the abrasive- specific gravity and 
size also helps to reduce the drilling time. Dense, large abrasives will carry their 
momentum deeper into the hole. 


Controlled-Depth Milling 

To control and limit the depth of cut by a waterjet, repeated passes are needed, and 
only small amounts of material are removed per pass. This requires the jet either to be 
weak or moving at relatively high traverse rates (Hashish 1989; Fowler et al. 2005). 
The latter approach involves the use of masks made out of more resistant materials 
because high-speed contouring motion is difficult to achieve in practice. Three 
methods are typically used for controlled-depth milling (Hashish 1994): 

• Liner milling: In this method, Cartesian motion is used to scan the jet over the 
masked workpiece. 

• Radial milling: The jet moves radially over masked workpieces mounted on a 
rotating platter. 

• Cylindrical milling: Controlled-depth milling can be achieved on both the 
outside and inside of cylindrical workpieces or workpieces mounted on a 
rotating cylinder. 

Figure 24 shows radial and cylindrical milling concepts. 

Masks can be cut with an AWJ or laser and mounted on the workpiece. This 
method has been used to mill isogrid patterns on the inside and outside of cylinders, 
cones, and domes (Hashish 1994, 1998b). Figure 25 shows several examples of 
milled shapes. A depth control accuracy of about 50 pm has been demonstrated. 

The milling of variable-depth pockets can also be achieved by controlling the 
exposure time of the AWJ over the different areas to be milled, which can be 
accomplished by varying the traverse rate and the number of passes. Milling can 
also be used for multiple pattern cutting such as holes which can be milled through 
simultaneously by using a mask that has been predrilled with the hole pattern. 
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Fig. 25 AWJ-milled parts 


The milling of thin, closely spaced slots can be accomplished with or without a 
mask as shown in Fig. 25. To control the slot (or groove shape), both lead and taper 
angles may be used. 

Milling Model 

The milling process generally involves traversing the jet many times over the area 
to be milled. During a single “sweep” over an area, due to the width of the swath of 
the jet and the overlap at the edge of each swath, the jet may cut over some zones 
more than once. Because of the high traverse rates used in milling, no cutting wear 
mode will be encountered, and the depth of milling per pass can be determined from 
Eq. 7. The depth of milling will vary along the direction of the cross-feed due to this 
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difference in exposure time (or number of passes) resulting in “lay” on the surface. 
For a general case with a nondimensional overlap number ( O n ), the number of 
sweeps can be expressed mathematically as 

N p = INT(0„) + 1 (23) 

Assuming that the standoff distance does not affect the depth of milling per pass, 
the average depth of milling per sweep is derived as 
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The above equation can further be simplified by neglecting V c << V} for soft 
materials and setting Cf = 0, as the depth of milling per pass is very small. The 
resulting simplified equation is 


hav = [INT(0„) + 

TC dj u 



The cross-feed rate is implicitly expressed in the lateral feed increment in the 
above equation. 


Turning 

Waterjet turning is a relatively simple process where a workpiece is rotated while 
the AWJ is traversed axially and radially to produce the required turned surface 
(Hashish 1987). Figure 26 shows AWJ turning methods. Work on AWJ turning has 
addressed the volume removal rate, surface-finish control, visualization, and 
modeling of the turning process (Ansari et al. 1992; Henning 1999; Zeng 
et al. 1994; Manu and Babu 2008), and the development of a hybrid 
AWJ/mechanical lathe (Ansari et al. 1992). 

The depth of cut, which is determined from the radial jet position, is a critical 
parameter for process optimization. Unlike conventional turning, AWJ turning is 
less sensitive to the original part shape. For example, a highly irregular geometry 
can be turned in one pass with a relatively large depth of cut to a surface of 
revolution. Also, AWJ turning is not sensitive to the length-to-diameter ratio of 
the workpiece. Long- and small-diameter parts have been turned to precise dimen- 
sions. Underwater turning has also been demonstrated to significantly reduce the 
noise on the AWJ. 

A hybrid AWJ/mechanical lathe was built by modifying a conventional lathe to 
allow simultaneous turning with the AWJ and a solid tool, with the solid tool 
performing the finishing process. The AWJ nozzle can be mounted either on a 
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Fig. 26 AWJ turned and sliced parts 


separate manipulator for flexibility of pattern machining or on the tool carriage 
when accurate synchronization between the rotational and axial motion is required. 
The AWJ was used to produce a diameter 0.25 mm greater than the required 
diameter. The machined surface was simultaneously finished using a solid single- 
point tool immediately behind the AWJ. This approach was found to be most 
efficient to obtain surface finishes better than 5 pm. 

Turning Model 

In a recent study of turning with AWJs (Ansari et al. 1992), the following was 
found: 

• The material removal for the range of traverse speeds and depths commonly 
encountered in AWJ turning takes place at the “face,” rather than the circum- 
ference, of the workpiece (the face refers to the plane transverse to the rotational 
axis of the workpiece). 

• No significant deflection of the AWJ occurs in the radial direction at the material 
removal site. 

• The axial deflection of the AWJ changes cyclically. 

The problem of turning is to determine the final diameter (dy) as a function of 
AWJ and turning parameters. 

The relationship between the penetration depth and workpiece radius can be 
expressed (Ansari et al. 1992) as 
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The volume removal rate can also be written as 
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From the above equations, we get 
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Using a simplified form of Finnic’ s theory (Eq. 18) of erosion, the following 
expression results for h c where a becomes zero at the end of the shallow angle 
impact zone: 


h 


C 


2(2 r,S 


8 2 ) 1/2 he + 


Vjm 


a 


7T U(7f Iff K 


sin (2a t 


0 


(29) 


This equation is quadratic in h c and can be solved numerically. The angle a 
which is the deflection angle at the top of the cut, was derived as 
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Similar to linear cutting, a critical traverse rate ( u c ) should exist beyond which 
no steady-state turning can be achieved. This can be determined from the above 
equation by substituting a 0 instead of a t \ a 0 is an angle related to the material 
characteristics. 

For the case of deformation wear, whereby the steps are cut, the volume removal 
rate is 
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Using Eqs. 31 and 26, the volume removal rate can be written in terms of the 
penetration depth ( h ), which upon simplification becomes 
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For abrasive particles impacting the step in the kerf at angles of 90°, Bitter’s 
model (Eq. 17) under normal impact is expressed as 
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Fig. 27 Three-dimensional AWJ machining of complex geometry 





The above two equations can be used to yield the depth for deformation wear 
turning ( h d ) as 
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To determine the final radius (or diameter) of the specimen, the following 
equation is used: 



Multiprocess Machining 

AWJs can be used for prototyping by cutting layers in the actual material and stacking 
them to form the required prototype geometry. Joining techniques can also be assisted 
by the AWJ machining process by providing holes or prepared surfaces for bonding. 

Figure 27 shows the machining sequence used for a three-dimensional part made 
with an AWJ. This machining exercise demonstrates the flexibility of the AWJ 
process for cutting, turning, and drilling using the same setup and the same nozzle. 
A controller program was written to fully automate the machining process, which 
includes automatic quick-change nozzles. Also implemented was an intelligent 
manufacturing process involving parameter changes. 
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Hybrid Processes and Systems 

There are two approaches to using waterjets as a hybrid tool with other traditional or 
nontraditional tools. These approaches are a hybrid process or a hybrid system. 

Hybrid Process 

In this method, the material removal is modified either directly or indirectly by the 
waterjet. For example, the use of waterjets to assist drag bits was investigated by 
Hood (1977) in mining tools and tunneling equipment is a hybrid process. The 
waterjet may modify the cutting field by weakening the rock or creating free 
surfaces ahead of the tool tip such that less overall energy is consumed in 
the excavation. The waterjet may also act as a cooling tool to the otherwise 
excessively hot tool tip and thus elongate its service life and enhance its cutting 
performance. In jet-assisted machining, the performance of cutting tools and 
grinding wheels is enhanced by using waterjets in different ways: cooling tool 
tips or dressing grinding wheels. The cutting field itself is still primarily con- 
trolled by the solid tool process. The use of low-pressure waterjets to collimate 
laser beams (Iscoff 2003) is another example of an indirect hybrid process as the 
waterjet is not used to assist in the material removal process but rather as a guide 
to the laser. 

Hybrid Systems 

In this method, two or more tools are integrated in one system to achieve technical 
and/or overall productivity objectives. Some of the commercially available hybrid 
systems are: 

• Waterjet-mechanical: In these systems, a waterjet is used to perform certain 
functions, and the mechanical tools are used to either alter the produced surface 
or perform other operations in different locations on the workpiece. A waterjet 
composite machining center is used to trim composite structures with a waterjet 
tool and drill countersunk holes using special drills. Figure 28 shows a typical 
dual-mast hybrid waterjet-router system used for such parts as aircraft wings, tail 
fins, and some fuselage sections. In the stone and tile industry, a hybrid waterjet- 
saw machine is used; the waterjet is used to cut the interior shapes, and the saw is 
used on external straight-line cutting. Other machines may include edge 
finishing tools. 

• Waterjet-EDM: A wire EDM cuts faster if the wire is partially engaged with the 
surface. For example, the cutting rate may double or triple if the wire is used on 
the surface to remove a depth equals to only one wire radius. A hybrid waterjet- 
EDM system is used to cut the part with a waterjet to within one wire radius 
tolerance. The wire is then used to finish the part. Figure 29 shows a picture of a 
hybrid waterjet-EDM machine. 

• Waterjet- thermal methods: Waterjet band plasma has been combined on one 
machine. The plasma is used when accuracy or heat-affected zones are not 
critical. Waterjets are used to produce higher-quality edges or when the material 
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Fig. 28 Trim and drill 

system 



Fig. 29 Hybrid waterj et- 
wire EDM 



cannot be cut thermally. The two processes can be used on the same part to 
maximize productivity using special software that selects the waterj et or the 
plasma based on the edge specifications. 


Summary 

The waterjet was first introduced in 1970 for cutting soft materials such as plastics 
and tissue paper. To cut hard materials, the abrasive waterjet (AWJ) technology was 
introduced in the early 1980s. This has significantly widened the range of applica- 
tions and markets of the waterjet tool. Almost any material can be cut with AWJ 
such as stone, glass, composites, metal, carbides, and ceramics. Today, there are 
over 30 industries using the waterjet tool such as aircraft, jet engines, construction, 
job shop, signage, and architecture. 
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The basic platforms (subsystems) of a waterjet system are the ultrahigh-pressure 
(UHP) platform including pumps and plumbing and the cutting platform including 
cutting heads, abrasive feed, and cutting predictive models. These models are 
incorporated in the software platform and used integrally with CAD/CAM systems. 
The motion platform includes the manipulators required for cutting and automation. 
Finally, the ancillary platform includes other hardware or software to enhance the 
performance of the system or to meet specific customer requirements. 

There have been several advances in the above platforms over the past decade. 
Among the most notable advances are the introduction of UHP pumps for 650 MPa 
operation and the development of automated taper control wrists and software for 
precision 2D and 3D cutting. 

AWJs have been used for many applications such as cutting, trimming, turning, 
milling, and drilling. However, the majority of the applications today are related to 
cutting and trimming. In general, the use of AWJs for machining has many 
technical and economic advantages over existing methods. 
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Abstract 

There are three methods, namely, physical, chemical, and biological, that can be 
used for machining of metals. The physical and chemical methods have been 
widely applied. These processes require mechanical, thermal, electrical, or 
chemical energy to be concentrated at the machining point. Such machining 
methods could cause damage to metallurgical properties of the workpiece. 
Machining processes that use microorganisms to remove metal from a work- 
piece is known as process of biological machining (biomachining). Until 
recently, Acidithiobacillus ferrooxidans (At. ferrooxidans) and Acidithiobacillus 
thiooxidans (At. thiooxidans) were the microorganisms used mostly for machin- 
ing of metals in various studies. Along with these microorganisms, Staphylo- 
coccus sp. and Aspergillus niger (A. niger ) were also used in biological 
machining. The process of biological machining is advantageous over physical 
and chemical methods. In biological machining, microorganisms are used which 
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are easily available. These microorganisms can be produced continuously with 
low-energy consumption. Moreover, because metabolic processes of microor- 
ganisms are utilized, no damage or heat-affected zone is generated in the 
machined workpiece. Thus, a use of microorganisms for the micromachining 
of metals opens up the possibility of biological machining as an alternative to 
conventional metal processing methods. In addition, it is easy to control meta- 
bolic activities of microorganisms. Hence, it is possible to manufacture the 
machined part with desired surface finish. 


Introduction 

Machining is among the most ancient technique to shape a part. Rapid advancement 
in the field of manufacturing engineering has resulted in the fabrication of fine 
mechanical components that require micromachining (i.e., small metal removal 
volume) due to the challenges of decreasing dimensions of devices (Yang 
et al. 2009). The micromachining methods in use are the physical or chemical 
methods such as micro-milling, micro-EDM, and chemical etching (Ting 
et al. 2000). However, environmental issues are rapidly emerging as one of the 
most important topics in manufacturing (Istiyanto et al. 2012). Some of 
abovementioned machining processes require chemical or thermal energy to be 
concentrated at the machining point. Hence, these methods may create a damaged 
layer in the workpiece. Also they may potentially damage metallurgical properties 
of the workpiece. In chemical machining, hazardous materials (i.e., acids) are used 
which have safety issues as well as toxicity concerns for environment. Also as the 
process of machining proceeds, an example of chemical agents, ferric chloride, 
works too quickly at eroding the metal for fine depth control and becomes less and 
less efficient due to the buildup of dissolved ions, and ultimately, it ends up as a 
hazardous waste (Uno et al. 1996). As future devices may utilize multiple three- 
dimensional features rather than uniform-depth monolayers, a slower, more con- 
trolled process is useful. 

Recently, few researchers introduced various green strategies and techniques to 
overcome these problems. Biological machining is one of such candidates. The use 
of bacteria as the tool to remove metal from a workpiece, also known as 
biomachining, is a relatively new manufacturing technology with potential appli- 
cations in the construction of microscale features (Uno et al. 1996; Uno 2002; 
Kurosaki et al. 2003; Johnson et al. 2007; Hocheng et al. 2012a). These studies are 
inspired by developments of bio -hydrometallurgy, which is a relatively novel 
concept (Brierley and Brierley 2001). In this branch of science, various microor- 
ganisms are employed to recover metal from their respective ores (Suzuki 2001; 
Cui and Zhang 2008). The technique is gaining importance due to its wide com- 
mercial application (Brandi and Faramarzi 2006). It is also relatively free from 
environmental concerns unlike conventional hydro- and pyrometallurgical pro- 
cesses (Hoque and Philip 2011). 
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Also, conventional processes often demand significant investment of money. A 
new form of technology should be able to overcome such barriers in both econom- 
ical and eco-friendly ways. A use of microorganisms can be thought to be a suitable 
alternative over the conventional technologies for the recovery of metals. Micro- 
organisms can influence metal mobility by modifying their chemical and/or phys- 
ical characteristics. Microbes either get attached to the metal ions or transport them 
into the cell. The biosorption and bioaccumulation are the processes where the 
metal ion is exchanged for a counterion attached to biomass. In bioleaching, the 
microbes dissolve the metals present in the solid matrix into soluble form. Most of 
acidophilic microorganisms play key role for bioleaching process. These microbes 
are extensively used for mineral dissolution from ores. They are also termed as 
biomining microbes. The microbiological leaching is relatively inexpensive. Also 
the microbes can easily adapt to the variations of conditions (Gadd 2000). 

The chemolithotrophic acidophiles, namely, At. thiooxidans. At. ferrooxidans , 
and Lepto spirillum ferrooxidans, and heterotrophs like Sulfolobus are involved in 
bioleaching process. Penicillium and Aspergillus niger are also applied for 
bioleaching of metals (Hoque and Philip 2011). This technique is further applied 
for recovery of metals from waste materials. Studies have been carried out to 
recover metals such as Al, Ni, Zn, Cu, Cd, and Cr by using At. ferrooxidans as 
well as At. ferrooxidans and fungal species like Aspergillus niger (Zhang and Li 
1999; Lilova et al. 2007). Additionally, different waste sludge and river sediments 
have also examined for bioleaching study to recover metals (Lilova et al. 2007). In 
twenty -first century, electronic waste is emerging as a major industrial waste. 
Several reports have been published regarding the bioleaching of such waste 
recently (Brandi et al. 2001). Apart from E- waste, another major solid waste in 
the present society is the spent battery which comes from digital camera, cellular 
phone, or laptops. Pure and mixed cultures of acidophilic sulfur oxidizers and iron 
oxidizers have been applied for the recovery of metals from battery waste (Mishra 
et al. 2008; Xin et al. 2009a). Petrochemical industries use large quantity of 
catalysts for purification or upgrading of various petroleum streams or residues. 
Spent petroleum catalyst is the major solid waste from petrochemical industries and 
is known to have serious impact upon environment. A number of researches 
reported bioleaching of metals from the spent petroleum catalyst (Beolchini 
et al. 2010; Bosio et al. 2008). The acidophilic microorganisms that take part in 
dissolution of metals from the wastes are autotrophic in nature. They can grow in 
inorganic medium having low pH values. They can tolerate high metal ion concen- 
trations. Depending on their tolerance to temperature, acidophilic microorganisms 
are categorized into three subgroups as mesophiles, moderate thermoacidophiles, 
and extreme thermoacidophiles. Mesophiles are those microorganisms which grow 
at a prevailing room temperature, i.e., 28-37 °C. Among the mesophiles, the most 
popular and widely used strain is At. ferrooxidans. Although many strains of At. 
ferrooxidans have been isolated from different sources, most of the strains showed 
the following optimum growth conditions, i.e., pH 1.5-2. 5 and a temperature range 
of 28-37 °C. Moderate thermophiles are able to grow at a temperature of around 
50 °C. There are a number of thermophilic strains isolated from different 
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geothermal environments and mine sites (Kinnunen et al. 2003). An important 
moderate thermophile is Sulfobacillus thermo sulfidooxi dans which has an ability 
to oxidize sulfur and iron (Xia et al. 2010). Extreme thermophiles are those which 
can grow actively even at a temperature as high as 80 °C. The most important 
extreme thermophiles belong to the genus Sulfolobus (Jordan et al. 2006). A 
number of Sulfolobus species have been isolated such as S. acidocaldarius , 
S. solfataricus , S. brierley , and S. ambioalous. They show the following properties 
such as (i) anaerobic growth coupled with reduction of elemental sulfur, (ii) aerobic 
growth coupled with oxidation of sulfur, (iii) optimum growth temperature of 
65-70 °C, and (iv) able to oxidize both Fe(II) to Fe(III) and sulfur to sulfate. 
Since an extreme thermophile can grow at higher temperature, so oxidation kinetics 
is stronger than mesophiles and moderate thermophiles (Norris et al. 2000). 

The knowledge gained in these industries helped in its translation to 
bio-micromanufacturing of parts (Yang et al. 2009). The main advantage in the 
biomachining technique is an ease of operation as well as limited use of process 
controls, thus making an operation more user-friendly (Ting et al. 2000). In 
addition, a use of microorganisms consumes low energy, which reduces the cost 
of process. The process is carried out in close loop generating minimum effluents. 
The microorganisms used can be cultured continuously. Thus, this process is 
preferred as green technology (Johnson et al. 2007; Zhang and Fi 1999; Filova 
et al. 2007). 


Process of Biological Machining 
Microorganisms Used and Process Principles 

Every organism can be classified based on how it obtains its energy to run all 
metabolic processes for growth. Those organisms that obtain carbon from inorganic 
compounds (including carbon dioxide) are known as lithotrophs. On the other hand, 
organotrophs obtain all the carbon they use from organic compounds, including 
those from other organisms. Each of these groups can be further subdivided based 
on how included organisms obtain energy. The chemolithotrophs typically process 
inorganic matter. The chemoorganotrophs process organic matter for their growth. 
Many animals (including humans) and bacteria are classified as chemoorga- 
notrophs. The organisms that use light as an energy source are called as 
photo trophic organisms. Plants are the most common examples. A summary of 
the classifications, species, is displayed in Fig. 1. 

Organisms fitting into the category of chemolithotrophs have been the subject of 
intensive research lately for their unique ability to either oxidize or reduce certain 
inorganic compounds, especially heavy metals. Such organisms have been applied 
for bioremediation process. The natural reduction reactions of the Shewanella 
oneidensis have been used for removal of toxic chemicals from the groundwater 
(Viamajala et al. 2003). Species such as Geobacter metallireducens and 
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Orgaiiotroph 



Fig. 1 A summary of the classification of various organisms based on energy requirements 


Rhodoferax ferrireducens have shown potential application for biologically based 
fuel cells (Bond et al. 2002; Chaudhuri and Lovley 2003). 

It is a need of time that the biological processes have to be integrated into future 
industrial processes. This is important to meet environmental regulations and to 
establish eco-friendly, energy-saving processes. Hence, such processes are charac- 
terized by an application of biocatalysts (e.g., microorganisms, enzymes) in an 
industrial process and the substitution of existing processes (Liang et al. 2010). It 
was reported that certain microorganisms can dissolve metal compounds (Wang 
et al. 2009). At. ferrooxidans and At. thiooxidans are the principal microorganisms 
involved in the above process. These bacteria can grow by obtaining energy from 

o ■ n 

oxidation of ferrous iron (Fe~ ) or elemental sulfur (S ) and thereby carrying metal 
solubilization (Ubaldini et al. 2003; Bosecker 1997). Such metal bioleaching 
technique has been used in biomining on industrial scale (Ewart and Hughes 
1991; Rawlings and Silver 1995). Moreover, these bacteria were also used to 
recycle metals from various wastes (Wang et al. 2009; Zhao et al. 2008; Xin 
et al. 2009b; Ishigaki et al. 2005; Vestola et al. 2010). This metal-extraction 
property of microorganism was exploited for machining of metals. At. ferrooxidans 
was the species used in the preliminary biomachining work (Uno et al. 1996; Uno 
2002). Uno et al. (1996; Uno 2002) applied three At. ferrooxidans strains (ATCC 
13598, 13661, and 33020) for biomachining of copper and iron. They used basal 
9 K medium for growth of At. ferrooxidans. The composition for medium was 3.0 g 
(NH 4 ) 2 S0 4 , 0.5 g K 2 HP0 4 , 0.5 g MgS0 4 .7H 2 0, 0.1 g KC1, and 0.01 g Ca(N0 3 ) 2 in 
1 1 distilled water. The pH was adjusted to 2.5 with 1.0 % sulfuric acid and sterilized 
by autoclave. The FeS0 4 is used as energy source for growth of At. ferrooxidans 
(Uno et al. 1996). In another study, Hocheng et al. (2012a) used 510 medium for 
growth of At. ferrooxidans. The basal 510 medium composed of solutions A and 
B. Solution A contains the following substances per 800 ml of glass-distilled water: 
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0.8 g (NH 4 ) 2 S0 4 , 2.0 g MgS0 4 .7H 2 0, 0.4 g K 2 HP0 4 , and 5 ml of 0.22 mm filter- 
sterilized Wolfe’s mineral solution. Wolfe’s mineral solution contains the follow- 
ing substances: 1.5 g nitrilotri acetic acid, 3.0 g MgS0 4 .7H 2 0, 0.5 g MnS0 4 .H 2 0, 
1.0 g NaCl, 0.1 g FeS0 4 .7H 2 0, 0.1 g CoC 1 2 .6H 2 0, 0.1 g CaCl 2 , 0.1 g 
ZnS0 4 .7H 2 0, 0.01 g CuS0 4 .5H 2 0, 0.01 g A1K(S0 4 ) 2 .12H 2 0, 0.01 g H 3 B0 3 , 
and 0.01 g Na 2 Mo0 4 .2H 2 0 in 1 1 distilled water. The pH was adjusted to 2.5 
with 1.0 % (weight/volume) sulfuric acid and autoclaved for sterilization. The 
presterilized solution B was added to solution A. The freshly prepared solution B 
contained 20.0 g FeS0 4 .7H 2 0 per 200 ml of glass-distilled water (Hocheng 
et al. 2012a). Chang et al. (2008) and Hocheng et al. (2012b) used At. thiooxidans 
for the biomachining of metals. They used basal 317 medium for growth of 
bacteria. The basal 317 medium contained the following per liter of glass-distilled 
water: 0.3 g (NH 4 ) 2 S0 4 , 3.5 g K 2 HP0 4 , 0.5 g MgS0 4 .7H 2 0, and 0.25 g CaCl 2 . The 
pH was adjusted to 4.5 with sulfuric acid. 1.0 % (weight/volume) elemental sulfur 
was presterilized and added to 317 medium for growth and maintenance of At. 
thiooxidans. Shikata et al. (2009) used Staphylococcus spp. for biomachining pro- 
cess. Staphylococcus spp. will grow on any media. They are salt tolerant, as most 
organisms are killed by high NaCl. “Salt agar” is 7-10 % salt containing broth and 
agar or salt cooked meat broth. Shikata et al. (2009) revealed the possibility of using 
bacteria to drill metallic surfaces (Shikata et al. 2009). Many species of bacteria are 
reported to be involved in the corrosion of copper. They used Staphylococcus sp. for 
biomachining experiments. Staphylococcus sp. is a facultative anaerobic bacterium. 
They isolated it from corroded copper piping installed in groundwater environment in 
Japan. An experiment involved exposure of copper coupons (25 mm x 15 mm x 
3 mm) to a culture of Staphylococcus sp. for a maximum period of 7 days. The total 
pit area and volume on these coupons were determined using image analysis. They 
showed that both the biomachined area and volume increased with the duration of 
coupon exposure. In the drilling experiment, a copper thin film (2 pm thick) was 
perforated by this bacterium within a period of 7 days (Shikata et al. 2009). 

There are two main principles behind the biomachining process. They are 
oxidation and acidolysis. At. ferrooxidans use mainly oxidation process, while At. 
thiooxidans use acidolysis process for metal removal. Again an oxidation process 
can be further divided into direct and indirect oxidation of metals (Uno et al. 1996; 
Rohwerder et al. 2003; Istiyanto et al. 2011). 

Oxidation Process 

The direct mechanism is based on an attachment of bacterial cell on metal surface 
(e.g., copper) (Fig. 2). At. ferrooxidans contains iron-oxidizing enzyme in the 
periplasmic space and inner membrane (Zhang and Li 1999; Istiyanto 

o - 

et al. 2011). The bacterial growth medium contains Fe^ which is the main energy 

o I 

source for At. ferrooxidans (Uno et al. 1996; Rohwerder et al. 2003). This Fe 
radical is then transported to the periplasmic space from the culture fluid. Here it 
loses an electron through the catalysis of iron oxidization. This electron is then 
transported to oxygen with an electron transport chain, through a set of mediating 
biochemical reactions (Istiyanto et al. 2011). The overall reaction is as follows: 
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Fig. 2 Biomachining mechanism (Uno et al. 1996; Rohwerder et al. 2003; Istiyanto et al. 2011) 

2Fe 2+ + 1/2 0 2 + 2H+ -*• 2Fe 3+ + H 2 0 (1) 

o i 

This reaction generates energy. Also during this reaction Fe^ is converted into 
Fe 3+ . This Fe 3+ is then expelled from the cell. The Fe 3+ is a strong oxidant. It has an 

n 9 1 

ability to oxidize pure copper (Cu ) into Cu . Hence, a metal workpiece can be 
machined by Fe 3+ that is produced by At. ferrooxidans: 

Cu° + 2Fe 3+ -> Cu 2+ + 2Fe 2+ (2) 

During the machining process the Fe produced by At. ferrooxidans is reduced 

o ■ o ■ o ■ 

to Fe . Again this Fe can enter At. ferrooxidans cell and get reoxidized to Fe by 
oxygen. Thus, a redox cycle can be formed (Uno et al. 1996; Zhang and Li 1999; 
Rohwerder et al. 2003; Istiyanto et al. 2011). 

Recently, it is found that an indirect mechanism of bioleaching of metals also 
works well during metal -extraction process. Liang et al. (2010) found that copper in 
printed circuit boards (PCBs) can be continuously leached out by applying an 

Q I Q ■ 

indirect mechanism. During this process a cycle between Fe and Fe is built. 
In a similar way, Wang et al. (2009) divided the metal dissolution process into two 
continuous periods. First, the dissolution of metals such as copper is due to an attack 
of printed wire boards (PWBs) by Fe . In the first stage of attack, Fe is reduced to 
Fe 2+ and causes the increase of Fe 2+ concentration in leaching solution. Then in 

o ■ 

second stage, bacteria use Fe^ as an energy source, which leads to a rapid increase 

Q Q ■ 

in bacterial population and the regeneration of Fe upon Fe oxidation. They 

o ■ Q ■ 

proposed that the role of bacteria is just to oxidize Fe into Fe . It is now known 
that this is applicable to biomachining process as well. The machining of copper, 
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nickel, and aluminum was experimentally investigated using both the cells and the 
culture supernatant of At. ferrooxidans. 

Higher metal removal rate was achieved for copper by using the culture super- 
natant (Hocheng et al. 2012c). This is done to find out actual mechanism behind 

o ■ 

biomachining process. It has been shown that an initial concentration of Fe 
decreased from 0.072 to 0.0024 M after 48 h incubation. At the same time 
concentration of Fe 3+ increased with time. Also an effect of cell number on 
conversion of Fe 2+ into Fe 3+ was studied. Optimization of process parameters for 

o ■ 

oxidation of Fe is an important aspect because the produced ferric ions play key 
role in the biomachining process. The results indicate that all the cell concentrations 

o ■ o , 

used were able to transform Fe into Fe but the rate by which the transformation 
occurred was different for each cell concentration. The cell concentration of 5 x 

o o o 

10 for copper, 2.0 x 10 for nickel, and 1.0 x 10 cells/ml for aluminum were 
found optimal for oxidation of Fe 2+ into Fe 3+ (Fig. 3). 

Acidolysis Process 

At. thiooxidans use acidolysis process for metal removal. During the growth At. 
thiooxidans produce sulfuric acid. This acid is further used for metal dissolution 
(Hocheng et al. 2012b). The overall reaction is as follows: 

S° + 1.5 0 2 + H 2 0 -► H 2 S0 4 (3) 

M + H 2 S0 4 + 0.5 0 2 -► MS0 4 + H 2 0 (4) 


where M is a metal. 

Material Removal Rate 

The key criterion to characterize any industrial machining process is metal removal 
rate. Uno et al. (1996) determined an amount of metal removed during machining 
process. They also studied the effects of the initial conditions on metal removal. 
They found that the removed depth was approximately proportional to machining 
time. For Cu, the material removal rate in depth was about 20 pm/h and for Fe was 
14 pm/h. The experimental conditions were a temperature of 28 °C and a shaking 
speed of 160 cycles per minute. They also run a control in parallel by using the 9 K 
medium. They showed that the bacteria are responsible for the material removal 
effect despite the corrosive effect of medium (Fig. 4). 

They incubated one strain (ATCC 13598) at various temperatures and observed 
the mean material removal rate. The peak material removal rate for Cu (~23 pm/h) 
is near 30 °C, while for Fe, the max material removal rate (~20 pm/h) occurs near 
40 °C (Fig. 5). 

The temperature proves to be a relatively key variable in the efficiency of 
biomachining. There is a need to control temperature to keep the material removal 
rate near its theoretical maximum. These authors also studied the effect of electric 
field on biomachining rate. They reported that the depth of grooves generated on an 
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Fig. 3 The change of ferric ion concentration during the removal of (a) copper, (b) nickel, and (c) 
aluminum at different cell concentrations (Hocheng et al. 2012c) 
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Fig. 4 Biomachining experiments on Fe (left) and Cu (right) (Uno et al. 1996) 


Fig. 5 Effect of temperature 
on biomachining (Uno 

et al. 1996) 



Tempera lure (1C) 


anodic workpiece was about twice than in a normal biomachining. Hence, an 
applied electric field has pronounced effect on biomachining (Uno et al. 1996). If 
positive applied voltages are used, the material removal rate is accelerated over 
baseline (Fig. 6). For iron, if the voltage applied is negative (0.5 V in this case), the 
material removal rate is actually negative, indicating the material is deposited. 
Thus, an applied voltage can have an accelerating effect on the biomachining 
rate. They found that there is a need of further research to determine if there is a 
maximum voltage which yields a benefit to the process. 

Zhang and Li (1998) showed that machining of pure iron, pure copper, and 
constantan was possible by using At. ferrooxidans. They biomachined a microgear 
and grooves on pure copper workpiece (Fig. 7). 

They found that the depth of the groove biomachined was directly dependent on 
the machining time. In addition to this, Zhang and Li (1998) explained the Fe 
oxidizing and reducing reactions involved. They also speculated about what parts of 
the cell (periplasmic space, cytochromes, etc.) are used in the reactions. They found 
that the biological recreation of Fe 3+ has a major role in the biomachining process. 
Zhang and Li (1999) also analyzed the kinetics and thermodynamics of 
biomachining of pure copper. They studied the kinetic effect of bacteria on the 
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Fig. 6 Effect of various applied voltages for Fe (left) and Cu (right) on biomachining (Uno 
et al. 1996) 


Fig. 7 SEM photograph of 
the microgear on a Cu 
workpiece (Zhang and Li 

1998) 



Q ■ Q ■ 

ion cycle between Fe~ and Fe . Also they studied the kinetics of each sub-reaction 
and how experimental concentrations affected each of these reaction rates. The first 
reaction relates the consumption of Fe 3+ to the mass change of a Cu sample. 
Equilibrium equations of generating and consuming Fe 3+ show that these quantities 

\ Q | 

are directly proportional to the Fe~ and Fe concentrations, respectively. They 
revealed the thermodynamic effect of hydrolysis on the changes of pH in 
biomachining process. In their study Zhang and Li (1999) found that biomachining 
rate depends directly on bacteria concentration. Also the necessary conditions for 
maintaining the stable equilibrium of the biomachining system were the supply of 

■ Q ■ 

H and the removal of Cu^ . Their study does not show the presence of any 
maximum bacterial concentration at which biomachining can occur. The bacterial 
count cannot increase indefinitely. Thus, their study does not explain how to 
overcome this problem. Also there is a need to keep the pH around 2 so that there 
will be no negative impact on the process. Finally, Zhang and Li (1999) concluded 
that the biomachining process slowed down due to the buildup of Cu ions in the 
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solution. They bubbled H 2 S gas into the solution to solve this problem. This forms 
precipitate of CuS, thus removing the Cu ions from solution. 

Ting et al. (2000) applied At. ferrooxidans (ATCC 13598 and ATCC 33020) for 
the biomachining of mild steel and copper (Ting et al. 2000). They did not find any 
difference in the mean removal rate of copper and mild steel between ATCC 13598 
and ATCC 33020. 

They showed that the mass removed increased proportionately with machining 
time and shaking speed. Like Uno et al. (1996) Ting et al. (2000) also found that an 
increase in temperature led to an increase in the mean removal rates. An optimum 
temperature for biomachining found between 30 °C and 35 °C. Ting et al. (2000) 
compared the mass removal rate and undercutting occurred during biomachining 
and chemical machining. They reported that the mass removal rate was signifi- 
cantly higher in chemical machining than biomachining. When the width of the 
machined groove extended into the coated region, an undercutting occurred. An 
undercutting was more severe in chemically machined workpieces (Fig. 8). It was 
also found that during biomachining, undercutting occurred to a greater extent for 
mild steel. Nonetheless, an undercutting was not as significant (compared with 
chemical machining), although an extent increased with machining time, as 
expected. They feel that there is a need of further research to control this effect 
during the biomachining process. Ting et al. (2000) also manufactured the micro- 
structure by using organic photoresistive materials to mask. In their experiments 
they were able to biomachine unmasked regions of the metals. They found that the 
final machined profile was similar to the coating image on the original metal 
(Fig. 9). Also an undesired leaching of the metal in the region under the masked 
area (termed undercutting) was not severely encountered during the biomachining 
process. 
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Fig. 9 Metal workpiece with organic photoresistive coatings (a) before biomachining and (b) 
after biomachining (Ting et al. 2000) 



(D Etcfcjnn brass @ Etching bronze ® Etc lung copper CD Etching mbiodean 


Fig. 10 Change in machining depth with time (Yang et al. 2009) 


Kumada et al. (2001) also used the microbially influenced corrosion for the fine 
biomachining of steel and copper. They found that loss in mass of SS 400 was larger 
than that of copper. Also the surface of SS 400 was rougher as compared to copper. 
Yang et al. (2009) used At. ferrooxidans in the machining process of metal 
components (Yang et al. 2009). They studied the change in machining depth with 
machining time and found linear relationship. An etching rate of brass is about 9.20 
pm/h; etching bronze is about 10.82 pm/h and etching copper is 13.62 pm/h 

(Fig. 10). 
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Fig. 11 Change in pH value 
during biomachining (Yang 

et al. 2009) 



Besides this the researchers studied the factors that influence the biomachining, 
including temperature, pH, kinds of the culture medium, etc. They stated that the 
pH is an important influencing factor during biomachining process. Yang 
et al. (2009) showed that pH value of etching solution increased gradually (before 
40 h) along with the etching process (Fig. 11). The growth of microorganism was 
affected by increase in the pH. This affected the machining process and the metals 
removal decreased gradually. To overcome this problem they used H 2 S0 4 to 
maintain the pH in desired range. 

Hocheng et al. (2012c) found that the SMRR was different for different metals. 
2.0, 1.6, and 0.55 mg/h.cnr SMRR were achieved for copper, nickel, and alumi- 
num, in 48, 49, and 41.5 h, respectively. Now it is also known that the material 
removal rate is slow and appears to decline over time. This happened because the 
bacteria are aerobic and the depletion of available oxygen might have slowed down 
the material removal rate. It is also known that the SMRR is independent on the cell 
concentration, except when the cell concentration is too high. It has been proved 
that during biomachining process, the metals are found dissolved by the chemical 
action of iron ions (Fe 3+ ) (Hocheng et al. 2012a). These primary iron ions are 
supplied by the bacteria At. ferrooxidans and are proposed to be oxidizing agents 
for the dissolution and removal of metals. Therefore, there is no absolute require- 
ment for the bacteria to attach to the surface of metals. Unattached bacteria are also 
able to catalyze mineral dissolution through a noncontact mechanism. The culture 
supernatant is an effective medium for metal removal. Hocheng et al. (2012c) found 
that large amount of metal removal is achieved by using culture supernatant as 
compared to bacterial cells. The SMRR obtained were 5.5, 4.2, and 0.7 mg/h.cnr in 
6 h for copper, nickel, and aluminum, respectively (Hocheng et al. 2012c). All these 
experimental results support an indirect mechanism during the process of 
biomachining. 


Surface Integrity 

Now the method of biomachining is well established. Various researchers success- 
fully fabricated parts by using biomachining (Yang et al. 2009; Ting et al. 2000; 
Hocheng et al. 2012a; Zhang and Li 1999). During a manufacturing process, the 
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Fig. 12 SEM micrographs of polished Cu surfaces before and after 48 h of bacteria machining; 
sample polished by (a) 320 grit SiC and (b) 600 nm diamond (Johnson et al. 2007) 


quality of the surface produced is a very important aspect. Every manufacturing 
process, however, has limitations regarding this characteristic. It is therefore 
important to explore the surface finish characteristic of the biomachining process 
(Istiyanto et al. 2011). Recently, many studies were carried out considering these 
objectives. Johnson et al. (2007) reported an increase in overall roughness after the 
exposure of workpiece to the bacteria. They stated that an initial surface roughness 
did not have a significant effect on the change in Ra. The researchers studied an 
effect of biomachining on surface roughness for 24 and 48 h. The change in Ra 
obtained for 24 h was 1.8-2. 6 pm while for 48 h 1. 7-2.4 pm. There was larger 
change in Ra corresponding to a higher initial Ra for the 24 h test. Even if the 
machining time was increased up to 48 h, the surface roughness increased slightly. 
They found that bacterial machining is not uniform on poly crystalline Cu. 

The surfaces became rougher due to such uneven material removal on the 
surface (Fig. 12). On the bacterially machined surfaces, grooves from polishing 
disappeared and grain boundaries and annealing twins are visible. Rather than 
preferentially attacking grain boundaries like the sterile media, seemingly random 
changes in topography appeared all over, not only at grain boundary regions. 
Scanning electron microscope (SEM) micrographs demonstrate that bacterial 
machining is anisotropic, and roughness measurements of the polycrystalline Cu 
samples showed a deterioration of Ra values of 1.5-2. 5 pm. 
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Fig. 13 Effect of different culture medium on surface roughness of the metal (Yang et al. 2009) 


Yang et al. (2009) studied the surface roughness effects during biomachining 
process. They found that the surface roughness increased rapidly with time, from an 
initial less than 0.2 pm to the final near to 2 pm (Fig. 13). They observed that in the 
first 2 h, the surface quality deteriorates rapidly and at end of the 2 h the surface 
roughness of the three kinds of materials reach the worst. After 2 h biomachining, 
the surface roughness no longer increased significantly, but fluctuated inside certain 
scope and then reached the stable stage (Fig. 13). 

Istiyanto et al. (2010) studied the changes in Ra for 6, 12, and 18 h of machining 
processes. They observed an increase in Ra from 6 to 12 h of machining was less 
than those from 12 to 18 h. These changes occurred with reference to initial Ra 
values before machining of about 0.4 pm. The randomized surface profile might be 
responsible for this effect. The longer machining time will result in a more 
randomized surface profile. They carried out a second experiment, in which the 
workpieces were polished using a 220-grit SiC abrasive wheel and had initial Ra 
values of about 0.54 pm. In this experiment also, the surface appearance signifi- 
cantly changed after biomachining. The straight-line pattern of the surface changed 
to random patterns after biomachining. In this experiment, the change of Ra also 
varied significantly after 12 h of machining time, similar to the behavior in the first 
experiment. The increase in Ra from 12 to 18 h of machining was higher than that 
from 6 to 12 h. An increase in machining time did not linearly affect the change of 
Ra. This explains the relationship between Ra and machining time. They also 
suggested that phenomenon of increasing Ra could be a limitation of the 
biomachining process, if one wants to conduct biomachining for fine surface 
roughness. 
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Fig. 14 Profile of copper 
made by biomachining before 
photoresist removal (Istiyanto 
et al. 2011) 
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Fig. 15 The effect of machining time on profile dimension of the groove (Istiyanto et al. 2011) 


Istiyanto et al. (2011) further studied the final machined part by using scanning 
electron micrograph and a noncontact 3-D microsurface profiler. They revealed that 
the profile form of copper was U-shaped (Fig. 14). Also they found that the width of 
the groove profile as well as the depth increased with time. An increase was 
relatively linear for both width and depth from 12 to 48 h. Thus, the required 
width and depth of the groove can be obtained by selecting an appropriate machin- 
ing time (Fig. 15). 

It is now known that the metal removal rate and surface roughness can be 
controlled by using various physicochemical parameters. In previous section it 

o ■ Q ■ 

has been discussed that At. ferrooxidans oxidizes Fe to Fe and derives energy 
required for its growth. The Fe 3+ generated by this process can be used for 
dissolution of metal. A study was carried out to explore an effect of Fe 3+ on 
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metal dissolution. An increasing concentration of Fe 3+ in culture supernatants was 
obtained by oxidation of increasing concentrations FeS0 4 . These culture superna- 
tants were used for machining of copper metal workpieces for 1 h. At 10 g/1 FeS0 4 
concentration, the SMRR was 15.91 mg/h.cnr with 1.07 pm surface roughness. 
With increase in FeS0 4 concentration, the SMRR increased linearly with rapid 
deterioration in surface quality. The 40.2 mg/h.cnr SMRR with 5.18 pm surface 
roughness was achieved by using 40 g/1 FeS0 4 . Therefore, the further experiment 
efforts were taken to control the quality of the surface produced by varying different 
parameters. Effect of shaking speed on biomachining was studied. The results for 
this study showed that the SMRR increased with an increase in shaking speed. For 
initial three shaking speeds, SMRR was very low. The SMRRs for 0, 50, and 

o 

100 rpm shaking speeds were 0.5, 1.88, and 4.96 mg/h.cnr, respectively. It is 
found that at 150 and 200 rpm shaking speed, the SMRR increased significantly. 
The surface roughness was found increased in parallel with increase in shaking 
speed from 0.38 pm (0 rpm) to 1.46 pm (200 rpm). Effect of volume of culture 
supernatant on biomachining was studied. The copper metal workpieces were 
covered with variable volume (50-200 ml) of culture supernatant, and the flasks 
were incubated at 150 rpm. It is found that the SMRR increased up to 100 ml and 
then decreased. The surface roughness also showed similar pattern. The parameters, 
volume of supernatant, and shaking speed showed pronounced effect on SMRR and 
surface roughness. To further control the surface roughness below 1.0 pm, the same 
study was carried out at 50 rpm. Again the SMRR increased up to 100 ml and then 
decreased, while the surface roughness decreased with increase in volume of 
supernatant at 50 rpm shaking speed. Large volume of the working medium 
facilitates the transport of ionic product from metal removal process off the 
machined surface. This effect saturates beyond a certain amount of the supplied 
volume. Effect of incubation temperature on biomachining was studied. Uno et al. 
(1996) showed that incubation temperature has pronounced effect on metal removal 
rate. Recently, it has been found that no such effect on SMRR and surface 
roughness was observed when culture supernatant was used for biomachining 
process. The 2.39 mg/h.cnr SMRR with 0.38 pm surface roughness was achieved 
by applying 10 g/1 FeS0 4 , 100 ml volume of culture supernatant, 50 rpm shaking 
speed and 30° C temperature. For all the temperatures studied, similar SMRR and 
surface roughness were obtained. This might be due to the use of culture superna- 
tant. In previous studies bacterial cells were used for biomachining. At. 
ferrooxidans growth was affected by incubation temperature which resulted in 
change in metal removal rate (Uno et al. 1996). In the culture supernatant the 
bacterial cells were not present during machining process. Therefore, no effect of 
incubation temperature on SMRR and surface roughness was found (Jadhav 
et al. 2013). 

While various researchers extensively used At. ferrooxidans for biomachining of 
metals, the potential of At. thiooxidans for biomachining remain unused until 
Chang et al. (2008) tried it for biomachining of metals. At. thiooxidansis an 
extremely acidophilic bacterium. It is remarkably tolerant of an acidic environment 
at a pH of 1 or below and thus differs from At. ferrooxidans (Chen and Lin 2004). 
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Fig. 16 SEM photograph of 
biomachined grooves (80 pm 
in depth and 45 pm in width) 
on a pure copper piece 
(Chang et al. 2008) 
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Fig. 17 Surface appearance of copper metal, (a) Control, (b) After oxidation with culture 
supernatant (Hocheng et al. 2012b) 


The metabolite, mainly sulfuric acid, plays a major role in bioleaching (Wong and 
Henry 1998). Chang et al. (2008) reported that biomachining of copper, nickel, and 
aluminum can be carried out by using At. thiooxidans (Chang et al. 2008). They 
found that nickel is the most readily solubilized metal, while aluminum was poorly 
solubilized. Also the cell concentration enhanced the machining rate. 

They put forth a two-dimensional machining process for metal removal from an 
exposed workpiece (Fig. 16). The machining depth was approximately 15 pm per 
day. An amount of undercut was very small and influences the degree of 
biomachining during an overall machining process. An aspect ratio under the 
biomachining condition was around 1.85. Further Hocheng et al. (2012b) showed 
that an indirect mechanism predominates during the biomachining of metals by 
using At. thiooxidans. An extracellular supernatant showed higher machining rate 
than the cells in solution. They studied aspect ratio in micropattem machining of 
copper and nickel and found that the behavior of aluminum in micromachining was 
unsatisfactory. 
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Fig. 18 Surface roughness of copper metal, (a) Control, (b) After oxidation with culture super- 
natant (Hocheng et al. 2012b) 


Authors tried to explore the surface finish characteristics during machining of 
copper by using At. thiooxidans culture supernatant. During an oxidation of copper, 
the changes in surface appearance and surface roughness were observed (Figs. 17 
and 18). 
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Fig. 19 Shim fabricated by biomachining method (Y ang et al. 2009) 



Fig. 20 Rectangular pattern writing by using culture supernatant (a) control workpiece without 
mask, (b) control workpiece with vacuum tape mask, (c) workpiece after 2 h treatment, and (d) 
workpiece after 4 h treatment (Hocheng et al. 2012a) 


Examples of Microsize Features 


The feasibility of biomachining processes in fabricating microsize features has been 
studied by various researchers. Yang et al. (2009) fabricated several shims with a 
thickness of 0. 1 mm and a diameter of only 2 mm and with a thickness of 0.07 mm 
and a diameter of 15 and 16 mm (Fig. 19). 

Hocheng et al. (2012a) applied indirect mechanism for the formation of 
patterned copper films (Fig. 20) 
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Fig. 21 Cutting rate of line 
feature (Istiyanto et al. 2012) 
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Istianto et al. (2012) investigated the feasibility of biomachining processes in 
fabricating microsize features on pure copper. They produced several complex 
features such as lines, circles, and rectangles, as well as combination features. 
The dimension of 25 pm for these circular-island and gear-shaped features was 
obtained. A small feature down to 3 pm was successfully manufactured by using 
single crystal copper. They biomachined the line feature after 12 h machining time. 
The widths of lines were about 110 pm spaced apart by about 300 pm. The lengths 
of the lines were 770, 850, and 920 pm. Rounded ends were produced. The cutting 
rate or etching rate (terms commonly used in chemical etching) was reduced with 
the increase of machining time (Fig. 21). 

In another experiment they developed a circular pattern (Fig. 22a). The circles of 
various diameters were perfectly developed after 24 h of machining. 

The diameters of 245, 348, and 450 pm were obtained corresponded to the mask 
diameters of 200, 300, and 400 pm, respectively. They observed that the lateral 
cutting rates were 0.96, 1, and 1.04 pm/h, respectively. These cutting rates did not 
significantly increase as the diameter increased. They used a 3D microsurface 
profiler to measure the depth. The mean maximum depth for all circles was 
48 pm, equivalent to a depth formed at 2 pm/h cutting rate. The vertical rate of 
the circular feature was similar to that of the line feature for the same machining 
time. However, the lateral rate of the circular feature was slightly lower than that of 
the line feature. Istianto et al. (2012) also fabricated different size rectangular 
features (200^1-50 pm) by using biomachining (Fig. 22b). They found that inside 
surfaces of the features were flat. These authors also showed that other combination 
features can be produced by using the biomachining process (Fig. 23). 

Figure 23a represents the multiline feature and Fig. 23b represents the 
multicircular feature, the so-called circular-island feature, whose island diameter 
was 25 pm. The gear-shaped island feature, which is a more complex feature, with 
tooth depth of 45 pm, was obtained, as shown in Fig. 24. 
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Fig. 22 SEM micrographs of 
(a) circular and (b) 
rectangular features (Istiyanto 
et al. 2012) 




Abovementioned biomachining studies clearly show an application of 
microorganisms in machining of various metals and in fabrication of machined 
parts. Biomachining is based on the natural ability of microorganisms to 
transform solid compounds to a soluble form. This involves an attack on the 
solid compound by metabolic products of microorganisms. The main thrust 
of work has been done with sulfur-oxidizing microorganisms, such as 
At. thiooxidans and At. ferrooxidans. In case of At. ferrooxidans , Fe 3+ acts as 
a metabolic product, while in case of At. thiooxidans , sulfuric acid acts as a 
metabolic product. It is now known that organic acids produced by heterotrophic 
fungi can be used for biomachining of various metals (Jadhav and Hocheng 
2014). Hence, biomachining approaches are generally considered a “green 
technology” with low-cost and low-energy requirement. 
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Fig. 23 SEM micrographs of 
(a) multifeature and (b) 
circular-island feature made 
by a 24 h biomachining 
process (Istiyanto et al. 2012) 
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Summary 

Alternative micromachining processes are continually sought to obtain the best results 
with specific advantages. The process of biological machining is one of the efforts. 
Instead of using highly toxic chemicals to etch the metals, bacteria can be used to 
produce the same effect at a slower rate and at lower temperatures, all for very low 
energy input and cost. Several microbial species have been identified that can extract 
specific metals from their ores. These microorganisms are able to oxidize and reduce 
metals as part of their energy production cycle. The process of biological machining 
uses the potential of these microorganisms. Bacteria of the genera Thiobacillus are 
recognized to be among the most important microorganisms in the process of biolog- 
ical machining. At. ferrooxidans and At. thiooxidans are the bacteria from this genera 
which have been used extensively for biological machining. Also recently, the Staph- 
ylococcus sp. has been applied for biological machining. These bacteria produce 
certain products during their metabolism. The metabolic products then can be used 
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Fig. 24 Gear-shaped feature 
produced by biomachining 
process, (a) Top view, (b) 3D 
view (Istiyanto et al. 2012) 



for the process of biological machining. Along with bacteria metabolic products of 
fungi can also be applied for machining process. Application of microorganisms for 
machining of metals makes the machining process cleaner which is very important for 
environment. Also it opens a new paradigm in manufacturing. If this process can be 
properly understood, controlled, and even accelerated, it holds a huge promise as a 
commercially viable alternative to chemical and other machining methods. 
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Abstract 

Mechanism is made up of links and kinematic pair joints which move in the 
three-dimensional space. Many physical objects are considered as rigid bodies 
for the convenience of theoretical analysis. For rigid body motions, the repre- 
sentations of rigid body rotation have a wide range of approaches including the 
representations from the directional cosine matrix to the exponential coordi- 
nates. The kinematics of rigid body is the motion analysis without considering 
any external forces acting on a rigid body, which is actually the foundation of the 
dynamics of rigid bodies. This chapter firstly presents the representation method 
of the position and orientation of rigid bodies using both the algebraic and the 
geometric methods. Then, an example of a SCARA robot is given to show the 
applications of the basic theoretical tools for rigid body motions. The purpose of 
this chapter is to provide the basic mathematical tools and the main results for 
the kinematics of rigid body though many objects may have elastic deformation 
in practical engineering problems. 


Introduction 

A rigid body is an ideal object which does not deform or change shape during 
motions; more specifically, the distance between any two points on the object 
preserves a fixed value during motion. Although in most practical engineering 
problem, the objects are elastic with deformations subjected to external forces, 
when the deformation is sufficiently small, it can be treated as a rigid body for the 
simplification of motion analysis. 

Kinematics focuses on the kinematic geometry of a mass point or a rigid body. 
Thus, the classical geometric method and theory play an important role on the 
development of kinematics. Moreover, it is more convenient to perform the mathe- 
matical analysis, especially the quantitative analysis, when the geometric description 
of rigid body motion is transformed into the algebraic or the analytical form. Among 
the mathematical theories for the theoretical kinematics, the linear algebra and the 
matrix theory are the most common approaches for the description of rigid motions. 
In recent years, the screw theory, Lie group, Lie algebra, and Clifford algebra receive 
more attentions and become popular tools for the analysis of rigid body motion. 

Rigid body motions are made up of translations and rotations. In order to 
describe positions and orientations, a certain mathematical method should be 
adopted to construct the motion model. A transformation is usually thought of in 
the form of a homogeneous transform with the rotation matrix and the position 
vector information. The rotation matrix can be represented by a directional cosine 
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matrix, which is the common approach for the motion description. In 1983, Brocket 
first introduced the exponential mapping to the robotic kinematics (Brockett 1984). 
In the years after, Murray et al. (1994), Herve (1999), Selig (1996) and McCarthy 
(1990) investigated the applications of the Lie group and Lie algebra in the 
mechanism of robotics. Earlier in 1900, Ball finished the book of “A Treatise on 
the Theory of Screws,” which used the screw theory to discuss the kinematics of 
rigid body (Ball 1998). Hunt (1990) gave the further discussion about the screw 
theory for the kinematics in the book “Kinematic Geometry of Mechanisms.” 

The pursuit of this topic presents the kinematics of rigid body especially 
focusing on the kinematics relative to mathematical foundation. Euler’s theory 
for rigid body rotation is first presented in section “Euler’s Theorem,” followed 
by the spatial descriptions and the rigid body transformation with the directional 
cosine matrix in section “Spatial Description and Rigid Body Transformation.” 
From sections “Quaternion for Rigid Body Motion,” “Screw and Rigid Motion,” 
and “Exponential Coordinates for Rigid Motion,” the quaternions, the exponential 
coordinates and the screw motion-based rigid body descriptions are introduced. 
Finally, a kinematic example of a four-degree-of-freedom robot is analyzed by the 
above mentioned methods in section “Application in Robotic Forward 
Kinematics.” 


Euler's Theorem 


Euler’s theorem is for the motion of a rigid body that has a fixed point in the three- 
dimensional space, which states that the general displacement of a rigid body with a 
fixed point is equivalent to a rotation of the body about an axis passing through that 
fixed point. If more than two rotations happen, one can be combined with another 
rotation, and two rotations can be further reduced into one rotation. Suppose an 
arbitrary fixed point O of a rigid body undergoing an arbitrary motion at time t is 
selected as shown in Fig. 1. The rotation angle at the interval from t to t + dt is 


Fig. i A rotating rigid body 
with a fixed axis 
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Fig. 2 Point A is moving 
relative to O with angular 
velocity co 



denoted by dO around the fixed axis. Thus, at any time t, the rigid body’s rotation 
rate is its angular velocity: 

, w do 

i”( ,) i =*■ a> 

The axis around which the rigid body rotates is defined as the instantaneous axis 
of rotation. The angular vector, denoted by co, specifies both the directions of the 
instantaneous axis of rotation and the angular velocity shown in Fig. 1. The angular 
velocity is a vector and its direction is defined to be parallel to the instantaneous 
axis of rotation. When dt approaches to zero, the rigid body motion around the fixed 
point can be seen as the composite motion around a series of instantaneous axes of 
rotation. As the direction of the instantaneous axis of rotation changes in the three- 
dimensional space, the locus of the axes forms a fixed space cone in the universe 
reference frame or the primary reference frame, which is named as instantaneous 
axode or static cone. At the same time, the instantaneous axis of rotation also 
generates a cone in the rotating reference frame attached to the rigid body, which is 
named as dynamic axode or dynamic cone. It is also a cone with the peak O. The 
instantaneous axode is fixed corresponding to the universe reference frame. The 
dynamic axode is attached to the rigid body and is static in the rotating reference 
frame. When the fixed point motion occurs, these two cones meet along the 
instantaneous axis of rotation. The dynamic axode rolls either on the inside or the 
outside of the instantaneous axode. 

From Fig. 1, the orientation changing of the rigid body can be taken as rotating 
d6 around the instantaneous axis OC. Once the angular velocity vector co is defined, 
suppose that A is a point of rigid body with angular velocity co as shown in Fig. 2, 
the velocity of A relative to point O at any instantaneous time is defined by 

dr 

V ~dt ’ (2) 

= co x r 

where r is a vector from the origin O on the axis of rotation to the point of A. 
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Fig. 3 A relative motion of a 
point on a rigid body 



The point A is moving with a circular path of radius Irlsin/? corresponding to the 
fixed point O, where fi is the angle between the vectors r and co as shown in Fig. 2. 
Therefore, the velocity vector is defined as a cross product between the angular 
velocity co and the position r. The direction of the relative velocity v is perpendic- 
ular to the plane formed by the position r and the angular velocity co. 

The magnitude of the velocity vector of point A relative to O is equal to the 
product of the radius of the path and the angular velocity of the rigid body: 


v 


sin/^H’ 



The direction of the velocity vector is tangent to the path of the motion or tangent 
to the circle with the radius Irlsin/?. 

The general motion of a rigid body is further considered in the three-dimensional 
space as shown in Fig. 3. If the velocity, acceleration, angular velocity, and the 
angular acceleration of the point O are known, the corresponding physical quanti- 
ties of point A can be derived according to those of point O. The coordinate 
transformation system is located at point O with the angular velocity zero. For 
obtaining the relative motion, the position vector of point A is 


Y A 


o 


+ r A / 


o • 


( 4 ) 


Taking the derivative of above equation with respect to time t yields the 
following relative velocity: 


va 


V 


O 


+ v Alo- 


is) 


The velocity of A of a rigid body relative to the universe reference frame { C } is 
the sum of the velocity of the position O relative to the universe reference frame 
{ C } and the velocity of the position A relative to the reference frame { O } . The 
velocity v A is used to describe the absolute motion. And the velocity v A/o is used to 
describe the relative motion. Using Eqs. 2 and 5, the relative velocity of the point 
A on a rigid body can be described by its angular velocity: 


v A = Vo +co x r A/o . 


( 6 ) 
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By taking the time derivative from both sides of Eq. 2, the acceleration of point 
A at any time instant is defined as 


dv 



= d) x r + a) x r 

— e x r -f w x v 
= e x v + co x (co x r), 



where e — cb = / dt is the angular acceleration that is determined by the 

magnitude and direction of &>. The direction of angular acceleration e is not 
along the instantaneous axis of rotation. An angular acceleration is defined as the 
time rate of change of an angular velocity. Since the angular velocity is a vector 
quantity, angular acceleration is also a vector quantity according to the definition of 
the derivative of a vector. From Eqs. 2 and 6, it can be seen that the velocities and 
the accelerations of all points on the rigid body are not equal since the position 
vector r is different. For the above cases, the rotation axis is fixed in the three- 
dimensional space. The first term in Eq. 7, e x r, is parallel to the velocity vector 
and tangent to the circular path of motion. This term is named as tangential 
acceleration: 


a T = e x r. 


The second term is called the normal acceleration, which is denoted by 



a n = co x v 

= co x (co x r). 

From Eq. 6, the acceleration of point A is given as 



a A = a 0 + e x r A/o + co x {co x r A/o ) . (10) 

The geometric approach is adopted to describe the fixed point motion. In order to 
determine the position of the rigid body in the primary reference frame, two 
noncollinear points A and B with a fixed point O are needed. 


Spatial Description and Rigid Body Transformation 

Rigid body motion means the distance between any two different points on the rigid 
body is fixed or a constant with time varying. From a mathematical point of view, 
the rigid body is defined as Murray et al. 1994 

I \p(t) -q{t ) II = I \p(t) -q{t) 


constant, 


(id 
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where p,q £ ¥ l are two different points attached to a rigid body, t denotes the time 
of rigid body motion, and ||-|| denotes the distance operation. Rigid body moves 
from one position to another one along a continuous path, which is named as rigid 
body displacement including the translation and the rotation. Thus, a representa- 
tion method to describe the position, the orientation, the translation, and the 
velocity is needed in the three-dimensional space. Moreover, descriptions of the 
mechanism or the objects with their relations in the space should be given in order 
to perform the motion analysis, which leads to the coordinate transformation 
between different reference frames. This section gives the approach based on 
linear algebra, which combines the motion, transformation, and mapping with 
the matrix operation. 


Position and Orientation 


In the three-dimensional space, a universe reference frame or a primary reference 
frame (Craig 2005) is defined and fixed for a rigid body motion analysis. Other 
coordinate systems are defined relative to the universe coordinate system for 
representing the position and the orientation of a rigid body. It should be noted 
that the universe coordinate system is fixed in which every rigid body being 
analyzed can be represented. The most common coordinate system is the Cartesian 
coordinates, which is a special reference frame with three orthogonal unit vectors. 
The letters x OJ y 0 and z 0 in the universe coordinate frame { O } denote the Cartesian 
coordinates as shown in Fig. 4. 

Once a universe coordinate frame is defined, the position and the orientation of 
rigid bodies can be given accordingly. In the universe coordinate frame {O}, the 
position of any point p relative to the origin O is expressed as a 3 x 1 column vector 
with 



o 


The superscript O in the position vector p represents the universe coordinate 
frame {O}, and p x , p y and p z are the coordinates in the frame { O } , respectively. 


Fig. 4 A universe reference 
frame { O } 




O 
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Fig. 5 Frame attached to a 
rigid body 



The orientation of a rigid body can be represented with a reference frame which 
is attached to the rigid body itself. A body coordinate frame {A} is attached to the 
rigid body which is different from the universe coordinate frame { O } as shown in 
Fig. 5. The three unit vectors of the body coordinate frame {N} are x N , y N , and z N 
which are written with respect to the coordinate frame { O } with 


o o 

N ^ 


°x °y °z 

a N yN Z N 


O 


r \ l r 12 r 13 

f 21 F 22 F 23 

T31 f 32 ? 33 


(13) 


where is the rotation matrix and denotes the orientation of the body coordinate 
frame {N} in the universe coordinate frame {O}. The rotation matrix R has the 
following properties: 


RR 


T 


I 


3x3 


R r R = I 


3x3 


(14) 


R -1 = R r ,det(R) = 1, (15) 

where the superscript T represents the transpose of a matrix and I 3x3 is the unit 
matrix. The rotation matrix has the orthogonal property from Eq. 14. Another 
important characteristic of the rotation matrix is that the determinant is equal to 
+1 for the right-handed coordinate system. The determinant of the rotation matrix is 
equal to — 1 if the coordinate system is left-handed system. Using the properties in 
Eqs. 14 and 15, the relation between the universe coordinate frame and the body 
coordinate frame can be determined. 

To summarize, the position and orientation of a rigid body determine the 
location of the rigid body in the three-dimensional space. In the universe coordinate 
frame { O } , the position and orientation of a rigid body are given as 
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Fig. 6 Translation 
transformation 



m = ixp 0 }- 



o 


o 


o 


The attached body coordinate frame is defined by N R and p , where p is the 


corresponding position vector. 


Rigid Body Transformation 


Any point in the three-dimensional space has different representations in different 
reference frames. When the rotation matrix is the identity matrix, the two coordi- 
nate systems have the same orientations, that is, the pure translation transformation 
occurs as shown in Fig. 6. Since the two frames have different origins, the 
description of the origin of {A} relative to the universe coordinate frame {0} is 
PNorg- The position of the point p in the universe coordinate frame denotes p ° , while 
its position in the body coordinate system {N} denotes p N . 

Thus, from Fig. 6, the position of the point p in the universe coordinate frame is 

P°=P N +Pl s - (17) 

Equation 17 is the translation equation. 

As shown in Fig. 7, different from the translation equation, when two coordinate 
frames have the same origins instead of the same orientations, any point p in two 
different coordinate frames has the following relation: 

p°=°Rp N . (18) 

In Figs. 6 and 7, the same point p in different coordinate frames is denoted as p° 
and p N , respectively, with 
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Fig. 7 Rotation 
transformation 




Fig. 8 Geometric representation of rotation matrix 





From Eq. 18, the coordinate point p° can be represented by the vector p N using 
the rotation matrix 


p x = cos a n p u + cos a u p v + cos a u p w 

p y = cos ci 2 \p u + cos a 22 p v + cos a 22 p w , (20) 

p z = cosa 2 ip u + cos a 32 p v + cos a 33 p w 

where a t j is the angle between the ith coordinate axis vector in the universe 
coordinate frame and the jth coordinate axis vector in the body coordinate frame 
fixed on the rigid body with i — 1, 2, 3 and j — 1, 2, 3 as shown in Fig. 8. 
According to Eqs. (18) and (20), the rotation matrix is defined by 


Op 


COS Cl\\ 

cos a 2 \ 

COS <231 


COS <2i2 
COS <222 

COS <232 


COS <2 13 
COS <223 
COS <233 



From the discussion above, the inverse transformation from the universe coor- 
dinate frame to the body coordinate frame can also be written in matrix notation as 
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Three rotation transformations are often used for a right-handed coordinate 
frame with a, /?, and y angles rotated about the x Q , y a , and z 0 axes, respectively: 





cos a 
sin a 


— sin a 
cos a 


9 



cos P 0 sin p 

0 1 0 

— sin p 0 cos P 



cos y 
siny 
0 


— sin y 
cosy 
0 






When a coordinate frame is rotated about an axis, the diagonal element in the 
rotation matrix corresponding to that axis is equal to unity, and other elements in the 
corresponding row and column are zero. For composite rotation motions, the 
transformation is given by 


r' = Ri r and r" = R^, 



which yields 


r" = R 2 Rir, (27) 

where Ri and R 2 are two rotation matrices. It should be noted that Eq. 27 means that 
when the rigid body rotates twice, the composite rotation matrix is equal to the 
product of the two rotation matrices. However, the multiplication sequence cannot 
be exchanged since the matrix multiplication is not commutative. Therefore, the 
rigid body rotation with a finite angle displacement cannot be permuted. 

For a more general situation, the origins of the two coordinate systems are not 
superposition and the orientations are also different as shown in Fig. 9. The 
following transformation holds for describing the positions of p N and p° in the 
coordinate systems {N} and [0)\ 

p° = °Rp N + p° ore . (28) 

The rotation matrix ^R is used to describe the orientation of {TV} with respect to 
{ O } , and the p° 0 r g represents the position of origin of {N} in the universe coordi- 
nate frame { O } . Equation 28 is a composite transform including the rotation and 
translation between the two coordinate frames. 
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Fig. 9 General coordinates 
transformation 



The distances between any two points on a rigid body are constant for a rigid 
body motion, which is not a necessary condition when the internal reflections are 
considered. Therefore, a more rigorous mathematical definition of the rigid body 

o o 

transformation is given. A mapping g: is a rigid body transformation 

when the following properties are satisfied (Murray et al. 1994): 


(a) Length is preserved: || g(p) — g(#)|| = \\p — q 

(b) The cross product is preserved: g*(v x w) = 

W e fR 3 . 


O 

for all points p, q £ 9t . 
g*(v) x g*(w) for all vectors v, 


In condition (b), g* is vector transform related to the rigid body displacement 
g*(v) = g(q) — g(p ). The cross product between two vectors in the rigid body 
eliminates the possibility of reflection. A rotation matrix satisfies the properties of 
the rigid body transformation, so the rotation motion is a rigid body transformation. 


Euler Angles Based Rotation 


The Euler angles y/, 0, and cp are one of the most commonly used attitude repre- 
sentations of a rotation rigid body. As discussed in (O’Reilly 2008; Cheng and 
Gupta 1989), the representation of Euler angles can be dated to the works by Euler 
in 1751. The Euler angles show a better way to describe the rotation motion of rigid 
body. It is simpler than a directional cosine matrix which uses nine parameters to 
describe the rotation motion. The reference frames are defined in Fig. 10. Let {0} 
with X, Y, and Z axes be a universe coordinate frame, and let {N} with U, V , and 
IT be a rotating body coordinate frame attached to the rigid body. There are many 
options to attach a body frame to a rigid body. However, the coordinate axes are 
typically aligned with the coordinate axis of the primary coordinate frame. 
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Fig. 10 Universe and 
rotating body coordinate 
frames 



Table 1 Three types of Euler angle systems (Fu et al. 1988) 



Euler angle system I 

Euler angle system II 

Euler angle system III 

Sequence of rotations 

yr about OZ axis 

y/ about OZ axis 

cp about OX axis 


0 about OU axis 

6 about OV axis 

6 about OY axis 


cp about OW axis 

cp about OW axis 

yr about OZ axis 


There are three types of Euler angle representations which are widely used and 
shown in Table 1 (Fu et al. 1988). Note that the sequential rotations provided in 
Table 1 yield instantaneous geometric attitude for the universe coordinate frame 
OXYZ and the Euler angles are time varying in a general coordinate transformation. 

The Euler angle system I is usually associated with the gyroscopic motion. The 
rotation matrix associated with the Euler angle system I is given by 



R 


z,w ' R « 

, e • 











cos#a 

— sint/A 

0 " 


"1 

0 

0 " 


COS (p 

— sin#? 

0 

sini/r 

cos y/ 

0 


0 

cos 6 - 

sin 6 


sin#? 

cos#? 

0 

0 

0 

1 


0 

sin# 

cos 6 



0 

0 

1 

cy/ccp — 

sy/cOsrp 

—cy/scp 

- sy/cOccp 

sy/sO 




sy/crp + cy/cOscp 

sy/scp + cy/cOccp 

—cyrsO 





sOscp 




sOccp 

cQ 






where the symbol c denotes cosine function and s denotes sine function. In a certain 
condition, the rigid body orientation can be uniquely described by a set of Euler 
angles. It should be noted that the changing of one of the Euler angles does not affect 
other Euler angles’ values, which is a characteristic of Euler angle representation. 
Any rigid body orientation can be obtained by three sequence composite rotations. 
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The rotation matrix associated with Euler angle system II is given as 


R — Rz, i// • Rv, o ■ Rw, cp 

cos cp 
sin cp 

0 

cy/ cOccp — sy/scp —cy/cOscp — sy/ccp cy/sO 

sy/cOccp + cxi/scp —sy/cOscp + cy/ccp sy/sO 

— sOccp sOccp c6 


COSl/A 

— sin i/z 

0 


COS0 

0 

sin# 


sin y/ 

COSl/A 

0 


0 

1 

0 


0 

0 

1 


— sin# 

0 

COS0 



sin<p 

cos cp 
0 




The Euler angles in system III are roll, pitch and yaw, which rotate around 
the principal axes of the primary coordinate frame OXYZ. A roll angle cp around 
OX axis, a pitch angle cj) around OY axis, and a yaw angle y/ around OZ 
axis are named and the rotation matrix associated with Euler angle system III is 
given by 



Rz, xp • Ry, o * Rx, cp 






COSl/A 

— sini/A 

0" 



cos 6 0 

sin# 

sini/A 

COSl/A 

0 



0 1 

0 

0 

0 

1 



- sin 6 0 

cos 6 

cyrcO 

cyrsOscp - 

sy/ccp 

cyrsOccp - 

- sy/scp 

sy/cO 

sy/sOscp + cy/ccp 

sy/sOscp + cy/scp 

—sO 


cOccp 


cOccp 


1 

0 

0 


0 0 
cos cp — sin cp 
sin cp cos cp 



Aircraft and spacecraft orientations are commonly described through the Euler 
angles roll, pitch, and yaw with the Euler angle system III. 

The relation between the Euler angles and the directional cosine matrix is 
constructed through Eqs. 29, 30, and 31. The directional cosine matrix can be 
obtained when the Euler angles are given. However, the Euler angles cannot be 
easily obtained from the directional cosine matrix since the arccosine function 
holds multiple values. Therefore, some geometric constraints on the Euler angles 
should be imposed in order to make the solution of Euler angles unique. Given a 
rotation matrix R, the Euler angles y /, #, and cp can be computed by equating each 
element in R with the corresponding element in the matrix product R z ,^'Rf/ ,<rR W ,<? 
in Eq. 29. In order to obtain a unique solution of the Euler angles, the Euler angles 
are usually specified as 

0 < y/ > 2 n, 0 < 6 < n, and 0 < cp > 2 n. 


(32) 
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Equation 29 is taken as an example for solving the Euler angles. A general 
rotation matrix has the following form: 




*11 

R\1 

R 13 

R = 


Ri\ 

R22 

R23 



R31 

R32 

R33 


Equating each elements of above rotation matrix with Eq. 29 yields 



Ru = 

C\jfC(p — S\jfc0s(p, 

(34a) 

R\2 

— c\ffscp — sy/cOcq ) , 

(34b) 

R\3 

- sy/sO , 

(34c) 

R21 

sy/ccp + cyrcQscp , 

(34d) 

R 22 

sy/scp + cyrcQccp , 

(34e) 

R23 

—cyrsO, 

(34f) 

R 3 I 

- sOscp , 

(34g) 

R 32 ~ 

- S0C(p , 

(34h) 

R 33 

cO. 

(34i) 


Using the above equations, when \R 33 \ ^ 1, the solutions are 


0 


cos 1 (R33), 0 < 6 < n. 


(35a) 


cos 


-1 


R 


23 


¥ 


sin# 


>^13 > 0 


2 n — cos 


-1 


R 


23 


sin# 


,R\ 3 < 0 


(35b) 



However, when \R 33 \ approaches to 1, 6 is 0, or =b n, the other two Euler angles y/ 
and (p cannot be determined by Eq. 35b and 35c. If the rigid body motion is known 
in advance with \R 33 \ ^ 1, Eq. 35a-c are a better selection to solve the Euler angles. 
Therefore, in numerical solution given by Paul et al. (1981), the Euler angle 6 is 
defined by 
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' 0 < 0 <—, x > 0 and y > 0 
~ ~ 2 

- < 6 < n, x < 0 and y > 0 
2 _ _ _ 

—n < 0 < — — , x < 0 and y < 0 
- - 2 

— -< 6 < 0, x > 0 and y < 0 
2 “ “ “ y ~ 



Using the relation between Eqs. 29 and 33, the Euler angles can be solved by 
pre -multiplying the matrices Rz,ii/,Ru,e and Rw,cp successively with (Cheng and 

Gupta 1989) 





tan 


-l 


R 

R 


13 


23 


cos i J/ R n + sin 1/^22 
cosi//7?n + siny/R 2 i 


siny/R 


13 


cos y/R 2 3 


R 


33 


(37a) 

(37b) 

(37c) 


Certainly, one should post-multiply a given rotation matrix to solve the Euler 
angles. For other two Euler angle representation systems corresponding to Eqs. 30 
and 31, similar mathematical operations can be used to solve the Euler angles. 


Homogenous Representation 

The homogeneous coordinate transformation is introduced with the N component 
vector described by an N + 1 dimensional vector, namely, the position vector in 
Eq. 19 with the homogeneous coordinate representation is given by 




From Eq. 28, the homogenous transformation matrix is given by 



where p° and p N are the homogenous coordinates of point p in the different 
coordinate frames; the 4x4 matrix in Eq. 39 is named as a homogeneous 
transformation matrix. The homogeneous transformation matrix includes 
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the information relative to the rotation and the translation transformation. For a 
more general representation, the point coordinates and its homogeneous coordi- 
nates are given by 





where w is a scaling factor. 

For a special case of pure translation transformation with the translational vector 

T 

dx dy dz] , the translation transformation is defined as 


10 0 d x 
0 10 d y 
0 0 1 d z 
0 0 0 1 



T 

Given a point with homogenous coordinates u = [x, y, z, w] , the translation 
transformation yields 



1 0 0 d x 


X 


x + wd x 


x/w + d x 

0 10 d y 


y 


y + wd y 


y/w + d y 

0 0 1 d z 


z 


z + wd z 


z/w + d z 

_0 0 0 1 


w 


w 


1 



Similarly, the homogeneous rotation matrices corresponding to Eqs. 23, 24, and 
25 are given by 



x, a 



o o 

cos a — sin a 
sin a cos a 

0 0 



cos /? 0 sin/? 0 

0 10 0 

— sin /? 0 cos /? 0 

0 0 0 1 



cosy 

siny 

0 

0 


— siny 
cosy 
0 
0 


0 

0 

1 

0 
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Using above homogenization coordinate representations, the general 
4x4 homogenization transformation is given by 



The translation and the rotation transformations are the special cases of the 
general transformation in Eq. 46, where R 3x3 is the rotation matrix, d 3xl represents 
the position vector of point p , and w 0 = 1 is the scaling factor. From Eq. 46, the 
inverse of a homogeneous transformation matrix can be written as 



Quaternion for Rigid Body Motion 

It is well known that three angles cannot afford a regular representation of the 
rotation group 50(3), since there are singularities (Arribas et al. 2006). The use of 
four Euler-Rodrigues symmetric parameters, which introduce a set of four quanti- 
ties based on relations among the Euler angles, to parameterize a rotation dates back 
to Euler in 1771 and Rodrigues in 1840 (Senan and O’Reilly 2009). In fact, these 
parameters can be considered as a later proposed quaternion. 

William Rowan Hamilton developed the quaternion in 1843, in his effort to 
construct hypercomplex numbers or higher dimensional generalizations of the 
complex numbers. Failing to construct a generalization in three dimensions in 
such a way that division would be possible, he considered systems with four 
complex units and then arrived at a quaternion. Vector was introduced by Hamilton 
for the first time as pure quaternion, and vector calculus was first developed as part 
of this theory (Coutsias and Romero 2004; Widdows 2006; Hamilton 1844). Soon 
after that, it was discovered that rotations may be represented by quaternions. 
Rodrigues, Cayley, Klein, Whittaker, etc. also contributed to the quaternion repre- 
sentation (Dai 2006). 

By 1873, Clifford obtained a broad generalization of these numbers that he 
called bi-quatemion and referred here as double quaternion. A double quaternion 
defined a double rotation in four-dimensional space (Ge 1998). Ge, Ztzel, and 
McCarthy introduced the double quaternion, which achieves the mapping from 
transformation of rigid displacement in the three-dimensional space to rigid rotation 
in the four-dimensional space. Furthermore, Ge applied the double quaternion to 
the motion interpolation in computer graphics (Ge et al. 1998) and McCarthy used it 
for dimensional synthesis of robots (McCarthy 1990). 

At the turn of the twentieth century, Kotelnikov (1895) and Study (1901) 
developed dual vectors and dual quaternion for the study of mechanics. The dual 
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numbers represent algebra over 9\ . Its difference from complex numbers is that the 
basic object of the dual number algebra is a symbol e that is nilpotent. The effect of 
introducing e is to produce a non-division algebra. When one uses such numbers for 
the components of a quaternion, it allows representing the three-dimensional rigid 
motion by means of the group of unit dual quaternions, just as the unit quaternion 
carries a representation of the group of three-dimensional Euclidean rotation 
(Delphenich 2012). 

Nowadays, dual quaternions have been widely adopted in the mechanical 
engineering field because the dual quaternion gives a computationally-efficient 
algorithm for dealing with the rigid body motion that includes translations, such as 
the motion of joints in robotic manipulators (Gouasmi 2012; Aspragathos and 
Dimitros 1998). 


Euler-Rodrigues Parameters 

The column vector of Euler-Rodrigues symmetric parameters is a special 
case of a more general object, the quaternion, whose operation rules can be 
defined from the angle of algebra. In 1840, Rodrigues explicitly defined four 
parameters by presenting a scalar with the cosine of half the rotation angle 
and further defined three numbers by integrating the direction cosines of the 
rotation axis with the sine of half the rotation angle. But before talking about the 
Euler-Rodrigues parameters, let’s take a look at the two invariants of a rotation 
matrix. 

Linear Invariants of a Rotation Matrix 

Any 3-by-3 matrix A can be rewritten as the sum of a symmetric part A s and a skew- 
symmetric part A ss , which can be defined as follows: 

A.s = 1 (A + A r ) , A ss = 1 (A — A r ) . (48) 

Then the two invariants of the matrix A can be defined as 


vect( A) 




tr( A) = an + a 2 2 + ^ 33 - 



It can be verified that the vector of A, i.e., a , satisfies a x v = A ss v for any vector 

Q 

v G 93 , and tr( A) = tr( A ss ). According to Cayley-Hamilton theorem, a rotation 
matrix Q, denoting a rotation about an axis (denoted by a unit vector n ) with an 
angle a, can be expressed as (McCarthy 1990) 


1736 


Z. Xiong et al. 


Q = I + sinaS(w) + (1 — cosa)S 2 (w), 


(51) 


where S(n) is the cross product matrix of n and can be calculated by 


S (n 


nx 


0 

n 3 


ni 


~n 3 

0 

n\ 


n 2 

-n\ 

0 


It can be deduced that 


tr( Q) 


vect( Q) = v ect (sin crS(n)) = sinew. 


tr(l + (1 — cosa)S 2 (w)) = 1 + 2 cos a 


(52) 


(53) 


(54) 


Now, two parameters will be defined to denote the invariants of the matrix Q, 


vect(Q) and tr( Q), respectively: 


r4 


cos a 


tr( Q) - 1 
2 


(55) 


sin an . 


(56) 


Obviously, 


rl + 


2 


1. 


(57) 


Euler-Rodrigues Parameters 

The natural and linear invariants of a rotation matrix are just one kind of invariants 
that are used in kinematics. Therefore, the Euler-Rodrigues parameters are defined 
as 



These are usually arranged in four-dimensional column vector rj, given by 


n 


n 

n\ 


m ni n 3 n* 


T 


which satisfies a simple normalization constraint 


—T— 

rj 1 1 


2 , .22 _ i 


*1 +>h 




The rotation matrix Q can be denoted easily by using Euler-Rodrigues 
parameters, namely, 
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•u - n 2 - % + fi 4 


2 ('/i ) 72 + VaV 3) 

~tl\ + til - til + til 

2(ti 3 t]2 ~ rjAth) 


2(??i??3 - tl^th) 

2{thth + tl^th) 

-til -tll+tll + rjj 



Euler-Rodrigues symmetric parameters can also be extracted from the rotation 
matrix. Noting that 


tr (Q) — Qn + Q22 Q33 — 4^/4 1, 


if a 7 ^ n, it can deduced that 




1 


±2 V 1 + tr( hti 





if a = jt, rj 4 = 0 , and tj = n. 

To illustrate the relationship between the invariants of a rotation matrix and the 
Euler parameters, the square roots of a matrix are introduced. In fact, one square 
root of the matrix is just a case of its analytical functions and can be expressed by 
the power series of the matrix. Besides, geometrically, if a matrix denotes a rotation 
about the axis n by the angle a, then the matrix squared denotes another rotation 
about the same axis, but by 2 a and the square root of matrix by a 12. So generally, 
the correspondent Euler-parameters of the matrix Q can be represented by the 

invariants of the matrix 


rj = vect(Q_),r] 4 


try Q) - 1 

2 



However, it is of great significance to realize the difference between the linear 
invariants and Euler-Rodrigues parameters. Besides what is shown in Eq. 63, the 
Euler-Rodrigues parameters can be calculated correctly at any rotation angle while 
linear invariants can’t be sought when the rotation angle is n. 

The Euler-Rodrigues symmetric parameters have several advantages over the 
rotation matrix as an attitude representation. There are fewer elements and fewer 
constraints. Besides, the composition rule is simpler. 


Definition and Algebraic Properties 
Definition of Quaternion 

A quaternion q is defined as an expression below: 


q = xn+x\i + X2j + x 3 k, 


(65) 
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where i 2 = j 2 = k 2 = ijk= — 1, ij = kjk = i, ki = j,ji = — k, kj = — i, and /A: = —j, 
with xq, Xi, x 2 , x 3 G R. 

This description was given by Hamilton; the later three units i, j, and k can be 
considered as the basic vectors in a Cartesian coordinate. So the alternate notation 
of a quaternion is 


q = (5, v) 




where s is the scalar and v is the vector, and s, a , b , and c are real numbers. 


Algebraic Properties of Quaternions 

The conjugate of the quaternion q can be denoted as 


q* = s — ( ai + bj + ck ) . (67) 

The modulus of the quaternion q can be expressed as 


q 


2 2 . 2 . 7 2 , 2 

= £ + <2T + £ + C 


The inverse of the quaternion q is defined to be 




s — (ai + bj + ck) 
s 1 + a 2 + b 2 + c 2 



Suppose #1 = £1 + Vi = Si + + bj + CiA: and <72 = s 2 + v 2 = s 2 + a 2 i + ^2/ + c ^k\ 

then the principles of algebraic operations of quaternions are shown as follows. 
Addition of quaternions is defined component-wise, i.e., 


<7i + q 2 — (s\ + S2) + (a 1 + (22)1 + (^1 + ^2)7 + ( c i + c 2 )k. (70) 

Grassman product of q\ and is defined by 


q x °q 2 = sis 2 - Vi • v 2 + siv 2 +s 2 Vi + Vi x v 2 , (71) 

q { °q 2 = 2 vi x v 2 + <7 2 0< 7i • ( 72 ) 

Quaternions are far less predictable than their lower-dimensional cousins, 
due to the great complicating property of noncommutativity. Many of the 
ideas, which work beautifully for real or complex numbers, are not suited to 
quaternions, and attempts to use them often result in lengthy and cumbersome 
mathematics. However, quaternions can be used to recreate many of the 
structures over the real and complex numbers with which people are familiar 

(Widdows 2006). 
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Fig. 1 1 Rotation of a vector 
about a fixed axis 



Quaternionic Representation of Rotation of Rigid Body 
Rotating a Vector About a Fixed Axis 

According to Euler’s theorem, if a rigid body rotates around a fixed point, it must be 
in the form of rotating about an Euler axis by an angle. In Fig. 1 1, r Q is rotated by a 
about the axis denoted by the vector n and produces a new vector r N . There is a unit 
quaternion 


q = ( cos (a/ 2), sin (a/2)n), (73) 

relating r° with in the same frame [O'] by 

r N = qor 0 oq\ (74) 

where r N and r Q are two quaternions with zero as scalar part and r N , r Q as vector 
part, respectively. In following context, a vector is treated equivalent as its 
corresponding quaternion with a vanishing scalar part when calculated with other 
quaternions. 

The proof of the formula above is given by 

q°xo°q* = (0, ( n • ro)n + (n x ro) sin a + (n x r 0 ) x n cos a). (75) 

Due to the vector r Q can be decomposed into the following form (Arribas 
et al. 2006) 


v o — { r o * n ) n + (n x vq) x n. 


(76) 
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Fig. 1 2 Frame rotation 
depicts fixed vector in 
rotating frame { O } 



Substitute Eq. 76 into Eq. 75, and then 


q°Y 0 °q* = (0,1*0 + (n x r 0 ) sin a + [n x (n x 1 * 0 )] (1 — cos a)). 



This is equivalent of Euler-Rodrigues formula in quaternionic form. 

Now consider the case with a succession of rotations by variant axes with 
different angles. Suppose the original vector is rotated about axes tii and n 2 by 
angles 6\ and 0 2 , respectively and successively. The new vector can be also 
obtained by one rotation about axis n 3 by angle 0 3 . The relationship between 
these rotations can be represented in quaternionic form as follows: 


<?3 = 42 0 <7 1 > (78) 

where q t = (cos(0 z /2), sin(0 z -/2)#i/), i — 1, 2, 3 (Coutsias and Romero 2004). 

Coordinate Transformation Based on Quaternion 

A quaternion can be used to facilitate the representation of rotation instead of other 
parameters. Consider Fig. 12; the rigid body is represented by the body-fixed frame 
{ 0x o y o z o \ , and the vector r is assumed to be fixed with the frame. For the frame 
rotation about a unit axis n with an angle a, r is rotated to r' when the frame 
{0x o y o z o } is rotated to frame {O^q^}. Supposing i u h and i 3 are basic vectors of 
frame { 0x o y o z o \ and e x , e 2 , e 3 are those of frame and A is the rotation 

matrix, it is obtained that 



Note that 








it is deduced that 
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In fact, if a fixed vector r is defined in the space, r° and are their expressions in 

frames { 0x o y o z o } and { Ofy ^ } , respectively, and v° and r^ are their corresponding 
quaternions with zero scalar part, and then the rotation of coordinate can be denoted 
as 


r N = q*or°oq, (82) 

where q = (cos(a/2), sin(a/2)#i) (Wu et al. 2005). 

The Establishment of Quaternionic Differential Equations 

The quaternionic form of the coordinate transformation description is shown in 
Eq. 82. Perform differential operation at both sides of the equation 



dq* 0 dv° 

« r <■<,+«<■- 


o q -f q* or 




Note that q is a unit quaternion and consider that 






and then it can be deduced that 



N w jv (dq* \ jv jv (dq* 

—co x r = ( o q o — p o o q 

\dt J \dt 

= 2Vect(^^oq S j x r^, 

where Vect { •) denotes the vector part of a quaternion and co N is the angular velocity 
of frame {N} relative to frame { O } , expressed in the frame {N}. 

Obviously, 





Further, the kinematic equations of a quaternion are obtained, namely, 


dq 

dt 


1 

2 


qo(o 


N 


1 

2 


(0°°q, 


(87) 


<9 


where is the angular velocity of frame { N } relative to frame { O }, expressed in 
the frame { O } . 
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Fig. 13 Translation and 
rotation of a rigid body 



Dual Quaternion 

Although unit quaternions can be employed to represent the rotation of rigid bodies, 
they can’t deal with the general motions of a rigid body, which consist of both 
rotation and transition motions. In order to overcome the limitation, hypercomplex 
quaternions are invented to denote the displacements of a rigid body uniformly, 
instead of the traditional method (R, t), which denotes the displacements separately. 

A set of hypercomplex numbers called dual quaternion may be obtained 
from the even Clifford algebra of four-dimensional Euclidean space E 4 . Let 
£i, i = 1, . . ., 4, be the natural coordinate vectors of E 4 ; then, the multi-linear 
algebra of points in E 4 can be constructed (Fig. 13). 

Introduce the Clifford product 

e { ej + efii = -2e z - • e j9 (88) 

where the dot operation denotes the Euclidean scalar product. The even sub-algebra 
C + {E 4 ) is of rank 8, and a typical element can be written as 

q = q + eq f , (89) 

where q and q' are Hamilton’s quaternions and e = eie 2 e 3 e 4 satisfies the identity 

8 2 = 0 . 

Clifford shows that depending on the definition of the scalar product in Eq. 88, 
complex quaternions are obtained when e = — 1 and double quaternion when 

e 2 = 1 (Clifford 1873). 

Dual Quaternion 

Dual numbers were first introduced in the nineteenth century by Clifford and further 
developed in their principle of transference. A dual number is defined as 

z = a + sb, (90) 


r\ 

where e = 0, but 8 / 0, and a is called the real part and b the dual part. 
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By definition, dual numbers satisfy 

+ Z 2 = CL\ + (22 T ^(^l + ^> 2 ), (91) 

Xz = Xa + cXb, (92) 

fiZ 2 = Cl\Cl2 + e(fll&2 + «2^l). (93) 

A dual number with zero real part cannot be inverted, because the inverse is not 
well defined. The function in dual number arguments has an important property. 
Since all powers of e equal to or greater than two are zero, a Taylor expansion yields 

f(a + eb) = f(a ) + sbf'(a). (94) 

A dual quaternion is in fact a dual number with quaternion components, i.e. 

q = q + sq f = s + ai + bj -f ck + e(s f + a'i + b'j -f c'k ) . (95) 

Alternatively, dual quaternion can also be interpreted as a quaternion whose four 
terms are dual numbers: 


q = (£, v) = s + ai + bj -f ck, (96) 

where s is a dual number and v a dual vector. Obviously, a dual vector can be 
regarded as a dual quaternion with zero scalar part. 

Dual quaternion has many properties in common with a quaternion, which can 
be demonstrated by the definitions of the algebra operations between dual quater- 
nions. Suppose q x — (s x ,v x ) and q 2 = (s 2 , v 2 ), and then 


<?1 + <?2 - 

(5i +S 2 ,Vi + V 2 ), 

(97) 

Xq 

- {Xs,Xv), 

(98) 

/\ /\ / /\ /\ /\ 

<h °Qi ~ ( s i s 2 ~ vi 

A A A A A A /N \ 

• v 2 , 5 \V 2 + S 2 V 1 + Vi X v 2 ). 

(99) 


In fact, according to the principle of transference by Kotelnikov, the character- 
istics of a quaternion are completely inherited by a dual quaternion. 

Representation of Screw Motion of Rigid Bodies Using Dual Quaternion 


Plucker Line 

A unit dual vector is one of alternative representations of lines in space. The real 
part is the unit direction vector of a line and the dual part is the line moment with 
respect to the origin of the coordinate frame. In this case, a unit dual vector is 
known as the Pliicker coordinate or the Pluckier line. Accordingly, a line with unit 
direction vector l and passing through a point p , as shown in Fig. 14, can be 
represented as a Pliicker line (Wu et al. 2005). 
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Fig. 14 Pliicker line 


/ = / + sm, 

where m is the line moment and satisfies that 



m = p x l. (101) 

Pliicker lines can be manipulated to derive the relationship between lines. For 
example, the scalar product between two Pliicker lines is equal to the cosine of a 

dual angle 0 = 6 + sd that has a nice geometric interpretation: 0 is the crossing 
angle and d is the common perpendicular distance, as shown in Fig. 15. It can be 
formulated as 


l\ • h = cos <9, (102) 

/\ /\ 

where l\ = l\ + e,m\ and l 2 = h + , respectively. The cross product can be 

calculated by 


h x / 2 = n sin 6 , 


(103) 


where n represents the common perpendicular line in the direction of/i x l 2 , as is 
shown in Fig. 15. 

Representation of Rigid Body Displacements with Dual Quaternions 

For a general rigid motion (R, t), the equivalent screw motion parameters, say, 

where / = l+sm, can be deduced. The geometry of the screw motion is 

illustrated in Fig. 16. In the figure, p is the vector that points from the origin of 
frame { O } to its projection on the screw axis /. 

Suppose that the relationship between the frame { O } and the frame {N} shown 
in Fig. 16 is described by a rotation q succeeded by a translation ^ (or a translation 

O /s O 

t succeeded by a rotation q ), and the Pliicker line I is expressed as / in the frame 
{ O } and T in the frame { N } , then 
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Fig. 15 Dual angle 



Fig. 16 The screw motion of 
a rigid body 



1° = 1° + sm ° , / = / Ar + era A / (104) 

The rotation quaternion q can be derived from the rotation matrix directly, i.e., 

q = (s,v) = ( cos (a/2), sin (a/2)/). (105) 

From Eq. 101, it can be known that 


r 


>K 

q o 


foq. 


006) 


n t N = q*°m°oq + - °q* °l° °q + q* °l° oqo^^j 

jLml 


(107) 


Define a new quaternion q' = \q°t N , and then it can be deduced that 


/ = q °l °q. 


(108) 
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where the unit dual quaternion q is a function of q and t , or <7 and t^, i.e., 


1 N 1 n 

q = q + £-q°t =q + s-t °q. 

A* A* 


(109) 


Equation 108 is the dual quatemionic representation of the rigid body displace- 
ments. Given the facts that 


c = - ( t — dl + cot °zl x , m 


2 


2 


and that the unit vector l satisfies (/ x t) x l = t — (t • 1)1, then 


a d 

sin —m + - cos 
2 2 


Finally, it can be calculated that 


q = q + s\ ^toq 


(s, v) + £ 


a a 

cos — , sin — / 1 + g 

A* A* 


d 

2 



1 

< / : 



2 

) X 

/- 

a . 

1 

- 

— 

2 

2 

( 

1 

- 

- V 

V 

2 

• 

a 

sin 



^ x /+ (/ x t) x /cot ( 110 ) 


(jt+t X V 


1 , 

t, - (st + t X V 
2 v 

a d a 
in — w + — cos - / 
2 2 2 


(111) 


( 112 ) 


The dual quaternion can also be explicitly written as a function of elements of 
the screw motion 








q 


cos 1 sin (| |/|, 


(113) 




where / is the screw axis and a is the dual angle of the screw motion and 


a = a + £d,/=/ + <;m. 


(114) 


Kinematic Equations in Dual Quaternion 

The rate of change of a dual quaternion, which denotes the velocity of the displace- 
ment, can be deduced from Eq. 109 by 


2 q = 2q + £ ( i° °q + t° oq 


Considering Eq. 85 gives 


(115) 


2q = oeC o q 4- g ( t° oq -\- (t° x to 0 ) °q -f ^co°ot 0 oq 


o 


co° -f- £ ( t ° + t° x (o° ))o[ q -\- oq 


(116) 


* o * 

go o q 
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where 


co° = co° + £ ( i° + t° x Q)°),q = q + & °q 


( 11 7 ) 


And then 


co 


N 


o ^ 
q ocd o q 

i 


* 


q-\-£-t°°q) ° l co° + s( t° + x co° 


1 


o 


° 1 <7 + e-t °# 




co* + e(^*° (t° x oq -f co N x 


(118) 


Due to the fact that q ° q* = 1 , t N = q* o t° o g, it is obtained that 


;/V * ( :0 . .n O 

t = q ° ( t + t x co i oq 


(119) 


Substituting Eq. 119 into Eq. 118 gives 


r JV _ r JV | _ ( lN N v JV 
CO — (0 + £ t H - CU X i 


( 120 ) 


Thus Eq. 116 can be rewritten as 


/-> ^ ^ O ^ ^ ^ ]\J 

2q = oo o q = q°(S) . 


( 121 ) 


with the dual vectors Vectfco 0 ) and Vect(co iV ) called twists 


N 


Double Quaternions 

Brief Introduction of Double Quaternion 

According to Clifford algebra, if £ satisfies £ = 1 in Eq. 96, a double quaternion is 
defined. Now, introduce the symbols £ = (1 — e )/ 2 and rj = (1 + £)/2, and then the 
double quaternion can be reconstructed as 

<? = (<?-</)£+(<? + </> v ( 122 ) 

So a double quaternion is redefined as 

G = £G + r\H, (123) 

where r\ satisfy that f = 77 , £77 = 0 and G, H are unit quaternions (Akyar 

2008). In short, the double quaternion can be denoted as G = (G,//). 
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A double quaternion can be used to represent a rotation in the four- 
dimensional space. Supposing two continuous rotations are denoted as 

G\ = and G 2 = (G 2 ,// 2 ), respectively, then the combination of them is 

G 3 = G!oG 2 . (124) 

According to the definition of a double quaternion, it can be obtained that 

G 3 = G\ °G 2 ,// 3 =H 1 oH 2 . (125) 

That is to say, the rotation G and H can be operated separately. 

Spatial Motion and Rotations in E 4 

For a displaced rigid body, 0, cj), and y/ are supposed to be the longitude, latitude and 
roll angles defining the orientation, and the four-order homogeneous transform can 
be referred as 


where 



A (0,(p,ys) 

0 T 






cOcy/ — sOscpsy/ 

—c6s(j) — s6s(j)cy/ 

s6c(j) 


ccfrsyf 

c(pcy/ 

S(j) 

(127) 

—s6cy/ — cOscffsyr 

sOsyr — c6s<pcy/ 

cOccj) 


T 

d — (d.xi dy , dz'j , 


(128) 


with s and c representing the sine and cosine functions. 
Now define the angles a, /?, and y as 



(129) 


where R is the radius of the hypersphere to which the translational elements are 
computed. Successive rotations of a in the W-X plane, in the W-Y plane, and y in 
the W-Z plane determine a four-order rotation matrix 



ca 

0 

0 

sa 

J 

—s/3sa 


0 

s/3ca 

—sycfisa 

—sys/3 

cy 

sycfica 


—cycfisa 

—sficy 

— sy 

cycfica 


( 130 ) 


Defining a four-order matrix, 
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A(0,<j>,yf) 

0 T 



(131) 


A general rotation in E 4 can be expressed as ( Ahlers and McCarthy 200 1 ) 


D = J(a,f},y)K(0,<f>,ifr). 


(132) 


Considering the limiting case that a, /?, and y approach to zero, it is obtained that 

cos a = cos /? = cos y = 1 , and sin a = ^ , sin fi = and sin y = % 

Thus, 



(133) 


Calculate the Taylor series expansion of matrix D (Etzel and McCarthy 1996): 



0 T 




(134) 


and it can be deduced that the first-order term contributes components in the 
W direction and D is identical to a homogeneous transform representing a displace- 
ment in W = R hyperplane to the second order in R~ l . Thus, by increasing R , 
displacements in E 3 can be approximated as closely as desired using rotation in E 4 . 


Double Quaternion for a Spatial Displacement 

By using Cayley’s formula, the skew -symmetric matrix [5] can be obtained as 



D-I][D+I\ 



0 

-u 3 

U 2 

Vl 

1 

u 3 

0 

— U\ 

V2 


— U2 

U\ 

0 

v 3 


_-Vi 

-V 2 

-v 3 

0 


Now, introduce two matrices 


Pi 





0 

-s 3 

si 

si 

B 

+ 

B' 


^3 

0 

-S\ 

s 2 


2 ki 


-S2 

S\ 

0 

S3 





-Sl 

S3 

0 




" 0 

—t 3 

h 

-h 

B 

— 

B r 


h 

0 

—t\ 

~h 


lk 2 


~h 

h 

0 

—t 3 




h 

h 

h 

0 




(137) 


r* 

where [B ] is obtained by interchanging the u, and v, terms and Y s‘ 


It] 
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Define /u and v by the following equations: 


ju = tan l (k\ + k 2 ) + tan l (k\ — k 2 ), (138) 

v— tan _1 (/:i + ^ 2 ) — tan ~ l {k\ — k 2 ), (139) 

and then the double quaternion is given by G = + rjH , 

where 

G = cos// + ssin fi,H = cos v + f sin u, (140) 

rjr nr 

where s = (s x , s 2 , s 3 ) and t = (^, t 2 , t 3 ) define the axes to which the angles fi and u 
are to be rotated, respectively. 


Spatial Displacement from a Double Quaternion 

Any double quaternion in form of Eq. 140 has a correspondent displacement, whose 
rotation parameters can be calculated as follows. Supposing D is the four-order 
rotational matrix, it can be calculated by 


D = e'' s+uT . 


(141) 


Noting the properties of S and T that ST = TS, S 2 = T 2 
that 


—I, it can be deduced 


D = e tlS e vT , 

where 



COS // 

—s 3 sin ji 

s 2 sin ju 

si sin ji 


S3 sin ju 

cos ju 

—si sin ju 

s 2 sin ju 


—s 2 sin ju 

s 1 sin ju 

cos ju 

S3 sin ju 


—si sin ju 

—s 2 sin ju 

—S3 sin ju 

cos ju 


cosu 

— 1 3 sin v 

t 2 sin v 

—ti sinu 

e vT 

t 3 sin v 

cosu 

—ti sinu 

— 1 2 sinu 


— 1 2 sin v 

t\ sin v 

cosu 

— 1 3 sinu 


t\ sin v 

t 2 sin v 

t 3 sin v 

cosu 


(142) 


(143) 


(144) 


The rotation matrix can be calculated by substituting Eqs. 143 and 144 into Eq. 142. 

In this part, the Euler-Rodrigues parameters are introduced. In fact, they are a 
special case of quaternion, which was proposed earlier than quaternion. Based on 
that, quaternions and their mathematical backgrounds are introduced. Quaternions 
can be applied to the representation of the rigid body motions. Due to the limitation 
that quaternion can only deal with spatial rotation, dual quaternion and double 
quaternion are then introduced to represent the spatial displacement in the three- 
dimensional space. 
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The Euler angles, directional cosine matrices, and quaternions can all be adopted 
to represent the movement of rigid body, while quaternion equations are composed 
of only four linear differential equations and just have one constraint that the 
modulus of the quaternion is 1 . This yields a benefit of small amount of calculation. 
And it is simple to combine two rotations using the Grassman product. So quater- 
nions are a better alternative to represent the attitude of rigid bodies. Although 
quaternions are limited to represent the rotation of rigid bodies, there are some 
extensive tools that defined to overcome the limitation. 

Dual quaternion is a powerful mathematical tool for the spatial analysis of rigid 
body motion. It makes the solution more straightforward and robust and allows us 
to unify a translation and a rotation into a single state instead of having to 
define separate vectors. While matrices offer a comparable alternative to dual 
quaternions, they can be inefficient and cumbersome in comparison. In fact, dual 
quaternions give us a compact, un-ambiguous, singularity-free, and computa- 
tional minimalistic rigid transform. In addition, dual quaternions have been 
shown to be the most efficient and compact form of representing rotation and 
translation. 

On the other hand, a double quaternion can be employed to represent the rotation 
in four-dimensional space, for the displacement of a rigid body in the three- 
dimensional space can be considered as a rotation along a spherical surface with 
an infinite radius, which can be defined large enough in applications. 


Screw and Rigid Motion 

The early work of screw theory could be dated back to the Italian mathematician 
Mozzi’s finding (Mozzi 1763) of the instantaneous motion axis, and then in 1830, 
Chasles’s well-known theorem (Coolidge 2003) revealed that any motion of a rigid 
body could be replaced by a screw motion, which is a translation and a rotation 
about an axis. Poinsot (1851) revealed that any combination of forces acting on a 
rigid body can be duplicated by a wrench, a single force along a line, and a torque 
couple about the same line. 

Based on the former works of Chasles, Poinsot, and the others, Ball published ‘A 
Treatise on the Theory of Screws' and laid the foundation of screw theory 
that integrates rotation and translation quantities into screw (Ball 1998). After a 
long period of silence, Yang and Freudenstein (1964) applied dual quaternions 
to obtain the screw displacement in 1964. Dimentberg (1965) presented the 
finite screw displacement with a screw axis attached with the tangent of the dual 
angle of half the rotation in 1965. In 1978, Kenneth H. Hunt (Hunt 1990) published 
Kinematic Geometry of Mechanisms, which is the modern development of screw 
theory. In 1986, Young and Duffy (1986) applied screw theory in analyzing 
the extreme postures of manipulators. In 1990, Phillips (1990) applied screw theory 
in the analysis of mechanism and published the book Freedom of Machinery . Tsai 
and Roth (1973), Dai and Jones (2001) and Huang et al. (2013) also made lots of 
efforts in the study and application of screw theory (Dai 2006). 
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Fig. 17 Line vector 


Line Vector 



A line can be determined by point A (x | , y 1; z t ) and point B(x 2 , y 2 , z 2 ) in a specific 
order. This line can be represented by vector S. And in the coordinate system 

(Fig. 17), 


S = Li + Mj + Nk, 

x 2 — x\ = L 

y 2 ~y\ = M 

Z 2 - z\ = N. 

The distance between the two points is 



(145) 

(146) 


(147) 



/ = 
m = 
n = 



(148) 


o '7 9 

where / + m + n = 1 . Obviously /, m , w is the direction cosine of line S. If point p 
is on the line and is the vector pointing from origin to /?, then the vector equation 
of the line is 


(r — i*i ) x S = 0. 


(149) 


The standard form is 


(r-r l )xS = 0. (150) 

Sq is the cross product of r 1 and S and is referred as the moment of line which is 
also a vector. Obviously, S • S 0 = 0 and ( S ; 5 0 ) comprise what is called the Pliicker 
coordinates of the line, and it is a set of homogenous coordinates since (AS; AS 0 ) 
represents the same line in the space, except with different length. 
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Let d be the distance from origin to the line; then, if S is a unit vector, the 
magnitude of S 0 is equal to d, that is, 



Also the vector q perpendicular to the line with the length of d is 



Sx S 0 
S S 0 • 



When S 0 = 0, the line goes through the origin and the Pliicker coordinates are 
0 S ; 0). When S = 0, q -> + oo and the line lies in a plane at infinity. 

If a vector is confined by a line whose orientation and position are fixed, only 
allowing the vector slide along the line, then this vector is called line vector. Line 
vector is then defined by vector S and its moment 5 0 . The Pliicker coordinate of 
line vector is then ( S ; 5 0 ). Vector S represents the orientation of the line and is 
irrelevant to the position of origin while moment S 0 changes as the origin changes 
(Huang et al. 2013). 


Screw 


For line vector S ■ S 0 = 0 and in the general case when S , S 0 may not be 
perpendicular to each other, a screw is defined as Davidson and Hunt (2004) 

$= (S;S°) = (L,M,N]P*,Q*,R*) 

= (S-,S 0 +hS)= (. L,M,N\P + hL,Q + hL,R + hL ). (153) 

= ( 5 1 ; r x S+hS) 

For each screw, there is a screw axis associate with it, defined by the line vector 
0 S ; S 0 ) = ( 5 ; r x S ), where vector r points to an arbitrary point on the screw axis. 

The magnitude M of a screw is equal to the magnitude of vector S. When 
M = 1, $ is a unit screw. Let $j be a unit screw and a screw $ can be expressed as 


$ = (S\ S°) = MS/ 



Also, every screw has a pitch h: 


(154) 

(155) 
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The pitch of line vector as a special case of screw equals to 0. If the origin of the 
coordinate frame is changed from { O } to { N }, then the new screw will be 


S N = ( S ; S£) = (S'; S° + NO x S) . (157) 

Since S ■ S% = S ■ (S° + NO x S) = S ■ S°, this indicates that pitch h is origin- 
independent. 

A screw $ can be represented as two vector S(L,M,N ) and S°(P and 

S is called the real unit, and S° is called the dual unit. 

When S = 0, $ = (0; S° ) is called a couple, and h = +oo. 

A general screw $ can be fully determined by six independent parameters, and 
$ = (S\ r x S + hS) consists of homogeneous coordinates, so when multiplied by a 
scalar only changes the magnitude of screw. 

Though the change of origin does not change pitch h and clearly the direction 
and magnitude of screw, screws expressed in different frames need to be clarified. 
Let the transformation between two coordinate frames { O } and { N } be given by 
and °p N and the screw expressed in { N } be °$ = (°S; °S°) and N $ = ( N S; N S ° ), 
respectively; then (McCarthy 1990), 


1 

o 

Co 

i 
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1 

1 
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1 


5: 

1 

& 

1 


A 
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where °p N is the skew-symmetric matrix corresponding to vector °p N . 



Operation on Screw 
Screw Sum 

The sum of two screws Si = (5i; S?) and $ 2 = (S 2 ; S%) is still a screw $ z , whose real 
unit and dual unit are the sum of real unit and dual unit of Si and $ 2 , respectively, 
that is, 



As in the definition of a screw 



$=(S;S 0 + hS) = (S; So) + (o, AS), (160) 

this means that a screw can be considered as the sum of a line vector and a couple 
while S 0 is perpendicular to S and hS is parallel to S. 
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Scalar Multiplication and Reciprocal Product 

The multiplication of a scalar A and a screw $ is defined as 



o 


o 


The reciprocal product of two screws Si = ( S S\) and $ 2 = (S 2 ; S 2 ) is the sum 
of the scalar product of one screw’s real unit and the other’s dual unit, that is 


$ x o$ 2 = Si ■ZP 2 +S 2 -tf > l - (162) 

If the origin of the coordinate frame moves from { O } to { N }, the two screws Si 
and $ 2 become 



and in this new coordinate frame, the reciprocal product of these two screws is 



= ^i-(^+JVOx 

= ^ • S° + S 2 • 5? 


S 2 ) + s 2 ■ (S° + NO x Si) 
= S { oS 2 . 



This means that the reciprocal product of two screws is irrelevant to the origin 
(Yu et al. 2008). 


Rigid Displacement and Screw Motion 

As an arbitrary rotation in the three dimensional space corresponds to an orthogonal 
rotation matrix R, as stated in Euler’s theorem, there is always a fixed axis associate 
with the rotation. This axis is called the rotation axis. Let c be a vector pointing to a 
point on the axis; then, 


c = Rc (165) 

or 

(I 3 x 3 - R )c = 0. (166) 

Solution of c is the eigenvector corresponding to the unit eigenvalue of the 
rotation matrix R, and the other two remaining eigenvalues are cosO =b isinQ. If 
R has the following form 



r\ l 

r 12 

r \ 3 

ri\ 

1*22 

1*23 

y 3i 

1*32 

1*33 
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then the amount of rotation angle 6 and the unit vector w parallel to the rotational 
axis would be 



cos 


-i (r\\ + r 22 + 7* 33 

2 




1 

2 sin# 



7* 32 — 7*23 
7*13 — 7*31 
7*21 - 7*12 




Considering the case of a general rigid motion, point c is thought to be a fixed 
vector, then 


c = Rc + p (170) 

or 

(I 3x3 -R )c=p. (171) 

Since I 3x3 — R is a singular matrix as 1 is the eigenvalue of all rotation 
matrices, the conclusion would be there is no fixed point in a general rigid motion 
(McCarthy 1990). 

Let the unit vector w be the eigenvector corresponding to the unit eigenvalue of 
the rotation matrix R and the translation part p be decomposed into two parts, where 
p* is orthogonal to w and p° is parallel to w. After projected to the plane vertical to 
w , the remaining part of the rigid motion is a planar motion composed of rotation 
represented by R and a translation represented by p * . Again let c be the fixed vector 
associate with this planar motion, then, 


(I 3x3 -R )c=p*. (172) 

As this is a general planar motion, there would be a solution q except the case of 
pure translation while R = I. So when R ^ I, let the line passing through point q 
with the direction of w be l = q+ tw, t E M. Then, l could be represented by the line 
vector (w, q x w) = (w. So) naturally. 

Furthermore, the whole rigid motion could be considered as a rotation angle of 
magnitude of 0 about line l and a translation of amount of d along the line /, where 
d = \\p°\\. The ratio between 9 and d is the pitch: 



(173) 


Then, this rigid motion could be represented as a screw displacement 


$ = ( Qw ; q x 0w-\-h0w) = 6{w ; So~\-hw) = 0$j , (174) 


where $j is a unit screw displacement, and (Waldron and Schmiedeler 2001) 
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T 


6 = sign w p 


cos 


-1 ( r\\ + F 22 + F 33 


2 




(176) 

(177) 

(178) 


Note that q could be replaced by any point on the screw axis. Here, it can be said 
that a general rigid displacement is equivalent to a general screw displacement 
while screw is vividly analogue to the motion of a mechanical screw. 

As in the case of pure translation, that’s to say R = I, let w be the unit vector 
parallel to the translation direction and v be the magnitude of translation. Then, this 
translation can be represented by screw $ = (0, vw), where 


w = 

V = 



(179) 

(180) 



As shown in Fig. 18, the position vector of a point r before and after a screw 
motion (9 ^ 0) can be represented as Waldron and Schmiedeler (2001) 


r(9, h) = r( 0) + Ohw + sin Ow x (r(0) — q) — ( 1 — cos 0) (r(0) — q) 

— (r(0) — q) -w, (181) 

and this equation sometimes is referred to as Rodrigues’ equation and can be 
rewritten in a matrix form using homogenous transformation: 

r(0, h) = Rr(0) + p. (182) 

Expanding equation 1 80 and comparing it with Eq. 181 yield 



/ W 2 x Vg + Cg 

W5 

WyWo - W z S0 

W x WyVg + W Z S0 \ 


R = 

W x W y Vg + W z Sq 


WyVg + Cg 

W x w z vg - W x Sg , 

(183) 


\ W x W 2 Vg - WySg 

w 

r W z Vg + W x Sg 

W 2 Vg + Cg ) 



P = 

-(I 

— R)q + dhto. 


(184) 


where c e = cosQ , s e — sin# and v 0 = 1 — cos#. 
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Fig. 18 Screw motion 



d= hO 


In the case of pure translation, the conversion is quite straight forward: R = I and 
p = vco. 


Instantaneous Rigid Motion and Screw 
Instantaneous Rotation 

As in the case of pure rotation, line vector co (S; So) can be used to represent pure 
rotation, while co is the magnitude of angular velocity and S is a unit vector parallel 

O 

to the rotation axis. Let r E M be the point on the axis; then, the equation of 
rotation axis is 


rxS=S 0 ; 




where v 0 is the velocity of a point passing through the origin with the rigid body. 
Then the line vector as a special case of screw representing instantaneous rotation 
has two parts: the angular velocity co and the linear velocity v 0 . When the rotation 
axis passes through the origin, the Pliicker coordinates of the screw are (co; 0). 

Instantaneous Translation 

For pure translation, screw $ = o(0, S ) = (0, v) as a couple can be used to represent 
it, while o is the magnitude of the velocity and S is the unit vector in the translation 
direction. Since a parallel offset of the screw does not affect the motion of a rigid 
body, the exact position of the screw axis is irrelevant. Thus, $ can also be regarded 
as a free vector. Translations can also be viewed as rotation whose axis is orthog- 
onal to S and lies at infinity (Huang et al. 2013). 
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Instantaneous Screw Motion 

In a general case, a rigid body could rotate about a screw axis and translate along the 

Q 

axis simultaneously. Let r E be a point on the screw axis and S be the unit 
vector of the axis; then, the rotational velocity with magnitude co can be represented 
by 


((oS,r x (oS), (187) 

and the translational velocity with magnitude o can be represented by 

(0,o S). (188) 

Adding these two screws together yields the standard form of screw 

(go S,r x GOiS + o S), (189) 

where the second part can be viewed as the speed of the origin. Pitch h is the ratio of 
rotational velocity and translational velocity: 



cols' • (r x coiS* + Das') 

(x)S • CdS 



A screw can also be written as 


(190) 


co(iS, r x S + hS) = oo$/, 


(191) 


where $i is a unit screw. 

In this section, screw is introduced as a generalization of line vector, and a 
general rigid displacement could be viewed as a screw motion about a screw axis 
with pitch h and angle 6. Besides representing rigid motion, screw is often used to 
represent forces known as wrenches. Screw systems (Dai 2006), the composition of 
several screws, also hold a fundamental place in screw theory, which is not 
discussed here. 

Exponential Coordinates for Rigid Motion 

The exponential coordinate representation of rigid motion has a solid mathematical 
background of Lie group and Lie algebra because all the possible rigid 
motion transformation groups are a Lie group. Lie groups are named after Lie, a 
Norwegian mathematician who laid foundations of the theory continuous transfor- 
mation groups (Lie 1880). Klein, a friend of Lie, studied the spatial geometry 
groups using the transformation groups (Klein 1933). Then, Brockett (1984) 
first introduced the product of exponential formula into robotic field in 1983. This 
formulation was further developed by Murry, Li et al. in 1994 (Murray et al. 1994). 
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Exponential Coordinates for Rotation Displacement 

Pure rotation is a relatively simple but representative case of rigid motion especially 
in machinery. As mentioned in the preceding section, a rotational matrix 
R associated with each rotation has the following properties: 

RR t = R t R = I, (192) 

det(R) = 1. (193) 

The set of all orthogonal matrices with determinant 1, which is the set of all 
rotation matrices R, is a group under the operation of matrix multiplication denoted 

o o 

as 50( 3) C 9v . SO stands for special orthogonal, where special means the 
determinant of R being +1 instead of ±1 (Murray et al. 1994): 

50(3) = {ReR 3 x3 :RR T = I,detR = +l}. 

Since 50( 3) is a group, then it satisfies the four axioms of group: 

Closure: R X R 2 G 50( 3), V R x , R 2 G 50(3); 

Identity: There exists an identity element, for 50(3) is the 3x3 identity matrix 
I, such that RI = IR = R, V R G 50(3). 

Inverse: R T G 50(3) is the unique inverse of R, that is, RR T = R T R = I V R G 
50(3) 

Associativity: (RiR 2 )R 3 = Ri(R 2 R 3 ) V R 1? R 2 , R 3 G 50(3). 

According to Euler’s theorem, each rotation with a fixed point is equivalent to a 
rotation about an axis. Supposing for a general rotation, co is a unit vector denoting 
the direction of the rotation axis and 6 is the rotation angle. Since there is a rotation 
matrix R (R G 50(3)) corresponding to each rotation of the rigid body, R can 
thought to be a function of co and 6. Let p be a point on the rigid body, then the 
speed of the point can be expressed as 

p(t) = (O x p(t) = <op(t), (194) 

where co is the skew- symmetric matrix corresponding to the unit vector co 





( 197 ) 
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The operator extracting the vector from a skew- symmetric matrix denotes V, 
that is 


(co 


V 


CO. 


Then, ^(S) is defined as the set of all third-order skew -symmetric matrices, 
which can be expressed as 


so( 3) 


{se 



3x3 


: S 


T 


s}. 


Eqaution 193 is a linear first-order differential equation with its solution as 


p(t) 


e m, p{e o), 


(198) 


where p(# 0 ) is the initial position of the point p. Since the angular velocity is a unit, 
thus for 6 amount of time, the transformation could be expressed as 


R (<w,0) = e 60 , 


(199) 


where e 0)9 is an exponential matrix whose Taylor series expansion is as follows: 


6)6 


n2 n3 

I + 0 CO -f — ft) 2 + — ft) 3 + 


2 ! 


3! 


( 200 ) 


Notice that do G so( 3), so 


CO 


2 __ _T r -.3 


COCO 


1,(0 




CO. 


( 201 ) 


Equation 199 can be rewritten by substituting Eq. 200 into it, 


(o0 


8 3 8 5 

1 + 8 - — + 


3! 


5! 


(O + 


6 


0 


2! 4! 


+ 


6 


6 


/\ 


6 ! 


co 


( 202 ) 


It can be further simplified as 


6)6 


I + cb sin 6 + co 2 ( 1 


cos#). 


(203) 


If the module of co is not 1 , Eq. 202 should be modified as 






0)6 


/ + 


CO 


CO 


sin (| \co\ \6) + 


CO 


CO 


(1 


COS (||fi)||0)), 


(204) 


where e 0)6 can be verified as a rotation matrix indeed belongs to S<9(3) using 
Eqs. 202 and 203. 
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For a rotation matrix R as in Eqs. 167 and 168, the corresponding co and 6 can be 

o 

deduced. The components of the vector Oco E 91 are called the exponential 
coordinates (Waldron and Schmiedeler 2001). 


Exponential Coordinates for Rigid Displacement 

A general rigid motion can be represented as the relative change between two 

Q 

coordinate frames using a vector p E pointing from the origin of the frame { O } 
to the origin of the frame {N} and a rotation matrix R which is an element of 50(3). 

Q 

Then, the pair (/?, R) can fully define a rigid motion. The product space of 93 and 
50(3) is defined as 5£{3} implying special Euclidean: 

SE( 3) = {0,R) :/>efK 3 ,R€SO(3)} = fH 3 x 50(3). (205) 

Consider a rotation whose axis does not pass through origin, the rotation axis is 

o 

represented by a unit vector c o parallel to it, and q E 93 representing a point to the 
axis. If the body rotates about the axis with unit angular velocity, the speed of point 
r on the rigid body can be expressed as 

r(f) = co x ( r(t ) — q). (206) 


Now, a four-by-four matrix | is introduced 





(207) 


in which v = q x co, so Eq. 205 can be rewritten in a homogeneous form 


r 


co 

q x co 


r 


co 

V 


r 

0 


0 

0 


1 


0 

0 

1 


(208) 



r = |r. 

Solving the differential equation above yields 


(209) 


r(t) 


e Sf r( 0), 


( 210 ) 


A 

where r(0) is the initial position of the point r and e^ is the exponential matrix. The 
Taylor series expansion of e^ is 



t 1 ~2 


t 3 -3 


, + S ' + 2X + 3! { + 


( 211 ) 
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On the other hand, for a pure translation with the direction represented by unit 
vector v, the speed of point r on the rigid body can be expressed as 

r(t) = v. (212) 

Solving this equation yields 

r(t) = A-(O), (213) 


where 



(214) 


The four-order matrix f can be considered as the generalization of the three-order 
skew-symmetric matrix co. Furthermore, se(3) is defined by 

se( 3) = {(cu, v); v G 93 3 , co G s<9(3)}. 

An element of se(3) is called a twist representing the instantaneous velocity of 
rigid motion. In homogeneous coordinates, a twist f G se(3) can be expressed as 








CO V 
0 0 


g93 


4x4 


(215) 




Operator V is used to map the four-by-four matrix £ to six-dimensional vector 


that is, 



— 

V 

— — 

CO 

V 


CO 

0 

0 


V 




The map from the elements of se(3) to the elements of SE(3) is called the 
exponential map, which has the following two important properties: 


(1) For any | G se( 3) and 6(6 G 93), e 6 ^ is the element of SE( 3), that is, e 6 ^ G SE( 3). 

(2) For any given g = (p,R) G SE( 3), there exist | G se(3) and #(# G 93) satisfying 
that g = 


The proof of property (1) is direct calculation. If co = 0, 



I V0 
0 1 


and if co ^ 0, then 






06 ) 


CD X v) + 6(O l (OV 



(217) 


( 218 ) 
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/\ 

For each of the two cases, e 6 ^ belongs to SE{ 3) as desired. 
The proof of property (2) is by construing. If R = I, let 


Then, 


ifR^I, 



A 

e 0 ^ 

'e 0 * 

(I — e* 9 ®) {to x v) + 0(o J (ov 


R 

P 

C 

9 

\ / / 

1 


9 

1 


(219) 


( 220 ) 


( 221 ) 


6 and a> can be solved from R. And let 


A = (I — e 6 ^)® + Ota 1 to. (222) 

Then, Av = p. As A is always invertible (Murray et al. 1994) while R ^ I, 
so v = A J p. Since each rigid transformation corresponds to an element g 
from SE{ 3) and given the properties of the exponential map, there always be a 

twist 9^ G se( 3) that can generate this transformation. The vector 9^ G is called 
the exponential coordinates for the rigid transformation of g. 


Exponential Coordinates for Screw Motion 


Consider the screw motion shown in Fig. 18. The position vector of a point q before 
and after a screw motion using exponential map can also be represented as 


r(0,h) 


gr(0) 


q + e w0 {r{O)—q) + 8hw, 


(223) 


and in the homogeneous coordinates, 


8 


i 

1 

1 

1 

1 

1 


w 6 


9 


I -e* e )q + h0w 


1 


1 

1 

§ 

5* 

I 

1 

1 

i 


(224) 


Q 

As this map holds for all q G R , then the rigid transformation g is 


8 


wd 


9 


I — e w6 )q + h6w 

1 


(225) 
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As shown before, the exponential of a twist is 




wG 


W 


x v) H- ww T vO 




When v = r x w + hw, twist f = (w, v) generates the screw displacement with 
Pliicker coordinates (w, r x w + /zw). Till now, the relationship between exponential 
coordinates and screw motion is clearly shown. 


Twists and Rigid Motion Velocity 


In the case of pure rotation, frame {N} which attached to a rigid body coincides 
with a fixed ground frame { O } at start, and then the rigid body rotates about the 
origin. For a given time t , the coordinates q Q in the fixed frame of point q attached to 
frame { N }, i.e., the rigid body, can be described as 

q 0 (t)=R(t)q N , R(f)GS0(3), (227) 

while q N is the fixed coordinates representing point q in frame { N }. 
Differentiating the equation yields 




= Mt)q N , 


(228) 


as q N is irrelevant to time. Since using R (t) is quite inconvenient, this equation 
could be rewritten in the form 


v ?o (0 = R (t)q N = R(f)R _1 (0R(0^ = R(f)R -1 (0«o(0- (229) 

The reason for this is that R(t)R _i (t) is a skew-symmetric matrix given by 

T' 

differentiating RR — 1. Furthermore, a skew-symmetric matrix could be 
represented using a vector. Thus, the instantaneous ground angular velocity is 
defined as 


mo = RR 1 , (230) 

O 

and vector m 0 G corresponds to the skew matrix wo- Similarly, the instanta- 
neous body angular velocity is defined as 

6> n = RR -1 , (231) 

Q 

and vector co N E corresponds to the skew matrix The body angular velocity 
represents the angular velocity as seen from the instantaneous body frame. 
The relationship between the body angular velocity and ground angular velocity 

is rfiyv = 
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Now, the instantaneous velocity of the rigid body in fixed frame can be expressed as 


v ? o (0 = = ®o (t) X q 0 (t). 


and this velocity as viewed in the body frame is given by 



(0 = = 0 J N (t) x q N . 


(232) 

(233) 


As in the general case, a transformation g e SE( 3) could be used to represent the 
rigid transformation. Also for a given time t , the coordinates q Q of point q attached 
to frame {A}, i.e., the rigid body in the fixed frame, can be described as 


?o(0 = g(t) q N ,g(t) 


R 

0 



(234) 


Differentiating the equation yields 


v 


Vo 


(0 


d 

dt 


g(t)<lN = g(t)9N = g(t)g(0 l g{t)<lN = g(t)g(t) l q 0 


(235) 


Similar to the case of rotation, gg 1 and g : g have some particular properties. As 


-i • 



-l 


— • 

R 

°D 1 

Pn 


0 

0 



R 


T 


r to 


Pn 


0 


1 


RR 

0 


T 


TO 


O 


R R" 'p N + p N 

0 


(236) 


Given the fact that RR 1 
velocity Vq Ese( 3), 


is skew-symmetric matrix, gg 1 is defined as ground 



(237) 

(238) 


and then v qo (t) = V 0 q 0 = o) 0 x q 0 + v a . 

The ground velocity consists of two parts: co 0 is the instantaneous angular 
velocity as observed in the ground frame, while v 0 is the velocity of a point on 
the rigid motion passing through the origin of the ground frame at the very moment. 

And also the velocity V# of the rigid body with respect to the moving body frame 
could be defined in a similar way: 




(239) 


( 240 ) 
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The velocity of a point viewed in the instantaneous body frame can be 
expressed as 


v 


Qn 


8 \ 0 


/\ 


VnQn — 60 N X q N + V N . 


(241) 


For body velocity, v N is the velocity of the origin of the body frame { N } with 
respect to the ground frame, but viewed in the instantaneous body frame. And co N is 
the angular velocity of the moving frame also viewed in the instantaneous body 
frame. Please note that body velocity does not mean the velocity of the rigid body 
viewed in body frame (Murray et al. 1994). 

From the definition of body and ground velocity, 


Vo 





(242) 



Q ) 0 


R 

0" 

Vo 


1 

R 



(243) 


The 6x6 matrix associating with a general element g E SE( 3) transforms twists 
from one coordinate frame to another, denoting Ad ? , and is referred as the adjoint 
transformation. That is, for 


R p 

0 1 


(244) 



It could be proved that Ad ? is invertible and 







Please note that any two coordinate frames that are associated by g E SE 
(3) have a corresponding adjoint transformation, i.e., adjoint transformation is not 
confined to a fixed frame and a moving frame. 


Application in Robotic Forward Kinematics 


Generally, a robot manipulator can be viewed as the connection of n rigid links 
through n joints with a tool, i.e., an end effector attached to the end of the last link. 
Most joints used in industrial applications are revolute joints or prismatic joints. 
The task of forward kinematics is to determine the tool’s pose and position given 
the relative configurations of the rigid links. Usually when studying the forward 
kinematics of a robot, two frames, the base frame and tool frame, are attached to the 
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Fig. 19 The coordinate system of SCARA robot for quaternionic kinematics 


Ta ble 2 DH parameters of 
the SCARA robot 
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base and the tool of the robot, respectively. Here, several methods are given to study 
the forward kinematics using the example of a SCARA manipulator as shown in 
Fig. 19, and then the comparisons are given. 


Application of Dual Quaternion in Robot Kinematics 

For a robot manipulator, the transformation between the base frame and the tool 
frame consists of several successive rotations and translations. To show these 
intermediate transformations clearly, DH method is referred to establish the coor- 
dinate system and DH parameters are acquired to represent these transformations. 

The coordinate system of the SCARA robot is assigned as shown in Fig. 19, 
where frame { <9 0 1 is the base frame, and { 0 4 } is the tool frame, and it is assumed to 
be overlapped with { 0 4 } . Then, the DH parameters can be obtained, and they are 
shown in Table 2. 
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For each set of parameters, there is a transformation matrix corresponding to it 



Then in this particular case, 


( 247 ) 
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( 248 ) 


( 249 ) 


Thus the overall transformation from the base frame 


{CM is 


{( 9 0 1 to the tool frame 



Orp lrp 2rp 3 rp 

1 1 1 2 I3 1 4 
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0 c(#i + 62)12 T c 6 \l\ 
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Similar with DH method, the transformation between these frames can be 
obtained by multiplication of the four dual quaternions corresponding to each 
transformation: 


Qi— 1, i 




1 , &i - 1 


°Qtrans (A - 1 > @i— 1 ) °drot (A - — 1 > ®i) ° d tram (• ^i— 1 ’ ) • 


Then the forward kinematic model in dual quaternion space can be calculated for 
the SCARA robot as 


#05 _ #04 — #01°#12°#23°#34- 

Dual quaternionic forms of the transforms from the (/ — \) th coordinate to I th 
coordinate, where / = 1 , 2 , 3 , 4 , can be written as 
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(251) 


#2 . 0 2 , If 0 2 . . 0 2 . 

#12 = cos — + sin —k,q 12 = - ( /i cos — i+Zi sin —7 j. 
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(252) 
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2 


2 


(254) 


Then the dual quaternion <7 relating the tool frame to the base frame can be 


calculated by 












# — #01 °#12 °#23 °# 34 ’ 


and the results are 
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It can be deduced from q' = l / 2 (t°°q) that 
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6 ? 4 )&. 


It shows that the tool frame can be obtained by a translation of t followed by a 
rotation about z axis by 0 X + 0 2 — 0 4 based on the base frame. 


Forward Kinematics with Exponential Coordinates 

Using exponential coordinates to deduce the forward kinematics is quite straight 
forward. Here, a general procedure of deduction for an arbitrary open-chain manip- 
ulator with n joints is introduced: 

( 1 ) Choose a base frame and a tool frame. Define g 0 as the reference configuration 
while all the joints are at the origin position. 
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Fig. 20 The coordinate 
system of SCARA robot 
exponential coordinates 



▲ 


(2) While at the reference configuration, in the base frame, for each joint /, define a 
twist & which can represent the motion of the joint. And for a revolute joint, the 
twist is 



o 

where q x G 93 is an arbitrary point on the axis whose unit vector is denoted as o)\ 
and </i is usually selected for the convenience of expression. For a prismatic joint, 



where Vi is the unit vector in the direction of translation. 

(3) The forward kinematic map from joint space to the configuration of the tool can 
be expressed as 



e ilOl e Z202 e y3... e i»0n 



R(0) p{0) 
0 1 



And below the procedures for a SCARA manipulator are given in detail. 

For the SCARA manipulator, as is shown in Fig. 20, when all the rotation joints 
and the prismatic joint are at their home position, i.e., = 0 and d f = 0, the 

transformation between the tool frame and base frames is 





h + h 

0 

k) 

1 
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For the three revolute joints, 
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and the points are chosen as 
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and then the twists would be 
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As for the prismatic joint 3, the twist can be expressed as 
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Thus, the forward kinematic transformation is 


8 ( 0 ) 
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And the exponentials are given from Eqs. 203, 216 and 217, 
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Forward Kinematics with Screw Theory 

Screw theory is mainly used for analyzing instantaneous motion. However, Tsai 
and Roth (1973) proposed a method to represent the kinematics of manipulators. 
For a screw motion $ = (0w; q x 6w + hQw) = 0(w; So + hw), the corresponding 
transformation while 6 ^ 0 is given by Eqs. 183 and 184, and is rewritten here: 




/ W 2 x Vq + Co 
W x WyV0 + W Z S0 

\ W X W Z V 0 - WyS0 


W X WyW0 - W z S0 

WyV0 + Co 
W x W z Vo + W x S0 


W x W z V0 + WySo \ 

w x w z v 6 - W x S 0 , 

W 2 z V0 +C0 ) 


p = (I— R )q + Oho). 


(271) 


(272) 

(273) 


For pure translation, the expression is similar. In a kinematic chain, screw $ f can 
be used to represent the movement of a joint i connecting two links. Generally in an 
open-chain serial, the transformation between base frame and tool frame can be 
achieved by successive screw displacement. The process of obtaining the forward 
kinematic model of a manipulator can be summarized by the following steps 
(Rocha et al. 2011): 


(1) Choose a fixed frame where the screws will be expressed, and generally it’s the 
base frame. 

(2) Define a reference configuration for the manipulator, from which screw param- 
eters for each joint of the manipulator are determined. 

(3) Determine the homogeneous transformation for each joint. 

(4) Determine the overall homogeneous transformation by pre-multiplication. 

The above procedure is applied to the SCARA manipulator to clarify the 
process. For the four joints, the screw can be defined in the base frame as follows 
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Here the transformation corresponding to $ 2 is 
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(278) 


It is the same with the exponential in the method using twist, and the result is the 
same too. 

In this section, several methods have been given particularly in the application of 
a SCARA manipulator. DH method has the minimal set of scalar parameters. For 
the exponential coordinate method and successive screw method, it’s quite straight- 
forward to deduce the kinematics by only defining two coordinate frames which can 
be chosen for the convenience of expression. Besides, there is a clear geometric 
significance associated with the twist. The proper choice of reference configurations 
results in having the joint variables corresponding to the actual joint displacements, 
which are not common for DH notation. Exponential coordinate method and 
successive screw method bear much resemblance to each other while the former 
one has a solid mathematical background and the later one is more often used in 
velocity analysis (Dai 2006). 


Summary 

This chapter presents a fundamental theory for the rigid body motions. The screw 
theory, Lie group, and Lie algebra have been successfully applied to the modern 
physics and rigid body motion and become a powerful tool for the analysis of 
various mechanisms and robots. A basic overview from the conventional algebraic 
methods to modern geometric methods is presented. An application example of a 
SCARA robot is given in the last section, which instructs the usage of the basic 
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mathematical tools presented in this chapter. Certainly, this content is not a 
comprehensive version of robot kinematics, so interested reader can refer to the 
books listed in the references for obtaining more detailed derivation processes. 
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Abstract 

This article deals with the kinematics of serial manipulators. The serial manip- 
ulators are assumed to be rigid and are modeled using the well-known Denavit- 
Hartenberg parameters. Two well-known problems in serial manipulator kine- 
matics, namely, the direct and inverse problems, are discussed, and several 
examples are presented. The important concept of the workspace of a serial 
manipulator and the approaches to determine the workspace are also discussed. 
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Introduction 

A serial manipulator consists of a fixed base, a series of links connected by joints, 
and a free end carrying the tool or the end-effector. In contrast to parallel 
manipulators, there are no closed loops. By actuating the joints, one can position 
and orient the end-effector in a plane or in three-dimensional (3D) space to 
perform desired tasks with the end-effector. This chapter deals with kinematics 
of serial manipulators where the motion of links are studied without considering 
the external forces and torques which cause these motions. The serial manipulator 
geometries are described using the well-known Denavit-Hartenberg (D-H) 
parameters. Two well-known problems, namely, the direct and inverse kinematics 
problems, are posed, and their solution procedures are discussed in detail and 
illustrated with examples of planar and spatial serial manipulators. It is shown that 
closed-form analytic solutions to the inverse kinematics problem are possible 
only for serial manipulators with special geometries, and the most general 
six-degree-of-freedom serial manipulator requires the solution of at most a 
16th-degree polynomial. The solution of the inverse kinematics problem leads 
to an important and useful concept of the workspace of a serial manipulator, and 
the approaches to obtain the workspace and determine its properties are also 
presented. 

As mentioned above, a serial manipulator consists of joints and links. The 
number of joints and links determine the degree of freedom of a manipulator 
which determines the capabilities of a serial manipulator. We start this chapter 
with a discussion on this important concept of degrees of freedom of a serial 
manipulator. 


Degrees of Freedom of a Manipulator 

The degrees of freedom of a serial manipulator 1 can be obtained from the well- 
known Chebychev-Griibler-Kutzbach criterion 


j 

dof = X(N-J- l) + ^2 F < (!) 

i = 1 

where dof is the computed degree of freedom with N as the total number of links 
including the fixed link (or base), J as the total number of joints connecting two 
consecutive links, F z as the degrees of freedom at the ith joint, and 


1 The Chebychev-Griibler-Kutzbach criterion can be used to find the degree of freedom of arbitrary 
connections of links and joints and is not restricted to serial manipulators. It, however, does not 
work for over-constrained mechanisms. More details can be found in Gogu (2005). 
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The quantity, dof, obtained from Eq. 1 is the number of independent actuators 
that can be present in the serial manipulator. In a broad sense, dof determines the 
capability of the serial manipulator with respect to dimension of the ambient space 
X. We have the following possibilities: 

1 . dof = X - In this case, an end-effector of a manipulator can be positioned and 
oriented arbitrarily in the ambient space of motion. 

2. dof < X - In this case, the arbitrary position and orientation of the end-effector 
are not achievable, and there exist ( X - dof) functional relationships containing 
the position and orientation variables of the end-effector. 

3. dof > X - These are called redundant manipulators, and the end-effector can be 
positioned and oriented in infinite number of ways. 

In serial manipulators with a fixed base, a free end-effector and two links 

connected by a joint, from Eq. 1, N = / + 1 and dof = ^A =1 E7. If all th e actuated 
joints are one-degree-of- freedom joints, then J = dof. 

If / < dof, then one or more of the actuated joints are multi-degree-of-freedom 
joints and this is not used in mechanical serial manipulators. This is due to the fact 
that it is difficult to locate and actuate two or more one-degree-of-freedom joints at 
the same place in a serial manipulator. In biological systems, muscles are used to 
actuate multi-degree-of-freedom joints - in a human arm muscles actuate the three- 
degree-of-freedom shoulder joint. 

In manipulators, the J joint variables form the joint space. The variables 
describing the position and orientation of a link or the end-effector are called the 
task space variables. The dimension of task space is <6 for 3D motions and <3 for 
planar motion. Finally, there are often mechanical linkages, gears, etc. between 
actuators and joints. The space of all actuator variables is called the actuator space. 
If the dimension of the actuator space is more than 3 for planar motion and more 
than 6 for 3D motion, the manipulator is called redundant. If the dimension of the 
actuator space is less than the degree of freedom, then the manipulator is called 
under-actuated. 


Representation of Links Using Denavit-Hartenberg Parameters 
and Transformation Matrix 

As mentioned earlier, the serial manipulator consists of a sequence of links 
connected by joints. In most industrial manipulators, the links are designed to 
minimize deflection and consequent loss of accuracy and repeatability, and, in 
this sense, the links can be assumed to be rigid bodies. It is well known that a 
rigid body in 3D space can be described (with respect to another rigid body or a 
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/\ 



Link i-2 


Fixed Coordinate System 


Fig- 1 Intermediate links and D-H parameters 


reference coordinate system) completely by six independent parameters - three for 
the position vector of a point of interest on the link or the origin of a coordinate 
system attached to the rigid body and three angles for the orientation of the rigid 
body. In 1955, Denavit and Hartenberg (1955) presented a formulation for describ- 
ing links connected by rotary (R) or prismatic (P) joints which required only four 
independent parameters and thus leading to more efficient computations. Unfortu- 
nately, over time, several conventions have emerged with slightly different inter- 
pretations of these four Denavit-Hartenberg or D-H parameters (see, e.g., textbooks 
by Paul (1981), Fu et al. (1987), and Craig (2005)). In this chapter, we follow one of 
the commonly used and modern convention described in Craig (2005) or 
Ghosal (2006). 

Figure 1 shows three rotary (R) joints connecting link i — 2 and link / — 1, link 
i — 1 and link /, and link / and link i + 1, respectively - although R joints are used in 
the development here, analogous definitions of D-H parameters with prismatic 
(P) joints and a sequence containing both R and P joints can be similarly obtained. 
The key elements of the convention used here are: 


• The joint axis i is labeled as Z*. 

• The coordinate system { / } is attached to the link i. 

• The origin O t of { i } lies on the joint axis i. 
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It may be noted that the link i is after the joint i as shown schematically in 

/\ /\ 

Fig. 1. The X, axis is along the mutual perpendicular between the lines along Z, and 

Z /+ i and the intersection of the mutual perpendicular line and the line along joint 

/\ 

axis i determines the origin O z of coordinate system { / } . The Y z axis perpendicular 

/\ /\ 

to both X/ and Z z so as to form a right-handed coordinate system, is not shown for 
clarity. 

The first D-H parameter for link i is the twist angle , a z _ j , defined as the angle 

/\ /\ 

between the lines along Z z _i and Z z and measured about the common perpendicular 

X/_i according to the right-hand rule (see Fig. 1). The twist angle is a signed 
quantity between 0 and ±7i radians. 

The second D-H parameter for link i is the link length , a t _\, defined as the 

/\ /\ /\ 

distance between the lines along Z z _i and Z z along the common perpendicular X z _i 
(see Fig. 1). It is a positive quantity or zero. 

The third D-H parameter is called the link offset , d h defined as the displacement 

/\ ^ ^ 

along Z i from the line parallel to X z _i to the line parallel to X 7 If the joint i is rotary, 
then dj is a constant, and for a prismatic joint d z is the joint variable (see Fig. 1). The 
quantity d z can be positive, negative, or zero. 

The fourth D-H parameter is the link rotation angle, 0 Z , defined to be the angle 

between X/_i and X, measured about Z, according to the right-hand rule. If the joint 
i is prismatic, then is constant, and if joint i is rotary, then 0 Z is the joint variable 
(see Fig. 1). The rotation angle 6 t is a signed quantity between 0 and ±7t radians. 

If the axes of two consecutive joints i — 1 and i are parallel, then there exists 
infinitely many common perpendiculars. In this case, the twist angle a t _i is 0 or n, 
and the link length a t _\ is the distance along any of the common perpendiculars 
since all are equal. If the joints i - 1 and i are parallel and the joint i is rotary, then d t 
is taken as zero. If the joint i is prismatic, then is taken as zero. It may be noted 
that if the two consecutive joints are prismatic and parallel, then the two joint 
variables are not independent. 

If two consecutive joints are intersecting, then there are two choices for the 

/N 

direction of the common perpendicular X z _i either along or opposite to the normal 

/\ /\ 

to the plane defined by the intersecting Z/_i and Z z . The link length a t _ x is 
clearly zero. 

The above assignment of origins and coordinate axes fails for the first and the 

last link. For the first link, the choice of Zo and thereby Xi is arbitrary, and for the 

last link, Z n+) does not exist. As a consequence, a t _\ and a f _i for first link and J z and 
6j for the last link are not defined. In order to overcome this problem, the following 
is used: 


" In serial manipulators, the notion of after is clear and natural - the fixed link is denoted by { 0 } , 
and link 1, denoted {1}, is after the joint 1 connecting {0} and {1}. In parallel and hybrid 
manipulators, with one or more loops, one needs to be careful since the loop can be traversed in 
more than one way. 
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Fig. 2 Parallel jaw gripper 
with {Tool} frame 


Manipulator 

hand 



Parallel jaw gripper 


Tool 


• For the first link, if the first joint is rotary (R), then { 0 } and { 1 } are chosen to be 
coincident with a t _\ = a t _\ = 0. This also implies d\ = 0 if first joint is rotary and 
the only nonzero variable is 0\. 

• For the first link, if the first joint is prismatic (P), then the coordinate axes of { 0 } 
and { 1 } are chosen to be parallel and a t _i = a t _i =6 X = 0. For the first link with a 
prismatic joint, the only nonzero parameter is d\. 

• For the last ( n ) link, if the joint n is rotary (R), then the origins of { n } and { n + 1 } 
are chosen to be coincident and d n = 0. The angle 6 n is taken to be zero when the 


axis X n -i aligns with axis X n . The joint variable in this case is 0 n . 


If the last joint is prismatic, X n is chosen so that 0 n = 0, and the origin O n is 

/\ /\ 

chosen at the intersection of X n _\ and Z w when d n = 0. The joint variable in this 
case is d n . 


In the above described convention, two of the four parameters of the link /, a t _\ 
and a t _\ , have subscripts / — 1, and two of them, d t and 6>i, have subscript i. Another 
consequence of the convention is that the link length a n and the twist angle, a n , need 
not be defined. The link n is the end-effector or the tool of the manipulator and, to 
represent the tool or the end-effector, a separate coordinate system {Tool} on the 
tool is used. Usually, this end-effector or tool coordinate system has the same 
orientation as { n } , and its origin is at some point of interest in { Tool } . For example, 
in the case of a parallel jaw gripper schematically shown in Fig. 2, the origin of 
{Tool} is at the midpoint of the jaws. 

Once the links of a serial robot are represented with the Denavit-Hartenberg 
parameters, the position and orientation of link i can be obtained with respect 
to link / — 1. The position vector to the origin of link /, O h from the origin of 
link i — 1, Oj_ i, is given by 
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i_ 1 0/ = + di‘-% (2) 

• ^ A rp . ^ A J 1 

where is (1,0,0) and ,_i Z 7 is given by (0, — s a/1 , c a/1 ) . It may be noted 

that throughout this article, sine and cosine of an angle 6 are denoted by s 0 , c e , 
respectively. 

• -| 

The rotation matrix, [/?], describing the orientation of link i with respect to 

link i — 1, is given by the product of two rotation matrices, namely rotation of angle 

/\ /\ 

a/_x about X axis and rotation of angle 0 Z about Z axis 





In terms of the commonly used 4x4 homogeneous transformations, the link 
i with respect to i — 1 is given by 




It can be seen that ! _1 [T] is a function of only one joint variable — a function of Of 
if joint i is rotary and a function of d z if joint i is prismatic. 

To obtain the transformation matrix of a link i with respect to any other 
link, product of transformation matrices can be used. For example, the link 
i can be described with respect to the fixed base or reference coordinate system 
{0} as 








7-1 


Since each of z [7] is a function of only one joint variable, the right-hand side of 
Eq. 5 will be a function of i joint variables with the other 3*/ Denavit-Hartenberg 
parameters being constant. 


Examples of D-H Parameters and Link Transformation Matrices 

To illustrate the concept of Denavit-Hartenberg parameters and link transformation 
matrices, three examples are presented in this section. 

The Planar 3R Manipulator 

As a first example, the D-H parameters and link transformation matrices of a simple 
planar three-link manipulator with three rotary (R) joints, shown in Fig. 3, are 
obtained. 
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Fig. 3 The planar 3R 
manipulator 


{Tool} 


X Tool , X 3 


Y 1 



° 0’°1 


Table 1 The D-H 

parameters of a planar 3R 
manipulator 


• 

l 

^7—1 

01-1 

dj 

0, 

1 

0 

0 

0 

0! 

2 

0 

h 

0 

82 

3 

0 

h 

0 

8 3 


In this example, all the rotary joint axes are parallel and are pointing out of the 
paper. The Denavit-Hartenberg parameters are obtained as follows. 

/\ 

The fixed or reference coordinate system, {0}, is chosen with its Zo coming out 

of the paper and Xo and Yo pointing to the right and top, respectively. For the first 

/\ ^ ^ 

coordinate system, the origin 0\ and Zi are coincident with O 0 and Zo, and Xi and 

/\ /\ /\ /\ 

Yi are coincident with Xo and Yo when 0 1 is zero. The Xi is along the mutual 

/\ /\ /\ 

perpendicular between Zi and Z 2 . Similarly, X 2 is along the mutual perpendicular 

/\ /\ /\ 

between Z 2 and Z 3 For the last frame, X 3 is aligned to X 2 when 0 3 = 0. The origin 

0 2 is located at the intersection of the mutual perpendicular along X 2 and Z 2 The 
origin 0 3 is chosen such that d 3 is zero. The origins and the axes of { 1 }, {2}, and 
{ 3 } are as shown in Fig. 3. 

From the assigned origins and axes, the Denavit-Hartenberg parameters can be 
obtained by inspection. They are presented in Table 1. 

In Table 1, l\ and l 2 are the link lengths as shown in Fig. 3. It may be noted that 
the length of the end-effector does not appear in the table. To describe the 
end-effector, we attach a tool frame, {Tool}, aligned to {3} at the midpoint of the 
parallel jaw gripper. In Fig. 3, the origin of { Tool } is shown at a distance of / 3 from 

0 3 along X 3 . 
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From the D-H table, using Eq. 4, the link transformation matrices can be 
obtained by substitution. For i = 1, a^i = 0, a t _i = 0 and d t = 0. Denoting sin 
6 1 and cos 6i by Si and c 1? respectively, one can get 


0 

1 



C1 

~S\ 

0 

°\ 

S 1 

C\ 

0 

0 

0 

0 

1 

0 

\o 

0 

0 

1 / 


Similarly, for i = 2 (, a t _i = / ls a z _! = 0 and d t = 0) and for i = 3 = / 2 , 


a,- 1 — 0 and d f — 

0), 

( C 2 

-S2 

0 

A 

( C 3 

S 3 

0 

h\ 

2 cn = 

^2 

C2 

0 

0 

. 3 [T} = 

A 

C 3 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 



0 

0 

1 7 

A 

0 

0 

1 / 


Q 

To find the transformation matrix Too/ [7], the orientation of { Tool } is assumed to 

be the same as the orientation of {3}, and the origin is at a distance / 3 along X 3 . 
Hence, 




0 

0 

h\ 

0 

1 

0 

0 

0 

0 

1 

0 

\0 

0 

0 

1 / 


To find the transformation matrix §[7], multiply 0 i[T]\[T%[T\ resulting in 



^ 123 0 hc\-\-hcn\ 

C123 0 l\S\ + hsyi 

0 1 0 

0 0 1 / 


Finally, to obtain too?[T], multiply °[T] To J[T] and get 



— ^123 0 liC\ + he 12 + I3C123 \ 

c 123 0 h$l + ^2^12 + /3S123 

0 1 0 

0 0 1 ) 




The PUMA 560 Manipulator 

The PUMA 560 is a six-degree-of-freedom manipulator with all rotary (R) joints. 
A schematic drawing of the manipulator is shown in Fig. 4 with the assigned 
coordinate systems to the links of the manipulator. 
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The PUMA 560 manipulator Puma 560 - forearm and wrist 


Fig. 4 Schematic of a PUMA 560 manipulator 


Table 2 The D-H 

parameters of a PUMA 
560 manipulator 


• 

l 

&i— 1 

a-i-\ 

dj 

0, 

1 

0 

0 

0 

<h 

2 

—nt 2 

0 

0 

0 2 

3 

0 

a 2 

d 3 

0 3 

4 

—nt 2 

a 3 

d 4 

04 

5 

n!2 

0 

0 

05 

6 

—nt 2 

0 

0 

06 


The coordinate systems { 0 } , { 1 } , and { 2 } have the same origin. In many 
industrial manipulators, the last three joint axes intersect at a point called the 
“wrist,” and the PUMA 560 is one such example. The origins of the coordinate 
systems {4}, {5}, and {6} are located at this wrist point. Once the origins and the 
coordinate systems are assigned, the Denavit-Hartenberg parameters can be 
obtained by inspecting Fig. 4, and they are presented in Table 2. 

The link transformation matrices relating successive coordinate systems can be 
obtained by using Eq. 4. 



( ^ 

-si 

0 

°\ 

( 

Cl 

-si 

0 

°\ 

S 1 

Cl 

0 

0 

. 2 [71 = 


0 

0 

1 

0 

0 

0 

1 

0 


-S2 

-ci 

0 

0 

\o 

0 

0 

1/ 

K 

0 

0 

0 

1/ 

c 3 

-S3 

0 

a 2\ , 

( 

c 4 

-S 4 

0 

a 3 \ 

^3 

C3 

0 

0 

4^ U) 
1 1 

■ 1 


0 

0 

1 

d 4 

0 

0 

1 

J 3 


-s 4 

-c 4 

0 

0 

\0 

0 

0 

1/ 

K 

0 

0 

0 

1 / 
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C 5 

-S 5 

0 

0^ 


( 

C6 

-S6 

0 

°\ 

0 

0 

-1 

0 

. 5 6[T} = 


0 

0 

1 

0 

S 5 

C5 

0 

0 


—^6 

~C6 

0 

0 

\0 

0 

0 

ly 


K 

0 

0 

0 

1 J 


The transformation matrix $[ 7 ] can be obtained by multiplying all the above 
transformation matrices. The matrix 3 [ 7 ] is obtained by multiplying 1 [ 7 ] \ [ 7 ] 3 [ 7 ] 
and is given by 



( C1C23 

-C1S23 

-s\ 

CI2C1C2 - d^Si \ 

S1C23 

“^1^23 

C\ 

a 2 s\C2 +di,ci 

^23 

C23 

0 

- a 2 s 2 

V 0 

0 

0 

1 / 


The matrix $[ 7 ] is obtained by multiplying 4 [ 7 ] 5 [ 7 ] ^[ 7 ] 




C4C5C6 — 

S4S6 

C/\.C$S() ^4^*6 

-C 4 ^ 5 

a 3 \ 


^ 50 ) 


—S5S6 

C 5 

J4 


—S4C5C6 - 

- C4S6 


^ 4^5 

0 


0 


0 

0 

1/ 




The transformation matrix 5 [ 7 ] can be obtained by multiplying §[ 7 ] and 5 [ 7 ] 
given in Eqs. 8 and 9 . 

A SCARA Manipulator 

A manipulator with a SCARA configuration is very popular for robotic assembly 
due to its compliance and rigidity in desired directions. A SCARA manipulator has 
four degrees of freedom with three rotary (R) joints, and the third joint is prismatic 
(P). Figure 5 shows a schematic drawing of a SCARA manipulator and the assigned 
coordinate systems. 

As shown in Fig. 5 , the coordinate systems { 0 } and { 1 } have the same origin, 

and the origins of { 3 } and { 4 } are chosen at the base of the parallel jaw gripper. The 

/\ /\ 

directions of Z3 have been chosen pointing upward. It may be noted that the Z3 can 
be chosen in the opposite direction and 0 3 can be chosen at some other point - the 
kinematic properties of the SCARA manipulator will not change but d 3 values will 
be different. In an actual SCARA manipulator, the translation at the third joint may 
be realized by means of a (rotary) motor and a ball screw. However, in this example, 
the third joint is assumed to be a prismatic (P) joint. 

The Denavit-Hartenberg parameters can now be obtained by inspecting Fig. 6, 
and they are presented in Table 3 . 

The link transformation matrices relating successive links or coordinate systems 
can be obtained by using Eq. 4 and are given below. 
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Fig. 5 ASCARA 
manipulator 







Fig. 6 The [Base] and 
{Tool} frames 



{Base} 

1 




The transformation matrix 4 [7] is obtained as 
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Table 3 The D-H 

parameters of a SCARA 
manipulator 


• 

l 

^7 — 1 

m-i 

d t 

e, 

1 

0 

0 

0 

<h 

2 

0 

a\ 

0 

62 

3 

0 

a 2 

— c/3 

0 

4 

0 

0 

0 

04 


0 

4 



= ?cn 1 m i m i in 

f Cl24 ^124 0 

_ S 124 C124 0 

“001 
\ 0 0 0 


d\C\ + <22^12 

<^1^1 + ^2^12 
— J 3 

1 




In the next section, one of the main problems in serial manipulator kinematics is 
posed and discussed in detail. 


Direct Kinematics of Serial Manipulators 

The direct kinematics problem of a serial manipulator can be stated as follows: 
given the link parameters and the joint variable, a t ^ u a t _ 1? d h and 0 h find the 
position and orientation of the last link in the fixed or reference coordinate system. 

The direct kinematics is the simplest possible problem in manipulator kinemat- 
ics, and it follows directly from the notion of the link transformation matrix of 
section “Representation of Links Using Denavit-Hartenberg Parameters and 
Transformation Matrix”. If the fixed coordinate system is {0} and the coordinate 
system of the end-effector is { n } , one can write 

nin=°dn 1 2in--- n n~ 1 in cm 

In the above matrix equation, the right-hand side contains 1 , a z _i, d z , Of with 
i = 1, . . ., n. In the direct kinematics problem, all these are known, and hence all the 
4x4 matrices on the right-hand side are known. The direct kinematics problem for 
serial manipulators can thus be solved by simple matrix multiplication and extrac- 
tion of the rotation matrix, °[R] and the vector to the origin, 0 O„, of {n}. 

Instead of orientation and position of °[7], it is often of interest to obtain n ase [T\ 
(see Fig. 6). To obtain B n ase \T\ pre-multiply ^[7] by a known o ase \T\. Likewise, to 
obtain 7) 00l [T] post-multiply ^[7] by a known Tooi\T\- I n general, 

B T z\n = i*nnlin'Too,in a 2 ) 

One of the advantages of the D-H convention used here is that manipulator 
kinematics, represented by ^[7] is independent of the choice of the {Base} or the 
{ Tool }, and very little change is required if the serial robot changes its end-effector 
or is moved to some other location on the factory floor. This feature is not available 
in several of the other conventions in use for defining D-H parameters. 
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The direct kinematics problems for three serial manipulators are solved in the 
following examples. 


Example: The Planar 3R Manipulator 

In the case of the planar 3R manipulator (see Fig. 3), the orientation of the tool or 
the gripper can be described by an angle </>. From Eq. 7, one can directly write the 
position coordinates, x, y, and orientation of the tool as 

X = l\C\ + he 12 + hc\23 

y = l\S\ -\- hs \2 + h s \ 2 ?> (13) 

<fi = 0\ + O 2 + 03 

where, c 12 , s i2 , etc. represent cos($! + 0 2 ), sin( 0 x + 0 2 ), etc., respectively. 


Example: The PUMA 560 Manipulator 

The transformation matrix 5 [7] for the PUMA manipulator is obtained by multi- 

O ^ 

plying 3 [7] and 6 [J] given in Eqs. 8 and 9. Denoting the elements of the rotation 
matrix ®[R] by r ih i,j = 1, 2, 3, and the components of the origin of the coordinate 

0 T 

system { 6 }, 0 6 , by (0 6x , 0 6y , 0 6z ) , one can write 

''ll = Ci{C23(c 4 C 5 C 6 - S 4 S 6 ) - S23S5C6} + Si (s 4 C 5 C 6 + C 4 S 6 ) 

''21 = Sl{c 2 3(c 4 C 5 C 6 - S 4 ,S 6 ) - S23S5C6} + C\ (S4C5C6 + C 4 S 6 ) 

''31 = -523(C4C 5 C 6 - S 4 S 6 ) - C 23 S 5 C 6 

r 12 = C 1 {c 2 3 ( C4C5S6 - S 4 C 6 ) + S23S5S6} + Si (-S4C5S6 + C4C6 ) 

''22 = Si {c 23 ( C 4 C 5 ^6 - S 4 C 6 ) + S23S5S6} - Ci (~S 4 C 5 S 6 + C 4 C 6 ) 

r 32 = — .V23(£' 4 C5.Vf, — S 4 Ce) + C23S5S6 
''13 = -Cl(c 2 3C 4 S5 + S23C5) - S1S4S5 

r 2 3 = -Sl(c 2 3C 4 S5 + S23C5) + C1S4S5 

33 = S23C4S5 - C23C5 

0 6x = Ci(a 2 c 2 + CI3C23 - d 4 s 2 3 ) - CI3S1 

0 6y = si(a 2 c 2 + CI 3 C 23 - d 4 S 2 i) + d 3 ci 

Of, z = —a 2 S2 — CI3S23 — d 4 C22 


Example: A SCARA Manipulator 


For the SCARA manipulator, the matrix 4 \T\ is given in Eq. 10. The orientation of {4} 
can be described by the angle <p, and the position (x, y, z) of origin of { 4 } is given by 


x = aici +a 2 ci2 
y = a\S\ + a 2 si 2 
z = -d 3 

d> = di -\- 9 2 -\~ 9 4 



It may be noted that the direct kinematics problem for a serial manipulator is 
well defined, easily solvable, and has a unique solution for any number of links. 
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Inverse Kinematics of Serial Manipulators 

The inverse kinematics problem for serial manipulators can be stated as follows: 
given the constant link parameters and the position and orientation of { n } with 
respect to the fixed frame { 0 }, find the joint variables. For the planar 3R manipulator 
example, the inverse kinematics problem would be to obtain 0 h i = 1, 2, 3, given the 
position of the end-effector x, y, and its orientation <p. In the case of the spatial PUMA 
manipulator, the transformation matrix ° [7] is given, and the goal is to find the joint 
angles 6 h i = 1, . . ., 6. In general, six task space variables are given for 3D motion 
(three task space variables for planar motion), and the goal is to find the n joint 
variables which make up n [T] Depending on n , the following cases are possible: 

Case 1: n = 6 for motion in 3D or n = 3 for a planar motion. In this case, there exists 
required number of equations to solve for the unknowns. 

Case 2: n < 6 for motion in 3D or n < 3 for a planar motion. In this case, the number 
of task space variables is more than the number of equations, and hence, for 
solutions to exist, there must be 6 - n (3 - n for planar case) relationships 
involving the task space variables. 

Case 3: n > 6 for motion in 3D or n > 3 for a planar motion. In this case, there are 
more unknowns than equations, and hence there exists infinite number of 
solutions. These are called redundant manipulators. 


Inverse Kinematics for Case 1 

It can be seen from the direct kinematics equations of the planar 3R or the spatial 
PUMA manipulator (see Eqs. 13 and 14) that the position and orientation of the 
end-effector are related to the joint variables by means of nonlinear transcendental 
equations. In order to solve the inverse kinematics problem, one needs to solve these 
nonlinear equations. Ideally, the goal is to obtain closed-form and analytical expres- 
sions for the joint variables in terms of the given position and orientation of the last 
link. Like most sets of nonlinear equations, no known general methods for solving the 
inverse kinematics problem for an arbitrary serial manipulator were available till the 
work of Raghavan and Roth (1993). Before looking at this general method, to obtain 
insight, it is useful to solve the inverse kinematics for a few simple serial manipulators. 

Example: The Planar 3R Manipulator 

For the planar 3R manipulator shown in Fig. 3, the direct kinematics equations are 
given in Eq. 13. To solve for 6 h i = 1,2,3, given x, y, </>, one can proceed as follows: 
Define X = x- l^c^ and Y = y - l 3 s $ and thus X and Y are known. Squaring and 
adding, one can get 

A 2 + T 2 = / 2 + l\ + 21 \ I 2 C 2 


and one can get 
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0 2 


=b COS 


-1 


X 2 + Y 


l 


1 


/ 


2/i/ 


(16) 


1*2 


Once 0 2 is known, 6 x can be found using the four quadrant arc tangent formula as 


6 


1 


Atan2(7,X) — Atan2 (k 2 ,k 


1 


(17) 


where k 2 = l 2 s 2 and k\=l\ + l 2 c 2 . Finally, 63 can be obtained from 


0 3 =4>-0i -0 2 (18) 

The workspace of the planar 3R manipulator is defined as the set of values of 
{x, y, <fi) for which the inverse kinematics solution exists. The workspace for the 
planar 3R manipulator can be obtained by examining Eq. 16. It is known that 



which implies that 


(h - hf < (x 2 + Y 2 ) < (/1 + l 2 ) 2 (19) 

where X = x — / 3 c^ and Y = y — / 3 ^. Fig. 7 shows a 3D plot of the region in {x, y, (j ) } 
space where the inequalities in Eq. 19 are satisfied and the inverse kinematics 

solution exists. A projection of the workspace on the Xo — To plane is shown in 
Fig. 7 for l x > l 2 > l 3 . We have four circles of radius l\ + l 2 + / 3 , h + h~ h-> h~h + h, 
and ly- 1 2 - l 3 . 

It can be seen from Fig. 7 that the maximum reach of the planar 3R manipulator 
is points on the circle of radius h + h + h and the closest it can reach from the origin 
is points on the circle of radius l\- 1 2 - / 3 . This annular region is called the reachable 
workspace. Points in between the other two circles of radii + 1 2 — I 3 and l\ — l 2 + l 3 
can be reached with any <p, and this region was named as dexterous workspace by 
Kumar and Waldron (1980, 1981). It can be seen that as / 3 increases, the reachable 
workspace increases whereas the dexterous workspace decreases. 

Given any point in the workspace, the inverse kinematics procedure gives two 
sets of values of 0 l9 0 2 , and 63. This is shown schematically in Fig. 7 - the given 
X and Y can be reached by the two configurations. As with any nonlinear equation, 
the solutions are nonunique, and one could have several solutions. In the planar 3R 
manipulator, one can have two sets of solutions. 

Example: The PUMA 560 Manipulator 

For the six-degree-of-freedom PUMA 560 manipulator shown in Fig. 4, the direct 
kinematics equations are given in Eq. 14. In the PUMA 560 manipulator, the last 
three axes intersect at a “wrist” point. Due to this geometry, the position of the 
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Fig. 7 Workspace of a planar 
3R robot 


(/) 

X 

05 


CL 


3D workspace of 3R manipulator 




origin of the last link ({6}) is only a function of joint variables 0 1 , 0 2 , and 0 2 . 

r ~r 

Denoting the coordinates of the wrist point by (x, y, z ) , one can write 

x = ci(a 2 c 2 + a 3 c 23 - d 4 s 23 ) - d 3 s { 

y = si ( a 2 c 2 + a 3 c 23 - d 4 s 23 ) + d 3 c x (20) 

z = —a 2 s 2 — a 3 s 23 — d 4 c 23 

From the first two equations, one can write 

—s\x + c x y = d 3 

The above transcendental equation can be solved for 6 X by making the well- 
known tangent half-angle substitution 
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X\ 


tan 


# 


1 


2 


cos# 


1 


X 


1 


1 


1 +*?’ 


sin# 


l 


2xi 

1+JC? 


which converts the transcendental equation to the quadratic 



x\ {d 3 + y) + 2 xxi + (J 3 - y) = 0 

The above quadratic can be solved in closed form for x 1? and one can obtain 


#i =2 tan 1 




It may be noted that tan -1 gives an angle between 0 and n and hence 0 < #! < 
2n. However, due to the =b sign before the square root, there are two possible values 
of #! . 

To obtain # 3 , observe that 

rs r\ r\ r\ r\ r\ Q 

x + y + z = d 3 + a 2 + a 3 + d 4 + 2 a 2 a^c^ — 2 a 2 d/^S 2 

which involves only sine and cosine of # 3 . Again using tangent half-angle sub- 
stitutions one can get 


/— #4 ± \/i di + al — K 2 \ 

#3=2 tan" 1 (23) 

\ K + a 3 J 

0 0 0 0 O o 0 

where the constant K is given as (1/2 a 2 )(x + y + z — d 3 — a 2 — a 3 — d 4 ) Again, 

tan -1 gives an angle between 0 and n and hence 0 < # 3 < In. Due to the =b sign, one 
can get two possible values of # 3 . 

Finally, to obtain an expression for # 2 , observe that the Z component of the wrist 
point is only a function of # 2 and # 3 . The third Eq. in 20 can be written as 

-s 2 {a 2 + a 3 c 3 - d A s 3 ) + c 2 {-a 3 s 3 - d A c 3 ) = z 

and one can obtain 


# 2 = 2 tan 1 




Again 0 < # 2 < 2n and there exists two possible values of # 2 due to the ± 
sign. Since # 3 appears on the right-hand side of Eq. 24 in c 3 and 53 
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and one can have two possible values of # 3 , there exists four possible 
values of # 2 . 

Q 

The last three joint angles # 4 , # 5 , and # 6 can be obtained from 6 [/?] given by 



/ C 4 C 5 C 6 - S4S6 

-C 4 C 5 S 6 

- ^ 4 C 6 

“C4^5 \ 

R 

- ^5^6 

—S 5 S 6 

C5 


\ ^ 4^5 Qs ^4^6 

^4^5^6 

C4C6 

^4^5 / 


One can also write 





and since # 1? # 2 , and # 3 are now known, the right-hand side is known. Let the right- 
hand side matrix elements be denoted by r ij9 ij = 1, 2, 3. The following algorithm 
can be used to obtain # 4 , # 5 , and # 6 . 


Algorithm r t j => # 4 , # 5 , and 0 6 If r 23 ^ ±1, then 

0 5 = Atan 2 ^±^/ ~+rf),r 2 3 ) 

0 4 = Atan2 (r 33 /s 5 , - r 13 /s 5 ), 

06 = Atan2 (-r 22 /s 5 , r 2 i/s 5 ) 

Else 

If r 23 = 1, then 

# 4 = 0 

# 5 =0 

# 6 = Atan2(— r 12 , r n ), 

If r 23 = - 1 , 

# 4 = 0 

0 5 = 7T 

#6 = — Atan2(r 12 , -rn) 

The inverse kinematics of a PUMA 560 gives rise to two values of # l5 two values 
of # 3 , and four values of # 2 , i.e., there exists 4 sets of values of (# l5 # 2 , # 3 ) for a given 
wrist location. The above algorithm gives two sets of values for (# 4 , # 5 , # 6 ) for each 
set of (#!, # 2 , # 3 ). Hence, a PUMA 560 manipulator can have at most eight possible 
configurations. 

The workspace of a PUMA 560 manipulator is the set of values of the position and 
orientation of the last link, n , for which the inverse kinematics solution exists. It is not 
possible to imagine or describe the workspace since it is a six-dimensional entity. One 
can, however, obtain the reachable workspace of the wrist point. The position vector 
of the wrist point is given in Eq. 20, and the set of all (x, y, z ) satisfying Eq. 20 is a 
solid in 3D space. The bounding surface(s) of the solid can be obtained as follows: 

Squaring and adding the three equations in Eq. 20 and simplifying, one can get 


R 2 = X 2 + y 2 + z 2 = K\ +K 2 c 3 -K 3 s 3 

where K\, K 2 , and K 3 are constants. The envelope of this family of surfaces satisfies 
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which gives 


K2S3 + K 2 c 2 — 0 

^ r\ r\ 

Eliminating 0 3 from these two equations and denoting a 3 + by / , one can get 

x 2 -j-y 2 -h z 2 - ([a 2 + l ) 2 + d^j 

The above shows that the bounding surfaces for the wrist point are spheres. 

The orientation workspace for a PUMA 560 is like a set of three Euler angles 
about two distinct axes. In particular, they are the so-called Z - Y - Z rotations 
except that the second Y rotation is —0 5 . As it is well known, except for two special 
“singular” configurations, the three Euler angles can be obtained for any arbitrary 
rotation matrix. Hence, at every point in the 3D solid, the last link can be oriented 
arbitrarily except at the two “wrist singular” configurations. 


2 , 2,2 
jt + y + z 



1 ) 


2 



0 (25) 


Numerical Example of a PUMA 560 

For the PUMA 560, the numerical values of the constant Denavit-Hartenberg 
parameters and the ranges of the joint variables are given by 


• 

l 

a i_ ] (degrees) 

Oj-\ 1 (m) 

di (m) 

Range of <9, (degrees) 

1 

0 

0 

0 

-160 < 0 X < 160 

2 

-90 

0 

0 

-245 < 0 2 < 45 

3 

0 

0.4318 

0.1245 

-45 < 0 3 < 225 

4 

-90 

0.0203 

0.4318 

-110 < 64 < 170 

5 

90 

0 

0 

-100 <e 5 < 100 

6 

-90 

0 

0 

-266 <e 6 < 266 


For an arbitrarily chosen 0\ = 90, 0 2 = 30, 0 3 = 60, 64 = 135, 0 5 = —60, and 
0 6 = 120, the transformation matrix ° [T] is obtained as 



-0.7891 

-0.4330 

0.4356 

0 


0.0474 

-0.7500 

-0.6597 

0 


0.6124 

-0.5000 

0.6214 

0 


-0.1245 

-0.0579 

-0.2362 

1 



0 


The above 6 [J] can be used as an input to the inverse kinematics algorithm. 
Using the inverse kinematics equations, eight sets of solutions are obtained as 
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Fig. 8 Workspace of the wrist point of the PUMA 560 


• 

l 

di 

02 

03 

04 

05 

06 

1 

139.85 

2.48 

60.00 

-0.80 

65.29 

-122.53 

2 

139.85 

2.48 

60.00 

179.20 

-65.29 

57.47 

3 

90.00 

30.00 

60.00 

-45.00 

60.00 

-60.00 

4 

90.00 

30.00 

60.00 

135.00 

-60.00 

120.00 

5 

139.85 

150.00 

125.38 

-178.64 

147.61 

58.28 

6 

139.85 

150.00 

125.38 

1.36 

-147.61 

-121.72 

7 

90.00 

177.52 

125.38 

-111.60 

138.80 

155.68 

8 

90.00 

177.52 

125.38 

68.40 

-138.80 

-24.32 


As expected, one of the solutions (set 4) matches the chosen values of 
i = 1, . . 6, in the direct kinematics. 

The workspace of the wrist point of the PUMA 560 for the numerical values 
assumed in this example is shown in Fig. 8. It may be noted that the actual workspace 
of the PUMA 560 robot is only a subset due to the presence of joint limits. 
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In the above two examples and in general for Case 1, analytical solution of the 
inverse kinematics problem requires the elimination of joint variables from sets of 
nonlinear transcendental equations to finally arrive at a single equation in one joint 
variable which can then be solved. When three consecutive joint axis intersect (PUMA 
560 is such an example) instead of dealing with nonlinear equations containing all six 
joint variables, the inverse kinematics problem can be decomposed into two problems 
each involving only three equations in three joint variables (Paden 1986). This 
decoupling was first noticed by Pieper (1968) who worked on the well-known 
Stanford Arm. It was later shown that when three consecutive joint axis intersect, 
one needs to solve at most a fourth-order polynomial in the tangent of a joint angle, 
and the manipulator wrist point can reach any position in the 3D workspace in at most 
four possible ways. Since fourth-degree polynomials can be solved in closed form (see 
p. 24 in Kom and Kom (1968)), the inverse kinematics of all six-degree-of-freedom 
serial manipulators with three intersecting axes can be solved in closed form. 

For the PUMA 560, the workspace of the wrist point is bounded by two spheres. 
This is due to the fact that the PUMA has a special geometry. In a general serial 
robot with three joint axes intersecting at a wrist, it can be shown that the 
boundaries of the solid region traced by the wrist point form a torus which is a 
fourth-degree surface (Tsai and Soni 1984). Typically, it is not a complete torus 
since the joints cannot rotate fully, and some points on the torus and inside cannot 
be reached. The effect of joint limits on the workspace has been studied by several 
authors (see, e.g., Rastegar and Deravi 1987, Dwarakanath et al. 1992). 

For serial manipulators where three consecutive joint axis do not intersect, the 
elimination procedure to solve the inverse kinematics problem is much more 
complex. Several researchers worked on this problem - Duffy and Crane (1980) 
first derived a 32nd-order polynomial, and it was finally demonstrated by Raghavan 
and Roth (1993) that the inverse kinematics of an arbitrary 6R serial manipulator 
required the solution of at most a sixteenth-degree polynomial. Alternately, the 
inverse kinematics of any serial manipulator can also be solved numerically , and 
this is illustrated for a serial robot where only the last two joint axes intersect. 

Manipulator with Nonintersecting Wrist 

The three axes intersecting wrist, as in the PUMA 560 manipulator, is a fairly 
complicated design and also difficult to manufacture. In addition, there is always 
some manufacturing tolerance that makes it impossible to have three axes 
intersecting exactly at a point. Consider a robot similar to the PUMA 560 but 
with a nonzero offset d 5 which makes the last three joint axes nonintersecting. For 
such a robot, the last column of °[ 7] can be written as 

x = ci(a 2 c 2 + a 3 c 23 - d 4 s 23 ) - d 3 s { + d 5 (s x c 4 - c { s 4 c 23 ) 
y = s 1 (a 2 c 2 + a 3 c 23 - d 4 s 23 ) - d 3 c { - d 5 (cic 4 + s x s 4 c 23 ) (27) 

z = —a 2 s 2 — a 3 s 23 — d 4 c 23 — d 3 s 4 s 23 

Equations 27 are in four joint variables (0 l5 0 2 , 0 3 , 0 4 ) in terms of known x, y, 
and z. To solve for the unknown joint variable, one additional equation is required 
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containing the same joint variables. The additional equation, in this example, can be 
obtained by examining the rotation matrices. Since the inverse of a rotation matrix 
is the same as its transpose, 





where ®[R] have known elements r,y, /, j = 1,2,3. The above equation can be 
expanded to obtain 


( C4C5C6 ^4^6 C4C5S6 ^4^6 

S 5 C 6 —S 5 S 6 

^4^5^6 ^4^5 S /\. C 3 S ^, C4C5 



/ C1C23 s 1 C23 -523 \ / r n r\2 r u \ 

~C\S 23 ^ 1^23 -C23 U >1 r 22 r 23 

V -Si Cl 0 / \r 3 1 r 32 r 33 / 



From the (1,3) and (3,3) terms of the above matrix equation, for 6*5 / 0, it, one 
can get 


s 4 (r 13 CiC23 + r 2 3SiC 23 - r 33 s 2 3) = c 4 (ri 3 Si - r 23 Ci) (29) 

Equations 27 and 29 can be solved for (0 l5 6> 2 , ^3, ^4) by using any available 
numerical scheme. Using the D-H parameter values for the PUMA 560, d 5 = 
20 mm, the 5 [7] given in Eq. 26 and using the numerical solver f solve in Matlab 
(2012), one can obtain 

0i = 96.11, 0 2 = 169.11, 0 3 = 128.93, 0 4 = -115.51 

Once (0 1? 0 2 , 0 3 ) are obtained, 0 4 , 0 5 , and 0 6 can be solved by considering left- 
and right-hand sides of the matrix Eq. 28. For 0 5 7^ 


0 5 = Atan2(±^/(l,3) 2 + (3,3) 2 , (2. 3)j 
O 4 = Atan2((3,3)/s5, -(3,l)/s 5 ) 

0 6 =Atan2(-(2,2)/s5,(2,l)/s 5 ) 

where the terms (3, 3), (2, 2), etc. are from the right-hand side of Eq. 28 and are 
known once 0 1? 0 2 , and 0 3 are known. For the numerical values chosen above, two 

sets of values, namely, 0 4 = -115.51, 64.49, 0 5 = 141.99, -141.99, and 0 6 = 
146.54, -33.46 are obtained. It can be verified that one value of 0 4 is the same as the 
value obtained using fsolve. It may be noted that 0 5 = 0 or n is a singular 
configuration for the nonintersecting wrist. 

Efficient numerical techniques, such as the Newton-Raphson method using 
available Jacobian matrices of serial manipulators (see Chapter on velocity 
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analysis), can solve the inverse kinematics problem of a serial manipulator extremely 
quickly in very few iterations (typically 2 or 3). The two main issues associated with 
any numerical technique are the choice of the initial guess and effort required to find 
all solutions. In practice, the choice of the initial guess is not serious since the initial 

Q 

guess can be the previous values of the joint variables. The second difficulty is not 
easily overcome even though continuation methods (Tsai and Morgan 1985) can be 
used to obtain all the solutions. Analytical expressions have the advantage of finding 
all possible inverse kinematics solutions and also give insight into the nature of the 
workspace. This has been a motivating reason for finding general algorithms for the 
inverse kinematics of arbitrary serial manipulators. 


Inverse Kinematics of a General 6R Robot 

As mentioned earlier, in the 1990s, Raghavan and Roth (1993) developed a general 
algorithm to solve the inverse kinematics of an arbitrary 6R serial manipulator. The 
manipulator is arbitrary in the sense that none of the fixed Denavit-Hartenberg 
parameters (link lengths, twist angles, or link offsets) have special values, such as 
0, jc/ 2, or 7t, which results in simpler equations and easier elimination of one or more 
joint variables. The salient features of the algorithm are presented next. 

For a general 6R serial manipulator, the direct kinematics equations can be written as 


° 6 m 


? m \ [T] 1 [T] l [T] t [7] l [71 


(30) 


where 1 [ T] is in terms of the four Denavit-Hartenberg parameters and is a function 
of only one joint variable 6 t (or d z ) and the other three Denavit-Hartenberg param- 
eters are constants (see Eq. 4). For the inverse kinematics problem, the left-hand 
side 6 [7] is given, and the six joint variables in each of - _1 [T] / = 1, 2, . . ., 6 are to 
be obtained. 


i 


The first step is to recognize that ' _A [Y] can be written as a product of two 4x4 
matrices, Cr 1 [Ti)stCT l [T\)jt. The first matrix (- _1 |Y])^ is a function of a t _\ and a t _i 
and is constant, and the second matrix (j~ l [T])j t is a function of the joint variables 
(for a rotary joint) or d t (for a prismatic joint). Thus one can write 


r 1 m = (r 1 m),(r l m). 


/I 0 

0 

«,-! \ , 

( c e> 

~ s 0i 

0 

°\ 

0 Qr;_l 

0 S OCj - 1 

— s ai- 1 
C OCi- 1 

0 

0 


S 0, 

0 

CQi 

0 

0 

1 

0 

di 

(31) 

\0 0 

0 

1 J 

V 0 

0 

0 

1/ 


The second step is to rewrite Eq. 30 as 


Q 

For accurate control, a manipulator end-effector is commanded to move by a small amount from 
its present position and orientation. The joint values at the future commanded position and 
orientation are expected to be close to the known current values, and these current values of the 
joint variables serve as a very good initial guess for the numerical procedure. 
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pfflGPiL = (i ) ■ 1 a m ) ■ 1 (? [^ ) _ 1 ^ [rj a [r] 1 02 ) 

It may be seen that the left-hand side is only a function of (0 3 , 0 4 , 0 5 ) and the 
right-hand side is only a function of (0 l9 0 2 , 0 6 ). In addition, it can be found that in 
the six scalar equations obtained by equating the top three elements of columns 
3 and 4 on both sides of Eq. 32 do not contain 0 6 . Denoting the third and fourth 
columns by p and 1 , one can rewrite the six equations as 

T T 

[A] (S 4 S 5 S 4 C 5 C 4 S 5 C 4 C 5 54 C 4 55 C 5 1 ) = [ B}(s l S2S l C2C l S2C l C2S l C l S 2 C2 ) (33) 

where [A] is 6 x 9 matrix whose elements are linear in 53 , c 3 , 1, and [B] is 6 x 
8 matrix of constants. 

The third step is to eliminate four of the five variables, 0 U 0 2 , 03 , # 4 , and 0 5 , in 
Eq. 33. As pointed out by Raghavan and Roth (1993), the minimal set of equations 
is 14 in number. These are the three equations from p, three equations from 1 , one 
scalar equation from the scalar dot product p • p, one scalar equation from the scalar 
dot product p • 1 , three equations from the vector cross product p x 1 , and three 
scalar equations from (p • p)l — (2p • l)p. It is shown that all these 14 equations 
contain the same variables as in Eq. 33 with no new variables. The 14 equations can 
be written as 



T T 

[F](s 4 ^5 S4C5 C 4 S 5 C4C5 54 C 4 S 5 c 5 1) = [ 0 ](sis 2 S1C2 C1S2 C1C2 Si Cl s 2 c 2 ) ( 34 ) 

where [P] is a 14 x 9 matrix whose elements are linear in c 3 , s 3 , 1, and [Q] is a 
14 x 8 matrix of constants. To eliminate four out of the five joint rotations, use any 
eight of the 14 equations in Eq. 34 and solve for the eight variables SiS 2 , S\C 2 , C\S 2 , 
C\C 2 , s u ci, s 2 , c 2 . Note, this is always possible since this involves solving eight 
linear equations in eight unknowns. Once this is done, substitute the eight variables 
in the rest of the six equations and get an equation of the form 

T 

[fl](s 4 S 5 S4C5 c 4 s 5 C4C5 s 4 c 4 s 5 c 5 1 ) =0 ( 35 ) 

where [R] is a 6 x 9 matrix whose elements are linear in 53 and c 3 . 

In the fourth step, the tangent half-angle formulas (see Eq. 21) are introduced for 
s 3 , c 3 , s 4 , c 4 , s 5 , c 5 , and after simplification, obtain 



where [S] is a 6 x 9 matrix and x ( .) = tan(0 ( .)/2). 

In the fifth step, x 4 and x 5 are eliminated using Sylvester’s dialytic method (for a 
treatment of elimination of variables from a set of nonlinear equation, see refer- 
ences Salmon (1964) and Raghavan and Roth (1995)). Six additional equations are 
generated by multiplying equations in Eq. 36 by x 4 . In the process, three additional 

O O O Q 

“linearly” independent variables, namely, x 4 x 5 , x 4 x 5 and x 4 are generated, and a 
system of 12 equations in 12 unknowns is obtained. The equations can be written as 
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Following Sylvester’s dialytic method, the determinant of the coefficient matrix 
is set to zero, and this gives a 16th-degree polynomial in x 3 . The roots of this 
polynomial can be solved numerically and then one can find 0 3 = 2tan _1 (x 3 ). Since 
there can be 16 real solutions to the polynomial, the general 6R serial manipulator 
has 16 possible configurations. 

In the final step from known 0 3 , x 4 and x 5 (and 0 4 , 0 5 ) are obtained from Eq. 37 
using standard tools of linear algebra. Once # 3 , 0 4 , and 0 5 are known, using Eq. 36, 
the right-hand side variables SiS 2 , SiC 2 ,- . s 2 , c 2 are solved from the eight linearly 
independent equations, and thus unique 6 X and 0 2 are obtained. Finally, to obtain 0 6 , 
Eq. 30 is rewritten as 

ePi =^[7r l 5[7ri[n- i ‘[7r i ?[7r i »[ri es) 

Since 6 h i = 1, 2, . . ., 5 are known, the (1, 1) and (2, 1) element gives two 
equations in s 6 and c 6 which gives unique values of 0 6 . 

It may be noted that if the 6R manipulator has special geometry, i.e., some Denavit- 
Hartenberg parameters are 0, ^/2 or /r the 1 6th-degree polynomial in x 3 can be of lower 
order. In addition, if one or more joints are prismatic, then the inverse kinematics problem 
becomes simpler since the prismatic joint variable is not in terms of sines or cosines. The 
1 6th-degree polynomial is not amenable to analysis as in the case of a planar 3R or the 
PUMA 560, and it is not possible to obtain the workspace or the boundaries of a general 
6R serial manipulator. However, if all the roots of the 16th-degree polynomial are 
complex, then ®[T] is not in the workspace of the manipulator, and this reasoning can 
be used to get some idea of the workspace of the general 6R serial manipulator. 

Finally, in industrial applications, a choice needs to be made from the possible 
inverse kinematic solution sets to move the end-effector to the desired position and 
orientation. This is done in two steps - first, all solution sets in which at least one 
joint variable is outside the joint limits are rejected. For example, out of the 
8 possible solutions in the numerical example for the PUMA 560 serial manipula- 
tor, solution sets 2, 5, and 7 are not possible since the value of 0 4 is outside the joint 
limits of 0 4 . Likewise, solution sets 6, 7, and 8 have 0 5 outside its range. In the 
second step, out of the remaining possible inverse kinematic solutions (solutions 
1,3, and 4 for the PUMA 560), the solution set which is closest to the previous joint 
values is chosen. This strategy will not work if the solution branches come close to 
each other or if they intersect. Typically, path planning for a serial manipulator is 
done such that it is away from a singularity where the solution branches intersect. 


Inverse Kinematics for Case 2 

In many robotic applications such as welding or painting, the rotation of the 
welding torch or the paint gun about its own axis is not required. To decrease 
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cost and to simplify the design and manufacture, many welding or painting robots 
have five degrees of freedom with five actuators. Likewise, in the assembly of 
electronic components on a printed circuit board, it is simpler to assemble from one 
direction. Hence, an assembly robot such as the SCARA robot has only four degrees 
of freedom. In these industrial robots n < 6, and the tool or the end-effector cannot 
be positioned and oriented arbitrarily in 3D space. There exists 6 - n (3 - n for 
planar manipulators) relationships or constraints involving the position and orien- 
tation variables, and the given °[7] must satisfy these relationships or constraints. In 
most situations, the constraints are obvious since the manipulator was designed 
with the constraint in mind. For example, in the case of the SCARA manipulator the 

tool can be positioned arbitrarily in 3D space, but its orientation capabilities are 

/\ 

restricted to rotations about Z 4 (see Fig. 5). Hence, the constraints would be that 

/\ /\ 

rotations about X 4 and Y 4 are zero. For the SCARA manipulator, given x, y, z, and 
<fi, the inverse kinematics problem can be solved as 



zb COS 


-1 


x 2 + y 2 


l 


1 


/ 


Atan2(y,x) 


2/1/2 

Atan2 



,h + 



z 






The constraints or the relationships between the position and orientation vari- 
ables in any serial manipulator can be, in principle , determined from the direct 
kinematics equations. For example, in a five-degree-of-freedom robot, one 
can obtain six equations from 5 [7] in terms of the five joint variables. Elimination, 
using Sylvester’s dialytic method or any other approach, of the five joint 
variables from the six equations can yield a single equation in terms of the position 
and orientation variables of the tool. This single equation would be the constraint 
and would represent a five-dimensional subspace of the reachable workspace. 
A manipulator with four joints would lead to two expressions involving the 
position and orientation variables, and thus one would get a four-dimensional 
subspace. Obtaining explicit expressions for the constraints is, however, difficult 
in practice due to the difficulties in elimination of variables from nonlinear 
equations. 


Inverse Kinematics for Case 3 

In the case of a redundant manipulator, the number of joint variables is more than 
the number of equations. As an illustration, consider the planar 3R serial manipu- 
lator shown in Fig. 3. In addition, consider that the orientation of the last link is not 
of interest. In such a situation, there are only two equations relating the (x, y) 
coordinates of the end-effector with the three joint variables, 6 h i = 1, 2, 3, and 
these are 
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(40) 


For the inverse kinematics, (x, y) are given, and the task is to find 0 1? 0 2 , and 0 3 . 
Since there are only two equations, there exists an infinite number of solutions for 
0i, 0 2 , and 0 3 for a given (x, y). In order to obtain a finite number of 0 1? 0 2 , and # 3 , 
one more equation involving 0 1? 0 2 , and 0 3 is needed. One can impose a simple 
constraint such as 0 3 equals constant (the third joint is locked), but that defeats the 
purpose of designing and building manipulators with more than required joints and 
actuators. The availability of an extra joint can be used for optimization. Several 
researchers have suggested the use of redundancy to minimize joint rotations, joint 
velocities, and accelerations. Several others have also suggested the use of redun- 
dancy for avoiding singularities and avoiding obstacles (see the textbook by 
Nakamura (1991) and the references therein). Obtaining meaningful and useful 
equations is known as the resolution of redundancy, and this is the key issue in 
inverse kinematics of redundant robots. The resolution of redundancy can be 
achieved at various levels, such as position, velocity, accelerations, and torques. 
In this chapter, the example of the redundant planar 3R manipulator is used to 
illustrate the minimization of joint rotations - minimization of joint velocities, 
accelerations, and torques uses the pseudo-inverse of the manipulator Jacobian 
matrix, and this is discussed elsewhere. 

OOO 0 0 

A candidate function for optimization is 6\ + 0 2 + 0 3 . Minimization of 6\ + 0 2 + 

o 

0 3 subject to constraints given in Eq. 40 results in the planar 3R manipulator 
following a given trajectory with least rotation of the joints and can be formulated 
as follows: 


Minimize f(0) = 0\ + 0\ + 6\ 


subject to 


£l(0) — — ^ + l\C\ + hC\2 + ^3 C 123 — 0 

82 $) = — + hs \2 + h s i23 = 0 

where 6 denotes the three joint variables (0 1? 0 2 , 0 3 ), and (x, y) are points on the 
trajectory of the end-effector. Using the classical method of Lagrange multipliers, 
differentiating the modified objective function, one can get three equations 


de Al do + 89 

82 ( 0 ) = 0 

82V) = 0 

To solve the above three equations, eliminate the Lagrange multipliers Xi and X 2 
by rewriting the first equation as 
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< Y co-ordinate of the trajectory > 



Fig. 9 Plot of joint and task space variables for redundant planar 3R robot 
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For nontrivial Xi and X 2 , the determinant of the 3x3 matrix must be zero, and 
one can get, after simplification, 


I 1 I 2 O 3 S 2 + hh{0\ ~ # 2)^3 + hh(@3 ~ 0 2 )s22 = 0 


(41) 
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Joint 2 restricted between -120 to 120 degrees 
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Y co ordinate of the trajectory 
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A view of work space 




Trajectory on which 
minimization points 
were found and plotted 
above 





Fig. 10 


Plot of joint task space variables for redundant planar 3R robot with joint limit 



The inverse kinematics of the redundant planar 3R manipulator can now be 
solved from the three Eqs. in 40 and 41. Figure 9 shows the plot of 6\, 0 2 , 0 3 , and 
f{0) (/ 1? / 2 , and / 3 are chosen to be 5, 3, and 1 , respectively) when the end-effector of 
the planar 3R manipulator traces a straight line parallel to the Y axis as shown in the 
bottom of Fig. 9. One can also solve the optimization problem subject to joint 
limits. Figure 10 shows the plots of 6\, 0 2 , 0 3 , and f(0) when 0 2 is constrained to lie 
between ±120°. One can observe the difference in all joint variables when 0 2 is 
constrained. 

The inverse kinematics of redundant serial manipulators is an active topic of 
research and finds application in elephant trunk manipulators, snake robots, actu- 
ated endoscopes, and various other multi-degree-of-freedom devices. The reader is 
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referred to references (Klein and Huang 1983; Nakamura 1991) for the well-known 
pseudo-inverse-based approaches, reference (Chirikjian and Burdick 1994) for the 
modal approach, and references (Reznik and Lumelsky 1995; Ravi et al. 2010) for 
the tractrix curve-based approach. The references cited above are by no means 
comprehensive, and there exists a vast amount of literature on redundant 
manipulators. 


Summary 

Serial manipulators are extensively used in industry for a variety of applications. A 
serial manipulator consists of series of links and joints connected one after another, 
with one end fixed and one free end carrying the tool or the end-effector. Most serial 
manipulators have heavy and “rigid” links and the well-known four Denavit- 
Hartenberg parameters can be used to mathematically represent the links and joints 
of a serial manipulator. 

There are two important problems in serial manipulator kinematics, namely, the 
direct and inverse kinematics problem. The direct kinematics problem involves 
obtaining the position and orientation of the tool or the end-effector for given values 
of the joint variables. This is probably the simplest problem in robotics and can be 
always solved uniquely by simply multiplying appropriate matrices. The inverse 
kinematics problem involves obtaining joint variables for the given values of the 
position and orientation of the tool or the end-effector and is much harder to solve. 
Closed-form analytic solutions can be obtained only for special geometries of the 
serial manipulator such as when the last three joint axes intersect at a wrist point. 
For such special geometries, the inverse kinematics involves solution of a fourth- 
degree polynomial, and the manipulator can reach a point in the 3D workspace in 
four possible ways. At each such location, there can be two possible orientations. For 
a general six-degree-of-freedom serial manipulator with arbitrary geometry, the 
inverse kinematics problem involves numerical solution of a 16th-degree polynomial, 
and the manipulator can reach a desired position and orientation in at most 1 6 possible 
ways. When the degree of freedom is less than 6 for spatial motion and less than 3 for 
planar motion, the end-effector cannot achieve arbitrary positions and orientations, 
and there exists one or more constraints relating the position and orientation variables 
of the end-effector. When the number of actuated j oints is more than the dimension of 
the ambient space in which the manipulator operate, there exists an infinite number of 
solutions, and the manipulator can reach the desired position and orientation in 
infinite number of ways. For such redundant manipulators, optimization of a useful 
objective function is performed to make use of the redundancy. 

The workspace of a serial manipulator is the region in 3D space which the 
manipulator can reach. For serial manipulators with a wrist, the workspace is a torus 
which is restricted when joint limits are present. The workspace of a general 6R 
serial manipulator cannot be described analytically. However, whether a desired 
position and orientation of the end-effector is achievable or not can be determined 
by solving the inverse kinematics problem. 
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Abstract 

Robotic manipulators are mechanisms that are used to transmit motions and 
forces. Their kinematic and static properties are thus basic characteristics that 
must be analyzed when controlling them but also within the design phase. Such 
kinematic properties are the transmission of joint rates to the end-effector 
velocities. The dual property is the transformation of end-effector forces to 
joint forces. This requires adequate modeling of the manipulators, where serial 
and parallel manipulators must be distinguished. Special situations where these 
transformation properties change drastically are so-called singularities. 

In this chapter, the kinematic modeling is reviewed making use of recursive 
frame transformation. The velocity and force transformation relation is derived 
and explained for several manipulators, serial as well as parallel manipulators. 
The phenomenon of singularities is discussed, and the conditions for the exis- 
tence of singularities are presented. 

The motion planning and control requires the solution of the inverse kine- 
matic problem. This is also discussed in this paper. In particular, kinematic 
redundancy and redundant actuation are introduced, and the inverse problem 
discussed. 

Introduction 

The study of manipulator velocities constitutes one of the most important parts for 
robotic kinematics science, which is about manipulator’s position, velocity, and all 
higher-order derivatives of the position variables without regard for the forces that 
cause it. 

There are many applications in industrial processes that the velocity control is 
crucial, such as automatic arc welding and automatic spraying of latex. For exam- 
ple, the PUMA robot, a rotary -joint manipulator with six degrees of freedom, is 
usually used to weld together two parts of a car’s metal shell, throughout the 
manufacturing process of which the linear velocity of the end effector has to be 
maintained at a stable value and the angle velocity must be real-time changed. The 
torch on the tip of the end effector must move uniformly with its axis keeping a 
specific angle with the curved surface of the shell, so as to form a qualified weld. 
Therefore, a link of velocities would be built between two Cartesian spaces, the 
joint space, where the joints’ values of the manipulator are represented, and the 
operational space, in which the position and orientation of the end effector are 
defined. 

Furthermore, in some actual control cases, making use of a velocity loop in 
addition to a position loop can improve the manipulator performance (Kellya and 
Morenob 2005). And in some situations, it is more efficient to utilize the model of 
manipulator velocities to solve kinematic problems for there to be a linear-mapping 
relationship between the joint space and the operational space (Sciavicco and 
Siciliano 1986). In the previous chapters, the basic concepts and methods were 
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introduced, such as spatial descriptions and transformations, modeling a manipu- 
lator mathematically, deriving the forward and inverse position equations, and 
establishing the relationship between the joint variables and the end-effector 
pose. In this chapter, the study of velocities will be derived based on differential 
kinematics, for the linear velocity and angular velocity can be regarded as the 
derivative of the position and orientation with respect to time. The basics about this 
chapter are introduced first, including differentiation of vectors, the representation 
and notation of velocities in different frames, and differential relationships of 
frames. And then, one critical kinematic model termed Jacobian is derived to link 
the velocities’ relationship from the joint space to the operational space. 

The Jacobian is a matrix consisting of partial derivatives; moreover, in the field 
of robotics, they are usually first-order derivatives of multivariate functions of end 
effector’s position and orientation with manipulator’s joint values as the indepen- 
dent variables. The Jacobians are used not only for relating velocities from one 
space to the other but also for many aspects of robotic manipulator analysis: 
determining inverse kinematic algorithms, finding singularities, analyzing redun- 
dancy, and describing the mapping between static forces applied to the joints and 
the end effector, being utilized in manipulator dynamics, which will be derived in 
the following chapter. 

The goal of studying static forces in this chapter is to determine the relationship 
between the generalized forces applied to the end effector and that applied to 
manipulator joints, as well as to describe static forces in different frames. For 
there is a dual relationship between velocities and static forces, the similar methods 
can be used to establish as well as analyze the relationships of forces and torques 
represented in different spaces and frames. 


Basic Concepts and Notation 

Differential Movement and Velocity Representation 

The differential movement refers to an object’s instantaneous small movement in a 
very short period of time, such as point, coordinate frame, rigid body, and manipulator. 
When measuring or calculating this movement during a very small period of time, one 
can get the velocities between different objects or different parts of the same manip- 
ulator. Therefore, this characteristic can be used to derive relationships of velocities in 
robotic kinematics. For points and objects that can be considered as points, the 
differential movement refers to a change in position, corresponding with the linear 
velocities; for rigid bodies and objects with inner stmcture, the differential movement 
might also be caused by orientation changes, corresponding with angular velocity. In 
this chapter, all the discussed are based on the Cartesian coordinate system. 

When the vector P needs to be represented in a frame { A } , the notation A P can be 
used and one can also term that “P is a position vector in the frame { A } .” Moreover, 
the derivative of A P in the frame { A } can be written as 
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A 


The velocity of P reference to { A } is given by the equation 



(1) 

(2) 


where 




lim 

At->0 


A P(t+ At) 

At 




A velocity is generally measured related to a specific item, such as a certain point 
or a frame, but it can be expressed in any coordinate frames, i.e., if represent A V P in 
a frame [B }, that is, 





where {A } is the frame with respect to which the differentiation was done and {B } 
is the frame in which the resulting velocity vector is expressed. 

If the representation and measurement are taking place in one coordinate system 
frame, the outer subscript can be omitted, i.e., 


A ( A V P )= A V P (5) 

As introduced in the previous chapter, the transformation from frame {A} to 
frame {5} can be represented by a 4 x 4 matrix, consisting of a rotation 
element and a translation element. The structure of the transformation matrix can 
be written as 


B 

A 



B 

A 1V 3x3 


R 


Oi 


x3 




where 3 x 3 is a rotation sub-matrix and p 3xl is a translation sub-matrix. 

If vector P is a function of time, one can link the representation of the vector 
between {A} and {5} by using the relationship 


B P(t)=p 3xl + B A R 3 * 3 A P(t). (7) 

By differentiating both sides of the upper equation, one obtains 




T does not change with time, the first two items of Eq. 8 would be zero, i.e., 
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P 3 xl — 03x1 

B d Ap 

A K 3x3 " — 




Hence, one gets 


B P(t)= B R 3x3 A P (t). (10) 

And the result could also be rewritten as follows: 

B ( A v p) = B R A V p . (11) 

By utilizing Eq. 11, one can transform the expression of a vector’s velocity from 
one frame to another when the two frames are stationary to each other (Craig 2005). 

For example, Fig. 1 shows a seeding robot which can be used in an automatic 
planting factory. During the entire process, pots filled with soil move at a constant 
speed one by one on the conveyor. Meanwhile, the end effector must repeatedly 
suck up one seed from a specific container beside the conveyor, catch up with a pot, 
and press the seed into it. Besides, a camera can collect all the velocity information 
for the navigation system. In this case, it is easy to obtain the velocity relationships 
in terms of the camera, and one probably needs to express them in other frames for 
controlling purpose. 

To this end, one can locate the frame { C } on the camera, attach frame { G } to the 
current pot, and fix frame {T} to the end effector. Moreover, the rotation matrices 

T 

g R and g R are known at any instant. 


Fig. 1 Example for linear 
motions of different frames 
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At a certain moment, one obtains that the speed values of the pot and the end 
effector are 0.1 m/s and 0.8 m/s, respectively, by the camera and both are heading in 

the Y c direction with reference to frame {C}. That is, 


C V GORG — 0.17c 

c Vtorg = 0.87c 


( 12 ) 


If representing the pot’s velocity with respect to the end effector, one can write 


T fC V gorg) — cR C Vgorg — T c R 1 (0.17c). 


(13) 


If representing the velocity of the end effector in the frame { G } , one can write 


G(C Vtorg ) — G R C Vtorg — gR 1 (0.87c)- 


(14) 


Velocity Relationships Between Different Frames 
Case 1: Only Linear Velocity Case 

Next, the velocity representation relationships between {A} and {5} is going 
to be found, in the situation that there is a linear velocity between them. 
As shown in Eq. 6, p 3x i can be considered as the origin of {A} expressed in the 
frame of {5}, i.e., 

B Paorg = P3 x 1 • (13) 

The velocity of frame {A } with reference to frame { B } can be written as B V aorg, 
and in this case, the p 3xl can be considered as a function of time. And then, 
substituting Eq. 15 in Eq. 8, one gets the equation 

• • • 

B P (f) = B P A OR G (t ) + B A R 3X 3 A P ( t ), (16) 

which leads to the relationship that 

B Vp = b V A0RG + a R a V p . (17) 

By utilizing Eq. 17, one can transform the expression of a vector’s velocity from 
one frame to another when there is a linear velocity between the frames (Siciliano 
et al. 2009; Spong et al. 2005). 

Case 2: Both Linear Velocity and Angular Velocity Case 

Whereas linear velocity describes an attribute of a point, angular velocity describes 
an attribute of a body. When studying manipulator kinematics, a frame is usually 
fixed to a rigid body so as to use the movements of the frame to replace that of the 
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rigid body completely. In the most general cases, there are both linear velocity and 
angular velocity between two frames. Therefore, one may be interested in how to 
link the velocities between two frames even though they rotate with respect to each 
other. And one can begin this study by finding the relationship between the angular 
velocity co and the rotation matrix R 3x3 . 

Supposing that the angular velocity of the frame {A} with respect to the 
frame {5} is B co A , the rotation matrix ^/? 3x3 can be considered as a time-varying 
matrix %R 3x 3 (t). 

Since a rotation matrix must be an orthonormal matrix, it is now known that 


A R3x3 {f^R^ 3x3 if) — hx3 

Differentiating both sides of Eq. 18, one yields 



B 

A 


X 3x3 (t) *R r 


3x3 


(0 + 


A*3x3 (t)*R T 


3x3 


(t) — 03x3, 


which can also be written as 



B 

A 


R3x3 (0 B A R T 3x3 



B 

A 









Then one can see that ^/? 3x3 (2) B A R T ^^{t) is a skew -symmetric matrix. 
Set 



B 

A 


*3x3 (0 B R T 




B 


and postmultiplying both sides of Eq. 21 by a^3x 3(0 gives 



B 

A 



S3.At) B A R T 3, 




Since 

B P = A ^3x3 P , (23) 

which leads to 


B P{t)= B A R{t)^ A P, 

by substituting Eq. 22 in Eq. 24, one obtains 



B P(t)=S 3x3 (t) B R 3x3 (t) A P. 


(25) 
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The physical significance of rigid body rotating shows 

B P(t) = B Vp = b w a x B P. (26) 

By substituting Eq. 23 into Eq. 26, one can write 



Therefore, the matrix operator 63*3(0 describes the vector cross product between 
the vector B oo A and the vector behind the “x.” Indeed, the skew -symmetric matrix 
63*3(0 is so that its symmetric elements with respect to the main diagonal represent 
the vector b oj a in the form 


S 3x3 {t)=S( B CD A (t)) 


0 

co z 

~(Dy 




where 



And hence, the operator 3*1 x” is equivalent to the operator 6 3x3 (ft>(0), he., 


ft> 3 xlX «=> 6 3X 3(®(0)- ( 3 °) 

Now, let us return to the starting point. If there is not only linear velocity B V AORG 
but also angular velocity B co A between the frame {A} and the frame {5}, by 
combining the views of Eqs. 7, 19, and 30, it is easily verified that 




V AORG + 


b a R a V p + b co a x ( b R a P ). 



Velocity Propagation in Serial Kinematic Chains 

Robotic manipulators are usually composed of a series of links which 
propagate motions from one part of the robot to another part. In this section, the mles 
about how the linear velocity and angular velocity propagate from link to link will be 
introduced. One can start by assigning frames to the links as shown in Fig. 2: 

• Align the z, with the axis of motion (rotary or sliding) of Joint / + 1. 

• Locate the origin of the zth coordinate at the intersection of the z, and z t _\ or at 
the intersection of common normal between the z, and z t _i axes and the z, axis. 
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Fig. 2 Description of 
manipulator links by 
Denavit-Hartenberg 



• Establish x t along the common normal to axes z t _i and z 7 - with positive direction 
from Joint i to Joint / + 1. 

• Establish y t so as to complete a right-handed frame (Zhang 2010). 

Our discussion will depend on three Cartesian frames shown in Fig. 2: the base 
frame {B }, the link frame { / — 1 } locating at Joint i connecting link i — 1 and link /, 
and the link frame { i } locating at Joint i + 1 connecting link i and i + 1 . And the goal 
is to find the relationship of velocities between the frame { / — 1 } and the frame { i } . 
As shown in Fig. 2, it is now known that 

i - 1 P i = B P i - l +f_ l R i ~ 1 r. (32) 

By differentiating Eq. 32, that is, 



B 

i—1 


R i ~ l r + 


B 

i—1 





In views of the previous relationships introduced in Eq. 1, Eq. 33 can be 
rewritten as 


i_1 Vi) +V --1 x B r. (34) 

And hence, one gets the relationship between the linear velocity of frame { / — 1 } 
and that of frame {/}, i.e., 

,W V/) +%-! x B r. (35) 

Next, the propagation relationship for linear velocity and angular velocity 
can be derived separately as follows: 
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Linear Velocity Propagation 

If the Joint i is a revolute joint, the value of relative linear velocity between { i — 1 } 
and {/} is zero, i.e., Z_ V; = 0. 

Hence, Eq. 35 would be rewritten as 

B V t = B Vi _!+%■_! x B r. (36) 

Alternatively, if the Joint i is a prismatic joint, the relative linear velocity 
between { i — 1 } and { i } would be written as 



In the Eq. 37, Z t _ { is the unit vector of frame { / — 1 }, fixed to Joint i axis, and it 

T 

will always be [0, 0, 1] expressed with respect to its own frame; z\ is a scalar, 
which is the derivative of the value of Joint /, that is, the translation velocity of 
Joint i. 

Accordingly, Eq. 35 would be rewritten as 



B 


Vt-i+z 


l 



o)j— i x r. 



Angular Velocity Propagation 

For the angular velocity propagation, the introduction can start from the rotation 
transformation. 

As known in the previous chapter, there is the relationship as follows: 



Differentiating both sides of Eq. 39 with respect to time, that is, 




B 

i - 1 





B 

i - 1 




which, by substituting Eq. 22 in it, can be written as 




For a 3 x 3 matrix, R , and a 3 x 1 vector, co, it is easy to verify that (Spong 

et al. 2005), 


RS(a>)R T = S(Ra>). 



Therefore, one can rewrite Eq. 41 as 

f* = S( B m-r)?R +l 1 RSr* i ) B _ 1 R TB _ l I?r'R. 


(43) 
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Substituting Eq. 22 into Eq. 43, one gets 




that is, 



Contrasting both sides of Eq. 45, one gets the relationship between the angular 
velocity of the frame { / — 1 } and that of the frame { /}, i.e., 



B 


B 


(Dj-i + j_jR 


i—1 


COj. 



If the Joint i is a re volute joint, the relative angular velocity between { i — 1 } 
and {/} could be calculated by differentiating value of Joint /, i.e., 0j with its 
orientation along the unit vector of frame { i — 1 } . And then, Eq. 46 would be 
rewritten as 



Alternatively, for prismatic Joint /, there is no relative rotation between the 
frame { i — 1 } and { / } , and Eq. 43 would be written as 



Insummary, the relationships described in Eqs. 36, 38, 47, and 48 show us the 
rule of the propagation of velocities from one link to the next. Therefore, the 
recursive relation of Eqs. 36 and 47 allows us to find the velocities of any link of 
the manipulator, including the end effector, in the case that the current joint is a 
revolute joint; alternatively, Eqs. 38 and 48 can help us calculate the velocities if the 
current joint is a prismatic joint (Siciliano et al. 2009; Spong et al. 2005). 

In some cases, representing the frame velocity with respect to itself would bring 
more convenience to the robot control algorithm (Craig 2005). To this end, 
premultiplying both sides of Eqs. 36, 37, 47, and 48, by rotation matrix, such as 
1 b R and \-\R, one can find the formulations of computing the velocity of link 
i with respect to frame { i } . For the case that the frame { i } locating at a revolute 
joint, that is, 




cot - 1 + Qi l Zi_ i 

C' -7 Vi_ 1 + '-V-1 X 




For the case that the frame { / } is fixed to a prismatic joint, that is, 
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I 


l 


a>i 

V { 


Ur 1 - 1 co ,- 1 

i - 1 (i—1 


UR 


i + x 


Pi)+Zi %- 1 



Note that one can transform the result velocities derived from Eqs. 49 and 50 
from its own frame to any frame by premultiplying a rotation matrix. 


Differential Relationships of Coordinate Frames 

In order to describe a micromotion of robot, differential coefficient is utilized for 
coordinate transformation (Zhang 2010; Paul 1984; Zhang et al. 2008). 

Given a transformation T, 



F n (x) 

F 12 M 

F 13 M 

F 14 w 

Fn(x) 

F 22 (x) 

F 23 ( X ) 

F 24 (*) 

F 31 (x) 

F 32 (x) 

F 33 (x) 

F 34 (x) 

_F 4l(jc) 

F 42 0) 

F 43 (*) 

F 44 (x) _ 



whose elements are functions of some variable x, which would be a scalar or a 
vector, then its derivative can be written as 



~dF n (x ) 

dF n (x) 

dF i 3 (x) 

dF i 4 (x)' 

dx 

dx 

dx 

dx 

dF 2 1 (x) 

OF 22 (x) 

dF 23 (x) 

dF 24 (x) 

dx 

dx 

dx 

dx 

OF 3 i(x) 

dF 32 (x) 

dF 33 (x) 

dF 3 4 (x) 

dx 

dx 

dx 

dx 

dF 4l (x) 

dF 42 (x) 

dF 43 (x) 

dF 44 (x) 

<9x 

dx 

dx 

<9x 



If pose of the manipulator or one of its links is represented with T, one can use 
T + dT to describe its pose after an instant when a micromotion takes place. 
Moreover, the micromotion can be expressed by a differential translation of dx , 
dy, dz, and a differential rotation d<fi about the vector /. And if the micromotion is 
respect to the base frame, one can write 


T + dT = Trans(dx,dy,dz)Rot(f ,d(j))T , 



which leads to 


dT = [Trans(dx, dy, dz)Rot(f , dtp) —I]T. (54) 

As known in the previous chapter, the differential translation homogeneous 
transformation can be expressed by a 4 x 4 matrix, that is, 
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Trans (dx, dy , dz) 


10 0 dx 
0 10 dy 
0 0 1 dz 
0 0 0 1 


Since the formula of general rotation transformation is 



Rot(f, (p ) 


fx fx Vers <P + COS (p 
fxf y v e rs(p+f z sin(p 

f x f z vers(p — / z sin </> 

0 


fyf x vers(p —f z sirup 
fyf y vers(p+ cos (p 
f y f z y ers(p + f x sirup 

0 


f z f x vers(p+f y sm(p 
f z f y vers(p — f x sin</> 
f z f z vers(p + cos (p 

0 



the differential rotation d(p about the vector / can be derived by substituting the 
following relationships into Eq. 56, i.e., 







lim ( sin cp) 

(p^0 

lim ( cos d) ) 
^ o 

lim ( verscp ) 




As a result, one obtains the differential rotational homogeneous transformation, 
that is, 


Rot(f , dcp) 





One can define the differential transformation operator, which is denoted by A, 
that is, 


A = Trans(dx,dy,dz)Rot(f,dxp) — I. 
By substituting Eqs. 55 and 58 into Eq. 59, one gets 





The differential transformation operator derived above is based on the differen- 
tial rotation dtp, which is about a given vector in the frame. Next, the differential 
transformation is going to be discussed based on the differential rotation around the 
coordinate axes, i.e., x-axis, y-axis, and z-axis. And in this case, one can write 


A = Trans(dx,dy,dz)Rot(x,d6 x )Rot{y,d0y)Rot(z,d0 z ) — I. 


(61) 
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As introduced in the previous chapter, the rotation transformation matrix about 
x, y, and z can be written as 


Rot(x , #) 


Rot(y , #) 


1 

0 


0 

0 

cos# 


— sin# 

0 

sin# 


cos# 

0 

0 


0 

cos# 

0 

sin# 


0 

1 

0 


sin# 

0 

cos# 


0 

0 

0 


0 

0 

0 

1 


0 

0 

0 

1 


and 


Rot(z , #) 


cos # 
sin# 
0 
0 


- sin # 0 0 

cos #00 


0 

0 


1 0 
0 1 


In views of the following relationship, 


#-+ 0 
lim ( sin #) 

0—tQ 

lim ( cos #) 

O—’Q 


d0 

1 


one gets 


Rot(x , d0 x 


1 

0 


0 


0 

1 


0 d0 x 


0 


0 

d6 x 

1 

0 


0 

0 

0 

1 


Rot(y,dO y 


1 

0 


y 


0 


0 dO,, 0 


1 


d0„ 0 


0 


y 


0 

1 

0 


0 

0 

1 


and 


Rot(z , d0 z 


1 

d6 z 

0 

0 


d0 z 

1 

0 

0 


0 0 
0 0 
1 0 
0 1 
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By substituting Eqs. 66, 67, and 64 in Eq. 61, one obtains the representation of 
the differential transformation operator based on the differential rotation about the 
coordinate axes of the frame, i.e., 



0 

—d0 z 

d6y 

dx 

d0 z 

0 

—d0 x 

dy 

—dOy 

d0 x 

0 

dz 

0 

0 

0 

0 



Note that during the process of deriving Eq. 69, the result is independent with the 
order that the differential rotations take place by neglecting second-order terms. For 
example, if the terms such as dO x d6 x are considered as zero, the following equation 
can be established: 


Rot(x,dO x )Rot(y,dO y ) = Rot(y,d6y)Rot(x,d6 x ). (70) 

Comparison of the elements in Eqs. 60 and 69 shows that a differential rotation 
about a vector axis in the frame is equivalent to three differential rotations about the 
x-, y-, and z-axes regardless of the rotation order. And indeed, the differential 
rotation operator can be considered as the combination of a differential translation 
vector, 


d = d x i + d y j + d z k , (71) 

and a differential rotation vector, 

5 = S x i + 8 y j + d z k. (72) 

Moreover, the translation vector d and the rotation vector 3 defined above can be 
combined into the differential motion vector SD. Thus, 



For example, suppose that the frame {A} can be represented as follows with 
reference to the base frame: 


-1 0 0 25 

0 0 19 

0 10 6 
0 0 0 1 


A 


(74) 
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When it makes a series differential motion, i.e., dx = 0.2, dy = —0.1, dz = 0.1, 
dO x = 0.1, dO y = —0.2, and d6 z = —0.1 in terms of the base frame, one can write in 
differential translation and rotation as follows: 


d = 0.2/-0.1/ + 0.U 
<5 = 0.1/-0.2/-0.U' 


(75) 


In addition, the differential motion can also be written in vector form, that is, 


8D 


0.2 

- 0.1 

0.1 

0.1 

- 0.2 

- 0.1 


(76) 


In view of Eq. 65, the differential transformation operator is 


A 


0 

0.1 

- 0.2 

0.2 

- 0.1 

0 

- 0.1 

- 0.1 

0.2 

0.1 

0 

0.1 

0 

0 

0 

0 


(77) 


and then, one can solve for dA as follows 


dA 



0 

0.1 

- 0.2 

0.2 " 


"-1 

0 

0 

25 " 

AA - 

0.1 

0 

- 0.1 

- 0.1 


0 

0 

1 

9 

0.2 

0.1 

0 

0.1 


0 

1 

0 

6 


0 

0 

0 

0 


0 

0 

0 

1 

0 

- 0.2 

0.1 

- o.r 






0.1 

- 0.1 

0 

- 3.2 






- 0.2 

0 

0.1 

6 

• 





0 

0 

0 

0 







(78) 


Moreover, one can compute the pose of the frame {A} after the differential 
motion by adding its original matrix to dA, i.e., 


-1 

- 0.2 

0.1 

24 

0.1 

- 0.1 

1 

5 

- 0.2 

1 

0.1 

12 

0 

0 

0 
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Ac ur — Aqri + dA 


(79) 
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Jacobian and Manipulator Velocities 
Jacobian Matrix 

In vector calculus, the Jacobian matrix is the matrix of all first-order partial 
derivatives of a vector-valued function (Craig 2005; Waldron and Schmiedeler 
2008). For example, there are m functions, each of which is a function of n 
independent variable: 


*i =/ i(<7i,<72> • • •<? 

*2 =/ 2 (<?l,<? 2 . •••<? 


n 


n 


(80) 




m 


^2’ * * * Qn) 


One can also use a vector notation to write the above equations, 


x 


/(<?)> 


( 81 ) 


where 


x 


[xi , X2,' ' ', X m 

rj-i # 

[q\,q 2 ,- ■ -,q n } 


(82) 


If representing the differentials of Xj as a function of the differentials of q t , one 
can use the derivation rule for multivariate functions to calculate and obtain 


5x\ 


< 


<5x 2 
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df i . d/, 

Sq< + t , — Sq 2 + 
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dq 2 


df 2 , Of 2 , 
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dfm O , dfm , , 

Sq i + 7 ^— <5<? 2 + 
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+ — — oq 


dq 
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df 2 c 

+ ^<5<7 


dq 


n 
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df ni c 

+ ^<5v 


dq 


n 


n 


(83) 


Moreover, Eq. 83 can also be written in vector notation, that is, 


<5x 


mxl 


1 

d/il 

dq i 
• 

dq„ 

• • 

dfm 

• • 

df m 

- dq 1 

dq„ . 


dq 


nx 1 


(84) 


The m x n matrix of partial derivatives is called the Jacobian, and one can use 
the notation 
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Sx 


mx 1 


J 


m x n 


1 - 


(85) 


Dividing both sides of Eq. 85 by the differential time element, one can get the 

• • 

mapping relationship from q to x , 


x = J{q) q . (86) 

For q is usually assigned a different value at each new instant, it is a linear J(q) 
transformation, which is changing over time. Therefore, Jacobians are ordinarily 
time-varying linear transformations (Kelly a and Morenob 2005; Whitney 1969). 

Generally, in the field of robotics, the Jacobians are used to relate joint velocities 
to spatial velocities where the manipulator’s position and orientation are expressed 
(Khatib 1987). For example, x can be used to represent the end-effector pose in 
operation space and q can be used to represent the vector in related joint space. 
Therefore, Eq. 86 can be rewritten as 


V = J(q) 'q , (87) 

where V is a vector of spatial velocities which is defined in Cartesian coordinate 

frame and q is the vector of joint velocities of the manipulator (Craig 2005). 

Note that, the Jacobians might be defined of any dimension. The number of rows 
equals to the number of degrees of freedom in the Cartesian space being considered, 
and the column number is equal to the number of the joints. 

Take the Two-Link Planar Arm, for Example 

As seen in Fig. 3, the two-link planar arm has only two revolute joints with their 
axes parallel to each other; therefore, the end effector’s position can be represented 
by two independent variables in Cartesian coordinate frame, i.e., x and y. Hence, 
they can be computed according to the manipulator’s geometric characteristic, i.e., 


x = l\ cos 0\ -j- I 2 cos (0i + 0 1 ) 
y = l\ sin 6\ -j- I 2 sin {6\ + 0 \ ) 



Fig. 3 Structure of two-link 
planar manipulator 


X, y) 
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Moreover, the differentiation of Eq. 88 gives 


8x = —[l\ sin^i + I2 sin (6 \ + 6\)]86\ — I2 sin (6\ + 61)862 
Sy = [l 1 cos 6\ + I2 cos (1 9 \ + 6\)]86\ — I2 cos (6\ + 61)862 ' 



Dividing both sides of Eq. 89 by the differential time element, one gets 


V — 

• ~ 

X 


-y -l 2 sin (<9i +# 2 ) 


0i 

v — 

• 

_y_ 


x l 2 cos (6 1 + 0 2 ) 


.02. 


from which one can find the Jacobian, that is, 



-y -/2 sin (0i +6> 2 ) 
x / 2 cos (6\ + 0 2 ) 



As seen in Eq. 91, the Jacobian is a 2 x 2 matrix, for the two-link planar arm has 
two degrees of freedom in the Cartesian space and two revolute joints. 


Transformation of Jacobian's Frame of Reference 

The Jacobian depends on the coordinate frame in which the manipulator velocities 
are expressed. Therefore, it is necessary to find the rule of transforming the 
representation of the Jacobian from one frame to another. 

For example, given a 6 x 6 Jacobian matrix with reference to frame {A}, 



and one can derive its representation in frame {B } as follows. 
Since one has 






B 

A 


R A V 3x1 




B 


&>3xl 


BnA 
A 


R 


&>3xl> 


(94) 
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one can write 


If substituting 



^3x1 


1 

0 

oq 

1 


1 

X 

CO 

> 

1 

^3x1 


.0 B a R_ 


A ^3xl 



into Eq. 95, one obtains 










which allows us to change the frame of reference of a Jacobian to another frame. 


Geometric Jacobian and Analytical Jacobian 

Generally, manipulator end-effector velocities can be defined with reference to a 
stationary Cartesian frame which would usually be the base frame, and the related 
Jacobian can be computed according to a geometric technique. In this case, the 
linear and angular velocities are determined as well as the corresponding joint 
velocities. Therefore, this type of Jacobian matrix is sometimes called the geomet- 
ric Jacobian, to distinguish it from the analytical Jacobian. 

If the end-effector orientation is specified in terms of a minimal number of 
parameters based on the end-effector frame, the analytical Jacobian is introduced 
and used to link the contributions of each manipulator joint velocity and the 
end-effector velocity (Orin and Schrader 1984). 

One uses the notation J(q) to represent a geometric Jacobian, that is, 





where 6 is defined with respect to the base frame, that is, 




( 100 ) 


Alternatively, one uses the notation / A (g) to represent an analytical Jacobian and 
express the relationship between joint velocity and the end-effector velocity as the 
following equation: 
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( 101 ) 


If the orientation of the end effector is represented by a vector of Z-Y-Z Euler 
angles with reference to the current frame, i.e., 



the relationship between 6 and <fi is 



0 — sin /? 

0 cos a 

1 0 


cos a sin /? 
sin a sin /? 
cos /? 



(103) 


Hence, the relationship between Je(q) and J<s>(q) can be expressed as follows: 


where 


Je(q) = 0 r o (^)/ o (v), 



0 

0 

1 


— sin/? 
cos a 
0 


cos a sin /? 
sin a sin /? 
cos/? 


(104) 



Therefore, the relationship between J(q) and /a(v) can be expressed as the 
following equations, which could be used to compute one of them from the other: 



< 

JA{q) 


I 

0 




Jacobian Computation for Serial Manipulator 
Computation of Linear Velocity Components 

Given a serial manipulator of n joints, the position vector of its end effector can be 
expressed as a function of the joint values, that is, 


P e =P e (q l ,q 2 ,---,q n ). 


(107) 
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Fig. 4 Representation for the computation of the contribution of a joint to the end-effector linear 
velocity 


By differentiating Eq. 107, one can express the linear velocity of the end effector 
as follows: 



As shown in Eq. 108, at any instant, the contribution of each joint’s velocity is 
independent to the end effector’s velocity. Therefore, as shown in Fig. 4, one can 
analyze the relationship between the end effector and each joint’s velocity, respec- 
tively, and combine all of them as a result (Cai 2000). 

In the view of Eq. 108, the velocity contribution of Joint i can be written as 



(109) 


where 



(110) 


If Joint i is a prismatic joint, it is now known that 


50 Manipulator Velocities and Static Forces 


1831 


V ei — P ei ~ Qi J vi — z i Zj- i , (111) 

• ^ 

where z t represents the moving velocity of Joint i and Z/_ i is the unit vector of 

z-axis of frame { i — 1 } located at Joint /, 

and then, one gets the relationship of Eq. 112 from Eq. Ill: 

J vi = Z l _ l . (112) 

If Joint i is a revolute joint, as shown in Fig. 4, one can obtain 

Vei — P ei = hi J VI =a>i-iX'~ l r e =^/Z/_ 1 X (P e -Pi- 1 ), (113) 

from which one derives 

Jvi=Z i - l X (P e — Pi-\). (114) 

Computation of Angular Velocity Components 

Since angular velocity is classified free vector, the angular velocity of the end 
effector can be considered as the superposition of all the angular velocities of the 
manipulator joints (Orin and Schrader 1984), i.e., 


CO e — CO i H - 0)2 + • • • + O) n 




where J mi is a 3 x 1 matrix and q { is scalar quantity determined by joint velocity. 
If Joint i is a prismatic joint, one has 


a>i =J coi q j = 0, (116) 

where 

Jon = 0. (117) 

If Joint / is a re volute joint, one can obtain 

Wi = 0 j Zi-i=J <oi q j , (118) 

where 

J coi = Zj—i (119) 

In conclusion, considering Eqs. 108 and 115, for serial manipulator, one 
can write 
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J 6xn (<?) Q 


nx 1 


( 120 ) 


where 



( 121 ) 


Furthermore, in view of Eqs. 1 12, 1 14, 1 17, and 119, the following relationships 
can be used to find the Jacobian for a manipulator: 

For a prismatic joint 



( 122 ) 


For a re volute joint 


1 

> 

1 


Zi-i X (P e -Pi-i) 

T . 

J an 


1 

* ^ 

<N 

l 


(123) 


Now take the well-known Stanford manipulator, for example, to illustrate the 
derivation of the serial manipulator Jacobian. The Stanford manipulator has six 
joints, and they are arranged in the order RRPRRR (revolute-revolute-prismati- 
c-revolute-revolute-revolute). The axes of the last three joints intersect at one 
point. The DH parameters are shown in Table 1. 

Considering Eqs. 121, 123, and the DH parameters in Table 1, the Jacobian can 
be written as 









(124) 


Table 1 DH parameters 
for Stanford manipulator 
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0 
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—n /2 
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d 2 (determined by mechanism) 

0 

n /2 

e 2 

3 

d3 

0 

0 

0 

4 
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—n /2 

e 4 
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0 

0 

n /2 

85 
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d 6 (determined by mechanism) 

0 

0 

06 
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where 


< 
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)> J (0 2 
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(n 

-A 

) ? J 0)6 




By calculating the position vectors in terms of the base frame, P 0 to P 6 can be 
found as follows: 




c\s 2 d 3 - s\d 2 
s\s 2 d 3 + ci d 2 
c 2 d 3 


(127) 




C\s 2 d 3 - s x d 2 + [ci(c 2 c 4 s 5 + s 2 c 5 ) “ sis 4 s 5 \d 6 
s\s 2 d 3 + c x d 2 + [si(c 2 c 4 s 5 + s 2 c 5 ) + cis 4 s 5 \d 6 

c 2 d 3 + (-^ 2 c 4 ^ 5 + c 2 c 5 )d 6 


(128) 


And computing the unit vectors with reference to the base frame, one obtains 



(129) 





C\C 2 S 4 - s x c 4 

-S\C 2 S 4 + C\C 4 

S 2 S 4 


(130) 


(131) 


and 


9 


(132) 
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Ci(c 2 C 4 S 5 + S2C5) - SiStfs 

5-l(c2C45-5 + ^ 2 ^ 5 ) + CiS 4 S 5 


9 


-^ 2 C 4 ^5 + C 2 C 5 


(133) 


where S, denotes sin(0 z ) for short, while c z denotes cos(0 z ). As a result, the Jacobian 
6 x 6 is obtained by substituting F z and Z z in Eq. 125. 


Jacobian Computation for Parallel Manipulator 
Computation Based on Differentiation 

The Jacobian of a parallel manipulator is traditionally formulated directly from the 
time derivative of the vector loop equation (Coppola et al. 2013). As shown in 
Fig. 5, a planar two degrees of freedom parallel manipulator is used to position a 
point on the plane. This 2-DOF parallel manipulator has five revolute joints noted as 
Oj to O5. Joints Oj and 0 3 are actuated joints with their base fixed to the ground, 
and the end effector is located on the Joint 0 5 . 

Our goal is to find the relationship between the velocities of the two actuated 
joints and the velocity of the end effector. To this end, one can attach the origin of 
the base frame on Joint Oi with x-axis pointing Joint 0 3 . Accordingly, x and y are 
used to represent the position of the Joint O 5 , i.e., the position of the end effector 
with reference to the base frame. At points 0\ to 0 4 , one gets 


Fig. 5 A planar two degrees 
of freedom parallel 
manipulator 


*1 - 

ll 

CO 

O 

*2 ~ 

l\ cos 6 \ + x\ 

< v 2 = 

li sin 0 \ + v | 

X 4 

I 3 COS 02 T X3 , 

>4 = 

- h sin 0 2 + y 3 

Y t 




(134) 


END-EFFECTOR 
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where (x h y,) represents the position of Joint O t in terms of the base frame. From 
Fig. 5, one obtains 


Ui = (x - x 2 ) 2 + (y- y 2 ) 2 _ 
l ll = (x ~ x 4 ) 2 + (y - y 4 ) 2 ' 

By substituting Eq. 134 into Eq. 135, one gets 




Hence, one can obtain the kinematic relationship as follows: 





where 







(137) 


< 





By differentiating Eq. 137 with respect to p and 6, one obtains 


(138) 




B 


d F 

dp 

OF 

d6 


(pc — l\ cos 0 \ ) (y — /i sin ) 

(x — I 3 cos O 2 — X 3 ) (y — /3 sin 6 ^ 2 ) 

(xsin^i — y cos^i )/i 0 



(x — X 3 ) sin 62 


— y cos d 2 ]h 


Since the velocity equation can be written as 


(139) 


A p + B 6 = 0, 


one can derive that 


9= (-5 _1 A) p. 

Hence, the Jacobian matrix can be expressed as follows: 




(140) 


(141) 


( 142 ) 
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in which 





a\ 

x - 

- l\ COS01 

a 2 - 

x - 

- l 2 cos 0 2 — x 2 

b\ = 

-y- 

- l\ sin^i 

b 2 ~- 

-y- 

- l 2 sin 0 2 

d\ 

—\ 

(xsin#i — y cos^i)/i 

d 2 - 

— 

(x — x 2 ) sin 0 2 —y cos 0 2 \ h 


(143) 


Computation Based on Screw Theory 

Screw theory has become an important tool in the field of robotics, which can be 
utilized to find the Jacobian for hybrid parallel manipulator. As shown in Fig. 6, the 
manipulator consists of three identical limbs. Each limb has two revolute joints, one 
prismatic joint, and one spherical joint (Coppola et al. 2013). 

In views of the screw theory, each of the three limbs is modeled as a serial chain: 



(144) 


where T( 6 x 1) is the platform twist motion in Pliicker axis coordinates, 6 x 2) 
and Pj( 6 x 4) are the active and passive joint screws, q- (1 x 2) and cp\ (1 x 4) are 
the active and passive joint velocities, and i denotes the ordinal number of the 
limbs. Letting Wi(6 x 2) be a pair of wrenches in Pliicker ray coordinates that are 
reciprocal to the passive joint screws, i.e., the value of wJP t is zero, one can 


Fig. 6 Example of a spatial 
parallel manipulator 
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eliminate the passive joint velocities in Eq. 144 by premultiplying with W T . This 
yields an expression of the form 




M '[ Q | 


~k\~ 

1 1 

m 

£ 

<N 

£ 

£ 

t T = 

*2 02 


<?2 



wlQ 3 _ 


-<? 3 - 



where W =[w\ w 2 W 3 ] is the wrench matrix (Ebert-Uphoff et al. 2002). 

The relationship expressed in Eq. 145 links the velocity of the mobile platform 

with that of the actuator velocities. Its form is analogous to the well-known form 

• • 

first proposed in (Gosselin and Angles 1990), Ax = B q, where A is denoted as the 
instantaneous direct kinematic matrix (or the wrench matrix if the equation is 
directly in screw form) and B is denoted as the instantaneous inverse kinematic 
matrix. Both the matrices are Jacobian matrices that are in screw form in Eq. 145. 

If the instantaneous twist frames associated with all the joints for each limb are 
defined as Fig. 7, the twists can be expressed as follows: 


\%bij ~ /— 1,2,3, (146) 

where , = [/,, m n n n p n q n r n f. 

The hybrid parallel manipulator structure is dually actuated in each limb; thus 
two reciprocal screws are expected. By computing the reciprocal condition, two 
screws reciprocal to all the passive joints of the zth limb can be found to be 



Fig. 7 Vectoral 
representation of the 
manipulator 


$1,2 


1838 


D. Zhang et al. 



(147) 


where £3,/ is the direction vector of S/,3, while S 4,/ is the direction vector of S/,4. 

For the screw wrenches of Eq. 147 form a two-system to yield w h the velocity 
equation for the ith limb can be derived as follows. 



w • 

where /,• 1 and 0 i are the transform velocity of the prismatic joint and the angle 
velocity of the actuated revolute joint on the ith limb. 

Moreover, Eq. 148 can be augmented for i = 1, 2, 3 to form the full velocity 
relationship of Eq. 145. As a result the 6 x 6 Jacobian can be obtained via reciprocal 
wrench screws. 


Kinematic Singularities 
The Basic Phenomenon 


In all preceding considerations, it was assumed that the essential kinematic char- 
acteristics of the manipulator do not change during its motion. These essential 
characteristics are the “type” of its motion, its instantaneous mobility, and the 
number of independent task space velocities it can generate. Situations in which 
these change, i.e., the manipulator exhibits a qualitative change of its kinematic 
properties, are collectively called “singularities.” 

As a simple example consider the planar serial manipulator in Fig. 3 in the 
horizontally stretched configuration q 0 = (#i, 0 2 ) = (0, 0), shown in Fig. 8. In this 
configuration the manipulator Jacobian Eq. 91 becomes 




(149) 


T 


r 9 9 ~\ 1 

and the end-effector velocity is V = [0, (l\ + l 2 )0\ + h02\ • That is, the manip- 
ulator is not able to generate any velocity that has a component in the x-direction 
of the global coordinate frame. The reason for this is that Jacobian drops rank, 
i.e., it becomes singular. Therefore, the manipulator is said to be in a singular 
configuration, and 0 o is called a singularity of the manipulator. It is important to 
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Fig. 8 Planar 2R serial 
manipulator 



infr:i\ihlr 



recall the consequences of such a singularity for the transmission capability of 

r\ 

the manipulator as defined by the forward kinematic mapping/ : T — » R given 
in Eq. 88. If the Jacobian is full rank 2, it follows from the inverse function 
theorem that for any end-effector (EE) position in a sufficiently small neighbor- 

hood of x G R , there is a unique q G T , and thus the motion of the EE is 
uniquely mapped to the joint motion. Furthermore, the relation of x and q is 
bijective. Now this is different at q 0 , where rank J (0 O ) = 1. Apparently any 
instantaneous joint motion <5q G ker /(q 0 ) does not produce an instantaneous 
EE motion, where the kernel (or null space) of J (q 0 ) is ker /(q 0 ) = span ([1, — 

T 

(/ 1 + l 2 )/l 2 \ )• Hence, any instantaneous motion of the form <5q = [s, — s(l x + l 2 )l 

nr 

l 2 \ does not affect the EE configuration. It is therefore called the self-motion of 
the manipulator at the singularity. The practical consequence that affects the 
stability of the manipulator is that instantaneous EE mobility is reduced which 
does affect the basic functioning of the manipulator. Even more from kinematic/ 
static duality (see “Static Forces”), according to which EE forces W transform to 
joint forces/torques r via the transformation r = JW, it follows that in a 
singularity there are EE forces W 0 G ker J (q 0 ) that can be balanced without 
producing joint forces/torques. This may be strategically exploited in certain 
situations as pointed out in (Kieffer and Lenarcic 1994). In the singularity of the 
2R example in Fig. 8, it is clear that the manipulator can balance any EE force 
along the x-axis with zero joint torques. 

While the situation is rather clear for serial manipulators, it becomes more 
complicated for parallel manipulators and closed-loop mechanisms in general. As 
a simple but instructive example, consider the planar 4-bar mechanism in Fig. 9. 
This closed-loop mechanism is known to have 1 DOF. That is, the motion of the 
mechanism is fully determined by the motion of one of its angles, like q x . This is 
true until the mechanism reaches the configuration q$ in Fig. 2b. From this config- 
uration the mechanism can either return to a configuration as in Fig. 9a or enter the 
motion mode in Fig. 2c where the links with equal lengths overlap so as to make the 
axes of joints 1 and 3 coincide. Apparently the joint angle q x alone is not sufficient 
to determine which mode the mechanism will enter, but this requires specification 
of the motion of an additional joint, say joint 2 with angle q 2 . That is, the 
mechanism gains mobility and has instantaneously 2 DOFs in the bifurcation 
configuration in Fig. 9b. The bifurcation phenomenon and the gain in mobility 
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Fig. 9 Planar 4-bar mechanism 
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Fig. 10 C-space of the planar 4-bar mechanism projected to the ql/q2/q4 subspace 


are geometrically visible from the visualization of the configuration space in Fig. 3 
showing the section of the 4-dimensional joint space. The 1 -dimensional 

7 2 

c-space consists of smooth curves V and V intersecting at singular points. Hence, 
in regular points the c-space is locally approximated by tangent vectors. Clearly at 

1 9 

the singularity q 0 , where V and V intersect, the linear approximation of the 
c-space is the tangent plane determined by the sum of the tangent vectors to either 
component. Now it is clear that it needs two coordinates to decide in which motion 
mode the mechanism moves. 

This observation regards the mechanism as a passive system. A parallel 
manipulator is a closed-loop mechanism with some of its joints actuated (inputs) 
and a dedicated output link (output). Taking again the geometric view, this means 
that the input joint variables are used as local coordinates on the c-space. If, for 
instance, joint 1 is used as the input joint, then the mechanism can be controlled 
until meeting the c-space singularity q 0 . However, it can also happen that a 
selected joint ceases to be a proper input for a mechanism away from c-space 
singularities. For example, using joint 4 of the 4-bar mechanism fails to locally 
uniquely determine the mechanism’s motion in the configuration in Fig. 9a. 
Denoting this configuration with q x , it is apparent from Fig. 10 that the joint 
variable q 4 of joint 4 cannot parameterize the curve V 2 (since qx is a singularity of 
the projection of V). The practical consequence is that at qi the mechanism cannot 
be controlled by joint 4. This is also a singular phenomenon referred to as input 
singularity. 

It shall already be clear from these two examples that the phenomenology 
of parallel manipulators is by far more rich than that of serial manipulators. An 
exhaustive study of the principle phenomena that can occur is reported in (Zlatanov 
et al. 1998). 
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Forward Kinematic Singularities of Serial Manipulators 

A serial manipulator is kinematically described by its forward kinematic mapping 

f :Y n — > SE (3) 


C =/(q) (150) 

that determined the EE configuration C G SE (3) in terms of the joint variables 
q G V /z . The joint space ¥ /z is a smooth manifold, and SE (3) is the Lie group of 
frame transformation, which describes the configuration of the EE frame relative to 
the spatial reference frame (Selig 2005). The corresponding velocity transformation 
is the forward velocity mapping Eq. 120, where J is the first-order approximation of 
the forward kinematic mapping. A manipulator can have an arbitrary number n of 
1-DOF joints. If dim im /= n , the serial manipulator is nonredundant, and if dim im 
/ < n, it is called kinematically redundant. That is, it is kinematically redundant, if 
the dimension of the manipulator’s EE motion space is less than that of the joint 
space. Denote with r m the maximal rank of J, which is the dimension of the EE 
motion space is equal to r m . Clearly, r m < min (n, m). 


Definition 1 A configuration q is called a forward singularity of the serial manip- 
ulator, if and only if rank J< r m . 

This is the same as saying that q is singular if and only if the rank of the forward 
kinematic Jacobian is not constant in any neighborhood of q in V /? . Singularities in 
general are strictly defined as properties in an open neighborhood and not in one 
particular point. This is in accordance with the very meaning of the attribute 
“singular” as solitary or unique, i.e., a property of some points adjacent to points 
not having this property. In fact the set of singular points is closed. It is hence not 
sufficient to evaluate the rank of Jacobians in a particular point in order to deduce 
whether the point is singular or regular. 

The forward kinematic Jacobian can be expressed in terms of the instantaneous 
joint screw coordinates as in Eq. 120. Hence, the condition for a singular constraint 
Jacobian is the linear dependence of more than n-r m of the instantaneous joint 
screws. Finding the condition that a set of given screws forms a screw system of 
certain rank is a very difficult problem that has been addressed in series of 
publications (Gibson 1999; Gibson and Hunt 1990a, b; Hunt 1986; Karger 1996; 
Kieffer 1992, 1994; Lipkin and Pohl 1991; Litvin et al. 1986; McCarthy 2000; Rico 
and Duffy 1992a, b; Rico et al. 1995). 

Based on these conditions the singularities of a large number of mechanisms 
were studied. This provides a geometric setting for the systematic singularity 
analysis. Nevertheless, it shall also be noticed that for special mechanism a direct 
investigation of the analytical Jacobian is feasible. 
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Configuration Space Singularities of Closed-Loop Mechanisms 

The c -space of a serial manipulator is the smooth manifold Y n but the c-space of a 
parallel manipulator is the subvariety 

y = {qG¥”,/i(q) =0} (151) 

defined by the m loop closure constraints 

h{ q) = 0. (152) 

In the case of the planar 4-bar mechanism, there are m = 3 loop constraints 
imposed on the 4 joint variables, and the c-space is a variety of dimension 
8 := n — m = 1. Here 8 is the global finite mobility of the mechanism, i.e., the 
dimension of V. The constraints ( 1 ) give rise to the velocity constraints 

J c (q)q = 0 (153) 

with J c being the Jacobian of h. The differential (or instantaneous) mobility of the 
mechanism in configuration q <G V is ^diff(q) := n — rank J c (q). For the 4-bar 
example <5 diff (q) = 1 except at the singular configurations discussed above. More- 
over, except at these points, the finite and differential mobility are identical. That is, 
the differential mobility is constant. This leads to the following: 

Definition 2 A configuration q £ Y is a regular point of the mechanism if and only 
if 8 diff is constant in a neighborhood ofq in V. Othei^wise it is a c-space singularity. 

It is important to notice that the condition is the differential mobility being 
constant, rather than being equal to the finite mobility, since there are mechanisms 
where the constraints (1) are not independent on the c-space V , and hence its 
dimension is greater than that estimated by n — m (so-called overconstrained 
mechanisms), which also means that the rank of the constraint Jacobian is less 
than m. 

The problem of singularities of general varieties is a major topic in differential 
geometry and topology (Gullemin and Pollack 1974). For mechanisms the concept 
of c-space singularities was originally introduced in (Zlatanov et al. 2002). An 
important result is that c-space singularities form closed subsets. This means that 
c-space singularities form lower-dimensional subvarieties of measure zero, so that 
“almost all” configurations are regular. In the 4-bar mechanism example, the 
c-space singularities are isolated points, i.e., 0-dimensional subvarieties of the 
1 -dimensional c-space. 

The necessary condition for q E V being a c-space singularity is that rank J c (q) 
< m. This means that the loop constraints are instantaneously dependent. Hence, 
the crucial step in identifying c-space singularities is to find the conditions for the 
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Jacobian to be rank deficient. As for the serial manipulator Eq. 120, the geometric 
constraint Jacobian can be expressed in terms of instantaneous joint screw 
coordinates. 


Singularities of Parallel Mechanisms 

Parallel mechanisms are special closed-loop mechanisms with designated input and 
output links. A “moving platform” is connected to the fixed platform (ground) in 
parallel by several (usually identical) actuated limbs. Such mechanisms are com- 
monly called fully parallel manipulators. As shown in section “Jacobian Compu- 
tation for Parallel Manipulator,” the kinematics can be expressed in the form 

AV = Bq (154) 

where V is the platform velocity and q are the actuator joint rates. This kinematic 
model involves two Jacobians, each of which can become singular. This has been 
first addressed in (Gosselin and Angeles 1990). Configurations where B becomes 
singular are called type I singularities, and when A becomes singular, they are 
called type II singularities. Type I and II singularities are also called serial and 
parallel singularities, respectively, as type II only occurs for PKM. 

In fact, the model (1) reduces to that for a serial manipulator by setting A = I. 
Type II singularities are also termed force singularities, since certain EE wrenches 
cannot be equilibrated by control forces. This can be concluded from the force 

r T 1 r T' r T 1 

transmission: A W = B r. EE wrenches in ker A cannot be balanced by any 
actuator force. This can lead to serious stability problems. 

As an example consider the 5 -bar parallel manipulator in Fig. 5 in the configu- 
ration on Fig. 1 1. In the shown configuration, the manipulator cannot withstand any 


Fig. 11 Planar 5R parallel 
manipulator in a singular 
con_guration 




50 Manipulator Velocities and Static Forces 


1845 


vertical force as indicated in the figure. Moreover, actuator forces can be produced 
that have no effect on the manipulator motion but merely produce internal self- 
stress. It is hence clear that such situations must be avoided by the motion planning 
if possible. For this 5 -bar example, it is not possible to reach any point on the lower 
half of the workspace without crossing a singularity. The geometric conditions for 
fully parallel manipulators were considered in (Merlet 1989). 


Further Considerations 
Cuspidal Serial Manipulators 

Most manipulators exhibit kinematic singularities, and an important question is 
whether a manipulator can transit between any two specified configurations without 
passing through a singularity, or whether it can change its postures without passing 
through a singularity. This problem has been addressed and intensively studied for 
regional serial manipulators with three joints as reported in a series of publications 
(Wenger and El Omri 1993; Wenger 1998, 2007). It is has been shown that a 3R 
manipulator can in fact change its posture without meeting a singularity if it 
possesses cusp singularities in the joint space. This is an important statement for 
the practical manipulator operation. The cuspidality concept has been adopted to 
parallel manipulators (Husty 2009; Wenger and Chablat 2009). 

Higher-Order Analysis 

The mere existence of singularities does not fully reveal the behavior of the 
manipulator in the vicinity of a singularity nor the control problems associated 
with it. The motion that a manipulator may perform when passing through a 
singularity can be revealed by looking at higher-order approximations. This is in 
particular instructive since in a singularity the mobility changes qualitatively and 
the manipulator may not be fully controllable by the designated actuated joints. 

For serial manipulators such higher-order approximation proceeds by series 
expansion of the forward kinematic mapping up to a certain order. The crucial 
observation is that the forward kinematic Jacobian is expressed in terms of the 
instantaneous screw joint coordinates and that their derivatives are given by screw 
products (Lie products) of the screw coordinate vectors (Selig 2005). In other 
words the series expansion does not require analytic derivatives but only involves 
vector cross products. This has been first reported in (Karger 1996) and was later 
advanced in (Lerbet 1998, 1999, 2003; Muller and Rico 2008; Muller 2012; Rico 
et al. 1995). 

Since also the constraint Jacobian of a closed-loop mechanism is expressed with 
help of screw coordinates, the series expansion of the geometric loop constraints 
can be carried out with repeated screw products of the joint screws. This 
provides the necessary formulations for the higher-order mobility analysis. How- 
ever, the sufficient order of the local analysis of closed-loop mechanisms is yet 
unknown. 
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Redundancy in Robotic Manipulators 

Traditionally the number of actuated joints in a serial manipulator equals the 
dimension of task space, i.e., the task space and joint space have the same dimen- 
sion. The existence of singularities as well the presence of obstacles in the 
workspace motivates the introduction of additional joints so that the manipulator 
is equipped with more actuated joints than strictly necessary. The DOF of a serial 
manipulator is S = n. 

Definition 3 A manipulator is called kinematically redundant if S > r m . The degree 
of kinematic redundancy is n — r m . 

This applies to serial as well as parallel manipulators. The anthropomorphic 
7R serial manipulators are a well-known example. With degree of redundancy 

n - r m = 1. 

Parallel manipulators possess a further type of redundancy. This stems from the 
fact that they comprise more joints than their DOF. Moreover, it was recognized 
that the force transmission can be improved if more than S joints are actuated 
(Chakarov 2004; Kim et al. 2001; Kock and Schumacher 2011; Nokleby et al. 2005; 
O’Brien and Wen 1999). Denote with n a the number of actuators and again with r m 

the EE mobility. Then, the inverse kinematic solution AV = q of Eq. 1 involves the 

n a xr m matrix A. 

Definition 4 A parallel manipulator is called redundantly actuated if 3 < n a . The 
degree of actuation redundancy is n a — 3. 

Dual to the velocity transformation, the force transmission is determined by 

W = A T T. (155) 

Since always r m < 3 , this is an overdetermined system for the actuator forces t. 
Hence, given a desired EE wrench W, the solution is not unique. Moreover, a 
solution can be determined taking into account the distribution of the overall load 
among the individual drives. This can be achieved using a pseudoinverse solution 
as reported in (Kock and Schumacher 2011; Muller 2005). 

More on the terminology of manipulators can be found in (Conkur and 
Buckingham 1997; Muller 2013). 


Inverse Kinematics of Serial Manipulators 

The forward kinematic mapping (1) for a serial manipulator explicitly determines 
the EE configuration for a given vector of joint variable q. The inverse kinematic 
problem (determining the joint variables that produce a desired EE configuration) is 
only implicitly defined, however. It requires solving the nonlinear equation (1) for 
q. This central topic has a long history in robotics starting with early works like 
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(Whitney 1969). A major problem is the nonuniqueness of the solution since, even 
for a nonredundant manipulator where n = r m , for a given EE configuration there 
may exist multiple q (i.e., postures of the manipulator) that give the same config- 
uration (Baker and Wampler 1988; Borrel and Liegeois 1986). The mapping/maps 
from the joint space manifold to the task space manifold. Using the standard 
description in terms of joint angles and displacements / is an analytic mapping. 
When using algebraic parameterizations like the tan-half- angle substitution, the 
mapping /is algebraic (which is always possible except for proper screw joints). 
The consequence is that either formulation allows for application of different 
mathematical tools. Such algebraic methods have been proposed for instance in 
(Baillieul 1985; Deo and Walker 1993; Lee and Liang 1988; Nakamura and 
Hanafusa 1986; Raghavan and Roth 1993), but it is the subject of ongoing research. 

Instead of aiming at closed form solutions, a practically feasible approach is to 
solve the inverse kinematic problem iteratively. To this end the first-order approx- 
imation in terms of the manipulator Jacobian is employed. Starting from a 
given configuration q, with C = /( q), the transition to a neighboring EE configu- 
ration C = C + dC can be approximated as 

C=/(q)=/(q)+J(q)dq, (156) 

where q = q + dq. For a nonredundant manipulator, the Jacobian is square and a 

solution is dq = J _ 1 (q)(C — /( q)). This has a unique solution except in forward 

kinematic singularities where J becomes singular. For kinematically redundant 
manipulators, on the other hand, the system (1) does not have a unique solution. 
Moreover, there is a n — r m dimensional variety of solutions, which is called the 
self-motion manifold (Burdick 1989). That is, for a given EE configuration, the 
manipulator can perform n — r m dimensional motions. Solving Eq. 1 in this case 
requires the use of the pseudoinverse of J, and its null space solution allows for 
incorporation of secondary tasks. As such obstacle avoidance is the most prevailing 
tasks. To it was proposed to define a repelling artificial potential representing 
obstacles and select such a solution from the self-motion manifold that minimizes 
this artificial potential and thus increases the distance to the obstacles (Khatib 1986; 
Khosla and Volpe 1988). An extension of more general symmetric obstacles was 
presented in (Muller 2004). An example is the redundant planar serial 5R manip- 
ulator in Fig. 12. The manipulator has n = 5 DOF; its task space is 3 -dimensional 
and thus possesses a degree of redundancy 2. That means that there is a 
2-dimensional manifold of self-motions. The terminal link can be navigated to 
pass through the gap between the obstacles without any other link colliding with the 
obstacles. The latter are represented by repelling potentials. Another secondary task 
is the maximization of kinematic dexterity (Muller 2003). 

The analytical or numerical computation of the inverse or pseudoinverse of 
the Jacobian is usually rather complicated, and it can be desirable to avoid it. To 
this end algorithms were proposed using the Jacobian transpose instead of its 
inverse. This needs careful stability considerations, however, as shown in 

(Antonelli 2009). 
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Fig. 12 A planar 5R serial 
manipulator can circumvent 
the obstacles 



Static Forces 

There are two goals of studying static forces with the manipulator at an equilibrium 
configuration in this section. One of them is to determine the relationship between 
the generalized forces which are composed by the vector of forces and torques 
applieds to the end effector and manipulator joints. The other is to find the 
transformation relationship of the generalized forces in different Cartesian frames. 

In the field of analytical mechanics, the work of a force on an object along a 
virtual displacement is known as the virtual work, of which the related principle can 
be used to analyze manipulator forces in static statements. 


Determination of Joint Forces/Torques 


A generalized force in Cartesian frame can be represented as 



where / and m , respectively, denote the force element and the torque element 
(Craig 2005; Paul 1984). 

Consider the end effector that makes a “least action” while the force F is applied 
to it. Then, the work done by F is a virtual work, denoted by 5W, and one can 
compute it by the following equation: 


SWe = F T 5D, 


(158) 
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where 



Suppose that r nx i is the vector of forces applied to each joint of the manipulator 
and the differential motions have taken place. Then, the increments of each joint’s 
displacement can be expressed in vector form, that is, 




Accordingly, the virtual work applied to each joint can be written as 



SWj = t T Sq. (161) 

In view of the principle of virtual work, when the manipulator keeps in a static 
situation, F and r are a pair of forces that realizes the equilibrium configuration. 
Therefore, it can be considered that the virtual work done by the two cancels each 
other out, i.e., 


which is essentially 


SW F = SWj , 


F T 8D = z r 8q. 

In view of the previous section of this chapter, one has 

SD = J(q)5q. 





Hence, 


F T J(q)8q = z T Sq, (165) 

and then, one can obtain the relationship, expressed by Jacobian, between F and t, 
that is, 


t = J T (q)F. (166) 

The generalized force applied to the end effector is expressed in the operational 
space, while that applied to the end effector is expressed to the joint space. As a 
result, Eq. 166 allows us to transform the forces’ representation from one space to 
the other. 
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Cartesian Transformation of Static Forces 


In some robotic applications, users may be interested in the static forces and torques 
applied to a particular point of the manipulator. However, it is easier to obtain the 
values of the generalized force of another part, which is fixedly connected with the 
attentive point, because it is more convenient to assemble a sensor probably. To this 
end, one may fix two frames to the two parts of the manipulator, respectively, and 
probe the static forces concerning one of them. And then, calculate the equivalent 
forces and torques represented in another frame (Craig 2005; Siciliano et al. 2009). 

Defining the link coordinates systems according to the rules shown in Fig. 2, one 
can suppose that f t _\ and m i _ l are forces and torques applied to the origin of the frame 
{ i — 1 }, while fi and m, are forces and torques applied to the origin of the frame { / } . 

For all the static forces and torques applied to the link i balance each other, one 
can describe the relationships in frame { / — 1 } as follows: 



Since the forces and torques are vectors in Cartesian frames, they can be 
rewritten in terms of their own frames, that is, 


i-i 


fi- 


i— 1 ni 


1 


l 


R'fi 


i - 1 


mi- 1 


i— 1 ni 


i-1 


, Rm- + tx 

V 1/ 


i-i 


fi - 1 



Furthermore, in considering static forces, all the joints of the manipulator can be 
seen locked to each other. To this end, the statics applied to any frame can be 
derived from that acting on another frame, for all the links between the two frames 
can be considered as a whole. And a more general relationship can be written as 


B f 
J B 


B 


m 


B 


aR A /a 

B R A m, + B P 


A 


A 


AORG 


which may also rewritten in matrix form 




where 




B 


P 


AORG 


x b a R 




(170) 


( 171 ) 
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Fig. 13 Frames of the 
interested point and the 
measured point 



For example, in the case of studying the static forces about the seeding robot 
which is introduced in Fig. 1, one may be interested in the forces applied to the tip 
of the end effector, when the seed is pressed into soil. However, it would be very 
difficult to measure the forces and torques of the tip directly. But by attaching strain 
gauges, one can easily calculate the forces applied to another part of the end 
effector, such as the dashed region in Fig. 13. 

Attach the frame {S} to the measured point, and it is easy to derive the 
transformation relationships between {5} and {T}, by utilizing the mechanical 

'T T 

structure of the robot. Hence, one can get S R and Psorg • And then, in view of 
Eq. 170, the static forces applied to the tip of the end effector can be calculated, 
that is, 



S R 0 

T p Y In Tp 

r SORG A S A S A 



(172) 


o 

where S F is the generalized forces applied to the origin of frame {S}, which has 
been measured or computed from the sensor information directly. 


Summary 


Researches about the kinematic and static characteristics are of basic contents in the 
field of robotic manipulator technology. Moreover, the study of manipulator veloc- 
ities and the statics, which is critical to the transmitting of manipulators’ motions 
and forces, requires a series of scientific methods, including modeling and analyz- 
ing special situations of the manipulators. 

In this chapter, basic concepts and notions are introduced, and the kinematic 
modeling is reviewed making use of recursive frame transformation. The relations 
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of velocities and forces between end effectors and manipulator joints are derived 
and explained for several different manipulators, serial as well as parallel manip- 
ulators. The special situations, so-called singularities, where the transformation 
properties change drastically are discussed. 

For the solving of inverse kinematics are very important for manipulator con- 
trolling, it is also discussed in this chapter. Furthermore, the concept and applica- 
tion of kinematic redundancy are introduced with examples. 
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Abstract 

This book chapter is about fundamentals of manipulator dynamics and their 
applications. Two approaches of manipulator dynamics, namely, recursive 
Newton-Euler approach and the Lagrange equations , are introduced and 
discussed. Examples are included to demonstrate their application in manipula- 
tor dynamics simulations and analysis. This book chapter can provide basic 
understanding on manipulator dynamics, which is applicable to manipulators, 
including serial and parallel manipulators. 
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Introduction 

Manipulator dynamics concerns with the actuation forces required to generate 
motion of manipulators. The manipulator dynamics is expressed in terms of 
equations of motion, which describe the relationships between the actuation, 
reaction forces, and the accelerations and motion trajectories of manipulators. In 
robotics, dynamics is a fundamental for the manipulator design, analysis, and 
control. 

Two types of problems are involved in the dynamics study, namely, the inverse 
and forward dynamics. The inverse dynamics calculates the required actuating 
forces at joints for a prescribed trajectories or known kinematic information 
(position, velocity, and acceleration). The forward dynamics, on the other hand, 
determines the joints’ and end-effector accelerations with applied actuating forces 
at j oints . 

The equations of motion can be established with several approaches. The most 
used approaches include recursive Newton-Euler approach and the Lagrange 
equations , which can be found in literatures (Murray et al. 1994; Angeles 1997; 
Featherstone and Orin 2000). In this chapter, the dynamics equation 
formulation with the above two approaches is introduced. The formulations are 
confined within the rigid-body dynamics. Readers with interests in flexible 
body manipulators are directed to literatures (Shabana 2013; Low 1987; 
Dwivedy and Eberhard 2006). Two examples are included to demonstrate 
their applications, one applicable to a serial manipulator and the other to a 
parallel robot. 


Recursive Newton-Euler Method 

The motion of a rigid body can be described by the Newton and Euler’s equations, 
the former for the translation and the latter for the rotation. Basically, the equations 
of motion of a manipulator can be established for each body of the manipulator on 
the basis of free-body diagrams. Alternatively, the equations of motion can be 
formulated through the recursive Newton-Euler method for serial manipulators, 
with which the free-body diagrams can be omitted. The formulation presented is 
based on the method introduced by Luh et al. (1980). This formulation is well suited 
for inverse dynamics problems, i.e., to find the driving torques with known kine- 
matics information. 

The recursive Newton-Euler method consists of two steps. The first step is an 
outward recursion, from the base to the end link, to calculate velocity and acceler- 
ation. The second step is an inward recursion, from the end link to the base, to 
calculate the forces and torques at joints. In the following formulation, the recursive 
method is applied to a manipulator with n rigid links. Coordinate systems are 
established by following Denavit-Hartenberg’s convention (D-H convention) 
(Denavit and Hartenberg 1955). 
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Outward Recursion to Calculate Velocities and Accelerations 

• • • 

In the inverse dynamics problems, the positions, velocities, and accelerations (6> z , Oj , <9 Z 

• • • 

for a revolute joint and d h dj , dj for a prismatic joint) of the joints are known. The 
outward recursion propa gates velocities, and accelerations of each link from the base 
to the end link. 

Revolute joints 

Referring to Fig. 1, the angular velocity of a re volute joint, propagated from link 
i— 1 to link /, is given by 


COi = (Oi-i + 6 (Li 



or in local components 


R i<o\ = R z -i<w- + 6 R/z • (2) 

where R, stands for the orientation matrix of link i and co • is the angular velocity of 
link i with respect to the ith frame (local coordinate system). Premultiplying both 
sides of Eq. 2 with R J and simplifying yield 


co 


“t" @i Z i 


( 3 ) 


■ 

where R z ] is the rotation matrix relating link /— 1 and link i. i! t is the unit vector 


T 

[0,0,1] parallel to the z z axis. Similarly, the propagation of linear velocity from link 
i— 1 to link i can be derived as 



f 


i + 1 


Axis z-1 


Axis z 


n i+ i 


*+i 


Axis z+1 


Fig. 1 The illustration of the symbols 
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i - 1 


where p, denotes the position vector of the frame of link i in the (/— l)th frame. 
The angular acceleration is calculated through 



The linear acceleration with respect to the link-attached coordinate system is 
given by 





where v- is the linear acceleration of link i in the ith frame. 

V 

The linear acceleration of the center of mass of link i is further calculated from 
the velocity and acceleration of the link 


ii’a = v' + (b\ x p',, + (o\ x («' x Pc;) (7) 

where denotes the position vector of the center of mass of link i in its own frame. 

I 

Prismatic Joints 

If the joints in question are prismatic, the angular velocities are identical for link 
i and link i— 1, which means (», = o),_ t . Expressing with local components yields 





/ 

/-I 


that is, 






/ 

i-1 


where matrix RJ_! is independent to rotations. 

Likewise, the local angular acceleration of link i is 






/ 

i-l 


Furthermore, the local linear velocity and acceleration become 


( 8 ) 

( 9 ) 




v' = R' , (v'_! +<»;_, x pi *) + di z' 


/ 


i-l 


1 


1 


+ <°i-\ XPi + fl», ! X (ffl, ! X P 


/-I 

l 



• • • 

+ 2 ( 0 ) x djZj + djZ) 


( 11 ) 

( 12 ) 


where d , is the displacement from axis .v,_ i to x, along the z, axis. 
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Inward Recursion to Calculate Forces and Torques 


With the velocities and accelerations found, the inertial force and torque applied to 
link i at its center of mass can be calculated by applying the Newton-Euler equations: 


f'. 


Ci 


miv' Ci 


(13) 


n 


/ 


Ci 


ljQ) • + CO • X I jCo'j 


(14) 


where m z is the mass of the link and I z denotes the inertia tensor of link i about its center 
of mass, calculated in a frame identical to the link’s coordinate system except the 
origin. In the equations above, all items are expressed in the local coordinate systems. 

The forces and torques acting on the links are calculated through the outward 
recursion. The joint torques that generate the forces and torques will be calculated 
through the inward recursion. 

Referring to Fig. 1, the inward recursion propagates the joint forces and torques 
from the end link to the base as 





Ci 


+ k + ifm 









where f/ and nf are the force and torque exerted on link i with respect to the zth 
frame. 

The actuating torque on a revolute joint can be calculated by 




IT / 

n i Z i 


For prismatic joints, the driving force is 





Forward Dynamics Problem 

In the forward dynamics problem, the joint accelerations are to be calculated with known 
joint torques. One method called articulated-body algorithm (ABA) (Featherstone 1983) 
is applicable to this type of problem with recursive approach. More dynamics modeling 
methods were reviewed and discussed in (Featherstone and Orin 2000). 


Lagrange's Equation 

The equations of motion for a manipulator can also be formulated with Fagrangian 
dynamics method. Two types of Fagrange’s equations can be formulated, one 
with dependent coordinates (1st kind) and the other with independent coordinates 
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C 2nd kind). This section starts with the Lagrange’s equation of the 1st kind. 
Hereafter, the vector q represents a set of n unknown dependent coordinates, m is 
the total number of independent kinematically constraint equations, and f = n — m 
is the number of dynamic degrees of freedom (de Jalon and Bayo 1994). The 
constraint conditions for the entire system are written in the following general form: 

0>(q) = 0 (19) 

Differentiating Eq. 19 with respect to time leads to 



d<t> dq 

dq d t 


= <£ q q = 0 



where matrix <I> q is the Jacobian matrix of the constraint Eq. 19. 

Let L = T — V be the Lagrangian of the system, where T = T( q, q) and V = V(q) 
are the kinetic and potential energies, respectively, and is the vector of 
generalized external forces acting along the dependent coordinates q. Considering 
a manipulator as a constrained multibody system, the Lagrange’s equation can be 
derived by generalizing the Hamilton’s principle by means of Lagrange multiplier 
technique(de Jalon and Bayo 1994). Skipping details of derivation, the equations of 
motion become 







which is the Lagrange’s equation of the 1st kind. In the equation, L is a 

'T 

m-dimensional vector of Lagrange multipliers. Item represents the generalized 
forces due to reaction forces at joints. 

With independent coordinates, the constraint Jacobian vanishes, and the term 
with Lagrange multipliers is dropped out. The Lagrange ’ s equation of the 2nd kind 
is thus obtained: 







The Lagrange’s equation of the 1st kind stands for a system of n equations for 
n dependent co-ordinates and m unknown Lagrange multipliers. To solve the 
equations, additional acceleration equations from the kinematic constraints are 
required, as presented below. 

The kinetic energy of a robotic system can be written as follows: 



where M(q) = M is the mass matrix of the system. The first and second terms in 
Eq. 21 can be expressed as below: 
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d (dL ^ ™ dL 

= Mq + Mq; 


d t \ dq 


<9q 


Lq — T q Vq 


(24) 


For the general case in which the kinetic energy depends on q, Eq. 21 becomes 


Mq + 0^ = Q CT +L q -Mq 


(25) 


On the other hand, differentiating the constraint Eq. 19 twice with respect to time yields 


O q q + <J> q q = 0 


(26) 


Let c 


• • 

<E> q q and n = + L q — Mq. Combining Eqs. 25 and 26 leads to: 


M 

<J> 7 

q 


• • 

q 


n 


0 




c 


(27) 


Equation 27 can solve simultaneously the accelerations and the Lagrange 
multipliers. 


Examples 

Two examples are included to apply the equations of motion to robotic manipula- 
tors. In the first example, a lightweight robotic arm consisting of five revolute joints 
developed at Aalborg University (AAU), Denmark, is considered. In the second 
example, the dynamics of a spherical parallel manipulator is presented. 

Example I: A 5-dof Lightweight Robotic Arm 

The lightweight robotic arm is demonstrated in Fig. 2a. Following the D-H con- 
vention, Cartesian coordinate systems are attached to each link of the manipulator, 
as shown in Fig. 2b. The D-H parameters of the manipulator are listed in Table 1. 

Dynamics Modeling 

The kinematics of the robotic arm can be found in (Zhou et al. 2011). This chapter 
includes here only the dynamics formulation. 

Jacobian matrix The joint angular velocity can be calculated with the Jacobian 
matrix 


0 = 3 *Vrf 


(28) 


where 6 = { 61,62 ... ,6 


-1 T 


n 


denotes an ^-dimensional ( n denotes the number of dof) 


vector of the joint angular velocities, J is the Jacobian of the robotic arm, and v e /the 
velocity of the end-effector. 
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Fig. 2 AAU 5-dof robotic arm: (a) a prototype and (b) coordinate systems 


Table 1 D-H parameters 
of the 5-dof robotic arm 


Joint i 

CXj 

Qi 

dj 

e, 

1 

jt/2 

0 

ht 

6i 

2 

0 

h 

0 

02 

3 

n! 2 

0 

0 

e 3 

4 

—n!2 

0 

h 

04 

5 

n/2 

0 

d\ 

05 


For a revolute joint, the Jacobian matrix can be calculated by (Tsai 1999) 




ni 



Z /-1 X 
Z /-1 



where z t _i and p,_i are given by 




R'-iq,! + P, 
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rp 

where q,_! = [a, cos 0 { , a, sin 0 h d z ] . The local coordinates of the end-effector are 

defined as p„ = [0, 0, 0] . When the desired end-effector velocity \ e f is given, the 

joint angular velocity can be solved by Eq. 28. 

• • 

The joint angular acceleration 0 can be calculated through time derivative of the 
velocity 0. 

0 = j l V ef + y V Vef (31) 


Simulation Results 

The dynamics of the 5-dof robotic arm was simulated with a program implemented 
in MATLAB. The trajectory of the end-effector in the global coordinate system is 
defined as x e f(t) = 5 + 400 (1 — cos(0), y e /f) = —990 + 800(1 — cos(^/2)), and 
z e ff) = 280 + 250(cos(^/2) — 1), all with unit of mm, which leads to 



400 sin (t) 

400 sin (t/2) 

— 125 sin (t/2) 


T 


The Euler angles for the end-effector are given as i/a = [0, cos(^/180), 0] , 
following ZXZ convention. The rotation implies the end-effector remains horizontal 
during the prescribed motion. For the prescribed motion, the rotation matrix 
R and angular velocity co can be readily found. The velocities of the end-effector 

-i T 


are \ ef 


T 

ef 


x;r, co 


T 


, from which the joint angular velocities and accelerations are 


calculated. 

To implement the Newton-Euler method to solve the inverse dynamics problem, 
the equations in section “Recursive Newton-Euler Method” are summarized in a 
recursive manner: 

Outward recursion: / : 1 — > 5 


co' = 

- R i _\Co i _i + Oi^i 

(32a) 

co' -- 

- K l i _ 1 co i _ l + OjZj + R|_ 1 6U / _ 1 x 0 i z i 

(32b) 

v' = 

■- Rj_i(vj_i +©;_! X Pp 1 +Co'_ 1 X X Pp 1 )) 

(32c) 

V C, 

P_! +<»;_! x Pc ( +©;_i x (<»;_! x pp) 

(3 2d) 

fa = 

- 

(32e) 

n 'a 

I \(b [ + co'j X I jCo'j 

(32f) 


Inward recursion: / : 5 — » 1 





Ci 


+ R ■ , i f • 


i+ \'i+\ 
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(32h) 




rT / 

n , h 



The joint angular velocities and accelerations are solved through Eq. 28, with 
results displayed in Fig. 3. Let the payload at the end-effector of the robot be 5 kg, 
which implies f 6 = 5g N, n 6 = 0, where g is the vector of gravity acceleration. 
Through the modeling of inverse dynamics of Eqs. 32g-32i, the joint torques are 
solved, as shown in Fig. 4. 


Example II: A Spherical Parallel Manipulator 

In this example, the Lagrange’s equation is illustrated with a spherical parallel 
manipulator (SPM) (Gosselin and Angeles 1989). The SPMs are of closed- 
kinematic chain. The Lagrange’s equation of the 1st kind is adopted for the 
dynamics modeling of this manipulator. 

A general SPM is shown in Fig. 5a with the parameterized zth leg in Fig. 5b. The 
zth limb consists of three revolute joints, whose axes are parallel to the unit vectors 
u ? , v„ and w All three limbs have identical architectures, defined by angles and 
a 2 . Moreover, and y define the geometry of two triangular pyramids on the base 
and the mobile platforms, respectively. The origin O of the base coordinate system 
xyz is located at point O. The z axis is normal to the bottom surface of the base 
pyramid and points upward, while the y axis is located in the plane made by the 
z axis and Ui. 


Kinematic Modeling 

Under the prescribed coordinate system, unit vector u, is derived as 



— sin rjj sin y 


cos rjj sin y 




where = 2(7 — 1)^/3, z = 1, 2, 3. 

The unit vector v z of the axis of the intermediate revolute joint in the zth leg is 
expressed as: 



— s^-sycai + (crjjSOj — srj^ycO^sai 
c^-sycai + (srjjSOi + crj^ycO^sai 

—cycai + sycOjSai 



where s(-) = sin(-) and c(-) = cos(-). 

The unit vector w z of the top revolute joint in the zth leg is a function of the 
orientation of the mobile platform (MP), namely, 
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Fig. 3 Angular velocities and accelerations of the robotic arm joints 


where Q is the rotation matrix that carries the MP from its reference orientation 
to the current one and w, is the unit vector of the axis of the top revolute joint 
in the ith leg when the mobile platform is in its reference orientation, which is 
given as 
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Fig. 4 Joint torques of the robotic arm 



Fig. 5 


A general SPM and the coordinate system, where only one leg is shown for clarity 


w 


* 

l 


sin rjj sin cos 77, sin fi cos ] 


T 



Kinematic Jacobian Matrix 

T' 

Let (o = [co x , co y , co z \ denote the angular velocity of the mobile platform; the 
velocity equation via the zth leg can be stated as 
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• • 

<0 = OjUi + 1 if {Vi + &w i 


(37) 


To eliminate i//, and <*■, dot-multiplying Eq. 37 on both sides with v, x w, yields 


(V; X W /) • CD = (u,- X V,) • W,(9, 

from which the SPM velocity equation is obtained as 


(38) 


with 


where 0 


Ao) = B0 


A 


Vl X Wi V 2 x W 2 V 3 x W 3 


T 


(39) 


(40a) 


B = diag [ (ui x vi) • wi (112 x V2) • W2 (U3 x v 3 ) • W3 


(40b) 


. i T 


@ 1 , 02,03 


. Matrices A and B are the forward and inverse Jacobian 


matrices of the manipulator, respectively. The kinematic Jacobian matrix J of the 
manipulator can be expressed as follows as long as matrix A is not singular: 


J = B -1 A 


. . . iT • 

Ji h h 5 Jz 


V/ X W / 


Ui X V/ • w i 


(41) 


Equation 38 is thus rewritten for the single joint velocity as 


T 


0i = j, (O 


(42) 


Velocity and Acceleration Analysis 

The motions of the link and mobile platform are shown in Fig. 6. The angle rates 

* 1 T 

(p = [(p, 0, 6\ and the angular velocity co are linearly dependent, namely, co = 
Differentiating the equation with respect to time yields 


• • • 
CO = \\i(j) + \\i(p 



Matrix 'F is dependent on the rotations. For example, a rotation with the Euler 
convention ZY Z of Q = R Z (<^)R V (6 > )R Z (<7— </>), matrix 'F is 



—sOccj) 
—sOscp 
1 -c 6 


— s <p s6c(j) 

c (p sOs (p 

0 ccp 



The velocity \j/ of the intermediate joint of zth leg is found by making use of 

• • 

Eq. 37 to eliminate 0j and <*•. Dot-multiplying Eq. 37 on both sides with u, x w, 
leads to 


1868 


S. Bai et al. 



Fig. 6 The movement of the mobile platform and links 


(ll; X W /) • 0} = X W;) • V/ OT Iff = 





The angular velocity of the distal link in the zth leg in the reference frame xyz is 

found as = Of u, + v,-. Let pu denote the corresponding angular velocity in the 

local frame xj/Z/, which means 





with 



+ w j 
Vi + w j 



Vj + Wj 
Vi + Wi 



Vj + Wj 

Vi + W/ 



which yields 



Dynamic Modeling 

The Lagrange’s equation for the SPM is expressed as 


d / dL 




dt \dqj dq 


(49) 
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where q = [0 1? 0 2 , 0 3 , 0 , <r] r , = [r r , 0] r G M 6 is the vector of external 

'T 

forces, and vector t = [r 1? t 2 , t 3 ] characterizes the actuator torques. Moreover, 

nr 

k = [ki, k 2 , ^ 3 ] is a vector of Lagrange multipliers. Matrix is the system’s 
constraint Jacobian, which can be found from the velocity Eq. 39, namely, 


B0 — Am = 






that is, 


C,q = 0 



with C q = [B-A'F]. 


Lagrangian of the Mobile Platform 

The local frame x p y p z p of the MP is established with the origin located at point P, 
i.e., the center of mass of the MP of position vector p. Henceforth, the Lagrangian 
of the mobile platform is obtained as 




CO I p co 


— m pg T p 



where \ p denotes the local inertia tensor of the mobile platform and co' its local 

T 

angular velocity. Moreover, g = [0, 0, 9.81] . 


Lagrangian of a Single Leg 

The Lagrangian of the zth leg is derived as 



where In is the proximal link’s mass moment of inertia about u, and I /2 is the distal 
link’s local inertia tensor about point O. Moreover, h z and k, indicate the position 
vectors for the centers of the mass of the proximal and distal links, respectively. 

Substituting the Lagrangian L p and L h i = 1, 2, 3, into Eq. 49, the inertia matrix 
can be derived as 



1870 


S. Bai et al. 


Table 2 Parameters of the SPM and the initial simulation condition 


a x [deg] 

a 2 [deg] 

P [deg] 

Y [deg] 

[(p, 6, a] [rad] 

[0i,e 2 ,0 3 ] [rad/s] 

m [Nm] 

60 

80 

75 

0 

[ 0 , tt/6, 0 ] 

[- 8 , -7, -9] 

[ 0 . 1 , 0 . 1 , 0 . 1 ] 


Table 3 Mass and inertia properties of the SPM 


Mobile platform 

Proximal link 

Distal link 

m p [kg] 

Ip [ 10 4 kg m 2 ] 

m i [kg] 

hi [10 3 kg m 2 ] 

m n [kg] 

l l2 [10 4 kg m 2 ] 

0.332 

[3.855 3.855 
7.688] 

0.123 

1.169 

0.114 

[1.852 0.089 
1.921] 


and the other terms also can be derived. Moreover, differentiating Eq. 51 with 
respect to time yields 


with 



Cfltjj = c 


q 


"T "T -IT 

e (/) 


C«q 


B-A'F-A'F 


q 


where 


and 


A 


WWW 

JA1 JA2 J A3 . 


T 


B = diag [j B i j B2 j B 3 _ 


j Ai = ^( u < x v/) x w ,• + v,- x (w x W/) 


j Bi = 9, [u, x (u,- x v,-)l • w, + (u,- x v,-) • (w x w /) 


Combining Eqs. 49 and 56 leads to 


(56) 

(57) 

(58a) 

(58b) 

(59a) 

(59b) 


[M(q) CH 


• • 

q 


"r — M(q)q 4- r q — V q 

c, 0 J 




~C q q J 


With an external moment vector m, the actuator torques are expressed as 





Simulation Results 

The previously developed dynamic model is applied to a co-axial SPM (Bai 2010), 
whose properties are given in Tables 2 and 3, where inertia tensors l p and I /2 contain 
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time [s] 

Fig. 7 The simulation of inverse dynamics: actuator torques 


only diagonal entries. Simulations were conducted for motion of constant angular 
accelerations at actuating joints, their initial conditions being specified in Table 2. 
The results of the actuating torques are shown in Fig. 7. The results agree well with 
the simulation results obtained from CAE software, namely, the MSC Adams. 


Summary 

This book chapter introduces the fundamental of manipulator dynamics, with focus 
on the recursive Newton-Euler approach and the Lagrange equations , which are 
approaches mostly common used in robot dynamics analysis and control. Both 
inverse and forward dynamic problems are discussed. Applications to manipulators 
are demonstrated. 

In solving the dynamics equation, numerics problems may arise. One is compu- 
tational efficiency. The recursive approach presented in this chapter is quite effi- 
cient, while the Lagrange’s equations, as a closed-form equation, implies high 
computational cost in finding solutions, which uses numerical methods, for exam- 
ple, Newton-Raphson method. Other problems include the singular Jacobian and 
the instability of the solution, etc., among others. The Jacobian matrix can become 
singular when a singular configuration is reached or redundant constraints are 
included, which may lead to a crash of simulation or a large error. Instability 
takes place in numerical integration of the equation of motion, when the roundoff 
errors increase with time, which can lead to the constraint equations not satisfied. A 
method to improve the stability is the Baumgarte stabilization method, which is 
introduced in (de Jalon and Bayo 1994). 
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The approaches presented in this chapter are closely relevant to multibody 
dynamics, which was extensively studied in the last century. With the advances 
of robotics in the twenty-first century, manipulator dynamics has significantly been 
expanded, not limited to multibody dynamics, but get involved with other subsets of 
the dynamics. A new development among others is the continuum manipulators, 
where continuum mechanics is the major theory applicable (Walker 2013). The new 
development of manipulator dynamics, which is not discussed in this chapter, 
presents new opportunity and challenges for robotics. 
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Abstract 

Trajectory planning consists in finding a time series of successive joint angles 
that allows moving a robot from a starting configuration towards a goal config- 
uration, in order to achieve a task, such as grabbing an object from a conveyor 
belt and placing it on a shelf. This trajectory must respect given constraints: for 
instance, the robot should not collide with the environment; the joint angles, 
velocities, accelerations, or torques should be within specified limits, etc. Next, 
if several trajectories are possible, one should choose the one that optimizes a 
certain objective, such as the trajectory execution time or energy consumption. 
This chapter reviews methods to plan trajectories with constraints and optimi- 
zation objectives relevant to industrial robot manipulators. 
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Introduction 

For a robot to accomplish a given high-level task - such as grabbing an object on a 
conveyor belt and placing it on a shelf - this task must be translated into low-level 
commands understandable by the robot operating system. This whole process, 
known as “motion planning,” can be broken down into a number of steps, as 
illustrated in Fig. 1. Note that, for clarity, the different levels and steps were 
represented in a sequential, linear fashion. In practice, however, several levels or 
steps might sometimes be addressed at the same time: for instance, when planning 
at the “trajectory” level, one might have to take into account the robot torque 
bounds, which appear at the “actuator commands” level. 

Until recently, motion planning has mostly been performed by human operators: 
for a given task, a skilled, experienced operator would manually find the commands 
(using, e.g., a teach pendant and visual feedback) that would allow the robot to 
achieve the task. However, this approach is time-consuming and usually produces 
suboptimal trajectories. Further, if the task changes (even slightly), the whole 
tedious teaching process has to be done over again. 

Impressive theoretical advances in the field of motion planning in the past 
few decades have brought about a new picture: it is now possible for a computer 
to find, in minutes or seconds, optimal commands for a robot to achieve a given 
task, even in very challenging, cluttered environments. Several companies, some 
of them spinning off from academia (e.g., Siemens Kineo, France/Germany or 
Mujin Inc., Japan), are developing software solutions that are in use in actual 
factories. 

The current chapter discusses trajectory planning , which is a part of motion 
planning. Specifically, we shall focus on the problem of finding a trajectory of the 
joint angles between given starting and goal joint angles, see Fig. 2. It is thus 
assumed that appropriate computations have been carried out before (e.g., grab 
synthesis, inverse kinematics, etc.) or will be carried out after (e.g., robot instruc- 
tion synthesis, command synthesis, etc.) this stage. 

Two important concepts, namely, constraints and optimization , are essential in 
trajectory planning. Constraints restrict the range of motions that the robot can 
execute. One can classify them into two categories. First, geometric constraints are 
the constraints that can be expressed solely in terms of the robot joint angles: these 
include bounds on the joint angles, avoidance of self-collision and of collision with 
obstacles, etc. These constraints can thus be fully taken into account in the path 
planning step. Second, kinodynamic constraints are the constraints that include 
higher-order time derivatives of the joint angles, such as bounds on the joint 
velocities, accelerations, torques, or motor current inputs. These constraints cannot 
be taken into account by path planning alone and must be considered at the 
trajectory level. 

Optimization comes into play when there are more than one path or trajectory 
that allow achieving the task while satisfying the constraints. It is then interesting to 
select the path or trajectory that optimizes a given objective. At the path level, one 
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Work-space task 


Joint-space task 


Joint-space path 


Joint-space trajectory 


Robot-language instructions 


Actuator commands 



Inverse kinematics 



Time parameterization 


Instruction synthesis 


Command synthesis 


Fig. 1 From task to commands. The work-space task may consist in, e.g., bringing the end effector 
of the robot manipulator from a starting position and orientation in space, where it has just grabbed 
the object, towards a goal position and orientation, where it can safely release the object onto the 
shelf. Inverse kinematics finds the manipulator joint angles that allow achieving the desired 
end-effector position and orientation. The joint-space task would then consist in connecting the 
starting joint angles and the goal joint angles. Path planning finds a collision-free path, that is, a 
continuous set of intermediate joint angles between the starting and goal joint angles, such that the 
manipulator does not collide with the environment or with itself at any of the intermediate joint 
angles. Time parameterization consists in finding the time stamps for the joint angles along the path, 
while respecting, e.g., the torque bounds and/or optimizing the traversal time or the energy 
consumption. Finally, instruction synthesis translates the desired trajectory into a set of instructions 
in the robot language, and command synthesis converts these instructions into low-level commands, 
such as the electrical current fed into the motors. Usually, this last step is performed internally by 
the robot controller provided by the robot manufacturer and is not accessible to the end user 



b Joint angle 



Fig. 2 Trajectory planning for a two-link manipulator, (a) The manipulator and the obstacles 
( black rectangles ) in the work space, (b) Trajectories connecting q star t and q goal in the joint space 
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might be interested in finding the shortest path in joint space or in the end-effector 
space. At the trajectory level, other optimization objectives, such as minimum time, 
maximum smoothness, minimum energy, etc., can be considered. Note that because 
of kinodynamic constraints, a shortest path might not correspond to a minimum- 
time or minimum-energy trajectory. 

Scope of this Chapter From the above discussion, it is clear that there exists a 
very large variety of trajectory planning problems (Hwang and Ahuja 1992; 
LaValle 2006), in terms of robot structures, constraints, optimization objectives, 
etc. The present chapter focuses on problems arising from the industry: we shall 
specifically consider serial robotic manipulators, constraints arising from obstacle 
avoidance and from bounds on the joint angles, velocities, accelerations and 
torques, and optimization objectives related to the trajectory execution time. 
Also, we shall concentrate more particularly on the methods that are actually in 
use in the industry rather than cover the whole panorama of existing planning 
methods. 

Organization The remainder of this chapter is organized as follows. In section 
“Path and Trajectory Planning,” some methods for geometric path planning and 
kinodynamic trajectory planning are presented. The focus of that section is on 
finding one feasible path or trajectory - optimization is not considered. In section 
“Path and Trajectory Optimization,” methods for path and trajectory optimization 
under geometric and kinodynamic constraints are reviewed. As stated above, we 
shall mostly consider the minimization of trajectory execution time. Finally, section 
“Conclusion” offers a brief conclusive discussion. 


Path and Trajectory Planning 
Important Concepts 

We first introduce the important concept of configuration space (Lozano-Perez 
1983) in the context of serial robotic manipulators. 


Definition 1 (Configuration space). A set of joint angles q = (0 U . . ., 0 n ), where n 
is the number of joints of the robot manipulator, is called a configuration. The set of 
all possible joint angles is called the configuration space and denoted C . For 
instance , C=[— n, n] n for a manipulator with n revolute joints. If a robot is in 
collision with the environment or with itself at configuration q, then q is called a 
collidingconfiguration. The subset of C consisting of all non-colliding configura- 
tions is called the free space and denoted C free . 

It is assumed that the starting configuration q start and the goal configuration q goa i 
are given. The problem thus consists in finding a path or a trajectory (see definition 
below) connecting q start and q goa i and respecting the constraints. 
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Definition 2 (Path and trajectory). Formally, a path can be defined as a continuous 
function P 


P : [0, 1] C 

s q(s). 

For a path connecting q start and q goa b one thus has 

q(o) = q st arP q(i) = q g0 ai- 


A path can be regarded as a geometric object devoid of any timing information. 
Next, a time parameterization of P is a strictly increasing function 

s : [0, T] -► [0, 1] 

t s(t) 

with s(0) = 0 and s(T) = 1 . The function s gives the position on the path for each 
time instant t. 

The path P endowed with a time parameterization s becomes a trajectory IT 

n= [o,n 

with T being the duration of the trajectory. Note that the same path P can give rise to 
many different trajectories fl. 


Path Planning Under Geometric Constraints 

There exists a large variety of approaches to path planning: combinatorial methods, 
potential field methods, sampling-based methods, etc. 

Combinatorial methods make use of algebraic geometry and mathematical 
programming tools to provide complete algorithms to the path planning problems. 
“Complete” means that the algorithm can decide in finite time whether the problem 
has a solution and provide a solution when it exists. An example of a combinatorial 
method is the cell decomposition method: when the obstacles are polygonal, the 
free space C free can be decomposed into cells (e.g., triangular cells by a Delaunay 
triangulation). Solving the problem then boils down to finding a sequence of 
adjacent cells such that q start and q goal belong to respectively the first and the last 
cell of the sequence (see, e.g., LaValle 2006, Sect. 6.3.2). While combinatorial 
methods are valuable to obtain theoretical results (for instance, they allow proving 
that the general motion planning problem is NP complete), they are too slow to be 
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Fig. 3 Roadmap-based methods, (a) Grid search. The starting and goal configurations are 
represented by respectively the blue square and the yellow star. Green disks represent sampled 
configurations lying in C free , while red disks represent obstacle configurations, (b) Probabilistic 
roadmap. Same legends as in (a) 


used in practice, especially in high-dimensional problems. Another issue is that 
these methods require an explicit representation of the obstacles, which is very 
difficult to obtain in most practical problems. 

The potential field method was introduced in (Khatib 1986). This method 
constructs a potential field which is high near the obstacles and low at the 
goal configuration. Steering towards the goal configuration while avoiding the 
obstacles can then be naturally achieved by letting the robot configuration 
evolve in that potential field. This method is interesting in that it allows real- 
time control. However, the issue of local minima - the robot may get trapped in the 
local minima of the potential field - prevents its use in highly cluttered 
environments. 

Sampling-based methods are perhaps the most efficient and robust, hence prob- 
ably the most widely used methods for path planning currently. The next two 
sections review these methods in detail. 

Grid Search and Probabilistic Roadmap (PRM) 

A first category of sampling-based methods requires building, in a preprocessing 
stage, a roadmap that captures the connectivity of C f ree • A roadmap is a graph 
G whose vertices are configurations of C free . Two vertices are connected in G only if 
it is possible to connect the two configurations by a path entirely contained in C free . 
Once such a roadmap is built, it is easy to answer the path planning problem: if 
(i) q start can be connected to a (nearby) vertex u and (ii) q goa i can be connected to a 
(nearby) vertex v and (iii) v and v are in the same connected component of G (in the 
graph-theoretic sense), then there exists a collision-free path between q start and q goa i 
(see Fig. 3). 

Methods for building the roadmap fall into two families: deterministic or 
probabilistic. A typical deterministic method is the Grid Search , where the config- 
uration space C is sampled following a regular grid, as in Fig. 3a. A sampled 
configuration is rejected if it is not in C free . Next, one attempts to connect every pair 




52 Trajectory Planning 


1879 


of adjacent configurations (adjacent in the sense of the grid structure) to each other: 
if the straight segment connecting the two configurations is contained within C free , 
then the corresponding edge is added to the graph G. 

In the Probabilistic Roadmap method (Kavraki et al. 1996), instead of following 
a regular grid, samples are taken at random in C free , see Fig. 3b. Since there is no a 
priori grid structure, several methods exist for choosing the pairs of vertices for 
which connection is attempted: for instance, one may attempt, for each vertex, to 
connect it to every vertices that are within a specified radius r from it. 

Advantages of the Roadmap-Based Methods The strength of the roadmap- 
based methods (both deterministic and probabilistic) comes from the global/ 
local decomposition - the difficult problem of path planning is solved at two scales: 
the local scale , where neighboring configurations (adjacent configurations in 
Grid Search, configurations within a small distance r of each other in the Probabi- 
listic Roadmap) are connected by a simple straight segment; and the global scale , 
where the graph search takes care of the global, complex connectivity of the free 
space. 

Note also that it is not necessary for these methods to have an explicit represen- 
tation of C free - one only needs an oracle which says whether a given configuration is 
in C free . To check whether a straight segment is contained within C free , it suffices to 
call the oracle on every configuration (or, in practice, on sufficiently densely 
sampled configurations) along that segment. 

These methods also possess nice theoretical guarantees. Specifically, it can be shown 
that the Grid Search is resolution complete , which means that if a solution exists, the 
algorithm will find it in finite time and for a sufficiently small grid size. Similarly, it can 
be shown that the Probabilistic Roadmap method is probabilistically complete , which 
means that, if a solution exists, the probability that the algorithm will find it converges to 
1 as the number of sample points goes to infinity (LaValle et al. 2004). However, the 
converge rate of both methods is difficult to determine on practical problem instances. 

Regarding the comparative performances of the deterministic and probabilistic 
approaches, it has been shown both theoretically and practically that randomness is 
not advantageous in terms of search time. However, it can be argued that probabi- 
listic methods are easier to implement (LaValle et al. 2004). 

Rapidly Exploring Random Trees (RRT) 

The methods just discussed require building the entire roadmap in the 
preprocessing stage before being able to answer any query [a query being a pair 
(q start Qgoai) to be connected]. In applications where only a single or a limited 
number of queries are needed, it may not be worthy to build the whole roadmap. 
Single query methods, such as the Rapidly Exploring Random Trees (RRT), are 
much more suited for these applications (Lavalle et al. 1998). 

Specifically, RRT iteratively builds a tree (see Algorithm 1), which is also a 
roadmap, but which has the property of exploring “optimally” the free space. The 
key lies in the EXTEND function, which selects the vertex in the tree that is the 
closest to the randomly sampled configuration (see Algorithm 2 and Fig. 4). Thus, 
the probability for a vertex in the tree to be selected is proportional to the size of its 
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9 start 


9 rand 


new 


near 


Fig. 4 Illustration for the EXTEND function. The tree is rooted at the black disk. Red disks and 
plain segments represent respectively the vertices and edges that have already been added to the 
tree. EXTEND attempts at growing the tree towards a random configuration q ran d. For this, q near is 
chosen as the vertex in the tree that is the closest to q rand . The tree is then grown from q near towards 
qrand? stopping at q new , which is at the specified radius r from q near 


Voronoi region, causing the tree to grow preferably towards previously under- 
explored regions (Lavalle et al. 1998). 

Algorithm 1 BUILD_RRT 

Input : A starting configuration q start 
Output : A tree T rooted at q start 

T. INITIALIZE (q start ) 
for rep = 1 to N maxre pdo 

grand <- RANDOM_CONF I G ( ) 

EXTEND ( T , q rand ) 

end for 


Algorithm 2 EXTEND 

Input : A tree T and a target configuration q 

: Grow T by a new vertex in the direction of q 



near 


q 

if q 


4 


NEAREST NEIGHBOR (T,q) 


new 


<- STEER (q 
Add vertex q 
Add edge [q 


near 


new 


, q) succeeds then 

to T 


near / 


q 


new 


] to T 


end if 


Note: STEER(q near , q) attempts making a straight motion from q near towards q. 
Three cases can happen: (i) q is within a given distance r of q near and [q rand , q] is 
collision-free, then q is returned as the new vertex q new ; (ii) q is farther than a given 
distance r of q near , and the segment of length r from q near and in the direction of q is 
collision-free, then the end of that segment is returned as q new (see Fig. 4); (iii) else, 
STEER returns failure 

Finally, to find a path connecting q start and q goal , one can grow simultaneously 
two RRTs, one rooted at q start and the other rooted at q goal , and attempt to connect 
the two trees at each iteration. This algorithm is known as bidirectional RRT 
(Kuffner and Lavalle 2000). In practice, bidirectional RRT has proved to be easy 
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to implement, yet extremely efficient and robust: it has been successfully applied to 
a large variety of robots and challenging environments. 


Trajectory Planning Under Kinodynamic Constraints 

As mentioned earlier, kinodynamic constraints involve higher-order derivatives of the 
configuration and cannot therefore be expressed in the configuration space. One approach 
to kinodynamic planning thus consists of transposing the path planning methods (such as 
PRM or RRT) to the state-space A, that is, the configuration space C augmented with 
velocity coordinates, where the kinodynamic constraints can be appropriately taken into 
account (Donald et al. 1993; LaValle and Kuffner 2001; Hsu et al. 2002). One drawback 
of this approach is that moving into the state space is associated with a twofold increase in 
the dimension of the search space: if C is of dimension n, then A is of dimension 2 n. Since 
planning algorithms usually have complexities that scale exponentially with the dimen- 
sion of the search space (Hsu et al. 2002), a twofold increase in the dimension may make 
state-space kinodynamic planning algorithms impractical even for relatively small values 
of n (Shiller and Dubowsky 1991). 

A second approach avoids the complexity explosion by decoupling the problem: 
first, search for a path in the robot configuration space C under geometric constraints 
(using path planning methods discussed earlier) and, in a second step, find a time 
parameterization of that path that satisfies the kinodynamic constraints (Bobrow 
et al. 1985; Kuffner et al. 2002). The drawback here is that the path found in the first 
step may have no time parameterization at all that respects the kinodynamic 
constraints. This drawback may be addressed by introducing an Admissible Veloc- 
ity Propagation scheme, which allows checking, at each PRM/RRT extension, the 
existence of an eventual admissible time parameterization (Pham et al. 2013). 


Path and Trajectory Optimization 

As stated in the Introduction, we shall focus here on minimizing trajectory execu- 
tion time, which is the most common optimization objective in the industry. When 
planning at the path level , minimizing execution time can be equated to minimizing 
the path length, which is covered in section “Minimizing Path Length.” At the 
trajectory level , however, shortest paths may not correspond to minimum-time 
trajectories when kinodynamic constraints come into play. Section “Minimizing 
Trajectory Duration” reviews methods to specifically minimize trajectory duration. 


Minimizing Path Length 
Asymptotically Optimal Methods 

While the roadmap-based methods presented in section “Grid Search and Probabi- 
listic Roadmap (PRM)” address the feasibility problem, i.e., the problem of finding 
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one feasible path, it is straightforward to modify them to include path length 
optimization. Indeed, once q start and q goa i have been connected respectively to 
vertices u and v in G, classical graph algorithms, such as Dijkstra’s search (Dijkstra 
1959) or A search (Hart et al. 1968), can be applied to find the shortest path 
between u and v in G. In turn, the path lengths between q start and q goa i are 
minimized. 

It can be shown that these algorithms are asymptotically optimal in the sense that 
the path length of the solution returned by these algorithms converges to the minimal 
path length as the grid size - in the case of Grid Search - goes to 0, or as the number 
of sampled points goes to infinity - in the case of the Probabilistic Roadmap. 

Regarding the single-query problem, it has been shown that the RRT method, if 
modified in the same way as above, would not yield an asymptotically optimal 
algorithm. However, RRT - a modified version of RRT where the EXTEND function 
tries to extend from not only the nearest vertex but from a specific number of nearest 
vertices - possesses the asymptotic optimality property (Karaman and Frazzoli 2011). 

Path Shortcutting 

While the RRT algorithm just mentioned has the nice property of being asymptoti- 
cally optimal, it is too slow to be used in practice. For single-query problems, it turns 
out that a two-step approach consisting of (i) finding one path using RRT, followed by 
(ii) post-processing this trajectory by repetitively applying shortcuts realizes a good 
trade-off between path quality and computation time (Geraerts and Overmars 2007). 

The shortcut method is presented in Algorithm 3. It is simple to implement, yet 
very effective. A modified version, called partial shortcut , consists in shortcutting 
one joint angle at a time, can yield even higher-quality paths but also requires a 
longer computation time (Geraerts and Overmars 2007). 

Algorithm 3 PATH_SHORTCUT 
Input: A collision- free path P 
Output : A shorter collision- free path 
for rep = 1 to N m axrep do 

Pick two random points q 2 q 2 along the path 
if [qi,q 2 ] is coll ision-free then 

Replace the portion of P between q 2 and q 2 by [q l7 q 2 ] 

end if 
end for 

Note: [q 1? q 2 ] denotes the straight segment between qi and q 2 in the joint space. 


Minimizing Trajectory Duration 
Fixed-Path Time Minimization 

Once a collision-free path has been found, one can give sample configurations 
along the path as input to the robot. However, for most modem robot manipulators, 
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execution time can be greatly reduced if each sample configuration is accompanied 
with a time stamp , i.e., the time instant when the robot should reach that configu- 
ration. This requires time parameterizing the path, that is, transforming it into a 
trajectory. 

More specifically, with the notations introduced in Definition 2 of section 
“Important Concepts,” minimizing the traversal time of a given path P is to find 
the time parameterization s such that T is minimal and that the parameterized 
trajectory (q(s( 0 ))/e[o, 7 ] satisfies given kinodynamic constraints. 

When the constraints are bounds on the joint torques, a very efficient solution to 
this problem, based on Pontryagin’s maximum principle, was proposed in the 1980s 
(Bobrow et al. 1985; Shin and McKay 1985) and has been continuously improved 
until today (Pfeiffer and Johanni 1987; Slotine and Yang 1989; Shiller and Lu 1992; 
Pham 2013). This method can also be applied to other types of kinodynamic 
constraints such as gripper and payload constraints (Shiller and Dubowsky 1989) 
or bounds on the joint velocities and accelerations (Kunz and Stilman 2012). More 
recently, another family of algorithms to solve this fixed-path time minimization, 
based on convex optimization, was proposed in (Verscheure et al. 2009; Hauser 

2013). 

Global Time Minimization 

A number of exact (Geering et al. 1985; Meier and Ryson 1990) and approximate 
(Yang and Slotine 1994) methods exist to directly find the time-optimal trajectory 
subject to torque bounds between two configurations. However, these methods are 
only practical for low-dimensional problems and cannot deal with geometric 
obstacles. 

To take into account both geometric and kinodynamic (e.g., torque bounds) 
constraints, an effective approach consists of generating a large number of paths 
and on each path, apply the fixed-path time minimization described in the previous 
section. In (Bobrow 1988), the author considers a family of paths consisting 
of Bezier curves. A path of this family can be represented by a set of control points 
p = (/q,..., p n ). One can then define the cost C(p) by the duration of the time- 
minimal parameterization of the Bezier curve represented by p. Finally, one 
can search for the time-minimal trajectory by a gradient search, where the gradient 
dC/dp is evaluated numerically. 

Another method (Shiller and Gwo 1991) consists in building roadmaps as in 
section “Minimizing Path Length,” but where the cost of an edge in the graph 
search would not be the distance between the adjacent vertices but a heuristic 
quantity related to the fixed-path time minimization algorithm of section 
“Fixed-Path Time Minimization.” 

Shortcutting with Kinodynamic Constraints 

As in the case of path planning (section “Path Shortcutting”), it turns out that 
shortcutting is the most effective method to obtain trajectories with short durations 
(Hauser and Ng-Thow-Hing 2010). There is, however, an important difference 
between trajectory and path shortcutting: in trajectory shortcutting, one usually 
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needs to ensure that the new portion can be inserted into the original trajectory 
while preserving the smoothness properties of the original trajectory. For instance, 
if one wants to preserve the (^-continuity of the trajectory (i.e., the property that the 
trajectory is differentiable and that the derivative is continuous), then it is necessary 
to generate shortcuts that begin and end, not only at the same configurations qi, q 2 , 
but also with the same velocities q 1? q 2 as in the original portion. 

Algorithm 4 presents shortcutting under velocity and acceleration (or pure 
kinematic) bounds (Hauser and Ng-Thow-Hing 2010). This algorithm is very 
effective thanks to the following property: given the beginning and ending config- 
urations and velocities (q^qj, (q 2 ,q 2 )> it i s possible to compute analytically the 

1 • 

time-optimal trajectory portion II (Qi» Qi» Q2» <fe) un der given velocity and accel- 
eration bounds (Hauser and Ng-Thow-Hing 2010). 

Algorithm 5 presents shortcutting under general kinodynamic bounds (Pham 
2012). Contrary to the case of velocity and acceleration bounds, there is no analytic 
expression of the time-optimal trajectory for general kinodynamic constraints. One 
thus have to resort to the path decoupling approach presented earlier: (i) interpolate a 
path between (q x , q 1 ) and (q 2 , q 2 ) respecting C 1 -continuity, and (ii) time -parameterize 
the path optimally under the given kinodynamic constraints. Note that the heuristic to 
choose the path in step (i) is cmcial for the performance of the algorithm. 

Algorithm 4 TRA JSHORTCUTK INEMAT I C 

Input : A collision - free , C 1 traj ectory Yl satisfying veloc- 
ity and acceleration constraints 

Output: A collision-free, C 1 trajectory satisfying the 
velocity and acceleration constraints and with shorter time 
duration 

for rep = 1 to N maxrep do 
Pick two random points q 2 , q 2 along Yl 
if n kin (qi,qi,q 2 ,q 2 ) is collision-f ree then 

i ■ 

Replace the portion of Yl between q 2 and q 2 by [] (qi,qi> 

q 2 ,q 2 ) 

end if 
end for 

1 • 

Note: n (q, , q, , q 2 , q 2 ) denotes the optimal trajectory between (q, , q^ and (q 2 , q 2 ) 
under given velocity and acceleration bounds (see text) 

Algorithm 5 TRAJ SHORTCUT GENERAL 

Input: A collision-f ree , C 1 trajectory IT sati sfying general 
kinodynamic constraints 

Output: A collision-free, C 1 trajectory satisfying the 

kinodynamic constraints and with shorter time duration 
for rep = 1 to iV maxrep do 
Pick two random points q 2 , q 2 along Yl 
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Joint-space task 

(starting and goal configurations) 


Joint-space path connecting start and goal 
and respecting geometric constraints 


Joint-space trajectory respecting geometric 

and kinodynamic constraints 


Joint-space trajectory respecting constraints 

and with short time duration 




Kinodynamic shortcutting 


Fig. 5 Typical work flow as practiced in a company specialized in motion planning for industrial 
robots 


Generate a path P int (qi,qi,q 2 ,q 2 ) 

if pint (q 1 ? q 1 ? q 2 ,q 2 ) is collision- free then 

Time-parameterize P 1Tlt (q 1 ,q 1 ,q 2 ,q 2 ) into [g nt (q^qt^.^b) 

■ , 

if rrMq,,^ , q 2 ,q 2 ) has shorter time duration than the orig- 
inal portion then 

■ , 

Replace the portion of Yl between q 2 and q 2 by n in (q x , <(i, 

end if 
end if 
end for 

Note: P m (q, , qi , q 2 , ^ 2 ) denotes an interpolated path between (q { , q { ) and (q 2 , q 2 ). 
n m ( qi,qi,q 2 ,q 2 ) denotes the time-optimal trajectory obtained by time- 
parameterizing 

^ mt (qn Qn q2’ Q2) un der given kinodynamic constraints (see text) 


Summary 

We have presented an overview of trajectory planning and optimization methods, 
with a special emphasis on those relevant to industrial robotic manipulators. It 
appears from this overview that very efficient methods exist for planning high- 
quality trajectories when the environment (consisting of the robot, the obstacles, 
etc.) is well defined and static. A typical work flow, may integrate some of these 
methods as sketched in Fig. 5 . The main current challenge of trajectory planning in 
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classical factory automation lies mainly in the development of robust software, as 
well as practical integration into the work place. 

The next major step in factory automation is to integrate the robot more tightly 
with human operators. For this, new methods must be developed, taking into 
account environments that are by nature time changing, and sometimes in an 
unpredictable way, because of the close, possibly physical interaction with human 
operators. In this context, other types of constraints and optimization objectives 
must also be considered, such as safety or compliance. 
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Abstract 

Robot manipulators have been widely used in industrial automation. In many 
modem robot control applications, sensory information such as visual feedback 
is used to improve positioning accuracy and robustness to uncertainty. This 
chapter introduces basic concepts and design methods that are employed for 
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motion control of robot manipulators with uncertainty. The chapter covers both 
basic methods in joint- space control and advance topics in sensory task-space 
control. 


Introduction 

Robotic manipulation has shown to be a key technology in factory automation. 
In order for a robot to perform some specific tasks, the robot is required to move 
according to the commands from a motion controller. Most motion control applica- 
tions of robot manipulators can be categorized into two main classes. The first is 
the point-to-point motion control or set-point regulation where the robot is 
required to move from an initial position to a final desired position in the workspace. 
Pick-and-place operations are typical examples of the point-to-point motion control 
applications. Other examples that require set-point control are spot welding and hole 
drilling. The second type is trajectory tracking applications, such as arc welding, 
machining, and painting, where the robot has to follow a desired trajectory. 

Robot manipulator Robot manipulator consists of rigid links connected by joints. 
One end of the manipulator is fixed to a base and an end effector or tool is connected 
to the other end. The vector space in which the joint displacements are defined is 
often referred to as the joint space, and the coordinates in which the manipulator 
task of the end effector is specified is referred to as the task space, which can be a 
Cartesian space or an image space depending on the task requirements. The motion 
control problem of a robot can be formulated either in the joint space (Kelly 
et al. 2005) or in the task space (Spong et al. 2006). In the joint-space control 
methodology, the desired position of the end effector is converted to a 
corresponding desired joint configuration by solving an inverse kinematic problem, 
and a feedback control law is designed so that the robot joints follow the desired 
joint position. To eliminate the problem of solving the inverse kinematics, the robot 
motion control problem can be directly formulated and designed in task space. 
A transformation matrix Transformation matrix or Jacobian matrix Jacobian matrix 
is used to transform the task-space feedback error to joint control inputs. 

The most commonly used controllers in industrial applications are PD and PID 
controllers (Ziegler and Nichols 1942). The main advantages of such controllers are 
the simplicity and ease of implementation. However, the kinematics and dynamics 
of a robot manipulator are highly nonlinear with coupling between joints, and 
hence, the linear control theory cannot be applied directly to design PD or PID 
controllers for a robot manipulator. By exploring physical properties of the robot 
dynamics, Takegaki and Arimoto (Takegaki and Arimoto 1981) first showed using 
Lyapunov method (Slotine and Li 1991) that a simple PD controller with gravity 
compensation is effective for set-point control of a robot manipulator Robot 
manipulator. The result was an important landmark in robot control theory. Inspired 
by the original work (Takegaki and Arimoto 1981), much progress has been made 
in understanding the robot motion control problem, and various control methods 
have been developed for a robot manipulator (Arimoto and Miyazaki 1984, 1985; 
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Arimoto et al. 1994; Ortega et al. 1995; Arimoto 1994; Wen and Bayard 1988; 
Niemeyer and Slotine 1991; Berghuis et al. 1993; Cheah et al. 1998, 2004, 2007, 
2010; Cheah 2003; Dixon 2007; Liang et al. 2010; Braganza et al. 2005; Wang and 
Xie 2009; Garcia-Rodriguez and Parra-Vega 2012; Cheah and Liaw 2005; Kelly 
1997; Wang et al. 2007). 

This chapter focuses on motion control methods of robot manipulators that were 
developed based on the Lyapunov method. In particular, several set-point and 
adaptive tracking controllers are presented in both the joint space and task space, 
for a robot manipulator with uncertainty. The robot kinematics and dynamics are 
nonlinear with coupling between joints, and a good understanding of the structure 
and properties of the models is essential for the design of simple and effective 
controllers. The dynamics and kinematics of robot manipulators and their basic 
properties are first introduced in section “Dynamics and Kinematics of Robot 
Manipulators” and several examples are given to illustrate the properties. The 
next two sections of this chapter present the standard motion controllers in robotics, 
which serve as foundation works for the design of most robot controllers based on 
the Lyapunov method. Section “Set-Point Control by PD Plus Gravity Controller” 
introduces the set-point controllers based on the PD plus gravity control strategy. 
Section “Adaptive Control of Robot Manipulators” presents adaptive control meth- 
odology for tracking control of robot manipulators. Recent advances in sensing 
technology has led to the research and development of sensory task- space feedback 
control laws for robot manipulators. The use of task-space sensory feedback 
information such as visual information improves the endpoint accuracy in the 
presence of uncertainty. Section “Approximate Jacobian Set-Point Control” pre- 
sents the basic task-space sensory feedback control problem of a robot manipulator 
with kinematic and dynamic uncertainty for set-point control applications. 
Moreover, the results are extended to deal with uncertain gravitational force. 
Section “Adaptive Jacobian Tracking Control” presents the adaptive Jacobian 
controller for task-space sensory feedback tracking control applications. Simulation 
results are presented in section “Simulation Results.” A brief review of the basic 
concepts and theories for stability analysis of nonlinear systems is also provided in 
the Appendix. 


Dynamics and Kinematics of Robot Manipulators 

In this section, the dynamic and kinematic equations of robot manipulators Robot 
manipulator are introduced and the properties which constitute the basis of con- 
troller design in this chapter are presented. 


Dynamic Equation of Robots 

T 

For a robot manipulator of n DOF with the joint coordinate q = [q i,. . ., # w ] , the 
equation of motion using Lagrange approach takes the following form: 
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Af 1# M 2 : mass of the first and second link 

L 1# L 2 : length of the first and second link 

L cl , l c2 : distance between the center of the 
gravity and the joint 



Fig. i A two-link robot manipulator 



such that r kji is called the Christoffel symbols and it is defined as 



1 \ dMij{q) dM ik (q) 

2 1 dq k dq. 



Equation 1 can be expressed in the following compact form: 



M(q)q + C(q, q)q + D(q)q + g(q) = u (3) 

where M(q) £ R nxn is the inertia matrix, C(q,q) is the matrix of Coriolis and 
centripetal terms, D(q) £ R nxn is the matrix of damping coefficients, g(q) £ R n 
denotes the gravitational force, and u £ R n is the vector of control input. The 
following example is presented for better understanding of the dynamic equation 
expressed in Eq. 3. 


Example 1 Consider a two-link planar robot manipulator as depicted in Fig. 1. 
The dynamic equation of the robot is expressed as follows: 


'M n 

M 12 


• • 

<7i 

+ 

'Cn 

Cl2 



+ 

'Du 

0 



+ 

8 1 

M 21 

M 2 2 


• • 

_<h_ 

c 21 

C22 _ 



0 

E> 22 



82 . 


u (4) 


where parameters M t j, Cy , and g t for /, j = 1, 2 are expressed in Appendix 2. 
Parameters D n and D 2 2 are positive constants that represent damping coefficients 
for each joint. 

Some properties of dynamic equation of robot manipulators expressed by Eq. 3 
are as follows: 


Property 1 The inertia matrix M(q) is symmetric and positive definite for all 

q £ R n . 
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Property 2 The matrix M(q) — 2 C(q, q) is skew symmetric so that 

y T [M{q) — 2C(q, q)}y = 0 forally<ER n . (5) 

To show property 2, the elements of matrix C(q, q) are obtained by using Eqs. 1 
and 2: 


Cij (<?,<?) 


J2 1 J dM iM) + dM * (?) dM kj (?) 1 , 


k= 1 


2 


<9* 


% 


<9?« 



q k 


( 6 ) 


Therefore, the elements of matrix M(q) — 2C(q , <?) can be expressed as 


(M(<?) - 2 C(q,q)) 


n 



k= 1 


dMiM 

dq k k 


dMij(q ) dM ik {q) 


dq k 


dq. 


dq< 


q k 


n 



dM ik {q) dM k j(q) 

H - >klk 


k= 1 


°<ij 


oq, 


Switching i and j and considering the symmetricity of matrix M(g) yield 



which shows that the matrix (M(q) — 2C(q, q)) is skew symmetric. 

Property 3 The dynamic equation Eq. 3 can be linearly parameterized with 
respect to the constant parameters , as follows: 


M(q)q + C(q,q)q + D(q)q + g{q) = Y(q,q,q,q)O d (7) 

where Y(-) G R nxp is the known regressor matrix and Oj is the unknown parameter 
vector. 

The following example is presented to illustrate properties 1-3: 


Example 2 Consider the two-link robot manipulator depicted in Fig. 1. The inertia 
matrix is expressed by Eq. 113 in the Appendix. To show the positive definiteness 
of the inertia matrix, it is sufficient to show that all leading principal minors of the 
matrix are positive. Hence, the following inequalities must hold: 


f I : — MiL^i + —M 2 L 2 C 2 + M 2 L\ + 2M2L.L.2 cos (q 2 ) > 0 

1 II : det(M) > 0 

For inequality /, the worst case is cos ( q 2 ) = — 1 ; then it gives 
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A -M x L 2 cl + l -M 2 L 2 c 2 +M 2 {L x 




For the second inequality, the determinant of M(q) is computed as follows: 

M iM 2 L 2 cl L 2 c2 +M\L\L 2 c2 (2 - cos 2 {q 2 ) \ > 0 (10) 

which yields the positive definiteness of the inertia matrix. 

To show the skew-symmetric property of the matrixM(g) — 2 C(q, q), let us form 
the matrix M(q): 


-2M 2 L l L c2 q 2 sin (q 2 ) -M 2 L l L c2 q 2 sin (q 2 ) 

—M 2 L\L c2 q 2 sin (q 2 ) 0 

Hence, the matrix M(q) — 2 C(q, q) can be expressed as 




M(q) - 2 C(q,q) 


0 ¥ 
-¥ 0 



where ¥ = 2M2LiL c2 qi sin (q 2 ) + M 2 L\L C 2 q 2 sin (q 2 ). It can be seen from Eq. 12 

that the matrix M(q) — 2C(q, q) is skew symmetric. 

To show the property Eq. 3, the elements of known regressor matrix Y E R (2x7) 
are given as 


Y n =qi, Y l2 =q 2 

Y 13 = (2 q x + <72 ) cos (<h) - (2<7i?2 + 4l) sin (<? 2 ) 

Yu = q u 715=0, 7i6 = cos (q { ), Y X1 = cos (q i + q 2 ) 

Y 2 \ = 0, 7 22 = q x + q 2 

723 = <7i COS (q 2 ) + q\ sin (q 2 ) 

Y 24 = 0, Y 25 = q 2 , Y 26 = 0, Y 21 = cos (q L + q 2 ), 

and the unknown parameter vector is expressed as follows: 



A -M x L 2 c ,+ A -M 2 L 2 c 2 +M 2 L\ 

4 9 

M2L\L C 2 


D 

D 


11 

22 


(M\L C \ + M 2 L\)g 

M 2 L C 2g 
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Kinematic Equation Kinematic equation of Robots 

The kinematic model represents the relationship between the joint angles and the 
end-effector position. Hence, the kinematic model connects the task space to joint 
space as follows: 

x = h(q) (13) 

where x E R m is the task-space vector and /?(•) E R n — » R m is a transformation 
describing the relation between the joint space and the task space. The relationship 
between task-space and joint- space velocities is given as 

x = J{q)q (14) 

where J(q) E R mxn is the which provides a transformation from the joint space to 
the task space. 

Remark 1 The task-space vector can either be defined in a Cartesian space or an 
image space. If cameras are used to measure the end-effector position, the task 
space is defined as an image space. Let x denote the vector of image feature 
parameters; the image velocity vector x is related to the joint velocity vector q as 

x = Ji(q)J e {q)q (15) 

where Jfq) is the image Jacobian matrix from the Cartesian space to the image 

space (Hutchinson et al. 1996) and J e (q) is the manipulator Jacobian matrix from 
the joint space to the Cartesian space. Therefore, the overall Jacobian from joint 
space to task space is J(q) = Jfq)J e {q). If a position sensor is used to measure the 
end-effector position directly, the task-space vector x is defined as a Cartesian space 
and hence J(q) = J e (q). 

A property of the kinematic equation Kinematic equation described by Eq. 14 is 
stated as follows: 

Property 4 The right-hand side of Eq. 14 is linear in a set of constant kinematic 

qr 

parameters 6 k = (6 ki , • • •, 0 kq ) , such as link lengths and joint offsets. Hence , 
Eq. 14 can be expressed as 


X = J(q)q = Y k (q, q)0 k , (16) 

where Y k (q,q) ER" yq is called the kinematic regressor matrix. 

Example 3 Consider the two-link robot manipulator as depicted in Fig. 1. The 
kinematics of the robot can be expressed as follows: 

_\L\ cos {q x ) + L 2 cos (< 7 j + q 2 ) 

I L\ sin (q x ) + L 2 sin (q x + q 2 ) 


( 17 ) 
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The task-space velocity can be obtained by differentiation Eq. 17 with respect to 


time, as 


4i 

Vi 

J(q) 

Equation 18 can be written as follows: 


-U sin (q x ) - L 2 sin (q x + q 2 ) -L 2 sin (q x + q 2 ) 

Li cos {q x ) + L 2 cos [q x + q 2 ) L 2 cos (q x + q 2 ) 




Set-Point Control by PD Plus Gravity Controller 

This section considers the set-point control problems or point-to-point control 
problems in which the robot is required to move from one point to another point 
without controlling the path taken by the robot between the two points. A simple 
and useful set-point controller for motion control is the PD plus gravity controller. 
This control strategy employs feedback of joint angles, velocity, and compensation 
of gravitational force as depicted in Fig. 2. 

The feedback control law is expressed in the following form: 

u = —K v q - K p q + g(q) (20) 

where K p is the proportional gain matrix, K v is the derivative gain matrix, and 
q = q — q d such that q d represents a constant desired position. Both K p and K v are 
positive diagonal matrices. The closed-loop system can be obtained by substitut- 
ing Eq. 20 into the dynamic Eq. 3: 

M(q)q + C(q, q)q + D(q)q + K v q + K p q = 0. (21) 

The stability of the closed-loop system is examined by using the Lyapunov- 
based stability analysis. Consider the following Lyapunov function candidate as 

V= l -q T M(q)q+ l -q T K p q. (22) 

Differentiating the Lyapunov-like candidate with respect to time gives 

V = q T M(q)q + ^q T M{q)q + q T K p q. (23) 

Substituting M(q)q from the closed-loop Eq. 21 yields 
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Q 


Robot 


Manipulator 

q 





Fig. 2 A block diagram of the PD plus gravity controller 


V 


1 

2 


q T (M{q) - 2 C{q,q))q - q T D{q)q 


q T K v q - q' K„q + q‘ K„q. 


T 


T 


(24) 


• • 

Considering the skew-symmetric property of the matrix M(q) — 2 C(q, q), V is 
simplified as follows: 


V = -q r D(q)q - q T K v q < 0. (25) 

Using LaSalle’s invariance principle (see lemma A1 in the Appendix) yields 

V = 0 => q T {D(q ) + K v }q = 0^q = 0. (26) 

From the closed-loop Eq. 21, one can conclude that 

K p q = 0=>q = 0 (27) 


yields q — * as t — > oo. 

In most robotic applications, the end effector of a robot manipulator is required 
to move to a desired position in the task space. This can be obtained by either 
solving the inverse kinematic problem to find the corresponding desired joint angles 
or developing task-space control methodology directly by using the task-space 
error. For task-space control, the Jacobian matrix is used to transform the task- 
space errors to joint control inputs. In the following, a task-space set-point control 
approach is presented. Consider the following feedback control law for the task- 
space regulation: 



where x = x — xj such that x d is the desired position of the end effector. A block 
diagram of the task-space regulator described by Eq. 28 is illustrated in Fig. 3. 
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x d 



Fig. 3 A block diagram of the task-space PD plus gravity control system 


Substituting the feedback law Eq. 28 into the dynamic Eq. 3 yields the following 
closed-loop system: 

M(q)q + C(q, q)q + D(q)q + J T (q)(K v x - K p x) = 0. (29) 

The following Lyapunov-like candidate can be considered for the stability 
analysis of the task-space regulation problem: 

V = l -q T M(q)q + ^x T K p x. (30) 

Differentiating Eq. 30 with respect to time and substituting Eq. 29 into it yield 

V = —q T D(q)q - q T J T (q)K v q - q T J T (q)K p x + x T K p x - q T D(q)q - x T K v x < 0. 

(31) 

If the Jacobian matrix is of full rank, by using LaSalle’s invariance principle (see 
lemma A1 in the Appendix), it is obtained 

V = 0^ q T D(q)q + x T K v x& | I q • (32) 

From the closed-loop Eq. 29, one can conclude that 

J T (q)K p x = 0^x = 0. (33) 

Hence, the position of the end effector converges to the desired position such as 

x — » x d as t — » oo. 

Remark 2 It was first shown in (Takegaki and Arimoto 1981) using the Lyapunov 
method that the simple PD controller with gravity compensation Eq. 20 is effective 
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for set-point control of a robot manipulator Robot manipulator. The gravity com- 
pensation term can also be computed off-line by using the desired position (Tomei 

1991; Kelly 1993; Arimoto 1996): 

u = —K v q - K p q + g(q d ). (34) 

A main drawback of the PD plus gravity controller is that an exact knowledge of 
the gravity force of the robot manipulator is required. The existence of uncertainty 
in modeling the gravitational term results in steady- state position error. One way to 
alleviate the problem is to increase the P control gain which may lead to saturation 
of the control torques. A common practice to remove the steady-state error is to add 
an integral term (Arimoto and Miyazaki 1984; Arimoto 1996). 



—K v q - K p q 


K 


i 


qdt 



where K 1 is a symmetric positive definite matrix. The closed-loop system of the 
linear PID robot control system is asymptotically stable in a local sense. To achieve 
global asymptotic stability, a saturated or bounded nonlinear function of the 
position error can be used (Arimoto et al. 1994): 



-K v q - K p q 


K 


i 


s(q)dt 



where s(.) is the saturation function. Other regulators for robot manipulators 
without using the gravitational term can be found in (Ortega et al. 1995; Kelly 
1998). If the structure of the gravity force is known, adaptive PD controller with 
gravity regressor can also be used to eliminate the steady-state position error in the 
presence of uncertain parameters (Tomei 1991; Arimoto 1996): 


u = -K v q - K p q + Z(q)<p 
with the parameter update law 



cp = —LZ T (q)(q + as(q )) (38) 

where a is a positive constant. The gravity regressor matrix can be computed 
off-line by using the desired position (Kelly 1993). 


Adaptive Control of Robot Manipulators 

For tracking control applications, the manipulator is required to follow a desired 
time- varying trajectory. The simple PD controllers in the previous section are 
usually not effective for tracking control, especially for high-speed manipulation 
tasks. In this section, adaptive control method of robot manipulators is presented for 
tracking control applications. 
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Fig. 4 Filter-like representation of Eq. 40 (p = ( d/dt ) is the Laplace operator) 

To design the adaptive control, a reference vector is defined as follows: 


q r = q d ~Aq (39) 

where A is a positive constant. Using the reference vector Eq. 39, a sliding vector is 
defined as follows: 


$ = q-q r = q+Aq m (40) 

In fact, Eq. 40 represents a low-pass filter which is also depicted in Fig. 4. Hence, 
q can be obtained as 



For a bounded sliding vector, i.e., 



< O, then it yields 


t 


< 7(011 <® 


e A<yt ^s(t)Jt < 


0 

1 





Therefore, inequality Eq. 42 shows that increasing A leads to smaller tracking 
error. 

By differentiating Eq. 40 with respect to time, i> is obtained as follows: 


5 = q-q r = j+Aq. (43) 

Substituting Eqs. 40 and 43 into Eq. 3 and using property 3, the following 
equation is obtained: 

M(q)s + C(q,q)s + D(q)s + Y(q,q,q r ,q r )0 d = u (44) 

where Y (q,q, q r , q, ) is a known regressor matrix and 6 d is an unknown parameter 
vector, which are defined as follows: 

Y (q,q,q n q r )0 d = M(q)q r + C(q, q)q r + D(q)q r + g(q). (45) 

In fact, adaptive control strategies for a nonlinear system are mainly based on the 
property of linear parameterization of the nonlinear dynamics with respect to 
unknown parameters. 
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Fig. 5 A block diagram of the adaptive feedback law (46) 


The adaptive law is expressed as follows: 



where K s is a positive definite matrix and Oj is the vector of estimated parameters, 
which is updated based on the following update law: 


9 d = ~LY T (q, q , q r ,q r )s (47) 

such that L is a symmetric positive definite matrix. A block diagram of the adaptive 
law Eq. 46 is illustrated in Fig. 5. 

The closed-loop equation for the robot manipulator is obtained by substituting 
the adaptive feedback law Eq. 46 into the dynamic Eq. 3, as follows: 

M{q)s + C{q, q)s + D(q)s + K $ s + Y(t/, q , q r , q r )A0d = 0 (48) 

where A 6d = 0d — Oj. To examine the stability of the closed-loop system described 
by the Eq. 48, the following Lyapunov-like candidate is proposed: 

V = ^s T M(q)s+^A0 T d L- l A8 d . (49) 

By differentiating the Lyapunov-like candidate with respect to time, substituting 
the closed-loop Eq. 48 into it yields 


V = s T M(q)s + 1 s T M {q)s + 0 d L~ l AO d 
= —s T D(q)s - s t K s s - s T Y(q,q,q r ,q r )A0 d 

+ X -5 T {M{q)-2C{q,q)}s + e d L- l A0 d . 


( 50 ) 
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Considering the skew symmetricity of the matrix M(q) — 2 C(q, q) and substitut- 
ing Eq. 47 into Eq. 50, V is simplified as 

V = -s T D(q)s - s T K s s < 0. (5 1) 

Since M(q) and L are positive definite, V is positive definite in s and A 6 d . 

Therefore, s and A 0 d are bounded. Hence, from closed-loop Eq. 48, one can 

• • 

conclude the boundedness of s, and this leads to the boundedness of V, which 
means that V is uniformly continuous. By Barbalat’s lemma (see lemma A2), it then 
follows V goes to zero as t — ■» oo, so it can be concluded that s converges to zero. 
The convergence of s yields the convergence of q and q to zero. 

Remark 3 The globally tracking convergent adaptive controller Eq. 46 was derived 
in (Slotine and Li 1987) for a robot with dynamic uncertainty. The dynamic 
regressor matrix can also be computed off-line by using the desired trajectory 
(Arimoto 1996; Sadegh and Horowitz 1990). An adaptive controller with off-line 
computed dynamic regressor is as follows (Arimoto 1996): 

• / ^ 

u = —k p q - K v q + Y d (q d , q d , q d , q d )0 d (52) 

with the parameter update law as 


d d = ~LY T d (, q d , q d , q d , q d ) {q + as(q) } (53) 

where a is a positive constant and s(.) is the saturation function. Many other robot 
adaptive controllers have also been proposed for a robot with dynamic uncertainty 
(Wen and Bayard 1988; Niemeyer and Slotine 1991; Berghuis et al. 1993; Paden 
and Panja 1988; Ortega and Spong 1989; Craig et al. 1987; Middleton and Goodwin 
1988; Koditschek 1987; Lee and Khalil 1997). 

This section focuses only on adaptive control of robot manipulators. 
Other tracking control methods for robot manipulators with dynamic uncertainty 
include robust control (Slotine 1985; Spong 1992; Abdallah et al. 1991), learning 
control (Arimoto 1996; Arimoto et al. 1984), and neural network control 
(Lewis 1996). 

Remark 4 Since the closed-loop system Eq. 48 is directly dependant on time due to 
the presence of the time-varying trajectory qjf), LaSalle’s invariance principle 
cannot be utilized to show the stability of the system. 

Remark 5 A key point in adaptive control is that the tracking error will converge 
regardless of parameter convergence. That is, one does not need parameter conver- 
gence for tracking error convergence. However, parameter convergence can be 
obtained if the persistence of excitation condition is satisfied. 
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Remark 6 In many practical applications, the physical parameters are bounded by 
prescribed upper and lower limits. Using these bounds as a priori information, the 
parameter update law can be modified in a way that the estimates are constrained to 
a specified region. In this regard, a commonly used method is the standard projec- 
tion algorithm (Ioannou and Sun 1996). 


Approximate Jacobian Set-Point Control 

In many modem control applications of robot manipulators, task-space sensory 
feedback information such as visual information is used to monitor the position of 
the end effector. By using sensory feedback, the control systems are robust to 
various modeling uncertainties. This gives the robot a high degree of flexibility in 
dealing with unforeseen changes and uncertainties. When a robot picks up several 
tools of different dimensions, unknown orientations, or gripping points, the kine- 
matics changes and is difficult to derive exactly. In the presence of kinematic 
uncertainty, inverse kinematic algorithms cannot be used to derive the desired 
position in the joint space. This section presents a simple approximate Jacobian 
set-point controller for a robot manipulator with uncertain kinematics by using task- 
space sensory feedback of the position of the end effector. Though the position of 
the end effector can be measured by task- space sensors, the uncertainty of the 
Jacobian matrix poses a challenging robot control problem. 

First the quasi-natural potential function Sff) is defined which will be used in 
developing the feedback law. 

Definition 1 Consider the scalar function Sfcp) and its derivative sff) as depicted 
in Fig. 6 , with the following properties'. 

1. Sff) > Ofor <fi £ R — {0} and 5/(0) = 0. 

2. Sff) is twice continuously differentiable, and the derivative Si(cf)) = ds ^ is 

strictly increasing in f for /(/)/< yi with some y\ and saturated for /(p/ > y it 
i.e., sM) = ±Sifor f > +Yi and <fi < -Yi, respectively, where s t is a positive 
constant. 

3. There are constants Ci > 0, dj > 0, df> dj) > 0 such that 

disj(<p) > 4>Si(4>) > disf(4>) > 0, Si(4>) > Cisf(4>), (54) 

for tp 0 

The approximate Jacobian set-point controller is expressed as follows: 

u = -f(q) ( K p s(x ) + K v x) + g{q) (55) 

where J(q) E R ,nxn is the estimation of the Jacobian matrix; J(q ), K p , K v are positive 
definite diagonal feedback gains for the position and velocity, respectively; and 
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Fig. 6 (a) Quasi-natural potential Si((p), (b) derivative of Sj((p ) 


57 O, / = 1, • • *, m are saturated functions defined in definition 1. Here, it is assumed 
that J (q) is chosen so that 


\\J T {q) - f {q)\\ < Y , (56) 

where y is a positive constant to be defined later. Substituting the feedback law 
Eq. 55 into the dynamic Eq. 3 gives the closed-loop equation as 


M{q)q + C(q, q)q + D{q)q + J (q) ( K p s(x ) + K v x) = 0. (57) 

To examine the stability of the closed-loop system, consider the following 
Lyapunov -like candidate: 

V = -q r M(q)q + aq T M(q)J ^ (q)s(x) 

m (58) 

H - ock V i ) Si (xi ) , 

/= l 

AJ. A 

where a is a positive constant and J ' ( q ) denotes the pseudo-inverse of J(q) such 

A Arj-< A A J. 

that J(q)J (q) is non-singular and J(q)J J (q) = I. The parameters k ph k vi denote the 
zth diagonal elements of K p and K v , respectively. Consider the following inequality: 

i q r M(q)q + aq T M(q)f (q)s(x) + Y^"Li (**» + ak vi) s i(*i) 

= i (q + 2aj\q)s(x)J M{q) (q + 2od\q)s(x)'j - 

a 2 s{x) T (f (q)^j M(q)f ( q)s(x ) + \ (kpi + ak vi )Sj(xj) 

— V- { kpjt'j -)- a(k vi Cj ) j .v, (x/) 


(59) 
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where A m A A max 



. Substituting the inequality Eq. 59 into 


Eq. 58 yields 


m 

+ 22 {kpiCi + a (kviCi - aA m )}si 2 (xi) > 0 

i=i 

where K v and a can be chosen so that 

k V jCj — aA m > 0. (60) 

Hence, V is positive definite and represents a Lyapunov function candidate for 
the set-point control of the robot with uncertain Jacobian matrix. Differentiating the 
Lyapunov function candidate V with respect to time, it can be shown that (refer to 
Appendix for details) 


1 


V > -q r M(q)q 


V < 


A 


max 


A 


max 


(* 


2 */)) - 

ac 0 \ 


Kp) 

( aSi - £ 

1 - acA 


\2 


such that c 0 , c x are positive constants, bj is the upper bound of \\J(q)\\, and 



51 

5 2 

s 3 



If the following inequality holds 


min 


28\ 

S 2 + 2b j 






then by proper selection of parameters a, K p , and K v , such that 



it can be guaranteed that V < 0. This implies the convergence of the task-space 
error and joint- space velocity such that x — > 0, q 0, as t — > oo. 
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Remark 7 For redundant robots, the dimension of x is different with the dimension 
of q, but the LaSalle’s invariance theorem can be used to deduce the convergence 
of x, q to zero. 

An alternate condition for the bound y can be derived by substituting J T \q) = 

a J^ 

J (q) + J(q) into Eq. 120 so that 



Therefore, condition Eq. 63 is written as follows: 



min 


28 i 

5 2 + 2bf 







where S 


x 


mm 


1 


T 


J (q)K,J (q)+D(q) 


^max (K v ) 


and bf is the upper norm bound of J(q). This implies 


that the bound y can be represented by the actual or estimated Jacobian matrix 
whichever is larger. 

In order to further explore the condition Eq. 63, let K v = k v I , K p = ak v I. Hence, 
condition Eq. 63 is simplified as 



For a sufficiently large a[ i.e., a 



, condition Eq. 67 is 


written as 



The condition Eq. 63 is illustrated in Fig. 7. In order to guarantee the stability of 
the system with approximate Jacobian matrix, the allowable bound of the Jacobian 
uncertainty y must be smaller than the curve as shown in the figure. 

The following conditions can be obtained from Fig. 7: 


(i) If y is small, a can be chosen small and therefore from Eq. 60 it can be deduced 
that a smaller controller gain is required. 

(ii) If y is small, a wider range of the feedback-gains ratio a can be chosen. In fact, 
in the extreme case where y = 0, a could be chosen as any value. 

(iii) If y is large, a larger controller gain is required and a narrower range of the 
feedback-gains ratio a is allowed. 
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^ min [j T J + D / £ J 

2bj 

^min [j T J + D / \ v 

2bj 


Fig. 7 An illustration of condition Eq. 63 where the vertical axis represents the bound of the 
Jacobian uncertainty and the horizontal axis represents the feedback-gains ratio 


Moreover, if a is increased, then the allowable bound y of the Jacobian uncer- 

— aJ 1 m 

tainty J (q) = J (q) — J (q) is reduced. Therefore, the feedback-gains ratio 
a should be kept smaller so that the allowable bound of the Jacobian uncertainty 
is larger. This can be easily done by either reducing K p or increasing K v . Though the 
condition is a sufficient condition, it is reasonable because increasing K p amplifies 

the estimated Jacobian J(q) and hence more accuracy on the estimation or more 

damping is required. An important and piratical conclusion of the result is that when 

/\ 

the system is unstable, redesign of J(q) may not be necessary as the instability may 
be due to the reason that the feedback-gains ratio a is not tuned properly. In 
practice, we should therefore try to stabilize the system or increase the margin of 
stability first by reducing the feedback-gains ratio a (Fig. 7). 

In the presence of uncertainties in gravitational force and Jacobian matrix, the 
task-space control of robot manipulators can be obtained by using the concept of 
regressor for the robot dynamics (see Eq. 7) such that a gravity regressor is 
introduced to compensate for the gravity force. 

T 

The gravity term can be characterized by a set of parameters (p — {cp i, • • •, (p p ) 
as 


g(q) = Z(q)<p, (69) 

where Z(q) <G R nxp is the gravity regressor. 

Example 4 For the two-link robot manipulator depicted in Fig. 1, Z(q) can be 
expressed as follows: 


cos (<?!) cos (q l +q 2 ) 

0 cos (q x + q 2 ) 


Z(q) 


9 
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*d 



Fig. 8 A block diagram of the task-space approximate Jacobian regulator with adaptive gravity 
compensation 

and the unknown parameter vector cp is expressed as 


(M\L C \ +M 2 Li)^ 

M2L c 2g 


The feedback control law is obtained as 


u = - f(q ) ( K p s(x ) + K v x) + Z{q)cp, (70) 

such that the estimated parameter vector cp is updated as 

<p(t) = -LZ T (q) (q + af(q)s(x)\ (71) 

where L E R pxp is a positive definite matrix. A block diagram of the task-space 
approximate Jacobian regulator Task-space approximate Jacobian regulator with 
adaptive gravity compensation described by Eqs. 70 and 71 is illustrated in Fig. 8. 
Substituting Eqs. 69 and 70 into the dynamic Eq. 3 gives 


M(q)q + C(q, q)q + D(q)qf \q) ( K p s(x ) + K v x) + Z(q)Lcp = 0, (72) 

where A cp = cp — cp. To examine the stability of the closed-loop system Eq. 72, the 
following Lyapunov function candidate is considered: 



( 73 ) 
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The convergence of the task-space error and joint-space velocity can be shown 
by differentiating the Lyapunov function candidate with respect to time (similar to 
previous section), and using LaSalle’s invariance principle. 

Remark 8 The approximate Jacobian set-point controller Eq. 55 with task-space 
damping was proposed in (Cheah et al. 1998, 1999). The controller requires the 
measurement of a task-space position by using a sensor such as vision systems. The 
task-space velocity is obtained by numerical differentiation of the position. In 
(Cheah et al. 2003), an approximate Jacobian feedback controller using only 
joint- space damping was developed as 

u = -f(q)K p s(x) - K v q + g{q). (74) 

The joint-space velocity is obtained by numerical differentiation of the joint 
position, which is usually less noisy than in the task space. In the presence of 
uncertainty in the gravitational force, the update law Eq. 71 or the integration 
control term (Cheah et al. 1999) also requires the use of the inverse Jacobian 
matrix. An approximate Jacobian controller with adaptive gravity compensation 
was developed as (Cheah et al. 2003) 

u = -J T (q) (K p s(x) + K v x) + Z{q)q > , (75) 

such that the estimated parameter vector cp is updated as 

cp(t) = —LZ T (q) [q + af (q)s(x)^j , (76) 

where a is positive constant and s(.) is the saturation function. The main advantages 
of the controller are that the task- space velocity and the inverse of the approximate 
Jacobian matrix are not required. An update law can also be used to update the 
kinematic parameters of the estimated Jacobian matrix online (Cheah 2003; Dixon 
2007), and the simpler stability conditions can be obtained as compared to using a 
fixed approximate Jacobian matrix. 


Adaptive Jacobian Tracking Control 

The approximate Jacobian controller in the previous section is only effective for 
point-to-point control problem. This section presents an adaptive Jacobian control- 
ler for tracking control of a robot manipulator with uncertain kinematics and 
dynamics. The concurrent adaptation to both kinematic and dynamic uncertainties 
is something “humanlike” as in tool manipulation. The key idea is to introduce a 
task-space adaptive sliding vector based on estimated task-space velocity and a 
dynamic regressor matrix based on the estimated kinematic parameters. 
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Fig. 9 The estimated task- 
space velocity and the actual 
task-space velocity 



In the presence of kinematic uncertainty, the parameters of the Jacobian matrix 
are uncertain and an estimated velocity is defined using Eq. 16 as 

i = j(q,0 k )q = Y k (q,q)d k (77) 

A A / A \ 

where xeR' 2 denotes the estimated task-space velocity, J{q,0k) £R ,2Xn is an 

estimated or adaptive Jacobian matrix, and Ok G R q denotes a set of estimated 
kinematic parameters. Figure 9 illustrates the difference between the estimated 
task- space velocity and the actual task- space velocity. 

A reference vector x r G R n is defined as follows: 

x r =Xd- a(x-x d ) (78) 

where x d {t) G R n is the desired trajectory in the task space. Differentiating Eq. 78 
with respect to time yields 


• • 




r 


x d - a(x - x d 


(79) 


where x d 


dx 


d 


dt 


G R n is the desired acceleration in the task space. Using the reference 


vector, an adaptive task- space sliding vector is defined as 


5 


X 


x — X 


1 


j(q,0 k )q -x 


r 


(80) 


where j(q. Ok) q = Yk(q, q)0k as indicated in Eq. 77. The above vector is adaptive in 

the sense that the parameters of the estimated Jacobian matrix are updated by a 
parameter update law which will be introduced in the later development. The 
adaptive task-space sliding vector can also be written as follows: 


5 


x 




J ( q , Ok) q — ij + ax+i — x 

l 


J{q,9 k )q - j(q,O k )q 


\ 


x + ax 



V Y k (q,q)0k 



Y k (q,q)0 k 



x + ax - Y k (q,q)A9 k . 


(81) 


Equation 81 shows that 5* converges to the sliding vector (x + ax), as A 0 k goes 


to zero. 
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Differentiating Eq. 80 with respect to time yields 



J{q, 9k)q + J{q,6k)q - x 


1 



where x denotes the derivative of x. 

Now, a reference vector is defined in the joint space as 




where J ( q , Ok) is the inverse of the approximate Jacobian matrix / [q. Ok). Here, it 

is assumed that the robot is operating in a finite task space such that the approximate 
Jacobian matrix is of full rank. From Eq. 83, it is obtained: 


Or 


* — 1 / ^ 

J (q, e k 


• • 


X 


r 


\ 1 /s 

+ J ( q,0k)x 


r 


Hence, an adaptive sliding vector in joint space can be defined as 





$ = q — q r . (86) 

To obtain the relationship between the adaptive sliding vectors in the joint space 

and task space, multiply both sides of Eq. 85 by J ( q , Ok) and using Eq. 80 as 
follows: 



Substituting Eqs. 85 and 86 into Eq. 3 and using property 3, the equations of 
motion can be expressed as 


such that 


M(q)s + C(q,q)s+D(q)s + Y{q,q,q r ,q r )d d = u 


Y (q,q,q r ,q r )0d =M(q)q r + C(q,q)q r + D(q)q r + g(q). 

The adaptive Jacobian tracking controller is presented as follows: 





u 


(90) 
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Fig. 10 A block diagram of the adaptive Jacobian tracking controller Eq. 90 


*d 


Adaptive Jacobian Tracking Control 


x d . 


X 


X 


1 

Q 


Parameters update laws 


d k = -L k Yl(K v x + K p x) 


G d = —L d Y T (q, q, q r , q r )s 


e 


Adaptive feedback 
law 


e d u = e k ){K v x + K p x) 

+Yd d 


u 


Fig. 11 A schematic diagram of the adaptive Jacobian tracking controller (Eq. 90) together with 
parameter update laws (91) and (92) 


where x = x — Xd , x = x — id , and K v E R nxn and K p G R nxn are symmetric 
positive definite gain matrices. The kinematic and dynamic adaptation laws are 
introduced as follows: 

6 k = L k Y T k (q, q) (kjc + K p x) (91) 

6 d = -L d Y T (q, q, q r , q r )s. (92) 

A block diagram and an illustration of the adaptive Jacobian tracking controller 
Eq. 90 are depicted in Figs. 10 and 11, respectively. 

Substituting Eq. 90 into Eq. 88, the closed-loop equation can be expressed as 

M(q)s + C(q,q)s+D(q) S +J T (q,0 k )(KX + K p x)+Y(q,q,q r ,q r )Ae d =O 

(93) 
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where A Od = Od — Od- For stability analysis, a Lyapunov-like function candidate is 
defined as 


V = l -s T M(q) s + X - A0 T d L^ 1 A0 d + 1 A0 T k L k l A0 k + X -x T {K p + aK v )x (94) 

where A 0 k = 0 k — Ok . Differentiating V with respect to time; substituting the closed- 

• • 

loop Eq. 93, Ok from Eq. 91, and Od from Eq. 92 into it; and using property 2 yield 


V = - s r D(q)s - f x (K v x + K p x) + x T (K p + aK v )'x 
-A0 T k Y T k (q,q)(K v k + K p x). 

Substituting Eq. 8 1 into Eq. 95 yields 



V = -s T D(q)s - xK v x - ax T K p x < 0. (96) 

Since M(q) is positive definite, V in Eq. 94 is positive definite in s, Ax, A 0 k , and 
A 0 d . Therefore, Eq. 96 leads to boundedness of V, and consequently 5, Ax, A 0 k , and 

A A /\ / /N \ 

A 0 d are bounded, which implies that 0 k and Od are bounded and 5 X = J(q,0 k )s is 

bounded as seen from Eq. 87. If x d and its derivatives are bounded, then x and x r (see 
Eq. 78) are bounded. Therefore, q r in Eq. 83 is also bounded if the approximate 
Jacobian matrix is non-singular. From Eq. 85, ^ is bounded and the boundedness of 
q means that x is bounded since the Jacobian matrix is bounded. Hence, Ax is 

A 

bounded and x r in Eq. 79 is also bounded if xj is bounded. From Eq. 91, 0 k is 
therefore bounded since Ax, Ax, q are bounded and Y k (-) is a trigonometric function 
of q. Therefore, q r in Eq. 84 is bounded. From the closed-loop Eq. 93, one can 
conclude that s is bounded. The boundedness of s implies the boundedness of q as 

seen from Eq. 86. From Eq. 82, $ x is therefore bounded. Finally, differentiating 
Eq. 8 1 with respect to time and rearranging yield 


x+ax — 5 X + Y k (q,q,q)A0 k - Y k (q,q)0 k 


which means that x 
follows: 


• • 

x — Xd is also bounded. Therefore, V , which is shown as 


V = 


2s t Bs - 2 Ax' KJc - 2 ax 1 K„x, 


T 


• • 


T 


P 


is bounded. Hence, V is uniformly continuous. Using Barbalat’s lemma, it gives 

x = x — Xd — * 0, x — x — Xd — * 0, and $ — » 0 as t — » oo. 


Remark 9 The adaptive Jacobian tracking controller Eq. 90 for robots with uncer- 
tainties in kinematics and dynamics was developed in (Cheah et al. 2004). The 
proposed controller requires the differentiation of the task-space position which is 
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noisy. An adaptive Jacobian tracking control law based on filtered differentiation of 
the measured task- space position was developed in (Cheah et al. 2006a). Observer 
design techniques can also be used to estimate the velocities (Liang et al. 2010). To 
avoid singularities associated with Euler angle representation, adaptive Jacobian 
tracking controller based on unit quaternion was developed (Braganza et al. 2005). 
An adaptive Jacobian controller based on prediction error was proposed in (Wang 
and Xie 2009). Sliding PID tracking control schemes with uncertain Jacobian were 
proposed in (Garcia-Rodriguez and Parra- Vega 2012). 

When cameras are used to measure the position of the end effector, the vector of 
image feature parameter rates of change x is related to joint variables by the 
following equation (Cheah et al. 2007): 



where Z (< q ) is a diagonal matrix, which contains the depth information of the 
feature points with respect to the camera image frame, and L(x) is a Jacobian 
matrix. Since the overall Jacobian matrix J(q) is inversely proportional to the 
depths, it is not linearly parameterizable. Therefore, the kinematic parameters in 
the image Jacobian cannot be extracted to form a lumped kinematic parameter 
vector that includes the unknown depth parameters, i.e., 

J(q)q^Y(q,q)0. (98) 

Thus, these abovementioned adaptive Jacobian controllers are effective only in cases 
where the depth information is constant or slowly time varying. Vision-based tracking 
controllers with uncertain depth were proposed in (Wang et al. 2007), but the uncertainty 
of robot kinematics and dynamics was not considered and the depth information was not 
updated online. However, the following linear parameterizations hold 

Z(q)x = Y z (q,x)0 z (99) 

J e {q)q = Y k {q,q)0 k (100) 

where Y z {q,x) is called the depth regressor matrix and Y k (q,q) is called the 
kinematic regressor matrix. Therefore, an adaptive Jacobian tracking control with 
uncertain depth information can be proposed as (Cheah et al. 2007) 

u = -f(q,e k )Z~ l (< 7 , 4 ) (K p x + K v k) + Y d (q,q,q r ,q r )0 d ( 101 ) 

A 7^ / A \ A 1 / A \ 

where/ [q,6 k jZ ( q,Q z ) is the adaptive Jacobian matrix and K p and K v are 

symmetric positive definite matrices. The uncertain dynamic, kinematic, and 
depth parameters are updated by the following update laws: 


53 Motion Control 


1915 



Remark 10 In the redundant case (Cheah et al. 2006a, b), the null space of the 
approximate Jacobian matrix can be used to minimize a performance index. Hence, 
the reference vector in the joint space can be defined as 

q r = f (, q , 6 k )x r + (/„ - f (q, 6 k )j (q, 6 k ) ji// (105) 

where j\q,0k) is the pseudo-inverse of the approximate Jacobian matrix and 
y/ G R n is minus the gradient of the convex function to be optimized. 


Simulation Results 

In this section, various feedback laws presented in the previous sections are 
applied on the two-link robot manipulator (depicted in Fig. 1). The parameters of 
the robot are shown in Table 1. The simulation was carried out in MATLAB/ 
SIMULINK. 

PD Plus Gravity Control Law 

For the set-point regulation, PD plus gravity control law Eq. 20 is applied on 
two-link robot manipulator. The control gains were chosen as K p = diag { 10, 10} 
and K v = diag {5, 5 }. The initial values of joint angles were chosen as q i(0) = n!4 
and q 2 ( 0) = n!6, and the initial values of joint velocities were chosen as q\{0),q 2 
(0) =0. The desired joint angles were chosen as q d \ = n/2 and q d2 = tt/ 3. The 
simulation results are depicted in Figs. 12 and 13 which show error of joints angles 
and joint velocities. The actuator torques are shown in Fig. 14. 

Now assume there is 10 % mismatch in measurement of physical 
parameters, i.e., 


M\ — M\ H- 0.1A/J M 2 — 4^2 + 0.1M 2 
L\ = L\ + O.lLi L 2 = L 2 -j- O.IL 2 . 

The error of joints angles is depicted in Fig. 15. Figure 15 shows that there is an 
offset between the final values of joints angles and their desired values. To alleviate 
this steady-state error, let the controller gains increase such that K p = diag {60, 60 } 
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Table 1 Parameters of the 
two link robot manipulator 


• 

1 

M,{ kg) 

Lit m) 

L C i( m) 

Dj( Nms/rad) 

1 

5 

0.2 

0.1 

1 

2 

5 

0.2 

0.1 

1 



Fig. 1 2 Error of joints angles q x , q 2 



Fig. 13 Joints velocities qi,q 2 
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Fig. 14 Actuator torques U\, u 2 



Fig. 15 q x , q 2 in the presence of 10 % mismatch in measurement of physical parameters 


and K v = diag{ 25, 25}. Figure 16 shows that the steady-state errors improved 
considerably. However, if one plots the actuator toques as depicted in Fig. 17, 
it shows that the actuator torques are also increased which might lead to saturation. 
Hence, there is a trade-off between the accuracy and the control torque. 
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Fig. 16 q x , q 2 for high control gains 
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Fig. 17 Actuator torques U\, u 2 


Another method to improve the steady- state error in the presence of inaccurate 
measurements is to utilize an integral term to the PD plus gravity control 
law. Therefore, the feedback law for PID plus gravity law can be expressed 
as follows: 
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Fig. 18 q x , q 2 for PID plus gravity feedback control 
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Fig. 19 Actuator torques u\, u 2 for PID plus gravity feedback control 



-K v q - K p q - K, 


qdt + g(q). 


u 



The simulation results for PID plus estimated gravity in the presence of 
10% mismatch in measurement of physical parameters are depicted in Figs. 18 
and 19. The control gains were chosen as K p = diag{ 10, 10}, K f = diag{2, 2}, 
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Xi (m) 

Fig. 20 The end-effector position for the task-space PD plus gravity regulator 


and K v = diag{ 5, 5}. It is shown that both accuracy and control torques are 
improved significantly. 


Task-Space PD Plus Gravity 

The task-space PD plus gravity control law Eq. 28 is applied for two-link 
robot manipulator to control its end-effector position to reach the desired position. 
The controller gains were chosen as K p = diag{ 200, 200} and K v = diag{ 50, 50}. 
The initial position of robot was x = [0.4, 0], and the desired end-effector position 
was considered as x = [0.1, 0.3]. The end-effector position is depicted in Fig. 20. 
The robot arms are depicted for different snapshots. The actuator toques are 
depicted in Fig. 21. 


Adaptive Control 

The adaptive feedback law Eq. 46 together with the parameter update law 
Eq. 47 is applied to the two-link robot manipulator for trajectory tracking. 
The controller gains were chosen as K s = diag{ 15, 15} and A = 10. The gain 
for update law was chosen as L — 50. The initial position and velocity of 
the robot’s joints were q = [0, 0] and q= [0,0] , respectively. The desired 
trajectory was set as 
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Fig. 21 Actuator torques U\, u 2 for the task-space PD plus gravity regulator 



0.1 + O.lf + 0.2 sin 
0.1 0.1 1 H - 0.2 cos 



(107) 


The initial values of the unknown parameters were chosen as 

A rji 

0d{ 0) = [0, 0, 0, 0, 0, 0, 0] . The position and the desired reference of joint one and 
two are depicted in Figs. 22 and 23. The velocity and the desired velocity of joint one 
and two are depicted in Figs. 24 and 25. The actuator toques are depicted in Fig. 26. 


Approximate Jacobian Set-Point Control with Uncertain 
Gravitational Force 

For this part, the approximate Jacobian set-point control in the presence of uncer- 
tainties in gravitational force was considered for the simulation. The approximate 
Jacobian regulator together with adaptive gravity compensation (Eqs. 75 and 76) 
was applied for the two-link robot manipulator to control its end-effector 
position to reach the desired position. The controller gains were chosen as 
K p = diag{ 1000, 1000} and K v = diag{ 100, 100}. The adaptive gains were chosen 
as L — 500 and a = 50. The initial values of the unknown parameters were chosen 

T 

as <p( 0) = [0,0] . The initial position of the robot’s end effector was x = [0.4, 0], 
and the desired end-effector position was set as i = [0.1, .3]. The end-effector 
position is depicted in Fig. 27. The robot arms are depicted for different snapshots. 
The actuator toques are depicted in Fig. 28. 
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Fig. 22 


The position and the desired reference of joint one for the adaptive trajectory tracking 



Fig. 23 


The position and the desired reference of joint two for the adaptive trajectory tracking 


Adaptive Jacobian Tracking Control 

In this case, the adaptive Jacobian tracking control Eq. 90 together with the 
kinematic parameter update law Eq. 91 and dynamic parameter update law 
Eq. 92 is applied to the two-link robot manipulator for trajectory tracking in the 
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Fig. 24 The velocity and the desired velocity of joint one for the adaptive trajectory tracking 



Fig. 25 The velocity and the desired velocity of joint two for the adaptive trajectory tracking 

presence of uncertain kinematics and dynamics. The controller gains were chosen 
as K p = diag{ 800, 800} and K v = diag{ 80, 80}. The adaptive gains were 
chosen as L k = 5 and L d = 5. The initial values of the unknown parameters were 

T 

chosen as <p(0) = [0,0] . The initial position and velocity of robot’s end effector 
were x = [0, 0.15] and x d = [0, 0], respectively. The desired trajectory was set as 


1924 


C.C. Cheah and R. Haghighi 



Fig. 26 Actuator torques U\, u 2 for the adaptive trajectory tracking 



Xi (m) 

Fig. 27 The end-effector position for the task-space approximate Jacobian regulator with adap- 
tive gravity compensation 


53 Motion Control 


1925 



time (s) 

Fig. 28 Actuator torques u x , u 2 for the task- space approximate Jacobian regulator with adaptive 
gravity compensation 



0.15 + 0.05 sin 
0.15 + 0.05 cos 



(108) 


The initial values of the unknown kinematic and dynamic parameters 

(0*(O) and 0^(0)) were chosen randomly from the interval [0, 10]. The 

end-effector position is depicted in Fig. 29. The robot arms are depicted for 
different snapshots. 


Summary 

This chapter introduces robot control design methods using the Lyapunov based 
method. By utilizing the physical properties of the robot kinematics and dynamics, 
several set-point and adaptive tracking controllers have been presented in both the 
joint space and task space. A simple motion controller that is effective for set-point 
regulation is the PD controller with gravity compensation. For tracking control 
tasks, the adaptive controller has been presented for a robot manipulator with 
uncertain dynamics. Using sensory feedback of the robot end-effector position in 
the task space, approximate Jacobian set-point controllers and adaptive Jacobian 
tracking controllers have been presented for robot manipulators with uncertainties 
in both kinematics and dynamics. 
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Fig. 29 


The end-effector position for the task-space adaptive Jacobian tracking control 


Appendix 1 

Preliminaries on control theories: 

Consider the following nonlinear system: 

x=f(x,t) (109) 

where x E R n is a vector of state of the system and /(.) is a nonlinear function. 


Definition A1 The equilibrium points of the system Eq. 109 are defined as the state 
vectors x e of x for which if at specific time x = then x will remain unchanged 
for all t > to. In other words, at equilibrium points the state of the system satisfies f 
(x e ) = 0. 

It is often important to know whether the equilibrium point is stable or not. In the 
following, a definition of stable equilibrium point is put forward: 

Remark A1 It can be always assumed that the equilibrium point is zero by using 
change of variable y = x - x e . 


Definition A2 The equilibrium point x e = 0 of the system Eq. 109 is said to 
be stable if for any s > 0, there exists 3 > 0 such that if x(0) — Oil <3, then 


I x(t) 


Oil < e for all t > 0. It can be mathematically represents as follows: 
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Vs > 0 35 > 0, if ||x(0) — 0 



x(t) — 0 


< sfor t > to . 


( 110 ) 


To examine the stability of the equilibrium point, the Lyapunov theory can be 
utilized. The main advantage of the Lyapunov theory is that the stability of the 
system can be determined without solving the differential equations of the system. 
Moreover, the Lyapunov theory can be used to design controllers that stabilize 
nonlinear systems. 

Before presenting the Lyapunov theory, a certain class of functions is introduced 
as follows: 


Definition A3 A continuous function V : R n x R + — » R is a locally positive defi- 
nite function (Ipdf) if for some s > 0 and some continuous, the following conditions 
hold: 


I I:V( 0 , 0=0 

\//:L(x,0 >0 WxeB s ,\/t> 0 


( 111 ) 


where B 8 is a ball of size s around the origin which is mathematically expressed as 


B 


{x G R 


n 


X 


< s}. 


In addition, V is a positive definite function (pdf) if the condition (II) is true for 


all x E R. 


If in condition (II) V (x, t) > 0, then V (x, t) is a (locally) positive semi-definite 


function. 


T 

Remark A2 If V (x) = x Mx, where M is a real symmetric matrix, then V is a pdf if 
and only if M is a positive definite matrix. 


Theorem A1 Lyapunov stability theorem: Suppose x e = 0 is an equilibrium point 

of the system Eq. 109. Let V(x, t) be a nonnegative function with derivative W along 
trajectories of the system dynamics: 


dV 

dt 



( 112 ) 


(i) IfV (x, t) is a locally positive definite function andV is a locally positive semi- 
definite function in x and for all t, then the origin of the system is locally stable. 

(ii) If V (x, t) is a locally positive definite function and V is a locally positive 
definite function in x and for all t, then the origin of the system is locally 
asymptotically stable. 

(iii) IfV (x, t) is a positive definite function and V is a positive definite function in x 
and for all t, then the origin of the system is globally asymptotically stable. 

If the function V (. x , t) exists in the above theorem, then it is called a Lyapunov 
function. 
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Remark A3 The Lyapunov stability theorem only provides sufficient conditions for 
the stability of nonlinear systems; hence, the failure of finding a Lyapunov function 
does not prove the instability of the nonlinear system 

In the case that —V(x,t) is a positive semi-definite function, the Lyapunov 
stability theorem cannot provide any information on the asymptotic stability 
of the system. To deal with the stability of nonlinear autonomous systems when 

—V(x) is a positive semi-definite function, LaSalle’s invariance principle has been 
presented. 


Lemma A1 LaSalle's invariance principle: Let V : R n — » R be a positive definite 

function such thatV(x) < 0 in compact set Q. Let D be the set of all points in Q 

where V{x) =0. Therefore, every solution of the system x = f(x) starting in Q 
approaches to the largest invariant set inside D. In particular, if D contains no 
trajectories other than x = 0, then 0 is locally asymptotically stable. 

LaSalle’s invariance principle enables one to conclude asymptotic stability only 
for autonomous systems. For non-autonomous systems, Barbalat’s lemma can 
be used. 


Lemma A2 Barbalaf s lemma: If a function V ( t , x) satisfies the following 
conditions: 

(i) V(x, t ) is lower bounded. 

(ii) V{x,t) is negative semi-definite. 

• • • 

(iii) V(x, t)is uniformly continuous in time or equivalently V(t,x) is bounded. Then, 
V(x, t) goes to zero as t — » oo. 


Appendix 2 


Parameters of dynamic Eq. 4 of the two-link robot manipulator which is depicted in 
Fig. 1: 

Elements of inertia matrix M(q) are 


4 4 

M\\ = -M\L 2 cl + -M 2 L 2 2 + M 2 L\ + 2M 2 L x L c2 cos {q 2 ) 

4 2 

M 12 = - M 2 L c2 + M 2 L\L c2 cos (q 2 ) 

4 2 

M 2 i = -M 2 L c2 + M 2 L\L c2 cos (q 2 ) 

4 9 

M 22 — -M 2 L c2 . 


(113) 
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Elements of matrix C(q, q) are given as 


M 2 L l L c2 q 2 sin (q 2 ) 

-M 2 L\L c2 \q x +q 2 ] sin (q 2 ) 


C 11 = - 

C 12 = 

C 2 1 = M 2 L\L c2 q x sin (<y 2 
C 22 = 0. 


Elements of gravitational force matrix are 


£i = {M\L C i + M 2 L\)q cos (^) + M 2 L c2 q cos {q x + q 2 ) 

g 2 =M 2 L c 20 COS (<?! + <? 2 ) 

where g is the gravity due to acceleration. 


(114) 



Appendix 3 


Part of the proof of the task-space control law Eq. 55: 

Differentiating the Lyapunov candidate V (expressed in Eq. 58) with respect to 
time gives 



Substituting the closed-loop Eq. 57 into Eq. 116, using property 2 and simpli- 
fying, yields 


V=- q r D{q)q - q T J (q ) (K p s{ x) + K v x) 

- aq T {C(q,q) + D(q) - M(q))f (q)s(x) 

— as(x) T K p s(x) + aq T M{q)J (c[)s[x) 

+ aq T M(q)J ^ (g)i(x) + x T K p s (x) . 

Using Eq. 14, V can be written as follows: 


(117) 



Since s(x) is bounded, there exist constants c 0 > 0 and C\ > 0 so that 
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aq T {C{q,q) + D(q) — M(q)}j\q)s(x) + aq T M[q)J (yq)s{x) 


t 


+aq T M(q)f {c[)s{x) < aco||^|| 2 + ac\ 

Substituting inequality Eq. 119 into Eq. 118 yields 



V < 



T 


q T \j T (q)K v J(q) -\-D(q) — acol \q — as(xY ( K 


p 



q T \r ( q)-J 


T 


T 



KpS 



Hence, Eq. 120 can be written with respect to J(q) and J (q) as follows: 


V < 


q T [J T (q) K V J (q) + D(q) - acol}q 



T 


K 


p 



+ q T f \q)K v j[q)q + q 1 J 1 (< q)K p s(x ) . 


•T tT 


(119) 


( 120 ) 


Let J{q) be the Jacobian estimation error which is defined as J(q) = J(q) — J(q). 


( 121 ) 


Since the Jacobian matrix contains trigonometric functions of q, ||/(g)|| < bj 


and Eq. 121 can be rewritten as 


V < 



min 


(j T {q)K v J(q) + D(qj) - ybjX 


max 


+ 


max 




oc ( /t m i n ( Kp 



( 122 ) 


where y is defined in Eq. 56. Since 


2 






(123) 


therefore, Eq. 122 is simplified as follows: 



(124) 

Equation 124 can be written as follows: 


such that 



51 

5 2 




A 


max 
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Abstract 

Robotic force control refers to the control and programmable specification of the 
interaction forces between a robot end effector and the work object, where either 
the end effector or the work object is attached to the robot manipulator. A 
rudimentary approach is to consider the joint torques and controlled variables 
and then to compute those torques such that a presumably rigid manipulator 
executes the desired forces. In practice, however, manipulators are not rigid, 
joint torques are accomplished from servo-controlled motors via joint trans- 
missions with nonlinear dynamics, and the control structure has to obey stake- 
holder aspects in industry. Based on algorithmic insights and experiences from 
industrial applications, the subject matter of force control is explained with core 
scientific approaches as the starting point, then extending the descriptions such 
that the industrial aspects are covered via established principles for joint servo 
control. 


Introduction 

Robotic force control refers to the control of robot motion such that forces and 
torques between the end effector and the workpiece are both controlled by the robot 
controller and programmable (or instructable) by the end user. While other robot 
motion control (and, hence, associated mathematical models) can be covered in that 
scope, with a corresponding subsystem in mind, force control is more challenging 
due to its crosscutting nature: 

• It is straightforward to accomplish position-controlled robot movements such 
that they are robust and stable irrespectively of the dynamics of the user’s 
application. Force control, on the other hand, includes the application dynamics 
such that the control properties are much more challenging to manage. 

• Force control is typically applied in some degrees of freedom of the end effector 
motion, while the complementary subspace remains position controlled. That is, 
while position control can be designed without force control in mind, force 
control needs to cope with the aspects of position control too. 

• In the case of pure position control, the robot programmer may be aware of 
visible issues such as kinematic singularities and cope with these with support of 
control system functions. That programmer or user may also, without safety and 
robustness problems, remain unaware of hidden limitations such as joint friction 
and torque saturations, such limitations being well managed for position control 
with a modern robot controller. The corresponding level of usability of force 
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control is more challenging, both due to lack of control system solution and due 
to the fact that forces are only indirectly visible within the physical robot 
application. 

All together these items, in combination with the lack of awareness and 
established practices in force control, call for the further introduction to application 
aspects as of the remaining part of this introduction, and then in the next section, 
some fundamental control aspects are presented. The core of the chapter follows 
thereafter, covering direct and indirect force control in two subsections, respec- 
tively. The final three sections cover the tailoring of the control toward industrial 
applicability, examples of industrial application. 


Application Demands 

Robots have been working in applications with programmed force interaction since 
they were introduced in the 1970s, even if the forces came from secondary effects 
of the setup and programming. Either compliant tooling was used or the servo 
control was made compliant by detuning the servos. In both these cases, there was 
no explicitly stated desired force, and hence, such robot applications were difficult 
to tune and hardly reusable. Nevertheless, detuning the servo to accomplish com- 
pliance (at that time implemented in analog hardware), and having that dynamically 
enabled from the robot program, was in the ABB case a standard option in the S2 
control system that was launched in 1982. To be able to accomplish a force by 
softening the servos and programming a path behind the desired one is still today a 
required feature. 

Similar to the benefits of having kinematic models built into the controller, 
thereby permitting the user to express motions in operational space rather than in 
the joint space of the robot, it is highly desirable to specify and express desired 
forces in the robot program. That is not only a matter of simplification for the robot 
programmer; it is also necessary for portability and reuse of robot programs. The 
need for force control in operational space basically comes from two different 
application demands that both have to do with managing (the always more or less 
present) uncertainty: 

Position Proximity: Motion in close proximity to work-cell objects entails a risk 
for collision or undesired contacts as a result of uncertainties in work object 
positions and orientations. In particular for stiff robots and stiff work objects, the 
impact forces may build up very fast, which requires very high-bandwidth force 
control in order to avoid damage. Moreover, the impact forces reflected on the 
robot structure will give deflections which, in turn, will cause inaccuracy of the 
position control. Hence, any presence of position uncertainties in a robotic work 
space provides a justification for force control. 

Force Interaction: As originally available in joint space, another purpose of 
robotic force control is to make the robot capable of accomplishing the specified 
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interaction forces such as contact forces, the result being that the robot can sense 
its surroundings and work space and apply a suitable force on objects and 
surfaces also when the exact position of the work object is not known. Examples 
from automotive power train assembly where force control is useful include 
assembly of torque converters, clutches, and pistons. Other examples are found 
in machining such as grinding, polishing, milling, and drilling. 

Both position proximity and force interaction relate to the dynamic and geomet- 
ric properties of the robot. 


Robot Properties 

As for robot design, the traditional application areas for industrial robots involve 
highly repetitive operations such as welding. Hence, robotic development has been 
focused on high precision (repeatability) in repetitive operations. For example, a 
standard ABB IRB4400 robot for 60 kg payload has a repetitive accuracy of ±0.05 
mm. If, however, the same robot is given a new coordinate that it has never visited 
before, the accuracy is in general around ±3 mm, which is 60 times the size of the 
repetitive accuracy. Today it is possible to buy the same robot with an option pack 
that includes calibration for high accuracy. This will improve accuracy to be within 
±0.5 mm, which is still 10 times the repetitive accuracy. In addition, this accuracy 
is only guaranteed under the condition that no unmodeled external forces act on the 
robot, i.e., only during motion in free space. 

For contact operations such as polishing, drilling, and riveting, the effects of the 
limited mechanical stiffness of the robot must be taken into account in order to 
maintain the positioning accuracy, which is not feasible unless a detailed model of 
the particular robot specimen is available. Robots in interaction with its work space 
and environment will therefore increasingly need more sensory information - e.g., 
force, vision, and touch - and information processing to support fast feedback 
control, e.g., control software architectures with sensor interface (Nilsson and 
Johansson 1999; Blomdell et al. 2005) and high-bandwidth signal processing 
capacity. Such robot systems, as of the topic of this chapter, not only need suitable 
algorithms that by means of the control system give the robot its desired properties 
in the nominal force interaction case but also need to be engineered with modularity 
as a primary concern. 

For position-controlled robots, as covered by the other chapters of this book, 
modularity is of course valuable for efficient engineering of prototypes or products, 
as well as for repair and maintenance. For force-controlled robots, such modularity 
still applies, but there is also a stakeholder or business reason that so far has been 
overlooked in the literature. A force-controlled robot, as will become evident at the 
end of this chapter, is based on feedback around dynamics of both the robot (from 
the technology provider) and the application (from the user), with risks for inter- 
fering control boundaries and saturations, such that any deficient robustness and 
performance can be “blamed on the other part/party.” Successful applications 
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therefore require a higher degree of modularity and separations of concerns, 
including those of servo control of individual joint, despite a crucial dependency 
between the robot properties and the required control principles. 


Approaches 

The dynamics and inaccuracies of the robot and its environment imply that both 
desired and accidental transitions between force interaction and free-space motions 
need to be well controlled. For instance, vibrations or instabilities may not occur 
regardless of the stiffness of the environment, despite that disturbances of the partly 
unknown stiffness enter the control loop as a partly unknown gain. Specifically, in 
force interaction with a stiff surface during execution of a robot task, interaction 
dynamics may involve diverse forces and torques of inertial, viscous, elastic, and 
dissipative nature as well as friction. Whereas a detailed, accurate robot model may be 
developed, it would be difficult and impractical to have detailed compliance models of 
work objects in all their various stages of manufacturing. This circumstance sets a 
limit to task planning and accuracy using (nominal) position-based motion control. 

Without resorting to explicit force feedback control, force interaction control 
can be approached using either passive compliance resulting from structural com- 
pliance of end effector, links, joints, gear box, and backdrivable actuators or via the 
so-called indirect force control where no explicit force feedback loop is closed 
(Chiaverini et al. 1999). Whereas the passive compliance approach is simple and 
does not require any force sensors or feedback control, it lacks accuracy and 
flexibility and/or needs specifically tailored end effectors uniquely designed for a 
specific task. For intrinsically safe interaction with humans, mechanically compli- 
ant lightweight links have been a commonly adopted solution. 

These basic approaches toward fulfillment of the needs within manufacturing 
will now be detailed and refined, starting with fundamentals for high-performance 
control. 


Model-Based Control 

The productivity needs within flexible manufacturing imply a need for high- 
performance robot motion. The established and very successful way to accomplish 
such performance both in academia and in industry is by means of model-based 
control which applies both to the control of the individual servos and for the overall 
multivariable manipulator control. Therefore, proper modeling is essential for 
performance, and since force control also needs to deal with dynamics of the 
environment, we have to pay special attention to modeling and control design 
such that the resulting robot motion properties are manageable and understandable. 
That is, in cases where there is a risk for shattering or undesired vibrations, the 
system needs to be designed such that the sources of the problems can be deter- 
mined. Otherwise, no force control product will be successful with obvious 
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implications for robot applicability. Note that this is another viewpoint of the 
algorithmic approach that is dominant in the literature (Siciliano and Khatib 2008). 

Thus, while motion control has been covered in the previous chapter, we will 
now revisit the modeling such that industrially applicable force control can be 
presented in a coherent manner in the following sections. We start with well-known 
rigid-body manipulator models, gradually approaching the joint-wise dynamic 
effects that make the purely academic approaches less applicable in industry. 
Apart from the nominal-intended control, the saturated and exceptional cases also 
need to be considered, as in any engineering of industrial controls. 


Models and State-Space-Based Control 


The principles of feedback control and feedforward control are assumed to be 
known. Control schemes are, however, presented in a variety of ways in the 
literature, often such that the basic components are mixed in confusing ways. 
While a feedback controller for a linear time-invariant system, such as a simple 
robot joint servo, may consist of a linear digital filter between a sensor and an 
actuator, the notion of model-based control implies an engineering need for 
maintaining and computing the filter parameters based on parameters of the phys- 
ical model. It is then important to maintain the understanding of how the control 
design depends on the models, which could be nonlinear. 

The basic control loops in robotics aim at reducing effects of load disturbances 
while not injecting too much measurement noise into the system, making the closed 
loop insensitive to variations in the process and making the desired outputs accu- 
rately follow command signals/desired reference trajectories. To be prepared for 
the use of nonlinear models, and to maintain the understanding of the physical 
states, it is advisable to base the control on a state-space formulation. Figure 1 
shows such a generic structure. 

In the linear and time -invariant case, the feedback gains will be an array of 
constants (a proportional controller in the scalar case). The u re f is the required 
control action in case of a perfect model knowledge, so feedback is used only 
for handling unknown variations. For normal robot motors, the signal is equiv- 
alent to the desired acceleration times the joint inertia, and in more general 
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Fig. 1 General formulation of state-space-based feedback control (Astrom and Wittenmark 1990) 
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terms, this is the inverse dynamics of the system. Since the system as such will 
have integral action, because that is what nature does (and there will be at least 
as many poles as there are zeroes in the case of linear process dynamics), the 
inverse will of course have derivative control action. Numerical differentiation 
of the possibly stepwise changing set point to obtain the state references can of 
course be problematic since state references typically need to be smooth. 
Therefore, the state-space model needs to include some low-pass filtering, and 
there can be a need for providing smooth or planned reference velocities and 
acceleration from the planning level. 

The state estimation includes model-based dynamics both for the internal (not 
measurable) states and for states representing the disturbances (such as the I part of 
a PID controller representing a constant unknown disturbance). When all states are 
measurable and the disturbances are managed without an internal model, as in the 
case of ideal computed torque control of a rigid-body manipulator, the state 
estimator will simply forward the joint value vector q , or the state estimator can 
be a stateless transformation if the feedback loop is closed in the operational space. 

Rigid-Body Manipulator Dynamics 

For motion control, rigid-body dynamics is often modeled by Euler-Lagrange 
equations in the configuration space (joint space). For a serial manipulator with 
n links and n joints with q denoting the vector of joint position variables, the Euler- 
Lagrange equations for fully actuated unconstrained motion are 

M(q)q + C(q,q)q + G(q) = t, q G R'\r G R” (1) 

where M(q) is the inertia matrix, C(q,q)q the vector of Coriolis and centripetal 
forces, and t the actuated joint forces. The inertia matrix M(q) is a symmetric, 
positive definite matrix. From the serial robot kinematics, the joint positions result 
in an end effector positions and orientations denoted by the vector x G R in the 
operational space (task space). From the kinematic mapping x=f(q) relating joint 
positions to end effector positions, the velocity mapping form joint velocities to end 
effector velocities can be found as 





where J(q) is the Jacobian matrix. When J(q) is invertible, the operational space 
dynamics obey the motion equations 


A(q)x + T(q, q)x + y(q) = J T (q)r 

where A(q) is the pseudo-inertia matrix 



A (q) = J (<? )M{q)J{q) 


( 4 ) 
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r(q,q)=J T (q)C(q,q)J 1 (q) - A(q) J *(<?) (5) 

r(<?) = J~ T {q)G(q) (6) 

During constrained motion with force interaction with the environment, the 
motion equations under the influence of external end effector forces F in the 
operational space are modified to 

M(q)q + C(q,q)q + G(q) = t + J T (q)F, q e R'\t € R n ,F e R 6 (7) 



A(q)x + r(< 7 , q)x + y(q) = J T (q)r + F 



Linear Joint Actuation Dynamics 

In contrast to Eq. 1, most industrial robot systems are not actuated by direct-driven 
motors. Extension of the robot model by taking the drive chain into account has the 
benefit that the dynamic interaction between joints is attenuated and to some extent 
decoupled by the often highly geared motors (Fig. 2). On the other hand, additional 
resonances and significant disturbances like backlash and friction are introduced; 
see, e.g., the joint model introduced by Spong in (1989). Most often the joint 
position sensors are colocated with the actuation (i.e., mounted on the motor 
axis). For most industrial robot systems, explicit velocity sensors, such as tachom- 
eters, are often not present in motor drives, so velocities may need to be estimated 
which may put constraints on the achievable motion and force performance when 
used for feedback. 

The Spong model offers an extension of the rigid-body modeling with compli- 
ance of the actuator and drive chain dynamics in the form of the motion equation 


/ 

Actuator 

space 

\ 



Fig. 2 Motion spaces of relevance for force control 
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M{q)q + C(q, q)q + G(q) + K a (q - q a ) = 0 (9) 


M a q a + D a q a - KJq - q a ) = u 




~M{q) 

0 " 


• • 
q 

+ 

0 

M a _ 


Aa_ 




where M a , D a , and K a are the inertia matrix, damping matrix, and stiffness matrix of 
the actuator dynamics; q a is the vector of actuator position variables; and u is the 
torque reference signal to the actuator. 

Note that these equations can be interpreted as an underactuated system. 


Nonlinear Joint Actuation Dynamics 

Torque actuation control in the presence of nonlinearities is important in industrial 
robotics (Fig. 3). Whereas some possible nonlinearities of the joint actuation have 
gotten attention in the literature (Ozgoli and Taghirad 2006), some such nonline- 
arities are the less relevant ones. Specifically: 

1. Without motor-current feedback, the back EMF, as well as the temperature- 
dependent torque constant, of the joint motor is a nonlinear effect since the 
desired linear relation between motor torque and acceleration is not 
implemented. The speed, torque, and time dependencies have inspired research 
on adaptive control with respect to those effects. That is, however, of no value 
since all industrial controllers use actuator current feedback so that the motor 
voltage can be controlled and so that the nonlinear effects become insignificant. 

2. Another nonlinear effect is the torque saturation, which is due to combinations of 
the drive electronics power and the permitted torque on the mechanical system. 
While this has also inspired research (Spong 1989), it is another example of 
non-applicable solutions. There are ways to manage torque saturations by means 



Fig. 3 Generic motor control for joint actuation with high-bandwidth inner control or torque by 
means of current feedback 
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of anti- windup protection, appropriate reference models (e.g., in trajectory 
generation), adaptation of path coordinates (Dahl 1992), and backpropagation 
of saturations as in the process industry (Elmqvist 1978). 

3. Friction is also a source of nonlinear control analysis, for instance, to avoid limit 

o 

cycles (Olsson and Astrom 2001). However, using tight inner-loop feedback and 
considering the two previous items, there would not appear to be any stability 
problem associated with friction. If friction causes stability problems, then the 
robot mechanics need to be improved. In practice, friction compensation can be 
implemented by feedforward control (part of the inverse dynamics; see Fig. 1), 
without harming the stability margin of the system. 

In total, the published techniques for handling nonlinear joint actuation 
dynamics are hardly applicable. On the other hand, there are physical and 
experimental evidence that such nonlinear effects are crucial, so what are they? 
To answer that question, we should look into the mechanical structure of robot 
joints. 

The resonant behavior caused by the compliance between the inertias of the 
joint transmission is an effect that is best managed in the local joint- wise servo 
control, with implications for the overall manipulator control according to Eq. 11. 
Considering the possible backlash in the gearbox of each joint, there is in a 
similar way a need for stabilizing the motor locally, and, then in the manipulator 
control, to deal with the closed-loop control properties, which also would need to be 
tailored to the manipulator control. Otherwise, consider the implication of backlash 
on Eq. 7 and in the following control design. That is, for low motor torques the 
diagonal elements of the matrix M will be very small, and the gains tuned for the 
full inertia will result in joint motors vibrating within the gearbox play. The 
conclusion is that for designing robotic force control, we need to consider also 
the actuator space in addition to the joint space and the operational space (Fig. 4, 
Table 1). 



Fig. 4 Robot joint model with nonlinear dynamics, depicted as a prismatic joint for simplicity 
(Olofsson 2013). For definition of notation, see Table 1 
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Table 1 Variables and 
parameters in the nonlinear 
robot joint model depicted 
in Fig. 4 


T 

Torque from controller to motor 


External disturbance torque 

q a 

Actuator angle (motor side) 

q 

Joint angle (link side) 

a h 

Backlash angle 

kn 

Nonlinear spring constant 

T f 

Friction torque 

Jo 

Actuator/motor inertia 

J, 

Transmission inertia 

J, 

Link side inertia 

Je 

End effector inertia 

ka 

Stiffness accomplished by controller 

da 

Damping accomplished by controller 


Basic Approaches to Force Control 


Since each robot joint is driven by a torque (or force in case of prismatic joints), it is 
natural to approach the force control problem by directly mapping the force 
(or torque for the orientations) errors into torque compensations of the motors 
(Khatib 1987; Khatib and Burdick 1986). That implies a direct relationship between 
the external force and the joint torques (or the actuator torques according to the 
previous section), which is the basis for the so-called direct force control that will 
be covered next. Thereafter, the indirect force control will be covered, also consid- 
ering the inner loops for joint control. 


Direct Force Control 


In direct force control we ignore the dynamics of the environment. Instead we 
consider the interaction force as such, first explicitly with pure force control and 
then combining force and position control in a hybrid manner. 


Explicit Force Control 

Explicit force control may be approached by the force feedback control law 


* =J T (<l)(r(q,q)x + y(q)) 


+ J T \q)(—F r H -A(q)x r — A(q)D(x — x r ) T -A(q)K(x — x 


r 



(ii) 


where F r , x r , x r , x r are reference values for the forces, position, velocities, and 
acceleration and D and K are feedback parameters to be designed. The closed-loop 


dynamics of the errors x = x — x r ,F = F — F 


r 


A(q) (i + Dx+Kx) = F - F r 


( 12 ) 
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During unconstrained motion with F = 0, F r = 0, the error dynamics are 
reduced to 

Z+Dk + Kx = 0 (13) 

for which the velocity feedback parameter matrix D and the position feedback 
parameter matrix K may be chosen according to principles for optimal control or 
robust control or impedance control (Johansson and Spong 1994; Zhou and Doyle 
1998; Hogan 1984; 1985). 

Hybrid Force/Position Control 

Methods involving force control in some geometric degrees of freedom and 
motion control in others that are known as hybrid force/position control or 
hybrid force/motion control were proposed by Raibert and Craig (Raibert and 
Craig 1981) and extended by Mason (1981). In hybrid force/position control, the 
decomposition of the robot motion into force control and position control is 
described in Fig. 5. 

By decoupling the different directions by projection matrices, the control task 
may be resolved into tasks of force control and trajectory motions in the various 
different dimensions of operational space. 

Note that in Eqs. 7 and 8, the force F may depend on external dynamics, whereas 
in Eq. 12 it is assumed that the effect of any environmental dynamics can be 
neglected. Since a robot user will be unaware of this situation, it is not surprising 
that the direct force control has problems with unstable behavior during contact 
with the environment. Particularly in stiff contact, impact phenomena which 
prompt for control schemes for the noncontact-to-contact transitions (Eppinger 
and Seering 1987; Johansson et al. 2009) have been analyzed. 





Fig. 5 Selection of control spaces in hybrid force/position control (Adapted from Craig 1989) 
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Indirect Force Control 


In contrast to direct force control, where the interaction aims at explicitly control- 
ling the contact force to a desired value, indirect force control may achieve a 
compliant contact behavior through motion control, possibly, but not necessarily, 
using force measurements for feedback (Siciliano and Villani 1999). 


Stiffness Control 

A manipulator given a suitable stiffness by means of position control will react with 
interaction forces when pushing against the environment. The pushing can be on the 
user level of the system as in the case of a compliant end effector as mentioned in 
section “Introduction.” In a similar way the user has to maintain the force indirectly 
when position servos are detuned, which we refer to as stiffness control. 

The drawback of not having a well-defined and reusable position specification 
when stiffness control is being used motivates more advanced indirect force 
control. 


Impedance Control 

Impedance control was introduced by Hogan as a unified way of controlling robots 
in both free-space motion and constrained environments under contact situation. 
Impedance control was proposed as a remedy to the undesired transient behavior 
which switching between free-space and force-controlled motion can induce 
(Hogan 1985). It can be viewed as regulating a generalized mass -spring -damper 
behavior - a desired resulting dynamic combination of contact force, position, and 
velocity - which may differ quite significantly from the original mechanical 
properties of the manipulator and its environment (Nagai and Yoshikawa 1994). 

There exist several suggested implementations of the general impedance control 
relation (see, e.g., Hogan 1989; Glosser and Newman 1994; Johansson and Spong 
1994; Kelly et al. 1989; Johansson and Robertsson 2003), which exhibit different 
dynamical properties and robustness to model uncertainties and heavily depend on 
the properties of the manipulator actuation (direct drives vs. cascaded servo control 
drives). 

Example 1 Lyapunov design of impedance control. Consider a mechanical system 
with mass m and position coordinate x affected by a constant contact force F and a 
control force u with the equation of motions 


mx = u + F 


Assume that the equation 


dx + kx = F 

where constants d and k would represent a suitable impedance relationship between 
force F and position x. Introduce the Lyapunov function candidate 
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1 


1 


V (x, x) = - m ( dx F kx — F) z + - kx 2 — xF 


2 


2 


1 

2 


m(dx F kx — F ) 2 + — - (&x — F) 2 F 2 


2k 


with the partial derivatives 


dV 

dx 


md (dx F kx — F) 


dV 

dx 


and the time derivative 


• dV . dV 
V = —x + — x 


dx 


dx 


2k 


mk(dx + kx — F) + kx — F 


(mk(dx + kx — F) + kx — F)x + md(dx F kx — F)x 


(mk(dx F kx — F) F kx — F)x F d(dx F kx — F)(u F F) 


It is straightforward to verify that, for instance, the control laws 


u 


F 


jn + 


km 

d 


(dx F kx — F) 


1 — km 


(kx-F) 


for an arbitrary constant /r > 0 results in a negative-definite derivative 


(14) 

(15) 


(16) 

(17) 


(18) 

(19) 

( 20 ) 


V = -n(dx + kx- F ) 2 1 (/ kx - F f 

with the stationary point 

dxF kx — F = 0 Impedence property (21) 

kx — F = 0 Static stiffness property, (22) 

i.e., we have a stable system with the desired impedance properties. 

As an alternative to the hybrid force/position control concept, a combination of 
force-controlled and impedance-controlled directions was suggested in Anderson 
and Spong (1988). 


Admittance Control 

Impedance control with inner motion control is often nowadays sometimes referred 
to as admittance control, and in the sequel we will use the term impedance control 
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for the direct-driven case, also referred to as dynamics-based impedance control. 
Another usage of the terminology is to characterize impedance as the velocity-to- 
force response, whereas admittance characterizes the compliance or force-to-veloc- 
ity response in contact operations. 

As mentioned, most industrial robot systems are based on a low-level drive 
control for each motor in combination with feedforward based on the robot dynam- 
ics. The dominating servo structure is the cascaded architecture with position, 
velocity, and torque/current controllers for each individual joint; see Figs. 1, 2, 
and 3. Although not optimal with the restricted number of parameters, it is an easy 
and straightforward structure to tune with possible gain scheduling and allows for a 
simple but important feedforward structure of velocity and torque references which 
considerably improve the tracking performance. 


Comparing Concepts on a Single Joint 

We will illustrate indirect force control through the design of compliance control, 
impedance control, and admittance control, respectively, in a simple 
one-dimensional link example. 

Example 2 Consider the rotational dynamics of a single-link robot 

r\ 

m£ q + dq + m£g cos q = r + r d (23) 

where m is the mass of the (homogeneous) link of length £, d is the viscous 
damping, g is the gravity of earth, r is the applied torque, and r d is an external 
torque acting on the joint like a matched disturbance; see Figs. 1 and 3. 

Compliance Control: A control law based on computed torque/feedback line- 
arization in combination with a standard PD controller 

r = ( m£g cos q + dq) + k p (q r - q) + k v (q r - q) (24) 

will result in the controlled dynamics 

mi q + k v (q - q r ) + k p (q - q r ) = T d (25) 

where k p and k v are the feedback gains for the PD controller, corresponding to the 
proportional gains in cascaded position and velocity controllers, respectively; see 

Fig. 7. 

Without any integral part in the controller, it is straightforward to see that the 
static angular error ( q r — q) is inversely proportional to k p and proportional to the 
external torque T d . Thus, by detuning the controller parameter k p , one can decrease 
the stiffness of the controlled joint with respect to an external disturbance (and by 
detuning k v one changes the transient behavior corresponding to the effect of a 
viscous damper). 
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With similar pose-dependent transformations as used in Eq. 4, a resulting 
Cartesian stiffness and damping can be generated by several coordinated joint 
controllers - the so-called Cartesian soft servo or Cartesian compliance control. 
Note that there is no feedback of the external torque in the controller and the 
original inertia is not changed. 

Impedance Control: Returning to the single link of Eq. 23, a desired impedance 
relation in the form of a linear second-order system 

Mq+Dq+Kq = r d (26) 


can be achieved by the torque 



m 


(mig cos q + dq) -z d + — (z d + mq 0 + K(q 0 - q) + D(q 0 - q)) 


M 



Here M, D, and K have the interpretation of the desired inertia, damping, and 
stiffness with respect to a deflection q = q — q 0 , q 0 denoting a desired reference 
angle for the outer loop, whereas q r is the reference value for the inner loop. 

In contrast to the compliance control of Eq. 24, we see that the impedance 
control Eq. 27 requires feedback of the external torque. One can also note that the 
control law (27) assumes full knowledge of the joint dynamics, which may render it 
sensitive to model uncertainties. 

Admittance Control: For a given servo controller with an internal position/ 
velocity loop for the link already present, the impedance controller would need to 
deactivate or override this. 

Admittance control on the other hand has the same desired mass-spring-damper 
relation (26) as the impedance controller above but uses the inner position/velocity 
loops to track the desired impedance model according to 


M(q r - q 0 ) + D(q r - q 0 ) +K(q r - q 0 ) = z d (28) 

together with the inner controller/manipulator dynamics 


mq + k v (q - q r ) + k p (q - q r ) = z d (29) 

A comparison of the different controller structures for the single-link example 
and the utilized feedback signals can be seen in Fig. 6. 

However, although aiming for the same impedance relation, admittance and 
impedance controls show different dynamical properties. Contact with a very stiff 
environment impedance control still allows for small interaction forces by tuning 
for a low artificial spring constant K , directly influencing the motor torque r from 
the feedback signals. The (static) angular deviation will grow correspondingly in a 
contact situation, but the main drawback is in the free- space motion where the low 
gain will cause poor tracking performance and render it sensitive to load distur- 
bances such as friction (Shiriaev et al. 2003). 
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Fig. 6 Comparison of 
control structures for 
compliance control Eq. 24, 
impedance control Eq. 27, 
and admittance control 
Eqs. 28, 29, respectively, for 
the single-link example 





♦ 



The performance of admittance control, on the other hand, relies on a well-tuned 
inner position and/or velocity loop with high bandwidth and is well suited for 
implementation in industrial robot controllers with high transmission ratio drive 
trains. The inner loops attenuate disturbances and reduce the sensitivity to model 
uncertainties. Free- space motion and soft contact interaction will thus show good 
tracking performance. However, due to the added dynamics of the cascade (see 
Fig. 6) stability issues may still occur in the transient contact situations with stiff 
environments. Stability properties with respect to model uncertainties and latency 
for impedance and admittance control are investigated in Lawrence (1988) and 
Newman (1992), and the inherent trade-off between accuracy and robustness is 
discussed in, e.g., Valency and Zacksenhouse (2003), Ott et al. (2010). 

Admittance control is also referred to as position-based impedance control or 
impedance control with inner motion control, whereas the form of impedance 
control described above also is referred to as dynamics-based impedance control. 


Adaptations Toward Industrial Applications 

Force interaction control ranges from low-force tactile exploration of work space to 
stiff interaction with a stabilized tool in machining operations (Natale et al. 2000). 

Interaction forces with a rigid environment or an unpredicted object will be 
reduced if the robot approach is made with lower mechanical impedance. During 
force interaction, the dynamic properties of the work objects and work space 
become important with limiting behavior caused by contact instabilities. 
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Cascaded Joint-Based Control 

As shown in Fig. 7, the joint motor feedback control is based on (sensed or 
reconstructed) velocity forming an inner loop and an outer position control loop. 
Integral action is needed to reject load disturbances z d . All robot servos have a 
limited maximum torque (electrically and mechanically). A model of that limitation 
is part of the regulator to avoid windup of the integrator state. Limitations of the 
references for the positions and velocity loops are omitted for clarity, and they are 
less crucial since properly limited q r (and its derivatives, from the trajectory 
generation) prevents exceeding the permitted positions (no overshoot in outer 
loop) and velocities. 


Modeling and Sensing the Contact Force 

Most force/torque sensors used are wrist-mounted or base-mounted sensors based 
on piezoelectric, piezoresistive, or strain-gauge measurements. As the resulting 
force measurement is not measured at the end effector contact with the environ- 
ment, there is no pure measurement of contact forces without disturbances from 
friction and inertia forces distal of the force sensor. Separation of the contact force 
from the measured force requires additional measurement of acceleration or model- 
based estimation of the inertia forces (Fig. 8). 


Redundancy Management in Task-Posture Decomposition 

In cases where the robot has more degrees of freedom in joint space than the 
degrees of freedom in operational space, it is possible to use the redundant degrees 
for projection of the operational tasks into the constraint null-space (Sentis and 
Khatib 2006), thus accommodating constraints, operational tasks, and postures and 
their prioritization. Let J c (#), Jt (<?)> and J P (q) denote the Jacobians of constraint- 




q 

q 


Fig. 7 Cascaded structure for motion control with feedforward of velocity and acceleration 
references 
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i) 


Fig. 8 Schematic picture of multiple-point contact scenario between robot and environment. 
Local coordinates (x^, y ( c L : \ zJT) are attached to the workpiece at each contact point, while the 
dynamics of the environment is modeled by a linear mass-spring-damper system (Olsson 2007) 


handling tasks, operational tasks, and postures, respectively. Let N c ( q ), N T (< q ), and 
N P (< q ) denote the null- space of the Jacobians of constraint-handling tasks, opera- 
tional tasks, and postures, respectively, satisfying 

Jc(q)N c (q)= 0, J T (q)N T (q) = 0, J P {q)N P {q)= 0, (30) 

A control hierarchy exploiting additional redundancy may be proposed as 

t = t c + N T c {q) ( z T + Nj(q)r P ) (31) 

= J T c (q)Fc + N T c (q)J T T (q)F T + N T c (q)N T T (q)N T P (q)F P (32) 

where F c , F r , and F P are the forces in operational space used to maintain position 
and forces of the constrained points, task motion and forces, and posture, respec- 
tively. The corresponding joint space torques are 


*c = 

= J T c (q)Fc 

(33) 


= J T T {q)F T 

(34) 

Tp = 

= J T P (q)F P 

(35) 


This approach is useful in a scenario of robotic machining - e.g., drilling - with 
composition of the task-dependent forces while maintaining posture and support 
forces. 

Figure 9 shows an example of a controller structure with superposition of task 
specified and null-space motion. 

A force-controlled assembly operation with a 14 DOF dual-arm robot (Linderoth 
et al. 2013) is shown in Figs. 10 and 11. The null-space motion of the assembly task 
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Fig. 9 Robotic force control structure (Adapted after Vuong et al. 2010). Note that the computed 
torque r c and the actuated torque r may differ if the actuator control has insufficient bandwidth 


Fig. 10 A force-controlled 
assembly with a 14 degree-of- 
freedom dual-arm robot 
(FRIDA) (Linderoth 
et al. 2013). The force/torque 
sensor, mounted at the wrist 
of the robot’s right arm, is 
only used for verification and 
evaluation of the estimated 
forces 



could, for instance, be optimized for accurate contact force estimation and/or 
keeping safety distance to a human operator sharing work space with the robot as 
described by Ceriani et al. (2013). 


Sensorless Force Control 

Robotic assembly is a task that requires physical contact between the robot and its 
environment. Traditionally this has been solved using position control together with 
fixtures to achieve the desired accuracy. When the task contains uncertainties, 
however, additional sensing is needed to accomplish the assembly. One way to 
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Fig. 11 Zoom-in illustration 
of the frame in which the 
contact forces are measured 




incorporate sensors and specify general tasks is to use the iTaSC framework 
(instantaneous Task Specification using Constraints) (De Schutter et al. 2007). In 
Stolt et al. (2012) it was described how this framework was used in assembly of an 
emergency stop button (Stolt et al. 2012). 

There exist a few different strategies for performing robotic assembly, where 
different amounts of sensor information are used. One way is to use pure position 
control of the robot. To be able to do this, one has to rely on the accuracy of the 
robot, of the involved parts, and of the work cell that is good enough. Usually task- 
specific fixtures and toolings are needed. Further, it is necessary to ascertain that 
nothing unexpected will happen during the assembly operation, as this is hard to 
discover without an external force sensor. One option is to handle some degree of 
part variation by using a compliant tool. A second strategy is to use binary 
information from sensors. This means that the assembly is divided into several 
steps, in which the information from the sensors is used to trigger transitions 
between the steps. This strategy can be used when there are a few uncertainties, 
e.g., some variations in the parts. One example would be to use a sequence of search 
motions in order to find a certain feature of an object, and once this feature is found, 
it is possible to use pure position control to finish the assembly operation. Yet 
another alternative is to use sensors for continuous feedback control. This strategy 
makes it possible to cope with large uncertainties, but it is also the strategy that is 
the most difficult to program for a robot operator. One example of this strategy is 
force-controlled assembly, where force sensing can be used to identify contact 
formations, keep contacts, and find new contacts during an assembly operation 
(Park and Khatib 2008). If contacts only are detected in a binary fashion, as 
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described in the previous paragraph, there is a risk of losing contact or getting very 
large contact forces during sliding motions. Hence, continuous sensing can make 
assembly possible when more uncertainties are involved and also reduce the risk of 
damaging equipment. The strategies described in the previous paragraphs may 
handle different amounts of uncertainties, and the type of effort necessary for 
task execution is different. In the case of pure position control, one has to assume 
that the position accuracy is very good and that everything will go as planned. 
These requirements can be relaxed when binary sensor information is used, but then 
one has to take care of the sensor signals in an appropriate way instead. Using 
continuous sensing demands even more sensor processing and feedback control 
strategies, which may be hard to tune. The two last strategies also require a sensor, 
which may be expensive. The reusability and the increased robustness to uncer- 
tainties, however, are incitations to use the strategy based on continuous sensing. 
Whereas assembly without a force sensor was experimentally verified in a real- 
world small part assembly task using a redundant robotic manipulator by instead 
estimating the forces from the joint position control errors, the force estimates may 
be error prone for reasons of friction disturbances. 

An industrial case study for comparing a robot made compliant by detuning 
the motion control (soft servo) and a force-controlled robot in a high-precision 
valve-insertion operation was presented in Chen et al. (2009). The authors showed 
that the soft servo strategy developed can perform assembly tasks with small part 
location errors” where the relative part location was within 1 mm, whereas for 
larger part location errors, force control or RRC (remote center of compliance) was 
needed. 

Force Control Applications and Industrial Control Systems 
Assembly 

Assembly of automotive power trains is a traditionally manual work, performed by 
skilled operators with years of experience. This circumstance is because gears and 
other critical components of the clutches, torque converters, and so on have to be 
aligned with very high precision. Such operations, however, take their toll on 
human labor. Tedious and fatiguing, they can lead to repetitive motion stress injury, 
lower product quality, and a drop in efficiency. A robot able to perform at the same 
level as a human operator would obviously make the prospect of automation in this 
application area very attractive. If a position-controlled robot tries to align a pair of 
gears and its control program does not have precise information about the gear tooth 
positions, the robots sole option is iterative trial and error, repeated until the relative 
tooth positions are found. Any attempt by the robot to fit the gears as long as they 
are misaligned will cause one gear to press hard against the other, generating 
unacceptably high contact forces. Even, if the teeth were chamfered to facilitate 
mating, misalignment would still produce large side forces as the robot struggles to 
move the gear along the preprogrammed path toward the centerline of insertion. 
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More than likely, the gears would even jam unless some means of mechanical 
compliance is provided (Johansson et al. 2004; Wang et al. 2008). 

A case study is the robotic assembly of Fords F/N torque converter case (Wang 
et al. 2008), which weighed about 25 kg. Inside this case, there was a double splined 
gearset into which a pump gear had to be inserted (Johansson et al. 2004; Wang 
et al. 2008). The seal of the pump gear was critical, and great care had to be taken to 
ensure that it was not damaged in any way during the insertion. An internal splined 
shaft had to be fitted at the same time, thus complicating the assembly. The tests 
carried out with advanced force control in the automotive industry convincingly 
demonstrated its ability to improve the cycle time and agility in different assembly 
applications. In one application involving the insertion of a forward clutch, a work 
cell with an IRB4400 robot averaged 5.7s for the insertion with a reaction force of 
less than 100 N. In another, F/N torque converters were assembled in an average 
time of 6.98 s with the contact force limited to 200 N, noting that, in addition to the 
part itself weighing about 25 kg, the allowed positional tolerance was ±2 mm. 


Grinding and Polishing 

The use of industrial robots for automated deburring, grinding, and polishing is an 
interesting example of a process where external sensing capabilities are crucial. 
Accurate control of the contact forces can help increase the quality of the final 
product, as well as flexibility in the deburring process. To handle the deviations 
from the nominal workpiece geometry that are inevitable consequences of the 
foundry process, some compliant behavior needs to be included in the system 
used for deburring. As an alternative to physically adding mechanical compliance 
to the system setup, for instance, by using a compliant tool, force control can be 
used to program a desired compliant behavior and to maintain a desired contact 
force during the deburring process. 

In the European project Autofett, grinding experiments were carried out on an 
ABB IRB6400 robot at the company Kranendonk Production Systems BV, the 
Netherlands, using a special grinding tool developed at KU Leuven, Belgium; see 
Fig. 12. The contact force was maintained at a constant value (F = 150 N), while 
the grinding tool was moved across the surface (Johansson et al. 2004; Fig. 13). 


Drilling 


Systems for automatic drilling have a long history in both the industry and the 
research community. In particular, the use of industrial robots for drilling is 
interesting due to their flexible programming and the comparatively low cost of 
industrial robot systems. However, robot drilling is a very challenging task due to 
the comparatively low mechanical stiffness of the typical serial industrial robots in 
use today. In general, clamp-up is necessary in robotic drilling to avoid vibration 
during the drilling process as the drill tool generates vertical, horizontal, and axial 
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Fig. 12 Stub grinding (left) and finishing on a propeller blade (right) (Olsson 2007) 



Fig. 13 


Force-controlled cutting operation in titanium with parallel kinematic robot 


forces during the cutting process. The compliance makes the robot deflect, some- 
times up to several millimeters, due to the externally applied forces during clamp- 
up and drilling. Due to the bending of the robot links and the elasticity in the gears, 
the local deflection at the contact point does not necessarily occur in the (axial) 
direction of the applied force, but may have tangential components which are on the 
same order of magnitude as the axial deflection. This tangential deformation results 
in poor hole quality and inaccurate positioning. In contrast, aerospace tolerances 
require drilled holes to be accurate within ±0.2 mm (Summers 2005). As for a state- 
of-the-art comparison on robotic drilling and fastening, we refer to the recently 
reported robot capability test targeting applications in the aerospace industry with 
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Fig. 14 Robot, drilling tool, and tripod with JR3 force sensor mounted on the drilling tool. Two 
6 DOF force sensors are used to handle both the normalizing of the drilling tripod orthogonal to the 
surface and to measure the resulting drill forces of the process (Olsson 2007) 


the test robots KUKA KR240, KUKA KR60, ABB IRB7600, and Staubli RX170 
(Application Manual - Force Control for Assembly 2006). 

In this study from Airbus UK, limitations related to static and dynamic deflec- 
tion, repeatability, absolute accuracy, temperature error, and hysteresis were sur- 
veyed, the conclusion being that an absolute accuracy ±0.2 mm was not achievable. 
Using state-of-the-art anti-skating approaches, Atkinson and coauthors from 
Boeing-Hawker de Havilland concluded that absolute accuracy remained on the 
edge of acceptability for aircraft assembly (Atkinson et al. 2007). 

A drilling process involves moving a drilling end effector to the correct position 
of the hole. Prior to drilling, a pressure foot is used to press the parts together in 
order to avoid burrs entering in between the plates (Fig. 14). In addition, the 
pressure foot assures that the drilling machine is kept stable throughout the drilling 
cycle. A self-feeding mechanism is normally used to feed the drill through the stack 
of materials. Automated drilling in the aerospace industry today uses large robots 
for two major purposes: to handle the large assemblies and to accurately counter- 
balance the drilling forces involved in the drilling process. There are many different 
ways to overcome forces in drilling and fastening using industrial robots. One 
approach is to divide the process in two steps, where in the first step, the robot 
stiffness is mapped by applying forces to the robot TCP and measuring its deflec- 
tion, while in the second step, the robot is adjusted back to the nominal position 
under load. These compensation values are then applied as a filter to program the 
robot during process execution (Degoulange et al. 1994). Mapping the robot 
stiffness in this way can, however, be extremely time-consuming. Other methods 
to solve the skating problem have been tested by using metrology systems to 
supervise the robot; see, for example, (Kihlman 2005; Van Duin 2006; Summers 
2005; Van Duin and Kihlman 2005). In such methods, a metrology system is 
connected to the robot controller via an external feedback loop to update the 
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nominal position of the robot with measurements in real time. However, using 
metrology is not a straightforward solution, as the robot will deflect as the pressure 
foot is engaged. 

Common to the traditional approaches is the lack of high-performance sensor 
feedback to the robot, whereby the robot cannot be updated fast enough to cope 
with the dynamic process that drilling involve. Robot control systems are tradition- 
ally closed, a circumstance which has hampered system integration of manipula- 
tors, sensors, and other equipment, and such system integration has often been made 
at an unsuitably high hierarchical level. As a more cost-effective solution, high- 
bandwidth feedback techniques can be used to control the properties of the drilling 
process. Research and development on force-controlled drilling has not received as 
much attention as many other applications of industrial force control, such as 
assembly, deburring, milling, or polishing (Blomdell et al. 2005; Zhang 
et al. 2005). The reason is probably the difficulties involved in robotic drilling 
and the lack of available industrial robot systems with capacity for high-bandwidth 
force control (Siciliano and Villani 1999; Caccavale et al. 2005; Johansson and 
Spong 1994). Some results on force control for special drilling machines have been 
reported in Kawaji et al. (2001). Experimental systems for force-controlled robot 
drilling have been presented in Alici (1999), where a force controller with inner 
position control was used for the drilling thrust force control, and in Lee and Shih 
(2006), where an application to bone drilling in orthopedic surgery was presented. 
Similar conclusion can be made for cutting processes (Fig. 13). 

While there exist commercially available products for force control - e.g., from 
ABB Robotics (Application Manual - Force Control for Machining 2006; Appli- 
cation Manual - Force Control for Assembly 2006) - none of the available 
packages include the particular features and/or level of flexibility required for the 
drilling application, i.e., the ability to redesign the inner-loop servo control for 
improved disturbance rejection. 


Milling 

As a result of the increased demand on cost efficiency and flexibility in the 
manufacturing industry, industrial robots have emerged during the past decade as 
an appealing alternative to the dedicated machine tools for performing high- 
accuracy machining operations, such as milling and grinding. Because of the 
significant process forces required during machining operations, joint flexibilities 
in combination with lack of measurements on the arm side of the robot limit the 
achievable accuracy (Zhang et al. 2005). Consequently, deviations from the nom- 
inal path may occur during the machining operation. To overcome the limitations of 
industrial robots when performing machining operations, a piezo-actuated micro- 
manipulator has previously been developed (Puzik et al. 2009). The purpose of the 
developed micromanipulator is to compensate for path deviations, which the robot 
per se is unable to compensate for (Yoshikawa et al. 1994). The micromanipulator 
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achieves this by stiff actuation, realized by piezo actuators, combined with a 
mechanical design such that the bandwidth of the micromanipulator is significantly 
higher than that of the industrial robot. Following the concepts introduced by 
Sharon et al. (1993), the terms micromanipulator and macromanipulator will be 
used for the compensation mechanism and the robot, respectively (Arifin 
et al. 2013). We presented a prototype control scheme without explicit modeling 
of the nonlinear behavior of the piezo actuators in Olofsson et al. (2011, 2012) and a 
subsequent experimental evaluation in Sornmo et al. (2012). However, since the 
piezo actuators in the micromanipulator are inherently nonlinear in their 
input-output dynamics, the performance of the control scheme, and hence the 
machining performance, can be increased by modeling the nonlinear dynamics 
and utilizing the models for feedforward control in the control scheme. 


Friction-Stir Welding 

Robotic friction-stir welding (FSW) is a solid-state joining process that uses a 
robotic tool to join two surfaces (Fig. 15). A very soft region near the FSW tool 
is generated between the tool and material caused by frictional heat. Mechanically 
intermixing the two pieces of metal at the place of the joint, the heated and softened 
metal can be joined using mechanical pressure applied by the tool. FSW is mainly 
used on materials such as aluminum and copper. The challenge in robotic force 
control is to accomplish high FSW tool interaction forces while maintaining high 
accuracy despite the robot compliance caused by the reflection of tool interaction 
forces on the robot structure. New FSW developments like the floating bobbing 
technique significantly reduces the needed process forces (Threadgill et al. 2009). 
FSW was invented and patented by the Welding Institute, UK, in 1991 (Thomas 
et al. 1991). 



Fig. 15 Force-controlled friction-stir welding (FSW) with industrial robot (Courtesy of TWI, UK) 
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Fig. 16 Robot control products with force control functions as of 2009 (From Marvel and 
Falco 2012) 
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First-Generation Industrial Force Controllers 

Force control products clearly will become more and more advanced, and further 
application experiences will contribute to a more widespread use of force control, in 
particular, if that option is not overly costly. How to package the force control is 
also a topic that will evolve. A snapshot of the available robot controllers that are 
capable of force control is presented in Fig. 16. 


Summary 

Force/torque control is a challenging and crosscutting topic in robot control. Model- 
based control is the basis, as for other types of motion control. In ideal cases, with 
tasks only depending on rigid-body multivariable dynamics and linear joint- wise 
dynamics, the wide variety of approaches presented in the literature can be applied. 
Performance tuning can be used to reduce the sensitivity to modeling errors and 
unmodeled disturbances. 

For better understanding of the core issues concerning robustness and perfor- 
mance, and to thereby promote wider applicability in industrial applications, this 
chapter brought forward: 

• The industrial need for performance, for force control in combination with 
position control, as exemplified by typical applications. 

• An overview of published scientific principles for force-controlled robot motions, 
structured such that both the scientific and the industrial aspects get explainable. 

• A control architectural view that permits implementation of the algorithmic 
approaches such that industrial requirements on modularity can be fulfilled. 

• Application examples and how they relate to the control approaches all the way 
down to the torque control of the individual joint servo motors. 

Force control for robot manipulators has been an active research area for several 
decades with numerous scientific publications on different approaches but has 
despite this only recently reached industrial practice. Our aim with this chapter 
has been to give a summary and overview of current research and provide insight to 
current best practice and the challenges still remaining. 
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Abstract 

The present work aims at providing a wide view of the actuators used in 
automation and robotics. The idea was to describe most of the actuation solu- 
tions without entering too deep into details, offering a general panorama of the 
trends that are followed in robotics to address several problems. Due to their 
large diffusion, electromechanical actuators will be discussed extensively, cov- 
ering several topics from constructive specifications to elementary control prob- 
lems. This chapter starts with an introduction on descriptive scheme of an 
actuation stage. Control blocks and feedback types are illustrated to offer to 
the reader the possibility of identifying the different structures composing 
electronic and mechanical counterparts. Working principles for actuations are 
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then described and reported to classify the different technological approaches. 
AC and DC motors are presented and illustrated. Dynamics of DC motor is 
discussed in details and problems related to the presence of reduction stage and 
load are reported. Introduction on compliant actuation is also included due to the 
emerging field in robotics for human machine interaction. Series elastic and 
variable stiffness actuation solutions are mentioned. Piezoelectric effect and the 
widely used constructive solutions are depicted highlighting the obtained spec- 
ifications with the different architectures of piezoelectric elements. Shape mem- 
ory alloys and polymeric actuator are also introduced highlighting their dynamic 
behavior and the most famous applications in the robotic field. 


Introduction 


In robotics the importance of the choice of actuator typology is crucial for realizing 
the desired movement sequences and optimizing the production line or any kind of 
action requiring accuracy and specific dynamics features. The actuator can be 
mainly seen as an energy converter using a known physical principle that can be 
of various natures to generate work. Considering the actuator as a subassembly of 
the whole control system, different blocks specifically dedicated and designed to 
match the actuator’s specifications and to perform the required action can be 
differentiated (Fig. 1). 

Usually the whole actuation stage is made by different subsystems. The servo 
stage comprises all the components which are dedicated to the power supply and the 
conversion to control signals and consequently driving the actuator itself. The 
primary power is mainly an electrical source (AC 230/400 V 50 Hz), a battery, or 
a generator and it is the first block of the whole servo: the converter is responsible 
for translating the signal from the controller in current or voltage to the actuator for 
performing the desired trajectories or delivering a prescribed dynamics. The con- 
troller is the main core of the drive systems and is dedicated to compute the right 
signals to be sent to the actuator: for industrial applications usually a position 
controller is the widely used one, but hybrid position-force control is sometimes 
implemented where the robot end effector must deliver forces at the interface with 
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Fig. 1 General block scheme for a feedback-controlled drive system 
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the load (Hollerbach et al. 1992). The strategy for selecting the right actuator can be 
driven by many different factors in order to match the specific requirements. In this 
chapter the aim is to introduce an overview of the actuation technology providing a 
qualitative comparison of the performance and showing some examples of system- 
atic selection procedures. An actuator is defined in its characteristics in terms of 
physical dimensions, force or torque specifications, and operating frequency. 

In automation the actuator runs a certain kind of mechanical process and the choice 
must depend on the required task that can be a position, speed, acceleration, torque 
(force), power, or a combination of them. Given the wide variety of applications 
ranging from industry to service robotics such as medical and haptic devices, 
humanoids, and biomimetics, there are some important classifications to be 
followed for a correct choice of the actuator. Mechanical requirements such as 
force, displacement, bandwidth, stiffness, efficiency, and accuracy/re solution must 
be matched with the required performance always keeping in consideration the main 
working principle. 

It is helpful to recall actuators features considering some readily extractable 
information related to physical dimension, max stroke (Fig. 2), force output, 
bandwidth, and operating frequency (Huber et al. 1997). These characteristics are 
usually unique for any individual products or class, but it is also true that 
two actuators of the same class can have extremely different features. Therefore, 
it is useful to define also a second classification based on normalized attributes able 
to define a whole class of actuation using the same working principle. One can 
define: 

• Actuator strain : the maximum stroke delivered by the actuator itself divided by 
the dimension of the actuator parallel to the delivered direction of motion 

• Actuator stress : the ratio between the maximum force generated and the cross 
section perpendicular to the stroke 

• Actuator density : the ratio between the mass of the actuator and the capacity that 
can be expressed in stroke, power, or velocity 

By using such kind of classification, actuators relying on the same working 
principle have similar values for the previously mentioned properties (Zupan 
et al. 2002). Following the classification suggested by Huber et al. (1997), an 
actuator can be defined by the maximum stress <r max and maximum strain e max , 
and these two parameters for a given size of the actuation define the force and 
displacement limits and provide a second set of classification attributes: 

• Actuator modulus is defined as the ratio between a small increment of g and e. 

• Stroke work coefficient is the maximum actuation stress <r max and max actuation 
strain e max , and it is usually shown as a family of curves depending on the 
control signal of the actuator and the external condition determining the type of 
working cycle. 

• Cyclic power coefficient is the maximum work per output cycle ranging from zero 
to unity, and it is mainly a sort of parametrical identification of the curve g — e. 
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Fig. 2 Classification of different classes of actuators in function of maximum stroke and max 
operating force 


Figure 2 depicts different kinds of actuation technology which nowadays is used 
for robotics and automation. Of course, given the wide variety of solutions, the 
figure tries to provide a general scenario of the capacity of the different classes of 
actuators in delivering dynamics and motion. 


Actuation Technology 

Given the high variability of the working principles on which the different actuation 
technologies are based, this chapter will briefly overview most of them and will be 
focused mainly on electromechanical actuators which are the widely used solution 
in automation and robotics. 


Electromechanical Actuators 


Electrical motors are the most commonly used in industries: an electromechanical 
actuator is usually configured as a rotary system whose electrical signal driving the 
working principle and generating the torque is commuted mechanically (bush 
motor) or electronically (brushless). A initial classification at the functional level 
can be done between tunable speed motor and fixed rotating regime motor (Green- 
wood 1965). The first is used in all those applications where a continuous tuning of 
the velocity and the generation of complex trajectory is required; the second type is 
mainly used in those applications where three different regimes of rotations are 
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Fig. 3 Classification of electrical machines based on magnetic fields and power source 

required: forward, rewind, and stop. A second classification of the electrical rotary 
machines can be done by considering the constructive features as reported in 
Fig. 3a. Asynchronous motors are alternate current-powered machines where the 
rotational speed of their rotor is lower than the one of the rotating magnetic field, 
whose rotational velocity is multiple of the frequency of the power source (Wavre 
and Thouvenin 1995). In synchronous machines the rotational speed is synchro- 
nized with the one of the electrical field, and the rotor has approximately the same 
velocity. 

A first application of these two categories of motors was a constant speed with no 
possibility of fine regulation, because the rotational regime had been imposed by 
the primary power source 50 or 60Hz. Collector motors were designed in order to 
allow a fine-tuning of the speed; usually they are powered by direct current and they 
have two main architectures which are “brushes” and “brushless”; the first archi- 
tecture needs the presence of a mechanical commutation because the magnetic field 
is generated by permanent magnet (or wounded coils) in the stator. The brushless 
motor represents the most widely used rotational electromechanical actuator in 
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automation and robotics, due to its high performance and reduced weight power 
ratio and will be discussed in details in the following paragraphs. In brushless motor 
the windings are on the stator, while the rotor has permanent magnets. A transducer 
(hall sensor and/or optical encoder) reads the angular position of the rotor feeding 
back the information to the electronics which powers the three phases winding in 
the stator (Beaty and Kirtley 1998). In terms of working principle, this kind of 
motor is similar to the synchronous ones where the electrical fields of the stator and 
the rotor are always on phase and with multiple advantages related to the reduced 
friction (no brushes for mechanical commutation), reduced inertia, and heat dis- 
persion due to the winding which are on the external structure of the stator. Then in 
following paragraph, some typologies of electromechanical actuation, which rep- 
resent nowadays the widely used ones, will be described in details. 


AC Motor 

AC motor represents the lowest point in terms of control complexity and is also the 
least expensive in terms of technological effort and performance. 

Production lines typically involve a variety of variable speed motor drives which 
serve to power conveyor belts, robotic arms, cranes, steel process lines, paper mills, 
and plastic and fiber processing lines to name only a few. These applications are now 
prevalently serviced by what can be called general-purpose AC drives. In general, 
such AC drives often feature a cost advantage over their DC counterparts and, in 
addition, offer lower maintenance, smaller motor size, and improved reliability. 

However, as previously introduced, the control flexibility available with these 
drives is limited and their application is restricted to fan, pump, and compressor 
where the speed need be regulated only roughly and where transient response and 
low-speed performance are not critical. 

More demanding drives used in machine tools, high-speed elevators, dynamom- 
eters, mine winders, rolling mills, glass float lines, and the like have much more 
sophisticated requirements and must afford the flexibility to allow for regulation of 
a number of variables, such as speed, position, acceleration, and torque. Such high- 
performance applications typically require a high-speed holding accuracy better 
than 0.25 %, a wide speed range of at least 30:1, and a fast transient response, 
typically better than 50 rad/s, for the speed loop. 

Due to the main robotic focus of the book, a major effort will be dedicated to the 
DC motor which nowadays represents the most widely used actuation solution; for 
a wider view and detailed description of AC motor, the reader is invited to look at 
bibliographies Beaty and Kirtley (1998) and Sen (1997). 


DC Motor 

The working principle of a DC motor is based on the presence of a magnetic field 
generated by permanent magnets on the stator or field coils, and coils on the rotor in 
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which current is flowing. Wound field coil motors are used above a certain power 
requirement and have the highest performance in terms of maximum torque and 
speed and usually can have different configurations as reported in Fig. 3b (series, 
parallel, and compound). A commutated DC motor has a set of rotating windings 
wound on an armature mounted on a rotating shaft (rotor). The shaft also carries the 
commutator, a long-lasting rotary electrical switch that periodically reverses the 
flow of current in the rotor windings as the shaft rotates. The rotating armature 
consists of one or more coils of wire wound around a laminated, magnetically “soft” 
ferromagnetic core. The magnetic field produced by the armature interacts with a 
stationary magnetic field produced by either permanent magnets or also winding 
field coils, as part of the motor frame. The current flowing in the rotor windings is 
responsible to generate a second magnetic field which will interact with the one by 
the permanent magnets on the stator consequently producing a torque which will be 
variable depending on the angle between the two magnetic fields. The coils 
mounted on the rotor are powered alternatively depending on the rotor position 
itself in order to obtain a constant torque: selection of the coil is obtained by a 
commutator fixed to the ground which uses brushes to conduct the current in the 
rotating coils (Sen 1997). The force between the two magnetic fields tends to rotate 
the motor shaft generating a torque. The commutator switches power to the coils as 
the rotor turns, keeping the magnetic poles of the rotor from ever fully aligning with 
the magnetic poles of the stator field, so that the rotor never stops but rather keeps 
on rotating as long as power is applied. 

By using this solution, only one or some coils have current flowing and generate 
a magnetic field. In the real constructive solutions, the rotor is designed in such a 
way to have multiple slots and conductors called armature windings, while the 
magnetic flux by the stator is generated by the pole core. From the above, one can 
get some important relations between mechanical and electrical parameters of the 
motor: the magnetic flux generated by the current I a flowing in the rotor windings is 
given by the following formula: 


O r = kl a (1) 

where the constant k depends on multiple constructive factors. One can derive the 
torque which will result proportional 

C m = kl a <£> s = K,I a (2) 

The quantity K t is called torque constant and it is usually provided by the 
constructor of the motor. The torque direction is therefore determined by the fluxes 
in both rotor and stator, and the link between the supplying current and voltage and 
motor speed can be determined by writing the equation of the equivalent circuit 
(Fig. 4d) where R and L represent the armature equivalent resistance and induc- 
tance, respectively; V is the source voltage of the electrical machine; and k^> s co m is 
the back efm (V b ) generated by the current flowing in the rotor windings. More 
precisely L and R are strictly related to the specifications and materials by which the 
motor is built. 
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Fig. 4 General concept of a DC motor and equivalent electrical circuit 
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Equation 3 is valid for all the typologies of motors where the back efm can be 
expressed by a V b = k^> s co m = k e co m where k e is called voltage constant. The torque 
of the motor can be derived by the formula reported below where k t is a previously 
mentioned parameter called torque constant: 


Cf n k(I 

(4) 

C m J(Om T Cr 

(5) 


The overall torque applied to the mechanics of the actuator is provided by the 
contribution of the load J and the resistive torque C R of the rotor. By substitution of 
Eq. 4 in Eq. 3, one gets a general equation relating the angular speed and the 
voltage: 
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Fig. 5 The graph above shows a torque/speed curve of a typical DC motor. Note that torque is 
inversely proportional to the speed of the output shaft. In other words, there is a trade-off between 
how much torque a motor delivers and how fast the output shaft spins 

From the previous equation, one may easily infer that if the motor works at 
constant rotational regime without any load applied, the angular velocity will be 
proportional to the voltage and no current would be absorbed. In the mentioned 
hypothetical condition, considering constant the parameters of Eq. 6, one can obtain 
the following: 










(7) 

( 8 ) 


From the equation, one can notice that in the presence of the resistive torque C R 
(which is the opposite of the torque delivered by the motor C m ), the motor speed can 
be written parametrically as function of the voltage V as shown in Fig. 5a where co 0 is 
the no-load speed and C sta u is the stall torque which represents the max value of 
which the motor can deliver close to null speed. Equation 9 defines the static features 
of the motor and the relation between torque, voltage, speed, and torque constant: 




If a situation is considered in which one has a certain degree of variability in the 
voltage V and resistive torque C R , Eq. 6 assumes a different form and differentiation of 
the current flowing in the windings is not negligible. By using Laplace one can express 
the relation between voltage and angular speed in a simpler form: 
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The equations above provide information on how the motor behaves for a 
variable voltage by introducing a function A(s) which refers to the motor 
dynamics and its features r m , r e , respectively, called mechanical and electrical 
time constants: 



(13) 

(14) 


As one can notice from Eq. 12, the electromechanical actuator can be assumed as 
a second-order system with the following values for natural frequency and damping 
factor: 




(15) 

(16) 



In the common applications, usually the two constants have different 
values, r m r e , and therefore the term A(s) can be approximated to the 
following form: 




In the considered conditions, the frequency response of the system can be easily 
identified by the Bode diagram (Fig. 5b). If the control voltage V is slowly varied, 
the proportionality between the speed and the voltage is matched. For frequency of 
oscillation of the voltage higher than co i = — , the response of the motor will be 

slower and attenuated depending on the value of the two electrical and mechanical 
constants. 
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Example of open-loop voltage and current control for velocity regulation in a DC motor 


DC motors can be controlled in both voltage (V) and current (I). In the first case, 
it is possible to control the angular velocity of the motor with a resulting final value 
that will be always dependent on the resistive torque C R applied to the motor shaft. 
The control scheme for a velocity-controlled DC motor is reported in Fig. 6a: 
usually the input command is a voltage V ctrl sent by the controller which is 
proportional to the desired velocity. 

There is also another way of controlling the motor by using current input instead 
of a voltage as depicted in Fig. 6b. From the equation relating the torque and the 
current in a DC motor C m = k t I, one can express the current in terms of the motor 
inertia and the angular acceleration: 



( 18 ) 

( 19 ) 


where using Laplace, 
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Fig. 7 DC motor torque voltage characteristic in voltage and current control: resulting charac- 
teristic curves of a voltage-controlled velocity (a) and current-controlled velocity (b) 


The control of the motor by armature current is a technique which allows to 
choose the velocity in the presence of a resistive load and the correspondent 
torque. The diagram is in Fig. 7b where the motor speed is not directly chosen 
by the input current but depends on the resistive torque that can be seen as the 
mechanical damping of the motor itself or a load applied to the output shaft. 
The main problem arising is when no load is applied, in this condition when 
the resistive torque is very low, the motor will tend to accelerate and rotating a 
high speed. The solution adopted is providing a velocity/position feedback and 
including the motor in a closed loop. In such kind of configuration, the 
constructive elements and the electronics of the system result in a more 
complex architecture but it allows a fine-tuning respect to the previous 
solution. 


Actuator Dynamics and Control 

Once the choice of the electromechanical actuation has been done, there are several 
additional issues to be addressed. The most important one is the control implemen- 
tation and robustness, and there are many techniques which strongly influence the 
performance of the manipulators. That is why the actuator dynamics play a funda- 
mental role and must accurately modeled in order to predict the overall behavior of 
the machine. In most of the application, the DC motor is usually coupled to a gear 
train: this solution can present several advantages, like decreasing the size of the 
motor and power consumption, but there are nonlinearities which must be accu- 
rately considered - the presence of a gearhead introduces additional friction, a 
certain degree of compliance, and also backlash (Canay 1993). Consequently the 
overall dynamic of the actuator results more complex and less predictable if some 
details of the system are unknown. Starting from the results obtained in the previous 
section of this chapter in which the electromechanical actuator had a linear 
dynamic, one can now model the overall system made by the series coupling the 
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Fig. 8 Dynamics of a geared transmission: schematic representation of a DC motor coupled to a 
gear train (a). Block diagrams of the transmission considering the presence of electrical constant in 
the transfer function (b) and approximating the transfer function by neglecting the electrical 
constant of the DC motor (c) 


DC motor with a gear train with reduction ratio 1 : r. Referring to Fig. 8a, one can 
write the dynamic equation of the entire transmission considering the inertial 
contribution of the rotor, the gearhead, and the load: 
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(Ls + R)I = V — k e s$ m 

(24) 

{j m s -E B m s)S m k m I rzj 

(25) 


where Eq. 22 is the dynamic equation that can be combined with the back efm 
Eq. 23 and the Eq. 24 of the electrical equivalent of the motor. One can write the 
transfer function by considering the block diagram of the system in Fig. 8b where 
the input is the armature voltage V and the output is the angular position of the 
motor (assuming the torque of the load T/ constant). 




s\(Ls H - R ) (J m s "E B m ) ~E k e k m 



While if one considers the transfer function having as input the torque from the 
load T/ (assuming V = 0), then 


K -r(Ls + /?) 

V s[(Ls + R) (/ m s + B m ) E k e k m 



As previously mentioned the electrical constant is much smaller than the 
mechanical one and can be neglected by dividing the numerator and the denomi- 
nator by R and putting | = 0; hence the two transfer functions can be rewritten in a 
simplified form: 
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(28) 

(29) 


where applying the superposition principle and inverse Laplace transform obtains 
the second-order differential equation of the reduced system and the corresponding 
block diagram (Fig. 7c). 



Usually the actuator used in robotics is servo-controlled mechanism and their 
control is mainly based on the error between the reference position and a sensed 
one. The primary sensor is collocated directly on the shaft of the motor in order to 
have no backlash: most used are rotary optical encoder, sincos encoder, resolvers, 
potentiometers, and tachometers. The secondary sensors can be located at the end 
effector of the robotic mechanism and provide additional information on the 
position in the workspace: this is particularly important where the required position 
of the mechanism end effector must have a high degree of accuracy; having the 
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actual rotation of the actuator sometimes is not sufficient to assure a correct final 
position especially in those cases when the presence of heavy loads and the 
consequent deflection of the members cannot be neglected. The controller may 
differ depending on the application and the required performance. Usually the 
approach is based on the so-called set-point tracking problem consisting in tracking 
a constant or step reference called desired command, and the performance of the 
system is defined as considering dynamic response (rise time) and static character- 
istic (steady-state error). The widely used control schemes for such kind of appli- 
cations are PD (proportional-derivative), PI (proportional-integrative) or PID 
(proportional-derivative-integrative). Having a look to Fig. 1, the main controller 
block can be identified: after the primary and secondary feedback has been fed 
back, the error with respect to the desired input is evaluated and, consequently, a 
control action which is the sum of different control efforts - proportional, deriva- 
tive, and integrative - has been sent to the actuation stage. Basic control schemes 
are reported in Fig. 9 and each of the controller gain introduces an effect that shape 
the frequency response of the actuator and therefore its performance. The propor- 
tional element of the controller affects the transitory response of the actuation, 
increasing the promptness and reducing the rise time when it is increased. The 
effects are visible from Fig. 8 where modulation of the response to a step input is 
different depending on the tuning of the single gains. 


Brushless DC Motor 

The main problem with DC motors illustrated in the previous section is the presence 
of a commutator and brushes that are used to periodically activate the current in the 
different armature coils. The brushes must be periodically changed and they 
introduce additional friction to the rotary motion of the rotor because of the sliding 
contact with the commutator (Miller 1989). In order to improve efficiency and 
durability, brushless motors have been developed; in the actual DC motor, the 
permanent magnets are typically on the stator and do not rotate; however, the 
resulting torque given by the rotation of the rotor is also experienced by the stator 
and the magnets. This principle has been used to design DC brushless motors, 
where the rotating parts are made by permanent magnets on the rotor and the 
current-carrying coils are in the stator. Since the permanent magnet has polarity, 
the magnetic field generated by the coils in the stator must be opportunely activated 
and switched. Figure 10 shows a basic form of a brushless motor. The rotor is 
usually ferrite or ceramic permanent magnet, and the current on the stator coils 
(A A, BB, CC) is electronically controlled by transistors in order to keep the 
magnetic field always delivering a torque for the rotation of the rotor. In order to 
opportunely activate the stator coils, the position of the rotor must be known and 
hall sensors positioned on the stator sensors are generally used to sense the angular 
position of the rotor. 

The transistors are activated by the output from the hall sensor and the different 
coils in the stator are energized in sequentially. From Table 1 the switching order is 
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ROTOR PERMANENT MAGNETS 



Fig. 10 Brushless motor constructive scheme. (Top) Schematic representation of a brushless DC 
motor and (Bottom) hall sensors and commutator circuit for generation of the synchronous 
magnetic held 


Table 1 Commutation 
table for a brushless motor 
with switching sequence 


Rotor position [Degree] 

Sensor signals 

Transistor on 

a 

b 

c 

0 

0 

0 

1 

A+ 

B- 

60 

0 

1 

1 

A+ 

C- 

120 

0 

1 

0 

B+ 

C- 

180 

1 

1 

0 

B+ 

A- 

240 

1 

0 

0 

C+ 

A- 

360 

1 

0 

1 

C+ 

B- 
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presented: when the rotor is in a vertical position, the output is from sensor c only 
and the active transistors are A+ and B-. When the rotor is 60° clockwise rotated 
than the sensors b and c, activate transistors A+ and C-. Using an improved 
automated manufacturing process, the brushless solution allows the motor to attain 
good heat dissipation and high overload capability. In addition, because the new 
motors are electronically commutated, the short life associated with mechanical 
brushes is eliminated and reliability is increased. The electronic commutation 
allows for minimum electrical noise and avoiding arcing that was present in the 
previous design of traditional brush motors; electrical noise is due to the control 
electronics, but the brush noise is no longer a factor influencing the control of such 
kind of motors. Another feature of the new brushless motors utilizes high-energy 
neodymium magnets, which results in a very responsive mechanical time constant - 
as low as 5 ms - while minimizing overall size. The neodymium magnets result in a 
higher torque output in a given volume increasing the ability to accelerate quickly 
via the “mechanical time constant,” which is defined as the time it takes for an 
unloaded motor to accelerate from standstill to 63 % of its no-load speed with the 
nominal voltage applied. A motor that has the capability to accelerate quickly is 
important for minimizing move times, which is critical for many applications, and 
also increases throughput in production applications. New generations of such 
motors are extensively used in robotic and automations. Some companies are able 
to provide motors that vary in diameter from 16 to 40 mm, with length sizes ranging 
from 24 to 88 mm and continuous output power from 5 to 120 W, while maximum 
speed is up to 20,000 RPM. The motors’ ambient temperature range is —40 °C to 
100 °C. Due to their high performance and long lifetime, these motors are well 
suited for a variety of applications, including automation, biomedical devices, pick- 
and place-machines, robotics, pumps, and semiconductor manufacturing. 


Introduction on Compliant Actuation: Series Elastic, Variable 
Stiffness, and Pneumatic Actuation 

Over the last two decades, technological advancements have contributed to out- 
standing innovations, and human in the loop became one of the design criteria for 
several applications, ranging from medical devices and industrial robotics where 
the coexistence between robots and human is often avoided for safety reason. The 
formulation of a new approach in robotic control, based on a gentle physical 
interaction, leads many groups to propose different solutions in actuation technol- 
ogy which include compliant elements in the transmission. There are several 
examples of compliant structures used in the design of novel actuators or studies 
focused on the control of flexible link robots (De Luca and Book 2008; De Luca and 
Lucibello 1998; Albu-Schaffer et al. 2007). A solution was proposed with the 
introduction of series elastic actuators (Pratt and Williamson 1995). Indeed, reduc- 
ing the interface stiffness by inserting series elasticity at the interaction port 
between the electromechanical actuator and the load can substantially decrease 
the peak output force and acts as low-pass filter (Hurst et al. 2004), thus increasing 
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Fig. 1 1 Control of a series elastic actuator based on force detection at the end effector at the 
interaction with the load 


the system safety. With such architecture, the force-control problem can be 
addressed by a fine position control because the output force is directly related to 
the mutual distance between the load and the end effector across the series elasticity 

(Fig. ID- 

Such kinds of actuators are probably less suited for classical position-controlled 
applications, but they offer valuable advantages, where safe human-robot interface, 
actuated prostheses, and orthoses require an interaction at the end effector that must 
be accurately modulated (Migliore et al. 2005). The range of compliance that is 
required depends on the application and the torque limits of the actuator. Most 
architecture embedding controllable stiffness consists of classical stiff actuators 
that can be easily dimensioned for the required torque at the output shaft and a 
compliant element. Usually brushless motors are used because of the higher power 
density and reduced friction at the transmission level. The compliant structure is 
made by an elastic passive element, i.e., a spring that can store energy and the 
overall transmission will allow deviations from its own equilibrium position, 
depending on the applied external force. The equilibrium position of a compliant 
actuator is defined as the position where the actuator generates zero force or zero 
torque. 

In the so-called equilibrium-controlled applications, a linear/rotational elastic 
spring is added in series to the main output of the primary actuator, and the 
equilibrium position of the spring is controlled to exert a desired force or desired 
stiffness (English and Russell 1999). The compliance is actively changed using an 
control algorithm and by converting the force-control problem into a position- 
control problem. The motor position is adjusted based on the deflection of the 
spring to alter the tension or compression of the spring itself and therefore the 
interaction at the end effector. 

Another architecture that is widely used in human-robot interaction is based on 
the concept of variable stiffness (VS A). Such kind of technology takes inspiration 
by nature where the muscular stiffness is continuously adjusted by the central 
nervous system to interact with the external environment and manipulate objects. 
The peculiar property of stiffness modulation can be implemented in the actuator by 
mechanically varying the configuration of internal spring mechanism as 
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demonstrated by the work of Tonietti et al. (2005): the geometrical variation of the 
internal compliant mechanism provides a lower or higher mechanical stiffness at 
the output of the actuator with the possibility to modulate the dynamic behavior in 
terms of bandwidth and therefore having a variable disturbance rejection depending 
on the applied load and the nature of the interaction forces. 

Based on a different working principle but used in the same field as SEA or VS A 
to “gently” modulate the interaction with the load or a human user, the pneumatic 
artificial muscles generate linear displacement by using air pressure. Air compress- 
ibility provides to the system an intrinsic compliance and a springlike behavior 
(Brunell 1979). The architecture involves piston driven by pressurized gas or also 
inflatable elastic tubes or bladder surrounded by braided mesh able to shorten when 
higher pressure is applied in the chamber and they are known as McKibben muscles 
(Hannaford and Winters 1990; Chou and Hannaford 1996). There are several 
advantages related to the absence of moving parts and geared mechanism which 
confer to such kind of actuator a high power-to-weight ratio (Hollander and Sugar 
2004) and an intrinsic simplicity. On the other side, the nonlinearity of the rela- 
tionship between force and pressure makes the control implementation demanding 
due also to the presence of hysteresis and viscous effects (Tabata et al. 2000). An 
example of robotic application is the Utah/MIT dexterous hand (Jacobsen 
et al. 1984) controlled by a single-stage suspension valve in a glass cylinder and 
graphite piston, mainly used in anthropomorphic robotics. The system has the 
pneumatic commercial version developed by Sarcos Inc. (Salt Lake City, Utah). 
Other previous works which are worth to be mentioned are more related to the high- 
performance control implementation for robotic joints (Liu and Bobrow 1988) or 
hierarchical feedback controller with high-performance valves (Kawamura 

et al. 1989). 

Such kind of actuation presents high nonlinearity on the characteristic curve 
force/pressure or pressure/position: Lig. 12 shows the block diagram of a position- 
control system comprising an inner-loop pressure control and an outer-loop posi- 
tion control. In the pressure control a proportional valve converts an analog 
electrical input signal into appropriate sectional opening to allow the fluid to flow 
inside the cylinder. The differential pressure in the cylinder is directly controlled by 




Fig. 12 Schematic diagram of a typical position-control system for pneumatic actuation. It is 
mainly a PID controller augmented with friction compensation 
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the flow rate through the valve, and the flow rate is related to the input voltage in a 
complex way. The presence of friction requires an additional model-based block in 
the inner loop to estimate the friction. 

A PID controller with feedback linearization is used in the pres sure -control loop 
to compensate for the nonlinearity arising from the compressibility of air. The 
position controller is mainly a PID controller augmented with friction 
compensation. 


Piezoelectric Actuator 

The piezoelectric effect (Morgan Matroc Ltd; Shen 1994; Venison 1986) was 
discovered for the first time by Jacques and Pierre Curie. The main feature of 
such kind of material which is usually quartz crystals (Fig. 13 depicts a crystal of 
Si0 2 ) is that a certain mechanical displacement or deformation applied to the 
crystal results in a creation of microscopic internal dipoles and a consequent 
measurable voltage at the macroscopic level. This effect is called “reciprocal 
piezoelectric effect,” and it can be reversed to the “direct piezoelectric effect” 
meaning that if a voltage is applied on a piezoelectric material, a mechanical 
deformation happens along the crystal orientation (Balias 2007). 

Piezoelectric materials present anisotropic characteristics, and therefore there is 
a directional dependency on the geometry of the applied force field or electrical 
voltage. Considering the strain displacement directions, there are three major 
effects that are used in piezoelectric applications: longitudinal, transversal, and 
shear effect, where longitudinal effect has the same direction of the causal effect. 
For example, the orientation of the force field is the same of the electrical field 
applied to the crystal or vice versa. In transversal effect, mechanical action and 



Fig. 13 Working principle of a piezoelectric material 
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electrical potential field are orthogonal, while the shear effect produces a mechan- 
ical tension which is tangential to the polarization plane. A complete and exhaustive 
description of piezoelectric effect and a mathematical approach to the problem of 
energy conversion can be found in literature (Brissaud 1991; IEEE Standard on 
Piezoelectricity 1988). 

The design of a piezoelectric actuator can be optimized depending on the 
application. The transformation from electrical energy to mechanical displacement 
happens without the use of moving parts, and it is characterized by a high band- 
width with a frequency much higher than all the previous actuation principles 
discussed in this chapter. Another important feature is that such kind of actuator 
has a high durability. Depending on the application, different designs can be 
implemented and the main specifications can be from large/low displacement, 
high stiffness, or high frequency. The main effects which are considered during 
the design are the longitudinal, transversal, and shear effects (uncommon in actu- 
ator design) : stacked actuators use the longitudinal effect to generate static force up 
to 10 kN with an achievable displacement of about 200 pm and an applied force 
field ranging from 30 to 1,000 V; these actuators are made of ceramic layers of 
opposite polarity stapled above each other (Fig. 14a). 

Transversal effect is mainly used in actuated cantilever mechanisms that have 
two active layers. The application requires relatively low voltage (10-400 V) and 
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Fig. 14 Architecture in piezoactuation. Example of piezoactuators based on longitudinal effect 
(a) and transversal effect (b). Ultrasonic actuators in monomodal (c) and bimodal (d) configuration 
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results in large displacements (1,000 pm) but the output force is low (<10 N) 
(Fig. 14b). Besides the simple architectures previously discussed, piezoelectric 
effect allowed to design more complex systems for the generation of motion and 
force delivery. An example which has been diffusing in many precision engineering 
applications is represented by piezoelectric stepper motors, resonators, and oscil- 
latory travel waves actuators. 

Bar- shaped resonators are designed to have a stator and a rotor (Fig. 14c-d); 
mainly the stator is a piezoactuator working as mechanical resonator (Uchino 1997; 
Uhea 2001) and it oscillates transferring motion to the rotor with a frequency of 
25 kHz for 4 pm amplitude of oscillation; the rotation can be generated by the 
different vibrational modes of the stator which can be configured and designed as a 
monomodal (Fig. 13c) or bimodal resonator (Fig. 13d); the first does not allow to 
invert the rotation of the rotor without moving the stator and positioning the mutual 
contact point, while the second is controlled by two frequencies summed up in one 
oscillatory signal which allows to drive the rotor in both direction by changing the 
phase lag between the two oscillatory modes. 

Traveling wave actuators are based on circular design and (Uchino and 
Giniewitz 2003) are controlled by typical frequencies of oscillation ranging from 
20 to 100 kHz. The stator is made by piezoelectric elements alternatively polarized 
and disposed on a circular structure each oscillating and performing an elliptical 
trajectory and enabling the formation of the rotating traveling wave which delivers 
the motion to the rotor by a frictional coupling. The great advantage is the 
possibility to transfer high torque, weight reduction of the actuation stage, rejection 
of electromagnetic noise, and high position-control accuracy. 

Other architectures that are worth to be mentioned are Uchino Motor and 
piezostepper motors. The first is made by a metallic tube with two orthogonal 
piezoelectric elements which oscillate with a phase lag and deform the tube 
transferring the motion. The moving part is a coil passing through the tube, and 
frictional contact delivers the linear motion to the coil. Piezoelectric stepper motors 
combine longitudinal and shear effects to provide motion by a drive-release 
two-phased action; they can be controlled with either open or control loop (with 
additional encoder feedback needed), and they are able to generate about 
400-600 N of linear force. Besides their capacity to deliver high force, such kind 
of motor provides a good durability and resolution in positioning. The main 
application in robotics is related to micromanipulation (Umetani and Suzuky 
1980) such as micro-gripper and micromotion for high-precision positioning 
systems. 


Shape Memory Alloy and Polymeric Actuators 

Shape memory alloys (Advanced Materials and Technologies; Bidaux et al. 1994; 
Duerig 1990; Heinemann; Jayender et al. 2008; Shea 2002) were discovered for the 
first time in 1938, when it was noticed that certain metal alloys once bent into odd 
shapes tend to return to the original configuration when heated. The field literally 
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took off in 1961 by the great effort in research by William Buehler at the US Naval 
Ordnance Lab; he and his team developed the first commercially viable shape 
memory alloy (SMA) called nitinol (NiTi) that is the acronym of Nickel Titanium 
Naval Ordnance Laboratory. Since the introduction, nitinol was used for several 
applications for pipe fittings that automatically seal when cooled, antenna arrays, 
sunglasses, frames, etc. The working principle of such kind of material is given by a 
martensite/austenite-phase transition occurring for a determined range of tempera- 
tures. For lower temperature, the NiTi has low crystal symmetry and can be easily 
deformed, but at increasing temperature, the alloy assumes a higher symmetry 
drifting towards the austenite phase and resulting in a mechanically harder config- 
uration. Another important feature is the variable electrical resistivity of the NiTi 
which tends to increase of about 20 % in the transition phase from austenite to 
martensite and this change in resistivity can be effectively used to control the NiTi 
actuation. Contraction time of a NiTi actuator is highly dependent on the amplitude 
of the current flowing and of course of the cooling time and thermal capacity of the 
material. The linear force that such kind of fibers can deliver is huge and abut 
200 MN/m and also the power mass ratio is large 50 kW/kg. Applications in 
robotics range from robot positioning systems, agonist and antagonistic bioinspired 
actuators proposed by Kuribayashi (1991), to wormlike endoscopes (Dario 
et al. 1991). 

Another important class of actuation principle is based on the use of synthetic 
polymers that are able to convert chemical or electrochemical energy into mechan- 
ical one. Poly electrolyte, synthetic rubbers, and collagens can shrink or swell 
depending on external conditions such as electrical field, temperature, and pH of 
the environment. One of the most widely used is the electroactive polymer (EAP) 
based on polyvinyl alcohol polyacrylic acid (PVA-PAA), where contraction/dila- 
tion is caused by three main factors: rubber elasticity, polymer-polymer affinity, 
and hydrogen ion pressure. The remarkable aspect is that this solution allows to 
have an efficiency of 30 % in converting energy to mechanical work. De Rossi 
et al. (1986) demonstrated that the contraction is mostly a result due to the pH 
gradients induced by an electrical field and causing a consequent mechanical 
deformation of the gel. In such kind of architecture, the forces are reduced but 
Chiarelli et al. (1987) demonstrated that a PVA-PAA 10 mm strip wide 0.11 mm 
electromechanically activated can generate an isometric force up to 0.1 N, which 
corresponds 91kN/nT. One of the main limitations of the EAP is the bandwidth, 
where contractions are very slow, taking several seconds to contract and reach the 
20-30 % of the maximum expected force. Robotics applications are relatively 
limited due to the slow response of the EAP, but there are some cases which are 
worth to be mentioned: Caldwell (1990) designed a gripper made by agonistic and 
antagonistic polymeric actuation and Suzuki (1991) designed an artificial elbow 
actuator. Despite the limitations, such kind of actuation technology allows to have 
architectures which are similar to human muscles and therefore their use is mostly 
employed in biomimetics and humanlike robotics. 
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Summary 

In this chapter a number of various actuators has been introduced, and one tried to 
highlight the advantages of using a certain class of actuator for a specific applica- 
tion. Furthermore, a brief discussion on the limitations of each actuation technology 
was reported, in order to allow the reader to visualize the spectrum of possible 
applications and performance. It results anyway clearly that electromechanical 
actuation still represents the most diffused solution for industry and robotics. The 
main motivation is in the compelling knowledge on the laws of electromagnetism 
and the control theory which is mainly based on theoretical assumptions consider- 
ing an electromechanical actuator in the loop. Furthermore, it is worth mentioning 
that the power-to-weight ratio of a DC motor is often suitable for most of the 
applications in automation and control stages and that electronics represent nowa- 
days the state of the art. Nevertheless, this survey tried to provide the reader with 
other descriptions of some unconventional classes of actuators: if on one side the 
use of shape memory alloys and polymeric architecture still represents a niche, the 
actuation principle based on piezoelectric material is at present the best solution for 
micromanipulation and actuation. Furthermore, the introduction on compliant 
actuation technology aims at illustrating a new trend in robotics which is gradually 
arising, and it is driving the robotics itself towards the use of flexible structure to 
increase the dexterity of manipulation and mimicking the efficiency in interaction 
of biological systems. 
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Abstract 

Industrial robotic manipulators have excellent repeatability. However, accuracy 
is significantly poorer due to the numerous error sources in the robot work cell. A 
literature survey on recent calibration and compensation methods is presented as 
well as an overview of existing commercially available solutions. Subsequently, 
two methods to improve robot work cell accuracy are proposed to illustrate the 
concepts behind calibration and error compensation. The first method is a 
model-based calibration approach, where the end-effector poses and 
corresponding joint angles are measured and used to improve the kinematic 
model of the robot. The second method is a non-model-based compensation 
approach where sensor information is used to establish the relative pose of the 
work object and tool frames at discrete locations. Following this, a robot 
accuracy enhancement framework is proposed in which both techniques are 
integrated for an industrial robotic work cell where the strengths of one method 
are used to compensate for the inherent weaknesses of the other. Specifically, the 
non-model-based compensation approach improves the robot accuracy locally at 
the point of compensation, accounting for unmodeled effects which cannot be 
compensated by the model-based calibration approach, while calibration 
improves the nominal robot kinematic model extending the compensation 
effects to a larger working envelope. 


Introduction 

The traditional robot teaching approach, termed as “online” teaching, involves the 
use of teach pendants to manually control the robot through a series of waypoints to 
achieve the desired motion. The accuracy of the robot motion is determined via 
visual feedback from the operator, the skills of the operator in the use of teach 
pendants to move the robot, and the repeatability of the robot. This process is time 
consuming and the quality of the robot path is dependent on the skills of the 
operator. In addition, due to the need to use the robot during the teaching/program- 
ming process, it will also lead to significant downtime of the robotic work cell when 
a new program is required. The combination of all these factors makes the tradi- 
tional robot teaching approach challenging to handle “high-mix low-volume” 
processes or for continuous type operations such as machining and polishing of 
freeform surfaces where a large number of points are required. 
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In order to overcome the above issues, alternative programming methods have 
been proposed and were summarized in (Pan et al. 2012). One useful approach to 
handle large work objects which require many teaching points is “off-line” pro- 
gramming. In off-line programming, the physical robotic work cell is replicated in a 
virtual environment. Subsequently, the robot path is generated based on the work 
object geometry and its relative placement to the robot in the virtual environment. 
Benefits of off-line robot programming include: 

1. Reduced downtime on the production line since the physical work cell is not 
required during the programming of the robot 

2. Increased safety factor by eliminating the need for the operator to program the 
robot in close proximity to the robot 

3. Increased consistency of the path quality by removing the reliance on operator 
proficiency in online teaching 


Effective implementation of off-line programming requires the virtual model of 
the work cell to be an accurate representation of the actual work cell. Unlike online 
teaching, waypoint accuracy is largely dependent on the effectiveness of the robot 
nominal kinematic model in determining the actual joint to end-effector pose rela- 
tionship. In addition, the virtual work cell must accurately describe its real-world 
counterpart, in particular the relative poses of the robot, work objects, and other 
critical work cell elements. Failure to meet the above conditions will prevent the 
robot from completing the desired tasks and may even lead to collisions within the 
work cell. Hence, robot work cell accuracy becomes an important issue, and solutions 
such as robot calibration, error identification, and compensation have been proposed. 

Industrial robotic manipulators generally have much better repeatability com- 
pared to its accuracy due to the emphasis of repeatability in traditional robot 
programming and applications (Greenway 2000), since accuracy of the waypoints 
is compensated via operator visual feedback in online teaching. Error sources that 
can compromise robot accuracy can be broadly categorized into two types, namely, 
(i) geometric and (ii) non-geometric errors (Mooring et al. 1991; Greenway 2000). 
These errors cause the actual kinematic model of the robot to deviate from the 
known nominal model. To achieve better accuracy, calibration of the robot is 
required to identify the actual kinematic model thereby compensating for the errors. 

In this chapter, the focus is on methods for increasing work cell accuracy to 
achieve a desired tool to part motion. Firstly, in section “Literature Survey and 
Commercial Solutions,” a survey of existing commercial solutions to increase work 
cell accuracy is presented along with recent works in the academia. Following this, 
in section “Mathematical Preliminaries,” some mathematical notations and con- 
cepts that are required in the subsequent formulations are introduced. In section 
“Calibration,” a robot calibration solution is presented using the product-of-expo- 
nentials (POE) representation to identify an improved kinematic model for a robot. 
This method is implemented on an industrial robot and experimental results verify 
its effectiveness. Subsequently, in section “Compensation,” a non-model-based 
compensation approach is developed where improved robot accuracy in a localized 
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region of the robot task space is achieved through the use of sensors attached to the 
robot end effector. In section “Calibration with Compensation Framework,” a 
framework to improve the robot work cell accuracy incorporating the benefits of 
both model-based calibration and non-model-based compensation is presented. 
Finally, the contents and findings of this chapter are briefly summarized in section 
“Summary.” 


Literature Survey and Commercial Solutions 

The presence of errors and uncertainties in the robotic work cell poses a great 
challenge to the effective implementation of off-line robot programming. These 
errors and uncertainties can be due to a number of different error sources such as 
manufacturing defects, inertial loading, encoder errors, calibration errors, and 
mechanical wear within the robotic manipulator. All these error sources contribute 
to a deviation between the nominal base-tool transformation X T™ obtained via the 
nominal kinematic model of the robot and the actual base-tool transformation l Tf 
as illustrated in Fig. 1. The errors in Fig. 1 have been deliberately exaggerated to 
illustrate the difference in kinematic loops between the nominal and actual work 
cell. Another major source of errors is due to alignment and datuming issues 
between the robot base and the work object which can be exacerbated by additional 
motion systems, such as turn tables to move the work object or tracks and gantry 
systems to move the robot, within the work cell. This will result a deviation between 



Fig. i Nominal and actual transformation matrices 
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the nominal and actual robot base- work object transformation, represented by 1 T$ 
and l Tt respectively, as illustrated in Fig. 1. 

One approach for error compensation is to improve the kinematic model by 
accurately modeling the physical and dynamic properties of these error sources. 
Subsequently, the parameters within the model can be identified through the use of 
calibration methods which utilizes measurement pairs of the robot end-effector 
pose and robot joint angles. This approach has been proposed and developed in both 
the academia (Low 2006; Veitschegger and Wu 1988; Zhuang et al. 1994; Yang 
et al. 2002; Zhao et al. 2006; Chen et al. 2008; Mustafa et al. 2010; He et al. 2010) 
and commercially in the form of Calibware™ from ABB, ROCAL™ from Nikon 
Metrology, and DynaCal™ from Dynalog. 

Recently, a number of related works in the academia have focused on cost- 
effective methods to obtain data for robot calibration. In (Ha 2008), measurements 
from a laser point distance sensor mounted on the robot end effector and a grid plate 
placed in the robot work cell are used to determine the position of the robot end 
effector for calibration purposes. A laser line scanner attached to the end effector 
and an artifact positioned in the robot work cell are used to conduct robot calibra- 
tion in (Wang et al. 2009). In (Gatla et al. 2007), robot calibration is performed 
using cameras and a laser pointer attached to the robot end effector to obtain the 
required measurement information. 

Robot calibration solutions are also available commercially including, but not 
limited to, Absolute Accuracy from ABB (Low 2006; Gunnarsson et al. 2006), 
MotoCal (2011; MotoCalV EG 2011) from Yaskawa Motoman Robotics, and 
DynaCal™ (Cheng 2007) from Dynalog. 

The Absolute Accuracy from ABB is a robot option that is only available for 
ABB robots. A unique robot kinematic model is identified for each robot through 
the use of a PC-based software, Calibware™. The software is used with a 3D 
measurement system to obtain up to 100 robot end-effector poses that are distrib- 
uted throughout the robot workspace. These measurement information are then 
used in Calibware™ to fine-tune up to 40 parameters within the robot model (Low 
2006; Gunnarsson et al. 2006) to better reflect the actual robot behavior. Yaskawa 
Motoman Robotics also offers a similar calibration solution for their robots known 
as MotoCal (2011; MotoCalV EG 2011) with the purpose of increasing robot 
accuracy and facilitating the replication of robot work cell. 

A similar approach is adopted in the DynaCal™ (Cheng 2007) system from 
Dynalog. In addition to the calibration software, the company also produces a 
measurement system, the CompuGauge™ Robot Measurement and Performance 
Analysis System. This measurement system consists of four extensible measure- 
ment cables that are attached to a single point on the robot end effector. The 
end-effector position of the robot is then determined using optical encoders to 
measure the cable lengths. In comparison with the ABB Absolute Accuracy robot 
option which is only available for ABB robots, DynaCal™ can be used with 
industrial robots from different robot manufacturers. 

Instead of modeling and compensating all the errors directly, an alternative 
solution can be achieved through the use of measurement systems to identify the 
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relative poses of the robot tool center and work object in the sensor frame. If this 
measurement is accurate, the end-effector tool center point can be guided to follow 
the desired path on the work object, effectively reproducing the results in the virtual 
work cell and indirectly compensating for all the error sources within the robotic 
manipulator and the work cell. Some commercial solutions adopting this approach 
are the Adaptive Robot Control system from Nikon Metrology (Adaptive robot 
control), vision (True View 5.12™ Vision Guided Robotics 2009), and probing 
systems. 

In the Adaptive Robot Control system, an optical CMM consisting of three 
cameras is used to capture LEDs mounted on the work object and the robot end 
effector. The pose of the work object relative to the tool frame is established 
through the positional information of the LEDs obtained from the optical 
coordinate-measuring machine (CMM). With this information, the system guides 
the robot to move the tool through the desired waypoints on the work object. 
According to the manufacturer, this system can achieve an accuracy of up to 
200 pm. 

Vision systems are commonly used with robotic systems to establish the relative 
pose of the work objects to the robot system. These systems are generally limited to 
pick-and-place operations that do not require high levels of accuracy. For example, 
ABB’s vision systems such as TrueView (2009) and PickMaster integrated with 
ABB robots have an accuracy of 0.5-1 mm. 

Adept Technology provides a machine-vision processor, known as Adept 
SmartVision™ EX (Adept SmartVision™ EX), that can be used for tracking and 
robot guidance applications. The machine- vision system uses a camera for data 
acquisition and can be integrated with Adept’s systems or act as a standalone vision 
inspection system. 

FANUC robotics has developed its own vision systems, known as integrated 
Robot Vision (iRVision introduction; iRVision) (/RVision), which can be inte- 
grated with their robot controller. The vision system can be used as a 
two-dimensional (2D) vision system incorporating a single camera for pick-and- 
place operations on a planar surface. Alternatively, it can be used to capture 3D 
information by augmenting the camera system with a laser line scanner (iRVision 
introduction). 

Kawasaki Heavy Industries also offers similar vision systems (2012) that can be 
integrated with their robots. Similar to the /RVision from FANUC, the Kawasaki 
vision system can be used to capture and process both 2D information, using a 
single camera, and 3D information which can be obtained from either a stereo 
vision system, 3D laser sensor, or laser slit scan camera. In addition to vision 
systems that are developed by the robot manufacturers, third-party vision systems 
(Cognex vision systems in automation 2014) can also be integrated with robots to 
provide visual guidance. 

As an alternative to vision systems, contact-based sensors (White paper: Sur- 
vival of the fittest - the process control imperative 2011) such as touch probes can 
be mounted on the robot end effector to establish the relative pose of the tool and 
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work object. In this approach, the robot moves the probe to a few defined points 
around the work object and continue moving the probe until contact is registered by 
the probe. This allows the robot to identify the relative positions of identified 
features on the work object, thus establishing the relative pose of the work object 
to the robot base frame. This approach is dependent on using the robot as a 
measurement device and the resulting accuracy is affected by the accuracy of the 
robot. In addition, this approach requires the work object to have distinct surfaces or 
features that can be probed in order to establish the relative poses between the work 
object and the robot base or tool frames. 

A similar solution was developed by ABB specifically for arc welding opera- 
tions and was reported in a white paper (Sensor based adaptive arc welding ABB 
AB). Instead of using a touch sensor, the tip of the weld torch was used to register 
contact between the robot and work object. A similar feature is also available for 
FANUC robots, which is known as touch sensing, to compensate for errors in work 
object placement. Similar to the contact-based sensors, these solutions are limited 
to work objects with distinct features that can be used for localization such as planar 
surfaces. In addition to touch sensing, FANUC robots can also be equipped with the 
ability to compensate for work object warping under the high heat generated during 
welding. This feature, known as through arc seam tracking (Through arc seam 
tracking 2005), tracks the welding current during the weaving motion along the 
weld seam to determine the center of the weld seam. Subsequently, the system 
moves the robot accordingly so as to maintain the weaving motion about the center 
of the weld seam. 

In addition to sensor-based systems, a calibration procedure that combines the 
human-in-the-loop compensation of online programming and path generation from 
off-line programming is developed by ABB (Machining PowerPac 2010) where 
five distinct points on the work object were defined off-line and the operator moves 
the robot to these same five points in the actual work cell, establishing the 
relationship between the virtual and actual work cell and compensating for errors 
around the local region of the five distinct points. This process is similar to probing 
and suffers from the same disadvantages. In addition, this is a manual and time- 
consuming process. 


Mathematical Preliminaries 

In this section, some fundamental geometric concepts pertaining to the formulation 
of the calibration model are presented. 


Transformation Matrices and Twist Representation 

The special Euclidean group , denoted by SE( 3) C R 4x 4 , of rigid body motions 
consists of matrices of the general form 
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g SE( 3) C R 


4x4 



where R belongs to the special orthogonal group , denoted by SO(3) Cl , and 
describes a rigid body rotation and t — [£*, fy, uy G R describes a rigid body 
translation. 

The Lie algebra of SE( 3), denoted by se(3) C M 4x4 , consists of matrices with the 
following form 







G se(3) C R 4x4 


where co G so( 3) C 


R is a skew symmetric matrix with 






G so(3) c R 3x3 



Q 

is an element in the Lie algebra of SO(3), denoted by so( 3), and v G M . Note that 
an element d) G so(3) can also be represented in the corresponding vector form co 
given by 



CO x 1 CDy , 





with m defined in Eq. 3. An element y G se( 3) can thus be represented in 
corresponding vector form y given by 



j 

V CO 




termed a twist. The twist represents the line coordinate of the screw axis of a 
general rigid body motion, in which co and v are the direction and position vectors 
of the screw axis, respectively. 


Adjoint Representation for Lie Group and Lie Algebra 

An element of a Lie group can also be identified with a linear mapping between 
its Lie algebra via the adjoint representation. For an element in the Lie group, 
X G SE( 3) defined in Eq. 1, its adjoint map acting on an element in its Lie algebra, 
y G ^(3) defined in Eq. 2, is given by 

Ad^Cv) 4 XyX 1 G se( 3) (6) 

If J is the vector representation ofy, then the adjoint map of X G SE( 3) acting on 
an element y is given by 
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where t is given by 




t 


0 


t 


z 


t 


y 


t 


z 


0 


t 


X 


t 


y 

t 


X 


0 


and Adx(y) is the vector representation of Ad^Cy). 




Matrix Exponentials 

An important connection between a Lie group, SE(3), and its Lie algebra, se(3), is 
the exponential mapping, defined on each Lie algebra. A general definition of 
matrix exponential for y £ M 4x 4 is given by 






where I nxn £ R ,?x/? represents an identity matrix of dimension n. 

If y £ se(3 ) as defined in Eq. 2, an explicit formula for the exponential mapping 
of y is given by (Okamura and Park 1996), then 
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where 


denotes the Euclidean norm of •. 


Product of Exponentials 

In this chapter, the product-of-exponentials (POE) formula (Brockett 1984; Murray 
et al. 1994; Park and Brockett 1994) is adopted for expressing the forward kine- 
matics of an open chain robot work cell containing either revolute or prismatic 
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joints where the forward kinematics equation is expressed as a product of matrix 
exponentials. 

The zth frame, denoted by {K t }, is attached to the zth link and the consecutive 
(i — l)th and zth links are connected by the zth joint. The relative pose of the zth 
frame with respect to the (z — l)th frame under a joint displacement q t can be 
described by a 4 x 4 homogeneous matrix, an element of SE( 3), as follows: 

'-'j, ■(<?,) = 1 J/ (oy® 

where l ~ T t { 0) G SE( 3) denotes the relative pose of the frame [Ki) with respect to 
the frame {K t _i } for g, = 0, s t G se(3) is the twist associated with joint z and when 
expressed in {AT/ } is given by 







ese( 3) C M 4x4 








ej«(3) c R 3x3 


The twist s,- can be expressed as a vector through a mapping s ) i— » s , 


T T 

v‘,co‘ 


tT 


l 


f\ T ^ 

G R with cOi = [ <jo ix , oo iy , co iz ] G R and s f is a unit vector defined as the twist 

coordinates of joint z expressed in {/f z }. The definition of e Si G SE( 3), the exponen- 
tial of Sj G ^^(3), is given in section “Matrix Exponentials.” In this chapter, the 
symbol II is used to represent a series of noncommutative products of matrices 
where 


n 1 

n di = «i «2 • • • tt n , and n di = d n d n -\ . . . d\ 

i= 1 i=n 


For a robot with n joints, as illustrated in Fig. 2 for the case n = 6, the 
transformation describing the relative pose of the sensor frame, {K n+ i}, in the 
measurement frame, {Ki}, is given by 



r T 1 A • ^ 

where q = [qi, . . q n ] . There exists an element p t s .vc* ( 3 ) such that e p = 1-1 T, (0) 

eSE{ 3). 

/: 

Note that the element p t G i? is a vector which will be fine-tuned in the 
calibration process to enhance the accuracy of the robot kinematic model. As 
such, °T n+ i(q) in Eq. 11 can also be expressed in the following form: 
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]1 e Pi e Siqi 


/=! 


* Pn -\- 1 



More information on matrix exponentials and the product-of-exponential 
approach can be obtained from (Brockett 1984; Murray et al. 1994; Park and 
Brockett 1994). 


Calibration 

In this section, the model-based approach to improve robot work cell accuracy is 
presented. The first step of the calibration process is to acquire a set of measure- 
ments containing the pose of the end-effector frame relative to a fixed measurement 
frame and the corresponding joint angles at that pose. This set of measurement data 
will be used to optimize the set of parameters p t for / = 1 , • • • , n + 1 in the robot 
kinematic model (12), where n denotes the number of robot joints, to best describe 
the end-effector pose given a set of joint angles. The frame assignment used in this 
section is illustrated in Fig. 2 for a 6-link robot. The reference frame used in this 
section is the measurement frame { K 0 } . 



Fig. 2 Frame assignment for calibration 
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Methodology 

The nominal forward kinematics describing the pose of the end-effector frame 
{ K n+ ! } in the measurement frame { K 0 } , 


T = °T n+ i(q) 


(13) 


defined in Eq. 12, is a function of the local frame initial poses 


i - 1 


n o) 


e Pi for 


i 


1, 


, n+1, joint twists s = [sq, • • •, s n ] G R 6xn and joint displacements q. The 


equation relating infinitesimal variations in the end-effector pose in the measure- 
ment frame to infinitesimal variations in the variables within the kinematic model 
can be obtained by linearizing the forward kinematic equation such that 


d(T)T 
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'Pn + 1 


T 


-l 


(14) 


where dTT 1 G se(?>) denotes the infinitesimal changes in the end-effector pose, as 
a differential term, in the measurement frame resulting from infinitesimal changes 

in the variables e Pi , s and q within the kinematic model. 

Let T a be the actual pose of the end-effector frame, {K n+X }, in the measurement 
frame, {/f 0 }, obtained from the measurement system. Noting that d(T)T~ l G ^(3) 
is the difference between the actual and nominal poses of the end-effector frame in 
the measurement frame (Okamura and Park 1996; Yang 1999), the following 
approximation, assuming sufficiently small error between T a and J, can be obtained 


d(T)T~ l = (T a - T)T~ l (15) 

Together with Eq. 14 and the above definition of d(T)T~ l , the terms d(e p >) and 

d(e Sjq j) in Eq. 14 denote the changes required ine Pi , s , and q , respectively, to achieve 
T — > T a . Rearranging the terms in Eq. 15, the following is obtained: 


d(T)T~ l = T a T~ l -I 4x4 = \og(T a T l )ese(3) (16) 

by noting the definition of matrix exponential in Eq. 9 and ignoring the higher order 
terms for sufficiently small error between T a and T. 

By taking advantage of the local POE formula that allows arbitrary assignment 
of local frames, an assumption can be made with regard to the variables within the 
kinematic model which does not affect the solution of Eq. 14. 


Assumption 1 Geometric errors within the nominal kinematic model only exist in 

A 

the initial poses of the local frames e Pi for i = 1, • • •, n + 1 only , and the joint 


56 Robot Work Cell Calibration and Error Compensation 


2007 


displacements q and the joint twists s can retain their nominal values throughout 
the calibration process. 

Based on Assumption 1, the equation defined in Eq. 14 can be simplified as 
follows: 


d(T)T~ l 
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d(e p 


77+1 
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-l 


(17) 


Considering Assumption 1 and with a slight abuse of notation to simplify the 
presentation, Eq. 17 can be written as follows: 


where 





The purpose of the kinematic calibration is to identify optimal values for de Pi , for 
/ = 1, • • •, n+\, given a set of m measurement data at various robot postures, such 
that the least square pose error given by 



d(T{q t ))r l { qi ) - 




is minimized for a given set of d{eP^ for j = 1, • • •, n+1, where T(qj) = °T n+ j (q;) 

and q f is the set of joint angles at the ith measurement pose. 

Considering the definition of T in Eqs. 13 and 12 and the following property of 
non-singular square matrices 





V 5, A e R nxn and det(^) ^ 0, det(U) ^ 0 


Equation 17 can be expressed as follows: 


d(T)T~ l 



n <?• 

i=\ 





II C r e ' " 

i= 1 





and noting that the following definition for dpj is adopted (Yang 1999) 
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A 




Simplifying the expression in Eq. 20 using a different notation, the following is 
obtained 


9(7)7-! = f^°7y7} +1 (0)a^ +1 r^ 1 1 (0)°7;r 1 j (22) 

where °J 0 = ^ 4 x 4 - 

Noting the definition of the adjoint map of an element in SE( 3) acting on an 
element of ^(3) in Eq. 6 and using the adjoint representation for Eq. 22, the 
following model can be obtained 



Considering the definition of d(T)T 1 


in Eq. 16 leads to 



and adopting the vector representation for the above expression, 





is obtained, where log ( T a T l ) v 

(Tj- 1 ) e «>(3). 


/T 

G M denotes the vector representation of log 


Considering the definition of the adjoint map of an element in SE(3) acting on 
the vector representation of an element in se( 3) as illustrated in Eq. 7, Eq. 25 can be 
rewritten as 


log(7 a 7-y 



Jf j+ 1,0 «/+ 1,0 
JR j+ 1,0 



where 






In order to simplify the presentation, the following definition is adopted: 


56 Robot Work Cell Calibration and Error Compensation 


2009 


°Xj(9) °ij(9)°Rj 
0 3 x3 °Rj(q 

JR j+ 1,0 J tj+\,0 J Rj+l,0 

03x3 J Rj+ 1,0 

■ 

Defining T a as the ith measurement data at the ith pose with joint angles and 
considering the above definitions, Eq. 25 for the ith pose is given by 

log (T a T- l ( qi )) v = ^ Ado T . (#,- ) Ad, Tj+X (0) d (pj + 1 ) (27) 

i = o 

and can be expressed as a linear equation of the form 



Ado T .(q) A 
Adyj (0) = 



where 



\og(T a T- l (q,)) V 

9{Pi) T ,d{p 2 ) T ,---,d(p„ +l ) 






In Eq. 28, T 1 (q i ) and A z can be obtained from the nominal kinematic model, while 
T l a is obtained from the actual measurement data. As such, the kinematic errors within 
the nominal model of the robot, x , can be identified from Eq. 28. 

The identification process requires multiple measurements at different poses to 
achieve calibration accuracy with the objective defined in Eq. 19. Considering a set 
of m measurement data, where m > n+\ and n denotes the number of robot joints, 
that will be used in the calibration process, the measurement data can be consoli- 
dated and expressed in the following form 


Y= Ax 



where 



Equation (29) is a linear calibration model and with m > n+ 1, the linear least- 
square solution for x is given by 
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x = A*Y (30) 

where A denotes the pseudoinverse of A. 

The solution in Eq. 30 will be used to update the parameters in p h for / = 1, • • •, 
/i+l . Due to the assumption of sufficiently small errors in the formulation of the 
solution, the parameter updates must be limited or scaled by a factor y where 
y < 1 during the updating process in order for the solution to be valid. As such, 
an iterative process has to be adopted where the next round a calibration is 
performed with the updated parameters from the previous round until the errors 
||F|| < 6 where 6 is a sufficiently small value to be chosen for the calibration 
process. 


Implementation and Experimentation 

The calibration results from section “Methodology” have to be implemented in the 
industrial robot in order to increase its accuracy. A number of implementation 
approaches can be adopted but most will require the robot control architecture to 
be open or semi-open. This is a restrictive requirement as industrial robot 
manufacturers do not open up the robot controller due to safety and proprietary 
issues. 

In direct implementation, the robot model within the robot controller has to 
be modified with the calibrated parameters. This implementation is straightfor- 
ward in theory but this approach requires the robot controller to have an open 
architecture and thus difficult if not impossible to implement in commercial 
industrial robots. 

Another approach is to perform preprocessing of the robot targets off-line as 
illustrated in Fig. 3. The concept behind this implementation is to modify the robot 
targets in the controller such that the actual pose achieved by the robot in the task 
space matches with the desired pose fed into the preprocessing algorithm. These 
modifications are based on the calibration results from section “Methodology” 
which are implemented in the numerical inverse kinematics block as shown in 
Fig. 3. The numerical inverse kinematics block, which uses the calibrated param- 
eters, will determine the calibrated joint angles for achieving the desired pose. Note 
that a numerical solution for the inverse kinematics has to be used instead of an 


I Offline Pre-processing 

. P Numerical Inverse Forward Kinematics 

Desired ^ Kinematics (with 1 (with nominal 

Pose • I ^ ' I (CaHbraMri ' 

| calibrated parameters) parameters) 


Robot Controller 

(with nominal 
: Pose ' 

Parameters) 


Fig. 3 Off-line preprocessing implementation block diagram 
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analytical solution since the spherical wrist condition may not be satisfied after 
calibration. 

Subsequently, the modified pose is computed using the forward kinematics of 
the robot with the nominal parameters and the calibrated joint angles from the 
numerical inverse kinematics block. This is required if the robot controller does not 
allow direct joint inputs. Therefore, in this implementation, no modifications are 
required in the robot controller and can be easily implemented off-line on most 
industrial robots by modifying all the desired poses in the off-line program before 
uploading to the robot. 

Unlike direct implementation, this approach cannot be implemented online as 
computing the calibrated joint angles from the desired pose using numerical 
inverse kinematics with the calibrated parameters to solve for the calibrated 
joint angles can be computationally intensive if the initial estimated joint angles 
are very different from the actual solution. In order to implement this approach 
online, a possible solution is to exploit the inverse kinematics built into the 
robot controller which may be available depending on the robot manufacturer’s 
programming tools. With the nominal joint angles from the supplier’s inverse 
kinematics as an initial estimate for the numerical inverse kinematics, the 
calibrated joint angles can be quickly computed since the nominal joint angles 
will be close to the actual solution of the inverse kinematics except in special 
conditions, for example, when the robot configuration is close to singularity. 
The block diagram to implement preprocessing of the robot targets online is 
illustrated in Fig. 4. 

Experimental Setup 

In this section, the experimental calibration setup and results are presented. The 
industrial robot used in the experiments is the ABB IRB-4400 with the IRC5 
controller, which has an accuracy of about 1-2 mm across its entire workspace. 
The measurement system used is the Leica laser tracker system AT901-MR 
together with a Tracker-Machine control sensor (T-Mac) which has a positional 
accuracy of about 30 pm with 0.01° rotational accuracy based on our experimental 
setup. The experimental setup is illustrated in Fig. 5. The T-Mac is mounted on the 
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Fig. 4 Online preprocessing implementation block diagram 


2012 


P.Y. Tao et al. 



Fig. 5 Experimental setup 


end effector and the laser tracker is placed approximately 3 m away from the T-Mac 
when the robot is in its home position. 

There are two parts in this experimental study. In the first part, the calibrated 
matrices are derived using the calibration procedures detailed in section “Method- 
ology” from the nominal matrices using a set of measurements from the Leica laser 
tracker and the corresponding robot joint values. The resulting calibrated matrices 
are further verified using another set of measurements and corresponding robot joint 
values. In the second part of the experiment, the online preprocessing system 
(illustrated in Fig. 4) is implemented on the ABB IRB-4400 using the calibration 
results from the first part of the experiment. The effectiveness of the system in 
improving the robot work cell accuracy is verified by moving the robot with the 
online preprocessing system through a random set of desired poses as well as along 
a straight path. These results are then compared with the corresponding laser tracker 
measurement data. 

Experimental Results 

In this experiment, a total of 2 sets of 50 measurement poses randomly distrib- 
uted within the workspace of the robot were obtained. The first set of data is 
used solely for calibration and the second set of data is used for verifying the 
results of the calibration. The nominal transformation matrix, °Ti, is derived 
through measurements of the end-effector pose and pose information from the 
robot controller, while l T i+x for / = 1, . . ., 6 are obtained from the manufac- 
turer’s specifications. The values of the nominal transformation matrices are 
given by: 
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Applying the calibration procedures from section “Methodology,” the calibrated 
transformation matrices, denoted by the additional subscript c, are given by: 
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Table 1 Summary of calibration results 



v x (mm) 

v_y (mm) 

v z (mm) 

<*> x (°) 

<*>y (°) 

(°) 

$P\ 

-2.73 

4.92 

1.99 

-0.035 

-0.027 

0.007 

$P2 

2.91 

-0.248 

-0.108 

-0.023 

0.015 

0.006 

Sp 3 

-0.264 

-0.045 

-1.421 

-0.006 

-0.001 

-0.008 

dp A 

-0.999 

-0.062 

-0.662 

0.021 

0.017 

-0.03 

Sp 5 

-0.954 

0.753 

-0.437 

0.021 

-0.017 

-0.035 

SP6 

0.446 

-0.035 

-0.156 

0.025 

-0.018 

0.023 

<5 Pi 

0.446 

0.43 

-0.599 

0.025 

-0.045 

-0.03 


Table 2 Sensor frame poses before and after calibration 


Joint angles 

i 

-23.32, 11.89, 7.52, - 

-52.37, -29.99, 48.33} 

Nominal pose 


" -0.0041 

0.999 

0.0446 

0.1729" 




0.0096 

0.0446 

-0.999 

2.1254 




-0.9999 

-0.0037 

-0.0098 

0.3765 




0 

0 

0 

1 


Measured pose 


' -0.0053 

0.999 

0.0454 

0.1762" 




0.0114 

0.0455 

-0.9989 

2.1208 




-0.9999 

-0.0048 

-0.0116 

0.3775 




0 

0 

0 

1 


Calibrated pose 


" -0.0053 

0.999 

0.0454 

0.1762" 




0.0114 

0.0454 

-0.9989 

2.1209 




-0.9999 

-0.0047 

-0.0117 

0.3776 




0 

0 

0 

1 



The changes made to the nominal transformation matrices after calibration is 
summarized in Table 1. 

The effectiveness of the calibration algorithm can be seen in Table 2 and Fig. 6. 
From Table 2, the results for an arbitrarily selected robot pose show that the 
measured sensor frame pose can be precisely described using the calibrated param- 
eters. On the other hand, Fig. 6 presents the errors of the sensor frame pose in the 
measurement frame prior and after calibration. A summary of the position and 
orientation errors using the nominal and calibrated matrices, based on a separate set 
of 50 measurement poses for verifications, is given in Table 3. 

Online Preprocessing Implementation and Verification 

In order to verify the effectiveness of the calibration results and the proposed 
implementation procedure, the online preprocessing implementation proposed in 
section “Implementation and Experimentation” is applied to the ABB industrial 
robot. The robot is programmed to move to 70 randomly generated poses within a 
sphere with a diameter of 800 mm. The numerical inverse kinematics of the robotic 
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Fig. 6 Calibration results based on proposed calibration algorithm 


Table 3 Error between computed and measured poses (after verification) 



Position error (mm) 

Orientation error (°) 

Average 

Maximum 

Average 

Maximum 

Before calibration 

5.759 

6.444 

0.13 

0.168 

After calibration 

0.385 

0.974 

0.017 

0.04 


Table 4 Error between desired and measured poses (after implementation) 



Position error (mm) 

Orientation error (°) 

Average 

Maximum 

Average 

Maximum 

Without compensation 

5.705 

6.276 

0.123 

0.163 

With compensation 

0.293 

0.822 

0.023 

0.069 


arm used in the preprocessing module is based on the POE approach (Chen 
et al. 1999). The laser tracker measurements obtained from the sensor in these 
modified poses are used to compare with the desired poses. The experiment is 
conducted twice, once without compensation and the second time with compensa- 
tion through the online preprocessing. The positional and orientational errors of the 
robot with and without compensation are summarized in Table 4. 

From the experimental results in Table 4, it can be seen that the accuracy of the 
robot has improved significantly (from positional accuracy of 5.7-0.29 mm and 
orientational accuracy of 0.12-0.023°) when online preprocessing is carried out 
with the calibrated matrices. It was also noted that after using the online 
preprocessing compensation, the accuracy of the robot for more than 90 % of the 
randomly generated poses is less than 0.5 mm, whereas the errors for the last 10 % 
vary between 0.57 mm and 0.822 mm. It is noted that the poses with the large errors 
are spread out within the workspace as shown in Fig. 7. 
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Fig. 7 Randomly generated poses in 800 mm diameter sphere 


Compensation 

A comprehensive model-based calibration approach can accurately compensate for 
all errors within the robot workspace in theory. However, the effectiveness of these 
approaches is largely dependent on the model and calibration parameters used in 
the calibration. Any differences between the calibrated model and the actual model 
will limit the effectiveness of the model. If the calibration data are well spread out 
within the robot workspace while the model and calibration parameters are insuf- 
ficient to fully describe the actual robot behavior in the entire workspace, a 
convergence problem may be encountered during calibration. On the other hand, 
if the calibration data points are confined within a small working region, the 
resultant calibrated model will only be valid for that localized region. Even with 
an accurate model, these calibration results will degrade over time due to mechan- 
ical wear, and multiple calibrations may be required over the lifetime of the robot. 

An alternative to model-based calibration, an indirect compensation using online 
measurement data between the robot end effector and work objects have been used 
in the industry. A common solution used in the industry which adopts this approach 
is the use of probing systems (White paper: Survival of the fittest - the process 
control imperative 2011) such as the Renishaw probes to identify the relative 
positions of some key features/planes of the work object to the robot base or tool 
frame. This solution is commonly used in CNC machine centers to achieve high 
levels of accuracy and can be directly replicated in robotic work cell although the 
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accuracy maybe lower with the result valid only within a localized region as the 
inaccurate robot is used as a measuring tool. Another indirect compensation method 
has been developed by Nikon Metrology (Adaptive robot control) where a triple 
camera optical CMM is used to establish the relative poses of the robot end effector 
and work object. This measurement information is then used to compensate for the 
positioning errors online. 

A sensor-based error compensation method which adopts the indirect compen- 
sation approach is developed in this section. Using a sensor mounted on the robot 
end effector to capture a point cloud data of localized region of the work object, the 
relative poses of the tool and work object frame can be established that reduces the 
positioning error within that localized region. With the model of the work object, 
which is required for off-line programming in the first place, and the captured point 
cloud, the difference between the nominal and actual work cell can be established 
using point cloud registration techniques (Mitra et al. 2004). The proposed method 
to improve work cell accuracy fits nicely within the off-line programming 
framework. 


Methodology 


In this section, a sensor-based error compensation method is proposed where a 
sensor attached to the robot end effector is used to identify and compensate for the 
pose errors between the tool and work object. This method adopts the second 
approach for error compensation of robotic manipulators described in section 
“Literature Survey and Commercial Solutions.” 

The concept is further illustrated in Fig. 8, and adopting the notations used in 
section “Literature Survey and Commercial Solutions,” the actual tool, work object, 
and sensor frames are denoted by [Kf], {Kq}, and {A^ +1 }, respectively, while 


the nominal tool, work object, and sensor frames are denoted by {/ff }, {A^ v }, and 

A sensor mounted on the robot end effector is used to capture a point cloud of the 
work object. Sensors, such as laser scanners, light interferometry scanning systems, 
stereo vision systems, etc., can be used to capture the point cloud data with 
sufficient levels of accuracy. Although the point cloud is defined in and relative 


'N 


to the actual sensor frame {Kf 1+X }, the relative pose of {K „+\ } to the robot base 


A 


frame { K \ } (given by 1 T„ + x ) is unknown and the nominal transformation L T„ +X will 
be used instead. As such, even if the point cloud is accurately mapped to the work 
object model via point cloud registration thus establishing the actual relative pose 
' 7+1 Tq between the sensor {Kf l+X } and work object { Kq } frames, the resulting pose of 
the work object frame relative to the robot base frame that is obtained will be { K 0 } 
instead of {K^+i }. The difference between the actual and compensated work object 
frame reflects the error sources within the robotic manipulator and serves to 
indirectly compensate for these error sources. 

Unlike the calibration work in the previous section where {/f 0 } denotes the 
measurement frame, in this section, \K 0 ] denotes the work object frame which is 


\™A 


1 rrN 
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Fig. 8 Frames and transformation matrices for eye in hand setup 


rigidly attached to the work object. The work object model, assumed to be 
described as a mesh, is referenced to this frame. 

Since the tool and sensor are assumed to be rigidly attached, in relatively close 
proximity, and properly datumed, the errors between the ideal and actual trans- 
formations T T^ l+1 and r 7^ +1 between the tool frames [K ^ } and {Kf} and the sensor 
frames {/if^ +1 } and {A^ +1 } are assumed to be negligible compared to other error 
sources. 

This concept is similar in principle to other approaches which depend on touch-/ 
probing-based systems or human-in-the-loop feedback to establish the relationship 
between the tool and work object frames described in section “Literature Survey 
and Commercial Solutions.” 

The transformation P T 0 between the nominal } and compensated {/f^} work 
object frame in the sensor frame can be determined by using the captured point 
cloud data, the ideal model of the work object used in off-line programming and a 
best fitting point cloud registration. The main objective of improving work cell 
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accuracy is to guarantee that the robot can move in such a way that the actual tool 
center point, defined as the tool frame {Kf} carried by the robot, follows a 
predefined path, defined in the actual work object frame {/Co }. 

Using this compensation method, the above objective is still achieved via 
establishing the accurate relative poses of [Kf } and work object frame {Kq } at a 
given point using sensor information, even though the relative poses of the actual 
tool frame [Kf] and work object frame {Kq } to the actual robot base frame [K x } 
cannot be determined. However, due to the lack of an accurate robot kinematic 
model, this approach is only effective locally around the compensation pose. 

In robot programming, implementation of this compensation algorithm can be 
achieved by updating the work object frame in the robot frame with the new 

1 r 1 

compensated work object frame in base frame T 0 which can be computed as 
follows: 


IjC 

1 o 


1 tW t rpN n-\-\ >rC 
1 n+ 1 


(31) 


For the purpose of the point cloud registration algorithm, the CAD model of the 
work object is assumed to be made up of a number of triangulated surfaces. 


Point Cloud Registration 


The method proposed in (Mitra et al. 2004) for point cloud registration is adopted in 
this section. The objective of point cloud registration is to identify the transforma- 
tion to map a point cloud to another point cloud or in this case a model represented 
by a number of triangulated surfaces, such that the sum of distances between points 
in the point cloud to the model is minimized. By minimizing this distance, the 
localization problem is solved as the relationship between the sensor and the work 
object is established and via Eq. 31a more accurate positioning of the tool in the 
work object frame can be achieved locally around the region of compensation. 


The points in the point cloud P s 


p s v -,p s 


n 


p 


obtained via the sensors are 


initially defined in the sensor frame where n p is the total number of points captured 
by the sensor. To prepare for the point cloud registrations, these points are 
converted to the nominal work object frame via 





rpN t rpN 

1 n+l 





r T 1 

where /?, = [x ph y pi , z pi ] are the same points p\ defined in the nominal work object 
frame. All subsequent actions are in the nominal work object frame {A^}. This 
approach reduces the computational time, as the model pose is constant in {^}, as 
such all related parameters, to be defined in section “Defining Error Functions,” 
remain constant and thus need not be recomputed during the iterative registration 
process. 
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Defining Error Functions 


The work object model is assumed to be represented by a number of triangular 
surfaces, the distance, d ij9 between the point, p h and a surface in the model, s h 
formed by the three vertices, v jk = [x vjk , y VJ - k , z v j k ] T for k = 
expressed as follows: 


1, 


3, can be 


d * * 

Ulj 
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c, - Pi ■ n. 


p Uk x v jk 


min 

^ k= 1 , 2,3 
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Vik 


if p Yf G sfj nclosed 9 else 


if o < < 


for k 
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jkl 


(33) 


where 


is the Euclidean norm of •; n 
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klxj i Ylyj , kl z j 


is the unit normal vector to 


the surface sy q is the plane constant of the equation describing the plane defined by 
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is defined as the enclosed space by the three 


• i 

vertices v yl , v /2 , and v /3 ; and rfj 0 ' G is the point p { projected onto the plane Sj\ 


and Pj H jl = Pijk'Vjk • I n Eq. 33, if the condition 0 < 

for more than 1 value of k , the minimum value of = || x Vj k || is chosen. 

The squared distance d 2 j between each point pj in the point cloud and the model 
is given by d 2 p = min 7= i ? ?/7v djj, where n s is the number of surfaces in the model. 

The squared distance function defined in Eq. 33 satisfies the requirements in (Mitra 
et al. 2004) where the squared distance functions have to be in a quadratic form 
defined as follows: 


< 


V 


jkl 


is satisfied 



AiXpi + BiXpiy p i + C t y pj + D\Xp\Zp\ + Eiy pi Zpj 
+ Fi z 2 pi + GjXpj + H (y pi + 1 p p i + / ) 



Note that the quadratic form defined in (Mitra et al. 2004) does not contain the 
term but this does not affect the results as the term disappears in the partial 

E Yl 

P d 2 p which can be 

l X 

parameterized into the following form: 







as required by the point cloud registration technique proposed in (Mitra et al. 2004) 

rr-y A A 

where Qi = Q\ G R x is a symmetric matrix representing the approximation of 
the squared distance function to the model around the point p t . The matrix Q t for 
each point pj in the point cloud at the current iteration can be computed from Eq. 33 
and using the definition in Eq. 34 is given by 
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Solving the Least Square Problem 

The objective of the point cloud registration is to identify a transformation P T 0 such 
that d 2 defined in Eq. 35 is minimized when { p Tq} is applied to all the points in the 

I 1 

point cloud. Subsequently, the pose n Tq of the compensated work object frame 
{*ol in the sensor frame {K n+1 } can be obtained using 



which can be used to determine J 0 using Eq. 3E Note that the pose of this 
compensated work object frame {K 0 }, as seen in Fig. 8, is not the same as the 
actual work object frame {Kq }. This compensated work object frame {Kq } serves 
to reduce the tool to part error through indirect compensation of both the errors due 
to deviations between the actual and nominal kinematic model of the robot and 
other error sources within the work cell. 

The point cloud registration is an iterative process, where P T 0 is determined via a 
series of infinitesimal translations t = [t X9 t y , t z \ and infinitesimal rotations Rot(a, 
/?, y) in order to minimize d 2 where t X9 t y , and t z are respectively the infinitesimal 
displacements along the x, y 9 and z axes of the fixed nominal work object frame 
{#o 1 and a , /?, and y are respectively the infinitesimal rotations about the x 9 y 9 and 
z axes of the fixed nominal work object frame {A^q } . The rotation Rot(a, /?, y) can be 
approximated as follows: 









During each iteration, the new squared distance after the transformation is 
given by 


n p 

di = E] {Rpi + t) T Qi(Rpi + t) (39) 

i= 1 

where the matrices Q f are likely to change after each iteration. The objective of the 
least square problem is to identify the rotation R and translation t such that Eq. 39 is 
minimized. 

Taking the partial derivatives of Eq. 39 with respect to each of the parameters 
and setting the results to zero, the following is obtained: 
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where T, E R 6x 6 , A t E 7? 6x \ and 5 = [ a , /?, t x , t y , t z ] T . The definition of T, and A, 

is given in (Mitra et al. 2004) and repeated here for completeness. 
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The parameters in 5 can be determined using 
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Due to the assumption of infinitesimal transformations and the corresponding 
approximations (Eq. 38), the computed result will deviate from the actual solution 
to the least squared problem (39) if the computed 3 in Eq. 41 is large. In addition, 
the parameters of Q, in Eq. 36 are dependent on the current position of the point p f 
in the nominal work object frame. A significant change in the position of p\ will 
likely result in Q f changing, rendering the solution 3 invalid. 

In order to avoid the above issues, the computed transformation should be scaled 
by a factor k where k < 1 . The factor k can be a constant parameter or a variable 
dependent on 3. For example, to limit the rotational and translational change in each 
iteration to a maximum value of co max and t max , respectively, 


k = max 


t 


max 


Co tyi E 




where k co nst < 1 is a constant design parameter to be chosen by the user based on 
simulation results and the desired convergence rate. The corresponding scaled 
transformations R sca ied and t sca / e d can be determined as follows: 
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where w = k[a , /?, y] E R / 3x 3 E R ^ x is an identity matrix, and 
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(45) 


(46) 


At each iteration, the transformation P T 0 is updated using Eqs. 38 and 41 as 


follows: 



R 


scaled 





where T° p 1 ^, for the first iteration is an identity matrix. The same procedure is 
repeated until d 2 j — » 0 V / = 1, . . ., n p or when a threshold value for d 2 j , a design 
parameter to be selected by the user, is reached. 
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Data Preprocessing 


The point cloud data used for the registration process will likely represent a portion 
of the workpiece model which aggravates the problem of local minima in registra- 
tion due to increased likelihood of points being mapped to the wrong surfaces in the 
model. In order to reduce computation time and reduce the likelihood of local 
minima solutions, a model preprocessing method is proposed in this section where 
only the surfaces within the line of sight of the sensor are used in the registration 
process. By reducing the surfaces, the computation time for Eq. 33 will also be 
reduced. 

A number of points on each of the triangular surfaces in the model is first 
defined. These points are defined arbitrarily within the enclosed surface s f lclosed 
of each of the surfaces Sj within the model. In this section, a total of seven points are 
selected for each surface, for surface Sj the points are defined as \v ja , v jb ,. . ., 
Vj g \. These points are defined in the nominal work object frame, and used to 
determine if Sj, is within the line of sight of the sensor. Taking point v ja as an 

example, V C j a is first defined as a line segment formed by v ja and the point of capture 


of the sensor p c , assuming a single origin of scan. V c j a is defined in the nominal work 


object frame. Subsequently, two tests are conducted to determine if the point v ja is 


within the line of sight of the sensor. The first test checks if the vector, v ja — p c , is 
within the field of view of the sensor. If this condition is satisfied, it is further 
checked if this point v ja is not blocked by other surfaces from the sensor. A surface, 


s k for k 


1 , 


n 


and k ^ j, is deemed to be blocking v ja if the following 


condition is satisfied: p 


k 

cja 


ja 

G s e k nclosed where p k cja is the point of intersection between 


the plane formed by s k and the line segment V c j a , if any. The surface Sj will be used 
in the point cloud registration if any of the points, \v ja ,. . v jg \, satisfies both the 
above conditions. 

In order to account for errors between the nominal and actual relative poses of 
the sensor and work object frames, additional surfaces can be added by repeating 
the process while perturbing the pose of the sensor frame to account for possible 
errors . 


Simulation 

In this section, a simulation study to verify the effectiveness of the proposed 
approach is presented. A point cloud is first generated assuming an initial pose of 
a sensor which has a ±20° and ±15° field of view about the horizontal and vertical 
axes respectively and an angle resolution of 1° per scanning point for each of the 
axes. The points in the point cloud are also randomly perturbed by 20 pm to 
simulate sensor noise. A total of 947 points, i.e., n p = 947, were generated in this 
simulation. The objective is to verify the effectiveness of the proposed technique in 
identifying the correct P T 0 . With P T 0 identified, the work object localization can be 


56 Robot Work Cell Calibration and Error Compensation 


2025 



n Reduced Surfaces 
□ Initial Surfaces 


Direction and point of scan 


05 


Fig. 9 Initial and reduced model used in the simulation 


completed using Eqs. 31 and 37. The simulation is developed using C++ and 
conducted on a laptop with 1.87 GHz Centrino® Duo core, 1.5 Gb of RAM, and 
running Windows® XP. 

The complete model of the work object, consisting of 4,192 surfaces, is illus- 
trated in Fig. 9. Using the technique described in section “Data Preprocessing,” the 
model is now reduced to 2823 surfaces, a 33 % reduction, and shown in Fig. 9. The 
significance of the model reduction technique is dependent on the size and shape of 
the work object as well as the relative pose of the sensor and work object. 

In Fig. 10, the point cloud captured by the sensor defined in the nominal work 
object frame is shown on the left. The pose difference between the nominal 
transformation n+l T$ and actual transformation ,1+1 Tq between the sensor and 
work object frames is randomly determined. The point-to-point error defined as 
the error between the points in the point cloud and their actual positions on the 
model given by 










is unknown to the algorithm since °Tf l+ 1 is unknown. This error reflects the actual 
error between the nominal and actual relative poses of the work object and sensor 
frames and is used in this simulation to gauge the effectiveness of the proposed 
solution. At the start of this simulation, the maximum and average point-to-point 
error is 23.8 and 21.88 mm, respectively. 
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□ Reduced Surfaces 

□ Initial Surfaces 


+ Scanned points from initial pose 

* Points after transformation t r „ 


Fig. 10 Point cloud before and after transformation 


After applying point cloud registration using the reduce model, the maximum 
and average point-to-point error between the points in the point cloud and their 
respective positions on the model in the nominal work object frame is reduced to 
6 and 2 pm, respectively, whereas the maximum and average distance between the 
points in the point cloud and the model, which can be computed via Eq. 33, in the 
nominal work object frame is 27 and 10 pm, respectively. The differences can be 
attributed to the addition of random measurement noise of 20 pm to the points in the 
point cloud. In comparison, an accuracy level of around 0.3 mm was achieved for 
an ABB IRB 4400 robot within a limited spherical working region of 800 mm in a 
previous experimental study (Mustafa et al. 2010) using kinematic calibrations of 
the robot model. The Dynamic Motion Measurement system from Nikon Metrology 
has claimed an accuracy of 0.2 mm in a hole drilling experiment. 

In this simulation, there were a total of 9 iterations during the point cloud 
registration, i.e., total of 9 transformations p T l 0 for i = 1, . . ., 9. Total computation 
time for all iterations is 16.58 s and the time taken for each iteration varies between 
5.58 s for the first iteration and 0.88 s for the last iteration. The average and 
maximum distance between all the points in the point cloud to the surfaces as 
well as the average and maximum error between the point cloud and their ideal 
positions on the surface before each iteration are illustrated in Fig. 11. 


Calibration with Compensation Framework 
System Overview 

The objective of the proposed framework, illustrated in Fig. 12, is to replicate 
in the actual work cell, the desired tool to part motion defined in the virtual 
environment. 
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Fig. 1 2 System overview 


The following description of the system is based on Fig. 12. The desired tool to 
part motion is first sent to the system with a corresponding compensation pose that 
is near the desired motion. The robot is moved to the compensation pose based on 
the nominal work cell model in (T). Subsequently, the sensor captures information 
of the work object at the compensation pose in (2). This information is sent to the 
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compensation process as given in (3) and used to determine the best value for 
^(O) which minimizes the error between the sensor information and work object 
model. The updated °r 1 (0) is used in ® to compute the joint angle trajectories that 
will achieve the desired tool to part motion around the compensation pose. This 
process is repeated for every compensation pose and corresponding tool to part 
motion while the computed °T 1 (0) is stored and consolidated with the 
corresponding joint angles in (4). This information is used to calibrate the work 
cell model in (5) and replaces the nominal work cell model in ® after every 
work cycle. 

In the proposed framework, it is assumed that °T 1 (0) of the actual system 
remains constant for a single work cycle, that is, the relative pose of the work 
object and robot base frames remains unchanged once it is loaded into the work cell. 
The localization results within a single work cycle is consolidated and used as 
training data for the calibration process. A minimum of (n+ 1), where n is the 
number of robot joints, distinct compensation poses are required in order to provide 
sufficient training data to the calibration process for a nontrivial solution. In 
addition, there must be sufficient joint angle variations between compensation 
poses in order to obtain a good calibration result. Therefore, the proposed frame- 
work is targeted for large work objects where at least (n+ 1) distinct compensation 
poses are required. 

The proposed framework can also be extended to include external motion axes 
moving the work object or the robotic manipulator. The motivation for integrating 
these two approaches is to achieve high accuracy levels throughout the robotic 
work cell. 

The sensor-based approach is a localized solution, which have been described in 
section “Compensation,” where errors tend to increase as the robot moves away 
from the point of compensation due to the difference between the nominal 
kinematic model and actual robot behavior. This phenomenon is illustrated in 
the simulation study. The robot kinematic model must be improved in order to 
extend the accuracy levels beyond the limited region, hence the calibration 
process. 

The calibration process, which has been largely described in section “Calibra- 
tion,” is a model-based approach which depends on parametric variations of the 
nominal model to approximate the behavior of the actual robot. The effectiveness of 
calibration will be limited if parametric variations of the nominal model is insuf- 
ficient to describe the motion of the actual robot. The sensor-based localization 
approach is non-model based which serves to compensate for the residual errors 
resulting from the differences between the best-fitted nominal kinematic model and 
the actual model. 

In the proposed framework, the calibration process is easily incorporated into the 
robotic system where the localization results are used as training data. The main 
difference from the method described in section “Calibration” is that the error 
between the measured and computed pose given in Eq. 40 can be obtained directly 
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from the results of the Mi compensation transformation P T$ from Eq. 47. As such, 
Eq. 40 can be rewritten as 



where log(») K E R 6 denotes the mapping of log(») G se(3) C R 4yA to its 
corresponding six-dimensional vector representation. 

However, during the calibration process, the nominal kinematic parameters are 
being updated while this error does not reflect the updates. As such, Eq. 49 is only 
applicable for the first iteration of the calibration process. For subsequent iterations, 
we will require 



where °7 1 ^i ) denotes the sensor pose in the work object frame for the Mi compen- 
sation pose computed using the updated kinematic parameters during the yth 
iteration of the calibration process. 

Following the approach described in section “Calibration” and the model in 
Eq. 12, the error model can be expressed in the following form: 



\ 7 z=0 

Rearranging the terms in Eq. 50, the following expression is obtained: 



y = Ax 


(51) 
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Considering a total of m compensation poses where m > (n+ 1), all the 
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results can be combined in the following form: Y = Ax, where Y 
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•, A m with y z and A, the results for the zth localization result in the 


form described in Eq. 5 1 . The least square solution of x is given by 


x = A + y 


where A + denotes the pseudoinverse of A. The parameters for / = 1, . . ., n+\ are 
updated based on the results for x and the process is repeated until the error 
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is sufficiently small at the j th iteration. 


Simulation 

In this section, a simulation study is conducted to illustrate the motivations and the 
effectiveness of the proposed framework. The simulated work cell with the work 
object is illustrated in Fig. 13. The simulation software is developed in C++. 

The robot model used in the simulation study is based on an ABB IRB 4400 
industrial manipulator with 6° of freedom, i.e., n = 6. A robot model considering 
the effects of joint compliance is used in the simulation to determine the actual pose 
of the end effector while the nominal model used in calibration ignores the effects 
of joint compliance. This is to illustrate the drawbacks of calibration when the 
nominal model is unable to fully describe the actual robot behavior via parametric 
optimization. The joint compliances used in this simulation study are obtained 
experimentally using an actual IRB 4400 robot. In the simulation study, a payload 
of 10 Kg is attached to the robot end effector with the payload center of gravity 
located at [100, 0, 0] mm from {/f 7 [. 

The nominal parameters are randomly perturbed to emulate the uncertainties 
within the actual work cell. The parameter p x is randomly perturbed by ±30 mm 
and ±0.5° to reflect the uncertainty between the pose of the work object and robot 
base frames, the parameters p { for / = 2, . . ., 6 is randomly perturbed by ±0.1 mm 
and ±0.1° to reflect the uncertainties within the robot, and p 1 is randomly perturbed 
by ±0.1 mm and ±0.1° to reflect the uncertainties between the end-effector frame 
and sensor frames. 

A 3 -dimensional laser scanner provides the sensor data for localization and is 
simulated with the z-axis of the sensor frame being the forward direction of the 
sensor. The sensor has a field of view (FOV) of ±30° and ±25° in the x and y axes, 
respectively, with an angle resolution of 1° for both axes. The sensor has a 


Fig. 1 3 Work cell with work 
obj ect 
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maximum scanning range of 300 mm and random measurement noise of 0.1 mm 
which is simulated using a uniform distribution with a range of ±0.1 mm. 

First, the localization process is simulated using the virtual robot work cell 
where a total of 44 compensation poses are identified around the work object 
model. In order to simulate the actual environment, the robot joint angles to achieve 
the desired sensor frame poses are first derived using the nominal work cell model 
and the actual sensor frame pose is computed using these joint angles and the actual 
parameters and taking into consideration the effects of joint compliance. The point 
cloud is then generated based on the actual sensor frame pose, the work object 
model, and the sensor specifications. The localization results for the 44 compensa- 
tion poses are summarized in Table 5 which shows that errors are compensated after 
applying the localization technique. 

In Table 5, the results are based on the consolidated results at all 44 poses. The 
mean and max distances to model, defined as 



44 


and 



max • 



k=i 


44 


respectively, can be computed, while the point-to-point distances are only known in 
the simulation environment and was not used in the registration process. The point- 
to-point distances reflect the actual error which are used for illustrating the effec- 
tiveness of the localization process. In Table 5, the point-to- surface distance was 
larger than the point-to-point distance after registration. This was largely due to the 
simulated, measurement noise of 0.1 mm. 

While the results in Table 5 are promising, the error will generally increase as 
the robot moves away from the compensation pose due to errors sources within the 
robot manipulator. In order to illustrate this problem, a desired pose is defined at 
varying distances of 50, 200, and 500 mm from each of the 44 compensation poses. 
The desired pose is shifted in a randomly selected direction for each of the 
44 compensation poses, resulting in 152 new poses. The errors are derived using 
the same process and summarized in Table 6. The simulation results validate the 
notion that localization loses its effectiveness as the robot moves away from the 


Table 5 Consolidated 
registration results for all 
44 poses 


Errors (mm) 

Before registration 

After registration 

Mean 

Max 

Mean 

Max 

Distance to model 

13.977 

29.996 

0.056 

0.191 

Point to point 

33.072 

34.645 

0.010 

0.016 


Table 6 Error between compensated {K 7 } and actual {K 7 } in { AT 0 1 with payload of 10 kg 


Distance from compensation (mm) 

0 

50 

200 

500 

Mean position error (mm) 

0.038 

0.287 

1.150 

2.939 

Mean orientation error (°) 

0.003 

0.005 

0.018 

0.047 
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Table 7 Results from calibration simulation (no localization) for payload of 10 kg 


Calibration simulation 

Positional error (mm) 

Orientation error (°) 

Mean 

Max 

Mean 

Max 

Before calibration 

28.053 

37.972 

0.661 

0.885 

After calibration 

0.172 

0.343 

0.016 

0.040 

Verification 

0.286 

0.487 

0.025 

0.043 


Table 8 Error between compensated {K 7 } and actual {K 7 } in {A'q} after calibration with payload 
of 10 kg 


Distance from compensation (mm) 

0 

50 

200 

500 

Mean position error (mm) 

0.041 

0.045 

0.082 

0.186 

Mean orientation error (°) 

0.003 

0.003 

0.004 

0.009 


compensation pose. Calibration is thus required to improve the kinematic model of 
the robot to overcome this issue. 

Calibration is conducted using the results from localization. In this simulation, 
30 randomly selected poses out of the 44 poses are used in the calibration with 
the remaining 14 poses used for verification. The results are summarized in Table 7 
and the errors remain significant after calibration due to the inclusion of joint 
compliance effects in the actual model which was not considered in the nominal 
model. 

Both problems in the localization and calibration process can be solved 
using the proposed framework; Table 8 illustrates the effect of using localization 
at each compensation pose and the non-optimally calibrated parameters from the 
previous cycle. Although the error has increased after moving 100 mm from the 
compensation pose, it is reduced compared with the results without calibration in 
Table 6. The error is also much lesser compared with the calibration results in 
Table 7 . 


Summary 

In this chapter, a survey of existing methods in the academia and commercially 
available solutions to improve robot work cell accuracy is presented. Subsequently, 
two methods are presented in further details to illustrate different approaches that 
can be adopted to improve work cell accuracy. The first method uses a model-based 
calibration approach where work cell accuracy is improved through parametric 
identification of the robot kinematic model. The calibration process uses measure- 
ment pairs of the end-effector pose and corresponding joint angles to identify an 
improved set of model parameters. An online preprocessing method for 
implementing the calibration results is later introduced to overcome the closed 
control architecture of industrial robotic controllers. An experiment is conducted 
using an ABB industrial robot (IRB 4400) and a Leica laser measurement system to 
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verify the effectiveness of this calibration procedure. The second method adopts a 
non-model-based approach where sensors are used to establish the relative pose of 
the tool and work object, thus indirectly compensating for all errors. Finally, the 
two different approaches are integrated in a single robot accuracy enhancement 
framework. Simulation results are presented to illustrate the advantages of the 
framework. 
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Abstract 

Grippers and end effectors are devices through which the robot interacts with the 
environment around it. This chapter introduces basic concepts, mechanisms, and 
actuation of grippers used in industrial applications. The purpose is to provide 
readers with guidelines on design and selection of suitable grippers for their 
particular applications. 
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Introduction 

In robotics, an end effector is a device at the end of a robotic arm, designed to 
interact with the environment (http://www.gibsonengineering.com). The nature of 
the end effectors depends on the intended task. Although end effectors may be 
broadly described in two main categories: grippers and tools, in robotics they are 
more commonly referred to as grippers (FESTO Website, http://www.festo.com/; 
http : //ww w .piab .com/) . 

A robotic gripper can be classified either by its gripping principle or by the 
method of closing/opening. According to the principles, a gripper can be classified 
as mechanical, vacuum, or magnetic grippers (FESTO Website. http://www.festo. 
com/; http://www.piab.com/; http://www.liftrite.ie/; http://www.fipa.com/). In 
terms of the closing/opening methods, a gripper can be either an angular or parallel 
type. If considering the centering requirement, a gripper which can centralize an 
object during gripping is termed as a synchronous gripper, and conversely an 
asynchronous gripper does not have the feature of object centering. Generally, 
mechanical grippers with a parallel closing/opening manner are often synchronous 
and able to provide a good part centering function. In industrial applications the 
selection of grippers depends on the tasks, specific parts, and requirements (Wolf 
et al. 2005; Ceccarelli and Gradini 1992). 

At present most industrial robotic grippers are designed for specific purposes and 
must be redesigned for different tasks (Okada 1979; Chen 1982; Monkman 
et al. 2007; Crisman et al. 1996; Robotics Bible 2011). Generally, most grippers 
have to be modified by the gripper manufacturer to accommodate new object 
geometry. Shape- specific grippers only allow the gripper to grasp objects with 
very specific geometry. Some shape-specific grippers can be changed to accom- 
modate a variety of object geometries (Mock 2009). However the change of gripper 
often requires significant time, resulting in production delays. 

There is a growing demand for more generic, multi-usage robotic grippers 
(Laliberte and Gosselin 1998; Belter et al. 2011). A low-cost method is to utilize 
hard stops that can customize the stroke length of a gripper to provide flexible 
openings (Aukes et al. 2012). Modular or detachable fingers equipped with sensors 
are an alternative for engineers to design multi-usage flexibility in their devices. In 
addition, multi- fingered grippers are also popular for grasping items that need firm 
holding and dexterous manipulation. Multi-usage grippers allow robots to grip 
multiple parts with different geometries, thereby increasing the adaptability of the 
robotic work cell and reducing the time for the tool change. 

Usually the number of actuators present in a gripper determines its degrees of 
freedom. A greater number of actuators increase the versatility of the gripping 
device, but it also creates issues of larger size, complexity, and increased weight of 
the device. A class of devices called underactuated devices has fewer actuators 
than their degrees of freedom (Laliberte et al. 2002; Birglen et al. 2008). The 
concept of underactuation allows a robotic gripper to adjust itself to an irregular- 
shaped object without complex control strategy and numerous sensors (Siciliano 
and Khatib 2008; Peerdeman et al. 2013; Odhner and Dollar 2011; Herder and 
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Kragten 2010; Belter and Dollar 2011; Krut et al. 2010; Odhner et al. 2013; Aukes 
et al. 2011; Dollar and Howe 2010; Belter et al. 2013; Birglen et al. 2002; Air 
Muscles From Shadow Robot http://www.shadowrobot.com; Gengenbach 
et al. 1997). Therefore, underactuated grippers are able to provide a greater 
gripping versatility. They are more efficient, consume less power, simpler, and 
generally more reliable than their fully actuated counterparts (Odhner and Dollar 
2011; Gengenbach et al. 1997). 

The design of robotic grippers is a complex process (Zhao 2007; Physik 
Instrumente; Lexcellent 2013), which involves different aspects of mechanics, 
actuation, and control (Birglen and Gosselin 2004; Frecker et al. 1997; Chen and 
Lin 2008). In the initial design stage, mechanism types and actuation methods have 
to be considered in priority. Although the design depends on the specific grasping 
task, the design requirements are generally focused on operational reliability, 
ability to provide adequate (often adjustable) force, shape/force matched prehen- 
sion, as well as good adaptability to handling operations. 

This chapter is organized as follows. Basic concepts of robotic grippers are 
described in section “Basic Concepts.” Some typical mechanisms of two- or three- 
fingered grippers are introduced in section “Transmission Mechanisms.” In section 
“Actuators and Sensor,” some of actuator components and sensors used in robotic 
grippers are briefed. As a case study, the design process of a compliant gripper is 
described in section “Compliant Gripper Design: A Case Study.” Finally, the 
chapter is concluded in section “Conclusion.” 


Basic Concepts 

A gripper used in robots typically works by means of mechanical-, vacuum-, or 
magnetic-based grasping methods. This section will introduce these three types of 
grippers and their selection guideline. 


Mechanical Grippers 

Mechanical grippers grasp objects through the clamping forces exerted by fingers. 
According to the methods of actuation, the mechanical grippers can be further 
divided into electric grippers and pneumatic grippers. 

Electric grippers : Electric grippers use motors to make their fingers move. The 
advantages are that speed and position can be easily controlled. Adaptive electric 
end effectors are a good choice for many applications such as bin picking, handling, 
machine tending, and more. Some commercial electric grippers are shown in Fig. 1. 

Pneumatic grippers : Pneumatic grippers utilize air to actuate the fingers. Usu- 
ally, compressed air is forced through a piston in order to make it move. This 
movement can be parallel or angular (Fig. 2). 

The mechanisms for motion transmission of mechanical grippers will be 
discussed in detail in section “Transmission Mechanisms.” 
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Fig. 1 Commercial electric grippers (http://www.gibsonengineering.com) 



Fig. 2 Commercial pneumatic grippers (FESTO Website, http://www.festo.com/) 

Vacuum Grippers 

Vacuum grippers (Fig. 3) make use of the vacuum force to hold objects. This type 
of grippers will provide good handling if the object surface is smooth, flat, and 
clean. Its performance depends on the surface properties of the object being 
grasped. 

Vacuum cups, also commonly known as suction cups, are used as the gripping 
device. Generally, the vacuum cups (suction cups) are in the round shape. These 
cups are made of rubber or other elastomeric materials. Sometimes, it is also made 
of soft plastics. 
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Fig. 3 Vacuum gripper and suction cups (http://www.piab.com/) 



Fig. 4 Electromagnetic grippers (http://www.liftrite.ie/) 


Vacuum grippers are commonly used in heavy industries, automobile assembly 
lines, and compact disk manufacturing. It is also used in the packaging industry for 
labeling, sealing, bottling, and box manufacturing. 


Magnetic Gripper 

The magnetic gripper is most commonly used in an end effector for grasping 
ferrous materials. It can be classified into two common types, namely, magnetic 
grippers with electromagnets and permanent magnets. 

Electromagnets : Electromagnetic grippers (Fig. 4) include a controller unit and a 
DC power source for handling the materials. This type of grippers is easy to control 
and very effective in releasing the part at the end of the operation. In order to reduce 
the residual magnetism on the part, the polarity level is minimized by the controller 
unit before the electromagnet is turned off to release the part. 
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Fig. 5 Permanent magnetic grippers (http://www.liftrite.ie/) 


Permanent magnets : Unlike the electromagnet-based gripper, the permanent 
magnet-based grippers do not require any external power for handling the materials 
(Fig. 5). After the gripper grasps a part, an additional device called as stripper push- 
off pin is required to separate the part from the magnet. This device is incorporated 
at the side of the gripper. 

The advantage of this permanent magnet-based gripper is that it can be used in 
hazardous environments and explosion-proof apparatus because there is no electric 
supply and potential electric sparking. 

Apart from being simple and low maintenance, other advantages include no 
accidental release of part due to power failures. 


Selection of Grippers 

Grippers are a key integral part of robotic automation, and in order to ensure the 

overall success of the automation projects, choosing the right gripper is essential. 

From a process and part perspective, the following technical factors must be taken 

into account when choosing the right gripper. 

• Task: The tasks to be done by a robot often determine the type of gripper. A very 
fast loading/unloading requirement will favor vacuum cups or magnetic grip- 
pers, but a slower process will favor mechanical grippers for accuracy. 

• Cycle time : The cycle time for the process will determine the speed needed for 
opening and closing the gripper which in turn will determine the acceleration. 
Robot maximum acceleration possible is determined by the end payload, which 
is the total sum of the gripper weight and the part weight. More gripper weight 
means the robot has a reduced part carrying capacity. 

• Precision need : For assembly work which requires high precision, a mechanical 
gripper, actuated by servo electric motors, would be ideal. Part sorting processes 
will require gripper adaptability in terms of part size and positioning, which can 
be fulfilled by the choice of a mechanical gripper with servo control. 
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• Environmental need : There is a diverse range of industrial environments 
which the end effectors function and this affects the selection. In the food 
and pharmaceutical industries, for example, hydraulic-actuated grippers are 
forbidden since there is a risk of oil leakage and contamination. In many 
cleanroom industries vacuum and pneumatic grippers are not recommended 
since they can create turbulent airflow and stir up dust particles. Grippers 
used in harsh environments like foundries, machining, and welding are 
exposed to dirt and particles so they must be designed with protection fea- 
tures. Corrosive or toxic environments in nuclear or chemical industries also 
require special considerations to ensure the gripper’s reliability and safe 
usage. 

• Clamping force: Calculation of the clamping force must take account for part 
weight, maximum acceleration during the process, surface friction, and the 
maximum stresses sustained by the part and gripper. 

• Parts : Knowledge of the part to be manipulated is crucial in determining which 
gripper should be used. The main factors are: 

- Size : Except for vacuum and magnetic grippers, all other grippers need to 
grip parts with a parallel or angular closing motion. This means the bigger the 
part, the more reach is required by gripper fingers. The gripper must have 
enough reach to handle the parts but not in excess since longer fingers require 
more torque on the tool actuator and the robot. 

- Shape: The shape of an object will determine the type of grasping. Flat 
surfaces can be handled by vacuum- or magnet-type grippers, while other 
shapes will be handled by jaws, claws, or multiple fingers. When using 
encompassing or fingertip grasps, finger reach must be taken into account. 

- Weight: To ensure firm grasping, a gripper must have enough force to 
withstand the inertial and dynamic forces generated due to the weight of the 
part and the motion during the process. A designer cannot simply use max- 
imum clamping force since damage to the part or the gripper may occur. 

- Surface type: The type of surface texture and roughness is an important factor 
when estimating friction. In turn the friction is used as input for the grasping 
force. 

Transmission Mechanisms 

The design of a gripper can be performed according to many possible design 
concepts and methods (Wolf et al. 2005; Ceccarelli and Gradini 1992). One of 
the main issues is to determine a proper transmission mechanism. According to 
statistical studies, 60-70 % of objects with cylindrical, parallelepiped, and pyrami- 
dal shapes can be gripped using two or three fingers grasping (Monkman 
et al. 2007). This is why two- and three- fingered grasps are quite common in 
industrial applications. This section will therefore focus on introduction of trans- 
mission mechanisms of grippers with two or three fingers. 
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Mechanical Gripper Mechanisms 
Finger Movement Classification 

Generally, the opening and closing of gripper fingers can be actuated by pivoting, 
linear or translational movement. 

Pivoting Movement 

In this movement, the rotation of fingers is conducted with the fixed pivot points of 
the gripper for providing open and close actions. A linkage mechanism is often used 
for achieving this movement. Some of typical examples with the fixed pivoting 
points are shown in Fig. 6 (Chen 1982). 

Linear or Translational Movement 

Figure 7 shows some typical structures of grippers with linear or translational 
movement. During linear movement, the guide rails are used to move the fingers 
parallel to each other for closing and opening. In translational movement, the 
fingers are maintained in parallel orientation. As in pivoting movements, linkage 
mechanisms are used for actuation. 

Mechanisms for Actuation 

Actuation of finger motions can be accomplished by anyone of these kinematic 
types: linkage, screw, gear and rack, rope and pulley, or cam actuation. 

Linkage Actuation 

A linkage is an assembly of links connected by joints to manage forces 
and movement. The design of linkage actuation helps in finding out the 
conversion of gripper’s input force into the gripping force, the time taken to 
actuate the gripper, and the maximum capability to open the finger. There 
are many possible kinematic designs, and some of them are shown in Fig. 8 
(Chen 1982). 

Screw Actuation 

The screw-type actuated gripper (Fig. 9) consists of a screw connected with a 
threaded block. To rotate the screw, a motor is used along with a speed reduction 
device. If the screw is turned in one direction, the threaded block is moved in one 
direction. Similarly, the threaded block moves in the opposite direction if the screw 
is turned in the other direction. As the threaded block is attached with the gripper, it 
makes the fingers open and close. 

Gear and Rack Actuation 

In this actuation, the gear and rack are connected with a piston, which provides 
linear-type movement. The two partial pinion gears are driven when the rack is 
moved. As it is linked with gripper, the opening and closing of fingers are accom- 
plished. Some examples of this type mechanism are shown in Fig. 10. 
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Fig. 6 Kinematic schemes of some pivoting-type grippers (Chen 1982) 
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Fig. 7 Kinematic schemes of some parallel grasping grippers (Chen 1982) 
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Fig. 8 Grippers actuated by linkage mechanisms (Monkman et al. 2007) 
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Fig. 9 Grippers actuated by screw mechanisms (Chen 1982) 



Fig. 10 Grippers driven by gear and rack mechanisms (Chen 1982) 

Rope and Pulley Actuation 

In this actuation, a tension device is required to control the rope tension and 
keep it taut during movement in the pulley. When the pulley is activated in one 
direction for opening the gripper, the tension device will allow the change in 
rope length while maintaining the tension. Similarly, the gripper is closed by 
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Fig. 11 Grippers driven by rope and pulley mechanisms (Monkman et al. 2007; Crisman 
et al. 1996) 



Fig. 12 Gripper actuated by cam mechanisms (Crisman et al. 1996) 


activating the pulley on the other direction. Figure 11 illustrates two gripper 
mechanisms driven by rope and pulley. 

Cam Actuation 

As a form of linkage -actuated gripper, the cam actuation types have a wide range of 
designs for opening and closing the gripper fingers as shown in Fig. 12. One type 
(Fig. 12a) is briefly described here. A cam-actuated gripper with spring-loaded 
follower can be used to provide open and close actions of fingers. The spring 
function is to force the gripper to close if the cam is moved in one direction, 
while the movement of cam on the other direction causes the gripper to open. This 
type is useful for holding various sizes of work parts (Crisman et al. 1996). 

There are other miscellaneous mechanisms, for example, diaphragm or expand- 
able bladder mechanisms (Robotics Bible 2011; Mock 2009), which are inflated or 
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deflated to open and close the fingers. For more information of other actuation types 
of grippers, readers can refer to the book “Robot Grippers” (Monkman et al. 2007). 


Adaptive Gripper Mechanisms (Underactuated Grippers) 

Adaptive gripper design often adopts the principle of underactuated mechanisms. 
An underactuated mechanism is defined as one which has fewer actuators than its 
DOFs (Laliberte and Gosselin 1998; Belter et al. 2011; Odhner 2012; Aukes 
et al. 2012; Laliberte et al. 2002). The concept of underactuation allows grippers 
to envelope the objects to be grasped and automatically adapt to their shape, size, or 
position with even one actuator and without complex control strategies. While a 
finger is closing on an object, the configuration of the finger at any time is 
determined by the external constraints associated with the object. When the object 
is fully grasped, the force applied at the actuator is distributed among the finger 
joints (phalanges). 

There is a large diversity of mechanical devices which enable a hand to adapt to 
the geometry of an object. Generally, an underactuated motion can be implemented 
through the following types: (a) differential (Birglen et al. 2008; Siciliano and 
Khatib 2008), (b) compliant (Peerdeman et al. 2013; Odhner and Dollar 2011), 

(c) selective self-locking (Herder and Kragten 2010; Belter and Dollar 2011), or 

(d) a combination of these three properties (Krut et al. 2010; Odhner et al. 2013). 

Basic Principle of Adaptability 
Differential Mechanisms 

Figure 13 illustrates a concept of underactuation through a differential mechanism 
(Laliberte and Gosselin 1998; Laliberte et al. 2002) which has one input and two 
outputs. The finger is actuated through the lower link, as shown by the arrow in the 
figure. Since there are two DOFs and one actuator, one (two minus one) elastic 
element must be used. In the present example, an extension spring tends to maintain 
the finger fully extended. A mechanical limit is used to keep the phalanges aligned 
under the action of this spring when no external forces are applied on the phalanges. 
The sequence of the phalanges motion is as follows: 

(a) The finger is in its initial configuration and no external forces are present. The 
finger behaves as a single rigid body in rotation about a fixed pivot. 

(b) The proximal phalanx makes contact with the object. 

(c) The second phalanx is moving with respect to the first one - the second phalanx 
is moving away from the mechanical limit - and the finger is closing on the 
object since the proximal phalanx is constrained by the object. During this 
phase, the actuator has to produce the force required to extend the spring. 

(d) Finally, both phalanges are in contact with the object, and the finger has 
completed the shape adaptation phase. The actuator force is distributed 
among the two phalanges in contact with the object. 
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Fig. 13 Grasping sequence performed by an underactuated finger (Laliberte and Gosselin 1998) 
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Fig. 14 Underactuated finger with compliant element (Siciliano and Khatib 2008) 


Compliant Elements 

Figure 14 shows an example using the compliant element to achieve adaptive 
feature (Siciliano and Khatib 2008). The spring (compliant element) gives exten- 
sibility to the tendon. It allows the motion between the first and second link when an 
external force is applied to the distal one to be decoupled, providing the possibility 
to fit the shape of objects. A design task is the choice of the stiffness of the 
deformable element in order to achieve at the same time a strong grasp and a 
good shape adaptability. 

Selective Self-Locking Device 

In this strategy, the motion generated by only one motor is transmitted and 
distributed to several joints. Actuation and control of each joint is obtained by 
means of devices like self-acting clutch, brake, or self-locking mechanisms. Var- 
ious finger trajectories can be attained through selectively locking certain joint 
rotations. Figure 15 illustrates an underactuated finger with a friction pawl locking 
system (Peerdeman et al. 2013). 
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Fig. 15 An underactuated gripper by means of joint locking (Peerdeman et al. 2013) 


Examples of Underactuated Grippers (Hands) 

Due to increasing demands from industry, a few underactuated fingers have been 
proposed in research. Some of them are based on linkages, while others are based 
on tendon- actuated mechanisms. This section describes the design features of some 
typical underactuated (grippers) hands. 

Underactuated Gripper Adaptive to Position Offset 

The two-DOF prismatic gripper as shown in Fig. 16 is a simple underactuated 
gripper for objects that are not centered in the grasp. The system consists of two 
fingers constrained to move only in linear translation. Each finger has certain spring 
and a drive cable that exerts force on the fingers. The main actuator tendon pulls on 
the system with force Ta. In this setup, the drive tendons are decoupled to two 
single actuators T R and T L using a floating pulley system. The floating pulley acts as 
a simple differential to match the tension force in each output tendon (Peerdeman 
et al. 2013; Odhner and Dollar 2011). 

The floating pulley mechanism couples one to two output drive cables and the 
gripper is therefore underactuated. It also permits the system to adapt to object 
offsets, u , within the grasp. Figure 16c shows the finger displacements X L and X R as 
the actuator force increases, demonstrating this adaptability. 

Selectively Compliant Underactuated Hand 

The hand (Fig. 17), also called SRI hand (Belter and Dollar 2011; Krut et al. 2010), 
consists of three identical fingers each actuated by a twisted string actuator. Each 
finger has three phalanges connected in series with a cable running along pulleys in 
the joints, terminated at the fingertip (Fig. 18a). Since there are three independent 
degrees of freedom in the finger and only one actuator, springs are attached across 
each joint to provide return actuation and to balance the closing tendon force 
between each phalanx. Extending from each phalanx into the joints is an interleaved 
set of surfaces used to electrostatically lock each joint (Fig. 18b). Covering each 
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Fig. 16 Phase of grasp acquisition in the prismatic two-DOF gripper (Peerdeman et al. 2013) 

phalanx is a set of capacitive force sensors. Tendon tension is measured using load 
cells integrated into the tendon path, allowing for force control using the tendon 
tension. The palm has one additional motor to enable reconfiguration of the finger 
positions and orientations, allowing for opposed, spherical, and interlaced grasps. 

A Compliant Underactuated Hand 

This hand, also called iHY (iRobot-Harvard- Yale) hand, was designed to address 
the need for a durable, inexpensive, moderately dexterous hand. It has three fingers 
and five actuators (Fig. 19): Each of the three fingers has a single flexor tendon 
running the length of the finger, and the adduction/abduction degree of freedom on 
the finger pair is driven by a fourth. A fifth actuator powers an extensor tendon on 
the proximal joint of the thumb. This actuator allows the link angle of the thumb 
proximal and distal joints to be set independently and is particularly useful for tasks 
in which the tip of the thumb needs to move arbitrarily in the plane, such as the 
finger pivoting. 

The iHY fingers (Fig. 20) utilize a two-link modular design. The proximal pin 
joint connecting the finger to the hand is mounted on a circular magnetic base which 
serves as a modular attachment point and also as a high-force breakaway coupling 
to ensure that fingers will be minimally damaged in a catastrophic collision. Spring- 
loaded “pogo pin” electric contacts further simplify the modular fingers so that 
replacing a finger can be accomplished by snapping the finger in place, then 
attaching the flexor tendon. A tendon attachment point for the antagonistic tendon 
used on the thumb is included at the base of the proximal link on all fingers, so that 
any finger can be interchangeably used in any position on the hand. 
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Fig. 1 7 The SRI hand 
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Fig. 18 (a) Pulleys and cable routing, (b) Brake 
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Fig. 19 The iHY hand 
(Dollar and Howe 2010) 
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Fig. 20 Cross-sectional view of the iHY finger design (Dollar and Howe 2010) 


One of the more important features of the iHY finger is its high compliance, 
especially at the distal flexure. The distal flexure hinge allows out-of-plane motion. 
The proximal pin joint includes a torsion spring, which provides the proximal joint 
with some elasticity. This compliance serves several purposes: first, because the 
fingers do not have extensor tendons, the joint elasticity alone extends the fingers 
when the flexor tendons are relaxed. This is particularly useful when operating the 
hand in an unknown environment where collision with obstacles is likely, so that 
fingers merely deform in response to unplanned contact. The torsional compliance 
at the distal flexure joint provides a similar robustness to the fingertips for out-of- 
plane contact. The second purpose served by passive finger compliance is passive 
adaptation to the shape of the object grasped, which removes the need to detect and 
react to small variations in surface geometry. 

Linkage-Driven Underactuated Hand 

This hand was developed by the Laval University and also named as Self Adaptive 
Robotic Auxiliary Hand (SARAH). It comprises three identical underactuated 
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Fig. 21 CAD model of the 
Laval hand (Birglen 
et al. 2002, 2008) 



fingers and each has three DOFs. The hand includes ten DOFs in total but only has 
two actuators. One actuator is used to control the opening/closing of all three 
fingers, while another actuator is used to drive the orientation of the fingers. The 
overall system is illustrated in Fig. 21 (Birglen et al. 2002, 2008). 

Each of the three identical fingers is mounted on an additional revolute joint 
whose axis is located on the vertex of an equilateral triangle and oriented normal to 
the plane of the triangle. The orientation transmission uses one actuator to drive the 
two rotating fingers. This rotation is synchronized by the gearing system and the 
fingers rotate in opposite directions. If some fingers are blocked, the remaining 
fingers will continue to close until they properly grasp the object. The force is fully 
applied only when all the fingers have properly made contact with the object or the 
palm. With these additional re volute joints, the hands can be reconfigured by 
modifying the orientation of the fingers to adapt to the general geometry of the 
object to be grasped. 

The mechanism of the three-DOF underactuated finger is shown in Fig. 22. It is 
composed of two parallelograms mounted in series. Two mechanical limits with 
springs at the top and bottom ends of the mechanism allow precision grasps to be 
performed and the adaptation to power grasps if necessary. 
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Fig. 22 Grasp diagram of the finger mechanism: (a) Parallel, (b) Power (Birglen et al. 2002, 
2008) 


Actuators and Sensor 
Actuators 


Actuators are the “muscles” of a gripper, converting stored energy into movement. 
The most commonly used actuators for grippers include electric motors that spin a 
wheel or gear and linear actuators that provide linear motions. But the actual 
selection of actuators depends on the specific properties of each gripper design. 
There are many other alternative types of actuators, powered by electricity, 
chemicals, or compressed air, to suit different requirements. 


Electric Motors 

Electric motors convert electric energy into mechanical energy. They are the 
commonly used actuators in both robots and grippers. Stepping motors are used 
in low-cost proportional systems instead of pneumatic drives. Servo motors (syn- 
chronous motors) are often used for demanding applications in which sensitive 
force and position regulation are needed. These motors are often preferred in 
systems with lighter loads and where the predominant form of motion is rotational, 
for example, in screw actuation, gear and rack actuation, rope and pulley actuation, 
and cam actuation mechanisms. 
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In order to select the appropriate electric motors, the following factors should be 
calculated and evaluated: 

• Number of motor revolutions needed 

• Moment of load 

• Moment of inertia from moving masses 

• Acceleration torque 

• Moment of friction 

The suitable motor can be selected to meet the above torque and speed require- 
ments. Finally, based on the current and voltage requirement of the motor, a 
controller can be selected. 

Linear Actuators 

Various types of linear actuators move in and out instead of rotating and often have 
quicker direction changes, particularly when very large forces are needed such as 
with industrial robotics. 

The most frequently used linear actuators in grippers are pneumatic drives. 
Actuation is realized either by pneumatic cylinders integrated within the gripper 
housing or by externally mounted cylinders. Such prime movers are robust and 
resistant to overload. The actuation can be single direction cylinders with spring 
return or double direction. The actuation force F can be calculated as 

F = PAr/ (1) 

in which P is the effective pressure (pressure difference at two sides of the 
cylinder), A the cross-sectional area of the cylinder and r| the efficiency (typically 
between 0.7 and 0.9). 

Another interesting pneumatic drive is pneumatic artificial muscles, also known 
as air muscles (Air Muscles From Shadow Robot, http://www.shadowrobot.com) or 
fluidic muscles (FESTO Website, http://www.festo.com/). They are special tubes 
that contract (typically up to 40 %) when air is forced inside them. 

Vacuum Suction Cup 

Vacuum suction is one of the oldest astrictive gripping methods which can provide 
a continuous holding force without the application of compressive stress. It has 
been used extensively throughout industry. The key components are normally 
elastomeric suction cups or caps. The cups can be either active or passive. Active 
suction cups require typically vacuum about 70 % of atmosphere. The simplest 
implementation is to force a flexible suction cup against a surface. Air is expelled 
by a vacuum pump or vacuum suction nozzle as the flexible polymer cup is 
compressed. 

In passive suction caps the vacuum is produced simply by pressing a disk sucker 
having a relatively soft rim against a flat surface. Mechanical pressure can be 
applied by hand or machine. Therefore, the suckers do not require further 
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operational force and can also be applied to slightly curved surfaces. These grippers 
are not affected by failure in the power supply, though leakage losses cannot be 
compensated for. 

Grippers based on vacuum cups can be used for both large and heavy parts and 
very small components in the semiconductor industry and micro-assemblies 
(Gengenbach et al. 1997). 

Magnetic Actuator 

Similar to vacuum suction, magnetic actuators can exert attractive force on ferrous 
materials which can be used for gripper design. The implementation can be in both 
passive and active forms. 

A simple permanent magnet can be used to acquire ferrous objects. Under 
normal temperature, permanent magnets maintain their magnetic force almost 
indefinitely. External power supply is no needed; thus, the design can be extremely 
simple with low maintenance and well suited for flat materials, rods, and iron tubes. 

Electromagnetic actuator can be made from coils wound on cores of high 
magnetic permeability. The attractive force may be calculated and designed 
according to actual requirement. By controlling the current in the coil, the force 
can be actively controlled. 

Magnetic grippers have simple and compact construction with no moving parts. 
The energy supply is simple with high efficiency. They can be applied with a wide 
range of object sizes and shapes. However, there are several disadvantages and 
limitations associated with magnetic grippers. They can only be used with ferrous 
materials (iron, nickel, cobalt). Extended release time is needed due to magnetic 
reminisce. This can also lead to the collection of fine magnetic particles and dust 
over prolonged time. Due to permeating effects of the magnetic field, separation of 
thin sheet objects from one another is difficult, and the object can become slightly 
magnetized. 

Piezoelectric Actuators and Motors 

Piezoelectric actuators convert electric energy into mechanical force and motion 
based on piezoelectric effect. According to the electric drive method, they can be 
divided into two categories: the resonant-driven piezoelectric actuators and the 
nonresonant-driven piezoelectric actuators. The resonant-driven piezoelectric actu- 
ators include ultrasonic transducers and ultrasonic motors. Piezoelectric motors or 
ultrasonic motors are the recent alternatives to DC motors. They work on a 
fundamentally different principle, whereby tiny piezoceramic elements, vibrating 
in ultrasonic frequency range, cause linear or rotary motion. The advantages of 
these motors are nanometer resolution, available force for their size, and holding 
force when powered off (Zhao 2007). These motors are already available commer- 
cially; however, real industrial applications in gripper design are rarely seen. 

The nonresonant-driven piezoelectric actuators include various one-stroke actu- 
ators such as piezo-stacks and benders (Physik Instrumente) . Their operating 
frequency range is from quasistatic up to about half of the first resonant frequency 
of the mechanical system. These actuators are very compact in size but can generate 
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relatively large force up to several kilonewtons. However, typical strain levels that 
can be obtained by piezoelectric materials are in the range of 0.1 %. This strain 
occurs at field strength in the region of 1,000 V/mm. In order to increase the total 
displacement and to reduce the driving voltage, these actuators are often built in 
stack of very thin layers which are connected in parallel electrically. Typical stroke 
of multilayer piezoelectric actuators is from a few to several tens micrometers. 
Thus, they are mostly used in small grippers in which high precision and small 
strokes are needed. 

Beside the abovementioned actuators, many other types of actuator are avail- 
able, but less commonly used due to their limitations. These include muscle wire, 
also known as Shape Memory Alloy, Nitinol, or Flexinol Wire, which is a material 
that contracts slightly (typically under 5 %) when electricity runs through it. They 
have been used for some small robot applications (Lexcellent 2013). EAPs or 
EPAMs are a new plastic material that can contract substantially (up to 380 % 
activation strain) from electricity. 


Sensors 

Sensors allow the end effector or gripper to receive information about the environ- 
ment or internal components. This is essential for them to perform their tasks and 
act upon any changes in the environment to calculate the appropriate response. 
There are three types of sensing which are related to the end effector or gripper 
technology: 

• Proximity or presence detection of objects to be manipulated 

• Verification of gripping or release 

• Sensing of object position and orientation 

Numerous types of sensors exist which can be used to fulfill the abovementioned 
required sensing requirements. The selection of sensor is independent of the chosen 
control strategy, the environment, and the physical design of the end effector or 
gripper. The rest of this section will give a brief introduction of the most commonly 
used classes of sensors relevant to the task of sensing in grippers. 

Proximity Sensors 

Proximity sensors are noncontact sensors which deliver a signal from which 
information concerning the instantaneous distance to a given object can be derived. 
The signal coupling can be realized by inductive, capacitive, fluidic, optical, or 
acoustic means. 

Inductive Sensors 

Inductive sensors are also called as eddy current sensors. They are suitable for 
electrically conducting objects, usually metals. The sensor consists of an induction 
loop. Electric current generates a magnetic field, which collapses generating a 
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current that falls asymptotically toward zero from its initial level when the input 
electricity ceases. The inductance of the loop changes according to the material 
inside it, and since metals are much more effective inductors than other materials, 
the presence of metal increases the current flowing through the loop. This change 
can be detected by sensing circuitry, which can signal to some other device 
whenever metal is detected. The sensing range can be up to several centimeters, 
but recognition of some alloys and carbon fiber parts presents problems. The 
detection can also be dependent on the thickness of highly conducting objects. 

Capacitive Sensors 

Capacitive sensors use the electric property of “capacitance” to make measure- 
ments. Capacitance is a property that exists between any two conductive surfaces 
within some reasonable proximity. Changes in the distance between the surfaces 
change the capacitance. It is this change of capacitance that capacitive sensors use 
to indicate changes in position of a target. They are suitable for practically all 
materials with a few exceptions but limited to a few millimeters distance. Capac- 
itive sensors can achieve nanometer resolution which makes them indispensable in 
ultraprecision applications. 

Photoelectric or Optoelectronic Sensors 

Photoelectric or optoelectronic sensors offer noncontact sensing of almost any 
substance or object up to a range of 10 m. Photoelectric sensors consist of a light 
source (usually an LED, light emitting diode, in either infrared or visible light 
spectrum) and a detector (photodiode). Due to the high-intensity infrared energy 
beam, these sensors have major advantages over other optoelectronic systems when 
employed in dusty environments. With their focused beam and long range, opto- 
electronic sensors are increasingly used in applications where other sensing tech- 
niques are lacking in sensing distance or accuracy. 

Photoelectric sensors are available in a variety of modes including Infrared 
Proximity (Diffuse Reflective), Transmitted Beam (Thru-beam), Retroreflective 
(Reflex), Polarized Retroreflective (Polarized Reflex), Fiber Optic and Background 
Rejection. 

Ultrasonic Sensors 

Ultrasonic sensors utilize the reflection of high-frequency (above 20 kHz) sound 
waves to detect parts or distances to the parts. The two basic ultrasonic sensor types 
are electrostatic and piezoelectric. In general, ultrasonic sensors are the best choice 
for transparent targets. They can detect a sheet of transparent plastic film as easily as 
a wooden pallet, but not suitable for foamed plastics and some textile fiber products. 

Sensors for Contact 

This class of sensors responds to physical contact. When a contact happens the 
sensor will generate corresponding information which can be used to control the 
gripper, e.g., the motor of a gripper. The information being sensed is usually force, 
torch, or pressure. In the simplest case it is sufficient to have a binary sensor. 
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Binary sensors simply register thresholds for a given signal, e.g., contact between 
the gripper finger and an object. 

Force-Torque Sensors 

Force-torque sensors have been widely used in industrial grippers, e.g., load cells. 
Force feedback represents a fundamental requirement for the success of any task that 
involves physical interaction with the environment, but when trying to use it to assess 
the quality of a grasp, the force information coming from a wrist sensor is not that 
important. On the other hand, force sensors could be used to detect contact positions if 
they are placed on each finger and use that contact information to evaluate grasp 
stability. The demands of these systems by industries have driven the force-torque 
sensing technology to a great development and robustness. Force-torque sensors are 
normally located between the gripper and the robot flange. These are deformable 
elements, for which there are many designs. Usually force-torque sensors are 6D: 
force and torque. There are also 12D sensors that measure also accelerations. 

Tactile Sensors 

Tactile sensors are more straightforward to be used in grippers. Tactile sensors can give 
information of the surface in contact with the object and use it to determine the stability 
of the grasp, for example, using a classification method. The commonest forms of tactile 
sensing are piezoresistive, capacitive, and optical (whether infrared or visible light). 

Sensors for Orientation and Position 
Vision 

Naturally, vision is the most suitable sensory cue to locate objects in the workspace and 
identify obstacles on its surroundings. It can also be used to detect contacts and to track 
the relation between the gripper and the manipulated object. For gripping and manip- 
ulation tasks, the most useful information is the location and 3D stmcture of the objects 
and obstacles. The classical and common approach to obtain 3D information of the 
environment is the stereo reconstmction system. This method is based in the disparity 
between two camera images to extract depth information and has been widely 
addressed. Another approach is to actively project a pattern to the scene and observe 
its deformation to obtain the 3D stmcture. This method is used by the Kinect sensor. 

However, vision is not widely employed in industry gripper applications, 
although it has been extensively studied in the scientific community. The reason 
is that vision is not 100 % reliable due to environmental uncertainties, sensing 
errors, calibration errors, and so on. A simpler and more commonly used method is 
to use laser triangulation or acoustic sensors. 

Slip Sensor 

Slip of an object between the gripper jaws can be detected by specially designed 
slip sensors. The object may be retained using the smallest possible force, and upon 
detection of sliding (slip) this force can be automatically increased. 
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Compliant Gripper Design: A Case Study 

Compliant mechanisms, i.e., mechanisms built with elastic joints, offer several 
advantages compared to conventional counterparts: reduction of wear, clearance 
and backlash, compactness, no need for lubrication, simplified assembly, etc. The 
use of compliant joints is a very promising approach for development of compact 
grippers used for small part handling. Design of compliant mechanisms normally 
includes two methods: kinematostatic model (or pseudo-rigid-body model 
(PRBM)) (Birglen and Gosselin 2004) and topological optimization (Frecker 
et al. 1997). In this section, we demonstrate how to develop a compliant gripper 
for optical fiber handling by using kinematostatic model method. 


Design Considerations 

The tools used to handle optical fibers have a significant impact on the func- 
tionality and reliability of the optical fibers and related components (Chen and 
Lin 2004, 2008; Chen et al. 2013). In a typical fiber handling process, a gripper 
is normally used. For example, to assemble a laser diode, a gripper needs to 
grasp an optical fiber in a limited space and moves with submicron resolution in 
six DOFs to maximize the light coupling efficiency between the laser waveguide 
and the optical fiber. Since the core of optical fibers is glass based, the gripping 
force exerting on it must be delicately controlled. If the gripping force is 
too low, the fiber cannot be precisely handled. On the other hand, if the gripping 
force is too high, the stress may be introduced into the fiber core, leading to 
the distortion of the beam profile, which in turn affects the alignment accuracy. 
The excessive gripping force even results in flaws on the core surface, causing 
fiber fatigue or compressive damage. The grippers are therefore required to 
have the features of a zero-backlash motion and a controllable gripping 
force. According to analysis of the requirements of fiber handling, the following 
issues impose a great impact on the gripper design, needing to be considered in 
design. 

Millimeter-Scale Size 

In a typical automated pigtailing process, a gripper picks up a fiber with a clad 
diameter of 125 pm from an unloading mechanism and positions it in a cramped 
space inside the package housing. In such an application, it is desirable to use 
a millimeter- scale gripper with a submillimeter opening and slender yet rigid 
gripper tips. 

Conventional millimeter-scale grippers are assemblies of various metal parts. 
Although parts below 1 mm in size could be manufactured through various 
microfabrication processes such as micro-electro discharge machining and powder 
injection molding, there remains difficulty in assembling of miniature parts. More- 
over, the large size of standard components such as actuators and LM guides 
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imposes another challenge in integration with miniature gripper mechanisms. 
As such, a monolithic design may be the best way to meet the requirement of the 
miniature- sized gripper. 

High-Resolution Motion 

The accuracy of the gripping force is affected by the motion resolution of the 
gripper. To achieve a fine resolution motion for a motorized gripper, not only its 
mechanism for motion transmission needs to be backlash-free but also its actuator 
including the controller, and driver should have a high performance in terms of 
resolution and controllability. While a flexure-based structure, which is intrinsi- 
cally zero backlash in its joints, is chosen as the motion transmission mechanism, 
the motion resolution of the gripper will depend upon the actuator. Some actuators 
such as SMA, micro-stepper, or inchworm motor can provide a high-resolution 
performance, but either their controllability or their output push force cannot 
meet the requirement of fiber handling application. To this end, the PZT actuator 
is a good selection as it can offer a nanometer- scale resolution and a high push 
force. 

Passive Compliance 

Due to the relative high stiffness of fibers (a high impedance), the control of 
gripping forces requires the fingers to exhibit a low gripping impedance. Otherwise, 
small perturbations in motion may generate large changes in gripping forces. The 
impedance control method can be applied to the case, but the method needs 
accurately modeling and some combination of position and force control. Due to 
this reason, it is desirable that the gripper itself has a passive compliance when 
contacting with a fiber. 

Flexural Materials 

Two dominant concerns in selection of materials are protection of fibers, which 
requires the gripping ends to be made of a pliable material (small elastic module, 
E), and the mechanical performance of the gripper, which is defined as the opening 
range of fingers under a specified output stiffness. To have a large opening range, 
materials should allow a great elastic deformation without undergoing damaging 
plastic stress. This means that these materials have the greatest ratios between the 
elastic limit and the elastic module, a/E. A trade-off of selection between the 
materials with the low E value and the high a/E value should be conducted before 
a final material is chosen. 

Gripping Manners 

Generally, there are three types of gripping manners for a grip, namely, parallel 
gripping, angular griping, and vacuum suction that is normally used to pick objects 
with a flat surface. For cylindrical- shaped fibers, angular gripping cannot provide a 
force-close for grasp, and suction manner may not provide a sufficient suction force 
to hold the fiber in the position. Obviously, only the parallel gripping manner can 
offer a secured grip for the fiber handling. 
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Modeling and Analysis 
Topological Synthesis 

Based on the analysis of gripping manners and a study on mechanisms used in the 
conventional grippers, a gripper’s topological structure, which is able to meet the 
requirements of the fiber handling, is obtained. Its kinematic scheme is illustrated in 
Fig. 23, in which the circles represent the hinge joints. Each finger is driven through 
a 4-bar parallel-configured mechanism, which also gives motion amplification. The 
transmission ratio K can be expressed as 



The parallel four-bar mechanism allows the finger to move predominantly in 
translation instead of rotation. When an input motion is exerted with a force P, two 
fingertips will close and open in a parallel manner, providing a secure grip for the 
cylindrical fibers. The link lengths of the mechanism are determined by the 
requirement of the motion amplification. Based on Eq. 2, for the design specifica- 
tions of the finger opening 2 s, 800 pm, and the actuator travel x, 40 pm, or the gain 
K set at 20, the ratio of a to b (i.e., a/b) is found to be 10. 


Generation of Compliant Mechanisms 

To eliminate the backlash existing in each joint of the gripper, the linkage mech- 
anism obtained needs to be converted to a flexure-based mechanism. In this process, 
the design of flexure joints plays a critical role. 

Notches in the compliant members have a much lower bending stiffness than 
other portions of the material; therefore they can easily provide a compliant 
rotational motion. These notches are defined as so-called flexure joints. Under 
small deformation, a flexure joint can be modeled as a rigid hinge joint attached 
by a linear torsional spring as shown in Fig. 24. The spring constant k is equal to the 
bending stiffness of the notch, which is generally expressed as 


k = y(t, l) — , (3) 

where, E is Young’s modulus, I is bending moment of the notch, y is correctional 
coefficient for the different shape of notches, and t and / are the notch’s thickness 
and length, respectively. The values of y in the three typical types of flexure joints 
are shown in Fig. 25. 

Since flexure joints produce rotational motion by means of material deformation 
within elastic range, they are able to offer the following several significant advan- 
tages compared to conventional joints: zero backlash, free wear, smooth and 
continuous displacement, assembly free, embedded return springs, and so 
on. Flexure joints may have some so-called parasitic deficiencies, but they can be 
eliminated or minimized through proper design. 
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Fig. 25 Three types of 
flexure joints 
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Fig. 26 Flexure-based 
gripper 



Based on the concept of the flexure joints, the linkage model of the 
gripper obtained in Fig. 23 can now be easily translated into a flexure -based gripper 
as illustrated in Fig. 26. The notch shape with a half circle is selected. It is because 
such a notch can be easily fabricated. To ensure the mechanism to produce a good 
performance for motion transmission, the appropriate sizes of flexure joints have to 
be determined. This work can be done by using methods of optimization. 
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Optimization of Flexural Structure 
Formulation of Optimization 

The objective is to ensure the gripper able to produce a maximum opening in tips 
under a given input force. In detail, we hope the flexure joints have a high axial 
stiffness and a high bending flexibility. Also, the gripper should satisfy potential 
constraints in terms of motion requirements, stress limits, bulking of compression 
members, and manufacturing limits. Based on the mechanics of material, the 
optimization problem of the gripper can be expressed mathematically as follows: 
Find a set of design variables q, p which lead to the objective function mini- 
mized, i.e., 


min 


a 


max 


(5 


max 


f{q,p) 



while subject to constraints 
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(5) 
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<r 
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0.1 < ti 

8i+i (p)> 

i 1,..6 

(7) 

k 8i+i(p) < 3, 

i 1,..6 

(8) 


where 

<7 max : the maximum stress of the whole gripper 
<5 max : maximum output displacement 
A: factor of safety 
o y : material yield strength 

q\ the shape parameters of the gripper such as a, b 

p\ the parameter of size of flexure joints such as t if /,./ = 1, . . ., 6. 

Equations 5-8 reflect the constraints of motion requirements, stress limits, 
minimum thickness (manufacturing limitation), and maximum length (bulking 
limitation) of flexure pivots, respectively. 

Numerical optimization algorithms such as FEA can be employed to calculate 
the shape parameters and the flexure joint sizes. However, direct use of these 
techniques in this phase will be time-consuming and difficult because solving 
Eqs. 4-8 belongs to a continuum mechanics problem and needs a good understand- 
ing of optimization techniques. To relieve the complication of the problem, a hybrid 
approach is used, which integrates pseudo-rigid-body model, trail-and-error 
method, and FEA simulation. In this way, a set of initial design variables is first 
determined by means of the pseudo-rigid-body model. Starting from these values, 
the design is modified by using the trail-and-error method and the FEA tool. 
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Fig. 27 A pseudo-rigid-body model and its free-body diagrams for links 


Pseudo-Rigid-Body Analysis 

The pseudo-rigid-body model (Chen and Lin 2008) treats the flexure joint as a 
hinge attaching a torsional spring and other segments of material as rigid links. As 
such, the gripper’s full compliant structure can be modeled as a conventional model 
with finite number of rigid linkages, as shown in Fig. 27, where kj stands for the 
spring constant of the hinge i. The motion and forces of each hinge can be solved 
through the conventional kinematic loop closure and the force-equilibrium 
equations. 

By virtue of the principle of virtual work, the relationship of the input (F in ) and 
output (F out ) force can be written as 

Pin = I 'Pout + - (&2 + ^3 + £4 + £5 + k§). (9) 

b b 

Using the static equilibrium equations, the unknown reaction forces ( fi X ,fi y ) and 
torque (M z ) in Fig. 27 can be found: 



( 12 ) 
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(13) 

(14) 

(15) 


The maximum output displacement, <5 max , and the maximum stress, <r max , in each 
hinge can be expressed as 




max 


i 





(16) 

(17) 


(^Ynax 





where w is the width of the notch. 

It can be found that the global maximum stress of the whole gripper occurs at the 
Joint 3 or Joint 2. Substituting Eqs. 16 and 17 into Eqs. 4 and 6, we can obtain a set 
of initial design variables by means of normal optimization techniques. 


Finite Element Simulation 

After obtaining the initial design values, we can employ FEA tool to simulate the 
performance of the gripper and then use the trial-and-error approach to do modifi- 
cation. The FEA software package used for this task is ANSYS (Release 5.7). In the 
simulation, the gripper was analyzed as a 2D model with the material of AL 7075- 
T6. Due to symmetry of the two fingers, only one side of the gripper needs to be 
analyzed. The meshed model is shown in Fig. 28. 

After a few design iterations are performed, the final version of the flexure-based 
gripper is achieved. Some main parameters are a = 16 mm, b = 0.8 mm, the 
thickness of flexure joints t = 0.3 mm, / = 0.5 or 3 mm, and the gain K = 20.6 
(Note: K = 40 if no elastic deformation). The maximum stress occurs at point 
A. When the tip range is equal to 1,000 pm, the maximum stress has a value of 
0.035 GPa. It is far less than the elastic limit (0.48 GPa) of the material, therefore 
ensuring the motion of the flexure joins within the elastic range. The ratio of the 
maximum stress to the yield strength is 7.3 %. According to the theory of fatigue 
failure, when this ratio is less than 10-15 %, flexure joints can remain dimension- 
ally stable for a long period of time (without fatigue failure). 

In addition, the simulation results also confirm that the fingertips exhibit excel- 
lent parallel opening and closing motion. The typical drawback of flexural j oints in 
terms of parasitic motion did not undermine the overall design requirements. 
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Fig. 28 Meshed model of 
gripper 
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Results 

Prototype of Gripper 

A close-up view of the prototype of the gripper is shown in Fig. 29. The gripper 
mechanism was fabricated from a monolithic aluminum plate AL-7075 through a 
wire electric discharge machine process. The overall dimensions of the gripper are 
42 mm long, 22 mm wide, and 4 mm thick. It is detachable from the actuator 
bracket housing, making it adaptable for handling different miniature optical 
components. The gain of motion amplification of the gripper mechanism is 20.6. 

The actuator is a high voltage stack PZT actuator with a rational resolution of 
3 nm. It can produce a maximum push force of 2,800 N and provides a stroke of 
40 pm. The stroke can offer an opening range of 800 pm in the fingertips through 
motion amplification of the mechanism. 

The gripping force is sensed through a micron strain gauge (Tokyo Sokki 
Kenkyujo) adhered on the outside surface of a fingertip. Thanks to the flexure joints 
and the high resolution of the PZT actuator, the gripper is capable of achieving 
backlash-free motion and has a high motion resolution (0. 1 pm in finger tips). These 
performances also lead to a fine gripping force increment (force resolution) of 

0.01 N. 
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Fig. 29 Gripper prototype 
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Fig. 30 Optical fiber 
handling system 



Experiment of Fiber Handling 

Fiber handling was carried out by using an optical fiber handling system shown in 
Fig. 30, in which the flexure -based gripper was mounted on a vertical X-Y stage. A 
single-mode fiber with a diameter of 1 25 pm was gripped from a V -groove and then 
placed on another. In the experiment, the threshold value of entailed force of the 
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fiber, viz., the gripping force was set at 1.40 N. With such settings, the system 
performed more than 50,000 cycles without failures. No distortion in the beam 
profile was detected under the gripping force. 


Summary 

End effectors such as grippers have been playing a crucial role in industry robotics. 
The design and selection of grippers need a comprehensive consideration in terms 
of task, environment, and technological limits. In the industry, simplicity and cost 
are often the main guideline for the design of grippers. This situation has led over 
the years to the development of a number of special-purpose grippers optimized for 
single specialized operations but not suitable for other task. In recent years, since 
applications with high-mix and low-volume parts are more and more popular, 
adaptive grippers have become a major topic in research community and industry. 
It is expected that constant research activities in this field with developments at 
technological (mechanism, actuator, sensor, etc.) and methodological (design and 
control) levels will be carried on in the future. 
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Abstract 

Although noncontact-type operations, such as pick-and-place or spot welding, 
are still commonly being used nowadays, there is an increasing interest on 
developing and applying “compliant motion” (i.e., motion and force) control 
on industrial robots for contact-type operations. For these operations, the 
required robot motion can be complex depending on the workpiece geometry. 
As a result, realizing these tasks using online programming methods is usually 
inadequate in terms of productivity in practice. The focus of this chapter is on the 
simulation and off-line programming process for contact-type operations, where 
some level of interaction between the robot and the workpiece/environment are 
required. Due to this interaction, some additional issues arise during the pro- 
gramming process. Solutions for these problems will be brought up, and readers 
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can refer to the cited references for detailed discussions. A case study on 
robotized surface grinding systems based on the experience on industrial pro- 
jects will also be presented to further assist readers on the actual implementation. 


Introduction 

Current industrial robots are essentially complex, position-controlled machines. 
These robots can usually take in commands in a text-based form called robot 
programs. Since most industrial robots nowadays can only understand very basic 
instructions, such as moving from one configuration to another or apply a 
predefined force in a specific direction, to program robots for a given task, users 
usually need to develop a very detailed sequential instruction set for the robot 
controller. Note that users typically need to redo this programming process every 
time when there is a change in the working environment. For example, one typical 
application of robots in the industrial environment is the pick-and-place task, i.e., 
robots are commanded to pick up an object from one location and move it to another 
place along a predefined trajectory. Due to the robot’s good repeatability perfor- 
mance, once taught, it is able to duplicate its task faithfully as long as all the initial 
work-cell setup remains valid. If the location of the object/workpiece is changed, 
users are required to modify the existing robot program to accommodate for this 
change. In order to specify the location of the workpiece (to be picked or placed), 
users can either physically move the robot end effector/gripper to the target location 
(walk-through programming) or manually jog the robot gripper using the provided 
teach pendant (jogging programming) as illustrated in Fig. 1. Although the 
noncontact type of tasks (pick-and-place or spot welding for instance) are still 
commonly being used nowadays, there is an increasing interest on developing and 
applying the “compliant motion” (i.e., motion and force) control for industrial 
robots (Lim and Tao 2010). This can be observed from the fact that several major 
robot manufacturing companies such as ABB and KUKA have incorporated the 
motion/force control capability into their new product line in the past few years 
(Bayegan and Elisson 2010; The ABB group 2003; KUKA automatisering + robots 
N.V 2003). With this additional capability, robots are now able to handle more 
complicated contact-type operations, such as polishing, grinding, and so on. 

Note that to realize these machining tasks using industrial robots, two types of 
information are needed. The first set of information is called geometric information 
which describes the geometric constraints that the robot tool center point must 
follow. For example, if a robot is required to drill a hole at a particular location on 
the workpiece, the hole’s location with respect to the robot is the geometric 
constraint that the tip of the drill bit (tool center point) must go to. The second 
information that is required for the robotized machining operation is called process 
information. This information includes a set of process parameters that can be used 
to achieve certain machining quality. In the above drilling example, the process 
parameters can be the drilling speed, the rotating velocity of the drill bit, the drilling 
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Fig. 1 Lead-through programming (left), walk-through programming (right) (www.osha.gov) 


strategy, and so on. Generally, for contact operations, the required robot motion can 
be complex depending on the workpiece geometry and the robot work-cell arrange- 
ment. Due to this complexity, large number of target points (where the robot’s tool 
center point needs to interact with the workpiece/environment) is expected 
during the programming process. As a result, the aforementioned walk-through 
and lead-through programming (or online programming methods) now become 
less viable. The reason is that the online teaching process will require a lot of 
time due to the large number of target points required to be taught. In addition, 
online teaching methods require robots to be out of production during the teaching 
process; thus, it can significantly reduce the utilization of robotic systems. This 
observation is also true for many noncontact tasks such as arc welding or 
spray painting especially in the case where the required robot motion is complex. 
In other words, realizing a robotic task, especially contact-type machining opera- 
tions, using online programming methods is usually inadequate in terms of produc- 
tivity in practice. 


Simulation and Off-Line Programming 

To resolve the aforementioned productivity issues, simulation and off-line pro- 
gramming was introduced in the early 1980s (Quinet 1995). Off-line programming 
is essentially a technology that can generate the text-based robot program by 
visualizing the robotic work cell using computer technology (see Fig. 2). The 
geometric information of the robot program can be automatically generated based 
on the 3D model information using CAD/CAM software packages (Siemens: 
process simulate for robotics and automation 2010; Robotmaster 2013). 
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Fig. 2 Off-line programming 
(www.osha.gov) 
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This information is then combined with the robot kinematic/dynamic model to form 
the completed robot program to be executed by the robot controller. Process 
parameters can be later integrated into the machine code via a provided code editor. 
The generated robot program can be simulated in the virtual environment for 
further verification before sending out to the robot controller. The advantages of 
off-line programming can be summarized as follows (Nof 1999; Biggs and 
MacDonald 2003): 

• Reduce downtime 

• Improve safety 

• Utilize CAD/CAM software for complicated problems 

• Enhance the work-cell setup through simulation 

The subject of off-line programming and simulation has been well studied in the 
literature (Biggs and MacDonald 2003; Pan and Zhang 2008; Nagao et al. 2007). 
However, the focus of this chapter is on the emerging contact-type machining 
operations, where some level of interaction between the robot and the workpiece/ 
environment is required, from a practical point of view. 
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While off-line programming tries to replicate the actual environment in the 
simulated world to the best possible, nevertheless, there are always some differ- 
ences, and in contact operations, these differences will lead to issues such as 
collisions, vibration due to poor path planning, and so on. In addition, procedures 
like work-cell calibrations to get better robot base frame to workpiece frame and 
tool center point to tool frame are required before the machining operation can be 
successfully executed. As a result, users are required to have much knowledge on 
other fields such as workspace digitalization, work-cell calibration, and so on. This 
chapter aims to provide readers an overview of the state of the art of the current 
industrial robot off-line programming. The focus will be on off-line programming 
for contact-type operations where the required robot/workpiece interaction usually 
results in complex robot motion during the execution. Practical considerations will 
also be provided throughout the case study section to further assist readers during 
the implementation phase. 

In order to have an overview of these state-of-the-art robot programming 
methods, let us review on how a manufacturing task can be realized using industrial 
robotic systems. Generally, the robot programming process for a given task can be 
divided into the following steps: 

1 . Decomposition of the task into subtask in such a way that each subtask can be 
carried out by the robot with a fixed robot- workpiece setup. This step is currently 
carried out by human due to the lack of robot intellectuality. 

2. Realization of each subtask by breaking it down into simple instructions/com- 
mands that can be executable by the robot controller. For example, if the subtask 
is to drill a hole at a specified location, a possible set of instructions can be: 

• Pick up an appropriate drill bit from the tool change station 

• Moving from the current position to a position that suits the drilling operation 
without colliding into the environment 

• Perform the drilling operation at certain predefined process parameters 

• Withdraw from the drilled hole to a safe position 

• Move back to a home position if necessary 

This task is also carried out by human as in the previous step. 

3. Obtain feasible geometric information for all the moving instructions. For 
off-line programming, this geometric information can be obtained from the 
workpiece CAD and the robotic work-cell model. Reverse engineering to get 
digitalized data (costly and time-consuming) is required if workpiece’s CAD 
models are not available. Figure 3 presents the standard work flow of how to 
realize a contact-type operation in practice. The executability of the robot 
program mainly depends on the geometric information in this step. 

4. Obtain feasible process parameters to achieve certain requirements. These 
process parameters depend on the machining process, the interaction of work- 
piece material and tool, user experiences, etc. For instance, for the above drilling 
process, the process parameters can be the rotation speed of the drill bit, how fast 
the drilling process will be carried out, what is the drilling strategy for the in-use 
material, etc. 
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5. Combine the above process parameters with the geometric information to form 
the robot instruction code in an ordered sequence. 

Since steps 1 and 2 require significant intellectual capability and human expe- 
rience, research on how to improve the robot programming process mostly focus on 
the last three steps, especially steps 3 and 4. In fact, research in step 3 usually results 
in a so-called intuitive teaching method (i.e., how to intuitively obtain the geomet- 
ric information), while research in step 4 mainly focuses on how to obtain the 
optimal process parameters. Although the ideal situation is to combine steps 3, 4, 
and 5 into one and execute this step in a real-time manner (i.e., the robot can 
measure the in situ geometric information of the workpiece to be processed, plan 
the trajectory based on the obtained information, and automatically adjust the 
process parameters during the machining process to achieve the desired specifica- 
tions, all in a real-time manner), the above steps are still independently executed in 
practice. As a result of this decoupling, time and effort for realizing a machining 
task using a robot can be very significant. For example, Pan et al. (2012) pointed out 
that the programming time of a robotic arc welding system for the manufacture of a 
large vehicle hull is approximately 360 times the execution time. This observation 
could be one of the main hurdles that prevents the widely use of industrial robots in 
small and medium enterprises (SMEs) where high-mix low-volume scenario is 
usually required. Since using robot to improve the productivity and to gradually 
replace the fading-out skilled workforce (due to ageing problem) is a trend, it is 
reasonable to look for a method that can speed up the robot programming process 
and at the same time can make use of the experience of the skilled worker, i.e., 
intuitive robot programming by demonstration. The topic of robot programming by 
demonstration is out of the scope of this work. Readers can refer to Argali 
et al. (2009) for a comprehensive review on state of the art of this teaching method. 
An outline of industrial robot programming methods is illustrated in Fig. 4. 

As mentioned, the focus of this work is on the programming process for contact- 
type operations, where some level of interaction between the robot and the work- 
piece/environment are required. Due to this interaction, the 3D CAD model of the 
robot, its end-effector tools, the workpiece, and the work station are now critical 
components as depicted in Fig. 5. For example, the workpiece CAD model can be 
used to generate targets (T target ) along the desired path in such a way that the robot’s 
tool ( T tcp ) can approach the workpiece surfaces at a certain angle depending on the 
process requirements. 3D models of the robot and the environment are mainly used 
for collision checking during the path execution (in the simulation environment). 
With current CAD/CAM software, the desired robot paths/targets can be automat- 
ically generated with respect to the workpiece local frame (T wob J). The remaining 
issues for contact-type operations using industrial robots are mainly: 

1 . Optimal tool path generation and workpiece placement: these two problems are 
in fact coupled since some machining processes such as drilling or chamfering 
can have an “extra” degree of freedom (DOF) of the tool path. Individual 
description of each issue is as follows: 
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Fig. 4 Industrial robot programming methods 


Fig. 5 Common frame 
assignments in industrial 
robot programming 



(a) “Extra” degree of freedom of the tool path: the easiest way to see this 
problem is through the drilling process. For a required hole location, it is 
easy to observe that the rotation along the drilling bit is free. In other words, 
there is an extra DOF while assigning the target frame to any drilling holes. 
As a result, depending on how users assign this extra rotation, the robot 
internal posture during the drilling process can be changed (for a fixed robot 
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Fig. 6 Extra DOF during 
path planning 



and workpiece setup). For instance, Fig. 6 depicted that two different 
assignments of the last rotation (about z target ) resulted in two different 
solutions of the robot base frame (T base _i and T base _ 2 ). For a trajectory, it is 
clear that there should be a policy for selecting this “extra” DOF for all 
targets along the path to ensure the path executability and maybe to optimize 
the robot internal posture during the path execution. 

Currently, there are some software packages that can assist users to 
automatically assign this extra DOF such as the configurator option of 
Robotmaster (Robotmaster 2013). By plotting the configuration map for all 
joint angles (from —180 °C to 180 °C) of target points along the trajectory, 
users can have a good visualization of how the remaining DOF affects the 
path executability. Users can use this configuration map to optimize the 
generated tool path from CAD/CAM software. 

(b) Workpiece placement problem: since the robot’s TCP needs to come into 
contact with the workpiece during the machining process, the robot’s inter- 
nal posture can play a critical role during the machining process. For 
example, if the robot happens to go through a near singularity configuration 
during the machining process, external vibration (caused by the rotating 
spindle for instance) can be amplified and thus degraded the machined 
surface quality. In addition, since robot’s TCP and workpiece are in contact, 
the probability of collision between robot’s links and the workpiece or work 
cell is high compared to the noncontact-type operations. As a result, it is 
sometimes not obvious for the users to decide on the location of the 
workpiece with respect to the robot to guarantee the path executability, 
i.e., within the robot reach, singularity free, collision free, and within the 
robot joint limit. Figure 7 illustrates the described issues in the 2D form. 
The subject of robot/workpiece placement has been considered by several 
authors (Feddema 1996; Pamanes and Zeghloul 1991; Lopes and Pires 2011; dos 
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Fig. 7 Unreachable target (left), reachable target with collision (center), reachable target near 
singularity (right) 

Santos et al. 2010; Vosniakos and Matsas 2010; Yang et al. 2009) in the 
literature. However, most of the research focused on developing a placement 
method for noncontact operations where there is no interaction/constraint 
between the robot and the workpiece and/or the working environment. For 
instance, Feddema (1996) introduced a searching method of the robot base 
frame subjected to a minimum time coordinated motion. In this work, the author 
assumed that the solution of the robot inverse kinematic problem is available. If 
the robot dynamic model is also available, one can further optimize the place- 
ment solution of a serial robot manipulator by taking into account the manipu- 
lability and mechanical power during the optimization process (dos Santos 
et al. 2010). To reduce the computation of the inverse kinematics, Yang 
et al. (2009) proposed a searching method based on impelling the workspace 
toward the target points with a set of constraints to assure the reachability of the 
end effector. Although this is a promising approach, the complexity of the 
algorithm can prevent its wide use in practice, especially in the small and 
medium enterprises (SMEs) context, where the shortage of skill workers is 
usually the major problem (Lim and Tao 2010). 

As the above discussion, most of the proposals from the literature are only 
applicable for noncontact operations where there is no interaction between the 
robot and the working environment. In addition, the closed-form solution of the 
inverse kinematic problem is usually required during the searching process. For 
contact-type operations, the likelihood of collision between the robot and the 
workpiece and/or the environment is much higher. To tackle the aforementioned 
problems, an algorithm for the workpiece placement problem was proposed in 
Vuong et al. (2013). The problem formulation can be stated as follows: 

Given: 

^targets 

• T wobj 

• The robot kinematics and dynamics 

• Robot joint limits 

• The 3D model of the robot, workpiece, and working environment 
DetermineTrob^t in such a way that the desired task executability is 

guaranteed. The proposed method can be summarized as follows (Vuong 
et al. 2013): 
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Fig. 8 Step 1, problem transformation (left); step 2, virtual pulling force (right) 


• Step 1: transform the workpiece placement problem into a control problem 
which is to bring all the robot base frames to a common location while staying 
within the work-cell constraints, such as joint limits, singularity free, collision 
free, and so on. 

• Step 2: use the operational space control framework (Khatib 1987; Nakanishi 
et al. 2008) to pull the robot base frames together (via virtual force). Human 
input can be used during this step to accelerate the convergence rate. 

• Step 3: the additional work-cell constraints, such as robot joint limits and so 
on, are incorporated into the null-space controller. By doing this, the null- 
space controller will try to adjust the robot configuration to suite the con- 
straints while maintaining all robot base frames at one common location. 

For example, Fig. 8 illustrates the above steps for the 2D case where only 

03 targets are considered. Figure 9 shows snapshots of the simulation using the 

above approach. 

The key contributions of the proposed method are: 

• No closed-form solution of the inverse kinematic problem is required. This is 
because the searching process is performed in the task space rather than in the 
joint space. Moreover, instead of approaching the problem from the optimi- 
zation point of view, the framework proposed in this work allows the 
searching process to be carried out with physical meaning. 

• Physical constraints such as joint limits and singularity and collision avoid- 
ance can be incorporated as artificial constraints. By making use of the 
operational space control framework (Khatib 1987), this paper introduces a 
unified framework to assist the searching of the workpiece location under 
user-specified constraints. 

• User experiences can be incorporated during the searching process (hybrid 
searching). Since the operational space control framework in Khatib (1987) is 
the force-based control framework (Nakanishi et al. 2008), user inputs can 
easily be integrated as artificial force fields without alerting the searching 
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Fig. 9 t = 0(s) ( upper left), t = 0.03(s) ( upper right), t = 0.25(s) ( bottom left), t = 1.15(s) ( bottom 
right) 


procedure. This is an important feature since other methods using coefficients 
to guide the optimization may fail and there is no clear guidance of how users 
can adjust the coefficients (which do not have a clear physical interpretation) 
to obtain better results. 

2. Robotic work-cell calibration: since relative position and orientation between 
the robot (robot base frame) and the workpiece (world object frame) are 
unknown, these location information need to be identified through a proper 
calibration process. For machining operations using robotic systems, two cali- 
bration processes need to be carried out: 

(a) Workpiece location calibration: this is to find out the relationship between 
the T wob j and r robot as depicted in Fig. 5. 

(b) Tool center point (TCP) calibration: depending on the tool used, i.e., the drill 
bit in the above example, the contacting point between the tool and the 
workpiece needs to be identified. Usually, this relationship can be 

r r/^ P 

represented by T tf as shown in Fig. 5. 
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External sensors may be needed in the calibration process and such equip- 
ment can be very costly. Human input is also sometimes involved in the process. 
Calibration can be a tedious process and sometimes need to be performed several 
times in order to achieve the required accuracy, and regular calibration is 
required to ensure consistent performance. How this calibration can be carried 
out is out of the scope of this work. Readers can refer to the robot calibration 
chapter in this handbook for a detailed discussion on this topic. 

3. Process study: as mentioned, process parameters are meant to achieve certain 
finishing quality. To obtain these parameters, a proper process study is 
needed. This process study can be designed using design of experiment method 
(Montgomery 1984). Note that this process study usually requires the assist from 
domain experts. 

As is seen, off-line programming for machining operations usually results in 
additional issues especially during the work-cell setup. To further assist readers 
in the implementation of machining processes using robotic systems, a case study 
on automated surface grinding process is presented in the below section. 


Off-Line Programming Case Study: Automated Surface Grinding 
Background 

In the manufacturing context, surface grinding is often used to produce a finish of 
certain surface roughness on an unfinished surface, which is often milled finish. It is 
a widely used abrasive machining process which utilizes a spindle with an abrasive 
medium containing rough particles which cuts chips of metallic or nonmetallic 
substance from a workpiece to achieve a required surface roughness. In particular, 
burrs and other surface flaws are removed. Typically, a targeted surface 
roughness, quantified by the vertical deviations of a real surface from its ideal 
form, is obtained based on the application’s functional purpose. Careful selection of 
the abrasive material based on the surface to be machined is essential for the 
process to work effectively and progressively, and finer abrasive grid sizes should 
be used with each grinding pass. Suppliers of such abrasives have recommended 
strategies and products for end users to consider in order to meet the end product 
requirements. 

Many manual machining processes today employ many older skilled workers 
with years of experience. Their knowledge is hard to duplicate and be transferred, 
and it usually requires a period of apprenticeship to identify a candidate who is up to 
the task. This laborious activity does not appeal to new job seekers and often results 
in a manpower shortage as attrition in the workforce takes its toll. There are health 
risks involved as the hours of handling a tool which creates significant vibrations 
can result in hand injuries. Depending on the location, continuous operating 
duration and daily allowable working hours for such workers are subjected to 
local legislative regulation which has to be abide to. 
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With so many constrains to the manufactures requiring grinding process, it 
makes sense to automate this task. Its immediate benefits include: 

• De-skilling of workers 

• Consistent finishing quality 

• Reliable throughout 

• Continuous operation 

• Scalable operation 

However, in addition to the aforementioned issues during the off-line program- 
ming process, there are other hurdles to address. Such issues include: 

• The viability of replicating worker’s skills on a machine to achieve similar or 
better output 

• The skill-upgrading issues because existing workers are required to equip with 
new skill set for robot operators 

• The return of investment 

The above considerations are out of the scope of this work. As mentioned in the 
previous section, absolute accuracy is a necessary evil when it comes to off-line 
programming as it is vital that the virtual environment is a true representation of the 
actual setup in order to have a meaningful simulation. However, when it comes to 
contact operations like grinding, another set of challenges are presented which 
poses greater issues for consideration. 


Considerations for Programming Robots for Grinding Operations 

Recent advances in robotic activities have led to the introduction of industrial 
robots with force control capabilities. This feature opens up the feasibility of 
utilizing such robots for contact operations like grinding. For simple task like 
performing contact task involving short straight motion with a constant orientation 
within the robot’s preferred working envelope, it is relatively easy and can be 
accomplished by a robot operator within minutes. However, such ideal scenarios 
are hard to come by in the real world. 

Many workpieces come in various shapes and sizes, and quite often, they possess 
complex geometry. For manual operations, the human is adequately adapted to 
handle such situations as the human limbs are naturally compliant, and for contact 
task like grinding, the downward force to be applied is actually quite low. In fact in 
many cases, the weight of the grinding tool acting in the downward direction 
normal to the contact surface is enough. In cases where more force is required, 
the operator simply adds additional passes as and when it is required. In contrast, an 
industrial robot is naturally stiff, and getting it to perform complaint task like 
grinding is something that the robot is initially not designed for. Manufacturers 
overcome these inherent limitations by manipulating the robot’s stiffness by 



58 Simulation and Offline Programming for Contact Operations 


2085 



applied force 


fiber-disc 


Applied Force 




Fig. 10 Deformation due to applying force 


altering the internal controls and introducing force torque sensors at the robot’s end 
effectors to provide force feedback. They may even tweak some of the safety 
boundaries to allow force control capabilities to coexist with their existing software. 

While off-line programming tries to replicate the actual environment in the 
simulated world to the best possible, nevertheless, there are always some differ- 
ences, and in contact operations, these differences will lead to collisions. In fact, for 
contact operations, collision is not a bad thing but is in fact a requirement but in a 
controlled fashion. Typically for contact operations, the abrasive tool will be 
subjected to deformation as it is pressed on the surface to be ground as depicted 
in Fig. 10. It is this deformation that creates the required force normal to the contact 
surface for the spinning abrasive tool to function. Sometimes a tool may work best 
if it is angled during use. With this in mind, the flexibility of the abrasive tool with 
any supports used has to be selected to provide the necessary deformation such that 
the final contact area between the tool and surface is constant and parallel to each 
other. The ultimate goal is to attain constant pressure throughout the surface to be 
processed. This requirement in reality is not always possible due to the various 
approach angles the tool needs to come in contact with the surface which results in 
varying contact area and thus a changing contact pressure. Reasons for this varying 
approach angle may include: 


• Default angle interfering with the workpiece. 

• Available space cannot accommodate the tool. 

• Concave surface is too small resulting in multiple contact areas as shown in Fig. 1 1. 

Should such situation arise, the applied force should be changed accordingly to 
ensure a consistent applied pressure. Separate trials have to be conducted to better 
characterize the tool’s abrasive performance at various angles, applied force, and 
feed rate so as to determine material removal rate. The range of these parameters 
which will not adversely affect the removal performance also needs to be identified. 
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Fig. 11 Collision issues due to tool deformation during machining process 

These numbers will help in determining how much material can be removed for 
each tool configuration and allow for adjustments. 


Off-Line Programming for Contact Operations 

Like any other applications requiring off-line programming, the process steps are 
similar: 

• Obtain CAD of workpiece and environment 

• Identify the tool path 

• Get the tool center point 

• Get the work object frame 

• Simulate the process 

• Generate robot program 

The key difference is in the simulation studies. Typically for such studies, we 
will look out for the following: 

• Reachability 

• Singularity 

• Collision 

• Joint limits 

For contact operations, it is not possible to simulate the deformation of the tool 
when it comes in contact with the surface in a fast and efficient manner. Hence, the 
tool is included in the kinematic simulation in its deformed state based on the 
studies conducted for better visualization. In the actual process, in the actual 
process, with the force applied, the TCP can be taken as the center of the contact 
area and it is identified based on this configuration. However, interference may 
result as the tool collides with the workpiece. Hence, an intentional offset is 
included to ensure that with a default deformation, no collision should occur during 
the simulation studies. If a collision does occur, it is then necessary to modify the 
tool approach angle such that the interference is resolved. If there is no possible 
solution, the process angle can be modified next such that it is within the acceptable 
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Fig. 1 2 Industrial robot 
programming process 


range. However, if there is still no acceptable solution, it is likely that the tool used 
is not suitable for the task and modifications to the tool may be needed. 

If the robot is able to reach each designated target point without problems, it 
does not mean that the entire path can be executed in one continuous move. Take, 
for example, the case when there is an overhead obstruction along a path as 
illustrated in Fig. 11. While teaching each point along the path, the user may 
have identified suitable approach angles for each of them. However, during the 
path execution, the system has to create interpolation between 2 via points, and this 
may result in collision along the way as shown in Fig. 12. Without a suitable 
configuration to allow a continuous path motion, the existing path may need to be 
separated into two or more parts each with its approach and departure strategies 
designed to avoid the obstruction. It is important to note that breaking a path into 
several parts may result in a visible difference in surface output which may not be 
acceptable for certain applications. Placement of the workpiece or changing the 
extra DOF during path planning may help to make a single pass possible as the 
robot configuration is changed. Another problem with breaking up a finish path is 
the quality between adjacent segments. The exit of the last segment and the entry of 
the next need careful tuning of the parameters to ensure that there is no over 
machining. At times this is hard to predict as the online force search and depart 
algorithms may take longer than usual to reach the targeted force threshold values 
in order to start or stop subsequent force control routines. Causes of these fluctu- 
ations could be due to the following: 

• Unnecessary vibrations introduced due to slight variations in new abrasive tools 
which affect the switching between different phases of the force control routine. 

• Changing vibration output from tool due to wear and tear which affects the 
switching between different phases of the force control routine. 

• Permanent deformation of abrasive tools which are reused as they still have 
abrasive capabilities. This deformation may shift the TCP slightly resulting in 
slight increases in search time causing extra machining. 

At times, simple issues may cause the variation in performance. For example, 
take the case where the grinding is done in the gravity direction and against. If the 
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sensor’s zeroing and calibration is done correctly, the system should compensate 
for gravity effects in its force control routine. However, in cases where there are 
improperly secured items like cables and the connection is secured after the force 
sensor, it will end up affecting the final outcome as the connection affects the sensor 
readings. 


Summary 

The above sections presented a case study on surface grinding using industrial 
robots. As is seen, much considerations are needed to automate a machining process 
using industrial robots. As a result, the decision on whether to robotize a particular 
machining process or not is often tough for decision makers. Generally, a robotized 
machining process will typically expect to have the following benefits: 

• Minimum operators 

• Increased throughput 

• Continuous operation 

• Cost-effective 

However, such benefits are only possible if robotic implementations are intro- 
duced in phases rather than all at once. 

In the beginning, feasibility studies should be done to determine if the current 
process is suitable for robotization. A stable process that is robust will ensure a 
higher chance of success, and focus can be put on the other development work. 
However, if the current process is not suitable for robotization, a new process study 
will be required. Cases like the absence of suitable tools for robots can be show- 
stoppers. Other issues like alternative automation solutions, return of investments, 
as well as legislative requirements may also affect the overall implementation 
decision. 

Existing manual labors are usually apprehensive to adopting new technologies 
that require new skill set. In particular, having to operate in a virtual environment 
rather than the physical one takes getting used to. Such people are important in the 
initial development as they know the process requirements and the required output 
and their inputs are essential to improving the overall implementation. Getting them 
to be the users of such machines will be ideal, but there is a need to ensure that they 
are comfortable to operate the system to a level where they are comfortable 
imparting their process experience as motions of the robot. 

Only when a small-scale implementation is completed can an overall assessment 
of the robotization success be determined. If the automation is able to de-skill 
a worker, it makes sense to progress forward to consider larger-scale 
implementations . 

In summary, this chapter first presented an overview of the state of the art of 
robot programming methods for modem industrial robots. Off-line programming 
with the focus on contact-type operations was then discussed in detail. To assist 
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readers during the implementation of robotic systems for machining processes, 
practical issues and some highlighted solutions were presented. We believe that the 
work presented here is very useful for those who are trying to automate existing 
machining processes using modern industrial robotic systems. 
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Abstract 

Parallel robot (PR) is a mechanical system that utilized multiple computer- 
controlled limbs to support one common platform or end effector. Comparing 
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to a serial robot, a PR generally has higher precision and dynamic performance 
and, therefore, can be applied to many applications. The PR research has 
attracted a lot of attention in the last three decades, but there are still many 
challenging issues to be solved before achieving PRs’ full potential. This chapter 
introduces the state-of-the-art PRs in the aspects of synthesis, design, analysis, 
and control. The future directions will also be discussed at the end. 


Introduction 

A parallel robot (also called parallel manipulator, or parallel kinematic machine, 
when it is used as a machine tool) is a closed-loop kinematic chain mechanism of 
which the end effector is connected to the base by several independent kinematic 
chains (legs) in parallel. Comparing to a serial robot, each limb of a parallel robot 
(PR) tends to be simpler and shorter therefore more rigid, and the motors are 
purposely mounted close to the base in order to reduce the moving mass. Therefore, 
PRs have the following advantages: high stiffness because the external load is 
shared by multiple legs, high acceleration and high speed due to low moment of 
inertia of the moving structure, higher accuracy since the joint manufacturing and 
assembly errors will be even out rather than accumulated at the end effector, and 
high force capabilities as the output force is contributed by multiple actuators 
working in parallel. The disadvantage of a PR lies in its limited workspace volume 
due to the parallel architecture, and there are often singularities in workspace. Due 
to these advantages, PRs have received wide applications, e.g., flight simulators, 
high-precision positioning devices, pick-and-place manipulation, metrology and 
coordinate machine, reconfigurable tooling, and high-speed machining. 

The earliest parallel manipulator is the Gough platform invented by Gough in 
1947, which allows the positioning and the orientation of a moving platform so as to 
test tire wear and tear (Fig. 1). Stewart (1965) designed a parallel manipulator 
(Fig. 2) for use as a flight simulator in 1965. The research area has not attracted 
much attention until the 1980s. In the last three decades, many types of PRs have 
been designed and prototyped, and a few of them have been successfully 
commercialized. 

Based on the Stewart platform structure, the Hexcel Corporation developed the 
R series robots (Fig. 3a) for biomedical applications and P series robots (Fig. 3b) for 
industrial applications, such as welding, deburring, and milling. Prof. Clavel pat- 
ented the groundbreaking Delta robot in 1990 (Clavel and Sogeva 1990). As an 
extension of Delta robots, the IRB340 FlexPicker was introduced by ABB Flexible 
Automation in 1999, as shown in Fig. 4a. It is equipped with an integrated vacuum 
system capable of rapid pick and place of objects. Many applications related to 
Delta robots in the industry have been developed, and one of them is used as Line- 
Placer as shown in Fig. 4b. Based on the Delta robot, a number of variants of pick- 
and-place PRs have been designed and developed. Figure 5 shows two examples: 
the Quattro robot from Adept Technology (Pierrot et al. 2009) and the Diamond 
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Fig. 1 Gough platform 
(Adapted from http://www. 
parallemic . org/Re vie ws/ 
Revie w007 .html) 



Fig. 2 Stewart platform 


End-effector 



robot from Tianjin University, China (Huang et al. 2003). For high-precision and 
high stiffness machining operations, Tricept from LOXIN, Sprint Z3 head from 
ECOSPEED, and Exechon machine from Exechon corporation are some successful 
examples of PRs on the market today (Fig. 6). For more information, readers are 
referred to www.parallemic.org 
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Fig. 3 


Applications of the Stewart platform by Hexcel Corporation (a) R series (b) P series 




Fig. 4 Applications of the Delta robot for high-speed pick-and-place tasks (a) IRB340 
FlexPicker, (b) Demaurex’s Line-Placer installation 


Classification Method 

According to the end-effector mobility, PRs are commonly classified into those 
with complete mobility (six degree of freedom) and those with lower mobility (less 
than 6-DOF). Generally speaking, lower mobility PRs consist of less number of 
limbs and therefore simpler and cheaper, comparing to those with complete mobil- 
ity. Since many applications do not need a complete 6-DOF motion, low-mobility 
PRs have recently attracted a lot of attention in the robotics research community. 

Based on the motion transmission characteristics, PRs are categorized into 
decoupled and un-decoupled ones. A decoupled PR has an architecture allowing 
specific motion axes to be driven in a certain order by the designated actuators 
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Fig. 5 Parallel robots for high-speed pick-and-place tasks (a) Quattro robot from Adept Tech- 
nology (b) Diamond robot from Tianjin University 



Fig. 6 High stiffness and high-precision machining applications of PRs (a) Tricept 600 (b) Sprint 
Z3 head from DS technologie (c) Exechon X700 
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independently. The decoupled motion will result in simpler displacement calcula- 
tion, singularity analysis, workspace evaluation, static analysis, and motion plan- 
ning and control. For an un-decoupled PR, its output motion DOF is always coupled 
together, driven by one or multiple actuators. Therefore, its kinematic analysis and 
static analysis tend to be more difficult, so are the motion planning and control. 

For successful applications, the determination of a PR has to be depending on the 
specific task requirements, aiming to maximize its advantage aspects while mini- 
mizing disadvantage aspects. 


Design Methodology 
Robot Components 

A PR has its inherent modular feature as all limbs of a PR often have the same or similar 
architecture, constmcted by the same type of actuators, joints, and links. Therefore, PRs 
have excellent reconfigurability, which is identified as a key nature of the future 
manufacturing system. Figure 7 shows some common building blocks of parallel robots, 
including revolute and prismatic actuator modules; three types of passive joint modules 
(without actuators) including the rotary joint, the pivot joint, and the spherical joint; a set 
of links with various geometrical shapes and dimensions; and a mobile platform. Two 
exemplar PRs constmcted with these components are shown in Fig. 8. 


Type Synthesis 

Type synthesis, or topological synthesis, is the initial step in PR design, and it is the 
study of the nature of connection among the members of a mechanism and its 
mobility. It is concerned primarily with the fundamental relationships among the 
degrees of freedom, the number of links, the number of joints, and the type of 
joints used in a mechanism. Type synthesis usually only deals with the general 
functional characteristics of a mechanism and not with the physical dimensions 
of the links. This part of the work is difficult because it originates from the 
creativity of the designer and mostly depends on their intuitiveness, intelligence, 
and experiences. 

In the literature, most of the PMs have identical limbs to simplify the design and 
fabrication (Gosselin et al. 1996; Huang and Li 2002; Jin et al. 2006a; Tsai 1999, 2001) 
introduced an enumeration scheme to guide synthesis of PMs with 3-6 DOFs based on 
Gmbler criterion and loop mobility criterion. The Grubler criterion is as follows: 

F = X{n-j-\) + Y j n i=x fi CD 

where A is the motion parameter; for spatial mechanisms, 2 = 6; and, for planar and 
spherical mechanisms, A = 3. n represents the total number of links (including the 
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Fig. 7 Building components of parallel robots (a) Prismatic and revolute actuators (b) Passive 
joints (c) Links and connectors (d) Moving platform (Chen 2001) 


fixed link), j the total number of joints, and /■ the number of DOF of the zth joint. 
Note that the Grubler criterion is valid provided that the constraints imposed by the 
joints are independent of one another and do not introduce redundant degrees of 
freedom. The loop mobility criterion is as follows: 

Yl=f k=F+M (2) 

where CV is the connectivity of the Id h limb (the DOF associated with all the joints), 
m the total number of limbs, and / the total number of loops. A > C k > F ensure 
proper mobility and controllability of the end effector. Based on Eqs. 1 and 2, the 
kinematic architectures of PRs can be enumerated according to their nature of 
motion and DOF, as shown in Table 1. 
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Fig. 8 Planar and spatial parallel robots constructed by the modules in Fig. 7 


Table 1 Enumeration of 
nonredundant PRs 


F 

m 

^fi 

C h • • •? 

3 

2 

9 

5,4 

6, 3 

3 

15 

5,5,5 

6,5,4 

6, 6, 3 

4 

2 

10 

5, 5 

6,4 

3 

16 

6,5,5 

6, 6,4 

4 

22 

6, 6, 5, 5 

6, 6, 6, 4 

5 

2 

11 

6, 5 

3 

17 

6, 6, 5 

4 

23 

6, 6, 6, 5 

5 

29 

6, 6, 6, 6, 5 

6 

2 

12 

6, 6 

3 

18 

6, 6, 6 

4 

24 

6, 6, 6, 6 

5 

30 

6, 6, 6, 6, 6 

6 

36 

6, 6, 6, 6, 6, 6 


Although the enumeration provides a nice guideline of the required connectivity 
to achieve certain DOF, this approach cannot specify the specific output motion, e. 
g., 3-DOF translations. For synthesizing PRs with specified DOF, there are two 
major methods: constraint synthesis approach (Huang and Li 2003; Fang and Tsai 
2004; Kong and Gosselin 2004, 2007) based on screw theory and Lie subgroup 
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approach (Herve 1991; Li et al. 2004; Li and Herve 2010) based on the algebraic 
properties of a Lie group of the Euclidean displacement set. 

A screw is a mathematical notion which is modeled on a physical screw. It is a 
quantity associated with a line in the three-dimensional space and a scalar called 
pitch. According to Chasles’s theorem, any displacement of a rigid body can be 
effected by a single rotation about a unique axis combined with a unique translation 
parallel to that axis, so any displacement can be described as a “screw displace- 
ment,” where the unique rotation axis, which is also the translation direction, is 
referred to as the screw axis of the displacement. In this sense, a revolute joint can 
be represented by a unit screw of zero pitch pointing along the joint axis and a 
prismatic joint represented by a unit screw of infinite pitch along the direction of the 
linear joint axis. 

Screw theory is a way to study velocities and forces in a three-dimensional 
space, combining both rotational and translational components. When a screw is 
used, it is often called a twist. When a screw is used to describe the force system 
acting on a rigid body, it is called a wrench. If the work done by a wrench $1 acting 
on a rigid body having a twist $2 motion vanishes, it is said the two screws $1 and 
$2 are reciprocal. A screw system is a set of linear combination of independent 
screws. The order of a screw system is equal to the number of basis (independent) 
screws in the system. A d order screw system is associated with a reciprocal (6 -d) 
screw system. The twist system of a kinematic connection represents all motions 
permitted by that connection in a given position. The wrench system of the 
connection represents all constraints that can be transmitted by the connection. 
The twist system and the wrench system of the connection are uniquely related by 
reciprocity. The duality and the reciprocality of screws make it a suitable tool for 
synthesis and analysis of parallel manipulators. 

In screw-based method (Huang and Li 2003; Jin et al. 2009), the wrench system 
of PRs is firstly obtained according to mobility requirement of the end effector. 
Then the standard base of the mechanism twist system can be formulated and is 
treated as the standard base of the limb twist system. Subsequently, some other 
necessary twists are added to obtain the limb twist system. The process of adding 
other twists mainly depends on the experiences of the designer. Then, all the limbs 
are connected with the base and the mobile platform according to the geometrical 
conditions. Lastly, it requires checking if the PMs can work in finite motion as this 
approach is based on the instantaneous kinematics. 

In Lie subset method (Herve 1991), a set of operators is used to describe all 
possible finite displacements, and each kind of motion is represented by a set of 
operators. A mechanical bond is represented by the set of allowed displacements of 
one body with respect to the other. A serial arrangement of kinematic pairs becomes 
the composition of mechanical bonds, and a parallel arrangement of kinematic pairs 
is represented by the intersection of mechanical bonds. The first step in type 
synthesis is to get a mechanical generator of a limb according to the motion 
characteristics of the end effector. Second is to analyze the mechanical generator 
to obtain a feasible configuration for a limb. Third is to combine several limbs to 
form the PMs. And last, checking finite motion on the mechanism is required. 
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Similar to the screw-based method, the second step of this approach very much 
depends on the experiences of the designer. 

So far, a large number of types of PRs have been synthesized, but the advantages 
of various PR types have not been fully exploited. There is currently still lack of 
tools and metrics to effectively compare different PR architectures. 


Displacement Modeling 

Displacement modeling is to find the relationship between the position (posture) of 
the end effector and the displacements of the actuators. It is the basis of workspace 
analysis, dimensional optimization, and motion planning and control. There are two 
types of displacement models: (1) forward displacement model, which is to find the 
end-effector pose with given displacements of actuators, and (2) inverse displacement 
model, in which the displacements of actuators are to be found with given 
end-effector pose. For a serial manipulator, the forward displacement is straightfor- 
ward, but the inverse displacement is difficult due to multiple solutions. For the 
parallel manipulators, the inverse displacement is easy because each limb is much 
simpler than a serial manipulator. But the forward displacement is generally rather 
complicated as it requires solving higher-order nonlinear Equations. A geometrical 
method is commonly used to put down a vector loop equation f(X, q, p) = 0 for each 
loop while taking the geometrical constraints into account (Altuzarra et al. 2009). 
Then the passive variables p are to be removed to achieve an equation with only 
actuators’ inputs q and end effector outputs X. However, the resultant system of 
equations is nonlinear and often including sine-cosine polynomials. Therefore, it is 
hard to arrive at closed form solutions; at best a high-order univariate polynomial 
equation can be obtained and then solved by a numerical method, such as Newton- 
Raphson’s procedure, but only one solution can be obtained each time (Raghavan 
1993; Wang and Chen 1993). Some other researchers studied the forward kinematics 
of PMs with the help of additional sensors (Notash and Podhorodeski 1995; Chiu and 
Pemg 2001; Baron and Angeles 2000), by which the difficulty is significantly reduced 
so that the kinematic algorithm is suitable for real-time applications. 


Jacobian Analysis 


An important limitation of a parallel manipulator is that there may exist singular 
configurations in its workspace where the manipulator gains or loses one or more 
DOFs so that the output motion cannot be properly controlled. Jacobian analysis is 
an important section for checking the motion characteristics of the mechanism. The 
relationship of the input joint rates and the end-effector output velocity of a PR can 


be expressed as 


J x X J q Q 


where J x is called direct Jacobian matrix and J q is called inverse Jacobian matrix. 
Due to the existence of two Jacobian matrices J x and J q , a PR is said to be a singular 
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configuration when either J x , J q , or both are singular. According to (Gosselin and 
Angeles 1990), three different types of singularities are classified, i.e., inverse 
kinematic singularity (\J q \ = 0), forward kinematic singularity (L J x \ = 0), and 
combined singularity (L T x \ = \J q \ = 0). The inverse singularity occurs in the 
manipulator configuration when the moving platform loses one or more degrees 
of freedom instantaneously. The actuator velocities cannot be determined by the 
given moving platform velocity. In other words, the moving platform will be 
constrained by more wrenches instantaneously in the inverse singular configura- 
tion. Forward singularity occurs when the moving platform gains one or more 
degrees of freedom instantaneously. In this configuration, the moving platform 
can move even though all actuators are locked. 

Two common methods are often used to formulate the Jacobian matrix of a PR, i. 
e., vector loop equation approach and screw theory approach. In the first approach, a 
loop closure equation is built for each limb, and then the loop closure equation is 
differentiated to obtain the relationships between the input and output parameters. 
The input parameters are the rotation velocity or linear velocity of actuators, and the 
output parameters are the velocity of the end effector. In the second approach, one 
can always find a screw which is reciprocal to all the twists associated with the 
passive joints because the instantaneous twist of the moving platform is expressed as 
a linear combination of all the twists of a limb. Hence, the relationships between the 
input joint rates and the output velocity can be obtained by the orthogonal product of 
the screw and twists. Each row of the Jacobian matrix is in fact represented with a 
twist, and each matrix in essence forms a system of all twists. The singularity will 
occur when the system is not in its full row rank. Therefore, screw-based line 
geometry method is usually employed for analyzing singular configurations. For 
example, the forward singularity of a 3RPR planar PM (Jin et al. 2009; Bonev 
et al. 2003) occurs when the 3 -system of wrenches (//, f 2 \ f 2 ) of J x degenerates. 
Geometrically, the forward singular configurations occur if and only if the three lines 
are in parallel or intersect at one common point as shown in Fig. 9. With the three 
lines in parallel as shown in Fig. 9a, the moving platform will produce an infinites- 
imal translation along the line direction even though all actuators are locked. With the 
three lines intersection at a common point as shown in Fig. 9b, the moving platform 
can produce rotations about that point even though all actuators are locked. There- 
fore, the end-effector motion path should be purposely designed away from singular 
postures in motion planning and control. 


Workspace Analysis 

Workspace analysis is a challenging problem for PRs. The solution of this problem is 
critical in design and motion planning of the robot. As the complete workspace of a 
6-DOF PR is embedded in a 6-dimensional space, there is no simple way to visualize 
it in a human-readable way. Therefore, the workspace of PR is usually visualized in 
various 3D subspaces (Merlet 2000), such as the constant-orientation workspace, the 
reachable orientation workspace, and the reachable positional workspace. As the 
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Fig. 9 Forward singularities of a 3RPR planar PR 


major drawback of PRs is their limited workspace, workspace volume becomes an 
important performance index of PRs (Monsarrat and Gosselin 2003; Merlet 
et al. 1998; Gosselin 1990; Kumar 1992; Jin et al. 2011). Some researchers made 
use of geometrical methods to calculate the workspace of PMs (Merlet et al. 1998; 
Gosselin 1990). Other researchers studied the design for the desired workspace by 
optimization (Monsarrat and Gosselin 2003; Jin et al. 2011; Lou et al. 2004). 

In geometrical method, the reachable positional workspace of each limb is firstly 
obtained based on the geometrical relationships of joints and links of the limb. The 
workspace boundaries are then obtained by the intersection of the reachable 
workspace generated by each limb. With numerical method, a large workspace 
envelope through the extreme reach of each limb is firstly assumed, and then the 
workspace envelope will be parametrically and proportionally divided into a 
number of finite elements of the same volume. When the number of the finite 
elements is sufficiently large, each element can be kinematically represented by a 
point located inside the element. And inverse kinematics will then be used to check 
if the point is inside or outside the workspace by checking all constraints including 
joint angles and actuators’ displacements. And the workspace volume is converted 
into a simple summation of all volume elements in the workspace. 


Dimensional Synthesis 

A PR topology can only give its best performance when its geometrical parameters 
are properly assigned. Dimensional synthesis is used to find the optimal dimensions 
of every link (L) for achieving the desired performance through certain cost 
functions f(L), subject to a number of constraints. So the problem is often formu- 
lated as an optimization question as follows: 


59 Parallel Robots 


2103 


Min : f(L) subject to constraints. 

However, constructing the optimization model and solving the optimization 
problem are rather challenging because of the nonlinear function and often con- 
flicted design criteria (e.g., use of stiffness and weight, workspace, and/or accuracy) 
(Cardou et al. 2010; Merlet 2006). A number of performance metrics have been 
introduced for designing PRs, and most of them are related to dexterity/condition- 
ing or workspace volume of the end effector (Gosselin and Angeles 1991; Castelli 
et al. 2008; Jin et al. 2006b; Carbone et al. 2007). Gosselin and Angeles proposed 
the global conditioning index (GCI) for measuring the global dexterity of manip- 
ulators over their entire workspace in design optimization (Gosselin and Angeles 
1991). Liu and Gao (2000) studied optimum design of 3-DOF spherical parallel 
manipulators with respect to the conditioning and stiffness indices. Chablat and 
Wenger (2003) studied architecture optimization of the Orthoglide with prescribed 
kinetostatic performances in a prescribed workspace. Huang et al. (2005a) intro- 
duced dimension optimization for the Tri Variant PR with two performance indices, 
i.e., global and comprehensive conditioning indices. Li and Xu (2006) introduced a 
new optimization approach which utilized both the global dexterity index and space 
utility ratio for design a 3-PUU translational parallel mechanism. Pierrot 
et al. (2009) introduced optimal design of a 4-DOF parallel manipulator with the 
cost of links as an objective while keeping machine cycle time and dexterity as 
optimization constraints. Liu et al. (2007) designed a hybrid PR with large 
workspace/limb- stroke ratio, and a global conditioning index based on the mini- 
mum singular value of Jacobian is defined for dimension optimization. Ottaviano 
and Ceccarelli (2002) used specified workspace volume as the design objective for 
synthesizing the design parameters of the CaPaMan. Altuzarra et al. (201 1) studied 
dimensional synthesis using Pareto optimization with three design objectives, 
including workspace volume, dexterity, and energy consumption. The current 
trend is to incorporate practical dynamic performance indices in design optimiza- 
tion as dynamic performance is the key advantage of PRs comparing to serial robots 
(Huang et al. 2013). Apart from the objective function-based optimal design, 
another method based on the performance chart is emerging recently (Liu and 
Wang 2007; Liu 2006). In this method, the performance maps of various measures 
are drawn in the entire or desired workspace of PRs, so that the performance can be 
evaluated through visual analytics so as to achieve the global optimization. But the 
approach is limited by the number of parameters. 


Performance Evaluation, Calibration, and Control 
Static and Dynamic Analysis 


Static analysis ensures to model the structure deflection with regard to static 
loads due to gravity or forces introduced by task. Dynamic analysis allows 
computing the necessary actuator torques to realize a task with the kinematic 
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and/or force requirement. In this case, the robot structure is considered to be 
composed of rigid elements. To analyze the influence of flexibilities on the 
end-effector movement, static and dynamic models are mixed together. Thus, 
the proposed dynamic analysis takes into account the structure flexibility and the 
inertial forces. 

During a task, such as machining, process forces stress the structure and cause 
non-negligible tool pose defects (Pritschow et al. 2002; Week and Staimer 2002). 
Moreover, critical issue with PRs is their nonconstant behavior in their geometrical 
workspace. So, static and dynamic behavior causes different defects, depending on 
tool pose in the workspace (Chanal et al. 2006). Taking the PR in machining 
operations as an example, this section will introduce the static and dynamic 
analyses of PKMs (parallel kinematic machine tools). 

In order to simulate those pose defects, many authors propose different modeling 
strategies for PKM (Chanal et al. 2006; Cobet 2002; Wang et al. 2006; Rizk 
et al. 2006; Deblaise and Maurine 2006; Wang and Mills 2006; Bouzgarrou 
et al. 2002; Katz and Li 2004; Cosson et al. 2006). Static models ensure to compute 
tool pose defects induced by given static forces which are exerted on PKM structure 
(Chanal et al. 2006). They are based on parameters which model flexibility of PKM 
legs and joints. Flexible dynamic models allow simulating the influence of stiffness 
characteristics and natural frequencies of PKM structure on machining quality 
(Bouzgarrou et al. 2002). They take into account component weight and inertia of 
PKM structure. 

Static and dynamic models of PKM can be sorted into two main categories: 

• Local models: representation of PKM structure around a reference pose 

• Global models: representation of PKM structure and its movements 

Local and global terms are defined according to kinematic description used in 
the model. In the case of local model, results given by model simulation are valid 
for a given PKM pose. For global model, kinematic description ensures to modify 
PKM pose without developing a new model. 

Local Models 

Local models impose, by definition, to define a new model for each studied PKM 
pose. Thus, the time for conduct static or dynamic behavior analysis of PKM 
structure can be significant if several PKM poses must be studied. Most local 
modeling methods are based on structural transfer matrix or on finite element 
(FE) methods. 

(a) Structural Transfer Matrix 

The aim of this method is to determine a transfer function which can link load 
parameters to displacements of PKM components. For example, relation between 
cutting forces f (&>) and tool tip acceleration x(&>) with regard to part can be 
expressed as 
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where co represents the spindle rotational speed. 

Transfer function defined from matrix G(&>) is identified from experimental 
measures (Tounsi and Otho 2000). In case of PKM, the behavior can vary highly 
according to the tool pose; the transfer function is then different for each tool pose. 
The behavior modeling in all workspace leads to a large number of measures and to 
an extrapolation of every measured point. Thus, the use of this kind of models is not 
relevant for PKM (Week and Staimer 2002). 

(b) FE Methods 

FE methods have been widely used for designing or simulating mechanism or 
structure behavior since the 1980s. These methods are generally used for designing 
(Bouzgarrou et al. 2002) or computing stiffness (Deblaise and Maurine 2006) or 
natural frequencies (Cobet 2002) of PKM. FE modeling applied to PKM encom- 
passes two main kinds of models: models with volumic elements and models with 
beam elements. However, a model with a part discretized with FE can be consid- 
ered as a local model or a global model. 

The most accurate models seem to be those using volumic elements to represent 
the machine components as they rely on fewer hypotheses than the others (Wang 
and Mills 2006). The main problem of these models is their computation time for 
calculating stiffness for a given position of the machine. As an example, a complete 
FE model with volumic elements and nonlinear joint models needs several hours to 
determine the stiffness in a given direction for one pose of the mobile platform 
(Bouzgarrou et al. 2002). Moreover, the computation of the stiffness in another 
position requires remeshing the model. The development of a parameterized FE 
model like in Bouzgarrou’ s work (Bouzgarrou et al. 2002) ensures to avoid 
remeshing all parts during a PKM pose modification. Moreover, PKM is designed 
with a large number of joints associated with nonlinear static behavior (Huang and 
Lee 2001). Thus, complex elements have to be used for modeling joints behavior, 
which will increase computation time but allow accounting joints and legs flexibil- 
ity (Wang and Mills 2006). 

To conclude, significant computation and modeling time of this kind of FE 
models make them difficult to use during the process optimization. However, it 
allows determining accurate PKM structure stiffness in the design stage without the 
need of real tests. 

Global Models 

In order to determine PKM behavior in the entire workspace without having to 
develop multiple models, some researchers propose global models which take into 
account PKM kinematic behavior (Paccot et al. 2009a; Nabat et al. 2008; Ramdani 
et al. 2008; Bonnemains et al. 2013). These models can be grouped under the term 
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multibody models. Multibody systems are defined by Shabana as a set of bodies, 
rigid or flexible, linked by joints (Shabana 2005). For defining a dynamic multibody 
model of PKM, three formalisms can be used as follows (Dwivedy and Eberhard 
2006): 

• Lagrangian formulation. It describes the behavior of a dynamic system in terms 
of work and energy stored in the system. The Lagrange equations are used to 
compute dynamic equations. 

• Newton-Euler formulation. It describes the behavior of a dynamic system in 
terms of forces and moments acting on each link. 

• Hamilton principle. It ensures to link the kinetic and the potential energy. 

In case of PKM, Lagrangian formulation is generally preferred due to the ability 
of the method to take into account different kinds of mechanical parameters such as 
friction or flexibility (Bonnemains et al. 2013). However, in a control context, 
Newton-Euler algorithm is often employed due to its efficiency for real-time 
application (Paccot et al. 2009a). 

In case of machining simulation, the use of direct dynamic model ensures to 
express tool pose at each time. Indeed, this model defines active joints acceleration 
q with regard to active joint speed q, position q, motor torques or forces x, and 
exterior actions fext : 


Q = DDM(q, q,x, f ext ) 


However, most of the work about PR modeling defines inverse dynamic models 
(Book 1984; Bayo et al. 1988; Boyer and Coiffet 1996). Generally, the aim of 
inverse model is used to choose motors or to develop an adapted command strategy: 

t = IDM( q,q,q,f ext ) 

The main assumptions made on multibody models of PKM concern the compli- 
ances of the legs and the joints (Dwivedy and Eberhard 2006). 

(a) Rigid Body Model 

The first way to model a PKM is to consider the structure as bodies linked with 
joints. This hypothesis is generally used to define a new command strategy (Paccot 
et al. 2009a), to compute dynamic machine capacity, to optimize dynamic behavior 
of PKM (Nabat et al. 2008), or to dimension PKM elements with regard to the 
application (Ramdani et al. 2008). The rigid body model can also be used for 
identifying joints preloads and viscous friction, reduction ratio and mass, and 
moment of inertia of each solid composing PKM structure (Bonnemains 
et al. 2013). 
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Fig. 10 Exechon parallel 
unit parametrization 
(Bonnemains et al. 2013) 



For defining the dynamic model, the mechanism has to be parameterized. 
Indeed, the Jacobian matrices of each part of the mechanism have to be computed. 
The Jacobian matrix gives the relation between the end-effector operational veloc- 
ity and the articular velocity: 



where p v is the end-effector operational velocity, p (o is the end-effector operational 
rotational velocity, q is the articular velocity, p i h is the Jacobian matrix, h is the 
reference point for the end-effector velocity, and p is the frame where the velocity is 
expressed. 

The main difficulty with PKMs is the choice of the parameters in the array q. It is 
often simpler to consider more parameters than only those of the motorized joints. 
Some constraint equations have to be written for modeling the mechanism kine- 
matic behavior (Bouzgarrou et al. 2002). 

Thus, for modeling the parallel unit of Exechon PKM, Bonnemains introduces 
12 parameters with only 3 independent (Bonnemains et al. 2013; Fig. 10). Indeed, if 
only three parameters (the leg lengths) are kept, the equations of motion are too 
large for any further computation. Finally, motion equations are deduced from 
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Lagrange’s equations. However, this method does not ensure to take into account 
structure flexibility. 

(b) Model with Flexible Legs or Joints 


Authors often choose to neglect the stiffness of the joints with regard to the 
stiffness of the legs or vice versa (Chanal et al. 2006; Cobet 2002; Bouzgarrou 
et al. 2002; Katz and Li 2004; Cosson et al. 2006). Cobet and Bouzgarrou consid- 
ered in their studies that legs are rigid and flexibilities are located in joints to 
perform a dynamic analysis or natural frequencies computation of a PKM (Cobet 
2002; Bouzgarrou et al. 2002). Chanal, Katz, and Cosson considered that joints are 
perfect and that only legs can warp (Chanal et al. 2006; Katz and Li 2004; Cosson 
et al. 2006). Legs are here modeled by Euler-Bernoulli or Timoshenko beams. To 
improve the accuracy of such model, flexible parts can be modeled with FE method 
by using volumic elements (Bouzgarrou et al. 2002). 


(c) Model with Flexible Legs and Joints 


Recent studies have shown that both joints and legs compliances have an impact 
on tool pose defects (Shabana 2005; Dwivedy and Eberhard 2006; Majou 
et al. 2007). Majou proposed a model considering legs and joints compliances 
(Majou et al. 2007). Virtual joints are added with a given stiffness in order to 
represent strains of the joints or elongation of the legs. 

Bonnemains proposed on his work a generic model of PKM by computing strain 
energy E D of a PKM structure consisting both joint and link compliances 
(Bonnemains et al. 2013): 



where q m is leg m length and N m , Mf xm , M fym , and M tm are, respectively, the 
compression force, bending moments around y m and z m , and torsional moment in 
leg m. f p is the effort transmitted by joint p and K p is the stiffness matrix of joint 
p. S m , I Gxm , I Gym, and Iom are, respectively, the leg section, the moments of inertia 
about x m and y m axes, and the polar moment of inertia of leg m. E and G are 
Young’s and shear moduli of the material. The most time-consuming step in the 
computing of the E D is the determination of the efforts N m , Mf ym , Mf zm , M tm , and f p . 
Their expressions rely on the kinematic model of the given architecture. The 
displacements of the mobile platform for a given effort F e applied on it at O e can 
then be computed using Castigliano ’ s theorem: 



dE D 

dF e 


where S e is the displacement in the direction of the applied effort F e . 
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To determine the displacement in a different direction than the applied effort, a 
fiction effort can be applied to the platform in the desired direction. The strain 
energy is then differentiated with respect to this fictitious effort, and this effort is 
finally set to zero in order to obtain the displacement in the desired direction. 

Modeling Method 

Many authors propose models of PKM in order to determine their stiffness in a 
design point of view (Bouzgarrou et al. 2002) (Pashkevich et al. 2009). Their goal is 
only to optimize the architecture design. However, according to the simulation goal 
and studied machining operation, it can be relevant to develop a model of PKM 
adaptable to different given architectures and compact enough to determine quickly 
(less than 1 h) the tool displacements for a given machining operation (efforts 
known along a given tool path). The interest of this model arises after the design 
stage; this can be an element of machining simulation like the detection of collision 
before the real machining of the part. 

So, FE models with beam elements or multibody models can be good answers to 
this problem. Indeed, they are easily adaptable to different architectures, and they 
can be computed in a lower computation time than FE models with volumic 
elements. Nevertheless, it seems necessary to simplify again legs’ geometry by 
discretizing them. Using material parameter description in a multibody model does 
not require longer computation times or give more accurate results (Bonnemains 
et al. 2013). 


Calibration 

The description of PKM mechanical behavior implies the definition of geometrical, 
static, and dynamic models. Thus, geometrical, static, and dynamic parameters 
have to be identified. In the following paragraph, calibration methods are intro- 
duced for these tree kinds of parameters. 

Geometrical Calibration 

The geometrical calibration or identification of a PKM consists of determining the 
best geometrical model which describes the mechanism. Three principal steps have 
to be verified during PKM calibration (Renaud et al. 2006): 

• Geometrical modeling of the mechanism 

• Parameter identification method based on the optimization of a dedicated cost 
function 

• Choice of the experimentation approach and associated measurement method 

(i) PKM Modeling and Definition of Identified Parameters 

The geometrical model dedicated to calibration is generally the inverse kine- 
matics model (Week and Staimer 2002; Fan et al. 2003). For PKM, the inverse 
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Fig. 1 1 Definition of the inverse kinematics model 


kinematics model (IKM) expresses motor displacements with regard to end-effector 
posture and geometrical dimensions of joints and links (Bi and Jin 2011; Fig. 11). 

(ii) Cost Function Definition 

The geometrical parameters are computed from a comparison of experimental 
measurements with theoretical estimation of the same machine pose (Renaud 
et al. 2006). In the general case, the geometrical parameters are identified by 
minimizing a cost function which compares these two estimations. This cost 
function depends on the means used to obtain measurement redundancy. 

In the case of external measurements, Renaud states that the machine should be 
compared to the machine coordinates system in order to decrease the influence of 
measurement noise (Renaud et al. 2006). Indeed, the cost function can be generally 
expressed analytically. 

In the case of PKM, IKM computes the motor positions V in the machine 
coordinates system as a function of tool position and orientation X in the Cartesian 
coordinates system and the identified geometrical parameters 

V = IKM(X£) 

To compare machine poses in the machine coordinates system, the theoretical 
motor positions Vi = IKM(X di , § d ) have to be compared with the real position V mi 
= 7KM(X mi , §) computed from the tool pose measurements (Fig. 12). The theoret- 
ical motor positions Vi are computed for given nominal geometrical parameter 
values § d . The identification of the geometrical parameters £, is successful if V mi is 
equal to Vi. 

These geometrical parameters § are generally identified by using a least square 
function (Khalil and Dombre 2004): 
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where X mi is the measured tool pose in the Cartesian coordinates system, X di is the 
theoretical tool pose, and n is the number of measured poses. The identified 
geometrical parameters § are the parameters which minimize the cost function. 


(iii) External Measurement Methods 


Several experimental methods exist to identify PKM geometrical parameters. 
Some of them use a one-off measurement performed by measuring a machined 
part by using external measuring equipment (Pritschow et al. 2002; Week 
and Staimer 2002; Terrier et al. 2004; Song et al. 1999). Others are carried out 
with integrated measuring equipment inside the PKM structure (Pritschow 
et al. 2002; Patel and Ehmann 2000). These later methods ensure to control 
geometric machine defect during machining in all the workspace. However, this 
kind of methods seems to be less accurate if measured joints are far away from the 
effector and can be difficult to realize according to the joint nature (Renaud 
et al. 2006; Merlet 2002). In the following, the external measurement methods 
are discussed. 

A first external measurement method consists of adding mechanical constraints 
to the tool movement in order to measure tool pose errors. For example, 
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Fig. 13 3D ball-bar 
(Martinez and Collado 2004) 



Week (2002) designs a redundant leg consisting of a linear guide way to connect 
two ball bearings which are attached to the rotary table and an HSK interface to the 
spindle. This mechanism is like the solution introduced by Patel, Chen, and 
Martinez (Patel and Ehmann 2000; Chen and Hsu 2004; Martinez and Collado 
2004; Fig. 13). Huang and Neugebauer have developed a specific calibration 
method for PKM based on the measure of parallelism, straightness, and orthogo- 
nality of virtual axis of machining coordinate system (Huang et al. 2005b; 
Neugebauer et al. 1999). A probe is put on the machine spindle in order to “scan” 
standard parts. To limit the spindle movement of the machine tool, Bleicher has 
developed a serial mechanical system which ensures to measure tool pose defect in 
all the workspace (Bleicher and Gunther 2004). 

A second method consists of using exteroceptive sensors. Song uses a 5D laser 
interferometer system which simultaneously provides five measurements (three 
linear displacements and two rotational displacements) (Song et al. 1999). These 
methods measure only some of the tool pose defects. However, the relation between 
the tool pose defects and the machined surface quality is not direct (Chanal 
et al. 2006), and the quality of the machined part is therefore not guaranteed, 
even if calibration is well performed. 

Thus, the measurement of a machined part is relevant. Pritschow designed a 
specific part which is machined on the PKM to be calibrated (Pritschow et al. 2002). 
The part is milled with a ball end mill with changing spindle orientation (Fig. 14). 
Measurement of the ball cups provided the positions of the ball end mill centers. 
However, ball cups do not represent usual machined surfaces. Indeed during 
machining, several continuous surfaces are generally created by the motion of a 
tool along a trajectory. So it is relevant to use a machined part which allows the real 
tool path to be compared with the programmed one as in Chanal’ s work (Chanal 
et al. 2007; Fig. 15). 

Standard identification methods ensure that tool pose defects are minimized, 
which then may enable the part to be machined with the required quality. However, 
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Fig. 14 Pritschow’s part for calibration (Pritschow et al. 2002) 
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Fig. 15 Chanal’s part for calibration (Chanal et al. 2007) 


usual methods need a certain level of mastering to implement and can take a long 
time. In a machining context, a method which identifies the geometrical parameters 
in a part of the workspace for a given part shape may increase the accuracy and 
decrease the cost of identification by minimizing experimentation time. 
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Fig. 16 Measurement means for static calibration 


Static Calibration 

Static calibration allows identifying flexibility parameters of PKM structure. Thus, 
the experimental procedure has to ensure the measurement of the structure defor- 
mation with regard to an effort applied on the PKM end effector. 

To identify the static behavior of a Tricept, Robin loads the structure with known 
masses and measures induced tool pose movement with a comparator (Robin 
et al. 2007). However, with this method, the structure can only be loaded in one 
direction; thus, Bonnemains uses a stroke combined with a LVDT and force sensors 
(Bonnemains et al. 2013; Fig. 16). 

In the case of PKM, the observed stiffness depends on the tool pose in the 
workspace and can highly vary. For example, Bonnemains observes an increase of 
the stiffness with a factor of 3 between the center and edge of workspace. A 
nonlinear stiffness behavior can appear due to the influence of bearing stiffness 
with regard to the legs (Fig. 17). 

To conclude, static calibration should be achieved with regard to the task load in 
order to ensure a local linearization of the stiffness model. Thus, the numerical 
minimization can be expressed as a linear problem. 


Dynamic Calibration 

The accuracy of a dynamic behavior simulation is linked on the accuracy of the 
parameters that describe the dynamic model (Khalil and Dombre 2004). The 
dynamic parameters of a PKM are generally inertial parameters, friction 
parameters, and preloads (Bonnemains et al. 2013). The identification 
procedure requires the joint torque measurement during the execution of a tool 
path. The followed tool paths should be sufficiently exciting in order to obtain a 
unit conditioning of the minimization problem (Khalil and Dombre 2004). In fact, 
tool paths used for dynamic identification are composed of (Bonnemains 
et al. 2013): 
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Fig. 17 Stiffness measure of Tripteor PKM 


• Several acceleration and speed. Low acceleration and speed are used in order to 
identify friction and preload parameters. Various high acceleration and speed 
ensure to identify inertial parameters. 

• Movements throughout the task workspace in order to deal with the nonconstant 
behavior of PKM. 

Figure 18 illustrates a tool path used for dynamic calibration. In this case, joint 
torque measurements are applied when the PKM follows the tool path in low speed and 
high speed respectively compared to the speed required for the machining operation. 


Control 

The control of a machine is the last stage before realizing the task. In fact, the aim of 
the control is to ensure the best accuracy with regard to the task requirement in 
terms of static and dynamic errors. Moreover, the control strategy should ensure 
perturbations reject. Therefore, lots of control schemes have been developed from 
simple linear control to complex robust and adaptive one. The following sections 
deal with control strategies used for parallel kinematic machines (PKM). 
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Fig. 18 Example of tool path 
used for dynamic calibration 
(Bonnemains et al. 2013) 



Linear Control 

Most of the industrial machine motion controllers are based on linear single-axis 
control. There are two kinds of controls strategies: classical speed control and 
torque control. 

(a) Joint Speed Control 

The joint speed linear control is the most developed control strategy in an 
industrial context because of a simple architecture implementation and well- 
known tuning (Franck et al. 2004; Kelly et al. 2005; Khalil and Dombre 2002; 
Spong et al. 2006; Sun et al. 2007; Tournier 2010). The control scheme is based on 
the proportional-integral (PI) joint speed controller and proportional (P) position 
loop (see Fig. 19). Hence, the speed control allows compensating load variations, 
and the position loop deals with the dynamic answer (i.e., settling time). However, 
such a control strategy is not clearly relevant when dynamic accuracy is required 
especially when the dynamic solicitations are great. Indeed, the speed controller 
imposes the joint behavior during transitional period with relatively slow time 
response. Moreover, the classical tuning rules do not take into account the dynamic 
behavior of the mechanical structure under task solicitations. 

(b) Joint Torque Control 

An improvement of the dynamic accuracy during transitional period comes 
from using a linear proportional-derivative (PD) torque control, sometimes called 
linear computed torque control (Franck et al. 2004; Kelly et al. 2005; Khalil and 
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Fig. 19 Joint speed control 



Fig. 20 Linear single-axis torque control 


Dombre 2002; Spong et al. 2006). The control scheme is based on a PD position 
loop imposing each actuator torque (see Fig. 20). The tuning of the PD controller is 
linked to the simplified dynamic of the mechanical structure: 

K p = Ja> 1 2 d 
K d = 2 jQco d -f 

where K p is the proportional gain, K d is the derivative gain, J is the inertia matrix 
considered as diagonal, co d is the loop desired undamped natural frequency, g is the 
desired damping ratio, and / is the viscous friction vector. co d and g are chosen to 
impose a damped transitional response with fixed five percent settling time: 

1 

5 

tr 5 % 




By using an adequate speed and acceleration feedforward Jq d +fq d , the error 
signal e has a second-order exponential decreasing: 

e + 2 <;co d e + m 2 d = 0 

The above assumption is verified when dynamic solicitations allow neglecting 
nonlinearity of the dynamic behavior, such as dynamic coupling between legs or 
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Fig. 21 Joint space computed torque control 

Coriolis and centrifugal forces. The static accuracy is thus assured in the joint space 
by the control loop and in the task space by geometrical identification. Dynamic 
accuracy is assured by controller tuning and dynamic identification of J and /. In 
this case such a control scheme could be relevant in an industrial context because of 
simple implementation, simple tuning, and relevant accuracy. 

However, the disturbance on torque input such as gravity, dry friction, and task 
solicitations (e.g., such as cutting forces) can degrade the accuracy. One may find some 
solutions in gravity and friction compensation (Franck et al. 2004) or identification 
under task solicitations (Bonnemains et al. 2009). To improve accuracy performance, 
specific tuning can be employed such as fuzzy logic or neural network (Sun et al. 2007; 
Bingiil and Karahan 2011; Carmona Rodriguez et al. 2012; D’Emilia et al. 2007). 

Nevertheless, due to the nonlinearity of the dynamic behavior in the PKM case, 
linear control is not always sufficient to ensure a constant accuracy in the whole 

o 

workspace (Astrom and Hagglund 2001; Paccot et al. 2009b). To solve this 
problem, nonlinear control can be envisaged. 

Nonlinear Control 

In the machine tool or robot case, nonlinear control consists in employing a suitable 
model of the machine structure behavior in the control loop. The motion controller 
could be based on linear PID controller coupled with predictive, adaptive, or 
intelligent control strategies. 

(a) Computed Torque Control 

When the dynamic behavior of the mechanical structure cannot be considered as 
linear under the task dynamic solicitations, a classical solution is the well-known 
computed torque control (CTC) (Luh et al. 1980). The control schemes include the 
inverse dynamic model of the machine to compute the joint torque reference (see 
Fig. 21). This allows linearizing the dynamic behavior of the structure; hence, the 
control signal w ctrl is 


Mctrl 


q 


Therefore, the controlled system (IDM and joints) can be seen as a linear double 
integrator. Moreover, the tuning of the PD controller is quite simple: 
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Fig. 22 Joint space computed torque control with predictive controller 


K P = (o 2 d 

K d = 2t;m d 


By using an adequate acceleration feedforward q d , the error signal has a second- 
order exponential decreasing: 


e + 2 qcode + co 2 d = 0 

Nevertheless, this assumption can be verified when dynamic modeling and 
sensing errors are minimized with a relevant dynamic identification process (Khalil 
and Dombre 2002; Lammerts 1993; Paccot et al. 2007). Furthermore, the external 
perturbing forces (such as load variation or cutting forces) could be taken into 
account to increase the control accuracy and stability. 

(b) Predictive Control 

When the classical CTC cannot deal with modeling errors and perturbing forces, 
strategies likes model predictive control (MPC) or generalized predictive control 
(GPC) can be employed (Belda and Bohm 2006; Belda et al. 2003; Clarke 
et al. 1987; Company et al. 2003; Cuvillon et al. 2005; Ginhoux et al. 2005; Richalet 
1993; Vivas and Poignet 2005). These methods are based on a minimization of the 
predicted error with regard to the desired path on a finished horizon. The definition 
of the cost function is generally made from the dynamic modeling under classical 
statistic errors and an observation of the real robot behavior. The minimization 
leads to a modification of the control signal (see Fig. 22). Such a control strategy 
allows for reducing external perturbation impact on accuracy and transitional 
period by computing an adapted control signal. However, the modeling of the 
mechanical structure is fixed and still subjected to modeling errors. Therefore, the 
robustness to this kind of perturbations is not clearly established. 

(c) Adaptive Control 

Another way to improve the CTC robustness is the adaptive control (AC) 
associated with a robust control strategy (Lammerts 1993; Fernandez 2004; 


2120 


Y. Jin et al. 



Fig. 23 Joint space computed torque control with adaptive controller 


Honegger et al. 2000). The general control scheme is based on a modification of the 
model parameters (or sometimes controller tuning) (see Fig. 23). The model 
parameters come from the minimization of the error between modeled and real 
behavior. Therefore, the dynamic model of the structure can change thus improving 
the control robustness to modeling errors. 

(d) Intelligent Control 

Last but not least, intelligent control such as fuzzy logic or neural networks can 
also be employed to robustify the CTC behavior (Dorato et al. 1992; Oh et al. 2004; 
Song et al. 2005). The controller is based on the dynamic modeling of the structure 
and trained on several trajectories. The control action leads to a modification of the 
control signal or model parameters. 

Joint Space Control Versus Cartesian Space Control 

In the PKM case, the concern in the space control is primordial (Paccot et al. 2009b; 
Beji et al. 1998; Bruyninckx 1999; Kim et al. 2005). Hence, because of the duality 
between serial and parallel kinematic machine, there is an ambiguity on using joint 
space control. Indeed, contrary to SKM, ensuring the tracking in the joint space 
does not ensure the tracking in the Cartesian space. Moreover, by controlling each 
joint separately, the disparate tracking errors can lead to internal torques, especially 
in an overconstrained structure case. However, most of the works on PKM control 
in the literature reuse the joint space control theory leading to inefficient use of 
these structures (Merlet 2002; Paccot et al. 2008). To improve the structure 
behavior control, Cartesian space control associated with Cartesian models should 
be employed (Paccot et al. 2009b; Beji et al. 1998; Paccot et al. 2008; Khalil and 
Ibrahim 2007). 

Nevertheless, by using, a Cartesian space control, one should resolve the 
end-effector pose perception. A first classical solution is the use of the forward 
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Fig. 24 Cartesian space computed torque control with forward kinematics 



Fig. 25 Cartesian space computed torque control with exteroceptive sensor 


kinematic model (FKM) in the feedback loop (Fig. 24). Indeed, the FKM is not 
trivial to compute and can degrade the expected performance of a Cartesian space 
control (Lou et al. 2004). Therefore, a direct measure of the end-effector pose with 
an exteroceptive sensor is a relevant solution (Fig. 25). The use of 3D laser tracker, 
computer vision, high-speed vision, or passive structure could be envisaged (Paccot 
et al. 2008; Dallej et al. 2006a, b; Corbel et al. 2008). In these cases, the perception 
of the end-effector motion under the task solicitations is accurately captured and 
thus allows for the improvement of the task quality. 

In this section, the focus is on the main control strategies used for PKM. To 
improve the task achievement, several points should be taken into account: influ- 
ence of dynamic solicitation of the task on the mechanical structure, dynamic 
behavior of the machine, and specificities of the PKM. The future direction could 
come from a control strategy which presents an adaptation of the control architec- 
ture, the controller settings, and the used models with regard to the structure 
behavior under the task solicitations. This online adaptation requires a pertinent 
sensing method. The fusion of fast proprioceptive sensors (active and passive joint 
variables) and accurate exteroceptive ones could be an efficient solution. 
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Summary and Future Directions 

Parallel robots are emerging technology which recently receives more and more 
attention. Although there are several successful examples, there is still a long way 
to discover the full potential of each type of PR for certain applications. In this 
sense, the design has to be task specific, to maximize the theoretical advantages 
while minimizing the disadvantages. Numerous PR prototypes have been built in 
laboratory in the last three decades, but most of them cannot offer the expected 
performance. In fact, making a workable PR is a system engineering process, which 
involves synthesis, component design/selection, analysis, optimization, control, 
and performance evaluation. However, such a system engineering process is not 
easy, and there is no effective tool yet to support it. Therefore, a digital tool is 
urgently required to conduct the design process systematically in a virtual environ- 
ment, so as to combine the multidisciplinary knowledge together and minimize the 
cost of prototyping. 

Regarding the current trend in PR research, hybrid parallel robots received lots 
of attention to combine the advantages of both serial and parallel robots. Micro PRs 
using compliant joints are also a hot topic, targeting at some high-precision 
microlevel applications, such as fiber alignment. Low-mobility PRs have received 
continuous investigation, but there are still a number of unsolved challenges, such 
as assurance of dynamic performance. PRs have been identified as one of the key 
smart devices for the next manufacturing system; therefore, the integration and 
application methodologies are also required to be explored. 
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Abstract 

Modularity is an important design concept in engineering to cope with complex 
systems. For robots used in the industrial environment, the complexity resides in 
the robot system as well as the tasks given to the robot. This chapter presents an 
up-to-date development in modular reconfigurable robots for the industry based 
on modular design principles. The scopes of the chapter cover the definition and 
classifications of modular robots; past and present research efforts in modular 
reconfigurable robots for the industry; basic modular design method including 
mechanical and interface issues; modular robot representation schemes for 
classifications and modeling; automatic model-generation techniques, kinemat- 
ics, dynamics, and calibration; task-based configuration optimization; modular 
robot software; and a demonstration workcell based on reconfigurable modular 
robot for adaptability. In the concluding section, future perspective of modular 
robots for industrial applications is discussed. 


Introduction 

Modularity in engineering design refers to a compartmentalization of functional 
elements. In a complex system, modularity helps reducing the complexity by 
dividing the system into pieces of specific functions in order to understand the 
relationships and to facilitate interaction among the basic elements. Modularity also 
allows the replacing of elements either for repair or upgrading new functionality. 
The alternative to a modular approach is an integrated approach where systems are 
designed as whole. While integrated approaches tend not to be as easy to repair, 
upgrade, or reconfigure, they have fewer constraints on element design and, there- 
fore, can be made optimal, for example, low cost and high performance, etc. 

A modular robot refers to a robotic system equipped with elements and modules 
that possess the necessary basic functions of robots that can be put together and 
reconfigured as an integrated system to carry out robotic tasks. The rationale for 
pursuing modularity in robot design is to contain the complexity of designing a 
robot for tasks with high sophistication and variety by offering recombination and 
reorganization of standardized robotic modules while keeping the maintenance of 
the robotic system simple and easy. As robotic modules possess more and more 
electronic and computational elements with very short life cycle, upgradability of 
the functional robot units also becomes a critical issue to be addressed in modern 
robot designs. 

There are two schools of thought on the modularity of a robot system: one is 
reconfigurable modular manipulators with a finite set of modules of different 
functions and the other is self-reconfigurable modular robots with uniform type of 
modules. A reconfigurable modular manipulator is a modularly designed robotic 
system, which adopts a common slot- and bus-modularity design approach (Ulrich 
1995) for its internal structure and architecture, not the external configuration. Such 
a robot may have a unified and integrated configuration that cannot be changed 
from the outside. A self-reconfigurable modular robot is a system that adopts 
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a bus- and sectional-modularity design approach (Ulrich 1995) for both internal 
structure and external configurations. The users can reconfigure the compartmen- 
talization and interchange functional modules like toy building blocks. 

The reconfigurable modular manipulator type of robotic systems were natural 
evolution of industrial robot manipulators that consist of a number of specific 
functional modules, such as actuator modules, link modules, end-effector modules. 
Subsequently, robots with the serial and branching topology, such as humanoid 
robots, legged robots, and mobile manipulators, adopt similar approach for modu- 
larity, as these functional modules indeed form the basis of a robotic system. The 
self-reconfigurable modular robots grew out of the concept of self-evolution and 
self-configuration of biological cells with identical units. Such a robot normally 
consists of a large number of identical mechatronic units that possess actuation, 
connection, communication, and computing capability that can be assembled 
together in any arbitrary form and configuration by itself. Although the two types 
of modular robots are originated from different fundamental thoughts, the goals to 
provide a large number of possible robot configurations for different tasks with the 
same set of basic robot modules are the same. 

In this chapter, the focus is on the design, modeling, and implementation of 
reconfigurable modular manipulators as the concept of such modular robot systems 
has been accepted by the industry in creating intelligent and adaptive manufacturing 
systems for the fluidic and fast-changing business and manufacturing environments. 


Past Efforts in Reconfigurable Modular Manipulators 

In the modularization of industrial robots, the granularity of the components is 
usually based on their basic functions, i.e., motion actuation and tooling. Thus, the 
design of modules is highly differentiated into actuator modules, passive joint 
modules, and tooling modules, etc. Several prototype modular robotic systems 
have been built and demonstrated, including the Reconfigurable Modular Manipu- 
lator System (RMMS) (Paredis et al. 1996), several generations of the cellular 
robotic system (CEBOT) (Fukuda and Nakagawa 1988), and modular manipulator 
systems developed by the University of Toronto (Cohen et al. 1992), the University 
of Stuttgart (Wurst 1986), the University of Texas at Austin (Tesar and Butler 
1989), Toshiba Corp. (Matsumaru 1995), more recent iMobot (Barobo 2013), and 
Johns Hopkins University (Kutzer et al. 2010). Basically, these systems have serial- 
typed (or open-chain) geometry with large working envelopes. These serial-typed 
modular robots are suitable for assembly, trajectory tracking, welding, and hazard- 
ous material handling. Parallel modular robots are also developed for light- 
machining tasks (Yang et al. 2001). As indicated in (Yang et al. 2001), modular 
design can reduce the development cycle of the parallel robot significantly. 
Furthermore, it allows a trial- and-error approach to construct a parallel robot that 
is impossible with the integrated design approach. 

Applications of modular systems have been proposed in rapid deployable 
robot systems for hazardous material handling (Paredis and Khosla 1995), 
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in space -stationed autonomous systems (Ambrose 1995), and in manufacturing sys- 
tems (Chen 2000, 2001). There are a few modular industrial robot arms commercially 
available such as the Universal Robots (Universal 2013), the Schunk Powerball Robot 
(Schunk Modular 2013), the highly dexterous WAM Arm from Barrett Technology 
(Barrett 2013), the JACO robot arm developed by Kinova (Kinova Robotics-The 
JACO robot arm 2013), and the Robotnik Modular Robot Arm (Robotnik 2013). The 
modular robot concept also proliferated in the hobby and educational robot sectors 
around 2000 by introducing well-packaged self-contained servo motor modules into 
inexpensive robotic devices, such as Robotis (Korea) and Kondo (Japan), as well as 
Lego (Denmark) and other toy companies making educational robots. 

Mobile robots with legs, wheels, and tracks also move into modularity to configure 
for different task requirements as used in disaster relief, rescue, and surveillance 
purposes. Two tracked modular mobile robots designed with multiple track segments 
(Wang et al. 2010) and reconfigurable tracks allowing serial and parallel connections 
(Li et al. 2009) are demonstrated. The work in (Moubarak and Ben-Tzvi 2012) 
contains an in-depth review of the development in modular mobile robots. 


Modular Robot Design 
Module Design Issues 

A modular robot would consist of the two main features founded in a modular 
product: (1) a one-to-one mapping from functional elements to the physical com- 
ponents of the product and (2) decoupled interfaces between the components of 
different modules (Ulrich 1995). For modular manipulators, the essential compo- 
nents are the base, regional, and orienting mechanisms composed by actuator 
modules and link modules of different dimensions and geometry and the 
end-effector module. For legged and wheeled mobile robotic systems, the motion 
generation mechanism modules are essential. 

The actuator modules normally adopts DC or AC motors as 1-DOF “rotate” or 
“pivot” joint module with compact high reduction ratio transmission mechanisms 
(Paredis et al. 1996; Cohen et al. 1992; Tesar and Butler 1989; Matsumaru 1995; 
Chen 2001). Some modular systems also adopt 1-DOF linear modules for large 
motion envelope (Cohen et al. 1992; Chen 2001) and 2-DOF joint modules for 
compact dexterous motion generation (Schunk Modular 2013). The actuator mod- 
ules are designed with similar geometry but of different dimensions and power 
ratings for different application requirements. 

The link modules connecting units in between the actuators function as reach- 
able workspace extenders. Some systems adopt a standard fixed-dimension con- 
nection module (Fukuda and Nakagawa 1988; Cohen et al. 1992; Tesar and Butler 
1989), and some are variable dimension modules that can be customarily fabricated 
due to additional design constraints and requirements (Chen 2001). In some system 
(Paredis et al. 1996), the link module becomes part of the actuator modules so that 
the module acts as an actuator as well as the connecting structure. 
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Interface Design Issues 

The mechanical connecting interface in between the modules in a modular manip- 
ulator needs to meet the basic requirements of (1) stiffness, (2) fast reconfiguration, 
and (3) interchangeability. Thus, the design of the mechanical connection or 
docking mechanism for the robot modules becomes a critical issue. In a fully or 
semi-supervised robotic system, like all the modular manipulator systems (Paredis 
et al. 1996; Fukuda and Nakagawa 1988; Cohen et al. 1992; Wurst 1986; Tesar and 
Butler 1989; Matsumaru 1995; Yang et al. 2001; Chen 2001), the connecting 
mechanism is designed to be manually operated for reliability and safety reasons. 
In a fully autonomous system, the connecting mechanism needs to be designed with 
an extra actuator and the locking mechanism for carrying out the connection 
automatically. This is the case for most of the self-reconfigurable modular robots. 

In order to meet the requirement of commonality on control and power with 
changing robot configurations, the electronic and communication interface for the 
modular system normally adopts communication network architecture with plug- 
and-play capability similar to Local Area Network (LAN). There are a number of 
existing industrial standard network protocols for real-time robot control suitable 
for such applications, like CAN-bus and IEEE 1394. The progressive development 
of industrial automation protocols will facilitate the implementation of modular 
robot communications. 


Modular Robot Representation 

A graph-based technique, termed the kinematic graph , is introduced to represent the 
module-assembly sequence and robot geometry. In this graph, a node represents a 
connected joint module and an edge represents a connected link module. Modules 
attached to or detached from the robot can be indicated by adding or removing 
nodes or edges from the graph. The realization of this graph is through an Assembly 
Incidence Matrix (AIM) (Chen 1994; Chen and Burdick 1998). A modular robot 
can be conceived according to the given AIM without knowing the other parame- 
ters, such as joint angles and initial positions. Here, we assume the generic structure 
of a modular robot is branch type. The serial-type modular robot is a special case of 
the branch- type structure. 


Module Representation 

To make the automatic model-generation algorithms work on a variety of module 
components, we introduce a conceptual set of modules whose features are extracted 
from those of real implementations. The modular systems developed to date have 
several common mechanical and structural features: (1) only 1-DOF revolute and 
1-DOF prismatic joints, (2) symmetric link geometries for interchangeability, and 
(3) multiple connection ports on a link. 
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Fig. 1 Modular robot components 

Joint Modules 

A modular robot joint module is an “active” joint, which allows the generation of a 
prescribed motion between connected links. Two types of joint modules, the 
revolute joints (rotary motion) and the prismatic joints (linear or translational 
motion), are considered. Rotary and linear actuators must reside in the modules 
to produce the required motions and maintain the modularity of the system. Multi- 
DOF motions can be synthesized with several 1-DOF joints. Joint modules are 
attached to link modules through standardized connecting interfaces for mechani- 
cal, power, and control connections. 

Link Modules 

The place on a link module where the joint is connected is called a connecting port. 
Without loss of generality, we assume that a link module is capable of multiple joint 
connections, and the link module has symmetrical geometry. Such a design allows 
modules to be attached in various orientations and the robot geometry to be altered 
by simple reassembling. The modular robot components developed in our univer- 
sity are shown in Fig. 1. This design follows the building -block principle whereby 
modules can be stacked together in various orientations through connecting points 
on all six faces of the cubes. 
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Fig. 2 (a) A branching modular robot, (b) kinematic graphs of the robot 


Assembly Incidence Matrix 

Definition 1 (Graph) A graph Q = (V, £) consists of a vertex set , V = {vo,* • *, v n }, 
and an edge set , £ = {e 0 , . . e m }, such that every edge in £ is associated with a pair 
of vertices, i.e ., e ; - = { Vy, . . v^}. 

In mechanism design theory, a kinematic chain of links and joints is often 
represented by a graph, termed a kinematic graph (Dobrjanskyj and Freudenstein 
1967), in which vertices represent the links and edges represent the joints. 
Using this graph representation, we are able to categorize the underlying 
structure (or geometry) of a linkage mechanism and apply the result from the 
graph theory to enumerate and classify linkage mechanisms. A robot manipulator 
is also a kinematic chain, thus, admitting a kinematic graph representation. 
For example, an 8 -module 7-DOF branch-type modular robot and its kinematic 
graphs are shown in Fig. 2a, b (in the digraph, the arrow on each edges point from 
the parent vertex to the children vertex). It is also known that a graph can 
be represented numerically as a vertex-edge incidence matrix in which the 
entries contain only Os and Is (Deo 1974). Entry (/, j) is equal to 1 if edge e 7 is 
incident on vertex vg otherwise, it is equal to zero. This incidence relationship 
defines the connectivity of the link and joint modules. Because link modules may 
have multiple connecting points, we can assign labels to the connecting points 
to identify the module connection. To further identify those connections in the 
incidence matrix, we can replace those entries of 1 by the labels of the connected 
ports being identified on the link modules and keep those entries of 0 unchanged. 
This modified matrix, termed an assembly incidence matrix , provides us the 
necessary connection information of the modules and also the basic geometry of 
the modular robot. 

Definition 2 (Assembly Incidence Matrix) Let Q be a kinematic graph of a 
modular robot and A4(Q) be its incidence matrix. Let port be the set of labels 
assigned to the connecting ports on the link modules. The assembly incidence 
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matrix of the robot A(G) is formed by substituting the Is in Ai(G) with labels in 
port on respective modules . One extra column and row are augmented to A{G) to 
show the types of link and joint modules . 

Note that the representation and assignment of the labels are nonunique. The 
labels of the connecting ports may be numerical values (Chen 1994) or may be 
derived from the module coordinates (Chen and Yang 1996). In this case, the 
module-component database should use consistent bookkeeping for this informa- 
tion. The AIM of the modular robot (8 link modules and 7 joint modules) shown in 


Fig. 4 is a 9 x 8 matrix: 
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Note that there are three types of link modules in the robot: the base ( B ), the large 
cubic module (Cl), and the small cubic module (C2). Cubic modules have six 
connecting interfaces labeled 1-6, i.e., port = { 1 ,. . .,6 }, which follows the labeling 
scheme on dice. The revolute joints and prismatic joints are denoted by R and P, 
respectively. 

Accessibility Matrix and Path Matrix 

Two matrices, namely, the accessibility matrix and the path matrix, derived from a 
given AIM are defined in this section to provide the accessibility information from 
the base module to every pendant module in a branch-type modular robot. The 
accessibility information enables us to formulate the kinematics and dynamics of a 
general branch-type robot in a uniform way. 

Module Traversing Order 

The links and joints of a serial-type robot can follow a natural order from the base to 
the tip. A branch-type robot has more than one tip and no loops. Therefore, the order 
of the links of a branch-type robot depends on the graph-traversing algorithms 
(Cormen et al. 1990). Let Q = (V, £) represent the kinematic graph of a branch-type 
modular robot with n + 1 link modules, where V = {vo, Vi,- • •, \ n } represents the set 
of modules. The fixed-base module is denoted by v 0 and is always the starting point 
for the traversing algorithm. The rest of the modules are labeled by their traversing 
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orders i. The traversing orders of the links in the robot of Fig. 2a are indicated by 
the numbers on the vertices of the graph of Fig. 2b. This order is obtained by the 
depth- first- search algorithm. Note that the farther the module is away from the base, 
the larger its traversing order. 

Directed Graphs 

A branch-type robot with n + 1 modules has n joints. Let £ — {e 1? • • •, e„} represent 
the set of joints, where joint e z is designated as the connector preceding link module 
v z . With a given traversing order, the robot graph Q can be converted to a directed 

graph (or digraph) Q, which is an outward tree for a branch-type manipulator in the 

following manner. Let e y = (v z , Vy) be an edge of the graph Q and i < j. An arrow is 
drawn from v, to Vy as edge e 7 leaves vertex v, and enters vertex Vy. Suffice to say, 
link v ? - precedes link Vy. An example of the directed graph is shown in Fig. 2b. From 
an outward tree with n vertices, an n x n accessibility matrix can be defined to show 
the accessibility among the vertices. 

Definition 3 (Accessibility Matrix) The accessibility matrix of a directed kinematic 
graph Q of a modular robot with n + 1 modules ( vertices ) is an (n + 1) x (n + 1) 

( i , j = 0, • • •, n), such that r t j = 1, if there is a directed path of 

length one or more from v, to v 7 -; r tj = 0, otherwise. 

The accessibility matrix can be derived from the AIM once the traversing order 

on the link modules is determined. For example, the accessibility matrix of Q in 
Fig. 2b is 


matrix , 1Z ( Q ) 


r a 



vo 

Vl 

V 2 

V 3 

V 4 

V 5 

V 6 

v 7 


Vo Vl v 2 v 3 v 4 v 5 
0 1111 1 

0 0 1 10 0 

0 0 0 1 0 0 

0 0 0 0 0 0 

0 0 0 0 0 0 

0 0 0 0 0 0 

0 0 0 0 0 0 

0 0 0 0 0 0 





From 1Z yQ j , we can obtain the shortest route from the base to the pendant link. 

This route is called a path. The pendant links are the rows of ^z(s^j with all 0s. The 

number of paths in a branching robot is equal to the number of pendant links. Let 
link v, be a pendant link. All link modules on the path from the base to v, are shown 
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in the nonzero entries of column i of yJZyQj + 1 ( n +i) x (n+i) j • Collecting all the 
paths, we obtain the path matrix. 


Definition 4 (Path Matrix) The path matrix V Q 


of a directed kinematic graph 



of a branch-type robot with n + 1 link modules ( vertices ) and m paths is an m x 


(n + 1) matrix , 1Z 




(/= 1,2, . . . ,m;j= 0, 1, . . . ,n), such that p f j = 1, if 


path i contains vertex j , and pij — 0 otherwise. 

For instance, the robot of Fig. 2a contains three branches (paths). The three paths 
can be represented as a 3 x 8 path matrix: 



V 0 Vi v 2 V 3 v 4 v 5 v 6 v 7 
111 1 0 0 0 0 " 

1 0 0 0 1 0 0 0 

1 0 0 0 0 1 1 1 



Row 1 represents the branch of the robot containing link modules v 0 , v l9 v 2 , and 
v 3 ; Row 2 represents the branch of v 0 and v 4 ; Row 3 represents the branch of v 0 , v 5 , 

v 6 , and v 7 . It can be seen that the rows ofV^Q^ are identical to Columns 3, 4, and 
7 of (k(q) +/( w +i)x(/i+i)), respectively. 


Geometry-Independent Models 


In the control and simulation of a modular reconfigurable robot system, precise 
kinematic and dynamic models of the robot are necessary. However, classical kine- 
matic and dynamic modeling techniques for robot manipulators are meant for robot 
with fixed geometry. These models have to be derived manually and individually 
stored in the robot controller prior to simulating and controlling the robot. Commer- 
cial robot simulation software usually provides end users with a library of predefined 
models of existing robots. The models of any new robot not in the library have to be 
derived exclusively from the given parameters and commands in the package. For a 
modular robot system built upon standard modular components, the possible robot 
geometries and degrees of freedom become huge. As shown by Chen (Chen 1994), 
the number of robot- assembly configurations grows exponentially when the module 
set becomes large and the module design becomes complicated. To derive all of these 
models and store them as library functions require not only tremendous effort but also 
very large amount of disk storage space. In such cases, it is impractical and almost 
impossible to obtain the kinematic and dynamic models of a robot based on the fixed- 
geometry approach. Hence, there is a need to develop an automatic model-generation 
technique for modular robot applications. 

Previous attempt to deal with automatic model generation for modular robots 
employed Denavit-Hartenburg (D-H) parameterization of the robot (Kelmar and 
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Khosla 1988; Benhabib et al. 1989). However, the D-H method does not provide a 
clear distinction between the arranging sequence of the modules in the robot chain 
and their spatial relationship. Also, it depends on the initial position of the robot: the 
same robot may have different sets of D-H parameters just because of the different 
initial or zero positions. When evaluating the task performance of a modular robot 
with respect to its corresponding geometry, complicated equivalence relationships 
must be defined on the sets of parameters to identify the uniqueness of the robot 
geometry (Chen and Burdick 1998). 

The formulation of the kinematics and dynamics is based on the theory of Lie 
groups and Lie algebras. The robot kinematics follows a local representation of the 
product-of-exponential (POE) formula, in which the joints, regardless of the types, 
are defined as members of se(3 ), the Lie algebra of the Euclidean group SE(3). 
The associated Lie algebraic structure can simplify the construction of the differ- 
entials of the forward-kinematics function required for numerical inverse solutions. 
The POE representation can also avoid the singularity conditions that frequently 
occur in the kinematic calibration formulated by the D-H method (Chen and Yang 
1997; Chen et al. 2001). Thus, it provides us with a uniform and well-behaved 
method for handling the inverse kinematics of both calibrated and uncalibrated 
robot systems. Since the joint axes are described in the local module (body) 
coordinate systems, it is convenient for progressive construction of the kinematic 
models of a modular robot, as it resembles the assembling action of the 
physical modular robot components. The formulation of the dynamic model is 
started with a recursive Newton-Euler algorithm (Hollerbach 1980; Rodriguze 
et al. 1991). The generalized velocity, acceleration, and forces are expressed 
in terms of linear operations on se(3) (Murray et al. 1994). Based on the 
relationship between the recursive formulation and the closed-form Lagrangian 
formulation for serial-robot dynamics discussed in (Featherstone 1987; Park 
et al. 1995), we use an accessibility matrix (Deo 1974) to assist in the construction 
of the closed-form equation of motion of a branch-type modular robot, which we 
assume is the generic topology of a modular robot. Note that all the proposed 
modeling techniques can contend with redundant and non-redundant modular 
robot configurations. 


Forward Kinematics 

The forward kinematics of a general branch-type modular robot starts with a given 
AIM and a dyad kinematic model that relates the motion of two connected modules 
under a joint displacement. A dyad is a pair of connected links in a kinematic 
chain. Using dyad kinematics recursively with a prescribed graph-traversing 
order assigned to the robot modules, we may obtain the forward transformation 
of every branch with respect to the base frame, having a prescribed set of 
joint displacements. Note that a branch-type robot is one without any closed-loop 
geometry. The kinematics of a closed-loop-type robot mechanism requires addi- 
tional constraints and is not considered here. 
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Fig. 3 Link assembly j 
connected to link i 



Dyad Kinematics 

Let v ? - and Vy be two adjacent links connected by a joint e 7 , as shown in Fig. 3. 
Denote joint e y and link v 7 as link assembly j and the module-coordinate frame 
on link v, as frame i. The relative position (including the orientation) of the 
dyad, v, and Vy, with respect to frame i with a joint angle qj , can be described by a 
4x4 homogeneous matrix: 



where sj G se( 3) is the twist of joint e 7 expressed in frame j , T/y(g/y) and T,y(0) G 
S£(3). T ? y(0) is the initial pose of frame j relative to frame i. Note that in the 
following context, the pose of a coordinate frame is referred to the 4 x 4 homogeneous 
matrix of the orientation and position of a coordinate frame: 



o 

where R,y(0) G SO (3) and d,y(0) G R are the initial orientation and position of link 
frame j relative to frame /, respectively. The twist Sj of link assembly j is the skew- 
symmetric matrix representation of the 6- vector line coordinate of the joint axis, 

o 

Sj = (qy, py); py, q y G R . py = ( Pj X , pjy , pj z ) is the unit-directional vector of the joint 
axis relative to frame j, and q 7 = q jy , q jz ) = p 7 x r 7 , where r 7 is the position 
vector of a point along the joint axis relative to frame j. For revolute joints, 
Sj = (O’ Py)’ and for prismatic joints, Sy = (qy, 0). 


Recursive Forward Kinematics 

Based on Eq. 4, we propose a recursive algorithm for a general branch-type 
modular robot, termed TreeRobotKinematics. This algorithm can derive the for- 
ward transformations of the base link to all pendant links based on graph-traversing 
algorithms. The procedure is illustrated in Fig. 4. Implementation details can be 
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Fig. 4 The TreeRobotKinematics algorithm 


found in an earlier work (Chen and Yang 1996). The algorithm takes three inputs: 
the AIM of the robot A(Q), the base link location T 0 , and a set of joint angles {q). 
The forward-kinematics calculation follows the breath-first-search (BFS) traversing 
algorithm to travel on the connected robot modules. 


Path-by-Path Forward Kinematics 

A tree-type robot consists of several paths that give the shortest routes from the base 
to the respective pendant links. Each path can be considered as a serially connected 
submanipulator so that the forward transformation can be derived as conventional 
industrial manipulator. The sequence of the connected modules in a path is indicated 

in a row of the path matrix Let a = {a 0 , ai, a 2 , a n } represent the links 

of path k. The base is a {) = 0 and the number of links in the path k is defined to be 
\a\ = n + 1. For instance, path 1 of the robot in Fig. 2a is a = {0, 1, 2, 3 }. The forward 
kinematics from the base to the pendant link a n of path k is given by 



For a branch-type modular robot with several paths, the forward kinematics is 



" r 

6t()(2 n 


r njLj (T ai _ iai (0)e s °> q °i) 1 

T(<7i , *72’ • • • W/?) 

1 

• • • o 
1 


n? =l (T bi _ lb xoy^) 

• 

• 

• 



where T(q u q 2 , . . q n ) represents the vector of 4 x 4 homogeneous matrices of the 
poses of all the pendant end-effectors. Since many paths in the branch-type robot 
share many common modules, there will be repetitive calculations using the model 
of Eq. 7. In actual implementation, we prefer the recursive approach, which 
introduces no repetitive calculations. 


Inverse Kinematics 

The purpose of an inverse kinematics algorithm is to determine the joint angles 
that cause the end-effector of a manipulator to reach a desired pose. Current robot 
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inverse kinematics algorithms can be categorized into two types: closed form 
and numerical. Closed-form-type inverse kinematics requires a complete param- 
eterization of the solution space, usually in terms of simultaneous polynomial 
equations. Solutions to such a set of simultaneous polynomial solutions exist 
for a few types of robots with revolute joints or simple geometry. It is very 
difficult to obtain the inverse kinematics for an arbitrary modular reconfigurable 
robot in this manner. Here we adopt the numerical approach to solve the 
inverse kinematics of modular robots. The inverse kinematics algorithm will 
construct the differential kinematic model using the local POE formula. 
The differential kinematic equation of a single branch of a branch-type 
robot is considered first. Based on the AIM of the robot, one can extend this 
differential kinematics model to include multiple branch structures. Then the 
Newton-Raphson iteration method is used to obtain the numerical inverse 
kinematics solutions. The differential kinematic model can be easily modified 
to solve the pure position, pure orientation, and hybrid inverse kinematics 
problems (Chen et al. 1999a). 

A Single Branch 

Let T aoan be the forward transformation of path k as indicated in Eq. 6. The 
differential change in the position of the end-link a n can be given by 



Left multiplying r ^ , Eq. 8 becomes 







Equation 9 is the differential kinematic equation of a path. Let 7^ denote the 

desired position of the end-effector. When it is in the neighborhood of a nominal 
position of the end-effector T aoan , we have 

r JT — T d — T 

U1 a 0 a n 1 ao a n 1 a n - 

Left multiplying T~^ to Eq. 9, and using the matrix logarithm, 


( 10 ) 
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We can obtain the following equation by first-order approximation: 





Substituting Eq. 12 into Eq. 9, we obtain 




Explicit formulae for calculating the logarithm of elements of SO( 3) and SE(3) 
were derived by Park and Bobrow (Park and Bobrow 1994). Definitely, 

log (r- o l a n T d aoan ) is an element of se(3) so that it can be identified by a 6 x 1 vector 

denoted by log {r~^ an T d ()an ^ in which the first and later three elements represent the 

positional and orientational differences between T aoan and T d a()Cln . Converting Eq. 13 
into the adjoint representation, we get 


iog(r 


— 1 rjid 
Ct^Ctyy &0d n 


V 


a 


-1 


Adr-i 


T 


&0 a n 


'E AdT %“ i Sa ‘ d ^r 


(14) 


i=\ 


Conveniently, Eq. 14 can also be expressed as the following form 


D 


T 


k 


Jkdqk, 


(15) 


where 


D 


T k 


io g (r 


— 1 

&{)Cl n Cltydyi 
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G R 


6x1 


is referred to as the pose difference vector for 


path k 


J 


k 


AkPkSk E R 


6 x (I a\ 


1) 


is termed as body manipulator Jacobian matrix 


(Murray et al. 1994) 


A k — Adj - 1 G R 


6x6 


a oa f 7 


B k = row 


Adr ,Adr , . . . , Adj 

1 1 1 ^ 0^2 1 a 0 a n 
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S k = diag . .,s a „] e/? 6(|a| 1)x(|fl| 1} 


dq k = column [dq ,dq . . . ,dq a 1 € R 


a 


-l)xl 


Equation 15 defines the differential kinematics for path k. It can be utilized in the 
Newton-Raphson iteration to obtain an inverse kinematics solution for a given pose. 


Entire Manipulator 

The paths of a branch-type manipulator may not be independently driven, because 
of the common sharing modules. This forbids us to treat each path as independent 
serial-type manipulators. Hence, with a given set of the pendant end-effectors’ 
poses for all branches, the inverse kinematics must be solved simultaneously. With 
the assistance of the path matrix, we are able to identify the connected and related 
modules in a path. Then, we can orderly combine the differential kinematic 
equations (Eq. 15) of all constituting paths into a single matrix equation of the 
following form: 



where 

D t = column [D Tl ,D Tl , . . ,,D Tm 
difference vector 


^ j^6mx 1 


is termed the generalized pose 


J = ABS € R 6mxn 

and 


is termed the generalized body manipulator Jacobian matrix ; 


A = diag [A i , A 2 , 




6m x 6n 



P\\ Ad T m 

P2l Ad T m 


P\2 Ad T 0 i 

P22 Ad T m 


Pln Ad T 0n 

P2n Ad T 0 „ 



6m x 6 n 


_P ml^dr oi P mZ^dr Q2 • • • Pmn^^To n _ 

The coefficient pdi = 1, 2, . . ., m\ j = 0, 1, 2, . . ., n) is entry (/, j) of the 
path matrix p , and m is the total number of paths; S = diagC^, s 2 , . . s n ] G R bnxn ; 
dq = column^!, dq 2 , . . dq n ] G R nxl . 

Rewriting this equation in an iterative form, we get 


dq i+l = J*D t , (17) 

q i+l = cf + dq i+l , (18) 

where i represents the number of iterations and J is the Moore-Penrose 
pseudoinverse of /. Using the Newton-Raphson method, a closed-loop iterative 
algorithm similar to that of Khosla, Newman, and Prinz (Khosla et al. 1985) is 
employed (Fig. 7). The iterative algorithm determines the necessary changes in the 
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Fig. 5 Inverse kinematics algorithm 


joint angles to achieve a differential change in the position and orientation of the 
end-effector. Given a complete robot assembly (or the AIM) and a set of desired 
poses T 1 , this algorithm starts from an initial guess, q°, somewhere in the neigh- 
borhood of the desired solution. It is terminated when a prescribed termination 
criteria is reached. As one can see, the structure of J depends on the path matrix, 
which is implied in the kinematic graph of the robot. Therefore, once the assembly 
configuration of a modular robot is determined and all module parameters are 
obtained, the differential kinematic model (Eq. 16) can be generated automatically. 

Computational examples of the inverse kinematics algorithms for branch-type and 
serial modular robots are given by Chen and Yang (Chen et al. 1999a) to illustrate the 
algorithm’s applicability and effectiveness. When compared to the other numerical 
inverse kinematics algorithm using D-H parameters, our method always use less 
number of iterations and computing time for the same given pose. This is due to 
the use of the pose difference vector computed from the matrix logarithm in Eq. 1 6, 
and not the difference of homogeneous transformation matrices. Actual implementa- 
tion of the algorithm using C++ codes shows that the computation time for each 
solution can take less than 20 ms on a Pentium II 300 MHz PC, which satisfies the 
basic requirement for real-time control and simulation (Fig. 5). 


Kinematic Calibration 

The machining tolerance, compliance, and wear of the connected mechanism and 
misalignment of the connected module components may introduce errors in posi- 
tioning the end-effector of a modular robot. Hence, calibrating the kinematic 
parameters of a modular robot to enhance its positioning accuracy is important, 
especially in high-precision application such as hard-disk assembly. 

Current kinematic calibration algorithms for industrial robots that are designed 
for certain types of serial manipulators are not suitable for modular robots with 
arbitrary geometry. Here we propose a general singularity -free calibration-model- 
ing method for modular reconfigurable robots, based on the forward kinematics 
discussed in the previous section. This method follows local POE formulae. 
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The robot errors are assumed to be in the initial positions of the consecutive 
modules. Based on linear superposition and differential transformation, a six- 
parameter model is derived. This model can be generated automatically once the 
AIM of the robot is given. An iterative least-square algorithm is then employed to 
find the error parameters to be corrected. 

The calibration starts with a serial-type manipulator kinematics model: 

Ton(q) = T{)i(q ] )Ti2(q 2 )- "T n -\ n {q n ) (19) 


Thus, 


T 0n ( q) = Toi(0)e Siqi Tn(0)e S2q2 - • T w _ 1? n (0)e Snqn (20) 

Extension to a general branch-type modular robot is similar to the treatment 
of the inverse kinematics model in previous section. Basically, Eq. 20 can be 
treated as a function of the joint angles, q = (q u • • •, q n ) 9 locations of the joint 
axes, s = (s\ 9 - • •, s n ) 9 and the relative initial positions of the dyads, T 0 = (T 01 ( 0), • • •, 
Tn- 1, n(0)): 


Ton =f(T o,s,q). 


Written in differential form, we have 




The differential dT 0n can be interpreted as the difference between the nominal 
position and the measured position. 


Error Model of a Dyad 

Our kinematic calibration is based on the local frame representation of a dyad 
described in Eq. 4. Two assumptions are made in the dyad of link Vy_ i and v, of 
a modular robot chain: first, small geometric errors only exist in the initial position 
Tf _ i ,(0) and, second, the twist and joint angle q t assumes the nominal values 
throughout the calibration analysis. Hence, instead of identifying the module’s 
actual initial positions, joint twists, and angle offsets, we look for a new set of 
local initial positions (local frames, called calibrated initial positions), in the 
calibration model, so that the twist of the joint remains the nominal value. In 
other words, the errors in a dyad are lumped with the initial position. Therefore, 
ds and dq can be set to 0. Because SE(3) has the dimension of six - three for 
positions and three for orientations - there can be only six independent quantities in 
T t _ i z (0), and there will be six independent error parameters in a dyad. Denote the 
small error in the initial position of dyad (v, _ 1? v,) as dTf _ 1?/ (0), then 
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where dx { , dy h and dz t are infinitesimal displacements along x- , y- , and z-axes of 
link frame /, respectively, and dx h dy h and Szj are infinitesimal rotations about x-, y-, 
and z-axes of link frame i , respectively. 


Gross Error Model of a Robot 

Similar to the error model of a dyad, the gross-geometric error dT 0n between the 
actual end-effector position and the nominal position can be described as 



dT 
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(25) 

(26) 


where dx 0n , dy Gn , and 8zq h are the rotations about the axes of the base frame and 
dx {)n , dy 0n , and dz 0n are the displacements along the axes of the base frame, 
respectively. Note that the gross error, dT 0n , is expressed in the base frame. 
Equation 25 follows the left multiplicative differential transformation of T 0n . The 
calibrated position of the end-effector becomes 

Ton'(d) = Ton + dTo n . (27) 


Linear Superposition 

Based on the assumptions, the errors in the dyads will contribute to the gross error 
in the end-effector’s position dT 0n . Since the geometric errors are all very small, the 
principle of linear superposition can be applied. We assume that the gross errors 
dT 0n are the linear combination of the errors in the dyads dT t _ i,/(0), (/ = 1, 2, . . ., n)\ 
then 




Y] T 0 , i 



i— 1 , i 




1 


(28) 
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Equation 28 converts and sums the initial position errors of the dyads in the 
base-frame coordinates. The forward kinematics of link frame j relative to link 
frame i (/ < j) is represented by Ty. Especially, Ty = / 4 x 4 when i = j. Substituting 
Eq. 23 into Eq. 28, and right multiplying T 0n \ 


dTo n T 0f l — Aq« 


n 




i - 1 


From the first-order approximation, we have 


A 


0 n 


dTo„T 0l l « iog(r 0 „'r 


-1 
0/7 J ’ 


Converting Eq. 31 into the adjoint representation, we have 


log v (7’o„'7’o„ 1 ) = Y.Adr^ Ud Ti _ v(0) ( A,- 


7-1 


Equation 32 can also be expressed in the following matrix form 


(29) 

(30) 





where 

y = log v {Ton'T^) G/? 6x1 ; 

x = column[A!, A 2 , • • •, A„] G R 6 " xl ; and 

^ ^6x6/7 

In Equation 33, x represents the error parameters to be identified in a modular 
robot assembly. The quantities in matrix A and T 0n l are determined from the nominal 
model. T 0n ' comes from the actual measured data. To improve the accuracy of the 
calibration model, the kinematic calibration procedure usually requires the position of 
the end-effector to be measured in several different robot postures. For the I th 
measurement, we obtain y L and A, After taking m measurements, 


A 


row 


Ad 


T 0 


,i(0 )’ Ad 
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where 
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6m x 1 
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A = column [A i,A 2 ,* • *, A m 



6 m x 6 n 


The least-squares solution for x can be obtained by 



where A is the pseudoinverse of A and A = 

/ T\ ~ l ~ t 1 

AA for m < n\ and A = A for m 


f ~t ~\ _ i ~t ~t ~t 

(A Aj A for m > n; A = A 

= n. The calibration procedure is 


illustrated in the diagram of Fig. 6a. Computer simulation and actual experiment on 
the modular robot systems described by Chen and Yang (Chen and Yang 1997) and 
Chen et al. (Chen et al. 2001) have shown that this calibration method can improve 
the accuracy in end-effector positioning by up to two orders of magnitudes. 


Dynamics 


The dynamic model of a robot can be formulated with an iterative method through a 
recursive Newton-Euler equation. This method can be generally applied to branch- 
type robots without modification. Here we present a method to generate the closed- 
form dynamic models of modular robots using the AIM and the recursive algorithm. 


Newton-Euler Equation for Link Assembly 

Assume that the mass center of link assembly j is coincident with the origin of the 
link module frame j. The Newton-Euler equation of this rigid link assembly with 
respect to frame j is 



fj 



0" 


Vj 
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" w i x m J v j " 

IL 
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Wj X JjWj 



fS x 1 

where F ) E R is the resultant wrench applied to the center of mass relative to 
frame j. The total mass of link assembly j is rrij (which is equal to the sum of link v y - 
and joint e ; ) . The inertia tensor of the link assembly about frame j is J r 
Transforming Eq. 36 into the adjoint representation, we have 
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MjVj 


adl (MjVj) . 


(37) 


The following notations are adopted: 
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is the generalized mass matrix; 
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is the generalized body velocity, where v, and Wj are 3 x 1 


vectors defining body translational velocity, Vy 

j 1 

velocity, w, = (w x , w y , w z ) , respectively; 


j 
T 


(v x , v y , v z ) , and the angular 
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Fig. 6 (a) Calibration algorithm for modular robots; (b) Dynamic model generation 


ady. G R oxo is the transpose of adjoint matrix ady j related to Vj 
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( 38 ) 


Vj and Wj G7? 3x3 are skew- symmetric matrices related to Vj and Wj, respectively; 


and Vj 


V 

Wj 


G R 


6x1 


is the generalized body acceleration. 


Recursive Newton-Euler Algorithm 

The recursive algorithm is a two-step iteration process. For a branch-type robot, the 
generalized velocity and acceleration of each link are propagated from the base to 
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the tips of all branches. The generalized force of each link is propagated backward 
from the tips of the branches to the base. At the branching module, generalized 
forces transmitted back from all branches are summed. 

Forward Iteration 

The generalized velocity and acceleration of the base link are given initially: 


V h = 

= v 0 = 

= (0,0,0,0,0,0) r 

(39) 

V h = 

= V 0 = 

= (0,0,2,0,0,0) r 

(40) 


where V b and V b are expressed in the base frame 0. We assume that the base frame 
coincides with the spatial reference frame. The generalized acceleration (Eq. 40) is 
initialized with the gravitation acceleration g to compensate for the effect of 
gravity. Referring to Fig. 4, the recursive body velocity and acceleration equations 
can be written as 


Vj = Ad T l (Vi) + Sj q j 
Vj = Adj— i (Vj) + ad A< j r _ , (l /,) (s#;) + sjqj 

J ij 

where all the quantities, if not specified, are expressed in link frame j. 

Vj and Vj are the generalized velocity and acceleration of link assembly j ; 
qj and cjj are the velocity and acceleration of joint e 7 , respectively; 

Ad T - 1 is the adjoint representation of TJj l (qj), where Tyiqj) G SE( 3) is the 

J 

position of frame j relative to frame i with joint angle qj and Ad T -\ = ( Adp , y ) 1 ; and 

6x1 

Sj G R is the twist coordinates of joint e y . 


(41) 

(42) 


Backward Iteration 

The backward iteration of the branch-type robot starts simultaneously from all the 
pendant link assembly. Let Vpo C V be set of the pendant links of the branch-type 

robot. For every pendant link assembly d, (vj. G Vpo), the Newton-Euler equation 
(Eq. 37) can be written as 



F% + M dl V di 



where Fj. is the wrench exerted on link assembly v^. by its parent (preceding) link 
relative to frame d, and F e d . is the external wrench exerted on vj. . Note that the total 

wrench is Fj. = F di + F e dj . Now traverse the links in the robot backward from the 

pendant links. Let Vm be the set of successors of link v,. For every link assembly /, 
the Newton-Euler equation (Eq. 37) can be written in the following form: 



( 44 ) 
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where all quantities, if not specified, are expressed in link frame /; F t G R 6xl is the 

6x1 

wrench exerted to link assembly i by its predecessor; F, G R is the wrench 


exerted by link assembly i to the successor y/e V Hi expressed in link frame j; and Ff 
is the external wrench applied to link assembly i. The total wrench is 


F 


l 


F; 


l 



j e v Hi 


AU'r ,(/■)) +r;. 


The applied torque/force to link assembly i by the actuator at its input joint e z can 


be calculated by 





Closed-Form Equations of Motion 

By iteratively expanding the recursive Newton-Euler equations (Eqs. 39—44) in the 
body coordinates, we obtain the generalized velocity, generalized acceleration, and 
generalized force equations in matrix form: 


V - GSq 

(46) 

V - G Xo v 0 + GSq + GAiV 

(47) 

F - G r F E + G t MV + G t A 2 MV 

(48) 

t - S r F 
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where 


n 


V = column! Vi, V 2 , ■ ■ ; V n ] G R 

• pi • • 

V = column [Vi , V 2 » * * •» V 

F = column[F 1 , F 2 , • • 
t = column^!, t 2 , • • •, 
q = column qi 9 q 2 ,’ * -,q 

q = column [4r 1? q 2 ,• • •, q 


6 nx 1 


is the generalized body velocity vector 


G R 6n x 1 is the generalized body acceleration vector 

6 nx 1 


is the body wrench vector 
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r n ] E R nxl is the applied joint torque/force vector 
G R nxl is the joint velocity vector 

G R nxl is the joint acceleration vector 
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is the generalized acceleration of the base link 
6 " x " is the joint twist matrix in the respective body 
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is the total generalized mass matrix 
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is the external wrench vector 
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Note that 7 Z\Q 


r ij 


G T?( /?+1 ) x ( /7+ i) is the accessibility matrix. The matrix 


G is called a transmission matrix. Substituting Eqs. 46-48 into Eq. 49, we obtain 
the closed-form equation of motion for a branch-type modular robot with n + 1 
modules (including the base module): 


M(q)q + C(q,q)q + N(q) 


t 


(50) 


where 


C(q,q 


N(q) 


M(q) 


TrxT 


S' G' MGS 


(51) 


S r G r (MGAi + A 2 M)GS 


(52) 


r m i r m i w rp rji 

S r G r MGr„Vn + S T G T F E 


T 0 *0 


(53) 


The mass matrix is M(q); C(q, q) represents the centrifugal and Coriolis 
accelerations; N(q) represents the gravitational force and external forces. The 
procedure for obtaining the closed-form equation (Eq. 50) is summarized in 
Fig. 6b. It has been successfully implemented in Mathematica code. 


Configuration Optimization 

Introducing modularity in a robotic system implies that the system performance 
can be optimized through proper selection and reconfiguration of module 
components. The task planner for the modular robotic workcell will be able to 
determine the optimal robot configuration and geometry for a given task from an 
inventory of robot modules. Figure 7 depicts the general approach for determin- 
ing the optimal assembly configuration. Shaded blocks represent the basic for- 
mulation of the optimization problem. With a given set of modules selected from 
the component database, all possible and unique assembly configurations can be 
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Fig. 7 Determination of a task-optimal configuration 


generated and identified through an enumeration algorithm (Chen and Burdick 
1998). In the second step, an objective function is formulated to evaluate the 
performance of every assembly configuration, based on the task specifications. 
A basic robot task contains task specifications that are provided by the task 
planner (the goal positions/orientations, force application, accuracy, and dexter- 
ity of the end-effectors) and constraints to be overcome (obstacle avoidance, 
workspace limit, singularity, and kinematic redundancy) (Chen and Burdick 
1995; Yang and Chen 2000). A search/optimization procedure is employed in 
the last step to find the optimal assembly configuration. 

Note that all the dimensions of the modules have been previously designed and 
fixed at the selection stage. With a given set of modules, the possible combination of 
robot-assembly configurations is always a finite number. Therefore, the parameter 
space for the optimization is discrete, and combinatorial optimization methods can be 
applied. Exhaustive search algorithms can be used to find the exact optimal solution, 
but the exponential growth of the data set impedes the efficient implementation of 
such an algorithm. Random-search techniques such as genetic algorithms 
(GA) (Chen 1994) and simulated annealing (SA) (Paredis et al. 1997) are more 
suitable for such problems. Transition rules for data points required in GA and SA 
can be easily implemented based on a data-representation scheme such as AIM. 


Task-Oriented Objective Function 

The crucial point in determining the optimal robot configuration is formulating an 
objective function that will assign a “goodness” value to every assembly configu- 
ration accomplishing a specified task. The form of the objective function should 
be general enough so that it is applicable to a wide variety of task requirements. 
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Fig. 8 The ACEF for serial modular robots 


Two components of a robot task - task specifications and constraints - must be 
considered in formulating the objective function. We call this function an assembly 
configuration evaluation function (ACEF). The assembly configuration with the 
greatest ACEF value is deemed optimal. It is also important to note that from a 
given set of modules, it is possible to construct robots with various topologies, such 
as serial or parallel kinematic structures. Even with a fixed robot-topology class, the 
number of degrees of freedom (DOF) can alter the kinematic functionality of the 
system. Here we propose a solution strategy for modular robot with a fixed topology 
and a fixed number of DOF. 

The structure of the ACEF for a serial modular robot is shown in Fig. 8. The input 
is an AIM with a predefined number of DOFs and predefined topology. The output 
is the “goodness” of the AIM in terms of a nonnegative real number. An AIM with a 
large ACEF value represents a good assembly configuration. The ACEF consists of 
two parts: task and structure evaluations. Task evaluation is performed according to 
the given task specifications: the task points (or the positions of the end-effector) 
and a designated criteria measure, such as the dexterity or the manipulability. 
A workspace check on the task points is executed before computing the measures 
for filtering out inaccessible points. Structure evaluation assesses the kinematic 
constraints (joint singularity and redundancy, link interference) and environmental 
constraints (workspace obstacles) imposed on the robot in accomplishing 
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Fig. 9 The evolution 
algorithm 



the assigned task. The proposed ACEF assumes the modular robot is operated in a 
structured environment and that there are no obstacles in the workspace. 
An auxiliary function, termed the module -assembly preference (MAP) is defined 
on the AIM to exclude undesirable kinematic features. Detailed implementation of 
the task and structure evaluation can be obtained from Chen (Chen 1996). 


Evolutionary Algorithms 


An evolutionary algorithm is a probabilistic search/optimization method based on the 
principle of evolution and hereditary of nature systems (Michalewicz 1994). In this 
algorithm, a population of individuals for each generation is maintained. The indi- 
vidual is implemented with some data structure and is evaluated by a “fitness 
function” to give a measure of its “fitness.” A new population is formed by selecting 
the more suitable individuals. Members in the new population undergo transforma- 
tions by the “genetic operators” to form new solutions. Through structured random 
information changes, the new generation is more “fit” than the previous generation. 
After a number of iterations, the individuals will converge to an optimal or near- 
optimal solution. Here we attempt to use the AIMs as the data stmcture of the solution 
and define AIM-related genetic operators (Chen 1996) as solving the task-optimal 
problem in an evolutionary approach, because AIM is a natural representation of the 
modular robot and is topologically independent. 
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Fig. 10 The initial generation 



b 

Fitness 



Fig. 1 1 (a) Optimal assembly configuration, (b) average and maximum fitness in each generation 

Figure 9 depicts the application of the evolutionary algorithm in solving the task- 
optimal configuration problem. An example of optimizing the configuration of a 
4-DOF modular robot is provided in the following example. Suppose we wish to 
find a 4-DOF fixed-base serial robot with revolute joints that passes through two 
task points Pi and p 2 . Also suppose that we require there be no redundant j oints and 
minimum link interference. Let the performance measure of the robot be the 
manipul ability. The initial set of AIMs randomly generated is shown in Fig. 10. 
The population size is 8, and the evolution stopped after 30 generations. 
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The assembly configuration in the target generation that has the highest fitness 
value is chosen as the optimal one (Fig. 11a). The average and maximum fitness 
values in every generation are shown in Fig. lib. As can be seen, the evolutionary 
algorithm does increase the fitness values generation by generation. Although the 
best solution may not be guaranteed, a suboptimal solution can always be found, 
and in return, the efficiency of finding the solution is increased. 


Simulation Software for Modular Robots 

To visualize and simulate the performance of an assembled robot, such as 
reachability and workspace, a robot simulation software application is necessary. 
The Simulation Environment for MOdular Robot System (a.k.a. SEMORS ) is a 
Windows-based object-oriented software application developed for this purpose. 
Based on the proposed local POE models and AIM data structures, SEMORS 
offers uniform and automatic model construction effort (kinematics, dynamics, 
and calibration) across computer simulation and real-time control of arbitrary 
robot configurations (Chen et al. 1999b). The basic graphical user interface of 
SEMORS is illustrated in Fig. 12. SEMORS is intended to be a uniform interface for 
all modular robots and is portable to modular robot systems from different vendors. 
It will be used both for simulation and for online execution of a task, regardless 
of whether the robot is executing (or is simulated to be executing) the task as 
a stand-alone application or as part of a workcell process. Thus, it allows 
the user to quickly integrate the hardware components into modular robots and 
to manage their operations in the reconfigurable workcell. Key features of SEMORS 
include: 

• Module and robot builder 

• 3D graphical task simulation 

• “Universal” inverse kinematics 

• Full dynamics models 

• Trajectory and task planning 

• Transparent workcell network connectivity 

In addition to the simulation of modular robots, extended features like robot 
configuration planning/optimization and module database management are 
implemented as separate application packages to be used along with SEMORS. 
The task-based robot configuration optimization mentioned in section “Geometry- 
Independent Models” is a generic and platform-independent methodology. With the 
capability of task-based robot configuration optimization, designing the modular 
robot configuration using SEMORS becomes no longer an ad hoc approach. The 
software system will provide the end user an optimized robot configuration 
according to the input task requirements. The user does not need to start the design 
work from scratch. Rather, based on the result of optimization, he can fine-tune the 
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Fig. 12 User interface of SEMORS 

suggested robot design or layout. The development effort and time for the workcell 
can be greatly reduced. 

Realization of a Reconfigurable Robotic Workcell 
System Architecture 

Figure 13 illustrates the system layout of a reconfigurable robotic workcell jointly 
developed by Nanyang Technological University and Singapore Institute of 
Manufacturing Technology (Chen 2001). The purpose is to demonstrate the adapt- 
ability of such a system to perform a variety of tasks, such as part assembly, 
material transfer, and light machining (grinding, polishing, and deburring), through 
rapid change of reusable workcell components. In this system, workcells are made 
of standard interchangeable modular components, such as actuators, rigid links, 
end-of-arm tooling, fixtures, and sensors. These components can be rapidly assem- 
bled and configured to form robots with various structures and degrees of freedom. 
The robots, together with other peripheral devices, will form a complete robotic 
workcell to execute a specific manufacturing task or process. The corresponding 
intelligent control and simulation software components are then reconfigured 
according to the change of the workcell configuration. The maintenance and 
upgrade of the system are simplified by replacing the malfunctioned or outdated 
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V / 


Fig. 13 Deployment of a reconfigurable robotic workcell 

components. Converting a manufacturing line from one product to another can be 
very fast in order to keep up with the rapidly changing marketplace. 

In this system, the workcell software is designed in reusable- and reconfigurable- 
object fashion for ease of maintenance and development. Figure 14 illustrates the 
overall software architecture of the modular workcell. The user environment will 
provide all the necessary functions to facilitate the end user in controlling, monitoring, 
and simulating the workcell. It consists of the following parts: 

Component browser - for viewing and editing the components available in the 
component database. 

Simulator - for generating a computer simulation model of a modular robot and the 
entire workcell; additionally, the simulator may be employed as the core func- 
tion for future virtual manufacturing capabilities. 

Task level planner - for determining the optimal geometry of a modular robot for a 
given task and the overall layout of the workcell for a particular manufacturing 
process. 

Programming interface - for providing command and control of the system. 
Controller - for commanding the low-level individual controllers located in the 
components and identifying the robot’s geometry from the local component 
controllers. 

The system kernel, which is hidden from the user, provides automated model- 
generation functions and the configuration-optimization function (a component 
database is also associated with it): 
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Fig. 14 Software architecture for reconfigurable workcell 


Object-oriented component database - manages the specification of all the com- 
ponents, such as the dimensions and weights of the links, maximum kinematic 
and dynamic performance of the actuators, etc. It can be accessed by the user for 
browsing and editing purposes. 

Geometry -independent kernal functions - generates kinematic and dynamic models 
of the robots shared by the simulators and the controller. Using identical models in 
the simulation and control of the workcell insures the reliability and integration of 
the system and enables physically based simulations through the workcell controller. 
The configuration- optimization function can enumerate all possible robot geometry 
from an inventory of module components in the database and select the most suitable 
one for a prescribed task. This information will pass back to the task level planner to 
determine the optimal layout and locations of the robots in the workcell. 

The information passing from the component database to the modeling functions 
is through the assembly incidence matrix. Robot geometries (serial, branch, or 
hybrid) and detailed connection information, such as the connecting orientation 
and the types of adjacent modules, are all indicated in the matrix. This matrix is 
then passed to the geometry-independent functions for model generation. 

In such a system, the need to maintain a huge library of robot models is 
eliminated; instead, we maintain a small selection of the component database and 
kernel functions for automated model generation, reducing the overall footprint of 
the system software. 
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Fig. 15 A light-machining 
workcell with modular robot 
components 



Workcell Prototype 

We have constructed a prototype workcell for light-machining tasks in an industrial 
exhibition in 1999 (Fig. 15). This workcell was built with multiple reconfigurable 
robots along with other supporting devices under a unified modular approach. 

• Preliminary Design Stage 


To make use of the advantages of both parallel-typed and serial-typed robots, we 
intend to make the workcell to perform a complete milling operation of a workpiece, 
starting from picking up the object, transferring the object to a milling robot, starting 
the milling process, and returning the workpiece back to a storage rack. Based on this 
preliminary concept, we decide to use two reconfigurable robots in this workcell: one 
is a serial-typed robot for the pick-and-place operation and the other is a parallel-typed 
robot for the milling operation because of its structural rigidity. The task is to perform 
milling operation on a dome-shaped top of a cylindrical workpiece with 15 cm in 
diameter. A workpiece transfer system should be used in between the two robots. 

• Robot Configuration Selection and Construction 

Based on the preliminary task description, the workcell is configured with a 7-DOF 
redundant serial-type robot, a 6-DOF articulate RRRS parallel robot, and a 1-DOF 
linear motion stage. From the robot configuration optimization, a 4-DOF SCARA- 
type robot is sufficient to perform the task. Deploying a redundant robot here is to 
demonstrate that the proposed model-generation algorithms used in SEMORS and in 
robot control are universally applicable for any configuration. 

The configuration design of the parallel robot follows a systematic approach 
(Yang et al. 1999). In principle, a 3-branch parallel structure is used because of the 
structure stiffness and dexterity. Each branch consists of three rotary joints (two are 
active and one is passive) and a passive spherical joint. Once the geometry is 
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Table 1 Specifications of 
the light-machining 
workcell 


Light-machining workcell 
7 -DOF redundant serial robot 


Work envelope 

Approx, sphere, SR = 1,200 mm 

Max speed 

750 mm/s 

Repeatability 

+/— 0.10 mm 

Max payload 

5 kg (excluding end-effector) 

Weight 

16 kg (excluding base) 


6-DOF RRRS articulate parallel robot 


Work envelope 

Approx, hemisphere, SR = 500 mm 

Max speed 

500 mm/s 

Repeatability 

+/— 0.05 mm 

Max payload 

25 kg (excluding end-effector) 

Weight 

30 kg (excluding base) 


1 -DOF linear motion stage 


Effective stroke 

L = 1,500 mm 

Max speed 

500 mm/s 

Repeatability 

+/— 0.025 mm 

Max payload 

45 kg (excluding fixture) 

Weight 

35 kg 


determined, the workspace analysis is performed. From the result of this analysis, 
the lengths of the rigid links and connectors are determined. Because of the modular 
design, the actuator modules can be freely located at the nine revolute joints. The 
workspace of the robot changes according to the locations of the actuator modules. 
A disk- shaped moving platform is attached to the three branches. An end-mill tool 
actuated by an intelligent motor is mounted at the center of the platform. This motor 
uses the same control interface as the standard actuator modules. Because of the 
lack of the force sensor, the task is only carried out in simulated manner, i.e., 
the end-mill tool only goes through the milling path without touching the surface of 
the workpiece. 

The 1-DOF linear motion stage uses two standard modules: one rotary module to 
drive the linear slide and one gripper module to hold the workpiece, to ensure 
uniformity in the workcell control. The specifications of the robots and the motion 
stage are listed in Table 1. 

• Workcell Construction and Fine-Tuning 

After the robots and motion stage are constmcted, the robot controllers are 
connected to the robots. Two Pentium II-based industrial PC robot controllers are 
used to perform high-level trajectory control of the serial robot and the parallel robot, 
respectively. The kinematic models of both serial and parallel robots are generated 
automatically in SEMORS and stored in the robot controllers. Kinematic calibration 
of both robots is performed before the operation. The kinematic calibration 
is conducted by using articulate-typed coordinate measuring equipment, called 
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Table 2 Task sequence 


Task sequence 


1 

Robot A a picks up workpiece from fixture 

2 

Robot A places workpiece on the motion stage 

3 

Motion stage moves workpiece under Robot B a 

4 

Robot B performs milling task 

5 

Robot A shifts locations of unprocessed workpieces 

6 

Robot B finishes milling task 

7 

Motion stage moves processed workpiece back 

8 

Robot A picks up processed workpiece from motion stage 

9 

Robot A places processed workpiece to the fixture 


a Robot A: 7-DOF serial robot, Robot B: 6-DOF parallel robot 


“Spin Arm.” The obtained calibration data is transferred to the robot controller and 
then SEMORS computes and updates the corrected kinematic models of the robots 
automatically. Because of its simplicity, the control of the motion stage is done by one 
of the robot controllers for this implementation. 

• Finalize Task Sequence and Control of the Workcell Actions 

With updated kinematic models, the detailed task sequence of all robots (Table 2) 
is laid out. The tasks are then programmed into the respective robot controllers. The 
two robot controllers are connected to a closed-loop workcell LAN running at 
10 MB/s. A separate notebook computer is also connected to the workcell network 
performing supervisory control of the workcell through SEMORS running on the 
individual robot controllers. The task sequence of the workcell is monitored and 
supervised by the notebook supervisor. 

Based on the actual construction, to assemble the described 7-DOF serial-typed 
robot takes two users about 30 min. The time to construct the parallel robot requires 
two persons about 2 h because of the complexity of the structure. Adding the time to 
install the motion stage, calibrate the robots, and fine-tune the workcell hardware, it 
will take about 4 h in total to complete the entire workcell setup excluding the time 
spent on the preliminary design stage. 


Summary and Future Aspects of Modular Robots 

In this chapter, we have presented the design principle, theoretical foundations, and 
also the implementation of modular reconfigurable robots for industrial applica- 
tions. On the module design, we pointed out that the robot design would follow the 
classification principle of modular product architecture that is commonly seen in 
engineering product design. On the modular robot theory, we have presented a 
generic method to automate the model generation for modular reconfigurable robots 
based on a graph representation of robot geometry, called an assembly incidence 
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matrix, and geometry-independent model building algorithms for the kinematics, 
dynamics, and error models of a robot. The AIMs of the assembly configuration of 
modular robots facilitate the determination of optimal robot configuration for a 
specific task using combinatorial optimization techniques. We also presented here 
an approach to solve the task-optimal problem using evolutionary algorithms with 
customized genetic operators based on the AIM of the robot. On the practice and 
implementation of modular robots for industrial applications, we have developed a 
modular robot control and simulation software application, SEMORS (Simulation 
Environment for MOdular Robot Systems). In this software, only a small set of 
component database and kernel functions are kept in the robot controller, and the 
required robot models can be generated automatically. From the demonstration 
modular robot prototype, we confirm the advantage of using modular components 
in constructing the complex robotic workcell with different configurations. The 
plug-and-play kinematics, dynamics, and calibration robot models are also verified 
through the actual implementation in the robot controller and the simulation 
software. 


Future Perspective 

Due to the improvement of living standard in developing countries and ageing 
population in well-developed countries, manpower shortage in every aspect of the 
industry (manufacturing and non-manufacturing) is pervasive. Deploying adaptable 
robotic technology for productivity improvement and automation is imperative. 
For more than 20 years of R&D in modular robotics, a number of modular robotic 
systems have successfully entered the industry automation, education and enter- 
tainment markets. Schunk robotics represents a very good example of a highly 
integrated modular robot system. Dynamixel and DARwIn-Op Humanoid 
Robot from Robotis (Korea) represent novel robot actuator module design for 
highly integrated humanoid robot as well as a key robotic component for diverse 
robotic application development. Fego Mindstorms Robot Development Kit repre- 
sents the integration of modular robots and modular product design for creative 
education applications. SmartMotor is a successful example of making robot 
actuator module as a stand-alone intelligent motor module for wide industrial 
applications without integrated mechanical structures. These successful modular 
robotic systems indicate that modular design offers clear advantage in the areas of 
product variety, application variety, and creativity. 

However, the cost of developing and manufacturing such systems, no matter at 
the module level or at the system level, can still be a main concern from the system 
providers to the end users. The cost structure of a modular robot is highly associated 
with the design of the individual modules and the possible types of modular robot 
systems to be conceived. From the development history of FEGO bricks and 
Schunk PowerCube modules, it is clearly shown that the module design evolves 
from possessing complex to simple functions (cost down) and increasing module 
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varieties (enlarging user base). The module design evolution results in market 
expansion and wide adoption. Hence, for reconfigurable modular manipulator and 
mobile systems, how to make the module and system design meet the expectation of 
the end users is the most challenging R&D topic in the near future. 
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Abstract 

Cable-driven robots (CDRs) are a special class of parallel mechanisms in which 
the end-effector is actuated by cables, instead of rigid-linked actuators. They are 
characterized by lightweight structures with low moving inertia and large 
workspace, due to the location of the cable winching actuators at the fixed 
base of the structure, and thereby reducing the mass and inertia of the moving 
platform. CDRs also possess an intrinsically safe feature due to the cables’ 
flexibility, which allows CDRs to provide safe manipulation in close proximity 
to their human counterparts. This chapter will highlight the various research 
endeavors in the performance analysis of CDRs such as force-closure analysis, 
stiffness analysis, workspace analysis, and cable tension planning. Several case 
studies will also be presented to serve as illustrations on the application of the 
proposed performance analysis tools. 


Introduction 


Cable-driven robots (CDRs) are a special class of parallel mechanisms in which the 
end-effector is actuated by cables, instead of rigid-linked actuators (see Fig. 1). 
Alternatively known as “tendon-driven” or “wire -driven,” these CDRs are character- 
ized by lightweight structures with low moving inertia and large workspace. These 
are attributed to the location of the cable winching actuators at the fixed base of the 
structure (instead of being located at the moving joints, as typically found in rigid- 
linked mechanisms). In addition, cables are spooled onto winches which makes the 
workspace virtually unlimited. The cable winching system generally consists of a 
rotary actuation unit that is coupled to a spooling drum which winds/unwinds the 
cable (see Fig. 2a). Other forms of cable winching systems include actuators coupled 
to leadscrews that directly change the cable lengths without the use of a spooling 
dmm (see Fig. 2b). These advantages make the CDRs promising candidates for 
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Fig. 1 Typical cable-driven 
robot ( CDR ) configurations 



Cable 

Winch 


Single-bodied CDR 



Cable 

Winch 



End-Effector 


Multi-bodied CDR 



Rotational actuation using spooling drums 



Actuator 


Leadscrews 


Translational actuation using leadscrews 
(Adapted from Buckingham et al 2007) 


Fig. 2 Typical cable winching systems 


applications requiring high speed, high acceleration, and high payload. However, the 
flexibility of the cables limits CDRs to have lower stiffness and accuracy. On the plus 
side, this flexibility of the cables allows CDRs to have an intrinsically safe feature in 
the event of any collisions and allows them to be ideal candidates in providing safe 
manipulation within the human environment. As such, CDRs have received signif- 
icant interest in the past two decades, with applications ranging from the factory floor 
to offices and homes with close proximity to their human counterparts. 
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One of the earliest applications of CDRs was in the form of cable-driven 
articulated fingers (Salisbury and Craig 1982). This cable-driven concept was 
then extended to numerous applications such as load lifting and positioning 
(Albus et al. 1993; Dallej et al. 2011), coordinate measurement (Jeong 
et al. 1998; Thomas et al. 2005), aircraft testing (Lafourcade and Llibre 2002), 
paint stripping of aircrafts (Nist robocrane cuts aircraft maintenance costs), video 
capturing over large areas (http://www.skycam.tv), slim dexterous manipulators for 
confined space applications (Buckingham et al. 2007), humanoid arms (Yang 
et al. 2011; Chen et al. 2013), haptics (Morizono et al. 1997; Williams 1998), 
and neurorehabilitation (Homma et al. 2003; Mustafa et al. 2006; Rosati 
et al. 2009; Mao and Agrawal 2012) (see Fig. 3). Typically, CDRs can be generally 
classified into two main categories: (i) under-constrained and (ii) fully constrained. 
For an ft-DOF CDR driven by m cables, it is classified as under-constrained if m < n 
and classified as fully constrained if m > n. Under-constrained CDRs rely on 
additional constraining wrenches, such as gravity (Dallej et al. 2011; Pusey 
et al. 2004) or springs (Behzadipour 2009; Liu and Gosselin 2011; Mustafa and 
Agrawal 2012a), to control the specified end-effector’s degrees-of-freedom (DOF). 
However, a fully constrained CDR can control all its end-effector’s DOFs using the 
cables only. 

Unlike rigid links, cables exhibit a unilateral driving property, i.e., it can only 
exert a tensile force. Hence, analysis approaches obtained for rigid-linked robots 
cannot be directly applied, and such cable-driven robots require additional force 
considerations. As such, one of the earliest work to address this was by Ming and 
Higuchi (1994). They had mathematically proven that a single-bodied 6-DOF CDR 
required a minimum of seven cables in positive tension for it to be fully constrained. 
This paved the way for extensive efforts in force-closure analysis (i.e., achieving 
equilibrium with a set of positive cable tension under the influence of external 
wrenches) of planar and spatial CDRs with both under-constrained and fully 
constrained configurations (Pham et al. 2006a; Gouttefarde et al. 2006; Stump 
and Kumar 2006; Ferraresi et al. 2007; Diao and Ma 2008; Gouttefarde 2008; 
Hassan and Khajepour 2011). In addition to single-bodied CDRs, Mustafa and 
Agrawal (2012b) had also mathematically proved that an ft-DOF multi-bodied 
open chain CDR requires a minimum of n + 1 cables for the entire chain to be 
fully constrained. Force-closure analysis is aimed at evaluating the workspace, 
optimizing it and minimizing the number of cables. As one would realize, too 
many cables will result in control complexity, increase in system setup costs, 
and possible cable interference. As such, due to the unilateral driving property 
of cables, two classical theorems in convex analysis (i.e., Caratheodory and 
Steinitz Theorems) can be employed as guidelines to determine the lower and 
upper bounds of the minimal number of cables in order to effectively drive the 
CDRs (Murray et al. 1994). For an w-DOF CDR, Caratheodory ’s theorem 
implies that a minimum of n + 1 cables in positive tension is required, while 
Steinitz ’s theorem implies that at most 2 n cables in positive tension are 
required. In short, the most effective number of the driving cables ranges from 
n + 1 to In. If the number of cables is lower than n + 1, positive cable tension 
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For leg rehabilitation Homma 2003 For arm rehabilitation Mao 2012 

Fig. 3 Typical CDR applications 


cannot be achieved, unless with the aid of external constraining wrenches, like 
gravity or springs (in the case of under-constrained CDRs). If the number of 
cables is greater than In , too many redundant cables will make the driving 
scheme complicated and inefficient. 

This chapter will focus on the performance analysis tools for the design of CDRs 
and is organized as follows: Section “Mathematical Preliminaries” presents the 
mathematical preliminaries required in the kinetostatic analysis of CDRs. 
Section “Force-Closure Analysis” presents an overview of the research endeavors 
in force-closure algorithms, as well as generic force-closure algorithms for both 
single-bodied and multi-bodied CDRs. Section “Stiffness Analysis” presents a 
literature survey on the stiffness evaluation for CDRs, including stiffness modeling 
algorithms which are vital for CDRs as they suffer from rigidity issues due to the 
flexibilities of the cables. Section “Workspace Analysis” presents a literature 
survey on the workspace analysis for CDRs, as well as workspace representations 
and volume determination for CDRs with orientation capabilities. Several perfor- 
mance measures are also presented to evaluate the workspace of CDRs, which can 
then be utilized to optimize the design. Section “Cable Tension Planning” presents 
the literature survey and an algorithm to optimally generate the cable tensions 
during a trajectory. Finally, section “Summary” concludes with the salient points of 
this chapter. 


Mathematical Preliminaries 

This section presents the mathematical preliminaries for the modeling of 
CDRs. These preliminaries include the general kinetostatic analysis for both 
single-bodied (henceforth termed as “modular”) CDRs, as well as multi-bodied 
open chain CDRs (consisting of modular CDRs attached in series). Various 
theories in convex analysis and reciprocal screw are also presented to 
allow the readers to follow the proposed force-closure analyses in section 
“Mathematical Preliminaries.” 
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Kinetostatic Analysis for Modular CDRs 


This subsection presents a brief overview of kinematic analysis for modular CDRs 
and highlights the relevant theories in convex analysis which will allow the readers 
to clearly follow the proposed force-closure analysis of CDRs in section “Force- 
Closure Analysis of Modular CDRs.” The kinematic diagram of an ft-DOF modular 
CDR, controlled by m cables, is shown in Fig. 4. The moving platform is connected 
to the base through m cables. {K B } represents the fixed base coordinate frame, and 
{/C^} represents the moving platform frame. All quantities are represented with 
respect to frame { K B } unless there is a trailing superscript “S'” indicating that it is 
written in frame { } . The origin of frame {A^} with respect to frame { K B } is 
represented by position vector s, while the orientation of frame {K s } with respect to 
frame { K B } is represented by the rotation matrix R. 

cj 

Vectors (/ = 1, 2, . . ., m) represent the position vectors of the cable attachment 
points on the moving platform with respect to frame {K s }. Vectors bi denote the 
location of the cable exit points at the base. f e and m e are the external forces and 
moments, respectively. 

From the kinetostatic equilibrium analysis, the following is obtained: 


W 


AT 


( 1 ) 


where W 


f 


m 


G W is the external wrench, T 


[h 


t 


the cable tension vector, A 


[ai 


a 


n 


a inl C 9^ 


n 


nxm 


■ ■ t m ] T e 9T" is 
is the structure 


matrix associated with the driving cables, ai 


ui 


Rif x Uj 


is the cable vector 


O’ 


1,2, ... , m), and Uj 


bj-Rif-s 
bi— Rr?— s 


is the directional unit vector of the ith cable. 


Fig. 4 Kinematic diagram of 
an ft-DOF modular CDR 
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The structure matrix also relates the cable spooling velocity (i E 9T”) and the 

instantaneous velocity of the moving platform (X E DT ? ) which is described as 
follows: 


• rrr • 

I = -A r X 

Based on Fig. 4, the external wrench can be expressed as follows: 




m p \ - m p g - fe 
I p (b + co x (I p co) — m e 



where m p is the mass of the moving platform, g (=9.81 m/s ) is the gravitational 
acceleration, I p is the inertial tensor of the moving platform at the centroid, 
while v and co are the velocity and angular velocity at the centroid, 
respectively. 

A pose of a CDR is said to satisfy the force-closure condition if all the cable 
tensions can be made positive regardless of the external wrench W. The mathe- 
matical description is as follows: 


VW E 9T, 3(t u • • • , t m ) E [0, oo] : W = AT 



The necessary conditions to attain force closure are Rank[A] = n and m>n+ 1. 
The following statements are equivalent (Murray et al. 1994): 


• The pose is under force-closure. 

• The convex hull formed by the column vectors of A contains a neighborhood of 
the origin. 

• There does not exist a vector v E v ^ 0, such that v • a, > 0 for i = 1, . . . , m. 

• The column vectors of A are able to positively span 9\'\ 


The proposed methodology will utilize the condition “The column vectors of 
A are able to positively span for the force-closure analysis. In this chapter, a 
force-closure algorithm will be proposed using two propositions, which are 
related to force-closure analysis. The first proposition provides the requirement 
for force closure to be satisfied. The second proposition indicates the total 
number of checks that has to be performed to identify if a pose satisfies force- 
closure condition, and it also indicates the number of column vectors of 
A required for each check. 


Proposition 1.1 Let ai, a 2 , . . . , a„ be n linearly independent vectors obtained from 
column vectors of matrix A E 9\ nxm (m >n + 1) and a t = — (ai + a 2 + . . . + a w ). The 
force-closure condition is satisfied if and only if the column vectors a, = (/ = 1,2, 
. . . , m) can positively express a t such that: 
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m ( m \ 

y, A,a, = a , I A, > 0 and ^^0 (5) 

i=i V 1=1 / 

Sufficiency 

It is to prove that if a t can be positively expressed by a z = (/ = 1,2,..., m), then 
the force-closure condition will be satisfied. As a 1? a 2 , . . ., a zz are linearly indepen- 
dent, an arbitrary vector x E 91 /7 (x 7^ 0) can always be expressed as a linear 
combination of these vectors, such that 

x = viai + v 2 a 2 + . . . + v z7 a zz (6) 

By introducing a t into Eq. 6, the following is obtained: 

x = viai + v 2 a 2 + . . . + v n a w H- wa^ — wa^ 

= Vi&i + v 2 a 2 + . . . + v Z7 a Z7 + wa ? — w(ai + a 2 + . . . + a w ) (7) 

= (vi-fw)ai + (v 2 +w)a 2 + . . . + (v n +w)a n + wa^ 

where w is a positive scalar. By choosing a sufficiently large w (w > 0), all (v/ + w) 
> 0 (/= 1, • • •, n). Therefore, an arbitrary vector x can always be expressed as a 
positive combination of the column vectors of A. In this case, vectors a 1? a 2 , . . ., a n 
and a t are the minimum basis that positively span DT 7 . The sufficiency requirement 
is proven. 

Necessity 

Obviously, if a t E 9V 7 cannot be positively expressed by column vectors 
a, = (/ = 1, 2, . . ., m), then the vectors a z will not be able to positively span 9\ n . 
Therefore force closure cannot be attained. 

Proposition 1.2 Let a z (/ = 1, 2,. . ., m ) be the column vectors of A. Then the 
vectors a z (/ = 1,2,..., m ) positively span 9f 77 if and only if any vector y E 9V 7 can 
be positively expressed as a linear combination of at least one set of n linearly 
independent vectors from column vectors a z (/ = 1, 2,. . ., m). 

Sufficiency 

It is apparent that if any vector y can be positively expressed by at least one set 
of n linearly independent vectors from a z , then 91 77 can be positively spanned by a z 
(/ = 1, 2,. . .,m). 

Necessity 

For the necessary condition, it is to prove that if column vectors a z can positively 
span 9T 7 , then any vector y E 9T 7 can be positively expressed by at least one set of 
n linearly independent vectors from column vectors a z (/ = 1, 2,. . ., m). 

In order to prove the necessary condition, Caratheodory’s theorem for cones 
is employed (Stoer and Witzgall 1970). Let D = { d lv . ., d m } be a set of vectors in 
9T 7 . Their conical nonnegative linear combination is represented as vector z as 
follows: 
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m 

z =y>d„ 0 < A, efH (8) 

i=i 

The conical hull of D is the collection of all conical combinations of vectors 
from D such that 


coni(D ) 



m 

'y ] ^/Z/, z / G DO < G 93 
/'=! 




Note that the space positively spanned by a ? - (/ = 1,2,..., m) is the conical hull 
formed by i.e., com (A). 

Caratheodory’s theorem for cones (Stoer and Witzgall 1970): Let D be any 
subset of 93 ,? with dim (coni( D)) — r. If z G coni (D), then there exists a set of r 
linearly independent vectors [d / 9 . . ., d r ) in D such that z — coni(d \,. . ., dji). 

If the vectors a i(i = 1,2,..., m) can positively span 9V\ then dim(coni( A)) = w 
(i.e., the rank of matrix A must be equal to n). Therefore, the above theorem states 
that any vector y G D3 ,z can be positively expressed by at least one set of n linearly 
independent vectors obtained from the column vectors of A. The necessity require- 
ment is proven. 


Kinetostatic Analysis for Multi-Bodied CDRs 

This subsection presents a brief overview of kinematic analysis using reciprocal 
screw theory and highlights relevant theories which will allow the readers to clearly 
follow the proposed force-closure analysis of multi-bodied open chain CDRs (i.e., 
consisting of modular CDRs attached in series) in section “Force-Closure Analysis 
of Multi-Bodied CDRs.” 

Ball (1900) formally established the theory of screws in 1900 and applied it to 
the analysis of rigid-body motions of multiple DOFs. A screw motion was 
established to represent any instantaneous motion in 3-D space, in which the 
moving body simultaneously moves with both an angular and linear velocity, 
about and along the screw axis, respectively. Any system of forces and moments 
acting on a body can analogously be represented by a force and couple along the 
same axis. This screw axis has a pitch p, which is the ratio of linear to angular 

velocity (or moment to force). A unit screw, $, = (n; r x n + pn), is then defined by 

combining the pitch, the unit vector n along the screw axis, and the vector r from the 

origin to a reference point on the screw axis, which forms six scalar components. 

• 

The instantaneous motion of a body due to a twist 0j about a unit screw $ f is 

• /\ 

described by 0,$, = (&>, ; v, ), where is the angular velocity of the body and v, is 
the velocity of a point on the body that instantaneously coincides with the origin of 
the reference frame (Woo and Freudenstein 1970). Similarly, a wrench of magni- 

tude f applied about a unit screw on a body is described by /$, = (F;M), where 
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F is the resultant force and M is the resultant moment on the body. A zero pitch 
twist corresponds to a pure rotation of the body about the screw axis, while an 
infinite pitch twist corresponds to a pure translation of the body along the screw 
axis. Similarly, a zero pitch wrench corresponds to a pure force on the body along 
the screw axis, while an infinite pitch wrench corresponds to a pure moment about 
the screw axis. 

From the viewpoint of twists and wrenches, two screws, $i = (itm; n lx ) and 
$2 = (n 2 n; n 2x ), are considered reciprocal if every wrench along $ x has zero work 
done when applied to a rigid body constrained to a twist about $ 2 . This is mathe- 
matically described as follows: 


$1 O $2 — n l 


n 2 x +n 2 |i • n lx 




where © represents the reciprocal product of two screws (also known as the Klein 
quadratic form (Karger and Novak 1985)). 

Reciprocal screw theory became one of the most important and widely used 
concepts in the kinematic analysis and synthesis of mechanisms (Hunt 1978; 
Sugimoto and Duffy 1982; Roth 1984; Agrawal and Roth 1992) due to its ability 
to gain geometric insights of mechanisms with the duality between sets of con- 
straint forces and sets of motion. There have also been numerous efforts in 
determining reciprocal screws, both analytically and numerically. Obtaining a 
reciprocal screw to a system of screws is equivalent to finding the null space of a 
matrix composed of the screw system. Numerical approaches to determine the null 
space are typically based on the Gauss-Seidel elimination approach (Strang 1976), 
which are applicable to any type of screw systems. Analytical approaches include 
using geometric relations (Tsai 1999), which are valid for screw systems with 
simple geometric relations, such as planar mechanisms with revolute or prismatic 
joints. However, for more complex screw systems such as spatial mechanisms, it is 
challenging to obtain geometric relations. Instead, approaches have been proposed 
to determine the null space analytically, such as the use of cofactors (Dai and Jones 
2002) or solving a set of algebraic equations of the corresponding system of screws 
(Kim and Chung 2003). 

In the context of kinematic analysis of an open chain, numerous theorems have 
been established for different types of open chain, i.e., proper (= d-DOF), deficient 
(< d-DOF), and redundant (> d-DOF), where d is the dimension of the end-effector 
space (Agrawal and Roth 1992). However, in view of brevity, this section will 
present the theorems for “proper” open chains, i.e., chains with just enough joints to 
fully position and orient the end-effector. This means that there will be only three 
DOFs for a planar open chain and only six DOFs for a spatial open chain (see 
Fig. 5). Readers may refer to (Mustafa and Agrawal 2012b) for details on the 
complete approach for all types of open chains. 

Based on the theoretical contributions by Agrawal (1990), the following presents 
the main theorem along with several propositions that will be used in section 
“Force-Closure Analysis of Multi-Bodied CDRs” for the force-closure analysis of 
multi-bodied CDRs: 
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Fig. 5 Multi-bodied CDRs 
with “proper” open chains 
(Mustafa and Agrawal 2012b) 




Theorem 1 For a “ proper ” open chain, the reciprocal screws , $ r , are unique and 

form a linearly independent set that spans the screws of the relevant wrench system. 

/\ 

Note: $*> is a special screw such that it is reciprocal to all screws of the chain except 
the ith one. 

Proposition 2.1 Any external wrench, $ ext , can be uniquely expressed as a 

/N /\ 

linear combination of the reciprocal screws, i.e., $ ex t = >vi$i r + . . . + 
where d = 3 for planar and six for spatial motions. 

Proposition 2.2 When an external wrench, $ ex t, acts on the end-effector about a 
reciprocal screw $ /r , the entire chain can be kept in equilibrium with actuator torque 

at joint i with magnitude t z - = — Wj ^$ zr © $,^ . This establishes a one-to-one corre- 
spondence between a wrench along $ ir on the end-effector and joint actuator 
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torque t z . The chain can then equilibrate linear combinations of only those wrenches 
for which there are corresponding actuators present at the joints. 

Proposition 2.3 When an $ ext acts on an intermediate link j in the chain, the 

wrench can be resolved according to Proposition 2.2, and joint torques are only 

/\ /\ 

+1 y, . . . , $dr do not require actuator 

torques to equilibrate across joints 1 to j. 

Proposition 2.4 In order to equilibrate a “proper” chain under the influence of 
external wrenches on every link, the net actuator torque at joint i is basically a 
superposition of torques required to equilibrate wrenches on the individual links j, i. 

e> ’ T/ = y;_i ~ w ji ($*> © • 

Proposition 2.5 Similar to external wrenches, cable forces acting on the various 
links can be resolved, and their net actuator torques can be determined according to 
Propositions 2. 1-2.4. However, for cable-driven open chains, cables act as joint 
actuators, and hence their net actuator torques at the various joints can be used to 
equilibrate the joint torques due to the external wrenches. 


required at joints 1 to j, since wrenches along 


Force-Closure Analysis 

This section presents the algorithms developed to analyze force-closure for CDRs. 
These algorithms will include for both modular CDRs, as well as multi-bodied open 
chain configurations of the CDRs. 


Force-Closure Analysis of Modular CDRs 

A variety of force-closure algorithms have been proposed for checking the positive 
cable tension status of n-DOF fully constrained CDRs driven by m cables (where 
m > n + 1). Ming and Higuchi (1994) showed that a pose of a CDR satisfies the 
force-closure condition when at least one solution in the null space of A is strictly 
positive. This method becomes complex for highly redundant CDRs. By checking 
the solutions in the null space of A, Ferraresi et al. (2007) derived a method to 
determine if a given pose is under force closure for CDRs with up to three 
redundant driving cables, i.e., n+l<m<n + 3. Gouttefarde et al. (2006) 
determined the force-closure workspace for a 6-DOF manipulator by delineating 
the workspace boundary. Murray et al. (1994), Stump and Kumar (2006), and Diao 
and Ma (2008) checked the existence of supporting/separating hyperplane to verify 
if a pose satisfied the force-closure condition. Diao and Ma (2008) modified 
Murray’s method (1994) by using the generalized cross product so that the method 
can be used for CDRs of up to six DOFs. Their methods had difficulties to deal with 
degeneracies, i.e., when the rank of the vectors, which were used to form the 
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hyperplane, is not equal to n— 1. However, Gouttefarde (2008) proved that the 
degeneracies can be ignored while checking for force-closure condition. Pham 
et al. (2006a) developed the recursive dimension reduction algorithm for force- 
closure analysis. The algorithm was not computationally efficient as it had to 
decompose the ^-dimensional space into numerous one-dimensional subspaces to 
check the tension status. Hassan and Khajepour (2011) employed Dykstra’s alter- 
nating projection algorithm to determine the workspace that is able to counter a 
particular wrench. 

In order to address the abovementioned shortcomings, the authors (Lim 
et al. 2011) proposed an alternative approach for force-closure analysis that is 
systematic, easy to implement, and satisfies both the necessary and sufficient 
conditions for force closure. 

Proposed Algorithm 

Let G h = {gx, • • •, g n } ( h = 1, 2, • • •, ,n C n ) be the sets of n linearly independent 
vectors formed by the column vectors a z (i = 1, 2, • • •, m). Based on Proposition 1.1 
and Proposition 1 .2, the force-closure condition is satisfied if and only if there exists 
at least one set of G /z that satisfies the following: 


n I n \ 

a, = kfgf { g f g G ; , , kf > Oandy^fc/ ^0 I (11) 

/= i V /= i / 

From Eq. 11, it can be stated that for a CDR’s pose with an n x m structure 
matrix A, force closure is achieved if and only if there exists a vector a t = — (ai + a 2 
+ • • • + a„) which can be positively expressed by at least one set of n independent 
column vectors from a z (i = 1, 2, • • •, m). Based on the flowchart shown in Fig. 6, the 
procedure of the proposed algorithm is as follows: 

1. Determine the structure matrix A E y\ uxm based on the pose of the moving 
platform. 

. T’ 

2. Perform rank analysis for matrix A. If the determinant of A A is very close to 
zero, the force-closure condition will not be satisfied. 

3. Choose a set of n linearly independent vectors from the column vectors of A. 
Since matrix A has rank n , this procedure will always be possible. Without loss 
of generality, let the n linearly independent vectors be a l9 a 2 , • • •, a w . 

4. Formulate a t with this set of vectors such that a t = — + a 2 + • • • + a n ). 

5. Obtain a combination of n vectors from the column vectors of A to form an 
n x n matrix G such that the columns of G are the n selected column vectors of A. 

6. Ensure that rank(G) = n ; Else skip Steps 7 and 8. 

'T' 

7. Determine vector k(k = [kik 2 • • • k n ] ) as follows: 

= G'a, 
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Input : 

Structural matrix (A) 



Choose a set of n linearly 
independent vectors [a 1 ? a 2 ,... a„] 

from A 


I 

Let 

*L = -(a i + a 2 + --- + a «) 


I 

Choose a combination of « 
column vectors from A 

l 

Use the n column vectors to 
form the n x n matrix G 



Fig. 6 Flow chart of the proposed force-closure analysis algorithm for modular CDRs (Lim 
etal. 2011) 


2184 


S.K. Mustafa et al. 


8. Check whether k satisfies Eq. 1 1. If kf > 0 and X &/ 7^ 0 for f = 1,2, • • •, n, then 
the pose satisfies the force-closure condition, and the check for this moving- 
platform pose stops here. Else, proceed to the Step 9. 

9. Go to Step 3 until all the m C n combinations are examined. If none of the m C n 
combinations passes Step 8, then the moving-platform pose is not a force-closure 
pose. Note: there are ni C n — 1 combinations because the set of vectors that forms 
a t need not be tested. 

It should be noted that this algorithm ignores those n x n matrices without full 
rank, and the computation stops once the force-closure condition is satisfied. The 
check on the set of vectors that forms a t can be skipped because this set will not 
satisfy Eq. 1 1 . In this case, the number of n x n linear systems to be analyzed for 
force-closure condition is at most ( n C n — 1). 

Evaluation of Computational Complexity 

The efficiency of the algorithm is verified by calculating the maximum number of 
operations required to check for force-closure condition at a moving-platform pose. 
In particular, the number of operations required by three generic force-closure 
algorithms, i.e., the proposed algorithm, the recursive dimension reduction algo- 
rithm (Pham et al. 2006a), and the method proposed in (Diao and Ma 2008), is 
determined and compared. In order to verify if the pose satisfies the force-closure 
condition, Diao and Ma (2008) checked the existence of supporting/separating 
hyperplane, while Pham et al. (2006a) checked the decomposed one-dimensional 
subspaces. The recursive dimension reduction algorithm and the method proposed 
in (Diao and Ma 2008) stop the computation once the pose is deemed not a force- 
closure pose. On the other hand, the proposed method stops the computation once 
the pose is proved to satisfy force-closure condition. Therefore, the recursive 
dimension reduction algorithm and the method proposed in (Diao and Ma 2008) 
are more efficient for poses that do not satisfy the force-closure condition, while the 
proposed method is more efficient for force-closure poses. When checking for the 
force-closure condition of a given pose, it is unknown whether the pose satisfies 
force-closure condition. Therefore, the maximum number of operations required by 
each algorithm is used to evaluate the algorithms ’ efficiency so that the comparison 
does not favor any algorithm. 

The recursive dimension reduction algorithm uses Gaussian elimination to 
reduce the dimension of matrix A. The method proposed in (Diao and Ma 2008) 
uses matrix determinants to generate the generalized cross product. The proposed 
method solves a system of linear equations. In fact, deriving matrix determinants 
and solving linear equations can all be done using Gaussian elimination. Strang 
(1976) provided the number of operations required to solve the matrix determinants 
and linear equations using Gaussian elimination. For Gaussian elimination, each 
division or each multiplication/subtraction is considered as an operation. 

Proposed Algorithm: Each check solves a linear system which consists of an 
n by n matrix. Solving this linear system using Gaussian elimination requires (1/3) 

q o 

(n —n) + n operations. There are a maximum of m C n — 1 checks. 
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Algorithm proposed in (Diao and Ma 2008): Each check generates a normal 
vector using n— 1 column vectors from matrix A. The dot products between the 
normal vector and the remaining vectors are verified for force-closure condition. 
Each normal vector is derived from the matrix determinants of n sets of (n— 1) by 
(n— 1) matrices. To obtain each matrix determinant, it will require (1/3) 

Q 

[(n— 1) — (n— 1)] + {n-2) operations. The normal vector will undergo dot product 
with ( m—n ) vectors which requires n(m—n) operations. There are a maximum of 
m C n -x checks. 

Recursive dimension reduction algorithm (Pham et al. 2006a): It is needed to 
reduce the dimension of an n by m matrix through Gaussian elimination. After each 
reduction, there are [(m\)/(m— n + r— 1)] new matrices in the (r— 1) dimension 
where r is the dimension before the reduction process. For every matrix, the 
dimension reduction process requires (m—n + r) (r— 1) operations. 

At a given pose of an n-DOF modular CDR controlled by m cables, the 
maximum number of operations required by each algorithm to check for force- 
closure condition is shown in Table 1. A 6-DOF modular CDR needs a minimum of 
seven cables (Murray et al. 1994). The maximum number of operations required by 
the three algorithms to check each pose of the 6-DOF CDR with 7-12 cables is 
shown in Fig. 7. The results show that the maximum number of operations required 
by the proposed method is always less than the other two algorithms. 


Table 1 Maximum 
number of operations 
per pose 



Number of operations per pose 

Proposed 

CC n -mm)(n 3 -n)+n 2 ] 

(Diao and Ma 

m C n ,[(n/3)[(/j— l) 3 — («— 1) + ( n-2 )] + n 

2008) 

(m—n)] 

(Pham 
et al. 2006a) 

X r 2 { Km ! )/(m-n + r- 1 ) ! ] [(m-n + r) (r- 1 )] } 


Fig. 7 Maximum number of 
operations per pose for a 
6-DOF modular CDR (Lim 
et al. 2011) 
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Simulation Studies and Results 

The force-closure workspace consists of all the moving platform poses which 
satisfy force-closure condition. Figure 8 shows a 6-DOF spatial CDR driven by 
eight cables. The dimension of the base is 1 x 1 x 1 m, and the moving platform is a 
0.3 x 0.2 x 0.1 m block. The Z-Y-X Euler angles, represented by a, /?, and y, 
respectively, are employed to define the orientation of the platform frame with 
respect to the base frame. Figures 9 and 10 show examples of the translational 
workspace (with fixed orientation) and the orientation workspace (with fixed 
position) generated, respectively. The translational workspace is computed with a 
step size of 0.01 m. The orientation workspace uses a step size of 5° to find the 
limits of the Euler angles. Within the limits, the workspace is computed with a step 
size of 0.5°. 

The simulation result produced by the proposed algorithm is the same as those 
by (Pham et al. 2006a) and (Diao and Ma 2008). The algorithms are implemented in 
Matlab on an Intel P4 3.20 GHz with 512 MB RAM. The computational time for the 
workspace analysis is compared among the various algorithms (see Table 2). The 
table shows that the computation time required by the proposed algorithm is always 
less than the recursive dimension reduction algorithm (Pham et al. 2006a) but is 
comparable to the algorithm proposed in (Diao and Ma 2008). This algorithm can 
also be extended to generate the static workspace of under-constrained CDRs, and 
readers may refer to (Lim et al. 2011) for more details. 


Force-Closure Analysis of Multi-Bodied CDRs 

In the previous section, a force-closure algorithm was presented for generic ft-DOF 
modular CDRs. It has also been proven that it requires a minimum of n + 1 cables 
with positive tension to fully constrain it. However, force-closure analysis of open 
chains driven by cables is still an open question. This subsection will present the 
force-closure algorithm for the case of an w-DOF multi-bodied open chain CDR. 


Fig. 8 A 6-DOF fully 
constrained CDR driven by 
eight cables 
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Fig. 9 Force-closure translational workspace for a fully constrained CDR with a = 2, j3 
y = 2 (Lim et al. 2011) 


1 , 



2188 


S.K. Mustafa et al. 



-10 -8 -6 -4 -2 0 2 4 6 8 10 -10 -8 -6 -4 -2 0 2 4 6 8 10 -10 -8 -6 -4 -2 0 2 4 6 8 10 

a 


Fig. 10 Force-closure orientation workspace for a fully constrained CDR with x = 0.5, y = 0.5, 
z = 0.5 (Lim et al. 2011) 
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Table 2 Computational time for force-closure analysis using the various algorithms 


Translational 

workspace 

Limits 

Time (s) 

«(°) 

PC) 

rC) 

x(m) 

y(m) 

z(m) 

(Pham 
et al. 2006a) 

(Diao and 
Ma 2008) 

Proposed 

0 

0 

0 

0-1 

0-1 

0-1 

11539 

246 

184 

2 

-1 

2 

0-1 

0-1 

0-1 

8120 

206 

185 

Orientation 

workspace 

Limits 

Time (s) 

x(m) 

yim) 

z(m ) 

a(°) 

PC) 

K°) 

(Pham 
et al. 2006a) 

(Diao and 
Ma 2008) 

Proposed 

0.5 

0.5 

0.5 

-15 to 15 

-30 
to 30 

-20 
to 20 

16307 

511 

495 

0.6 

0.8 

0.4 

—5 to 5 

-20 
to 30 

-15 
to 15 

4641 

146 

126 


As explained in section “Kinetostatic Analysis for Multi-Bodied CDRs,” the 
basic idea for the kinematic analysis of multi-bodied “proper” open chain CDRs is 
to express individual wrench screws as linear combinations of the reciprocal screws 
and determine the individual joint torques. The net required torques at each joint is 
then calculated based on the superposition of the scalar components in their 
respective reciprocal screws. The key distinction for force-closure analysis of 
cable-driven open chains is to treat external wrenches as requiring joint torques, 
while cable wrenches as providing joint torques (i.e., Proposition 2.5). In other 
words, under static equilibrium condition, the joint torques required by external 
wrenches must be equal to the joint torques provided by the cable forces. Force 
closure is achieved if the set of cable tensions is determined to be positive. 

Proposed Algorithm 

The proposed algorithm consists of five major steps (Mustafa and Agrawal 2012b): 

1. Determine the various screws (i.e., twists, reciprocal wrenches, external 
wrenches, and cable wrenches). 

2. Determine the joint torques required to sustain the external wrenches by expressing 
these external wrenches as linear combinations of the reciprocal wrenches. 

3. Determine the joint torques provided by the cable forces by expressing these 
cable forces as linear combinations of the reciprocal wrenches. 

4. Equilibrate the joint torques between the external wrenches and the cable forces. 

5. Determine force closure at a particular pose by checking if there exists a set of 
positive cable tension. 

As shown in Fig. 11, the following notations will be used in this section: 

• represents the unit screw for the twist at the jth joint. 

• $j r represents the unit screw for the reciprocal wrench of the jth joint. 
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Note: ‘0’ represents the base. 


Fig. 11 Screw notations of an ^-linked open chain CDR (Mustafa and Agrawal 2012b) 


• $tj-k represents the unit screw for the ith cable force acting from the jth link to 
the kth link and j ^ k. 

• $ext j represents the screw for the external wrench acting on the jth link. 

• tf represents the tension scalar of the ith cable. 

The proposed algorithm is now illustrated using a multi-bodied CDR with a 
“proper” open chain. As shown in Fig. 12, the spatial chain example is of an S-R-U 
configuration, driven by seven cables (Note: S, R, and U indicate a spherical, 
revolute, and universal joint, respectively). 

Step A: Determining the Various Screw Representations 

From Fig. 12, the unit screws for the various pure rotation twists (i.e., zero pitch) 
with respect to the inertial frame are described as follows: 




n x ni 



where h[ is the unit direction vector of the ith screw axis with respect to the inertial 
frame, and rj is the positional vector from the origin of the inertial frame to a point 
on the ith screw axis (which is taken at the joint center locations). 

For a “proper” spatial cable-driven open chain, it will have six linearly indepen- 
dent reciprocal wrenches (based on Theorem 1). Unlike the planar case, it is 
challenging to identify the reciprocal wrenches of spatial open chains through 
geometric observations. For the seven cable S-R-U spatial open chain, the zero- 
pitch wrenches of Twists 4—5 (i.e., $ 4r — $ 6r ) can be easily identified through 
geometric observations. Readers can refer to a publication by Tsai (Tsai 1998), 
which gives a clear explanation of geometrically identifying reciprocal screws of 
basic kinematic pairs and common kinematic chains. As shown in Fig. 13, $ 4r 
passes through the centers of the spherical and universal joints. For $ 5r , it passes 
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Fig. 12 


Screw representations of a spatial S-R-U open chain CDR (Mustafa and Agrawal 2012b) 


/\ 



Fig. 1 3 Determining the reciprocal wrench for Twist $ 4 through geometric observations (Mustafa 
and Agrawal 2012b) 


through the center of the spherical joint and intersects screw axes $4 and $ 5 , while 
$ 6r passes through the center of the spherical joint and intersects screw axes $4 and 
$ 5 , as shown in Fig. 14. 

For the remaining reciprocal wrenches (i.e., $ lr — $ 3r ), it will be determined 
numerically (Strang 1976), as it is challenging to find it geometrically. Numeri- 
cally, this is a simple problem of determining the single column null vector space. 
Based on the reciprocal product of two screws (i.e., Eq. 10), for each $ /r , five 
equations with six unknowns are formed as follows: 

$j O Sir = 0 (13) 

where © represents the reciprocal product of two screws as defined in Eq. 10, 
j = {1, 2, 3, 4, 5, 6 } and 7 ^ i. 
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This is the intersection point of 
the plane with screw axis $ 6 . 

This plane contains the centre of the 
spherical joint and screw axis $ 4 . 



End- 

Effector 


This is the intersection point of 
the plane with screw axis $ 5 . 


Fig. 14 Determining the reciprocal wrenches for Twists $ 5 and $ 6 through geometric observa- 
tions (Mustafa and Agrawal 2012b) 

Rearranging the five equations based on Eq. 13, the following matrix represen- 
tation is obtained: 


$4 Sir = 0 (14) 

where $ E 91 5x6 consists of an assembly of the five joint twists as its row vectors 
(except $/) and A represents the elliptical polar operator (Lipkin and Duffy 1985). It 
is a transformation matrix that interchanges the primary part (i.e., the first three 
elements) with the secondary part (i.e., the last three elements) of a screw. 

From Eq. 14, the solution for $ ir is obtained as follows: 

A Si r = N(S) (15) 

where N($) E 91 6xl is the null space of $ as Eq. 14 is an under-constrained system 
(i.e., five equations with six unknowns). Note: The solution for Eq. 15 will require 
the interchanging of the last three elements with the first three elements to obtain $ /r 
in its original form. 

Step B: Determining the Joint Torques Required to Sustain the External 
Wrenches 

Based on Proposition 2.1 , the external wrenches $ ext / for each link can be uniquely 

expressed as a linear combination of the six reciprocal screws, $i r to $6 r , and are 
described as follows: 


AtLinkl : $ ex ti 


H^$lr + 


• + H^$6, 


At Link 2 : $ ex t 2 


r + 


• + W^ e 2 $ 6 / 


+ 


i (6) 

' W e3 ^ 6 / 


$ 


(16) 

(17) 


At Link 3 : $ ex t 3 


(18) 
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or expressed in its equivalent matrix representation form 


$ext j 





where $ r 



G 6 x 6 and w ej = 



• • ; wfy] G 6 x 1 are the scalar 


components of $ ex t y on the various reciprocal screws. 

Since $ r is a square matrix, its inverse is easily obtained (provided rank ($ r ) = 6, 
i.e., non-singular pose), and the solution for w ej is obtained from Eq. 19 as follows: 



Based on Propositions 2.3 and 2.4 , the net joint torques required are determined 
as follows: 


Tl = - ( 'w ( e ? + + W^) ($lr © $i) 

r 2 = - + W { £ + W*3 } ) ($2r © $ 2 ) 

73 = - (wfj + W { $ + tt^) ($ 3 , © $ 3 ) 

T 4 = - (w^ + H$) ($ 4j - © $ 4 ) 

T5 = — ($5,- 0 $5) 

T6 = — ($6 r © $g) 


( 21 ) 

(22) 

(23) 

(24) 

(25) 

(26) 


Step C: Determining the Joint Torques Provided by the Cable Wrenches 

Based on Proposition 2.1, the ith cable forces acting from the jth link to the kth link 

(i.e., $i-j-k ) are a zero pitch wrench and can be uniquely expressed as a linear 
combination of the six reciprocal screws. The linear combinations of the various 

$i:j-k acting on each link are described as follows: 

AtLink 1 : (For i = 1, . . . , 7; k = 0 and 2) 

U • $/: 1 -k = P[l k %lr + • • • + fil\_k$6r 


( 27 ) 
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At Link 2 : (For i = 1,2,4, 5,7;/: = 1 and 3) 


U ' S/;2— /c — Pu2-k$\r + • • • + P^l-k^r 


At Link 3 : (For i = 2,5,7 ; jfc = 2) 

U ' %: 3 -k = /^ : 3-fc$lr + • • • + PrJ-k^r 


or expressed in its equivalent matrix representation form 


(28) 

(29) 


U ' $/:/-* — $r -ft i:j-k (20) 

where f) hj _ k = [Ptj- k , . . G lK 6xl are the scalar components of (t, ■ $ r ,j-k ) 

on the various reciprocal screws. 

The solution for Ptj-k is obtained from Eq. 30 as follows: 



Based on Propositions 2. 3-2. 5, the net torque on joint 1 consists of the super- 
position of all Prj-k s from all three links. This is the same for joints 2 and 3, except 
it is p\ : j- k an d respectively. The net torque on joint 4 consists of the 

superposition of all p\. y^s from links 2 and 3. Lastly, the net torque on joint 
5 (and 6) consists of the superposition of all pf] j _ k s (and pf\ j_ k s) from link 

3. Replacing $~ x by b i: j_ k (=[b^jL k ; . . .; b\fjL k ]) 9 the net joint torques provided 
by the cables are given as follows: 


/ 



(t\ ' a\\ + t2 • Cl 12 + • • • + tq • ay]) 



$1 r © $1 




/ 



(t\ • (221 + h • ^22 + • • • + tl ' Clll) 



$2r O $2 




T 3 — (t\ • (231 + t2 ' (232 + • • • + t'j • (237) ^$3,- © $3 J 



/ 



(t2 ' ^42 + ' ^44 + t5 * (245 + © 





$4r © $4 



/ 



(h ' ®52 + ^5 ' ^55 H~ ^7 ' ^57) 



$5r © $5 





(^2 ' <262 + ts * <265 + tl ' < 267 ) 



$6r © $6 



(37) 
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where 


a n = 

jSX) n 

b 1 : 1 — 0 ’ <212 

^2:1-0’ ^13 

^3:1-0’ ^14 

O 

- — " •'3" 
II 

<215 

hi i) n 

^5:1-0’ 

z,(i) ^ 

^ 6 : 1 - 0 ’ ^17 

o (1) 

^7:1-0 


<221 

J 2 ) „ - 

Ol:l _ 0 .«22 

J 2 ) „ - 

^ 2 : 1 — 0 ’ ^23 

4:1-0.024 = 

b {2) 

u 4:l-0 

<225 

1 ,( 2 ) _ _ 
^5:1-0’ ^26 

hi 2 ) ~ 

O 6:1 _ 0 ,a27 - 

b i2) 

U 7:l-0 


<231 

/, (3) „ - 
01:1-0.032 - 

II 

£2 

0 

LO 

OJ 

1 1 

II 

Ld 

O 

Ld 

-P^ 

II 

b o) 

u 4:l-0 

<235 

J3) „ - 

^5:1-0’ ^36 

II 

O 

Ld 

-d 

1 1 

b {3} 

U 7:l-0 


<242 

1,(4) n 

^ 2 * 2—1 ’ ^44 

1,(4) 

^ 4 - 2 — 1 ’ ^45 

^ 5 : 2 — 1 ’ 047 = 

^(4) 

^7:2—1 

<252 

J5) - . 

^2:3-2’ a 55 

b 5 : 3 — 2 ’ ^57 

b i5) 

u 7:3-2 


<262 

r ( 6 ) 

^ 2 : 3 - 2 ’ ^65 

r ( 6 ) 

^ 5 : 3 - 2 ’ ^67 

b {6) 

u 7:3-2 



Note : the terms above have been reduced to their simplified form as b i:j _ k = —b i:k _j 
and negate each other. 

Step D: Equilibrating the Joint Torques 

As mentioned in Proposition 2.5 , external wrenches require joint torques, while 
cable wrenches provide joint torques. By equilibrating Eqs. 21—26 with 
Eqs. 32—37, respectively, the resultant equations can be combined and expressed 
in a matrix representation form as follows: 


AT 



where 


W 


A 


w 


w 


w 


( 1 ) 

el 

(2) 

el 

( 3 ) 

el 


+ w[ 2 * + W 
+ W ( e 2 } + W 

+ wf 2 } + W 


w 


( 4 ) 


e2 


+ W 


( 1 ) 

(2) 

( 3 ) 

e3 


( 4 ) 

e3 


W 


W 


( 5 ) 

e3 

( 6 ) 

e3 


ai,a2,a3,a4a5,a6,a7 


G93 


6 x 1 


T 


~a n 

<212 

< 2 1 3 

<214 

<2l5 

< 2 l 6 

<217 

<221 

<222 

<223 

<224 

<225 

<226 

<227 

<231 

<232 

<233 

<234 

<235 

<236 

<237 

0 

<242 

0 

(244 

<245 

0 

<247 

0 

<252 

0 

0 

<255 

0 

<257 

_ 0 

<246 

0 

0 

<265 

0 

<267 _ 

> 1 ; t 2 ; 



1,1 G sk 7x1 




E93 


6x7 
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Step E: Determining Force Closure at a Particular Pose 

A single rigid-bodied CDR is said to achieve force closure at a particular pose if and 
only if any arbitrary wrench applied to the moving platform can be sustained by a 
set of positive cable forces. For the case of multi-bodied open chain CDRs, external 
forces will act not only at its end-effector but also at its intermediate links. So force 
closure for open chain CDRs is achieved if and only if any arbitrary external wrench 
applied to its end-effector and its intermediate links can be sustained by a set of 
positive cable forces. This relationship between the external wrenches and cable 
forces acting on the spatial S-R-U cable-driven open chain is reflected in Eq. 38, 
and W represents the 6-D orthogonal space of force and moments, which is spanned 
by the column vectors a z . The force closure for the spatial case can be determined 
by rearranging Eq. 38 as follows: 

T = A # .W + N(A).X (39) 

±L rp rji 1 

where A (=A . (A. A ) _ ) is the pseudoinverse of A, N{ A) represents the null 
space of A, and A represents a 1 -D vector. 

Due to the positive cable tension constraint, Eq. 39 is now represented as seven 
linear inequalities. 


A # .W + A(A).A > 0 (40) 

The spatial S-R-U open chain achieves force closure at a particular pose, if there 
exists a common interval, F A , bounded by these seven linear inequalities. 

Simulation Studies and Results 

Figure 15 presents the geometric parameters and the external wrenches for the 
spatial S-R-U “proper” open chain CDR. The external wrenches are just the weight 
of the individual links, i.e., 1 kgf, acting from the center of each link. Figure 16 


All Length dimensions are in millimeters. 



030 


End- 

Effector 


1 kgf 


1 kgf 


1 kgf 


Fig. 15 Geometric parameters of the spatial S-R-U open chain CDR at home pose (Mustafa and 
Agrawal 2012b) 
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Configuration S3 Configuration S4 


Fig. 16 Various cable routings of the spatial S-R-U open chain CDR 


Table 3 Force-closure results of spatial S-R-U open chain CDR with various cable routings 
(Mustafa and Agrawal 2012b) 


Configuration 

Pose a 

Determine T using linprog 


\0\, 02, 03, 9 A, 0 5 , $6l 

[h;t 2 ;t 3 ;t 4 ;t 5 ;t 6 ;t 7 ] 

E 

[20°,15 o ,3 o ,18 o ,5°,20 o ] 

[22.6; 52.3; 29.6; 1.0; 23.2; 33.3; 37.9] 

F 

[-30°, -15° -15°, 25°, 30°, 10°] 

[48.3; 8.8; 7.1; 128.1; 202.3; 115.6; 1.0] 

G 

[0° ,8° ,0° ,4° ,4° , — 1 6° ] 

[1.0; 202.6; 72.5; 279.9; 236.3; 113.7; 
176.3] 


d 0\—0 3 are based on the moving ZYX Euler Angles at the spherical joint, 0 4 is about the y axis at 
the revolute joint, while 0 5 and 0 6 are about the moving z and y axes at the universal joint 


presents the various cable routing configurations of the spatial S-R-U open 
chain CDR. 

Configurations S1-S4 were selected to represent several general cases one may 
encounter. For each pose of the various configurations, force closure was analyzed 
based on the linear inequality approach (Oh and Agrawal 2005). The lower and 
upper cable tension bounds were also set as 1 N and 300 N, respectively. 

Table 3 presents the force-closure results of the spatial S-R-U open chain CDR. 
Randomly chosen poses (within the joint motion constraints) for Configurations SI, 
S2, and S3 were able to achieve force closure. However, for Configuration S4, it 
was unable to achieve force closure in its entire range of motion. This is intuitively 
expected as n cables attached to the n -DOF segments were definitely unable to 
provide the bilateral force/moment. This configuration was deliberately chosen to 
demonstrate the ability of the proposed methodology to analyze such infeasible 
configurations. 
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Stiffness Analysis 

The flexible nature of cables causes CDRs to have relatively low stiffness and 
thereby significantly affecting the positional accuracy. For stiffness analysis, the 
pre-tension of cables is assumed to be large enough for the cables to be in tension 
and in the linear region of the stress-strain curve. In principle, a finite-element 
analysis will reveal more or less the complete information about the mechanism’s 
elastic properties. However, such finite-element analyses are not only computation- 
ally expensive and time-consuming, but they are also not suitable for design 
iteration and optimization. Hence, kinematic approaches are preferred without 
resorting to computationally intensive procedures. One of the first few works in 
the field of CDRs to address stiffness analysis was by Kawamura et al. (1995), 
(Morizono et al. 1998). These works mainly investigated the influence of cable 
pretension on the stiffness of a CDR with the aim of addressing the issue of 
vibration. Choe et al. (1996) classified the stiffness of a cable in tension into two 
aspects, one along the cable direction and another orthogonal to the cable direction. 
The stiffness along the cable direction is contributed solely by the cable stiffness, 
while the stiffness orthogonal to the cable direction is a function of the cable 
tension. The stiffness in these two directions is independent of each other. 
Verhoeven et al. (1998) formulated a simplified stiffness matrix which was a 
function of the cable stiffness and cable length and analyzed its eigenvalues to 
determine the nearness to singularity poses. Behzadipour and Khajepour (2006) 
proposed a four-spring model to represent the stiffness of each cable and calculate 
the total stiffness matrix of a CDR. This was then subsequently used to determine 
the stability of the moving platform (adapted from the work done by Svinin 
et al. (2000; 2001) on rigid-link gough-stewart platforms). Yu et al. (2010) 
performed stiffness control on cable-driven robots by optimizing the tension dis- 
tribution of the cables. The stiffness of a CDR is contributed by both the cable 
stiffness and the internal forces in the system. The internal forces, which are the 
cable tensions, are primarily used to resist the external wrench. As the CDRs are 
redundantly actuated closed-loop manipulators, the tension solution will contain a 
set of homogeneous forces that produce zero resultant wrenches on the moving 
platform. Hence, the cable tensions can be manipulated to regulate the stiffness of 
the CDRs. 

The stiffness matrix of the CDR, K E 9V ZXZZ , provides a linear relationship 
between the external wrench <5W E 9\ n and the infinitesimal deflection <5X E 9V Z , 
which is given as: 

— <5W = K<5X (41) 

T 

In Eq. 41, — <5W is used because W = — [F, M] . The stiffness of the cables can 
now be represented as follows: 





( 42 ) 
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where Kdi ag = diagf^, . . K % ] G 9f mxm and K\ is the ith cable stiffness. The cable 

K ' 

stiffness K[ = where /Tc ab i e is the cable stiffness per unit meter and /, is the 
length of the ith cable. Substituting Eqs. 1 and 42 into Eq. 41 yields 





<5W 

~~SX 

ST 

-A 

<5X 

AK C \ T 




T 


where AKdi ag A J and are the stiffness induced by the cable stiffness and the 
cable tensions, respectively. 


The term 


<5A 

<5X 


T) can be computed by (Yi et al. 1989, 1990) 



where t t is the ith cable tension, H 2,?J G is the second-order influence 

coefficient array, H 2,?J G 9T ZX/2 is the ith ( n x n) matrix of H 2 " J , and the operator (g) 
represents the tensor product (Rudich and Wigderson 2004). Therefore, Eq. 43 
becomes 


K = AKj iag A r + Y, Hi) ® (45) 

i= 1 

When high stiffness cables are used, the stiffness of the cables K c -(i = 1 , . . . , m) 
is much higher than the cable tensions, i.e., /C- t f . In this case, the stiffness 
induced by the cable tensions is relatively insignificant as compared to the stiffness 
induced by the cable stiffness. For fully constrained CDRs, it is usually not possible 
to achieve significant stiffness regulation by manipulating the cable tensions. 


CDR with Variable Stiffness Characteristics 

In order to significantly regulate the stiffness of the CDR, a variable stiffness device 
should be attached along each cable of the CDR. The variable stiffness device 
(VSD) refers to an add-on unit with nonlinear stiffness characteristics. As such, the 
CDR is able to exhibit variable stiffness characteristics. As shown in Fig. 17, the 
stiffness of the VSD-cable combination is a function of the stiffness of both the 
cable and the VSD. When the stiffness of the VSD is a function of the cable tension, 
the stiffness of the combination can thus be controlled by manipulating the cable 
tensions of the CDR. 
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K : Stiffness 

K c : Stiffness of the cable 
K VSD (t)\ Stiffness of the VSD 

t : Cable tension 



K=K C 

Without VSD 


b 



i i i 

K K c K VSD (t) 


With VSD 

(VSD-cable combination) 


Fig. 17 A cable with and without the VSD 


The VSDs regulate the stiffness of the CDR by adjusting the stiffness of each 
cable- VSD combination. As the VSD and the cable are placed in series, the stiffness 
of all the cable- VSD combinations is now computed as follows: 




\ K \ 

0 

0 " 

-i 

~K\ SD (t ) 

0 

0 

-in 


0 

• 

• 

• 

0 

+ 

0 

• 

• 

• 

0 



_ 0 

0 

K_ 


0 

0 

K V m SD (t)_ 




where K\ SD (t), for / = 1, . . . , m, is the stiffness of the ith VSD, which is a function 
of the cable tension. With (K^g)? the stiffness of the CDR (K) can be computed 
with Eq. 43. Note that the stiffness contributed by the various cable tensions 
remains the same with or without the VSDs. 


Case Study Using a 2-DOF Planar CDR 


A 2-DOF planar CDR prototype, as shown in Fig. 18, is used for the experimental 
studies to verify the ability of the CDR to significantly regulate its stiffness after the 
VSDs are employed. The moving platform is actuated using four driving cables. 
Any pose of the 2-DOF planar CDR is represented using the position (in meters) 
along the X and Y axes, i.e., [P x , Py\- 

A VSD with the parameters given in Table 4 is designed and built. This VSD is 
designed to sustain a tension limit of 80 N. The VSD prototype is shown in Fig. 19. 
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To 

Actuator 1 


To 

Actuator 4 



Actuator 2 


Actuator 3 


Dial indicator 


Load cell 


Moving -platform 


Fig. 18 Experimental setup of the 2-DOF Planar CDR with VSDs (Lim et al. 2011) 


Table 4 Parameters for 
the VSD prototype 


Parameters 

Values 

^rnax 

80 N 


0.00435 

4* max 

§ rad 

<p0 

0 rad 

d 

0.01 m 


Computer Simulations 

The simulations are first performed on the 2-DOF planar CDR with and without the 
VSDs. For the CDR with VSDs, a VSD is placed on each cable so that the stiffness 
of the CDR can be significantly regulated by manipulating the cable tensions. The 
determinant of the stiffness matrix, det(K), is employed to qualify the stiffness of 
the 2-DOF planar CDR. There are a number of criteria to evaluate the stiffness of a 
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Shaft to support torsion springs 


Set screw to fasten steel cable 


Circlip 



Torsion spring 
(Counter-clockwise) 


Torsion spring 
(Clockwise) 


End of torsion 
spring hooked 

onto shaft Modified design to hold the end of the springs 


Hole to insert steel cable 



Nylon-coated steel cable 


Fig. 19 VSD prototype using torsion springs 


system, such as the trace of the stiffness matrix (Zhang 2009), the eigenvalues of 
the stiffness matrix (El-Khasawneh and Ferreira 1999), and the determinant of the 
stiffness matrix (Ceccarelli and Carbone 2002). In this work, the determinant of the 


stiffness matrix is employed. Let Arable =13, OOON/m, t 


max 


80 N, and t 


mm 


= 3 N, 

in which t max and t min are the upper and lower tension limits. The upper tension 
limit t max is restricted by the force/torque capability of the actuators or the loading 
limit of cables, whichever is lower. The lower tension limit t min is a positive value 
that is needed to withstand disturbances. 

At a particular pose [0.160, 0.465], three sets of tension solutions, with different 
levels of cable tensions, are given in Table 5. Different levels of cable tensions help 
to demonstrate the change in the stiffness (K) of the 2-DOF planar CDR when the 
tension solutions are manipulated. 

Table 5 presents the simulation results on the stiffness analysis of the CDR with 
and without the VSDs. When the VSDs are not employed, it is observed that the 
change in the stiffness of the CDR is insignificant at different tension levels. When 
the VSDs are employed, the determinant of the stiffness matrices has much larger 
variation, implying that CDR with VSDs is able to produce significant changes to 
its stiffness. With the VSDs employed, the same tension limit can now produce 
significant changes to the stiffness of the CDR. As the cable and VSD are connected 
in series, the net stiffness of the cable- VSD combination is bounded below the 
stiffness of the cable alone. For applications that demand low stiffness at certain 
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Table 5 Simulation results for CDR at [0.160, 0.465] 


Tension 

level 

Cable 

tensions (N) 

Without VSD 

With VSD 

K 

det(K) 

K 

det(K) 

Low 


'4.88" 

3.53 

3.46 

5.96 



'36249 -4196' 

-4196 84137 


3.03 x 10 9 


'107.8 -18.5' 

-18.5 225.5 


2.40 x 10 4 

Average 


"44.15" 

30.14 

31.51 

_52.18_ 



'36492 -4172' 

-4172 84230 


3.06 x 10 9 


' 6106 -1900 

-1900 17165 


1.01 x 10 s 

High 


' 73.40 
42.95 
52.94 
79.490 



'36648 -4169' 

-4169 84291 


3.07 x 10 9 


' 13749 -2010 

-2010 36774 


5.02 x 10 s 


times and high stiffness at other times, such CDRs with variable stiffness can be 
very useful. If the CDR requires higher stiffness, cables with higher stiffness can 
always be used. 

Experimental Studies 

A 2-DOF planar CDR prototype, as shown in Fig. 18, is used for the experimental 
studies. As the stiffness matrix is a local measure, it should not be employed to 
calculate deflections for mechanisms or manipulators with large deflections. Hence, 
the unit load method (Bauchau and Craig 2009) is used to determine the deflection 
of the moving platform as a result of the external force. The unit load method is a 
simple and elegant method that can cater to the stiffness nonlinearity and the large 
deflection of the moving platform. Based on the principle of complementary virtual 
work, the unit load method applies a virtual unit force along an axis and then 
determines the deflection caused by the external force. Let the external force along 
the X axis be F x . Employing the principle of complementary virtual work yields the 
following: 


m 

<5F x ^x = ^ efiti (47) 

i= 1 

where <5F X is the virtual force that is applied on the moving platform along the X 
axis, A x is the actual deflection of the moving platform along the X axis as a result 
of F x , <5ti is the tension acting on the ith cable under the action of the virtual force, e t 
is the actual elongation for the ith cable- VSD combination as a result of F x , and 
m is the number of cables. For the unit load method, <5F X = 1. Equation 47 can also 
be applied to determine the deflection along the Y axis or A Y . Using Eq. 47, the 
deflections A x and A Y can be determined analytically. The deflections along both 
axes are then used to compute the deflection of the moving platform. 
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Table 6 Experimental results 


Pose 

Tension 

level 

Cable tensions(N) 

Computed 

det(K) 

Displacement (mm) 

Analytical 

Experimental 

[0.235, 

0.375] 

Low 

[18.4, 18.4, 18.4, 18.4f 

3.96 x 10 s 

0.66 

0.42 

Average 

[33.8, 33.8, 33.8, 33.8f 

5.22 x 10 7 

0.28 

0.21 

High 

[56.9, 56.9, 56.9, 56.9] r 

3.78 x 10 8 

0.13 

0.09 

[0.160, 

0.465] 

Low 

[20.5, 14.7, 14.5, 24.9f 

4.43 x 10 6 

0.71 

0.51 

Average 

[36.6, 25.5, 26.0, 43.7f 

4.87 x 10 7 

0.33 

0.25 

High 

[58.3, 37.0,41.8, 66.lf 

2.61 x 10 8 

0.17 

0.13 


The 2-DOF planar CDR is set at different cable tensions to induce a change in 
the stiffness of the VSDs, which in turn changes the stiffness of the CDR. An 

T 

external force of F = [0 N, 5 N] is applied at different stiffness levels, and the 
displacements are measured using dial indicators. The deflections along the X and 
Y axes are measured individually and are then used to compute the displacements 
of the moving platform. Load cells are placed along the cables to measure the cable 
tensions. The nylon-coated steel cables used in the experiments have a stiffness of 
13,000 N/m per unit meter. The experiments are performed on two poses. The cable 
tensions, the computed stiffness determinants, and the analytical and experimental 
results are given in Table 6. From the analytical and experimental results, it is 
observed that the displacements reduce as the cable tensions increase. This implies 
that the CDR with VSDs is able to significantly regulate its stiffness by manipu- 
lating the cable tensions. Readers may refer to (Lim et al. 2013) on details of a joint- 
based hybrid control scheme that can be employed to simultaneously control both 
the position and the cable tensions of a redundantly actuated CDR, which will allow 
the stiffness to be regulated via the VSD. 

Workspace Analysis 

Performance analysis plays a vital role in obtaining the fundamental tools to 
evaluate and carry out optimal designing of a mechanism. In literature, this has 
been generally done by analyzing certain workspace-related performance measures 
based on the kinematic properties of mechanisms. Hence, workspace analysis is a 
pertinent issue to address, for subsequent development of performance measures 
and design optimization. In general, workspace analysis consists of three main 
components, namely, (i) workspace definition, (ii) workspace evaluation 
approaches, and (iii) workspace generation. While the first two components have 
been addressed in the previous sections, this section will address the workspace 
representation and volume determination, especially for CDRs with orientation 
capabilities. Several performance measures based on workspace quantity and 
quality will also be presented to evaluate the design of CDRs. 
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Workspace Representation 

In literature, there have been two main approaches for the workspace representation 
of CDRs, namely, (i) analytical and (ii) numerical approaches. The analytical 
approach derives the workspace from the closed-form solutions of the kinematic 
constraints, and the workspace boundary is generally characterized by the occur- 
rence of the loss of force-closure condition. This is usually considered as a 
singularity pose and is based on the conventional singularity definition as a pose 
where the moving platform has uncontrollable DOFs instantaneously. Bosscher and 
Ebert-Uphoff (2004a) characterized the wrench-feasible workspace boundary 
based on geometric properties of the available net wrench set. Stump and Kumar 
(2004) derived the closed-form expression for the workspace boundary based on 
convex analysis and using tools from semi-definite programming. Verhoeven and 
Hiller (2000) developed a method to find closed-form expressions of the conserva- 
tive bounds for the controllable workspace. Oh and Agrawal (2006a, b) proposed an 
interesting approach to analytically generate feasible workspace through the choice 
of a control law, which considered input constraints and disturbances. Based on a 
given initial condition, this method allowed recursive calculations of subsequent 
feasible domains, which is useful in extending the end-effector motion range. 
Although analytical methods have an explicit expression and high computational 
efficiency, they are more suited for lower DOF CDRs with simpler kinematic 
constraints and architectures. Such approaches become complex and computation- 
ally challenging when incorporating actual implementation issues (such as mechan- 
ical interferences, task-specific kinematic constraints, cable tension bounds, and 
CDR configurations with redundant cables) due to the increasing number of 
inequalities to consider. Hence numerical approaches are generally more effective 
in coping with such cases and are demonstrated in works by Ceccarelli (2004), 
Pusey et al. (2003), Alp and Agrawal (2002), Gosselin and Wang (2004), and Pham 
et al. (2005). 

The numerical approach generally involves using of discretization algorithms to 
quantify the workspace, solving the inverse kinematics at each pose, and verifying 
the constraints that limit the workspace. For CDRs, this involves force-closure 
analysis. Such numerical approaches can be applied to any type of CDR architec- 
tures. However, this approach requires proper representation of the pose parameters 
in the workspace with an equi-volumetric finite partitioning scheme. Ceccarelli 
(2004) proposed such an equi-volumetric finite partition scheme based on a binary 
matrix formulation. This formulation divided the cross section of the workspace 
into small rectangles, represented by a feature point. An efficient workspace 
boundary determination algorithm was also developed, which is based on the 
geometry of the grid and the number of surrounding feature points. 

There are two main categories of pose parameters, namely, translational and 
orientation pose parameters (Merlet 2000). Translational pose parameters can be 
easily visualized in 3 -dimensional space, and the visualization in R 3 represents the 
actual task space. However, is it not so trivial for the case of orientation pose 
parameters, especially the Special Orthogonal Group, i.e., SO(3) orientations. There 
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are several conventions that have been developed to represent and parameterize 
rigid body rotations, such as the Cayley-Rodrigues parameters, axis-angle param- 
eters, unit quaternions, Euler angles, T&T angles, and exponential coordinates 
(Gosselin and Wang 2004; Bonev and Ryu 2001; MacCarthy 1990; Park and 
Ravani 1997). Hence, there are two basic requirements to be considered in the 
selection of the appropriate parameterization method: (i) the parametric domain 
must have a closed boundary so as to make finite partition possible, and (ii) the 
parameterization needs at most three parameters so that the embodied orientation 
workspace can be readily visualized in three-dimensional space. 

There are three parameterization methods (among the various mentioned previ- 
ously) which fulfill these two requirements, i.e., the Euler angles, the T-&-T angles, 
and the exponential coordinates. The rigid body rotation group SO(3) can be 
represented as a rectangular parallelepiped (Chirikjian and Kyatkin 2000), a solid 
cylinder (Bonev and Ryu 2001), and a solid sphere (Park and Ravani 1997) when 
using the Euler angles, the T&T angles, and exponential parameterizations, respec- 
tively. Various partition schemes have been proposed for these three parameteri- 
zation methods based on their geometries. Readers may refer to a related paper by 
the author (Yang et al. 2006) for details on the partition schemes proposed and the 
workspace analysis of the different parameterization methods. Nevertheless, these 
three partition schemes possess the same essential features, i.e., they are all equi- 
volumetric, parametric, proportional, and exact partitions of their corresponding 
geometric identities. However, as a result of the parameterization, integration 
measures have to be introduced when computing the workspace volume from its 
parametric domain. Consequently, the orientation workspace volume is numeri- 
cally computed as a weighted sum of the equi-volumetric elements in which the 
weightages are the element-associated integration measures (Chirikjian and 
Kyatkin 2000). Following this approach, various global performance measures of 
the orientation workspace can also be readily computed. 

The following subsections will present the workspace representation for two 
common orientation configuration of modular CDRs, i.e., 2-DOF and 3 -DOF 
orientation. 

CDR with 2-DOF Orientation 

A CDR with 2-DOF orientation has a passive 2-DOF universal joint connected 
between the base and moving platform so as to limit the motion of the moving 
platform to the two rotating axes. The total orientation workspace of the 2-DOF 
CDR belongs to a subset of the Special Orthogonal Group SO( 3) with a complex 
boundary. Thus, the workspace is represented using the parametric domains, and 
0 2 , which are the angular displacements about the rotational axes of the 2-DOF 
CDR. In the parametric space, the orientation workspace of the 2-DOF CDR can be 
represented by a plane in R~, in which a one-to-one mapping between 6\ and 0 2 and 
the orientation of the 2-DOF CDR can be established. The parametric domain is 
partitioned into finite elements with equal volume by dividing each axis in the 
parametric domain into h identical segments (see Fig. 20). There are h 2 equi- 
volumetric elements in the planar workspace, and each element is specified by its 
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Fig. 20 Workspace 
representation for the 


2-DOF CDR 



feature point f These equi-volumetric elements have dimensions which are 
inversely proportional to the number of partitions h, implying that all the elements 
in this partition scheme converge uniformly when h is sufficiently large. When the 
elements are small, the local property of the entire element can be represented by f^ 
anywhere inside the element (Ceccarelli 2004). The use of feature points will 
facilitate numerical calculation of the orientation workspace volume where each 
fij represents a volumetric element, ldet(J I )\d0id0 2 - 


CDR with 3-DOF Orientation 

These 3-DOF CDRs are in fact cable-driven spherical mechanisms. Hence, the SO 
(3) orientation workspace analysis is not as straightforward. While quaternion 
seems to be used as the standard method for representing motion in computer 
animations and has the advantages in terms of lack of gimbal lock and insensitivity 
to round off errors, but they suffer from problems of interpretation. Euler angles, on 
the other hand, allow some degree of intuition for the rotation angles, but suffer 
from singularities. Representations using equivalent angle-axis have also been 
proposed (Korein 1984), but the main issue is that interpretation of joint motions 
must have an intuitive feel while being mathematically tractable at the same time. 
Hence, there is a great debate as to the use of Euler angles with quaternion and 
equivalent angle-axis for workspace representations. 

For the 3-DOF CDR, the Tilt-&-Torsion angles parameterization is adopted to 
represent its workspace. The advantage of this representation is that it is able to 
separate the 3-DOF orientation into two components: the 2-DOF tilt and the 1-DOF 
twist. The tilt component (</> and 0) determines the direction of the axis, while the 
twist component (<r) determines the rotation about the axis itself, as shown in 
Fig. 21. This representation is intuitive, and the joint motion limit boundaries can 
be easily identified using this representation. 
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Moving Platform 
(At the initial pose) 


x 



Moving Platform 
(At the final pose) 


Fig. 21 Tilt-&-Torsion angles representation (Bonev and Ryu 2001) 


Gosselin and Wang (2004) proposed an intuitive way to display orientation 
workspace using a cylindrical coordinate system. Adopting a similar approach, 
the orientation workspace of the 3 -DOF CDR is displayed using a system of 
cylindrical coordinates {r, y, h}. </>, 0, and o will be the polar angle y, the polar 
radius r, and the height h, respectively. In this section, an equi- volumetric partition 
scheme of the cylindrical workspace of radius n and height In (with (j) 
E [—180°, 180°], 6 E [0°,180°] and o E [—180°, 180°]) is proposed. The following 
is the partition scheme of the top half of the cylinder (the partition of the bottom half 
of the cylinder will be similar to that of the top half as presented here): 


Step 1: As shown in Fig. 22a, the solid cylinder is divided along z-axis into 
p identical circular discs of height Sh = Starting from the bottom, the discs are 

then labeled from 1 to p , i.e., i = 1, . . . , p. This results in p circular discs. 

Step 2: As shown in Fig. 22b, each circular disc i is then divided into p circular 
bands of thickness Sr = ^ along the radial direction. Note that the first circular 

band is actually a circular disc of radius ^ and height Starting from this central 

circular disc, the circular bands are labeled from 1 to p , i.e.,y = 1, ... , p. This 
results in p circular bands. 

Step 3: As shown in Fig. 22c, the central circular disc (i.e.,y = 1) is divided into 
four quadrants, where each quadrant is regarded as the basic element of volume 

. For each circular band j (i.e., j = 2, ... , p), it is circumfer- 


SV 


Cartesian 


4/7 


entially divided into 4(2 j — 1) equi-volumetric element of volume <5F Cartesian 
with the same spacing angle of 2 { 2 j-\) > as shown in Fig. 22d. Starting from 


the x-axis, the elements are labeled counterclockwise from 1 to 4(2 j 


1 ), 


i.e., k 


1, ... , 4(2 j — 1). This results in X 1 - 


i 


4(2 j 


1 ) 


4 p elements for 


each circular disc. 


Q 

In total, there are 4 p elements in the cylindrical workspace, and each element is 
specified by its feature point f ijk using a triplet index (/, j, k). f ijk is selected as the 
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X 



l = p 


>i = 2 


Partition scheme of a solid cylinder 



Partition scheme of z th circular disc 



k = 4(2j-1) 


Partition scheme of j th circular band in / th disc 


d 

Feature Point 



Element in the partition scheme 


Fig. 22 Equi -volumetric partition scheme of a solid cylinder 


intersection point of the three neutral planes of the element and is given in 
cylindrical coordinates as 



where /z( 


( 2/ - 1)”- J 

2 p ’ r ij 


2 P 


and 


ijk 


/f Qr l 
4(2/— 1) V lur 1 


1, . . . ,p;j = 1, . . . ,/?; and 




1 , 


4(2/ 


1)). When the number of elements is sufficiently large, each 


element can be locally represented by f ijk . The advantage of this division scheme is 
that the singularity points for the T&T angles at 6 = 0 and n are avoided. It has a 


“one-to-one” mapping between the points in the cylindrical plot and the poses in the 
task space. In addition, all elements are geometrically identical, with simplified 
analysis and computational effort. 
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Workspace Volume 

Workspace volume is normally employed as the quantitative measure for 
workspace analysis. However, for mechanisms with orientation capabilities, its 
orientation workspace volume is not equal to its geometric volume in its parametric 
space due to the nonlinear parameterization. Instead, the workspace volume can be 
numerically computed as a weighted volume sum of its constituent volumetric 
elements in its parametric space, in which the weightages are the element- 
associated integration measures. During the computation of the volume of SO(3), 
an integration measure has to be introduced in order for the integral of the function 
to be independent of the parameterization. This integration measure is the absolute 
value of the determinant of the Jacobian, J 7 , which relates the angular velocity, 
co G 93 , to the rate of change of the parameters 6 G 93 : 

co = J/(R(0))0 (49) 

The following subsections will present the workspace volume determination for 
two common orientation configuration of modular CDRs, i.e., 2-DOF and 3 -DOF 
orientation. 

CDR with 2-DOF Orientation 

The orientation workspace volume of the 2-DOF CDR is not the geometric volume 
in its parametric space due to nonlinear parameterization. The orientation 
workspace volume is obtained by defining a volumetric element, \dct(J j)\d0 id 02 , 
and finding the integral of the function. The term ldet(J/)l is the integration measure, 
I v . For the 2-DOF CDR, the Jacobian matrix J 7 G 93 is rectangular, but a matrix 
has to be square for the determinant to exist. Note that the rotation of the 2-DOF 
CDR can also be represented using the XYZ Euler angles parameterization, in 
which the rotation matrix can then be represented by 

R = e S X 0i e Sy6>2gS z 6>3 ( 50 ) 

'T 

where s z = [00000 1] and 0 3 = 0. Using the XYZ Euler angles parameterization, 
the integration measure is derived in Appendix “Formulation of the Jacobian and 
Integration Measure Associated with 2-DOF CDR’s Parameterized Rotations” and 
is given as 


I v = | det ( J/ ) 




The orientation workspace volume of the 2-DOF CDR, V S o{ 2 > is the integration 
of the volumetric element over the parametric domain, defined by the two rotational 
angles, and is given as follows: 



1 

dV R 

* 

* 

cos (0 2 ) 

JVsOi 2) J 

1 

{Si ,e 2 ) 



dOid02 
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Employing the equi-volumetric partition scheme, the orientation workspace 
volume of the 2-DOF CDR is numerically computed by 



where h tota[ is the total number of poses (i.e., feature point) and O^i) is the 0 2 angle 
at the ith feature point. 


CDR with 3-DOF Orientation 

When quantifying the workspace volume of SO( 3), V S oo) * s not ec l ua l to the 
geometric volume of the cylindrical workspace (i.e., Vcartesian) due to the nonlinear 
parameterization. Hence, a volume-associated integration measure (7 V ) has to be 
introduced to accurately determine V S o( 3 ) from V Cartesian . From the concept of 
metrics, 7 V is defined as the absolute value of the Jacobian matrix which relates 
the rate of change of the orientation parameters to describe the rotation to the 
angular velocity of the rigid body. When computing the volume of SO(3) from the 
cylindrical coordinate representation of T&T angles, an equivalent 7 V is determined 

as by adopting a similar approach presented by Chirikjian and Kyatkin (2000) 

(refer to Appendix “Formulation of the Integration Measure for SO(3) Representa- 
tion in Cylindrical Coordinates” for formulation details.). The volume for the 
discretized SO( 3) workspace is now given as 



where /z total is the total number of poses (i.e., feature point), 0, is the 6 angle at the 
ith feature point, and p is the number of circular bands, with respect to the 
cylindrical workspace as shown in Fig. 22. 


Workspace Performance Measures 

The most widely studied performance measures are based on workspace quantity 
(i.e., volume) and workspace quality (i.e., manipulability, condition index, dexter- 
ity, stiffness). Although majority of design optimization approaches utilize 
workspace volume as a performance measure, a mechanism designed for maximum 
workspace may lead to undesirable kinematic characteristics such as poor dexterity 
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or manipulability (Tsai and Joshi 2002; Yang et al. 2003). Hence, it is also 
important to determine the quality of the workspace in addition to workspace 
quantization. A variety of workspace quality measures have been proposed such 
as manipulability, dexterity, condition index, and stiffness, but most of them are 
equivalent to one another as they all depend on the conditioning of the Jacobian 
matrix (Merlet 2000). A more commonly used workspace quality measure is the 
global condition index (GC/), which is a measure of the kinematic dexterity of the 
robot over the whole workspace (Gosselin and Angeles 1991). Large values for GCI 
(close to one) ensure good performance with respect to force and velocity trans- 
mission. Regardless of the quality measures used, a mechanism far from singular- 
ities will always have a high workspace quality measure. For CDRs, GCI has also 
been employed to study the workspace quality (Pusey et al. 2003; Fattah and 
Agrawal 2002). However, Jacobian-based measures do not reflect the stiffness of 
the cables. Hence, several other measures have been proposed. Bosscher and Ebert- 
Uphoff (2004b) proposed a slope-based measure to analyze the stability of an 
under-constrained CDR (i.e., no. of cables, m < n -DOF), while Verhoeven and 
Hiller (2002) proposed a tension factor which reflects the tension distribution 
among the cables. Pham et al. (2006b) proposed a similar tension factor, but it 
utilized a linear programming approach. 

The following subsections will present several performance measures based on 
the workspace volume and workspace quality (using the stiffness matrix). 

Quantitative Measures 

Given a specific task workspace, the main objective in the dimension optimization 
of a CDR is to achieve a workspace that closely matches the specified 
workspace. Hence, a performance evaluation index is required that will quantita- 
tively determine how much workspace volume of the specified workspace 
has been matched by the designed CDR. This proposed performance quantity 
measure, known as the Workspace Matching Index ( WMI ), is described as follows: 


WMI 


Maneuverable workspace volume of a CDR 
Total Volume of the specified workspace 



where the maneuverable workspace refers to the poses satisfying the force-closure 
conditions, as well as other requirements such as minimum stiffness. 

For the 2-DOF CDR, Eq. 55 is numerically given as 


WMI 




COS 0 2 (i) 

COS 0 2 (i) 



For the 3 -DOF CDR, Eq. 55 is numerically given as 
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WMI 





where h a is the number of poses achievable by the CDR and is a subset of h to tal , 
while /z tota i is the total number of poses in the specified task workspace. WMI ranges 
from 0 to l.A WMI value 1 (ideal) indicates that the CDR is able to achieve all the 
poses in its corresponding specified workspace. 

The proposed performance quantity measure ( WMI) is illustrated using a 3 -DOF 
modular CDR. This modular CDR is part of a 7-DOF humanoid arm (Yang 
et al. 2011) whose objective is to achieve the same workspace as the human arm. 
In this illustration, the 3 -DOF CDR represents the human shoulder joint with 
their typical range of motion (Hamilton and Luttgens 2002), and the 
approximated maximum range of the tilt-&-torsion angles are e [—180°, 180°], 
6 e [0°, 140°], and o e [—90°, 90° ], respectively. Figure 23a-c show the workspace 
plot based on the shoulder range of motion, the optimized workspace plot, and the 
corresponding optimized 3 -DOF CDR configuration, respectively (readers may 
refer to (Yang et al. 2011) for the detailed design optimization of the 7-DOF 
humanoid arm). 


Qualitative Measures 

Besides workspace quantity, it is also important to determine the quality of the 
workspace. Especially for CDRs, it utilizes flexible cables which greatly affect the 
moving platform’s stiffness. Hence it is very critical to maintain sufficient stiffness. 
As shown by Strang (1976), the condition number of a matrix can be used in 
numerical analysis to estimate the error generated in the solution of a linear system 
of equations by the error on the data. When applied to the Jacobian matrix, the 
condition number gives a measure of the accuracy of the Cartesian velocity and the 
static load acting on the end of the moving platform. Similarly, the condition 
number of the stiffness matrix K (determined in section “Stiffness Analysis”) can 
also be used to give a measure of accuracy for CDRs, which includes the effect of 
cable stiffness. The Stiffness Condition Index, i.e., SCI , (which is the reciprocal of 
the condition number) is given as the ratio of the minimum to the maximum 
singular value of K at a given pose. SCI ranges from 0 to 1. Large values for SCI 
(close to 1) ensure good stiffness with respect to force and velocity transmission, 
while SCI values close to 0 indicate poor stiffness. 

Adopting a similar approach as the Global Conditioning Index proposed by 
Yang et al. (2003), the Global Stiffness Conditioning Index ( GSCI) is proposed 
as a performance quality measure. GSCI is the integration of SCI over the whole 
workspace and is given by 
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3D view of the right-arm shoulder joint Optimized workspace plot with WMI = 0.686 

workspace based on its typical range of 

motion 


C 


Spherical 

Joint 


Moving 

Platform 



Base 


Optimized 3 -DOF Modular CDR with 3-3 six-cable configuration, 
where — 129 mm, r 2 — 49 mm, d\ — 12mm and d 2 — 51 mm 


Fig. 23 Illustration of proposed performance quantity index ( WMI) using a 3-DOF Modular CDR 
based on the human shoulder joint 


GSCI 


r* 


SCI(K)dV 





where V represents the workspace volume of the CDR. GSCI ranges from 0 to 
1. Large values for GSCI (close to 1) ensure good overall stiffness with respect to 
force and velocity transmission, while GSCI values close to 0 indicate poor overall 
stiffness. 
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Fig. 24 Topological synthesis of symmetric six-cable 3-DOF CDRs 


Table 7 Optimization 
results for the various 
symmetric six-cable 
3-DOF CDR topologies 


Configuration 

WMI (0-1) 

GSCI (0-1) 

6-6 

0.181 

0.054 

6-3 

0.360 

0.137 

6-2 

0.433 

0.114 

3-6 

0.352 

0.199 

3-3 

0.686 

0.358 

2-6 

0.225 

0.081 


The proposed performance quality measure GSCI is illustrated using the 3-DOF 
modular CDR. In this illustration, dimension optimization is carried out on sym- 
metric six-cable 3-DOF CDRs with the topologies shown in Fig. 24, and Table 7 
shows the optimization results based on both GSCI and WMI. 


Cable Tension Planning 

For CDRs, it is important to ensure that positive cable tensions are maintained 
because of their unilateral driving property. In addition, since CDRs are redun- 
dantly actuated, there exist an infinite number of cable tension solutions for a 
particular pose. Adjustable cable tensions at a given pose can be useful in many 
CDR applications. For instance, if the user wishes to have lower energy consump- 
tion, then lower cable tensions are preferred. If the user wishes to regulate the 
stiffness, say for instance, achieve higher stiffness, then higher cable tensions are 
preferred. Hence, planning through optimization of the cable tension solution is an 
important aspect as well. 


Cable Tension Planning of CDRs Without VSDs 

For CDRs, the minimum 1-norm and 2-norm tension solutions are commonly used 
as the performance criteria for tension optimization. Pham et al. (2009) employed 
linear programming method to determine the minimum 1-norm tension solutions. 
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Fang et al. (2004) derived an analytical method to obtain the minimum 1-norm 
tension solutions for cable-driven manipulators with one redundant cable. Oh and 
Agrawal (2005) designed a controller that guarantees positive tension solutions and 
implemented it on a planar CDR. The controller is able to use linear programming 
method and quadratic programming method to generate the minimum 1-norm and 
2-norm tension solutions, respectively. The minimum 1-norm tension solutions can 
be efficiently generated by linear programming method, but discontinuity may exist 
in the tension solutions despite having continuous input variables. Such disconti- 
nuity will not occur for the minimum 2-norm solutions (Verhoeven and Hiller 
2002). The tension solutions based on minimum 1-norm and 2-norm performance 
criteria tend to have tensions that are near to the lower tension limit. Hence, it is 
possible for the cable to lose tension when the CDR is subjected to disturbance. 
Mikelsons et al. (2008) addressed this issue by generating the safe-zone tension 
solutions that are away from the lower and upper tension limits. The non-iterative 
algorithm is shown to produce continuous tension solutions, but the computational 
efficiency is low when there are many redundant cables. Brogstorm et al. (2009) 
employed linear programming method to generate the optimally safe tension 
solutions which are similar to the safe-zone solutions. It allows the users to 
adjust the tension of all the cables between the tension limits. However, the 
tension solutions also suffer from possible discontinuity because of the linear 
programming approach employed. Pott et al. (2009) generated closed-form 
tension solutions which have tension values closest to the mean feasible 
tension, i.e., (/ min + / max )/2. However, the algorithm may miss solutions at some 
of the poses. 

When at least one feasible tension solution exists, a tension optimization scheme 
can be used to find the optimal feasible solution based on a performance criterion H 
(T). The tension optimization problem can be written as 

Obj ecti ve function : Optimize H ( T ) 

subjected to: AT = W (59) 

T m in < T < T max 

where H(T) can be maximized or minimized. 

The quadratic programming is employed to generate adjustable tension solution, 
and the objective function is given as 


H(T) = Y, C ti - td ) 2 

7=1 



where t d is the desired cable tension. 

As an illustration, Fig. 25 shows the distribution of H(T) of the ith cable when 
/min = 3 N and / max = 200 N. It is observed that the objective function can be 
manipulated by changing the desired cable tension t d . Using this objective function, 
the optimization process will be able to generate the tension solution that is close to 
the desired cable tension. 
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Fig. 25 Performance 
criterion for the ith cable 


Performance criterion, H Z (T) 



t'min = 3N Tension (N) /' =200N 


Performance criterion, H Z (T) 



Cable Tension Planning of CDRs with VSDs 

When the VSDs are employed, the stiffness of the CDR can be regulated by 
manipulating the cable tensions. Therefore, a stiffness-oriented tension resolution 
algorithm is proposed. At a particular pose, the algorithm determines the required 
cable tensions so as to fulfill the stiffness requirement. In literature, Yu et al. (2010) 
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Fig. 26 Distribution of the 
performance measure based 
on eigenvalues of the stiffness 
matrix 



maximized the smallest eigenvalue of the stiffness matrix by optimizing the tension 
distribution. This method optimizes the magnitude of the stiffness matrix. Liu 
et al. (2011) optimized the tension distribution to maximize the sum of some 
diagonal components of the stiffness matrix. For this method, the actual stiffness 
is affected by the non-diagonal terms. The determinant of the stiffness matrix 
generally is used to show the change in stiffness when the tension distribution 
changes. However, the determinant only reflects the magnitude of the stiffness 
without any information on the direction. For the 2-DOF CDR shown in Fig. 18, a 
performance measure is proposed to optimize the tension distribution to achieve an 
isotropic stiffness matrix with high stiffness magnitude. Let the eigenvalues of the 
stiffness matrix of the 2-DOF CDR be o x and <r 2 . For the isotropic stiffness matrix, 
o i equals to <r 2 , but this is usually not achievable. Hence, the performance measure 
will aim to attain eigenvalues that are close to each other. High stiffness magnitude 
can be achieved by having high eigenvalues. The performance measure to achieve 
isotropic stiffness matrix with high stiffness magnitude is given as 


(jj + al 



Figure 26 shows the relationship between the performance measure and the two 
eigenvalues. It is observed that the performance measure forms a ridge (shown 
using a dotted line) when the two eigenvalues are equal, and this ridge gets higher 
when the eigenvalues increase. 

Maximizing such performance measure in the objective function allows the 
optimization process to select a tension distribution that is able to achieve isotropic 
stiffness matrix with high stiffness magnitude. The optimization problem for the 
stiffness-oriented tension resolution algorithm becomes as follows: 
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Obj ecti ve function : 
subjected to: 


Maximize 

AT = W 


<A- a \ 


T 


min 


< T < T 


max 



The “fmincon” function in Matlab can be used to solve the nonlinear optimiza- 
tion problem in Eq. 62. With the proposed stiffness -oriented tension resolution 
algorithm, users will be able to generate cable tensions that are able to achieve 
isotropic stiffness matrix with high stiffness magnitude. 


Case Study Using a CDR with 2-DOF Orientation 

A 2-DOF CDR (without any VSDs), as shown in Fig. 27, is used as a computer- 
simulated case study to illustrate the proposed cable tension planning algorithm in 
section “Cable Tension Planning of CDRs Without VSDs.” It has a passive 
universal joint that connects the moving platform to the base and is actuated 
using four driving cables. Any pose of this 2-DOF CDR can be represented using 
two angles 6 \ and 0 2 . The rotation matrix is defined as R = which is 

defined with respect to the fixed X and Y axes. 

At the pose [20°, 20° ], the cable lengths and tensions are given in Table 8. 
Figure 28 shows the tension distribution at [20°, 20°] when the desired cable 



Dimension 

Length (mm) 

D1 

38.3 

D2 

92.4 

D3 

92.4 

D4 

38.3 

D5 

25.0 

D6 

25.0 


Fig. 27 Dimension details of the 2-DOF CDR used for cable tension planning simulation 


Table 8 Tension distribution for the 2-DOF universal joint CDR 


Pose 

Cable lengths (mm) 

UN) 

Computed tension distribution (N) 

e x - 20 ° 


10 

[8.2, 14.0, 8.8, 8.4] 

6b - 20° 

[58.1, 80.8,59.2, 48.4] 

100 

[93.1, 113.8, 96.5,94.3] 



180 

[166.4, 200.0, 172.2, 168.4] 
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200 


150 
o 100 

GO 

a 

CD 

H 

50 


0 



t d = ION t d = 1 00N 


Fig. 28 Tension distribution at [20°, 20°] when t d 



■ Cable tension 1 
Cable tension 2 

■ Cable tension 3 

■ Cable tension 4 


t d = 180N 

10 N, t d = 100 N and t d = 180 N 


tensions t d are 10 N, 100 N, and 180 N. It is observed that the proposed tension 
optimization method is able to generate adjustable tension solutions. 

With the cable lengths and tensions computed, a hybrid control scheme can be 
employed to simultaneously control both the position and the cable tensions. For an 
n -DOF CDR with m cables, the pose of the moving platform can be uniquely 
determined using n cable lengths. Hence, n cables are placed under displacement 
control mode to maintain the position of the moving platform. The remaining 
0 m—n ) cables are then placed under tension control. In this way, the control scheme 
can simultaneously control both the position and the cable tensions of the CDR. For 
the 2-DOF 4-cable CDR, the pose of the moving platform can be uniquely deter- 
mined by two cable lengths. Hence, if cables 1 and 2 are set in displacement control 
mode, then cables 3 and 4 will be under tension control mode. In displacement 
control mode, the cable length is controlled using the encoder feedback from the 
motor. In tension control mode, the cable tension is controlled using the feedback 
from a force sensor (or a load cell). In this way, while the moving platform is under 
position control, the cable tensions can also be changed at the same time, which will 
allow the stiffness to be regulated (readers may refer to (Lim et al. 2013) for more 
details on the hybrid control scheme). 


Summary 

This chapter focused on the performance analysis tools such as force closure, 
stiffness, workspace, and cable tension planning, required in the design of cable- 
driven robots (CDRs). CDRs are a special class of parallel mechanisms in which the 
end-effector is actuated by cables, instead of rigid-linked actuators. They are 
characterized by lightweight structures with low moving inertia and large 
workspace, due to the location of the cable winching actuators at the fixed base 
of the structure and thereby reducing the mass and inertia of the moving platform. 
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However, unlike rigid links, cables exhibit a unilateral driving property, i.e., it can 
only exert a tensile force. Hence, analysis approaches obtained for rigid-linked 
robots cannot be directly applied, and such CDRs require additional force consid- 
erations. As such, force-closure algorithms which focused on determining if equi- 
librium can be achieved with a set of positive cable tension under the influence of 
external wrenches were presented for both single-bodied CDRs as well as multi- 
bodied CDRs. Case studies were presented to illustrate the application of the 
proposed algorithms. 

In addition, CDRs also possess an intrinsically safe feature due to the cables’ 
flexibility, which allow CDRs to provide safe manipulation in close proximity to 
their human counterparts. On the other hand, this flexibility causes CDRs to have 
relatively low stiffness and thereby significantly affecting the positional accuracy. 
Furthermore, it is not possible to achieve significant stiffness regulation by manip- 
ulating the cable tensions for fully constrained CDRs. As such, the concept of CDR 
with variable stiffness characteristics (through the use of variable stiffness devices 
(VSDs) attached in series to cables) was introduced. A case study was presented to 
verify the ability of the CDR to significantly regulate its stiffness after the VSDs 
were employed. 

Performance analysis plays a vital role in obtaining the fundamental tools to 
evaluate and carry out optimal designing of a CDR. In literature, this has been 
generally done by analyzing the workspace. Workspace representation techniques 
such as equi- volumetric workspace partitioning for CDRs with orientation capabilities 
were presented. For these CDRs, mathematically tractable workspace volumes were 
also presented as the orientation workspace volume was not equal to the geometric 
volume in the parametric space due to the nonlinear parameterization. Both quantita- 
tive and qualitative performance measures based on workspace volume and eigen- 
values of the stiffness matrix, respectively, were presented together with case study 
examples to illustrate the application of the proposed performance measures. 

Planning through optimization of the cable tension solution for a given pose is an 
important aspect for CDRs. This is due to the unilateral driving property of cables, 
which also makes CDRs to be redundantly actuated. Hence, there exist an infinite 
number of cable tension solutions for a particular pose. A quadratic programming- 
based tension optimization method was proposed to obtain adjustable tension 
solution for CDRs. The tension solution can be manipulated by selecting a desired 
cable tension, in which the tension optimization will aim to achieve. Adjustable 
cable tensions allow CDRs to regulate its stiffness and also enable the CDR to avoid 
tension limits and allow the stiffness of the CDR to be regulated at a particular pose. 
A case study was also presented to illustrate the application of the proposed cable 
tension planning algorithm. 

In summary, various performance analysis tools for CDRs were presented in this 
chapter with the aim of allowing readers to get a quick overview and a head start in 
the design of CDRs. 
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Appendix 1: Formulation of the Jacobian and Integration 
Measure Associated with 2-DOF CDR's Parameterized Rotations 

Jacobian Associated with 2-DOF CDR's Parameterized Rotations 


For the 2-DOF CDR, the time-varying rotation matrix is parameterized as 


R (0 


n(h l (t),h 2 (t)) = n(h(t)) 


(63) 


Using chain rule from calculus, Eq. 63 becomes 


„ OR • OR • 
R = - — h\ + — — h 2 


dh\ 


dh 2 


(64) 


T 


Right multiply Eq. 64 by R and extracting the dual vectors yields 


oo 


Q(H(h))h 


(65) 


where oo s = vect ( RR r ) is the angular velocity with respect to the spatial reference 


frame and Q(H(h 



vect 


( d\\ 3 iji 


vect 


(f)« 


T 


is the Jacobian associated 


with 2-DOF CDR’s parameterized rotations. 


For the 2-DOF CDR, H W u 0 2 ) = Rx^ORyC^)- Hence, 


vect 


fd H 

\dhi 


H 


T 


/?' (0i)J?yO%)Ry(-02)Rx(-0i) 


RUd^Rxi-e 


i 


vect 


fdW 

\dh 2 


H 


T 


Rx{@i)Ry{@ 2 )Ry{—@ 2 )Rx{—@i) 


/ r>T 


T 


Since vect (RjRi ) = £ z regardless of the value of the parameter and vect (RjXRi ) 
Rvect(X), the Jacobian is given as 


Q(H(h 



1 


0 


0 cos^i 
0 sin 6 1 


(66) 


Integration Measure Associated with 2-DOF CDR's Parameterized 
Rotations 

Using the XYZ Euler angles parameterization, the rotation matrix is given as 

H(01,02) = Rx(0i)Ry(02)Rz(#3) (67) 

Employing the same approach the Jacobian matrix is derived as 
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Q(H (0 U 0 2 ,0 3 )) 


1 0 sin 02 

0 cos 6 1 — sin 0 \ cos 62 

0 sin 0 \ cos 6 \ cos 62 


The determinant of Q(H(#!, 0 2 , 0 3 )) is: 


det(Q(H(6> 1 ,6' 2 ,6' 3 )) = cos(6» 2 ) 

Therefore, the integration measure is determined as follows: 


det (Q(H(0! , # 2 , #3)) 






Appendix 2: Formulation of the Integration Measure for SO(3) 
Representation in Cylindrical Coordinates 


It has been addressed in (Bonev and Go s selin 2005) that the entire rigid body group 
50(3) can be visualized as a solid cylinder when the cylindrical coordinates are 
employed to represent the Tilt- & -Torsion angles, as shown in Fig. 22. However, 
integration measures need to be introduced when computing the volume of the 
orientation workspace of rigid body rotations from the T&T angles domain. It can 
be verified that if Cartesian coordinates are used to represent the T&T angles, the 
integration measure will be same as that of the Euler angle representation, which is 
given by sin 0. In this case, the volume of the entire rigid body rotation group is 
given as 


* 

dV R 

*: 2k 

• pi 

JSO{ 3 ) J 

0 J 

0 J 


2 n 


sin 0 d(j)d 0 do = 8 n 


o 



Equation 71 has the same form as when the T&T angles are represented with 
Cartesian coordinates, i.e., x = </>, y = 0, and z = o. However, since the T&T angles 
are normally represented with cylindrical coordinates, i.e., x = 0 cos <p,y = 0 sin 
and z = t 7, an additional integration measure needs to be included for the change of 
the coordinate representation. Geometrically, such a transformation of the coordi- 
nate representations maps the parametric domains of the T&T angles from a 
rectangular parallelepiped to a solid cylinder. It can be further verified that deter- 
minant of the Jacobian (i.e., the additional integration measure) for the transforma- 
tion of the coordinate representations is given by The resultant integration 

measure becomes pp . It follows that the volume of the entire 50(3) under 
cylindrical coordinate representation of the T&T angles is given by 


II 

(• 


* 

sin 0 


V 2 x1Z 

0 

JSO( 3) 


dxdydz 


(72) 


2224 


S.K. Mustafa et al. 


r\ 

where V x 71 represents a solid cylinder. If the integration is computed using 
cylindrical coordinates, Eq. 72 can be rewritten as 


dV R 


SO( 3 ) 


2 n 


0 


"P l 


0 


2 n 


0 


sin# 

~ 6 ~ 


6d(j)d0do = 8 n 


(73) 


Although Eqs. 71 and 73 are equivalent for the volume computation of SO( 3), 
they possess different geometrical meanings. Equation 71 is associated with the 
Cartesian coordinate representation of the T&T angles, while Eq. 73 is associated 
with the cylindrical coordinate representation of the T&T angles. In Eq. 73, the 

terms and 6d(j)d6do represent the integration measure and the differential 

volume element, respectively. 

Equations 72 and 73 also indicate that the integration measure becomes singular 
when 6 approaches 0 or n. However, with strategic selection of the cylindrical 
coordinate representation for T&T angles, singularity point at 6 = 0 can be avoided. 
This is a significant feature for the numerical volume computation of SO(3) through 
its parametric domains. 

With the equivalent integration measure derived for the cylindrical coordinate 
representation of T-&-T angles, the integration or convolution of a rotation- 
dependent function / ( R ) over a set of rotations S E SO( 3) in the T&T angles 
domain is given by 




f(R)dV R 




ReS 


(p,0,eeQ t 


sin# 

~6~ 


0d(j)d0do 



where Q t denotes the parameter space of the T&T angles (with cylindrical coordi- 
nate representation), i.e., a subset of the solid cylinder. 

After the finite partition of the solid cylinder, the orientation workspace for a set 
of rotations S E SO (3) can be numerically computed as 



where v t is the unit volume of the equi-volumetric partition scheme in the cylin- 
drical coordinate representation of the T&T angles. Consequently, Eq. 74 can be 
written as 



where (< i^ijk ? ®ijfa ^ ijk ) E Qt- 
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Abstract 

Compliant manipulators are advanced robotic systems articulated by the flexure 
joints to deliver highly repeatable motion. Using the advantage of elastic deflec- 
tion, these flexure joints overcome the limitations of conventional bearing-based 
joints such as dry friction, backlash, and wear and tear. Together with high- 
resolution positioning actuators and encoders, the compliant manipulators are 
suitable ideal candidates for micro-/nanoscale positioning tasks. This chapter 
presents the relevant knowledge of several fundamental topics associated with 
this advanced technology. After reviewing its evolution and applications, the 
principal of mechanics is used to explain the limitations of these manipulators. 
Subsequent topic covers various theoretical modeling approaches that are gen- 
erally used to predict the deflection stiffness of flexure joints and stiffness 
characteristics of compliant manipulators. Next, various fundamental design 
concepts for synthesizing the compliant mechanism will be introduced and 
several examples are used to demonstrate the effectiveness of these concepts. 
The topic on actuation, sensing, and control summarizes the types of high- 
resolution actuators and sensors which the compliant manipulators use to 
achieve high-precision positioning performance. Performance trade-offs 
between various actuators and among different sensors are discussed in detail. 
With this relevant knowledge, this chapter serves as a guide and reference for 
designing, analyzing, and developing a compliant manipulator. 


Introduction 

A compliant manipulator is a motion system that consists of a compliant mecha- 
nism driven by high-resolution positioning actuators. Unlike traditional rigid-link 
mechanisms, a compliant mechanism gains its mobility from the deflection of 
flexible members (Howell 2001), which is termed as the flexure joints. Using the 
advantages of elastic deflection, a flexure joint overcomes the limitations of a 
conventional bearing-based joint such as dry friction, backlash, and wear and tear 
(Smith 2000). Consequently, a compliant mechanism offers high repeatable motion 
due to the frictionless characteristics of these flexure joints. Driven by high- 
resolution positioning actuators such as the piezoelectric (PZT) solid-state actuators 
and the electromagnetic (EM) actuators, the compliant mechanisms become high- 
precision manipulators that are ideal candidates for micro-/nanoscale positioning 
tasks (Teo et al. 2010a; Yang et al. 2008; Ryu et al. 1997; Lee and Kim 1997; 
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Fig.l (a) A 3 -DOF out-of-plane motion compliant manipulator and (b) a 3 -DOF in-plane motion 
compliant mechanism developed by SIMTech 

Jywe et al. 2008). Figure la shows an example of a compliant manipulator 
developed by the Singapore Institute of Manufacturing Technology (SIMTech) to 
automate the out-of-plane alignment and imprinting tasks for a Nanoimprint 
Lithography (NIL) process. This three degree-of-freedom (DOF) compliant manip- 
ulator is driven by Lorentz-force EM actuators and has achieved a positioning and 

z 

angular resolution of ±10 nm and ±0.05 arcsec (±0.242 x 10 rad), respectively, 
throughout a workspace of 5° x 5° x 5 mm. 

Compliant manipulator can be classified as a partially or fully compliant manip- 
ulator. A partially compliant manipulator has limbs that consist of both rigid bodies 
and flexure joints, while a fully compliant manipulator has continuous flexible 
limbs. For example, the manipulator shown in Fig. la is considered as a partially 
compliant manipulator since each limb is articulated by a combination of rigid 
bodies and flexure joints. On the other hand, the mechanism shown in Fig. lb is 
considered as a fully compliant manipulator because each limb is formed by a 
continuous flexure joint without the need of rigid bodies. Other than gaining 
performance, these compliant manipulators also benefit from the simple construc- 
tions of the flexure joints that lead to reduction of parts and assembly time. Taking 
the spatial joint compliant module shown in Fig. 2a as an example, it is a single 
monolithic-cut joint that has significantly less parts as compared to a bearing-based 
spherical joint as shown in Fig. 2b. Having less parts may shorten the manufactur- 
ing and assembly time and eventually reduce cost. In addition, parts reduction may 
also reduce assembly errors, which often affect the accuracy and motion repeat- 
ability of a manipulator. 

Among various motion systems that utilize noncontact bearings such as the 
compliant, air, magnetic, fluid, and ultrasonic bearings, the compliant manipulators 
are cost-effective and maintenance-free because the flexure joints do not require 
any electrical/fluid/air source, actuation, sensor, and complex control system to 
function as a noncontact bearing. Being maintenance-free is a significant merit 
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Fig. 2 (a) A spatial joint compliant module developed by SIMTech and (b) a bearing-based 
spherical joint from HEPHAIST SEIKO (S. HEPHAIST 2014) 

because the compliant manipulators could operate in harsh environments that may 
damage or degrade the joints. For example, flexure joints made of Teflon could be 
used in chemical solutions and even space exploration systems since no lubrication 
is required. The frictionless characteristics of a compliant manipulator also suit the 
clean vacuum environment since no particles will be generated through friction. 
Most importantly, reduction in part counts coupled with the simple constructions of 
flexure joints becomes an attractive solution for developing microscale compliant 
manipulators such as Micro-Electro-Mechanical Systems (MEMS) and macro-/ 
microscale nanopositioner (Chen and Culpepper 2006). Figure lb shows a 3 -DOF 
macroscale fully compliant mechanism that benefitted from the simple construction 
of the flexure joints, which allow the mechanism to be fabricated using polymer 
material for harsh environment usage. 

Although a compliant manipulator has an abundance of benefits, the usage of 
elastic deflection from the flexure joints is also accompanied by limitations. As all 
flexure joints are required to deflect or bend within the elastic region of the 
materials, the deflection of these flexible members limits the motion of the com- 
pliant manipulator. For example, a flexure joint cannot produce the continuous 
rotation motion of a ball-bearing rotary joint. In general, the workspace of a typical 
compliant manipulator is limited to a few millimeters and degrees as the stress 
concentration of each flexure joint must not exceed the yield strength of the 
material. Other than stress-strain characteristics of the materials, the force- 
displacement (or stiffness) characteristics of a flexure joint are also crucial to the 
development of a compliant manipulator. Accurate prediction of the stiffness 
characteristics requires in-depth knowledge of the principal of mechanics, deflec- 
tion theory, mechanism synthesis, and synthesis methods. Yet, from the recent 
advancements in compliant manipulator, flexure joints are tasked to produce larger 
deflections, which exhibit nonlinear behavior. Hence, classical beam equations 
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derived from the small deflection theory are no longer valid. Even with sufficient 
knowledge, manufacturing tolerances can easily result in uncertainties to the actual 
performance of the compliant manipulator. 

Dealing with flexure-based (or compliant) bearing involves the transfer or 
transformation of stored energy from input to output. To overcome the stored 
energy, the energy used to create a displacement tends to be higher as compared 
to other noncontact bearings. Under significant stress, prolonged stress, or constant 
exposure to high temperature, the stored energy will also cause a certain degree of 
hysteresis in the stress-strain characteristics resulting in creep effects. Although 
stored energy can be reduced by lowering the displacement stiffness, this approach 
further reduces the off-axis stiffness of the compliant manipulator, which is rela- 
tively lower as compared to motion systems using other noncontact bearings. Thus, 
the compliant manipulators are not suitable for carrying high payloads and any 
accidental overload may lead to instabilities, e.g., buckling. Fatigue is another 
crucial factor that determines the reliability of a compliant manipulator since the 
flexure joints are deflecting cyclically with constant load. Unfortunately, existing 
theoretical model, which predicts the fatigue life, is only applicable to standard 
specimen shapes, e.g., rectangular and circular. In addition, the material properties, 
geometries/dimensions of the flexure joints, and types/amount of loadings have 
significant effects on the fatigue life. As a result, tedious and meticulous evaluations 
are required to determine the life span of any compliant bearing in performing its 
prescribed tasks. 

This chapter introduces the essence of compliant manipulators by presenting 
several fundamental topics that are associated with this advanced technology. After 
the brief history on its evolution and applications, the first topic covers the principal 
of mechanics, which is used to explain the limitations of these manipulators. The 
next topic introduces the fundamental design concepts that are typically used to 
synthesize the compliant mechanism. Subsequent topic covers various theoretical 
modeling approaches that are generally used to predict the deflection stiffness of 
flexure joints and the stiffness characteristics of the compliant manipulators. Fol- 
lowing these fundamental topics, the material properties, the types of fabrications, 
and the manufacturing limitations are discussed. The topic on actuation, sensing, 
and control summarizes the types of high-resolution actuators and sensors which 
the compliant manipulators use to achieve high-precision positioning performance. 
Fastly, the future advancement of the compliant manipulator technology is 
presented. 

Brief History 

Implementation of flexure joints into precision machines can be dated as early as 
1 826, when metal strips were first used to replace torsional members of a classical 
torsion balance to increase its precision of measuring fine torque when subjected to 
mechanical or electrical loads (Jones 1962). Absence of the “sticking” effect made 
it possible to register very small changes in torque with meaningful observations of 
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Fig. 3 H. A. Roland with his 
ruling engine for diffraction 
gratings (Jones 1988) 



10 -9 rad change in orientations. A cross-strip hinge, which was later introduced to 
increase the stiffness of non-actuating directions, was well adopted by subsequent 
torsion balances. In 1902, such slender strips were used to support the ruling engine 
for grating diffraction (Fig. 3) so as to avoid the effects of friction (Jones 1988). 
This slender strip, which is termed as a leaf spring, is considered as the earliest form 
of a flexure joint. By the dawn of World War II, these shock-proof torsional leaf 
springs have been increasingly used in electrical instruments to replace jeweled 
pivots (Jones 1962). During the war, the flexure joints became widely used in 
developing highly sensitive measurement instruments such as highly accurate 
load cells for force measurement and the pendulum pivots of miniaturized force- 
balance accelerometers (Motsinger 1964; Tuttle 1967). 

After the war, the compliant mechanisms were widely used to develop metrol- 
ogy systems when precise positioning of optical lens or mirrors was needed (Jones 
1951, 1952, 1955, 1956). For example, the optical slit mechanism of an infrared 
spectrometer shown in Fig. 4a is a compliant mechanism. Using leaf springs to 
support the slit jaws, the resolution of the micrometer is directly translated onto the 
optical slit mechanism since there is no backlash between the micrometer and the 
jaws. Due to their ability to provide direct transmission between the input and 
the output, leaf springs were gaining popularity in the development of highly 
sensitive metrology devices. For example, Fig. 4b illustrates a Michelson interfer- 
ometer mirror positioner that is supported by a pair of leaf springs and Fig. 4c shows 
a seismograph that used these compliant bearings to achieve direct measurement of 
the amplitude of earthquakes without losses through friction. Using leaf springs as 
flexure joints requires additional assemblies that generally affect the precision of a 
mechanism. In addition, the leaf springs have poor stiffness in other non-actuation 
directions, which will further deteriorate precision when subjected to off-axis 
external loading. 

To address these issues, the notch hinge was introduced as an alternative solution 
for creating the compliant mechanism. A monolithic-cut approach was used to 
produce such notch hinges and the entire mechanism could be fabricated from a 
single piece of workpiece without involving any assembly. Figure 5 illustrates two 
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Fig. 4 (a) An optical slit mechanism, (b) a parallel spring mechanism for positioning the 
Michelson interferometer mirror, and (c) a gravimeter-vertical seismograph (Jones 1952) 
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Fig. 5 A leaf- spring compound linear spring mechanism versus a monolithic compound linear 
spring mechanism (Jones 1988) 


compound linear spring compliant mechanisms constructed by the leaf springs and 
notch hinges, respectively. The notch hinges were localized flexure joints that 
formed only a small portion of the entire mechanism. As a result, the off-axis 
(or non-driving directions) stiffness of the monolithic compound linear spring 
compliant mechanism is more superior than its leaf spring counterpart. In addition, 
the notch hinges possess limited deflections, which exhibit linear characteristics. 
Hence, well-established mechanics theory was used to predict the deflection 
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Fig. 6 (a) A 1-DOF compliant stage developed by SIMTech (Ho et al. 2004) and (b) a 2-DOF 
nested compliant stage developed by NIST (Boone et al. 2002) 

stiffness of these notch hinges. Monolithic-cut fabrication also reduced the accu- 
mulative assembly errors that could potentially cause the motion of the compliant 
mechanism to be indeterministic. Thus, compliant mechanisms that used notch 
hinges were more predictable than those with leaf springs. 

Between the late 1960s and the early 1990s, compliant mechanisms were mainly 
developed via the notch hinges and many were employed as subnanometer posi- 
tioning stages for laboratory usage (Paros and Weisbord 1965; Deslattes 1969; 
Haberland 1978; Becker et al. 1987; Smith et al. 1987; Nishimura 1991). The 
architectures of all these compliant stages mainly evolved from a parallel linear 
spring concept that resembles the classical four-bar linkage mechanism. To double 
the displacement range, a compound linear spring concept, which is formed by 
connecting a pair of linear spring in series, was introduced. Examples of the 
compound linear spring compliant mechanisms were illustrated in Fig. 5. To further 
enhance the robustness of these stages towards external disturbance, a symmetrical 
double compound linear spring concept was introduced. Figure 6a shows an 
example of a single DOF compliant stage that was developed via a double com- 
pound linear spring concept and notch hinges. Driven by a PZT actuator, it has a 
positioning resolution of 50 nm over a traveling range of 100 pm and was used for 
wafer-bump inspections (Ho et al. 2004). 

To obtain a 2-DOF motion, the simplest approach was to stack a 1-DOF 
compliant stage on top of another in series. Another approach was to nest a 
1-DOF compliant stage within another stage. This approach was used to develop 
several multi-DOF nanopositioning stages (Her and Chang 1994; Gao et al. 1999). 
Figure 6b shows an example of an X-Y translational optical lens steering stage 
developed by the US National Institute of Standard and Technology (NIST) for 
space communication purposes (Boone et al. 2002). This 2-DOF PZT-driven 
compliant stage was developed based on a nested approach where the y-axis 
stage is embedded inside the x-axis stage. Precisions of these stacked and nested 
compliant stages are usually affected by the accumulative positioning errors. The 
responses of these stages are also slow because the lower stage needs to carry the 
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Fig. 7 Planar motion compliant manipulators that were used as (a) a wafer positioning stage (Ryu 
et al. 1997) and (b) a positioner of SEM (Yong and Lu 2009) 


inertial mass of the upper stage via the stacked approach and the outer stage carries 
the inertial mass of embedded stage for the nested approach. Inertial masses of the 
upper and embedded stages from both approaches also caused these 2-DOF com- 
pliant stages to have nonsymmetric natural frequencies. 

Towards the late 1990s, the parallel-kinematic architectures were widely 
adopted by modern compliant manipulators to achieve a higher precision and better 
performance in multiple DOF manipulations. This architecture plays an important 
role in the success of these manipulators due to its advantages of a lower inertia, 
programmable centers of rotations, superior dynamic behavior, and insensitivity to 
external disturbances, e.g., thermal expansion. Most importantly, the limited deflec- 
tions of flexure joints also suit the limited workspace characteristics of a parallel- 
kinematic architecture. Figure 7a shows an example of a planar motion compliant 
manipulator developed based on a parallel -kinematic architecture. Being used as an 
X-Y-0 z wafer positioning stage, this compliant manipulator delivers a positioning 
resolution of 8 nm along the x- and y-axes and a rotational resolution of 0.057 
arcsec about the z-axis throughout a workspace of 41.5 pm x 47.8 pm x 322.8 
arcsec (Ryu et al. 1997). Another form of planar motion compliant manipulator was 
developed based on a three-legged revolute-revolute-revolute (3RRR) parallel- 
kinematic architecture (Yi et al. 2003). This compliant manipulator was used for 
positioning the wafer and delivers an X-Y translational workspace of 100 pnT with 
a rotational range of 0.1°. A similar concept was found in another X-Y-6 Z compliant 
manipulator as shown in Fig. 7b, which was used as a precision stage for position- 
ing the samples within a Scanning-Electron-Microscope (SEM) machine (Yong and 

Lu 2009). 

With the advancement of electrical discharged machine (EDM) in the 1990s, the 
dimensions of flexure joints can be fabricated more precisely. Hence, the modern 
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Fig. 8 (a) Delta compliant manipulator developed by EPFL (Henein 2000) and (b) a 3 -axes 
translational motion compliant manipulator developed by NTU (Pham and Chen 2005) 


flexure joints played an important role in realizing various types of spatial motion 
compliant manipulators (Han et al. 1991; Hudgens and Tesar 1988; Seugling 
et al. 2002; Portman et al. 2000; Canfield et al. 2002; Wang et al. 2003a; Mclnroy 
et al. 1999; Mclnroy and Hamann 2000). One example of these manipulators is the 

o / 

Delta developed by Ecole Poly technique Federale de Lausanne (EPFL) as shown 
in Fig. 8a. This X-Y-Z spatial motion compliant manipulator was constructed based 
on the kinematics of a Delta robot where each limb is formed by three rigid links 
coupled together via the universal flexure joints (Henein 2000). Driven by Lorentz- 
force EM actuators, this 3-DOF compliant manipulator delivers a positioning 

Q 

repeatability of 100 nm within a workspace of 1 cm . Figure 8b shows another 
interesting 3 -axes translational motion compliant manipulator that was developed 
by Nanyang Technological University (NTU). Other than using a three-limbed 
parallel configuration to construct this compliant manipulator, each macroscale 
compliant limb is articulated by a 3RRR parallel-kinematic architecture. As a 
result, all three translational axes are kinematically decoupled and hence each 
PZT actuator will only generate a single-axis motion. 

Using six sets of spherical-prismatic-spherical (SPS) serially connected compli- 
ant limbs, a compliant Steward platform was developed by Shizuoka University 
(SU) as shown in Fig. 9a. This 6-DOF compliant manipulator obtained a transla- 
tional accuracy of 160 nm and rotational accuracy of 2 p rad (Oiwa and Hirano 
1999). Due to the limited displacement stroke of PZT actuators, it could only 
achieve 10 pm in all translational directions, 200 prad about the x- and y-axes, 
and 100 prad about the z-axis. Figure 9b shows another example of a 6-DOF 
compliant manipulator developed by Massachusetts Institute of Technology 
(MIT). It was constructed based on a hybrid concept by stacking an X-Y-6 Z 
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Fig. 9 (a) A compliant Steward platform developed by SU (Oiwa and Hirano 1999) and (b) 
a hybrid 6-DOF compliant manipulator developed by MIT (Zuo et al. 2003) 
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Fig. 10 (a) A MEMS-based micro-actuator from MEMS and Nanotechnology Exchange and (b) 
a 6-axes microscale nanopositioner developed by MIT (Zuo et al. 2003) 

compliant mechanism on top of a Z-6 x -6 y compliant mechanism (Zuo et al. 2003). 
As a result, the planar motion is decoupled with the out-of-plane motion. Driven by 
three PZT actuators, the planar motion workspace is 140 pm x 140 pm x 7.6°. The 
out-of-plane workspace was generated by three hybrid PZT-EM stepper actuators 
and achieved a workspace of 5 mm x 2.4° x 2.4°. 

Today, the compliant manipulators can be fabricated in the micron level through 
the semiconductor photolithography process. Driven by electrostatic actuation, 
thermal actuation, and even microscale gears, these compliant manipulators 
become MEMS devices such as microsensors and micro -actuators. Figure 10a 
shows a MEMS-based micro-actuator developed by MEMS and Nanotechnology 
Exchange. The compliant mechanism adopts a double compound linear spring 
concept with slender hinges with the translator driven by a pair of electrostatic 
actuators. The latest and most noticeable development of microscale compliant 
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manipulator comes from MIT, Precision Systems Laboratory. The research group 
presented a 6-axes microscale nanopositioner, termed //HexFlex, based on a fully 
compliant design concept as shown in Fig. 10b. The //HexFlex consists of a central 
stage that is connected to the base via three parallel flexure joints where each joint is 
driven by two microscale thermal mechanical actuators. As each thermal actuator 
has 2-DOF actuation, i.e., in-plane and out-of-plane deflections, a combination of 
six actuators delivers a 6-DOF motion to the system. //HexFlex registered a 

Q 

positioning error of ±10 nm over a workspace volume of 8.4 x 12.8 x 8.8 pm 
and 19.2 x 17.5 x 33.2 prad for the x-y-z axes and the Ox-Oy-Oz , respectively. 


Principles of Solid Mechanics 

A compliant manipulator is articulated by the flexure joints that are considered as 
“springs” with high stiffness ratios. The basic working principles of these flexure 
joints are elastic bending and torsion. The advantages of elastic bending or torsional 
motion include frictionless, contactless, and non-hysteresis characteristics. The 
disadvantages include limited deflection, limited load capacity, and fatigue. To 
give a better understanding of the limitations of the flexure joints, this section 
reviews the characteristics of the elastic bending and torsional motion based on the 
principal of solid mechanics. 


Strength and Stiffness 

The strength and stiffness offer different insights to the deflection of the flexure 
joint. Strength determines the stress a deflected flexure joint can withstand before 
failure and is associated with the property of the material. On the other hand, 
stiffness determines how much a flexure joint deflects due to a load. Based on the 
Bemoulli-Euler law, the bending moment is proportional to the beam curvature 

dO 

M = El — (1) 

ds 

where M represents the bending moment, dOlds is the rate of change in the 
deflection angle along the curvature of the beam, and El represents the bending 
rigidity with E representing the Young’s modulus (modulus of elasticity) of the 
material and I is the cross-sectional moment of inertia (the second moment of area). 

The bending rigidity is a function of its material properties and the geometries 
due to the presence of the Young’s modulus and the cross-sectional moment of 
inertia. Figure 1 1 illustrates a cantilever that is made of an isotropic material with 
equal Young’s modulus and strength in all directions. When subjected to a load 
along the x-axis, E x , the bending rigidity of the cantilever beam is governed by EI yy . 
When subjected to a load along the y-axis, E y , the bending rigidity will be governed 
by EI XX . As a result, the bending rigidity along the x-axis is much stiffer as 
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Fig. 1 1 A cantilever beam 
subjected to two independent 
loads along the x- and y-axes 



b - width 
h = thickness 
l = length 



compared to the bending rigidity along the y-axis. This example explains that even 
with similar Young’s modulus and strength in both directions, the stiffness between 
both directions may not necessarily be the same. It also highlights that geometry has 
crucial influence on the stiffness characteristics of a flexure joint. In addition, 
different materials will affect its flexibility due to the variations in the Young’s 
modulus. For example, an aluminum (E = 71 GPa) beam will be approximately 
3 times more flexible than a steel (E = 210 GPa) beam, while the flexibility of a 
Teflon (E = 0.5 GPa) beam will be 142 times higher than the aluminum beam. 
These comparisons also highlight the importance of material selection in the flexure 
joint design. 


Stress Failure 

In general, the deflection of a flexure joint is limited by the bending or torsional 
stress. As these joints only operate within the elastic region, the yield strength of the 
material becomes the maximum allowable stress and the maximum stress generated 
via the deflection must be kept below it. In theory, the maximum stress, o max , due to 
bending is given as 


Gnax 




where c represents the location of the neutral axis from the loading point. By 
substituting Eq. 2 into Eq. 1 and let c — /?/ 2, the maximum stress due to bending 
moment about the x-axis is expressed as 


d0 x 

ds 

By referring to Eqs. 2 and 3, the maximum bending stress is proportional to the 
maximum bending moment and geometry also plays an important role in reducing 
the stress. In addition, selecting materials with low Young’s modulus could also 
reduce the stress. Yet, materials with low Young’s modulus would have lower yield 
strength. Based on a stress-strain curve shown in Fig. 12, the Young’s modulus of a 
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Fig. 12 Stress-strain curve of a ductile material 


material is calculated based on the linear stress-strain relationship, while the yield 
strength is beyond the proportional limit. Hence, choosing a material with lower 
Young’s modulus will effectively lower the maximum allowable stress. Yet in 
some cases, higher Young’s modulus is desirable for achieving higher stiffness 
characteristics. Consequently, the major challenge in designing a flexure joint is to 
obtain the desired deflection that could fulfill the function of the compliant mech- 
anism while maintaining the stresses well below the yield strength of the material. 
Table 1 summarizes this section by listing some materials that are commonly used 
to develop the compliant mechanism. 


Fatigue Failure 

Any flexure joint that flexes to deliver motion is subjected to fatigue failure. The 
fatigue life of any material is usually presented in an S-N diagram (Woehler 
strength-life diagram) as shown in Fig. 13. From this S-N diagram, the number of 
cycle can be classified into three regions, i.e., low cycle, high cycle, and infinite life 
(Howell 2001). For the low cycle category, the fatigue failure usually occurs 
between 1 and 1,000 cycles. As for the high cycle category, the fatigue failure 
typically occurs beyond 1,000 cycles. The infinite life region is for the flexure joints 
that required to flex constantly and only applies to some materials that do not fail 
regardless of the number of cycle. 

The ultimate strength of the material is represented by S ut and Sf represents the 
fatigue strength of the material. The first limit that bounds the low cycle region is 
represented by S L , while the second limit that bounds the finite life is known as the 
endurance limit, S e . This limit is common in many low-strength carbon and alloy 
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Number 
of cycle 


Fig. 13 S-N curve of steel material 


steels, some stainless steels, irons, molybdenum alloys and titanium alloys, and 
certain polymers (Dowling 1993). If the stress is kept below the endurance limit, 
continuous cycle without fatigue failure is possible and the flexure joint has infinite 
life. From past literature (Juvinall 1967), S L can be approximated as 


where 


$L Cf $iit 

( 0.9 bending 
(0.75 axial loading 


( 4 ) 

( 5 ) 


For low cycle fatigue estimation, the maximum stress must not exceed S L . For 
high cycle fatigue estimation, Sf can be approximated as 



where 



Based on Eq. 6, the number of cycle can be estimated by assuming that o max = Sf 
For materials without endurance limit, af and bf are expressed as 



(log Nf (log CfS ut ) - 3 log % 

log N 2 — 3 



1 

3 - log N 2 




( 8 ) 
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Table 1 Initial approximation values of S' e and Sf for some materials 


Material 

Classification 

Values 

Conditions 

Steel 

Endurance limit 

S'e ~ 0.5S m? 

Sut < 1 ,400 MPa 



S'e « 700 MPa 

Sut > 1 ,400 MPa 

Iron 

Endurance limit 

S'e ~ 0.4 S ut 

Sut < 400 MPa 



S'e ^ 160 MPa 

S ut > 400 MPa 

Aluminum 

No endurance limit, fatigue estimation 
for 5 x 10 8 cycles 

S} « 0.4S ut 

S ut < 330 MPa 



Sf w 130 MPa 

S ut > 330 MPa 


Table 2 Curve-fitted 
a and b parameters for 
surface finishing 

Surface finish 

a (MPa) 

b 

Ground 

1.58 

-0.086 

Machined, cold rolled 

4.45 

-0.265 


Hot rolled 

58.1 

-0.719 


As forged 

271 

-0.995 

where S fl - CjS ut , /V, - 

= 1 X 10 3 , Sf 2 - S e , and N 2 ~ 1 

x 10 . For materials with 


endurance limit, S e obtained through these specimen tests are termed as uncorrected 
endurance limit, S' e . For materials without endurance limit, the Sf obtained via the 
specimen tests are known as uncorrected fatigue strength, Sf. As information is 
often unavailable, Table 1 listed some initial approximations of S' e and Sf for some 
materials that are useful for estimating the fatigue life (Norton 2000; Shigley and 
Mitchell 1983; Forrest 1962). 

The initial approximation values can be used to predict S e and Sf (Shigley and 
Mischke 2001; Marin 1962) using 



Gur fC size Goad Geliab 5^ 




C sur f C size £ load Geliab Sf 



where c sur f represents the Marin correction factor for surface finishing, c size for 
size, Ci oad for loading, and c re ii a b for reliability. c sur ^ can be approximated as 



aS h ut if aS b ut < l 
1 if aS b ut > 1 



where the values of a and b are listed in Table 2. c size and c reliab are listed in Tables 3 
and 4, respectively. From past literature (Norton 2000), c load = 1 for bending load, 
Cioad = 0-7 for axial load, and c load = 0.577 for torsion and shear load. 
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Table 3 Marin correction factors for size 


r 

^size 

a Conditions dependent on the diameter of the sample, d 

l 

For d < 2.79 mm 

( d \ —0.1133 
V0.3/ 

If d is in inches and 0.1 1 < d < 2 in. 

( d \ -O.H33 
V7.62/ 

If d is in millimeters and 2.79 < d < 51 mm 

0.6 

For d > 51 mm 


a For rectangular shape subjected to zero-torsional bending, d = 0.808 y/bh 


Table 4 Marin correction factors for reliability for steel material 


Reliability (%) 

50 

90 

99 

99.9 

99.99 

99.9999999 

^reliab 

1.000 

0.897 

0.814 

0.753 

0.702 

0.520 


"'Assuming a standard deviation of 8 % 


Theoretical Modeling Approaches 

Many theoretical models and modeling methods were introduced over the past 
50 years due to the continuous evolution of the flexure joints and the applications of 
the compliant manipulators. From the very beginning, modeling of the flexure 
joints from classical bending-moment equation was sufficient when the flexure 
joints were expected to deliver small deflection motions. As the desire for large 
deflection increased, analytical models focusing on nonlinear deflection behavior of 
the flexure j oints were introduced by the late 1960s. When high-precision compliant 
manipulators were needed in the semiconductor industry in the 1990s, the evolution 
of notch-hinge flexure joints spurred the efforts in finding more accurate modeling 
approaches. These efforts continue till today due to new beam-based flexure joints 
that were introduced for delivering large deflection motions. After understanding 
the limitations of the flexure joints and the design constraints, this section presents a 
comprehensive library of theoretical models and modeling methods that can be 
useful for modeling different types of flexure joints. 


Small Deflection Theorems 


Leaf spring is considered as the earliest form of a flexure joint. It can be modeled as 
a cantilever beam based on the Bernoulli-Euler law. From Eq. 1, the beam curvature 
due to a bending moment can be represented in rectangle coordinates 




d 2 y / dx 2 
f{x,y) 


( 12 ) 
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Fig. 14 (a) A beam that is subjected to an end load and (b) delivers a pure translation motion 


o Q /O 

where /(x, y) = [1 + ( dy/dx ) ] . Based on small deflection assumption, the square 

of slope, (dy/dx) 2 , is approximated as zero. This assumption allows /(x, y) = 1 and 
leads to the classical beam-moment-curvature equation given as 

d 2 y 

M = £/-| (13) 


For a cantilever beam subjected to an end load shown in Fig. 14a, the summation 
of moment gives M = P(l — x). By solving Eq. 13 with M = P(l — x), the maximum 
deflection along the y-axis occurs at x = / and is expressed as: 



max 


pi 3 

3£7 



By substituting Eq. 2 into Eq. 14 with M = PI and c = hi 2, the maximum stress is 
given as 


^max 


3E7) h 


21 



For pure translation motion as shown in Fig. 14b, the summation of moment gives 
M = P(l — s — x). By solving Eq. 13 with M = P(l — s — x), the maximum translation 
motion along the y-axis, which occurs at x = / and s = // 2, is expressed as 



max 


pi 3 

12EI 



Using Eq. 2 with M = PI and c = h[ 2, the maximum stress is given as 


^rnax 


3 Ffi h 

u mdiX n 


i 



The pure translation motion equation expressed in Eq. 16 is useful for finding the 
translation stiffness of a parallel linear spring mechanism shown in Fig. 15a. To 
avoid parasitic torsional motion, this compliant mechanism employs two parallel 
leaf springs to achieve a pure translation (or prismatic) motion along the y-axis. 
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Fig. 15 (a) Parallel linear spring mechanism and its (b) front view and (c) side view 


Based on the same notations used to define the geometries of a cantilever beam 
shown in Fig. 11, the translation stiffness along the y-axis, K$ p 9 which is twice of 

Eq. 16, is expressed as 



To achieve maximum translation motion from a given load, P y , the loading point 
must be located at // 2 away from the base as illustrated in Fig. 15b. Although the 
stiffness is doubled due to the parallel configuration, the amount of deflection 
remains unchanged and the maximum stress is similar to a spring leaf-spring 
configuration, which is given in Eq. 17. Based on Eq. 18 with / vv , the translation 
along the x-axis, K §x p x , is expressed as 



The torsional stiffness about the x- and z-axes is expressed as 




( 20 ) 

( 21 ) 


Any translation motion provided by the linear spring mechanism will be accom- 
panied by a parasitic height variation, X. Figure 16a illustrates a compound linear 
spring mechanism that can eliminate or minimize this error. Using two linear spring 
mechanisms connected in series, the parasitic height variation of the moving 
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Fig. 16 (a) Compound linear spring mechanism and (b) double compound linear spring 
mechanism 


Table 5 Translation stiffness along the y-axis and maximum stress for different types of 
compound linear spring mechanisms 


Type 

Translation stiffness, K§ p 

Maximum stress, cr max 

Single 

12 El 

P 

3 E8 max h 

2 P 

Double 

24 El 

P 

3 ES max h 

2 P 


platform can be canceled out by the parasitic height variation of the intermediate 
platform. Due to the series connection, the translation stiffness is half of the linear 
spring mechanism. Figure 16b shows a double compound linear spring mechanism, 
which is a symmetrical concept of a compound linear spring mechanism. It was 
introduced to obtain superior rectilinear motion and reduce sensitivity to external 
disturbance via the symmetrical concept. Hence, the translation stiffness is twice of 
the single compound linear spring mechanism. Table 5 lists the translation stiffness 
along the y-axis and maximum stress for both types of compound linear spring 
mechanisms. 

The discovery of notch hinge allows the compliant mechanisms to be fabricated 
in the monolithic (single piece) forms where no assembly is required. Hence, 
assembly errors can be minimized or avoided to make the compliant manipulators 
more deterministic. The simplest form of a notch hinge shown Fig. 17a has a 
circular shape profile, which incorporates a circular cutout on both side of a blank 
to form a necked-down section. This necked-down section, which serves as a fixed 
center of rotation, exhibits a pure rotational motion within a small dedicated range. 
In 1965, Paras and Weisbord (1965) presented a complete analysis of such notch 
hinges. 

Assuming that the ratio h/(2R + t) is near to unity, which makes the notches 
nearly semicircular, the angular stiffness of the notch hinge is expressed as 
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Fig. 17 Three types of notch flexure joints: (a) a circular shape, (b) an elliptical shape, and (c) a 
corner-filleted shape 



M- _ 2Ebf! 2 
0- ~ 9 7iR l/z 



For circular shapes defined by t < R < 5 t, the angular stiffness is expressed as 



where the correction factor, k , is given as 





+ 0.166 


The maximum stress is determine by 



+nax 


k t E t 
4 kR 


0 


max 


where the stress correction factor, k h is given as 




2.1t + 5AR 
8 R + 1 


+ 0.325 



The monolithic translation motion compliant mechanisms shown in Fig. 18 are 
constructed via the circular notch-hinge flexure joints. Unlike the leaf-spring 
versions, each limb is formed by a pair of notch-hinge joints connected in series. 
Table 6 lists the translation stiffness and the maximum stress of various forms of 
notch-hinge type of linear mechanisms. 

The circular- shaped notch hinges usually lead to high stress concentrations 
during operations. Subsequently, various kinds of shapes were explored to avoid 
such high bending stresses (Xu and King 1996; Tseytlin 2002; Lobontiu et al. 2002; 
Lobontiu et al. 2004; Yong et al. 2008). Examples of these flexure joints include the 
elliptical shape shown in Fig. 17b and the comer-filleted shape hinge shown in 
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Fig. 18 Notch-hinge types: (a) linear spring mechanism and (b) single and (c) double compound 
linear spring mechanisms 


Table 6 Translation stiffness and maximum stress for different notch-hinge type mechanisms 


Type 

Translation stiffness, Kg v p 

Maximum stress, c max 

Linear spring mechanism 

Ebt 3 
6 kX 2 R 

kfEtS max 
4 kRX 

Single compound linear 
spring mechanism 

Ebt 3 

kfEt8 max 

\2k£ 2 R 

8 kRX 

Double compound linear 
spring mechanism 

Ebt 3 
6 kX 2 R 

kfEtS max 
8 kRX 


Fig. 17c. Based on past literature (Lobontiu 1962), the angular compliance of the 
elliptical notch hinge can be expressed as 



Q z _ 12 / 

mT _ Ebt 2 {2c + t)(8c 2 + t 2 ) 


6c 2 + 4ct + t 2 + 


6c(2c + 1) 2 I 4c 

— . arctan \ l -\ 

y/t(4c + T) V t 



For the comer-filleted notch hinge, the angular compliance is given as 

12 f 2r 1 

Cqm, — _, T o \ 1 — 2r -| t / 

Ebt y (2r + t)(4r + t) 3 J 

t(4r + o ( 6 >- 2 4- 4 rt + / 2 ) + 6r(2r + t) 2 \J /(4r + t) arctan 

(28) 

Many studies were conducted to find the optimal elliptical and comer-filleted 
shapes. Figure 19 plots the results obtained from one investigation (Henein 2006) 
that presented one graph that plots the stresses obtained from the elliptical shape 
hinge and another from the comer-filleted shape hinge, respectively. To avoid stress 
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Stress from elliptical shape 



Stress from corner-filleted shape 




Fig. 19 


Stress concentration studies conducted on various forms of notch hinges (Henein 2006) 


concentration, the results from the investigation suggest that the ellipse ration, r y /r x , 
of the elliptical shape hinge must not exceed 0.025. Investigation results also 
suggest that the fillet radius, r/f min , must maintain below two. By comparing the 
normalized stress level, o/Ea , between both graphs, the stress level of the optimized 
corner-filleted shape hinge is 10 % lower than the elliptical shape hinge of R = 2 t min 
and 5 times lower than a circular shape hinge. Yet, it is still 13 % higher than the 
ideal prismatic beam. 


Nonlinear Large Deflection Theorems 

As compliant manipulators progressed to larger displacement, the ideal beam shape 
became a promising solution due to its low stress but large deflection characteris- 
tics. However, delivering large deflection means that these flexure joints will 
experience parasitic shift in the pivot point and exhibit nonlinear characteristics. 
Figure 20a shows that with a fixed pivot point, the ideal deflection path is a 
concentric arc about the pivot point. Once there is a shift in the pivot point, the 
deflection path is altered causing a variation between the actual and targeted 
deflected position. This shift is commonly known as the parasitic shift. 

The effects of parasitic shift are accounted by the Bernoulli-Euler law expressed 
in Eq. 12. In small deflection theory, the slope dy/dx is assumed to be zero resulting 
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Fig. 20 (a) Concept of parasitic shift of center of rotation and (b) its effect on a cantilever beam 


Table 7 Effects of slope 
on f{x, y) 


dy/dx 

0 (deg) 

fU, y) 

0.01 

0.6 

1.0001 

0.10 

5.7 

1.0150 

0.50 

26.6 

1.3976 

1.00 

45 

2.8281 


in the derivation of the classical bending-moment equation. However, this assump- 
tion is invalid for large deflection analysis. Considering the/(x, y) within Eq. 12, 
Table 7 lists the values from fix, y) due to the changes of slope. When the deflection 
angle is small, the effects of fix, y) are negligible. At 26.6°, the value of fix, y) 
increases to 40 %. By 45°, the value of fix, y) increases up to 2.8 times higher than 
the initial value. Consequently, this investigation shows the importance of fix, y) as 
it will account for the nonlinearity behavior of the large deflection. 

For large deflection analysis, the angular compliance of a cantilever beam 
subjected to a moment loading at free end (Fig. 21c) can be derived directly from 
Eq. 1 and written as 



By applying the cross-product rule on Eq. 1, the deflection along the x- and 
y-axes can be expressed as 




/(I — COS0) 

6 


(30) 

(31) 


Equation 29 can be used to find the angular compliance of a 1-DOF cross-spring 
pivot shown in Fig. 21a by considering it as a pair of cantilever beams subjected to 
an external moment loading at free end. 

For analyzing a cantilever beam subjected to a perpendicular point loading at 
free end (Fig. 22a), the bending moment is expressed as 
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Fig. 21 (a) A 1-DOF cross-spring pivot that delivers (b) angular rotation via (c) large deflection 
of cantilever beams 




b Characteristic Characteristic 
\ pivot * radius 

— \ r~ yi -I 




Fig. 22 (a) A cantilever beam subjected to a perpendicular point loading at free end and (b) its 
corresponding representation based on PRB modeling 




= P(l — x — 8 X ) 


By integrating Eq. 32 by s yields 



dO IP 


ds 


El 


sin# 


o 


sin 6) 


1 

2 


Solving Eq. 33 with a 


pi 


and an assumption that Jo ds = l yields 


EI 






Elliptic integrals or numerical integration can be used to solve Eq. 34. As there 
are many examples and sources (Howell 2001; Frisch 1962; Byrd and Fredman 
1954), this section will not go further to obtain the closed-form solution. 
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Pseudo-Rigid-Body Model 


For large deflection analysis, the elliptic integrals or numerical integration can offer 
closed-form accurate solutions. However, it is observed that these methods are 
cumbersome during the design stage of compliant mechanism. To make analysis of 
large nonlinear deflection easy to predict yet accurate, Howell (2001) introduced an 
approximation method known as the Pseudo-Rigid-Body (PRB) modeling. 

Analysis of any flexure joint is based on a perception that deflection is generated 
with respect to a pivot point. In PRB modeling, this pivot point is predefined based 
on the types of loading and the nature of the flexure joint. Another uniqueness of 
this method is a torsional spring, K , which is attached to each pivot. This spring 
governs the torsional stiffness of the flexure joint. Revisit the problem of a canti- 
lever beam subjected to a perpendicular point loading at free end. Figure 22b shows 
the equivalent PRB model representation with a characteristic pivot and a charac- 
teristic radius that defines the deflection path. Based on PRB modeling (Howell 
2001), the deflection along the x- and y-axes can be approximated by 


if/ = l[l — y(l — cos0)] (35) 

S y = yl sin0 (36) 

where y = 0.85 for this specific case. For accurate prediction using these equations, 
the deflection angle must maintain below 64.3°. With the torsional spring governing 
the angular stiffness, the force and angle are related by an applied torque about the 
characteristic pivot. Hence, the relationship between the load and angle is given as 



KQ 

yl sin (<p — 0) 



where K is the spring constant and <p represents the angle of the load (e.g., vertical 
load gives <fi = |). The spring constant is expressed as 


El 

K = Pj (38) 

where = 2.25 for this specific case. For other cases of configurations or loading 
conditions, the PRB method offers specific representation, modeling approach, and 
parameters for each case. Some examples are shown in Fig. 23 where each individual 
configuration or loading condition has a specific PRB modeling representation. 

In the case of a small length flexure pivot configuration shown in Fig. 23a, the 
flexure joint is coupled with a rigid link to amplify the deflection. Based on the PRB 
modeling (Howell 2001), the deflection along the x- and y-axis can be approximated by 

¥ = ^ + ( L + 0 cos 0 


(39) 
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Fig. 23 PRB modeling representations for (a) a small length flexure pivot, (b) a fixed-guided 
cantilever beam, and (c) a cantilever lever beam subjected to a moment loading at free end 



where / represents the flexure joint length and L represents the rigid-link length. The 
relationship between the load and angle is given as 





where K is expressed in Eq. 38 with /? = 1. For accurate analysis, L / must be 
satisfied, e.g., L must be at least 10 times greater /. 

For a cantilever beam with pure translation motion, this case can be modeled as a 
fixed-guided cantilever beam shown in Fig. 23b. Based on the PRB modeling 
(Howell 2001), the deflection along the x- and y-axis can be approximated by 


if/ = l[l — y(l — cos0)] 

S y = yl sin 0 


(42) 

(43) 


where y = 0.8517 for this specific case with a constant vertical load and reaction 
moment. The spring constant for each torsional springs is given as 


K = 2 yK e — (44) 

where the characteristic stiffness, K 0 , is 2.67617 for constant vertical loading. 

In the case of a cantilever lever beam subjected to a moment loading at free end 
shown in Fig. 23c, the deflection along the x- and y-axis can be approximated using 
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Fig. 24 Different flexure configurations for different types of flexure joints found within one 
compliant manipulator 


Eqs. 35 and 36, respectively, with y = 0.7346. In this case, the spring constant is 
expressed in Eq. 38 with = 1.5164. There are other cases that have been presented 
in past literature (Howell 2001; Howell et al. 1996; Howell and Midha 1994). This 
section only provides a few cases and the remaining cases can be found from these 
sources. One important note is that the PRB modeling method offers simplistic and 
accurate solutions for analyzing various flexure configurations and loading condi- 
tions. Yet, the most significant contribution is that the PRB modeling method has 
linked the classical linkage mechanism and the compliant mechanism together. 
Using the unique concept of adding a torsional spring to each pivot point to describe 
the angular stiffness, the knowledge of rigid-body mechanism can be used to design 
a compliant mechanism. 


Semi-Analytic Model 

The PRB modeling method has specific locations to place the pivot points, specific 
values for the torsional spring constant, and specific representations for each flexure 
configuration and loading conditions. As a result, any misjudgment and inappro- 
priate selection of these PRB models often lead to inaccurate results, especially for 
large deflection analyses. Figure 24 shows different types of flexure joints found 
within a compliant manipulator. Analyzing each flexure joint with specific flexure 
configuration requires a person who is well verse in PRB modeling method. In 
addition, pairing a PRB model with a flexure configuration during the design stage 
is extremely restrictive and could lead to inaccurate analysis. This is because 
designing a flexure joint often goes through an iterative process of changing the 
geometries of the rigid links or the flexure joints to achieve desired stiffness and 
off-axis stiffness within a given size constrain. Considering the flexure joints with 
flexure configuration of L < l shown in Fig. 24, the configuration could have change 
to L < l or L = / during the design stage and different PRB models are required for 
different configurations. Hence, the main limitation of the PRB method is its 
inability to provide a simple and generic solution for all forms of flexure 
configurations. 
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Fig. 25 (a) A generic flexure configuration subjected to a loading at the rigid-link end and (b) 
a close-up on the large deflected flexure joint with parasitic shift in the pivot point 


A semi-analytic modeling method offers a generic, simple, and quick solution 
for analyzing the nonlinear characteristic of large deflection motion produce from 
any flexure configuration (Teo et al. 2010b). The term flexure configuration repre- 
sents a flexure joint coupled with a rigid link shown in Fig. 25a. The force, F, 
applied at the rigid-link end becomes a moment load at the end of the flexure joint 
shown in Fig. 25b. Based on the derivations presented in past literature (Teo 
et al. 2010b), the deflection of a generic flexure configuration along the driving 
direction can be approximated as 



where a represents the deflection angle and w is a Sine function, i.e., w = sin 610. 
Referring to Fig. 25b, this Sine function accounts for the parasitic shifting of the 
pivot point, PP r . Subsequently, the in-plane parasitic deflection perpendicular to the 
driving direction can be approximated as 


A »- ( l+ £) ~ ( L+ i) cos ^ (46) 

The angular stiffness is derived based on the hypothesis that the relationship 
between the applied torque and the deflection angle of the torsional spring is 
governed by a moment arm. This moment arm is formed by the rigid link and a 
portion of the flexure joint as shown in Fig. 26a. This portion of the flexure joint 
represents the distance from the center of the torsional spring to the coupling point 
between the rigid link and the flexure joint. This portion of the flexure joint is 
termed as the changing arm, S , and can be expressed as 



( 47 ) 
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3 Moment arm = L + S b 



Fig. 26 (a) Torsional spring with a moment arm that changes with respect to (b) different flexure 
configurations 


where p is introduced as 

ivn+L 

p ~ TTl 



From Fig. 26b, the changing arm varies according to different flexure configu- 
rations even with similar flexure joint lengths. To address this issue, p is an 
empirical factor that is used to determine the changing arm with respect to any 
flexure configuration. Subsequently, the changing torque, T 0 , is recognized as a 
tangential force, F t , applied to the moment arm and is expressed as 



The changing angular stiffness of a torsional spring is given as 




By substituting Eqs. 49 and 29 into Eq. 50, the relationship between the force 
and angle is expressed as 



In semi-analytical modeling, the changing angular stiffness of a torsional spring 

is expressed in Eq. 51. With TV = F sin (| — 0), the vertical force, F, applied on the 

moment arm can be determined directly based on a known deflection angle shown 
in Fig. 26a. Last but not least, the maximum bending stress, <7 max , is given as 


a 


max 


F 

1 



cos a 




The uniqueness of the semi-analytic modeling is that it is generic for all flexure 
configurations. In cases where there is no rigid link, L will be zero. For any other 
case, the presence of p accounts for the change in length of the moment arm. 
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Fig. 27 (a) A 5-DOF spatial compliant joint module formed by (b) a pair of parallel limbs 
connected orthogonally in series, (c) Representation of each flexure joint in a translational motion 

Together with Sine function, w , which governs the parasitic shift of pivot point, this 
method is a generic, simple, and quick tool for analyzing any flexure configuration. 


Stiffness Modeling 


Flexure joints are considered as spring members within a compliant mechanism. 
Hence, the moving platform of a compliant mechanism can be connected to a fixed 
base by a series of springs and parallel springs. When the springs are connected in 
series, the overall stiffness is expressed as 






or 

n 

Ctotal = ^2 Ci (54) 

i = 1 

where the compliance C = K~ l . When the springs are connected in parallel, the 
overall stiffness is expressed as 



Example. Apply this concept to model the translation stiffness of a 5-DOF spatial 
compliant joint module shown in Fig. 27a. This module is formed by two identical 
segments where each segment consists of two parallel limbs as shown in Fig. 27b. 
As each limb can provide 3 -DOF motions, i.e., translation, bending, and torsion, 
combining two segments in series produces a 5-DOF spatial motions, i.e., three 
rotational motions, W xy , z , and two translational motions, A Xyy . 
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Fig. 28 (a) A double compound linear spring module and (b) the schematic representation of this 
compliant joint module and (c) each parallel limb 


For each segment to deliver a pure translation motion, each limb deflects in an 
“S”-shaped form and can be represented as two identical flexure joints 
with individual length being // 2 and the deflection of each flexure joint is A/2 as 
shown in Fig. 27c. Hence, the translation stiffness of a flexure joint, K h is 
expressed as 


K, = 2 F - (56) 

A 

Based on semi-analytic modeling, the driving force, F , can be obtained from 
Eq. 5 1 with the deflection angle, 0 , deriving from Eq. 45 based on A = A/2, / — > //2, 
and L = 0. With each limb being formed by two identical flexure joints connected in 
series, the translation compliance of each limb, Ci imb , is 



As each segment comprises of two parallel limbs, the linear translation stiffness 
of the spatial joint module is 



Example. Considering the prismatic compliant joint module shown in Fig. 28a, it 
is formed by a compound linear spring module, which comprises of two parallel 
springs connected in series as shown in Fig. 28b. Each parallel spring is articulated 
by two parallel limbs and each limb comprises of two flexure joints connected 
together via a rigid link. Hence, each limb can be represented by two identical 
flexure configurations where each flexure joint is represented by a re volute joint 
with a torsional spring attached to it as shown in Fig. 28c. 
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The deflection of each flexure configuration is half of the desired deflection, i.e., 
AJ2. Therefore, the deflection stiffness for one flexure configuration along the 
z-axis, Kf c , is expressed as 

Based on semi-analytic modeling, the driving force, F z , can be obtained 
from Eq. 51 with the deflection angle, 0, deriving from Eq. 45 based on A = AJ. 2 
and L = LI 2. With two identical flexure configurations connected in series to form 
each limb, the deflection compliance of each parallel spring along the z-axis of each 
limb, C z , is 

’ limb ’ 

2 1 

C z = V — (60) 

limb / -J 

i= 1 K fc 

As each parallel spring is articulated by two parallel limbs, the linear translation 
stiffness along the z-axis, K z ps , is 

Ks = E (6D 

i=\ '-'limb 


With two parallel springs connected in series to form a compound linear spring, 
the translation stiffness of the compound linear spring along the z-axis, is 


1 


K 


AzFz 

total 



i= 1 


1 

K z ps 


(62) 


This spring-based modeling concept can be extended to model the stiffness of 
the entire compliant manipulator. Considering that a compliant manipulator com- 
prises of a moving platform supported by j number of parallel and symmetrical 
limbs, each limb can be formed by a group of compliant joint modules connected in 
series by the links as shown in Fig. 29a. For a link in Cartesian space, the moment 
vector, m, is a cross-product of the link vector, r, and force vector, f, expressed as 


m x 


r x 


fx 


"0 

~r z 

y y 


fx 


m y 


r y 

X 

fy 


r z 

0 

~r x 

• 

fy 

(63) 

m z 


r z 


fz 


_~fy 

r x 

0 


fz 



In addition, the linear displacement vector, dS , is a cross-product of the angular 
displacement vector, d6 , and the link vector expressed as 


dd x 


d0 x 


r x 


"0 

r z 

~ r y 


d0 x 


ddy 


d0 y 

X 

r y 


~r z 

0 

r x 

• 

d0 y 

(64) 

dd z 


d0 z 


r z 


Jy 

~r x 

0 


d0 z 



Within each limb, only the compliant joint module is assumed to be a spring 
member with a compliant matrix, C z , established at the local coordinate frame 
attached to it. To establish the compliant matrix of each limb, Ci imb , a Jacobian 
matrix, J z , that maps the local coordinate frame of each compliant joint module to 
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Fig. 29 (a) A limb, which is constructed from a serially connected compliant joint module, is 
used to (b) form the symmetrical limbs of a compliant manipulator 


the local coordinate frame of the tip of each limb is required. Based on Eq. 64, the 
displacement vector of each limb, X, can be expressed as 
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1 

1 
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(65) 


Based on Eq. 63, the force and moment applied to the end-effector, F, can be 
expressed as 
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As each limb is formed by the compliant joint modules connected in series via the 
links shown in Fig. 29a, the displacement of the limb is X limb = J i X j + J 2 x 2 + J3X3 + . . . 
based on Eq. 65, reexpressed as C^mbF = Jifi + J 2 f 2 + ^3 + . . . based on CF = X. As 
Eq. 66 can be reexpressed as f = J T F, the compliance matrix of each limb is given as 



From Fig. 29b, the end effector is supported by a group of parallel limbs. 
Based on Eq. 66 , the total force and moment applied to the end effector is 
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Fig. 30 (a) A compliant manipulator formed by three symmetrical parallel limbs where (b) each 
limb consists of a spatial and prismatic compliant joint module to support (c) a moving platform at 
three symmetrical corners which are 120° apart 


F = JF T fi + J 7 T f2 + J3" T f3+...., reexpressed as KX = + J2 T K 2 x 2 + J3" T K 3 x 3 

+ . . . based on F = KX. By reexpressing Eq. 65 into x = J“ 'X, the stiffness matrix of a 
compliant mechanism is given as 

n 

K = ^J- T K, mlb , ; J- 1 (68) 

j= 1 


Example. Consider the stiffness modeling of a 3 -DOF compliant manipulator 
shown in Fig. 30a as an example. This compliant manipulator is formed by three 
symmetrical parallel limbs and each limb consists of spatial and prismatic compli- 
ant joint modules, as shown in Fig. 30b. All three parallel limbs are placed 120° 
apart to support a moving platform with its end effector located at the center. 
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In this example, the compliance matrix of the prismatic compliant joint module, 
C A , and the spatial compliant joint module, C B , is given as 


3 

8 6.65e — 4_ 

(69) 


SYM 


0.338 

0 0.031 _ 

(70) 

Next, individual local coordinate frame is attached to the center of the prismatic 
compliant module and the tip of the flexure joint within the spatial compliant 
joint module as shown in Fig. 30b. These local coordinate frames have the same 
orientation but different offsets with respect to the local coordinate frame 
attached to the tip of the limb. Hence, the Jacobian matrix for each module, J m , is 
expressed as 


C 


A 


1.56e — 6 

1.41e — 7 3.33e — 7 
1.95e — 8 0.23e — 9 


0.15e - 9 
0.60e - 9 


0.25e - 9 


3.03e — 5 3.03e — 6 


3.41e — 4 


5.00e — 9 5.58e — 5 


1.30e — 9 


1.28e - 3 


1.70e — 9 3.95e — 8 


SYM 


4.61i 


0.01e-9 1.33. 


C B 


1.71e — 6 
1.03e — 6 
— 7.06e - 8 
— 3.19e — 5 
— 5.24e - 5 
0 


1.71e — 6 

— 7.06e - 8 4.80e - 9 
— 5.24e — 5 2.17e — 6 0.338 
— 3.19e — 5 2.17e — 6 9.81e-4 


0 


0 


0 



0 0 0 h m 0 

1 0 — h m 0 0 

10 0 0 

1 0 0 

zeros 1 0 

1 



where m represents A or B for respective modules. In this example, h A = 87 mm and 
h B = 19.5 mm. The compliance matrices C A and C B are assembled to form 


C total — diag( C A ,C B ) (72) 

In addition, the Jacobian matrices J A and J B are assembled into 



From Eq. 67, the compliance matrix of each limb is derived as 
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5 

3.10e — 

6 

— 1.30e 

— 

9 

— O.Ole — 9 

1.33e 


- 8 0.032 

(74) 

Prior to the derivation of stiffness matrix, the Jacobian matrix, J p that maps the 
local coordinate frame of each limb to this reference coordinate frame must first be 
established. Figure 30c illustrates that the local coordinate frame attached to the tip of 
each limb has different orientations and a linear displacement w.r.t. the reference 
coordinate frame attached to the center of the moving platform. Here, the reference 
coordinate frame attached to the center of the moving platform falls on the same plane 
as the local coordinate frame at the tip of each limb. Hence, there is no variation in the 
z-axis between both coordinate frames. The orientation of the local coordinate frame at 
the tip of each limb w.r.t. the reference coordinate frame in matrix form is expressed as 
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Y 



cos y/j 
sin y/j 


cos 
sin /?. 
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i 

i 

1 

£7 

i 


(75) 


where Pj = y/j + n ! 2 and j represents either 1, 2, or 3. Based on Eqs. 75 and 65, the 
Jacobian matrix that maps the local coordinate frame of each limb to the reference 
coordinate frame is 
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where y/\ 3tt/2, y/ 2 zr/6, i// 3 5zr/6, 6\ n! 2, 0 2 ln/6, 0 3 1 1 nj 6 and, in this 

example, r 64.83 mm. Based on Eqs. 68, 74, and 76, the stiffness matrix of the 

compliant manipulator is given as 
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( 77 ) 
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The obtained stiffness matrix was compared against the experimental values and 
results have demonstrated that the variation between the theoretical and experi- 
mental values is less than 15 %. This shows that this stiffness modeling approach is 
accurate and able to provide reasonable analysis on the stiffness characteristic of a 
compliant mechanism. 


Fundamental Design Concepts 

The most important design criteria of a compliant manipulator is to maximize the 
stiffness ratio, i.e., between the off-axis stiffness and natural stiffness, of the 
compliant mechanism. Consider that the natural stiffness quantifies the compliance 
in the desired driving directions while off-axis stiffness quantifies how stiff the 
mechanism is in non-driving directions. Hence, a compliant mechanism must have 
high stiffness ratio since it directly affects the robustness of the compliant manip- 
ulator. This section highlights the several existing and new design concepts that can 
be used to synthesize the compliant mechanisms. 


Exact Constraint Design 

Exact constraint design approach excludes the principles of kinematics to synthe- 
size a compliant mechanism. Its basic objective is to achieve desired degrees of 
motions or no motion through applying minimum number of constraints to a rigid 
body. Based on the principles of kinematics, an unconstraint rigid body has six 
degrees of motion, i.e., three translational and three rotational motions. Based on 
the principles of exact constraint design, a nonrigid body may have one or more 
degrees of “flexibility” which serves as additional degrees of motions. Considering 
a box with lid as an example, it has six degrees of motions when the lid is 
on. Without the lid, the open box will have seven or even eight degrees of motions 
due to the additional torsional motions. A total of twelve fundamental ideas are 
associated with this design approach and can be found in the past literatures 
(Blanding 1992; Hale 1999; Slocum 1992). Most of these fundamental ideas are 
summarized in the matrix table shown in Fig. 3 1 illustrating how constrains can be 
added to achieve desired degrees of motions. 

From Fig. 31, most of those fundamental ideas involve with orthogonal con- 
straints. However, there are some that use non- orthogonal constraint. One of these 
fundamental ideas states that (Blanding 1992). 

A constraint applied to a body removes that rotational degree of freedom about which it 

exerts a moment. 

When a set of non-orthogonal constraints are used to constrain a translation 
motion as shown in Fig. 32a, it creates an instantaneous center (I.C.) of rotation at 
the center of the circle. By adding another constraint that reacts with a moment 
about the center, the rotation about the center of the circle is constrained. However, 
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+- Translation axis Constrain Rotation axis 

Fig. 31 A matrix table of exact constraint design approach 


a 



b 


A 


C 


A 



\ 


\ 

\ 


Fig. 32 Examples of constraints been applied at the instantaneous center of a set of constraints to 
prevent rotation motion 


the circle may not be perfectly constrained because the length of the additional 
constraint arm is not equal to the other non-orthogonal constraints. A perfect 
example that reflects this fundamental idea is shown in Fig. 32b where all constraint 
arms are equal in length and symmetrical. In some conditions, nonsymmetrical 
arrangement is acceptable, but all constraint arms must remain equal as shown in 
Fig. 32c. This fundamental idea is useful for synthesizing compliant mechanism 
with constrained in-plane motion. 
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Fig. 33 (a) A leaf-spring flexure joint represented by two vertical constraints and one diagonal 
constraint, (b) This set of constraints is used to constraint a rigid body that eventually leads to (c) 
a parallel spring mechanism 


Another two fundamental ideas are related to flexible members (Blanding 1992). 

An Ideal Sheet Flexure imposes absolutely rigid constraint in its own plane (8 X , S z , and 6 y ), 
but it allows three degrees of freedom: Y, 6 X , and 6 Z . 

An Ideal Wire Flexure imposes absolutely rigid constraint along its axial ( 5 X ), but it 
allows five degrees of freedom: 8 y , 8 Z , 0 X , 6 y , 6 Z . 

The fundamental idea of an Ideal Sheet Flexure that offers 3 -DOF out-of-plane 
motions is shown in Fig. 33a. Using a leaf spring as an example, the in-plane motion 
is constrained and can be represented by two vertical constraints and one diagonal 
constraint. When one set of constraints is used to support a rigid body, it permits 
3-DOF motions, i.e., <5^, 0 X , and 6 Z , as shown in Fig. 33b. When another similar set 
of constraints are added at the opposite end of the rigid body, the only degree of 
motion left is along the y-axis, S y , as shown in Fig. 33c. As each set of constraints 
represents a leaf spring, having two in parallel forms a parallel linear spring 
mechanism. 

This fundamental idea was also applied in synthesizing the spatial compliant 
joint module shown in Fig. 27a. The design concept of this spatial compliant joint 
module is also governed by the following fundamental idea of exact constraint 
design (Blanding 1992). 

When parts are connected in series (cascaded), add the degrees of freedom. When the 
connections occur in parallel, add constraints. 

Referring to Fig. 27b, two parallel beam-based flexure joints, which forms one 
segment, increase the deflection stiffness of the spatial compliant joint module. Yet, 
connecting two segments in series and orthogonal arrangement brings additional 
degrees of motions. Alternatively, the fundamental idea of an Ideal Wire Flexure 
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that offers 5 -DOF motions is also useful for designing a spatial compliant joint. In 
fact, classical spatial compliant joints were mainly realized through wire flexures. 
The last but most important fundamental idea of the exact constraint design states 
that (Blanding 1992) 

A constraint £ properly applied to a body (i.e., without overconstraint) has the effect of 
removing one of the body’s rotational degrees of freedom (93 ’s). The 93 removed is the one 
about which the constraint exerts a moment. A body constrained by n constraints will have 
6 — n rotational degrees of freedom, each positioned such that no constraint exerts a 
moment about it. In other words, each 93 will intersect all £’s. 

This fundamental idea provides the definition of a mechanism being 
overconstrained or under- con strained. An extension of this idea was to generalize 
it to nonrigid bodies (Hale 1999). By adding number of DOF with due to the DOF 
of flexure joints, the mobility equation is rewritten as dof =6 + / — £. Hence, £ 
must be sufficient to achieve the desired DOF. In addition, it is also important that 
there must be no redundant £. If removal of a £ does not affect the DOF, the 
remaining £ stays in the mechanism. Lastly, the mechanism is exactly constrained 
if the removal of any single £ increases the DOF by one. With this final but most 
crucial fundamental idea, this section wraps up the review of exact constraint 
design approach. 


Parallel-Kinematic Architecture 

Parallel-kinematic architecture plays an important role in the success of the com- 
pliant manipulator due to its advantages of a lower inertia, programmable centers of 
rotations, superior dynamic behavior, higher stiffness, and less sensitive to external 
disturbances as compared to its serial counterpart. In addition, the limited deflection 
of the flexure joints suits the limited motion range of the parallel-kinematic 
architecture. Therefore, the parallel -kinematic architectures are commonly used 
to synthesize the mechanisms within the compliant manipulators. In general, a 
parallel mechanism is made up of an end effector (located at the center of the 
moving platform) with n degrees of motions and of a fixed base that are both linked 
together by at least two independent parallel -kinematic chains (Merlet 2000). Each 
kinematic chain is articulated by a set of elementary kinematic joints that are 
connected in series. Such kinematic joints include the 1-DOF revolute (R) joint, 
the 1-DOF prismatic (P) joint, the 2-DOF universal (U) joint, and the 3-DOF 
spherical (S) joint. Parallel mechanism can be classified into two categories: planar 
and spatial motion. A planar motion parallel mechanism can provide up to 3-DOF 
of in-plane motions, while a planar motion parallel mechanism can deliver out-of- 
plane motions. 

Figure 34a illustrates a 3-DOF planar motion parallel mechanism that can only 
deliver two de-coupled X and Y translational motions and a coupled 6 Z rotational 
motion. This parallel mechanism consists of three kinematic chains. Starting from 
the fixed base to the moving platform, each kinematic chain is formed by one 
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Fig. 34 Different parallel-kinematic architectures 

P joint, another P joint orthogonal to the first, and one R joint. This is also termed as 
the 3-legged prismatic -prismatic -re volute (3PPR) parallel-kinematic architecture. 
To form a parallel manipulator, one of the P joints must be actively controlled, 
while the remaining P and R joints become the passive joints. To prevent high 
inertial and moving mass, it is always ideal to fix the active P joints on the base. 
Figure 34b illustrates a spatial motion parallel mechanism that produces 3 -DOF 
out-of-plane motions, i.e., Z, 6 X , 6 y . This parallel mechanism is formed by a 3RPS 
parallel -kinematic architecture and has an active P joint in each kinematic chain. A 
6-DOF spatial parallel mechanism is illustrated in 34c. This parallel mechanism is 
formed by a 6UPS parallel -kinematic architecture with an active P joint in each 
kinematic chain. This parallel-kinematic architecture, which offers 6-DOF motions, 
is also known as the Steward platform. A compliant mechanism that uses this 
architecture was shown in Fig. 9a of section “Brief History.” In fact, section 
“Brief History” has introduced many compliant manipulators that are synthesized 
from different types of parallel-kinematic architectures. 

With many variations of parallel-kinematic architectures, a systematic design 
methodology, which could identify the right architectures and conversion of kine- 
matic chain to compliant limbs, is essential. This methodology is shown in Fig. 35 
where one can easily follow to convert any linkage-joint parallel mechanism into a 
parallel compliant mechanism. Termed as a task-oriented design approach, the first 
step (SI) is to identify the task or application, desired DOF, and workspace. These 
data become the design criterions, which are used in the second step (S2), to 
synthesize the right-type parallel-kinematic architecture, which is suitable to 
deliver the targeted task specifications. Upon selection of suitable architecture, 
the third step (S3) is to perform kinematic analysis, i.e., forward and reverse 
kinematic analyses. The forward kinematic analysis works out the position and 
orientation of the end effector (task space) based on given actuator displacements 
(joint space). Hence, it is used to determine the required displacements from the 
actuators and the compliant joint modules to achieve the targeted workspace. The 
inverse kinematic analysis provides the actuator displacements based on given 
position and orientation of the end-effector. This analysis is a crucial analytical 
model when task-space control implementation is required. 
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SI: Task7Dcsign specifications 
(c.g. DOF, worksapce, etc) 


S2: Mechanism synthesis based on 
conventional rigid links and joints 


S3: Kinematic analysis of the selected 


mechanism 


S4: Convert conventional joints to flexure 
joints: Design through analytical modeling 


S5: Perform design optimization 
(e.g. FE analysis, stiffness modeling etc) 




L i n ka ge-j o i n t pa ra llcl-kinc mat i c 

arch itectu rc 




Parallel compliant mechanism 


Fig. 35 Task-oriented design approach; a systematic methodology to convert any linkage-joint 
parallel mechanism into a compliant-based parallel mechanism 


The forth step (S4) is to convert all the rigid-body kinematic joints into the 
flexure joints or joint modules. Here, a compliant joint module represents a specific 
conventional kinematic joint, e.g., the revolute joint, the prismatic joint, the spher- 
ical joint etc., which can be formed by a single flexure joint, a series or a group of 
flexure joints. Next, the parametric analyses will be conducted based on the forward 
or inverse kinematic solution to provide the estimated displacements required from 
each compliant joint module. Subsequently, the design of each module is conducted 
through analytical modeling of the required stiffness within those estimated dis- 
placements and material’s yield strength. The last step (S5) is to perform design 
optimization of the complete compliant mechanism through numerical simulation 
using finite-element modeling (FEM) platform, i.e., ANSYS, and analytical stiff- 
ness modeling. S5 evaluates the achievable workspace, the stress concentration of 
the flexure joints, and the natural frequency of the FPM. An iterative process 
between S4 and S5 is necessary should any of those parameters fall out of the 
desired specifications. Using this design approach, a compliant mechanism is using 
the parallel -kinematic architecture systematically. 

Example. Consider the design of a 3 -DOF 0 X -0 V -Z spatial motion compliant 
mechanism as an example to apply the task-oriented design approach. This mech- 
anism with active actuation and control formed a compliant manipulator targeted to 
automate the imprinting and out-of-plane alignment tasks within an Ultraviolet 
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Fig. 36 (a) A 3PPS parallel-kinematic architecture synthesized from S2. (b) Projection of a 
rotation vector to represent the 6 x -6 y -Z motion for kinematic and workspace analyses in S3 

Nanoimprint Lithography (UV-NIL) process. The targeted workspace was 5° x 5° 
x 5 mm, while the targeted imprinting force was ~200 N. 

Type synthesis, which identifies an ideal parallel-kinematic architecture based 
on the task requirements, was conducted in S2. From past literatures (Merlet 2000; 
Tsai 1999), four possible architectures were identified, i.e., 3RRS, 3PRS, 3RPS, and 
3PPS. To design a high-precision mechanism, P-joint, which provides stiffer and 
higher precision guide ways, is always preferred over the rotation counterpart. 
Considering the requirements of having high imprinting force and the contact 
task, 3PPS parallel-kinematic architecture was selected as shown in Fig. 36a. To 
reduce the moving mass and inertia, the active P-joint was placed nearest to the base 
platform (note: the underlined P represents active P-joint). To effectively translate 
the output force of the linear actuators into the desired imprinting force, the active 
P-joint in each leg was placed vertically, while the passive P-joint was placed 
horizontality with its axis of motion always pointing towards the center of the 
equilateral triangular base. 
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The forward and inverse kinematic models were derived in S3. Forward kine- 
matic modeling is used to determine the moving platform pose based on the known 
active P-joint displacements and the solutions are given as (Yang et al. 2011) 




~-\/3(2zi - z 2 - £ 3 ) 

3 a 



Zi + z 2 + z 3 

3 


(78) 

(79) 

(80) 


where a represents the edge length of the equilateral triangular moving platform 
while z 1? z 2 , and z 3 represent the displacement of the active P-joints. Referring to 
Fig 36b, these forward kinematic solutions were derived based on the notation that 
the 3-DOF motions can be represented by a rotation vector, co, with Z = (0,0,1) and 
Z' = (e x , e y , e z ) being the unit directional vectors of the original and final Z axis of 
the moving platform frame, respectively. Hence, the projections of Z! onto the x- 
and y-axes, i.e., e x and e y9 are employed to uniquely define Z! instead of using two 
rotation angles 0 and <p. These kinematic solutions were used to determine the 
design parameters of the mechanism based on the desired workspace. These 
parameters include the dimensions of the moving platform, the desired translation, 
and rotation displacement from each joint. To obtain a vertical displacement of 
5 mm, a long-stroke flexure-based electromagnetic linear actuator (FELA) was 
employed (Teo et al. 2008) as the active P-joints. Using the kinematic solutions and 
size constraint of the FELA, a was chosen to be 1 12.29 mm. Based on the selected 
design parameters, results obtained from the workspace analysis suggested that the 
maximum orientation and translation displacements of the moving platform were 
5.1° and 5 mm, respectively (Fig. 36). 

Subsequently, an initial conceptual design of a 3PPS parallel compliant mech- 
anism was proposed in S4. It was articulated by three symmetrical compliant limbs 
where each limb consisted of a series of compliant joint modules that mirrored the 
PPS kinematic chain as illustrated in Fig. 37. The stiffness analysis of each 
compliant joint module was conducted via the semi-analytic modeling approach 
(see section “Semi-Analytic Model”) to determine the geometries of the flexure 
joints. This stage of design was conducted using the analytical modeling 
approaches because it is a tedious and iterative process. Hence, using finite-element 
(FE) simulation is time consuming and computational intensive. 

The FE simulation became useful in S5 as it was used to conduct final validation 
on the workspace of the proposed parallel compliant mechanism and the maximum 
stresses within the flexure joints. Figure 38a shows that results from the FE analysis 
suggested that the passive P-joint compliant module and a segment of the S -joint 
compliant module were redundant because there was no stress within the flexure 
joints. Returning to S4, a simple 5-DOF spatial compliant joint module was 
proposed to replace the passive S- and P-joint compliant modules. Reduction of 
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Fig. 38 (a) FE analysis conducted on the initial proposed mechanism, (b) Final prototype with 
simplified compliant limbs 


the compliant modules also simplified the entire mechanism design and increased 
the off-axis stiffness. Finally, the improved version was evaluated through the FE 
simulation before the actual prototype was developed as shown in Fig. 38b. 

This example provides an overview on how the task-oriented design approach 
can be used to design a parallel compliant mechanism. For S2 and S3, most of the 
parallel -kinematic architectures can be found in many sources and past literatures 
(Merlet 2000; Tsai 1999). Hence, finding a suitable architecture will not be difficult. 
The flexure and stiffness modeling methods, which are presented in section “The- 
oretical Modeling Approaches,” are sufficient for executing S4 and S5 (except the 
FE analysis). Therefore, information presented in this section will be useful for 
designing a parallel compliant mechanism systematically. 


Topological Optimization 

A compliant mechanism can also be treated as a continuum structure with either 
distributed or lumped compliance to deliver specific DOF motion. For example, a 
compliant gripper shown in Fig. 39a was designed based on a four-bar linkage 
architecture with a slider. On the other hand, topology optimization can also 
synthesize a continuum structure that not only delivers the same function but 
with better stiffness characteristic as shown in Fig. 39b. 

In general, a topological optimization approach is a mathematical approach of 
finding the optimal way of distributing material within a predefined design domain 
based on a set of loads, fixed supports, and boundary conditions such as the 
performance specifications and task requirements. To conduct a topological opti- 
mization, a design domain formed by either discrete number of finite elements or 
trusses must be defined. Using finite-element design domain is considered as 
the homogenization approach (Bendsoe and Kikuchi 1988), while those with 
truss structures are termed as the ground structure approach (truss sizing) 
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Fig. 39 (a) A compliant gripper obtained from the kinematic-based design approach versus (b) 
a new gripper concept generated via the topology optimization approach 


(Rozvany 1976; Bendsoe et al. 1994). Figure 39b shows a finite-element design 
domain that was used to synthesize an optimized continuum structure. With this 
design domain, loads, fixed supports, and output point (motion) were allocated 
around it. Based on an objective function and some boundary conditions, topolog- 
ical optimization was conducted via an optimization algorithm to determine the 
state of each element, i.e., either solid or void. From Fig. 39b, those elements in 
black are solid, while those in white are void. The solid elements form the 
optimized structure topology that meets the objective function. A shape optimiza- 
tion process further removed the unwanted materials and smoothen the edges to 
form a complete continuum structure. Lastly, a symmetrical pair of optimized 
continuum structure will function like a compliant gripper. 

Topological optimization approach uses optimization algorithms such as the 
Generic Algorithm (GA) (Chapman and Jakiela 1996), Solid Isotropic Material 
with Penalization (SIMP) (Bendsoe 1989; Bendsoe and Sigmund 1999), Evolution- 
ary Structural Optimization (ESO) (Xie and Steven 1993; Chu et al. 1997), and 
Optimality Criteria (OC) (Rozvany 1995). Lately, new topological optimization 
approaches such as the level-set method (Wang et al. 2003b), the morphological 
method (Tai and Akhtar 2005), and the mechanism-based seeding approach (Teo 
et al. 2013) were also introduced. Over the past 30 years, these research efforts and 
findings have demonstrated that the topological optimization approach is another 
effective concept of designing the compliant mechanisms. 

Using the advantages of modem topology optimization approach, an integrated 
design approach for synthesizing an optimized parallel compliant mechanism is 
introduced in this section. This design approach is a systematic design methodology 
that integrates both classical mechanism theory and modem topology optimization 
approach. Referring to Fig. 40, the first step is to understand the design specifica- 
tions, e.g., the desired DOF, workspace, and size constraints. Next, appropriate 
parallel -kinematic architecture will be selected and general kinematic analyses will 
be conducted. At sub-chain level, topology optimization will be used to determine 
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Fig. 40 A systematic 
integrated design approach 
for synthesizing optimized 
parallel compliant 
mechanisms 
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the optimized topology of the flexure joint or limb. With these optimized topologies 
being generated, the compliant matrix of each flexure joint or limb will be used to 
determine the overall stiffness of the compliant mechanism via the classical mech- 
anism theory. At configuration level, the overall stiffness of compliant mechanism 
will be optimized based on the desired workspace and size constraints. Subse- 
quently, an optimized parallel compliant mechanism that meets all desired speci- 
fications will be generated. 

Example. Consider the design of a 3 -DOF X-Y-O z planar motion compliant mech- 
anism as an example to apply the integrated design approach. This compliant 
mechanism targeted a workspace of 4 mm x 2° x 2° and a footprint of 300 mm . 

To achieve an X-Y-6 Z motion, 3RRR, 3PRR, and 3PPR (Yang et al. 2008) are 
possible parallel-kinematic architectures. In this example, 3PPR was chosen 
because the compliant P-joints are more deterministic than the compliant R joints. 
The schematic of 3PPR is shown in Fig. 41 where the moving platform is connected 
to the fixed base by three identical parallel-kinematic chains. Each kinematic chain 
consists of an active P-joint and a passive RP-joint that are connected in series. By 
treating each joint as a spring, the stiffness modeling approach from classical 
mechanism theory (see section “Stiffness Modeling”) can be used to determine 
the overall stiffness of the end effector based on the compliant joints modules. 

After the mechanism synthesis and kinematic analysis, a new topological opti- 
mization approach was used to synthesize individual compliant joint module. 
Termed as the mechanism-based approach (Lum et al. 2013), elementary linkage 
mechanisms were used as basic genes for the joint optimization. For example, a 
generic 4-bar linkage mechanism was used to synthesize the 1-DOF active P-joint 
and a generic 5 -bar linkage mechanism was used to synthesize the 2-DOF passive 
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Fig. 41 Schematic representation of 3PPR parallel-kinematic architecture and the stiffness 
modeling from classical mechanism theory 
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Fig. 42 (a) Adding cubic and harmonic curves to create material for each link, (b) Design 
variables of the mechanism and (c) for each link 


PR-joint. Next, the cubic and harmonic curves were added to each link of the 
mechanism to create material as shown in Fig. 42a. Elements that fell within 
the boundary of the original and the reflected curves became solid. Subsequently, 
the optimization varied the distribution of the material by changing the design 
variables of mechanism (Fig. 42b) and individual link (Fig. 42c) until the objective 
functions were met. 

For an optimized P-joint, Cl 1, which represents the compliance along the x-axis 
due to a translation force along the same axis, needs to be as high as possible, while 
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Fig. 43 Concurrent evolution of topology and structure for both P- and PR-joints 


the remaining components of the compliance matrix need to be as low as 
possible. Hence, the objective function for the P-joint optimization was 
formulated as 


min 



n t 2 n; 


=1 


C 


C 


20 

11 



For an optimized PR-joint, both Cl 1 and C66, which represents the compliance 
about the z-axis due to a moment about the same axis, need to be as high as possible, 
while the remaining components of the compliance matrix need to be as low as 
possible. Thus, the objective function PR-joint optimization was 


( ny 2 n' cfA 

mm F(x) = (82) 

\ C 11 C 66 / 

In this example, GA was used as the optimization algorithm. The evolutions 
from the basic genes (linkage mechanisms) to optimal joint designs in both topol- 
ogy and structural forms are shown in Fig. 43. Subsequently, the stiffness matrix of 
the optimized topology of each joint was used to determine the stiffness matrix of 
the end effector based the stiffness modeling approach (see section “Stiffness 

Modeling”). 

At configuration level, stiffness optimization was conducted to optimize the end 
effector based on the workspace and size constraints. As mentioned in the begin- 
ning of this section, the most important design criteria of a compliant mechanism is 
to maximize the stiffness ratio, i.e., between the off-axis stiffness and 
natural stiffness. Hence, the objective function of the stiffness optimization was 
formulated as 


max 


/ ^ 33 ^ 44^55 

Ul 1^22^66 



where K 2 2 represents the stiffness along the y-axis due to a translation force along 
the same axis while K 33 , K 44 , and K 55 are the off-axis components. 
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Fig. 44 3PPR parallel compliant mechanisms articulated by compliant joints with (a) optimized 
topologies versus (b) conventional topologies 


Another 3PPS parallel compliant mechanism shown in Fig. 44a, which was 
articulated by flexure joints with traditional topologies, was used to evaluate the 
stiffness characteristic of the optimized design. Termed as the conventional design 
(Fig. 44b), its PR flexure joint was formed by a cantilever beam with both ends 
being fixed to the translation portion of the P-joint, which was formed by a 
conventional parallel linear spring configuration. The compliance matrix of the 
optimized design, C°e\ and the conventional design, Cee X \ was obtained as 
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Subsequently, the ratio between Eqs. 84 and 85 is represented in a diagonal 
matrix form, which is 


£Opt 

R c =— ^ = diag[1.91 1.91 0.57 0.75 0.75 



From Eq. 86, results show that the compliance along the x- and y-axis of the 
optimized design is almost twice as compared to the conventional design. In 
addition, the off-axis stiffness is higher for the optimized design. Hence, this 
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Fig. 45 Prototype of the 
3PPR parallel compliant 
manipulator that is articulated 
by flexure joints with 
optimized topologies and 
driven by three voice-coil 
actuators 
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section shows that the integrated design approach is an effective design methodol- 
ogy for using topological optimization approach to synthesize an optimized parallel 
compliant mechanism with high stiffness ratio characteristic. Finally, a prototype 
was developed from this approach. Each of the active P-joint is driven by a voice- 
coil actuator and its position is measured via a linear optical encoder. Based on a 
PID controller, it achieved a high positioning resolution of 50 nm over a workspace 

of 5 mm 2 x 5° (Fig. 45). 


Actuation and Sensing 

Compliant manipulators are high-precision mechatronic systems that consist of the 
compliant mechanisms, high-resolution positioning actuators and sensors, and good 
control schemes. With the main part of this chapter focusing on the strength, 
limitations, modeling, and design methodologies of the compliant mechanisms, 
this section reviews the state-of-the-art actuators and sensors that can be used to 
achieve high-precision manipulation where the advantages, limitations, and poten- 
tial performance trade-offs for each kind of actuator or sensor will be discussed. 


High-Resolution Positioning Actuators 

An actuator with high positioning resolution is an essential subsystem of the com- 
pliant manipulator that could ultimately decide the traveling range, output force, 
stiffness, and even size or footprint of the manipulator. These actuators can be 
classified into two categories, i.e., the solid state and the field based. The solid-state 



2282 


T.J. Teo et al. 



Fig. 46 (a) Stacked PZT actuators from PI (Physik Instrumente 2014). (b) A 1-DOF PZT-driven 
compliant stage with an amplification level and preloaded springs and its schematic representation 
(Ho et al. 2004) 

actuators are transducers that convert the electrical energy into the mechanical energy 
via strain in the materials. Piezoelectric (PZT) actuator, shape-memory alloy (SMA), 
and the thermal actuator are the solid-state actuators, which are commonly used by 
the compliant manipulators. On the other hand, the field-based actuators are trans- 
ducers that convert the electrical energy into the mechanical energy via the presence 
of fields. Field-based actuators that are commonly found in the compliant manipula- 
tors include the electrostatic and the electromagnetic actuators. 

Piezoelectric Actuators 

Most conventional compliant manipulators are driven by the PZT actuators due 
to the nanometric resolution and large actuating force characteristics (Mamin 
et al. 1985; Fite and Goldfarb 1999; Dong et al. 2000; Kimball et al. 2000; Sun 
et al. 2002; Zhang and Zhu 1997; Tan et al. 2001). Such an actuator is made up 
of ceramics that convert an applied voltage or charge into a mechanical dis- 
placement directly through the physical elongation of the material. This small- 

i o 

dimensional change leads to extremely small displacement of up to 1 x 10 
m. Such positioning resolution can be adjusted since the dimensional changes 
are proportional to the applied voltage (Ouyang et al. 2008). PZT actuators can 
operate at extremely high bandwidth of up to few hundreds kilo-Hertz (kHz). 
They can also operate up to millions of cycle without deterioration. Due to the 
nature of producing mechanical displacements through dimensional changes, 
PZT actuators can produce large force of up to few kilo-Newton (kN) and no 
wear-and-tear issue and can operate in vacuum and clear room environment. 
Figure 46a shows some commercial stacked PZT actuators developed by Physik 
Instrumente (PI). 

The major disadvantages of PZT actuators include the small and nonlinear 
displacement characteristics. The nonlinearity of PZT actuator is contributed by 
the hysteresis and creep behavior of the ceramic materials during the dimensional 
change. As a result, the forward and backward paths of a cyclic PZT actuator are 
different and require mechanical means or advanced control schemes to minimize 
such nonlinearities. One approach is to preload the moving stage with springs in the 
direction that opposes the PZT actuation as shown in Fig. 46b. However, preloading 
with springs can only linearize a portion of the stroke. Hence, the maximum 
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displacement of the stage is limited to 250 pm (Ho et al. 2004). Another approach is 
to model the hysteresis and creep of the PZT actuation and linearize the displace- 
ment through advanced control schemes (Liu et al. 2013). The biggest limitation is 
the limited displacement of a PZT actuator. By stacking the PZT ceramic disks, the 
maximum achievable stroke is a few hundred micrometers. In addition, this 
stacking approach generates accumulative errors at the end of the stack and 
increases the stress within each PZT ceramic disks. Most importantly, this approach 
increases the internal resistance which in turn increases the applied voltage require- 
ments, e.g., >200 VDC. 

Shape-Memory Alloy 

SMA material has a unique memory (known as the “shape memory effect”) of its 
pre -deformed shape. At low temperature, the SMA returns back to its original shape 
when the temperature increases. This behavior makes SMA as a form of solid-state 
actuation. At high temperature, an applied force can cause a large deformation, 
which can also easily recover by releasing the applied force. This effect is termed as 
the “superelasticity.” The use of SMA on the compliant manipulators is unique too. 
Instead of using conventional materials to develop the flexure joints, SMA is used 
as the flexure material creating interesting flexible members that double up as both 
the limbs and actuators of the compliant manipulators (Reynaerts et al. 1995; 
Hesselbach et al. 1997; Bellouard and Clavel 2004). This is because the SMA 
flexure joint exhibits higher flexibility as compared to the conventional flexure 
joints. Using SMA flexure joints reduced the amount of flexure joints, hence 
lowering the stiffness of the compliant manipulator in the driving direction. In 
addition, SMA can also be considered as a high damping metal, which is a favorable 
characteristic for spring-dominated systems, i.e., compliant mechanisms. 

The main limitation of SMA is its slow rate of cooling, which is usually limited 
to a few hertz. As a result, the difference between heating and cooling transition 
creates a hysteresis effect during the forward and reverse motions as shown in 
Fig. 47. The slow cooling rate also limits the bandwidth of any SMA-based 
compliant manipulator. Hence, these manipulators are only suitable for tasks or 
applications that require slow yet precise motion delivery. 

Thermal Actuators 

Thermal actuation is one of the popular schemes in driving the MEMS -based 
compliant manipulators (Chen and Culpepper 2006; Comtois and Bright 1996, 
1997; Qiu et al. 2003; Zhu et al. 2006). This is because the thermal actuators can 
be easily fabricated and integrated within such microscaled functional systems. The 
basic working principle of the thermal actuation is to generate a small amount of 
thermal expansion in a material through joule heating. Simultaneously, this small 
thermal expansion will be amplified to produce a deflection motion. Such amplifi- 
cations are realized through the bimorph (asymmetric) or chevron (symmetric) 
actuator design as shown in Fig. 48. 

The bimorph design is a cantilever beam formed by two equal length parallel 
segments been joined together. During operation, one segment will be heated while 
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Fig. 47 A monolithic SMA compliant linear spring and its force-displacement characteristic 
(Bellouard and Clavel 2004) 



Fig. 48 Illustrations of the (a) bimorph and (b) chevron thermal actuator designs 


the other remains cool. The temperature difference between two segments causes 
both segments to expand differently. As a result, a bending deflection occurs since 
both segments are fixed at one end. In addition, two or more thermal actuators can 
be connected together in parallel to enhance the force output or generate a linear 
motion (Comtois and Bright 1997). The chevron design can be articulated by one or 
more pairs of “V”-shaped cantilever beams. For each pair of beams, the applied 
current passes through from one beam to the other, thus causing them to expand, 
buckle, and create a linear motion. In general, the bimorph thermal actuator pro- 
vides a bending motion, while the chevron thermal actuator provides a translation 
motion. 

Figure 49a shows a 6-DOF compliant manipulator, //HexFlex, that is driven by 
three pairs of 2-axes bimorph thermal actuators to achieve a workspace of 8.4 x 
12.8 x 8.8 pm 3 and 1.1° x 1.0° x 1.9° (Chen and Culpepper 2006). Another 
MEMS -based compliant stage shown in Fig. 49b is driven by a single-axis chevron 
thermal actuator. When the actuator was tested without a specimen, a maximum 
displacement of approximately 800 nm was generated at a current of 15 mA (Zhu 
et al. 2006). The main advantage of the thermal actuation is having higher force 
generation as compared to the electrostatic actuation. However, it has lower 
bandwidth and requires good heat dissipation for continuous operations. 
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Fig. 49 (a) A 6-DOF microscale nanopositioner, ^HexFlex, driven by six 2-axes bimorph thermal 
actuators (Chen and Culpepper 2006). (b) A MEMS -based compliant stage driven by a chevron 
thermal actuator (Zhu et al. 2006) 


Electrostatic Actuators 

Electrostatic actuation is perhaps the most popular choice for driving the MEMS 
devices (Toshiyoshi and Fujita 1996; Rosa et al. 1998; Hung and Senturia 1999; 
Tsou et al. 2005; Chiou and Lin 2005; Borovic et al. 2006). Unlike the thermal 
actuation, it consumes very small amount of power and can operate in high 
bandwidth. Electrostatic actuator can also be easily fabricated and integrated as 
part of the MEMS devices. Governed by Coulomb’s law, the electrostatic actuation 
uses the attraction force between two point charges to generate displacement or 
exert force. Assuming that two surfaces are extremely close, the actuation force, F e , 
can be expressed as 


F e =\ V 2 £air (A) (87) 

where V represents the voltage and s air represents the permeability of air, while E = 
Aid where A is the area, d is the gap between two areas, and E is the electric field. 

Equation 87 suggests that the force generation is proportional to the surface area; 
the “comb” configuration is commonly used to enhance the force generation. 
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Fig. 50 (a) Schematic of an electrostatic force actuation, (b) An X-Y -0 Z MEMS-based 3PRR 
parallel compliant manipulator driven by electrostatic comb-drives (Deepkishore et. al 2008) 


A comb architecture consists of two sets of capacitor banks where each bank 
comprises of a parallel array of capacitors. Each capacitor has a pair of parallel 
surface. However, there can be many variations in the geometry of these comb- 
drives such as the typical parallel configuration shown in Fig. 50b, the spiral (Tang 
et al. 1989) and the T-bar configurations (Brennen et al. 1990). Figure 50a shows an 
example of an electrostatic -driven MEMS-based X-Y -6 Z micro-/nanopositioning 
stage articulated by a parallel-kinematic architecture. Each electrostatic achieved a 
travel of 27 pm at 85 V, while the overall workspace of the end effector is 18 pnT x 
1.72° when all three actuators are energized. 

Electromagnetic Actuators 

Electromagnetic (EM) actuation is a driving scheme that offers noncontact, fric- 
tionless, and long travel characteristics. Over the past two decades, compliant 
manipulators driven by EM actuation have achieved millimeters of displacement 
and subnanometer positioning resolutions with high accelerations and speed 
responses. To realize an EM-driving actuation, two types of techniques can be 
employed, i.e., the solenoid actuation and the Lorentz-force actuation. 


Solenoid Actuation 

Solenoid actuation is an EM technique based on the attraction of a ferromagnetic 
moving part. Based on Fig. 51a, the fixed stator generates an attraction force to 
propel the ferromagnetic moving part towards it. This force is expressed as 



n o 

where fi is the magnetic permeability of the air, An x 10 (N/A ), N represents the 
number of turns, I represents the applied current, A represents the area, and 
g represents the air gap between the stator and the moving part. The main advantage 
of this technique is that it generates higher force as compared to the Lorentz-force 
actuation. However, it exhibits nonlinear characteristic due to the square term in g. 
Hence, the force increases significantly as the gap reduces. 
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Fig. 51 (a) A schematic representation of an EM solenoid actuation and (b) an EM-driven 3 -DOF 
planar motion compliant manipulator and its schematic plot of all three motion (Chen et al. 2002) 


Figure 51b shows an example of an X-Y-#z compliant manipulator driven by the 
solenoid actuation. Three pairs of solenoid actuators are used to produce 3 -DOF 
decoupled planar motion with a minimum positioning resolution of 50 nm over a 
workspace of ±80 pnr x ±3.52 mrad. Each solenoid actuator generates a driving 
force of 50 N with an input current of 0.5 Amp at an air gap of 250 pm. This example 
also demonstrated that the solenoid actuation is not suitable for large traveling range 
and has a nonlinear force-displacement and current-force relationship. 

Lorentz-Force Actuation 

Lorentz-force actuation has a contrast characteristic as compared to the solenoid 
actuation. This EM technique offers a linear current-force relationship and can be 
configured to deliver large traveling range with linear force-displacement relation- 
ship. Governed by Lorentz’s law, the force generation is expressed as 

F = BILN (89) 

where B represents the magnetic flux density from the permanent magnet (PM), 
I represents the applied current, N represents the number of coil turns within the 
effective air, and L represents the coil length per turn. 
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Fig. 52 (a) Conventional VC actuators and other variations of VC actuators from (b) H2W 
Technologies (H2W Technologies 2014) and (c) BEI Technologies (BEI Kimco Magnetics 2014) 


Commercially available Lorentz-force actuator shown in Fig. 52a is also known 
as the voice-coil (VC) actuator. With the VC actuator, the effective air gap 
represents the region with the presence of magnetic flux density emanates from a 
permanent magnetic source, i.e., PM. In this air gap, the magnetic flux density 
should be well distributed. Due to the low permeability of air, the magnitude of the 
magnetic flux density is usually lower than the remanence magnetic flux density of 
a PM. The best approach to overcome this limitation is to increase the size of a PM 
while maintaining a small effective air gap. Examples can be found in the VC 
actuators from H2W Technologies (Stroman 2006) show in Fig. 52b and BEI 
Technologies (Speich and Goldfarb 2000) shown in Fig. 52c. These magnetic 
circuits increase the magnetic flux density through larger PMs. Nevertheless, they 
still require very small effective air gaps because the magnitude and uniformity of 
the magnetic flux density decreases with respect to the increment in distance from 
the magnet-polarized surface. 

Due to its linearity and large displacement characteristics, there is a growing 
trend of using Lorentz-force actuation to drive the compliant manipulators (Teo 
et al. 2010a; Yang et al. 2008; Teo et al. 2013; Fukada and Nishimura 2007; Bacher 
2003; Helmer 2006; Culpepper and Anderson 2004). These multi-DOF compliant 
manipulators have demonstrated nanometric positioning resolution capability over 
a few millimeters and degree workspace. Figure 53a shows an example of a 3 -DOF 
parallel compliant manipulator driven by three VC actuators from BEI Technolo- 
gies. Using the VC actuators, it achieved a positioning resolution of 50 nm over a 
workspace of 5 mnr x 5°. Customized Lorentz-force actuators for large force 
generation were also well demonstrated by the 3 -DOF out-of-plane motion com- 
pliant manipulator shown in Fig. 53b. Using customized Lorentz-force actuators, it 
achieved 20 nm positioning resolution and 0.05 arcsec angular resolution over a 
workspace of 5 mm x 5° x 5°. With the customized actuators, the manipulator 
produces 150 N/Amp of output force. 


Performance Trade-Offs 

Electrostatic and thermal actuators are commonly used in the MEMS -based com- 
pliant manipulators. The trade-off of using the electrostatic actuators is poor force 
generation. Switching to the thermal actuator may provide larger force generation 
but at the expense of lower-frequency responses. For micro- to macroscale 
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Fig. 53 (a) A 3-DOF planar motion parallel compliant manipulator driven by commercial VC 
actuators and (b) a 3-DOF out-of-plane motion parallel compliant manipulator driven by custom- 
ized Lorentz-force actuators 


Table 8 Performances of various actuators that are suitable for driving the compliant manipulator 


Actuator 

Max. range 
(pm) 

Stiffness 

Linearity 

Accuracy 

(nm) 

Bandwidth 

Cost 

Piezoelectric 

-500 

High 

Poor 

<10 

High 

Medium 

Electrostatic 

-100 

^0 

Square 

law 

<1 

High 

Low 

Thermal 

-20 

High 

Poor 

<10 

Low 

Low 

SMA 

-10 

Medium 

Poor 

<100 

Low 

Medium 

Solenoid 

-300 

^0 

Poor 

<1 

Low 

Low 

Lorentz- 

force 3 

-50 

^0 

Good 

<1 

Medium 

Medium 


a Single-phase non-commutation 


compliant manipulators, the PZT actuators and SMA materials are the potential 
solutions for driving them. Both offer limited travel range, but PZT actuators can 
deliver very high driving force and bandwidth. On the other hand, SMA materials 
can be made into a monolithic compliant manipulator. It is an attractive solution for 
minimizing moving masses and stiffness in actuating direction. From macroscale 
onwards, EM actuation can be an alternate solution to the PZT actuators. Solenoid 
actuators may have the advantage of large force-to-size ratio, but the Lorentz-force 
actuator has linear characteristic and large traveling range. Although PZT actuators 
have limited stroke, high force generation and bandwidth are desirable character- 
istic for driving the compliant manipulators. Being a solid-state actuator, PZT can 
still provide additional non-actuation stiffness (even when power-off) to the com- 
pliant manipulators. This benefit can never be provided by the EM actuation. To 
summarize the review of high-resolution positioning actuators, the performance 
and trade-offs of each actuator are listed in Table 8. 
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High-Resolution Sensors 


A sensor is a transducer that converts one form of energy to another form, for example, 
a device that responds to or detects a physical quantity and transmits the resulting signal 
to a controller (Slocum 1992). Hence, sensors are also considered as the “eyes” or 
“ears” of a control system that controls a manipulator. Hence, it defines the achievable 
positioning resolution of a compliant manipulator even if the actuators have infinite 
positioning resolution. Between different sensing technologies, there are certain 
performance specifications that are crucial and are stated as follows (Slocum 1992): 


Resolution 

Accuracy 

Noise 

Linearity 

Frequency response 


The smallest detectable change to the physical quantity. 
An error in output causes by external disturbances such as the 
variations of temperature, humidity, and atmospheric pressure. 
The magnitude of the output which is not part of the 
change in the physical quantity. 

The percentage of variations in the constant of proportion- 
ality between the output signal and the measured physical 
quantity. 

The rate of change in output signal due to a change in 
physical quantity. 


In general, a sensor can be classified into two categories: the nonoptical and 
optical sensors. Optical sensors provide analog or digital signals that are 
corresponding to the physical quantity change by optical means, while the 
nonoptical sensors use other means of measurement. Such kind of sensors include 
capacitive sensor, hall effect sensor, inductive displacement sensor, variable dif- 
ferential sensor, inclinometers, magnetic scales, magnetostrictive sensor, and 
PZT-based sensor. Among these nonoptical sensors, the capacitive sensor is com- 
monly used in the compliant manipulators due to its high-resolution nature. For the 
same reason, the optical sensors, which include the optical encoders and interfer- 
ometric sensors, are the popular choices for the compliant manipulators. 


Capacitive Sensors 

A capacitive sensor determines the gap between a probe and a target by measuring 
the amount of capacitance formed between the two parallel surfaces, i.e., the face of 
the probe and the target, as shown in Fig. 54a. By applying voltage to one surface, 
an electric field will exist between the two surfaces. Electric field is the result of the 
difference between the electric charges that are stored on the surfaces. Hence, the 
capacitance is the “capacity,” which is formed between the two surfaces, to hold the 
charges. The effect of the gap, g, and the surface area, A , on the capacitance, C, 
between two parallel surfaces is expressed as 

C = — (90) 

8 

where e is the dielectric constant of the material in the gap. 


62 Compliant Manipulators 


2291 




Fig. 54 (a) Illustration of capacitance probe measurement, (b) A commercial capacitance probe 
from Lion Precision (Lion Precision 2014) and (c) its configuration 


Capacitive sensor offers a noncontact measurement with extremely high 

° Q 

resolution of ~25 A (2.5 x 10 ) and a typical accuracy of ~0. 10-0.20 % over 
the full-scale range. However, it can only provide a measuring range of up to 
±0.13 mm or subnanometer resolution and the frequency response is up to 20-40 
Hz. The measuring range can be increased but at the expense of losing the 
nanometric resolution. A unique characteristic of the capacitive sensor is the 
ability to detect a wide range of materials, e.g., metals, dielectric, and semiconduc- 
tors. The sensor output is only affected by different types of material surfaces 
but will not be affected with different contents with the same material. For example, 
a capacitive sensor calibrated over a stainless steel target can also be used to 
measure brass or aluminum target. Apart from the optical means, the capacitive 
field sensing is the only nonoptical means of providing subnanometer resolution 
measurement capability. Hence, it is widely used in high-precision compliant 
manipulators. 

There are a couple of stringent requirements when using the capacitive sensors. 
First, there will be stray capacitance that affects the accuracy of the measurement. 
However, the stray capacitance can be easily minimize by adding a guard around 
the sensing electrode and collimate the electric field lines between the sensor and 
the target as shown in Fig. 54c. The trade-off is that the size of the probe will 
increase due to the presence of the guard. Second, it is important to keep the 
surfaces of the sensor and the target parallel. Misalignment in parallelism of two 
surfaces will affect the accuracy of the measurement since the capacitance is 
proportional to the sensing area and gap between the probe and the target. 

For the same reason, the third requirement is to ensure the high ratio of the 
sensing area to the gap. Having high ratio, i.e., huge sensing area with very small 
gap, means greater accuracy and resolution. Other benefits include minimizing the 
effect of electromagnetic waves on the accuracy and providing an averaging effect 
to the output. The forth requirement is to minimize environmental disturbances 
because the dielectric constant can be affected by the temperature, barometric 
pressure, humidity, and media type. Other than maintaining the environment, a 
second probe can be used to measure a fixed object to record a reference signal that 
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Fig. 55 (a) Illustration of an optical measurement principle, (b) A commercial optical linear 
encoder from HEIDENHAIN Corporation (Heidenhain 2014) 


captures the noise or drift cause by the change in barometric pressure, humidity, and 
temperature. Subtracting the measurement signal with this reference signal concur- 
rently will produce the actual displacement signal. 

Optical Encoders 

Optical encoder operates based on the principle of counting the slits or windows 
within a scale through a light source and a photodiode as shown in Fig. 55a. Simply 
imagine a LED that shines light on a scale, which has an array of equally spaced 
windows. Behind the scale and directly opposite the LED is a photodiode. If the 
LED light transmits through these windows, the photodiode will receive the 
transmitted light and generate signals or pulses. On the other hand, no signal will 
be generated by the photodiode if the light is block by the material in-between two 
windows. This measuring scheme can be configured to linear or angular optical 
encoder that is immune to electric noise. Most optical encoders mainly produce 
digital output due to the working principle. 

The resolution of optical encoder is dependent to the types of scale gratings. For 
example, a scale that can pack more equally spaced slits (gratings) will have higher 
resolution. Based on the commercial available optical encoders from 
HEIDENHAIN Corporation, a scale with the absolute grating can deliver approx- 
imately 10- to 12-bits positioning resolution, a scale with the incremental grating 
can deliver 10- to 16-bits positioning resolution, and a diffraction grating can 
deliver up to 21 -bits positioning resolution. Like a ruler with equally spaced 
numbered markings, the absolute grating has a fixed and low resolution. For the 
incremental grating, the quadrature and interpolation methods are used to enhance 
the resolution of the encoder. For example, an incremental grating with 20 //m pitch 
can deliver 5 nm per encoder count resolution based on 4,000 steps interpolation. 
With such interpolation approach, the diffraction grating provides even higher 
positioning resolution due to its smaller pitch size. The scale also comes in different 
materials, i.e., stainless steel tape, glass scale, and zerodur scale. The glass scale has 

z 

a thermal expansion coefficient of 5.9 x 10 /K, while the zerodur material has a 
thermal expansion coefficient of 0 =b 0.007 x 10 _6 /K from 0 °C to 50 °C. The 
selection of the material depends on the applications and desired specifications. For 
high-precision positioning tasks, the zerodur scale may be the best option, but it is 
the most expensive among the rest. 
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Fig. 56 Illustration of (a) a 1-DOF measurement setup and (b) a 3 -DOF planar motion measure- 
ment setup using the laser interferometer sensor 


Laser Interferometer Sensors 

Laser interferometer sensor is the most accurate measurement system that can be 
applied to today’s mechatronics and robotic systems. Hence, it can be used as 
working standards for machine calibrations, measurement, and feedback control. 
Based on the typical size of a photon, the resolution of a laser interferometer sensor 
is up to 0. 15 nm. It also has a measuring range that is beyond 10 m and a measuring 
speed of up to 4.2 m/s. Most laser interferometer sensors use Helium-Neon as the 
laser source (wavelength, 2, = 633 nm). The accuracy of laser interferometer sensor 
is dependent on very stringent metrology conditions: 

1. Temperature stability of ±1 °C 

2. Humidity variation of less than 10 % 

3. Pressure variation of less than 0.25 mmHg 

4. Target mirror flatness of 2/10 PV 

Temperature and humidity can be actively controlled in an enclosed environ- 
ment, e.g., a clean room, while the mirror flatness can be achieved through high- 
precision polishing technology. As barometric (environmental) pressure is more 
difficult to control, the usual approach is to setup a wavelength tracker or a 
metrology station to monitor the variations in the laser. 

A laser interferometer sensor is a noncontact and relative displacement sensor. 
Hence, it will not give an absolute displacement value but rather a value relative 
to the previous value. In addition, it can only measure relative displacement 
change beyond a dead path, which represents the minimum length requirement 
between laser interferometer sensor and the target. Setting up the laser interfero- 
meter sensor requires optics components such as the beam splitters (BS) and 
retroreflectors to orientate the laser beam from the laser source to the moving 
stage. Figure 56a illustrates a setup to measure a 1-DOF moving stage via the 
laser interferometer sensor. A BS is used to split the beam into two paths where 
one path accounts for the fixed reference position via the fixed retroreflector, while 
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the other path accounts for the position of the moving stage. The displacement is 
given by the position of the moving stage relative to the fixed reference position. 
Figure 56b illustrates another setup that measures a 3 -DOF moving stage. After the 
first BS from the laser source, 50 % of the beam goes to another BS that further 
splits it into two beams (25 % each) where one goes to a wavelength tracker and the 
other measures the position along the x-axis. The other 50 % of the beam is 
also split into two beams (25 % each) that measure the position along the y-axis. 
All three 25 % beams also measure the angular change about the z-axis. The 
wavelength tracker is used to monitor the accuracy of the beam, which could easily 
be affected by changing metrology conditions such as the temperature, humidity, 
and pressure. 

Fiber optics can be used to deliver the laser beam directly to the remote units with 
built-in optics and detector. This technology minimizes the needs of using optic 
components and the setup time to orientate the beam from the laser source and 
can channel the beam from one laser source to multiple remote units. These units, 
termed the interferometric encoders, can be placed near the end effector for 
direct measurements. Figure 57a shows a pair of single-axis interferometric encoders 
from Renishaw that were used to measure the displacement of the end effector of 
a compliant manipulator. Using the fiber optics technology, the laser beam from a 
laser source was channeled into two separated interferometric encoders without 
any loss in the resolution, i.e., 10 nm per count. The transmitter, optics, and detector 
are all built inside each encoder to simplify the setup, while the end effector carried 
the metrology mirrors to reflect the transmitted laser beams back to individual 
encoder. Another example is the triple-beam interferometric encoder from SIOS 
shown in Fig. 57b. Fiber optics cable channels the beam from a laser source to a 
sensor head that provides three individual measuring beams. These beams are 
equally spaced in a predefined arrangement that is capable of measuring 3 -axes 
out-of-plane motion and provides a resolution of 1 nm per count from each beam. 
This encoder system also provides a metrology station that has a similar function as 
a wavelength tracker. 

Performance Trade-Offs 

Laser interferometer sensor offers the most accurate measurement in modern 
world and has the ability to provide a positioning resolution of up to 0.15 
nm. It is a noncontact and relative displacement sensor that is commonly used 
to feedback to position of the end effector rather than the joints of a manipulator. 
The main limitation of laser interferometer sensor is that its accuracy can 
be easily affected by changing metrology conditions. On the other hand, these 
changing metrology conditions have less effects on an optical encoder since the 
scale and encoder head are placed in close proximity. However, this arrangement 
limits the use of the optical encoders to single-axis or two-axes planar motion 
measurement of the joints rather than the end effector of a manipulator (except for 
1-DOF and 2-DOF translational motion stages). The measuring range is limited 
by the length of the scale, which typically range from 10 cm to 3 m. Although 
interpolation offers high resolution, the accuracy is always limited to 1 % of the 
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Fig. 57 Measurement setups using (a) two single-axis interferometric encoders from Renishaw 
and (b) a three-axes interferometric sensor from SIOS (SIOS 2014) 


grating pitch size. One method to overcome this limitation is to directly relate 
the change in analog signal with the change in displacement. Such calibrations 
are also commonly done in motion systems that use the analog capacitance 
sensors. Just like the laser interferometer sensor, a capacitance sensor also 
has very high positioning resolution. However, high resolution comes with the 
expense of small measuring range. Likewise, a capacitance sensor can be 
calibrated for larger measuring range but sacrifices on the resolution due to 
the effects of stray capacitance and the instability of the dielectric constant 
within a large air gap. Nevertheless, all these technologies are effective measur- 
ing means if properly setup via controlled environment and methodology 
conditions. 
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Future Advancement 

The first generation of compliant manipulators offered single-axis translational 
motion, while the second generation delivered multi-DOF planar or spatial motion 
through the aid of parallel -kinematic architectures. In both generations, the 
workspace of the compliant manipulators is limited by the notch-hinge flexure 
joints and PZT actuators. By exploring the EM actuation and more flexible beam- 
based flexure joints, the third generation of compliant manipulators has successfully 
broken through the millimeter-range barrier encountered by the previous genera- 
tions but limited to low payload positioning applications. More recent research 
work has introduced the latest generation of compliant manipulator, which offers 
high payload with large displacement capability, suitable for direct contact appli- 
cations such as the UV-NIL process (Teo et al. 2010a). The next generation of 
compliant manipulators is projected to be an integrated system fabricated through 
3D printing technology. Termed as the Flextronics, these single monolithic com- 
pliant manipulators will have actuators and sensors all printed and embedded into 
the compliant limbs or flexure joints. Synthesizing the Flextronics requires the 
advancement in the topological and dynamics optimization of compliant manipu- 
lators. Using the 3D printing technology, the structure arrangement of the compli- 
ant limbs or flexure joints will be isotopic rather than homogenous. Hence, new 
theoretical models and modeling approach will be needed to predict the stiffness 
characteristic of the Flextronics. The advancement in material science will be the 
key to the realization of Flextronics since this advanced technology requires new 
material that can be actuated and sense while providing necessary strength and 
stiffness to the overall structural integrity. Most importantly, the new engineered 
material must have high endurance limit and predictable fatigue life. Potential 
applications of Flextronics can be advanced surgical tools and miniature robotic 
systems or even providing macro- to nanoscale positioning tasks in vacuum and 
harsh environment. 


Summary 

Development of a compliant manipulator, which is articulated by the flexure joints, 
involves good understandings on several fundamental topics associated with this 
advanced robotic system. Material science plays an important role in developing a 
reliable compliant manipulator. Accurate identification of the fatigue life of those 
flexure joints is crucial as it varies according to the types of materials, surface 
finishing, geometries, and loadings. Material properties also affect the stress level 
that each flexure joint can sustain under loading, bending, and torsional bending. 
Although lower Young’s module properties will lessen bending stiffness and thus 
the stresses, the compliant manipulators may lose desirable stiffness in the 
non-actuating directions. Hence, the major challenge in designing a compliant 
manipulator is to fulfill its desired functions and deflection workspace while 
maintaining the stresses well below the yield strength of the material. 
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Solid mechanics provide important analytical tools for modeling the deflection 
stiffness of the flexure joints. Classical bending-moment-curvature equation 
brought several key formulations in predicting the stiffness and stresses of func- 
tional compliant mechanisms, while the large deflection theorem covers the 
nonlinear characteristics of the flexure joints. Advancement in the theoretical 
studies on large deflection characteristic gave birth to the pseudo-rigid-body 
approximation model, which also became the bridge between the classical rigid- 
body linkage mechanism and the compliant mechanism. This model was further 
enhanced by the semi-analytic approximation model, which provides a simple, 
quick, and generic solution for any form of flexure joint configuration. With these 
approximation models, the knowledge of rigid-body linkage mechanism, particu- 
larly the parallel-kinematic architectures, can be applied in synthesizing the com- 
pliant mechanism. Synthesizing of the compliant mechanism can also be conducted 
through the classical exact-constraint design approach and the state-of-the-art 
topological optimization techniques. The synthesized compliant mechanism has 
to be driven by actuators to form a compliant manipulator. The actuators will 
ultimately decide the traveling range, output force, stiffness, and even size or 
footprint of the manipulator. Yet, the high-resolution sensors will define the step 
resolution of the manipulator. With the relevant knowledge of each fundamental 
topic being covered in this chapter, it serves as a guide and reference for designing, 
analyzing, and developing a compliant manipulator. 
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Abstract 

This chapter gives an overview to the state-of-art technology of autonomous 
mobile robots and focuses more specifically on autonomous indoor vehicles 
(AIVs) for the purpose of being more relevant to the manufacturing and indus- 
trial automation applications. Among the various locomotion designs, this chap- 
ter only introduces wheeled AIVs as wheeled platforms are predominant in the 
current commercially available AIVs. Four key research areas of wheeled AIVs, 
(1) design and modeling, (2) motion control, (3) sensing, (4) navigation, are 
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reviewed in detail. The major AIV suppliers along with their key AIV products 
are then surveyed. The chapter ends with concluding remarks and a prediction of 
the trends of future AIV development. 


Introduction 


A mobile robot is an automatic machine that is capable of motion within a given 
environment. They are designed to move in this environment for different applica- 
tions like material handling, parts transferring, or security surveillance. This is a 
broad definition because there are a few ways in which they can be classified. They 
can be classified based on the environment that they travel in, the mode of motion 
they utilize, and the level of autonomy given to the mobile robot. Robotic research 
in mobile robots is quite prevalent, and almost every major university in the world 
has one or more labs working in this area. 

Here, mobile robots can be classified by the environment they travel in: 


• Land robots or sometimes referred to as unmanned ground vehicles (UGVs). 
They are usually either wheeled or tracked but also include legged robots that 
might resemble humanoids, insect, or animals. 

• Aerial robots are usually referred to as unmanned aerial vehicles (UAVs). 

• Underwater robots are usually called autonomous underwater vehicles (AUVs). 

If we focus on land robots, we can see that they can be further classified into their 
various modes of motion. This can be: 


• Legged robot 

• Wheeled robot 

• Track-based robot 

Among the types of land mobile robots, wheeled mobile robots are the most 
common due to its simple design, high reliability, and efficient manner to move 
from point to point. This will be the focus of this chapter. 

Wheeled mobile robots which are designed and deployed in indoor environ- 
ments with a high level of autonomy are called autonomous indoor vehicles (AIV). 
AIVs can be found in industry, military, and security applications. In addition, it 
can also be made as domestic robots for consumers. This would include entertain- 
ment robots and those that perform certain household tasks such as vacuuming or 
gardening. 

The autonomy level of a mobile robot may range from a simple remote control to 
fully autonomous with artificial intelligence built in. The following shows one way 
of categorizing them: 

• Manual tele-operated robots 

• Guarded tele-operated robots 
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• Automated guided vehicles(AGVs) 

• Autonomous randomized robots 

• Autonomous guided robots 


A manual tele-operated robot is under the control and guidance of a driver with 
some form of controller. This can be a joystick or some other control device. A tele- 
operated robot is meant to keep the driver out of harm’s way. One example that has 
been widely used is the iRobot’s PackBot. 

A guarded tele-operated robot, as the name suggests, allows tele-operation 
but with some safety mechanisms in place. This can be either alerting the 
operator of an impending collision with a nearby obstacle or the ability to 
avoid that obstacle. Otherwise, it will behave like a robot under manual tele- 


operation. One can think of a car equipped with a reversing sensor as a guarded 
tele-operated robot. It beeps when the car is approaching closer to an obstacle 
while backing up. 

AGVs can be guided by mechanical means or by some form of sensors. This 
guidance by mechanical means can be when the AGV is placed on a track and is 
constrained by that track. In other applications, a line may be placed on the ground 
which may be used to guide this vehicle. This line can be a visual line painted or 
part of the floor/ceiling or a wire in the floor. These robots would then operate on 
the strategy of trying to center the line that it is following. Since they are 
constrained to be on its path, if there is an obstacle, it would just stop until the 
obstacle has moved away. Many examples of such vehicles are still being marketed 
by Transbotics, FMC, Egemin, and HK Systems. 

Autonomous robots with random motion can detect obstacles in the moving 
direction. If obstacles are detected, robots will turn and move to another direction. 


They cannot follow a predefined path precisely to avoid an obstacle due to its 
limited autonomous capability. However, such robots are simple and cheap, suit- 
able for some applications like autonomous vacuum cleaner. 

An autonomously guided robot knows at least some information about where it 
is and how to reach its goals and/or to follow particular path along the way. It can 
localize himself using various sensors like motor encoders, camera vision, gyro- 
scope, laser range finders, and/or global positioning systems. With the knowledge 
of its own location, and/or with predefined way points, the robot can decide 
and plan its path toward its target. Often, such robots can communicate with each 
other or with other devices/systems in a building through wireless communication 
for more sophisticated tasks. For example, Swisslog robot could communicate 
with elevators so that the robot can take elevators and reach different floors of a 


building. 

Recently, due to advances in sensing and computing technology, the capability 
of autonomous motion is improved. The cost of achieving such level of autonomy 
has dropped down significantly. Therefore, autonomous indoor vehicles can carry 
out more tasks and are widely used in many applications. The growing interests on 
autonomous indoor vehicles have attracted many researchers, companies, and 
individuals to develop or deploy AIV. 
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The rest of this chapter is organized in sections as follows: section “Wheeled 
Mobile Robot Technologies” introduces and discusses knowledge and techniques 
to develop an autonomous indoor vehicle (AIV); section “AIVs at Work” intro- 
duces recent development work of AIV in various applications followed by the 
future development of AIV in section “Summary and Future Trends.” 


Wheeled Mobile Robot (WMR) Technologies 

To achieve autonomous motion, the indoor vehicle must be capable of solving 
several problems in order to move on the ground without colliding obstacles and 
fulfill desired tasks. In simple words, AIV must answer several questions to move 
from one point to another autonomously: 

• How to drive myself: vehicles kinematics and control 

• How is the environment: perception and map construction 

• Where am I: localization 

• How to reach the target: motion planning 

To answer these questions, the AIV needs a variety of sensors besides its motors 
and wheels. Some sensors are used to measure the status of AIV, and some sensors 
can provide perception to the surrounding environment. Only with proper selection 
and implementation of these sensors, autonomous capability can be achieved. 

In addition, classically, there are two types of basic representations: metric and 
topological. Topological map consists of nodes and links. A robot is either on one 
node or on one link between two nodes. In the metric framework, the environment 
is usually represented in two- or three-dimensional Cartesian space. The robot pose, 
its position plus orientation, is represented by a vector respect to a static Cartesian 
frame. Due to the precise representation of the metric framework, this chapter 
focuses on the metric representation only, especially Cartesian framework. 


Kinematic Design of WMRs 
Types of Wheel 

The first decision to make for designing a new WMR is often the determination of 
types of wheel to be used for the design. In the next section the major five types of 
wheel are introduced. 

Fixed Wheel 

Fixed wheel (Fig. 1) only allows the center of the wheel to rotate about a point on a 
fixed axis. By itself, such wheel design does not allow steering, but with 2 such 
wheels along the same axis, it is possible to direct the drive direction by varying the 
angular velocities of each wheel independently. However, such wheels are incapable 
of omnidirectional motion. One such example is the real wheel of a typical bicycle. 
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Fig. 1 Illustration of a fixed 
wheel 



Fig. 2 Illustration of a 
centered steering wheel 



Centered Steering Wheel 

For a centered steering wheel as illustrated in Fig. 2, similar to the fixed wheel 
concept, instead of revolving around a fixed axis which is attached to a body, an 
additional axis perpendicular while intersecting the wheel’s axis of rotation is 
added between the body and the wheel’s axis. This additional axis allows the 
drive wheel to be steered accordingly. However, such wheel designs do not have 
omnidirectional capabilities. One such example is the front wheel of a typical 
bicycle. 

Castor Wheel (Off-centered Steering Wheel) 

Castor wheel (Fig. 3) is different from the centered steering wheel by the fact that 
the steering axis that is parallel to the steering axis going through the center of the 
wheel with an offset distance along the wheel plane. Due to the offset between the 
two steering axes, castor wheel is also called off-centered steering wheel. 

Figure 4 shows a standard castor wheel which is commonly seen in a trolley and 
a dual castor wheel which is used in most of the office chairs. 
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Fig. 3 Illustration of a castor 
wheel 




Fig. 4 Left , standard castor wheel; right , dual castor wheel 


Swedish Wheel 

The Swedish wheel can be viewed as a fixed wheel with smaller passive rollers at 
the circumferences demonstrated in Fig. 5. This angled smaller roller translates part 
of the force from the main wheel to that of the direction along the small roller 
rotating axis to provide traction. The main wheel is actively driven and cannot be 
steered. When three or more of such wheels are used, omnidirectional motion is 
achievable. The discontinuous contact when the main wheel rotates is less 
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Fig. 5 Illustration of a 
Swedish wheel 




Fig. 6 Left , Swedish 45° wheel (also called Mecanum wheel); right, Swedish 90° wheel (also 
called omni-wheel or universal wheel) (Courtesy of VEX Robotics Inc) 


significant as compared to the previous design as there is at least one small roller 
always in contact on a flat ground. The smaller roller wheels can be mounted at the 
sides which can result in the wheel support colliding with obstacles or centrally 
which resolves this issue. 

Figure 6 shows two most common designs of the Swedish wheel according to the 
different arrangement of passive rollers. The Swedish 45° wheel is also known as 
Mecanum wheel where the angle between the passive roller axis and the main 
wheel axis is 45°. Similarly, the Swedish 90° wheel, which is sometimes called 
omni-wheel or universal wheel, has the angle between the passive roller axis 
and the main wheel axis of 90°. In order to reduce the contact discontinuity between 
the wheel and the ground, Swedish 90° wheel often adopts two layers of passive 
rollers. 
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Fig. 7 Illustration of a 
spherical wheel (Asada and 
Wada 1998) 


Stationary 
Rollers B 



Ring A 


Fig. 8 Spherical wheel on 
the Superchair robot in 
(Asada and Wada 1998) 



Spherical Wheel (Ball Wheel) 

The spherical wheel or ball wheel has a ball connected through rollers to the chassis 
passively or to the inside of a ring with rollers which is actively driven via friction. 
The illustration of a spherical wheel is shown in Fig. 7. This implementation gives 
good omnidirectional mobility. A real implementation of spherical wheel for 
WMRs is shown in Fig. 8. 


Classification of WMRs by Wheel Configuration 

As discussed in the previous section, different types of wheel present different 
kinematic constraints that affect the motion characteristics of the WMR. Depending 
on the adoption of the wheel type and the organization of the wheel configuration, 
WMRs can be classified into various types. This section introduces the most 
common types of WMRs with different wheel configurations. Examples of physical 
robots that belong to each type of WMRs are also given for reference. 
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Differential Drive 

Differential drive (Fig. 9) is one of the most common wheel configurations for 
wheeled mechanisms. There are two fixed wheels with a concurrent rotation axis on 
a differential drive WMR. The differential drive WMR has two motors are installed, 
and each motor drives one of the fixed wheels. Besides the pair of fixed wheels, 
there are usually one or two passive castor wheels on a differential drive robot for 
the purpose of balancing. The advantage of differential drive is that it is a very 
simple design. However, one major disadvantage of differential drive is that it is 
difficult to control the robot to move exactly in a straight line as this requires precise 
control of the two fixed wheel with the same velocity. Even if the velocities of the 
two fixed wheels are the same, the robot may not move straightly because the 
alignment of the axes of the two fixed wheels can never be perfect. An example of a 
differential drive robot is the Pioneer 3-DX shown in Fig. 10. 


Fig. 9 Illustration of a 
differential drive WMR 



Fig. 10 The Pioneer 3-DX 
differential drive robot 
(Courtesy of Adept Mobile 
Robots LLC) 
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Car-Like Drive (Ackermann Steering) 

Car-like drive (Fig. 11) is also known as Ackermann steering. In this type of 
vehicles, only the front two centered steering wheels are powered for both rolling 
and steering. The rear two wheels are fixed wheels and passive. The rotation axes of 
the rear two fixed wheels are concurrent. Although car-like drive is the predominate 
design for conventional 4- wheeled automotives, it is not common for WMRs to 
adopt this wheel configuration. Figure 12 shows an example of a car-like drive 
WMR which is the Summit robot from Robotnik Automation SLL. As compared to 
differential drive, it is easy for car-like MWR to move in a straight line with high 
precision. 


Fig. 11 Illustration of a 
car-like drive WMR 




Fig. 1 2 The summit robot 
with Ackermann steering 
(Courtesy of Robotnik 
Automation SLL) 



63 Autonomous In-door Vehicles 


2311 


Synchronous Drive 

Synchronous drive vehicles (Fig. 13) consists of three or more centered steering 
wheels. Due to a special design of the transmission mechanism that connects all the 
wheels, only two motors are installed in synchronous drive vehicles. The steering 
motion of all the wheels is synchronized and powered by one motor, while the 
translation of all the wheels is also synchronized and powered by the other motor. 

The design implies that all the centered steering wheels in a synchronous drive 
vehicle are always pointing to the same direction. In order to move from one point 
to another, synchronous drive vehicles always first steer all the wheels to face the 
goal and then move in a straight line toward the goal. 

Although the design concept is simple, practical design of the synchronization 
mechanism is not easy to achieve as it requires precise synchronization of all the 
wheels. An example of a synchronous drive WMR is the B14 robot from Real 
World Interface Inc. shown in Fig. 14. 

Tricycle 

A tricycle design is illustrated in Fig. 15. The three wheels of a tricycle are two 
fixed wheels and one centered steering wheel. Again, the two fixed wheels have the 
concurrent rolling axis. In most cases, the two fixed wheels are passive, and only the 
centered steering wheel is powered by two motors with one for its rolling and 
the other for the steering. Although the tricycle is also a common design for 
automotive, it is less common for mobile robots. An example of a tricycle WMR 
is shown in Fig. 16. 

Remark 1: Instantaneous Center of Rotation (ICR) There is an interesting 
geometric interpretation about the instantaneous center of rotation (ICR) of 
WMRs that have fixed wheel or centered steering wheel. In the examples of 
differential drive, car-like drive, and tricycle vehicles, the ICR for each of these 


Fig. 13 Illustration of a 
synchronous drive WMR 
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Fig. 14 The B14 series from 
WMR is a commercial 
example of a synchronous 
drive system (Courtesy of 
Real World Interface Inc) 



Fig. 15 Illustration of a 
tricycle 



vehicles is the intersection between the rolling axis of the fixed wheels and that of 
the centered steering wheels (Fig. 17). 

Omnidirectional Drive 

Omnidirectional WMRs are those that have simultaneous three degrees of freedom 
motion in a plane. The three degrees of freedom motion are two translational 
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Fig. 16 Example of a 
tricycle mobile robot 
(Courtesy of Olin College of 
Engineering) 




Fig. 17 Illustration of the ICR of three types of WMRs - left , differential drive; middle, car-like; 
right, tricycle 


velocities and one angular velocity. Such robots are useful in constrained environ- 
ment where space is a premium or when it needs to work in areas with human 
traffic. 

Except for the fixed wheel and centered steering wheel, the three other types of 
wheel introduced in section “Types of Wheel” are capable of constructing 
omnidirectional WMRs. 

Figure 18 shows the omnidirectional mobile robot developed in Singapore 
Institute of Manufacturing Technology. This robot consists of four castor wheels. 
Each castor wheel is powered by two motors with one for the rolling and the other 
for the steering. Figure 19 shows the CMU’s Uranus omnidirectional robot which 
consists of four Mecanum wheels. The Robotino robot from Festo shown in Fig. 20 
is currently commercially available. This robot adopts the omni- wheel to 
generate the omnidirectional motion. The last example of omnidirectional WMRs 
shown in Fig. 21 is the Superchair robot with four spherical wheels (Asada and 
Wada 1998). 
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Fig. 18 The omnidirectional 
mobile robot with castor 
wheels (Courtesy of 
Singapore Institute of 
Manufacturing Technology) 




Fig. 19 The uranus omnidirectional robot with four Mecanum wheels (Courtesy of CMU) 


Classification of WMRs by Degree of Maneuverability 

Following (Campion et al. 1996), WMRs are classified into five generic types based 
on their degree of maneuverability , S M , which is the sum of the degree of mobility, 
8 m , and the degree of steerability, 8 S , of the WMR. 
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Fig. 20 The robotino robot 
with omni-wheel (Courtesy of 
Festo) 



Fig. 21 The superchair robot 
with spherical wheels (Asada 
and Wada 1998) 



The five generic types of WMRs denoted by the combination of ( 5 m , <5 5 ) and their 
correspondence to the classification based on wheel configuration presented in this 
section are shown in Table 1. 
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Table 1 Five generic types of WMRs and their correspondence to the classification based on 
wheel configuration 


Type 

Wheel configuration 

Example 

(3,0) 

There is no fixed wheel and centered steering wheel. That is, only 
those with Swedish wheel (and its variations), castor wheel, and 
spherical wheel are involved 

Omnidirectional 

drive 

(2,0) 

There is no centered steering wheel and there is at least one fixed 
wheel. If there is more than one fixed wheel, the horizontal rotation 
axes of all those fixed wheels are concurrent 

Differential 

drive 

(2,1) 

There is no fixed wheel and there is at least one centered steering 
wheel. If there is more than one centered steering wheel, the 
motion of the centered steering wheels needs to be coordinated so 
that 8 S 1 

Synchronous 

drive 

(U) 

There is at least one fixed wheel and at least one centered steering 
wheel. If there is more than one fixed wheel, the horizontal rotation 
axes of all those fixed wheels are concurrent. If there is more than 
one centered steering wheel, the motion of the centered steering 
wheels needs to be coordinated so that 8 S 1 

• Car-like 
Tricycle 

(1,2) 

There is no fixed wheel and there are at least two centered steering 
wheels. If there is more than two centered steering wheels, the 
motion of the centered steering wheels needs to be coordinated so 
that 8 S 2 

This design is 
rare 


Motion Control of Nonholonomic WMRs 

It is worth noting that kinematic model-based motion control of nonholonomic 
WMRs has attracted much more research efforts than that of dynamic model based. 
This is due to the theoretical challenge in the kinematic motion control problem of 
systems with nonholonomic constraints. In the following, an example of the 
nonholonomic kinematic constraint is presented for the case of unicycle. Unicycle 
is the simplified example representing a general WMR but yet captures 
the nonholonomy property. 

The well-known Brockett theorem is then presented which sets the basis for the 
subsequent discussion on various control approaches for wheeled mobile robots. 
Literature reviews of the two basic motion control problems, namely, the trajectory 
tracking and posture stabilization, are also presented. 

Remark 2: Holonomy The nonholonomy concept is defined in (Campion 
et al. 1996) based on the so-called configuration state space model, and the 
conclusion is that all types of WMRs are nonholonomic. According to this defini- 
tion, there is no truly holonomic WMRs. However, it is not unusual to find literature 
that uses the term of holonomic WMRs. For example, the concept holonomic WMR 
has been formally defined in (Qingxiao Yu 2012) as a robot is holonomic if and only 
if the differential degree of freedom is equal to the degree of freedom of the system. 
So it is worth to clarify the confusion about the holonomy concept in WMRs 
community. 
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By examining the two literatures, it is not difficult to know that in 
(Siegwart and Nourbakhsh 2004) the definition of the degree of freedom 
(DOF) is equivalent to that of the degree of maneuverability, while the 
definition of the differential degree of freedom (DDOF) is equivalent to that 
of the degree of mobility in (Andrea-Novel et al. 1995). According to the 
definition of nonholonomy defined in (Andrea-Novel et al. 1995), if we apply 
the calculation of the degree of nonholonomy based on the posture kinematic 
model, it is not difficult to find out that all the three types of omnidirectional 
WMRs presented in 2. 1.2.5 are holonomic. With the above clarification, 
nonholonomic WMRs are referring to all types of restricted mobility WMRs, 
while holonomic WMRs are equivalent to omnidirectional WMRs (also called 
full-mobility WMRs) according to the classifications defined in (Campion 
et al. 1996). 


Nonholonomic Kinematic Constraint 

Wheeled mobile robot is the classical example of nonholonomic systems. The 
nonholonomic kinematic constraints that exist in WMRs describe the pure rolling 
without slipping characteristic of wheel motion. Intuitively the constraint equations 
describing the pure rolling without slipping characteristic involve the differential 
configuration variables of the system. For the equations to be nonholonomic, it 
requires that they are not integrable with respect to the differential configuration 
variables. Mechanisms with nonholonomic kinematic constraints are typically 
characterized by under-actuation, i.e., the system has fewer controls than the 
number of configuration variable. 

For example, in the case of unicycle (Fig. 22), the number of configuration 
variable is 3 and can be represented by the configuration vector 






Fig. 22 Illustration of a 
unicycle 


x 
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However, there are only two independent control inputs which are the linear 
velocity of the wheel v and the angular velocity of the wheel about the vertical axis 
w. Similarly represent the control vector is represented as 



The unicycle is the best example to demonstrate the nonholonomic kinematic 
constraint that appears in all types of wheel, and the equation of such constraint is 

x sin 0 — y cos 9 = 0 (1) 

In the next the explanation of why the above equation is not integrable will be 
given. 

To systematically prove that (1) is not integrable requires a few tools from 
differential geometry including Lie bracket, regular distribution, and involutive 
closure. With these tools, the method to examine the integrability of a constraint is 
given by the Frobenius theorem which states that a regular distribution is integra- 
ble if and only if it is involutive. A detailed examination of the non-integrability for 
(1) is given in various literatures such as (Mason et al. 2013). 

The Brockett Theorem 

The presence of nonholonomic kinematic constraint in WMRs introduces a theo- 
retical challenge to the motion control problem of WMRs. The theoretical chal- 
lenge is best described by the Brockett theorem (Brockett 1983) which concludes 
that there is no smooth time -invariant control law able to asymptotically stabilize 
controllable nonholonomic systems. 

It is the above theoretical challenge that motivated many researchers in the past 
two decades to work on the motion planning and control of nonholonomic WMRs. 
In the next the major research outcomes dedicated for the motion control of 
nonholonomic WMRs will be reviewed. The review is covered for the two basic 
motion control problems, namely, the trajectory tracking and posture stabilization. 

Trajectory Tracking 

It is an interesting fact that trajectory tracking is easier than the posture stabilization 
for nonholonomic WMRs (De Luca et al. 1998, 2001). It is proved that 
nonholonomic WMRs are possible to be locally stabilized by linear feedback 
about trajectories, and it is possible to use linear design techniques in order to 
obtain local stabilization for arbitrary feasible trajectories (Canudas de Wit 
et al. 1993). The trajectory tracking problem of nonholonomic WMRs has also 
been solved by many researchers using nonlinear control design such as 
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time-varying feedback stabilization (Samson 1993) and dynamic feedback 
linearization (Andrea-Novel et al. 1995). 

Posture Stabilization 

Although posture stabilization is not achievable with static state feedback lineari- 
zation in the original Cartesian coordinate frame, exact feedback linearization can 
be realized if the problem is transformed to the polar coordinate frame (Park 
et al. 2000). However, transformation to polar coordinates is not attractive for 
WMRs with complex kinematics. Samson has pioneered in the time-varying tech- 
niques (Samson 1993, 1995). Comparative results for both smooth and discontin- 
uous time-varying feedback controllers are presented in (De Luca et al. 1998, 
2001 ). 

It is noted that the problem of path following for WMRs can be formulated as a 
subproblem of posture stabilization (De Luca et al. 1998), so most results obtained 
for posture stabilization are applicable to the path following problem. 


Sensors 

The aim of this section is to survey the various types of sensors that are associated 
with the operation of an autonomous indoor vehicle (AIV). This operation would 
encompass obstacle avoidance, localization, and mapping. 

For AIVs, perception of its environment is essential for its ability to perform and 
locate itself accurately and precisely within that environment. There are many 
sensors on the market that provide some form of information in regard to the AIV 
in relation to its surroundings. These can be broadly classified into two categories 
which obtain absolute and relative measurements. Absolute measurements are 
readings based on some (external) reference signal/network, which can be consid- 
ered as external sensing. Relative measurements are measurements based on the 
AIV (internal) motion, which can be considered as internal sensing. 

The first two sections will touch on these two categories of sensors, and the third 
section will look at a case study. 

Exteroceptive Sensors 
Digital Compass 

A digital compass, or a magnetometer, has the ability to give absolute heading in 
reference to the Earth’s magnetic poles. This is useful for an AIV to determine its 
orientation that would help in its localization. This sensor is able to provide 
readings in the form of bearings (0-359°) and sometimes requires calibration. 
This is commonly used in ships for navigation to give its heading, and it is useful 
for AIVs to rely on an absolute form of measurement. 

Some downsides are that it is susceptible to electromagnetic interference (EMI). 
This may arise from equipment that emits EMI or has ferrous -magnetic metals. As 
such, gyroscopes which measure angular rate show advantages over these 
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Fig. 23 (a) Devantech CMPS03 tilt-compensated digital compass, (b) OceanServer OS5500-S 


shortcomings. The accuracy of digital compass would also depend on the geograph- 
ical location and the degree of tilt of the compass. These two can have significant 
effect on the heading accuracy. To overcome such effects, there are tilt- 
compensated compasses. Nowadays, digital compasses are even coupled with 
3 -axis digital accelerometer that would be able to perform dead reckoning in 
situations where GPS signal is poor. This can be seen in mobile devices used for 
navigation. Figure 23 shows some commercially available tilt-compensated digital 
compasses. 

Infrared and Sonar 

In this section, we cover two types of low-cost sensors that are able to provide 
distance measurements. Although they provide the same distance measurement, the 
methodology behind them is different. 

For infrared, it provides a “point” type of measurement where it emits a single 
point of laser and measures the angle at which the receiver obtains the reflected 
signal. This type of measurement is also known as triangulation. The angle at which 
the receiver obtains is determined by the charge-coupled device (CCD) array. The 
position at which it falls allows us to determine the angle at which it is coming into 
the receiver. With the prior knowledge of the distance between the emitter and 
receiver and the angle that the emitter emits, the distance can be determined through 
triangulation. This sensor provides analogue measurement, and an A/D converter 
with some filtering is required for good results. 

A very common infrared sensor would be SHARP sensors (Fig. 24). They have a 
few models with different distance ranging, and price range is about USDS 15-25. 
Some disadvantages of this type of sensor are that it might miss the legs of a chair 
for an obstacle and under strong lighting condition or not so favorable surface, the 
ranging measurements might get affected. 

Ultrasonic sensors send out several pulses in a “conic” shape and waits for the 
signals to return. During this time, it measures the time and waits for it to return. 
This measurement for the time of flight allows the distance measurement to be 
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Fig. 24 (a) SHARP GP2D12, (b) triangulation to obtain the distance measurement 



Fig. 25 (a) The “conic” beam shape pattern emitted by an ultrasonic sensor, (b) Devantech’s 

SRF08 


calculated with the speed of sound known. Figure 3 shows some models of 
ultrasonic sensors. 

Some provide digital or I2C output. Using I2C communication protocol is quite 
useful as only using two pins one is able to connect up to 16 devices to the same 
pins. Devantech’s SRF08 (Fig. 25) is one such example. 

This is a very affordable “contactless” bump sensor that many AIVs can utilize. 
It is already seen in several car models that assist when reversing. However, it might 
not be able to detect a doorway opening although it can detect the legs of a chair. 
Surfaces that do not fully reflect the sound waves would also not be so favorable. In 
addition, “ghost” echoes might be detected when multiple ultrasonic sensors are 
used. Some affordable ultrasonic sensors would be from Parallax, Devantech, and 
MatBotix. 


Vision Camera 

Having a camera mounted on an AIV has many advantages. It can first be used to 
identify if there are any obstacles in the near vicinity of the AIV. It can also act as 
eyes for any user that would like to have some form of surveillance in their AIV 
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Fig. 26 (a) Successive images captured help to generate the vector which helps estimate AIV 
motion; (b) Mars exploration rover that successfully applied visual odometry in a high-slippage 
environment 


(Konolige and Agrawal 2010). Not only that, image processing can be performed so 
that they are able to assist AIV in estimating their motion. 

Visual odometry is a technique that estimates motion based on stereo imagery. 
This is a low-cost method which can prove to be quite useful for AIV navigation 
(Konolige and Agrawal 2010). By tracking the distinctive features within a scene, 
one is able to obtain the estimated motion of the AIV (Helmick and Cheng 2004). 
Figure 26 illustrates how the optical flow vector is produced when successive 
images are compared. 

Figure 26 may be simplistic in nature but helps conveys the concept behind 
visual odometry. It performs selection of certain features and tracks these features 
between images. 

Visual odometry has proven to be beneficial in high slip environments. More- 
over, it provides a “look ahead” safe approach that allows the AIV to terminate or 
modify its path that could be due to an obstacle (Cheng and Maimone 2006). 
Besides providing sensory input in its autonomy, it provides visual feedback for 
any human that would like to override the autonomous navigation and perform it 
manually. 

One of the shortcomings of visual odometry is that it might not always provide a 
valid solution. This can happen when images are blurred or there is too large a 
motion that leads to insufficient overlap in images (Cheng and Maimone 2006). In 
feature -poor terrain, additional information must augment visual odometry. This 
can be dealt with by infusing with another type of sensor be it the AIV odometry 
(Fojtu and Havlena 2012). Some work, which fuses visual odometry and IMU, has 
shown to have localization errors of less than 0.1 % (Konolige and Agrawal 2010). 

3D Camera: RGB-D and TOF 

3D cameras, or ranging cameras, in general not only have images, but they are also 
able to get depth information of its surroundings. There are actually a few tech- 
niques in which it is able to do so, and this is beneficial in obtaining a 3D map of the 
factory floor. Below we discuss just some of the notable techniques used in 
commercially available products. 
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RGB-D 

Within the recent years, there has been much interest in the Microsoft Kinect and 
Asus Xtion (Fig. 27) and how they are able to enhance the gaming experience with 
gesture and motion recognition and being able to obtain a 3D model of the 
environment. There has been even greater interest from the research community 
when Microsoft released APIs for this sensor. With this release, many researchers 
and hobbyists were able to fully utilize and experiment with this sensor. 

This type of camera operates on the concept of optical ranging using speckle 
patterns. It has an IR emitter that projects a speckle pattern onto the environment 
and an IR sensor that captures these speckles. It first captures reference speckle 
images at different distances then captures test image of the speckle on the object. 
Afterward, it obtains the range by matching the test image to its reference image. 
This offsets or deformation in the pattern when compared to the referenced speckle 
allows it to construct a 3D map (Javier Garcia and Zeev Zalevsky 2008). 

TOF 

Another type of ranging camera would be a TOF camera. A TOF camera is able to 
generate a 3D map of its surroundings by comparing the phase shift in the 
modulated emitted light and received light. This phase shift angle is directly 
proportional to the distance that light travels from the emitter to the object and 
back. This allows the depth information of the object or person to be obtained. A 
TOF camera would generally include (not limited to) a modulated illumination 
source, a detector to receive the modulated light reflected from the object, and a 
program to compare and obtain a phase shift from these two signals. This TOF 
technique can be seen in the cameras in Fig. 28. It has been mentioned that a TOF 
camera will be featured in the upcoming second generation Kinect. 

Just recently announced was MIT’s effort in creating a TOF camera that is not 
only affordable but also not fooled by translucent or near transparent objects. This 
would prove to have significant advantage over state of the art if this product is 
commercialized. 



Fig. 27 (a) Microsoft 
Kinect, (b) ASUS XTion 
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Fig. 28 (a) Mesa swiss ranger 4000 (b) PMDTecCamCube 3.0 


Laser Range Finder 

Laser scanners are similar to infrared in the sense that it shoots out an infrared 
beam and senses it through tri angulation (Fig. 29). The only difference is that 
it is now equipped with a revolving mirror. With the revolving mirror, it is 
able to scan a plane. 2D laser scanners, together with infrareds, have a 
minimum and maximum distance that it can measure. It also has a limited 
rotational view. With that planar distance, one is able to detect obstacles and 
localize and generate a map. 

Although a 2D scanner can only scan a plane, there has been work on using it to 
generate a 3D map. Just by attaching this 2D scanner to a pitch actuator, one is able 
to get a 3D scan. With an additional yaw actuator, one would be able to generate an 
omnidirectional 3D scan registration surrounding the AIV. 

Others 

Besides the abovementioned, there are those that involve some form of an indoor 
global positioning system (iGPS). This will be covered under the section Locali- 
zation and Navigation. 

Proprioceptive Sensors 
Odometry 

Odometry is the bread and butter for AIV localization and navigation. Odometry is 
the measure of displacement of the AIV’s wheels and computes the displacement of 
the AIV through kinematics. These measurements are through the means of an 
encoder. The encoder measures directly the angular displacement of the motor 
shaft. With a known wheel diameter, the linear displacement of the traversed path 
can be known (Fig. 30). 

However, encoder measurements suffer from problems of accurate wheel diam- 
eter estimation, uncertainty in the determination of the contact point with the floor, 
dirt accumulation, wheel slippage, and wear and tear (Palacin and Valgaon 2006). 

It is a good form of information in regard to the AIV’s displacement, but the 
above drawbacks will be significant when accumulated over time. It is often used 
but always in conjunction with other sensors that would help compensate for the 
accumulated errors. 
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Fig. 29 (a) SICK, (b) Hokuyo URG-04LX-UG01, (c) Active scan of a 2D laser scanner 
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Fig. 30 Obtaining the linear displacement by encoder measurement of angular displacement 

Inertial Measurement Unit 

An IMU is able to measure acceleration in 1, 2, or 3 axis. Through this measurement 
of acceleration when integrated with time provides velocity. That, in turn, when 
integrated once more provides displacement. Some IMUs are coupled with 
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Fig. 31 Crossbow’s (a) IMU770 fiber-optic gyro and (b) IMU440 MEMS-based gyro 


gyroscope which allows measurement of the change in orientation about the three 
axes. Listed below are two examples of IMU/gyro that uses different technology to 
achieve the same result. One is MEMS based, while the other is fiber optic (Fig. 31). 

This is commonly used in aerospace, aeronautical, and defense as this is able to 
provide a vehicle’s position and orientation. Some military-grade IMU can be quite 
expensive, but it is able to provide pinpoint accuracy. 

For AIVs, a 3 DOF IMU/gyroscope would be sufficient to determine its position 
on a 2D plane and its orientation with respect to the normal to the ground. The 
tricky part in dealing with IMUs is that they are prone to accumulated errors. This is 
known as “drift.” Because it is always accumulating even the smallest of errors, 
over time, this will be significant. It is important to address this either by using 
certain software filters such as Kalman filter or by incorporating other sensors 
(Evennou and Marx 2006; Bezick and Pue 2010). 

Gyroscope 

A gyroscope measures the rotational motion of a body. This measurement output 
can be in the form of angular rate (degree/s) or angular displacement (degree) at 
certain intervals. This is commonly complemented with an AIV odometry. This is 
commonly used in guided missiles, planes, and ships for navigation to give its 
heading. 

There are generally two kinds of gyroscope on the market: microelectrome- 
chanical systems (MEMS) and fiber-optic gyros (FOG). There are many types of 
MEMS gyroscopes out there, but they generally measure the displacement of a 
mass as it is rotated caused by the Coriolis force read from a capacitive sensing 
structure. For FOG, there are two laser beams fired into a fiber-optic cable but in 
opposite directions in a coil. Due to the Sagnac effect, the beam traveling against 
the rotation experiences a slightly shorter path delay than the other beam. This 
difference is measured and translated into angular velocity (Merlo and Norgia 2000). 
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Fig. 32 (a) KVH DSP-4000 fiber-optic gyroscope, (b) Xsens’ MTi 10-Series (MEMS Attitude 
and Heading Reference Systems) 


Fig. 33 Adept Lynx based 
on differential drive can 
handle payloads up to 60 kg, 
travel at maximum speed of 
1.8 m/s, and have run time of 
19 h a day 



Figure 32 shows some commercial products which range from low-cost to military- 
grade products. 

The FOG gyros on the market still currently outperform MEMS gyros but only 
marginally. The cost, however, for FOG is almost 10 x, and time will tell if the 
performance of MEMS gyro will soon catch up with FOG. 

Case Study: Adept Lynx 

Here, a case study of an AIV that has been announced in the early part of 2013 is 
presented (Fig. 33). The Adept Lynx is the highest profile product coming from the 
California-based company. It is an AIV that is designed for a dynamically changing 
environment which includes confined passageways as well as crowded locations. It 
does not require any infrastructure deployment from the environment, and it uses 
proprietary software and control to circumvent obstacles. 
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Sensors 

This AIV has a gyroscope which helps measures the robots rotation. This will be 
used in conjunction with the encoder on each wheel. Each encoder can provide how 
fast the wheels are turning, the distance covered and the direction it is turning. 

In front, it has four sonars that help detect low-lying obstacles and additional 
four sonars on the back when it is reversing. The range is up to 5 m, though 
the typical accurate range is only up to 2 m. It has been noted that the only 
times when the platform will back up is when docking on the charging station or 
when the bumper has hit an obstacle in which the AIV will reverse so that it can 
rotate freely. 

A safety 2D laser scanner helps localize and navigate in accordance with its 
mapping software. This scanner provides 500 measurements within a 250° field of 
view, with a typical maximum range of 15 m. One bumper, with two pairs of 
sensors, is mounted at the front of the platform, should the obstacle avoidance 
systems fail to detect an obstacle. 

Software 

For initial operation setup, the Adept Lynx would need to generate a map by first 
tele-operating it around the environment so that it has an idea how the environment 
is. Adept’s MobilePlanner software builds floor plans and maps for AIV applica- 
tions. Once the necessary features are captured, virtual routes and forbidden lines 
can be made known to the AIV for safe operation. Its other software, MobileEyes, 
allows you to see the situation on the ground and observe the actual navigation 
process on the virtual map. It also provides a visualization for path planning and 
obstacle avoidance. 


Navigation 

Localization 

To achieve autonomous navigation, a mobile robot has to know its own position and 
orientation, or say, the robot pose, relative to the environment. Such need of 
localization comes from two basic requirements. Firstly, the robot needs to know 
whether it has reached its target or goal. Secondly, to construct the environmental 
map or to follow a planned path, the robot has to know its own position and 
orientations. 

The most straightforward approach of localization is using AIV odometry-like 
data (Borenstein 1995). Based on wheel encoders, gyroscopes, optical flow, and/or 
IMU devices, it is possible to calculate the robot pose relative to its starting pose. 
The drawback of such approach is that it has unbounded localization error so that 
the localization result may not be consistent in a long run. In short run, such 
approach is usually accurate enough. However, after traveling some certain dis- 
tances, the localization error will be too large to be acceptable. Therefore, AIV 
needs to implement other localization approaches with consistent localization 
results. 
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There are several ways to achieve bounded localization error. Firstly, localiza- 
tion can be done using pre-setup solutions, like indoor positioning system (iGPS). If 
an environmental map is available, AIV can also localize himself using sensors like 
laser range finders or vision cameras. If no environmental map is available, the 
robot can still localize itself through simultaneous localization and mapping 

(SLAM). 

Indoor Positioning System (iGPS) 

GPS in the past was only used for military and aeronautical purposes. Today, it has 
been extended to consumer applications, and it can be seen in car navigation, 
mobile devices, and several others. Satellite GPS uses trilateration which utilizes 
three or four satellites to pinpoint a location on the Earth’s surface. Figure 34 
illustrates the concept of trilateration. To pinpoint a location on a unique plane, at 
least three signals are required. The satellites are located in the center of each of the 
circles. Based on the signal strength, it is possible to determine the distance of 
the object from the satellite. With three satellites, the location of the object can be 
known from the three intersecting circles. 

Unfortunately, GPS requires line of sight which is lost indoors. Hence, there has 
been work to develop an indoor version of GPS, and below shows some of that 
effort. 

This type of iGPS obtains the absolute coordinates of the AIV within the RFID 
network space by utilizing RFID tags and reader (Shenoy 2005). The location 
sensing system makes use of the received signal strength indication (RSSI) to 
estimate the distance between the RFID tags in context and thus localize the AIV 
with the help of inter-tag distance (Hightower 2001). Some special RFID systems 
are able to obtain the positioning accuracy of about +/— 2 cm (Qingxiao Yu 2012). 
Figure 35 below shows one type of RFID reader used to identify the vehicle’s 
location status in the warehouse management systems as it moves through the 
tag-equipped zones. 

Some advantages of this type of localization are that it does not require a line of 
sight and that it provides high data rate and security. Having said that, the environ- 
ment’s infrastructure still needs to be modified by the placement of these tags, but 
its compactness provides minimal changes required (Yamano 2004). 


Fig. 34 Basic concept of 
trilateration 
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Fig. 35 Intermec’s IV7 RFID vehicle mount reader 


Fig. 36 Ultrasonic network 
with RFID. Each ultrasonic 
transmitter has unique RFID 
identifier 
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This iGPS makes us of many ultrasonic transmitter and receivers to determine 
the position of the AIV by means of trilateration. The transmitters are at known 
locations and send out pulses. Measuring the reflected time of flight and the speed 
of the ultrasonic signal, one is able to obtain the distance measurement. When this 
coincides with a few transmitters, an exact location can be defined. This ultrasonic 
network is in tandem with RFID tags which are used to identify which ultrasonic 
transmitter corresponds to which node (Sung-Bu Kim 2007). Figure 36 below 
shows an illustration of this ultrasonic network. 

This iGPS relies on affordable sensors which makes it a cost-effective solution. 
However, it requires line of sight with the vehicle. If it is in a crowded human 
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Fig. 37 (a) Artificial landmark; (b) artificial landmark with triangles 1, 3, and 6 coated with 
infrared light reflecting material 


environment or where the ultrasonic transmitters are not able to reach the receiver, 
the localization accuracy might be compromised. 

Instead of ultrasonic sensors and RFID used in the previous section, this iGPS 
uses IrDA markers and a camera. Landmark markers are simple and useful means 
for indoor localization. Utilizing special markers, such as the one shown in Fig. 37, 
one is able to estimate the orientation and position of an AIV. The blue triangles are 
used to estimate the orientation, while the other triangles, depending on whether 
they are coated with infrared light reflecting material or transparent film, are for 
unique identification (Lee 2009). Successive images captured by an onboard cam- 
era and infrared LEDs are compared. The translational and rotational changes are 
computed by image processing and when feature landmark extraction is performed. 

This type of system can also make use of the odometry information to estimate 
position when these markers are not within the field of view (Lee 2009; Tenmoku 
2003). This system is an alternative method to the previous section and presents 
positioning and orientation accuracy of close to +/— 0.73 cm and 5.25°. 

The system might face some difficulties when ceilings are high or unstructured. 
In addition, for big factories or warehouses where a significant number of identifiers 
have to be used, the number of artificial landmarks and/or its size might have to 
increase. Either that or the camera capturing of the landmark would have to be of 
higher quality which would in turn increase the computational image processing 
time. This might cause some concern for scalability. 

In comparison to the previous sections of iGPS, the infrastructure of the envi- 
ronment that the AIV is roaming in would need to be modified to accommodate the 
various sensors, be it ultrasonic or IrDA. However, in most work/factory environ- 
ment, there might already be a network that can be tapped on by the AIV to localize 
itself. 

In homes, office and work environments, there are already existing wireless 
networks that allow seamless communication within the establishment and provides 
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Fig. 38 Wireless router provides means of non-line-of-sight localization with little or no deploy- 
ment effort 


this medium in which this can be realized (Fig. 38). This is an attractive notion 
because this disposes of the need to deploy a network to localize an AIV and does 
not require a line of sight. It would depend on the received signal strength (RSS) 
(Evennou and Marx 2006). 

The RSS would be indicative of the distance that AIV is from that router. When 
combined with multiple points, the location can be determined accurately, through 
trilateration, with low computational requirements. Some work has shown accuracy 
of up to 0.7 m in small buildings and 4 m in large buildings (Moustafa Youssef 
2005). 

Although the accuracy is not as close to those mentioned in the previous 
sections, it is quite an achievement in obtaining accuracy mentioned. In practical 
applications, this can be used as a form of coarse localization. It can be used 
together with other sensors to obtain a finer localization. 


Map-Based Localization 

If an environmental map is available prior to the autonomous task, localization can 
be done using map-based localization approaches. The environmental map could be 
a feature map, recording position of a set of features which can be observed and 
extracted by robot sensors. The most common type of feature used is point features, 
often referred as landmarks. Another type of environmental map could be the 
occupancy grid map. It consists of grid cells, and each cell indicates the occupancy 
of obstacles. 

To calculate the robot pose based on a given map, sensor measurement data, and 
robot odometry data, the most popular approach of map-based localization is 
particle filters (PF) or Monte Carlo localization (Dellaert et al. 1999). Here, particle 
filters, also referred to as sequential Monte Carlo method, are a recursive Bayesian 
filter with particle representation. A particle set is a set of point clusters which 
represents the Bayesian posterior. Unlike parametric filters such as Kalman filters, 
PF is a nonparametric method and can represent nonlinear, non-Gaussian model/ 
noise. This advantage is significant since real-world problems are often 
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complicated. PF can be applied in map-based/map-aided localization, often called 
Monte Carlo localization (MCL) (Dellaert et al. 1999). 

In MCL, usually, the robot motion model is used to estimate the robot pose 
represented by a set of particles, i.e., the proposal distribution. Then the weight of 
each particle is calculated based on latest sensor measurement and the available 
environment map through measurement model. Finally, resampling process will 
update the particle set. These processes will repeat over time to provide real-time 
localization information of the robot. 

Simultaneous Localization and Mapping 

If environmental map is not available, it has to be constructed when it is localizing 
itself simultaneously. Such task is much more complicated and called simultaneous 
localization and mapping (SLAM) (Aulinas et al. 2008) or concurrent localization 
and mapping (Rencken 1993). 

The practical objective of SLAM is to concurrently update the robot pose 
and environmental map consistently and efficiently. Similar to the map-based 
localization, two types of environmental map can be constructed by SLAM. One 
is feature map and the other one is nonparametric map like occupancy grid map. 
Both robot sensors and environment contain noises. Therefore, stochastic 
approaches are often considered in SLAM to handle such uncertainties to guarantee 
consistent results. 

The first milestone of SLAM is the development of extended Kalman filter 
SLAM (Julier and Uhlmann 1997) in 1997. After that, Montemerlo proposed an 
RBPF-based SLAM approach known as FastSLAM (Hahnel et al. 2003; 
Montemerlo et al. 2003). In FastSLAM, Rao-Blackwellized particles (Doucet 
et al. 2000) are used to represent the posterior over robot paths. The features are 
extracted from sensor measurement and estimated using extended Kalman filters 
(EKF). This well-known algorithm has been proven to be a very efficient means for 
solving the SLAM problem. In recent years, SLAM has progressed from 2D SLAM 
to 3D SLAM. In (Jonathan Courbon et al. 2010), Courbon demonstrates 6 DOF 
SLAM with color camera mounted on small quadratic helicopters using 
FastSLAM-based algorithms. Engelhard et al. present successful SLAM known 
as RGBDSLAM (Engelhard et al. 2012). In this approach, 3D sensor is used to 
perceive 3D environment, and HOG-Man (Grisetti et al. 2010) framework is 
applied to solve SLAM problem in 6° of freedom (DOF). Besides (Engelhard 
et al. 2012) Pomerleau et al. proposed another 6 DOF SLAM approach called 
Fast ICP. It uses iterative closest point (ICP) method to solve SLAM problem. 

Map Construction 

For mobile robot navigation, the representation of the environmental map (Thrun 
2002) is quite important for different tasks. It is the foundation of environmental 
mapping approaches. 

The most common approach for parametric map representation is often known as 
the feature-based approach. Features are extracted from the environment, and the 
geometric information of the features is represented by a few parameters to represent 
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Fig. 39 Example of 
occupancy grid map. White, 
free space; black, obstacles; 
gray, unknown space 



the position of the feature. Features are extracted and mapped, and they are also 
known as landmarks. Most feature extractors take out a small number of features 
from high-dimensional sensor measurements. This approach greatly reduces the data 
size for processing and thus reduces the computational complexity, making the 
following processing more efficient. These feature -based map representations are 
efficient and easy to implement. With this information about the mapped features, the 
robot pose can be simultaneously updated (Dellaert et al. 1999). 

The most common approach for a nonparametric map representation is the 
occupancy grid map (Luo et al. 2004). The basic idea of occupancy grids is to 
represent the map as a field of random variables, arranged in an evenly spaced grid 
(Fig. 39). Each random variable is binary and corresponds to the occupancy of the 
location it covers. Occupancy grid mapping algorithms implement approximate 
posterior estimation for those random variables. 

Occupancy grid maps can represent very complex, highly unstructured environ- 
ments. However, computational and memory efficiencies are the drawbacks. New 
approaches have been proposed for 3D mapping like (Henry et al. 2010). 

Path Planning 

Path planning (motion planning) is traditionally divided into two categories: global 
path planning and local path planning. In global path planning, prior knowledge of 
the environment, i.e., environmental map, is available. From an environmental 
map, a path can be planned for the robot to reach the final target. Local path 
planning methods could be called reactive strategies and are completely based on 
sensory information. Such approaches focus on obstacle avoidance and often 
require real-time processing speed so that the robot reacts fast to avoid obstacles. 
For example, vector field histogram (VFH) (Borenstein and Koren 1991) (Ulrich 
and Borenstein 1998) approaches and vector polar field histogram (Gong 
et al. 2007) approaches are of this kind. 
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For global path planning, in general, a plan imposes a specific strategy or 
behavior on a decision maker to take actions (Lavalle 2006). Here are several 
basic concepts for path planning: 

• Configuration space C: a configuration describes the pose of the robot, and the 
configuration space C is the set of all possible configurations. The configuration 
space which is not colliding with any obstacles is often denoted as Cfree. 

• The robot state: the pose of a robot at a time instance. 

• Initial and goal states: a planning problem usually involves starting in some 
initial state and trying to arrive at a specified goal state or any state in a set of 
goal states. 

• The actions: motion taken by the robot. 

There are generally two different kinds of planning concerns based on the type of 
criterion: 

• Feasibility: find a plan that causes arrival at a goal state, regardless of its efficiency. 

• Optimality: find a feasible plan that optimizes performance in some carefully 
specified manner, in addition to arriving in a goal state. 

For AIV, feasibility means to find a path from robot pose to the desired 
destination without hitting any obstacles and satisfies all constraints. There are 
several constraints that can be taken into considerations. For example, the obstacles, 
robot kinematics, task requirement, etc. Optimality can be evaluated by shortest 
distance, smooth path curvatures, etc. 

A common method to solve global path planning is grid-based approaches. It 
represents the environment (configuration space) by a grid, and assumes each 
configuration is identified with a grid cell. The state of each grid could be 
“obstacle,” “empty” (Cfree) or “unknown.” Furthermore, it is assumed that if two 
grids of Cfree are next to each other, the robot can move freely between them. 
Based on these assumptions and setups, path planning problem can be solved by 
“Search algorithms” searching a path from starting cell to the goal cell through all 
“empty” cells. For example, algorithms like Dijkstra’s algorithm (1959) or 
A*algorithm (Hart et al. 1968) can be used to find a path from the start to the goal. 

Sampling-based algorithms represent the configuration space with a roadmap 
(e.g., Visibility or Voronoi Diagrams). The starting point and goal point can be 
linked to the roadmap and the complete path between them could be found out from 
the roadmap. Examples of such approaches are Rapidly Exploring Dense Trees 
(RRT) or Probabilistic roadmaps (PRMs). A roadmap is then constructed that 
connects two milestones P and Q if the line segment PQ is completely in Cfree. 
Again, collision detection is used to test inclusion in Cfree. To find a path that 
connects starting point and goal, they are added to the roadmap. If a path in the 
roadmap links starting point and goal, the planner succeeds and returns that path. If 
not, the reason is not definitive: either there is no path in Cfree or the planner did not 
sample enough milestones. 
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AIVs at Work 

In this section, we focus on mobile robots designed and implemented in indoor 
environment. More specifically, we focus on autonomous indoor vehicles which are 
wheeled indoor mobile robots with higher level of autonomy. 

Autonomous indoor vehicles have been implemented in many areas and appli- 
cations. There are over five million domestic robots, the autonomous vacuum 
cleaners working all over the world. There are a number of autonomous vehicles 
working in factories and laboratories. Scientists have developed various applica- 
tions and functions to autonomous indoor vehicles, some equipped with robotic 
arms. The design, size, and shape of these AIVs vary very much. Based on the 
environment it is working in, here, we think it can be separated into two categories. 
One type of AIV is designed for office-like environment, including home, hospital, 
and similar indoor environment. Another type is AIV designed for factory envi- 
ronment to carry heavy loads. 

Currently, there are quite many AIVs available in the market. Domestic AIV is 
getting more and more popular. They have been sold in the number of millions. 
Simple design with bumper and infrared sensors make them extremely cheap so that 
it can be afforded by every family. However, the application of such system is 
limited to cleaning or similar tasks due to limited autonomy. It can only achieve 
random autonomous motion. 

AIVs implemented in industrial and lab/office environment is quite reliable to 
perform point-to-point transportation. Such AIVs are often equipped with laser 
range finders for self-localization and a number of sonar sensors or bumpers for 
obstacle avoidance. These AIVs can move on flat ground following predefined path 
without hitting obstacles or moving passengers. 

AIV developed for R&D applications can be equipped with expensive sensors 
with 3D vision, sometimes a robotic arm also. It can observe the environment and 
perform complicated tasks like serving elderly people or searching for particular 
targets. The capabilities are achieved through sophisticated robotic software 
approaches, and researchers keep developing new applications using such R&D 
AIV as a testing platform. Although the reliability of some applications cannot 
achieve commercial standard, there is clearly a trend of what can be achieved in the 
future. 

There are numerous AIVs selling in the market and it is impossible to list all of 
them in this writing. Therefore, in this section, we select and introduce some typical 
AIVs available from the market for your references. 


AIVs in the Society 
Roomba of iRobot 

The first well-known successful domestic autonomous indoor vehicle might be 
Roomba robot (Fig. 40), a simple unmanned vacuum cleaner, developed by iRobot. 
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Fig. 40 Roomba vacuum 
cleaner (Courtesy of iRobot) 



iRobot was founded in 1990 when Massachusetts Institute of Technology 
roboticists Colin Angle and Helen Greiner teamed up with their professor 
Dr. Rodney Brooks with the vision of making practical robots a reality. 

In 2010, iRobot generated more than $400 million in revenue and employed 
more than 600 of the robot industry’s top professionals, including mechanical, 
electrical, and software engineers and related support staff. iRobot stock trades on 
the NASDAQ stock market under the ticker symbol IRBT. 

iRobot’ s home robots are revolutionizing the way people clean - inside and out. 
More than six million home robots have been sold worldwide. The award-winning 
iRobot® Roomba® floor vacuuming robot is leading the charge. Roomba made 
practical robots a reality for the first time and showed the world that robots are here 
to stay. iRobot ’s acclaimed line of home robots also includes the iRobot Scooba®, 
floor washing robot; the iRobot Verro®, pool cleaning robot; and the iRobot Looj®, 
gutter cleaning robot. 

After the success story of iRobot, other companies also launched their domestic 
autonomous robots. These indoor vehicles are usually simple, small, and cheap. 
Some may argue that it is too small to be called “vehicle,” and their autonomy level 
is usually limited to random motion with some low-cost sensors. However, such 
products have changed the lifestyle of many families and keep evolving. 

RoboCourier 

Swisslog is a company designs, develops and delivers automation solutions for 
forward-thinking hospitals, warehouses and distribution centers. Their 
RoboCourier AMR (Figure 41) from CCS Robotics is designed to deliver speci- 
mens, pharmacy supplies, surgical equipment and other items using an open 
container within hospitals, making it a great tool for hospitals interested in adopting 
a lean workflow solution. It is equipped with laser range finders, sonar sensors and 
bumper sensors to avoid obstacles and follow its planned path. 
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Fig. 41 RoboCourier 
(Courtesy of Swisslog) 



Pioneer 3-DX of Adept 

The Pioneer is the world’s most popular research mobile robot. The Pioneer’s 
versatility, reliability, and durability have made it the reference platform for 
robotics research. Unlike hobby and kit robots, Pioneer is fully programmable 
and will last through years of tough classroom and laboratory use. 

The base Pioneer 3-DX platform (Fig. 10) arrives fully assembled with embed- 
ded controller; motors with 500-tick encoders; 19 cm wheels; tough aluminum 
body; 8 forward-facing ultrasonic (sonar) sensors; 8 optional rear-facing sonar; 1, 2, 
or 3 hot-swappable batteries; and our complete software development kit. Add an 
optional internal computer or your own laptop and the robot is ready to go. 

Care-O-Bot of Fraunhofer IPA 

In addition to developing product-focused, specialized low-cost robots for domestic 
use, Fraunhofer IPA developed the robot assistant called Care-O-Bot for many 
years, pursuing the vision of a multifunctional domestic helper capable of 
performing a wide range of different domestic tasks. The robot serves both to 
develop various software components for the control of mobile service robots and 
also to implement and test complex application scenarios in realistic service 
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Fig. 42 Care-O-Bot 
(Courtesy of Fraunhofer IP A) 



environments. Application scenarios that prove successful in practice can be trans- 
ferred in the short to medium term to specialized, close-to-product prototypes. 

Their latest achievement, Care-O-Bot 3 (Fig. 42), is a research and development 
platform for the testing and presentation of new applications and technologies. It is 
omnidirectional with a robotic arm attached on its back. 

PR2 of Willow Garage 

PR2 (Fig. 43) is an autonomous mobile manipulator developed by Willow 
Garage. It combines the mobility to navigate human environments and the dexterity 
to grasp and manipulate objects in those environments. It has two robotic arms and 
one omnidirectional mobile base. Its personal robot applications enable people to 
make themselves more productive at home and at work. PR2 allows you to 
concentrate on your specific field, while taking advantage of the work of specialists 
in other areas. 

The PR2 hardware platform and 1000+ software libraries enable you, as a 
member of the community, to focus on new capabilities. The common platform 
also lets you easily share your research results in a way that is reproducible. 
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Fig. 43 PR2 (Courtesy of 
Willow Garage) 



AIVs in the Factory 


Major suppliers that provide AIVs for industrial applications are listed below with 
their AIV products shown in Figs. 44-50 

Some of the major suppliers that provide AIVs for industrial applications are 
introduced in this section with examples of their AIVs product shown in 
Figures 46-53. 

AIV can be broadly classified by its payload and sizes. Small AIVs are devel- 
oped for material transfer in factory or office-like environment. They are usually 
designed with flat top surface to hold items, or come with a special designed 
container with doors/covers to transport particular items. Most of such AIVs are 
developed with differential wheels, laser range finders and sonar sensors. It has the 
capabilities of localization and mapping, waypoint following, path planning and 
obstacle avoidance. The footprints of such AIVs are small enough to manoeuvre in 
tight space. It can be implemented with minimum modification of the environment 
and they are safe enough to operate with humans in its way. To operate such AIVs, 
it is often provided with a software package including a graphical user interface. 


Summary and Future Trends 

This chapter reviewed the key research area of autonomous mobile robots and 
surveyed the major suppliers of AIVs. It can be concluded that the research on 
navigation, which is the current key research scheme for AIVs, has progressing 
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Fig. 44 Adept courier. 

Adept has developed a series 
of AIVs to transport parts, 
mostly for semi-conductor 
industries. The adept lynx is 
an AIV that comes with a flat 
top to hold items to be 
transferred. Adept SPH-2200 
is an AIV with a “robotic 
arm” attached on its top so 
that it can fetch and place 
items from other machines. 
Adept SPC-4200 and Adept 
Courier, as shown in Fig. 44, 
on the other hand, has a 
container attached on the top. 
The parts to be transferred 
will be placed inside the 
container (Courtesy of Adept) 




Fig. 45 Kiva AGV. Kiva systems developed an AIV solution (Figure 45) which attached a goods 
shelf onto their AGVs. As a result, it works as a total solution of store automation. It is possible to 
move store shelf from one location to another location automatically. With other software support, 
it can enhance the efficiency of the storeroom management. Another common tasks in factory is to 
move/transport large items from one place to another. To complete such tasks autonomously, high 
payload AIVs are needed. Such AIVs are designed with powerful motors and high capacity 
batteries. As a result, such AIVs are too large to move in office-like environment and it needs 
wider transporting paths (Courtesy of Kiva Systems) 
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Fig. 46 omniMove. KUKA 
also developed their mobile 
robot products like KUKA 
Omnimove. It has Maccunum 
wheels and the payload can 
reach up to 90tons with good 
positioning accuracy. 
However, due to its large size, 
it requires large floor space to 
move or it is only operational 
in outdoor environment 
(Courtesy of Kuka) 



Fig. 47 Vision-guided 
robotic tow tracker GT45. 
Seegrid developed a series of 
AGVs for logistic 
applications. Their products 
are mostly designed to carry 
heavy payload (several tons) 
with forklift attached. Such 
designed makes it very easy 
to carry loads in store rooms 
(Courtesy of Seegrid) 



significantly in the last two decades. However, capability of truly robust autono- 
mous navigation in unstructured and dynamic environments is still lacking. Nev- 
ertheless it is optimistic that autonomous mobile robots technology will achieve the 
next level of autonomy in the next one or two decades. 
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Fig. 48 Load transfer 
vehicle AGV. Egemin 
automation developed a series 
AGVs for logistic warehouse 
or distribution centers. Their 
AGVs are designed with 
forklift, clampings or 
non-driven carts for different 
applications (Courtesy of 
Egemin) 




Fig. 49 CB 08 LGV. Elettric80 developed a series of autonomous forklift vehicles of more than 
10 models. Some of them also use clamps or container to hold parts (Courtesy of Elettric80) 


Technology Trends 

Multimodal sensory, networked, and cooperative are some key features of AIVs to 
be seen in the near future. “The convergence of technologies involving computing, 
communication, and intelligent interfaces with autonomous robotics suggests that 
networks of intelligent, autonomous robots may become the next disruptive 
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Fig. 50 Transport robot 
MT-400. Neobotix also 
developed a series of AIVs 
for material transfer tasks. 
The payload of their products 
varies from 50 to 500 kg. 
They also developed omni- 
directional AIVs which are 
much more manoeuvrable 
than conventional differential 
wheel AIVs. Their omni- 
directional model MPO-500 
uses maccunum wheel and 
MPO-700 uses power caster 
wheel design(Courtesy of 
Neobotix) 



technology” (Moustafa Youssef 2005). The paradigm of Multi-agent Systems 
(MAS) can prove to be highly useful in the research of AIVs considering the 
important role of cooperative robots in many applications. 

A new trend has been observed that AIVs will combine its flexible mobility with 
manipulators that are capable of dexterous manipulation to provide a highly flexible 
and universal equipment to handle far wider applications than the current AIVs. 
Currently state-of-the-art perception and navigation functions will be encapsulated in 
the next level of cognitive behavior controllers for future autonomous mobile robots 
and mobile manipulation systems. Moreover, the above trends will most likely appear 
not only in the ground robots but also in the air (UAVs) and water (AUVs). 

Application Trends 

Due to the advancement of technology in ubiquitous computing, cloud computing, 
and virtual manufacturing, AI V s are promising to lead to tmly lean manufacturing in 
the industry. Some important applications will be proven to generate more impact to 
both the industrial sector and domestic service sector. Key applications will be the 
flexible and automated material handling system, house care, and healthcare. 
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Abstract 

At the present time, industrial robots for assembly tasks only constitute a small 
portion of the annual robot sales. One of the main reasons is that it is difficult for 
conventional industrial robots to adapt to the complicity and flexibility of 
assembly manufacturing processes. Therefore, intelligent industrial robotic sys- 
tems are attracting more and more attention. This chapter discusses robotic 
assembly techniques that perform assembly tasks with part geometric variations, 
part location variations and/or fixture errors. Different assembly tasks were 
implemented to demonstrate different techniques. For complex assembly 
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processes, assembly parameters are very critical for assembly cycle time and 
First Time Through (FTT) rate. Hence the exploration of optimal parameters to 
minimize the cycle time and maximize the FTT rate has to be discussed. The 
Design-of-Experiment (DOE) method is adopted to identify the optimal param- 
eters and experimental results demonstrate the effectiveness of the proposed 
DOE method. Since the proposed techniques were tested using real industrial 
assembly processes, they are ready for industrial implementation. 


Introduction 


Assembly tasks using industrial robots have increased in both number and com- 
plexity over the years because of the increasing requirements of product quality and 
quantity. However, assembly robots are still a small portion of total robot sales 
each year. One of the main reasons is that it is difficult for conventional industrial 
robots to adjust to any sort of change. Therefore, more intelligent industrial 
robotic systems are rapidly expanding the realms of possibility in assembly appli- 
cations because they can perform assembly tasks with high autonomy and adapt- 
ability to the environments. In this chapter, several robotic assembly techniques are 
discussed for different industrial applications, such as assembly without high 
precision fixtures, assembly on a moving production line, and high precision 
assembly. 

Robotic force control strategy is developed for high precision assembly while 
controlling the contact force/torque. Since the force/torque control is sensitive to 
contact with the environment, the tool position/orientation can be accurately con- 
trolled. This makes the force control assembly a better solution for robotic assem- 
bly. There are many successful stories using force control assembly in industrial 
applications. 

However, the force control method requires extra devices, such as the force/ 
torque sensors and control software package, etc., which makes the robot control 
system more complicated and more expensive. Therefore, a method to perform 
assembly process without force control is also discussed in this chapter. Because the 
force control hardware and software are eliminated, this implementation of such 
technique is much simpler. 

The motion range of force control has certain limitations because it is difficult to 
keep the robotic system stable and sensitive within a large range of force-controlled 
motion. Typically vision system is installed to compensate for large part location 
error. Force control is then applied for fine assembly position searching. In this 
case, the requirements for both force control and vision system are released. 

Because many production lines are moving, especially in automotive 
manufacturing, assembly on a moving production line is worth discussing. The 
natural frequency of industrial robots is typically low, while the moving production 
lines contain both high-frequency and low-frequency terms. Therefore, visual 
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servoing and force control are integrated to control the motion of a robot to track the 
motion of a moving production line while assembly tasks are performed. 

Robot force control introduced complexity and uncertainty to the robot pro- 
gramming, control parameter setting up, and manufacturing process quality. The 
force-controlled robot behaves differently for different contact force conditions 
resulting from the manufacturing variations of the assembled parts, fixture, and 
environment disturbances on the manufacturing floor. One of the most recognizable 
behavior differences from position-controlled robot is that the production cycle 
time is no longer a predetermined value for force-controlled robot. Assembly 
parameters have large impact on the cycle time and FTT rate. Therefore, they 
have to be optimized in order to achieve optimal assembly performance. 

As aforementioned, these topics will be discussed in the following sections. 
Experimental samples are used to demonstrate the techniques. 

Robotic Assembly Techniques 
Force Control Assembly 

Introduction 

Traditionally, industrial robots are used for painting, material handling, welding, 
and other noncontact manufacturing processes. Recent advancement in robot con- 
trol systems permits an entire new class of robot behaviors and applications. The 
new robot behaviors are possible due to the incorporation of force feedback into the 
robot control system. Industrial robot applications can now be expanded into 
processes with contact forces such as assembly and machining because the force 
of contact with the environment in these applications can now be controlled. 

Adding force feedback to the robot force control system allows dynamic alter- 
ations to the robot trajectory to control the contact force. Sicilinano et al. (1999) 
give basic robot force control concept, theory, and implantation. The force control 
technology enables robotic automation applications that mate parts together such as 
gear meshing, spline insertion, clutch hub assembly, and surface grinding following 
complex curved geometry with simple programming constructs. Zhang et al. (2004) 
reveal a typical robot force control assembly system for articulated robots. Around 
the same time, attempt has been done to develop force-controlled parallel robot 
structure to deal with the assembly requirement for higher sensitivity. Morris 
et al. (2001) describe an example of such system. Based on the force control feature 
developed, many assembly applications have been developed for automotive power 
train assemblies such as torque converter assembly (Morris et al. 2001) and engine 
piston installation (Chen et al. 2007). While the principles are the same, there are 
different techniques and characteristics for each application. This section will 
briefly describe the robotic force control algorithm and implementation and the 
assembly application examples, and finally a summary will be provided to address 
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the result, major lessons learned, and further work or the trend of the robotic 
assembly technology and application. 

Force Control Theory and Development 

Force control is an enabler for industrial robotics to be used in sophisticated 
assembly applications. While industrial robots are traditionally used for painting, 
material handling, welding, and other noncontact manufacturing processes, force- 
controlled robots can handle contact operations such as assembly and machining. 
Normally, robotic force control employs a 6D transducer or called force sensor, 
being mounted on the robot faceplate. In some other cases, torque sensors are 
placed on each joint of the parallel or articulated robot arms to measure the joint 
torque values. 

Most existing industrial force control is built on previous position control 
systems. An analog or digital channel is added on the robot controller to take the 
force sensor feedback. The feedback then integrated into the robot control loop to 
realize the force control. Figure 1 is an example hardware setup for industrial robot 
force control. 

Figure 2 shows a typical force control loop diagram. Reference force and 
reference velocity are input into the control system as robot force control behavior 
desired by the user. Properly designed force/velocity input can generate 
corresponding searching or other robot reaction patterns. 

Because the robotic force control assembly requires a robot to perform assem- 
blies in which the assembly tolerance is small, a search pattern has to be 
implemented in order to compensate for the part positioning errors. A spiral search 
pattern is used in the X and Y searching directions. 



Robot Arm 


IRC5 Controller 


Fig. 1 An example of hardware setup for industrial robot force control 
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Fig. 3 A spiral search pattern to find the exact position for force control assembly. The units 
are fim 


x = Rsinasin/3 
y = R sin a cos fi 

The search pattern is shown in Fig. 3. 

When using force control in assembling, the searching pattern or searching 
strategy needs to be defined first. Most time the strategy is coming out from the 
experience of manual assembling work and the part design analysis. Different force 
control developers may use different terminology, but it involves search pattern 
such as linear search, circular search and spiral search illustrated in Fig. 3, insertion 
force, and search orientation arrangement. In most automotive part assembly, the 
orientation is fixed or with a little flexibility, which is controlled by dumpling factor 
D as illustrated in Fig. 2. In some of the search patterns, there are other related 
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parameters such as search radius to be defined and optimized to get out the optimal 
assembly system result for specific applications. While parameter optimization 
methodology and practice will be introduced in the later sections, the search 
strategy will be further described and explained in the next section (Experiment 
and Examples). 

Experiment and Examples 

Torque Converter Assembly 

Automotive torque converter assembly is a typical robotic assembly example. 
Torque converter can be weighed up to 75 kg. Between torque converter and 
transmission house, normally there are three distinct subassembly stages required 
to be matched during the assembly process. Figure 4 illustrates the gear matching 
for torque converter assembly. Assembling torque converter manually causes 
working injury and other ergonomic problems. Automating torque converter 
assembly is demanded by the automotive industry. Robotic force control provides 
a better solution than hard automation equipment with more flexibility and cost 




//// 

//// 



Fig. 4 Gear matching between torque converter and transmission housing 
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advantages. From Fig. 4, it can be seen that the torque converter assembly is an 
insertion process with several layers of gear matching. A typical torque converter 
assembly has three layers of matching: first layer is the first (largest) gear of the 
torque converter to the third gear (from top down) of the transmission shaft; the 
second matching is the second gear of torque converter to the second gear of 
the transmission shaft; and the third matching is the grove at the tip of the torque 
converter shaft to a bar on the bottom of the transmission shaft. For the first layer 
matching, spiral search pattern is used; for the second layer, circular or spiral search 
pattern is used; and for the third layer, only rotation pattern is used because the 
assembling and assembled parts should be already lined up when reaching the third 
layer. In terms of rotation angle ranges, they are different for the three layers. First 
two layers use relative small angles such as 45° since the matching gears have fine 
teeth (as can be seen in Fig. 4). The third layer uses larger searching rotation angle, 
at least more than 90°, since it has only two-gear teeth. Assembling experience tells 
that adding hopping motion along the axial direction will help improving the 
assembly cycle time because the constant axial insertion force will press the two 
assembling surfaces together, which prevents gear matching during the assembly. 

To summarize the above observation and analysis, for the robotic force control 
torque converter assembly process, the following force control assembly strategy is 
used: 

• Use straight insertion and divide the assembly process into three or more layers. 

• Use spiral search pattern for the first layer, circular or spiral for the second layer, 
and no radial direction search for the third layer. 

• In the first two layers, relative small rotational search angles, about 45° or less, 
are used; but in the third layer, a large rotational search angle, more than 90°, 
is used. 

• Hopping motion (in terms of insertion force vibration) is added along the axial 
direction to improve the cycle time through overcoming gear surface “sticking” 
problem. 

Figure 5 shows a robotic torque converter assembly along the horizontal direc- 
tion. In this case, the gravity effect will not help in finalizing the assembly process 
(gravity will not hold the assembled part in place along horizontal assembling as in 
vertical assembling). Therefore, a final position locking mechanism is needed to 
secure the torque converter in place when the assembly is finished and robot 
releases the part. Other strategies for the vertical assembly process are applicable 
to the horizontal assembly process. 

Engine Piston Installation 

Another example of robotic force control assembly is engine piston installation. 
Because the part tolerance and chamfer are getting smaller on the piston head for 
higher energy efficiency, manually stuffing the piston is a challenge especially for 
diesel engines where the piston is heavier, making manual operation even more 
difficult. Robotic force control piston installation can combine the piston stuffing 
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Fig. 5 Robotic torque 
converter assembly 
horizontally 



with cam bolt placement and tightening in one robotic automated station, which 
increases the productivity and reduces the manufacturing floor space requirement. 
Figure 6 shows an example of engine piston stuffing in lab experiment. Multiple 
robots can be used in the engine piston assembly. No matter how many robots are 
used, only one robot needs to have the force control feature to perform the piston 
stuffing operation. 

The gas engine piston stuffing/installation is one of the most challenging assem- 
bly processes. Because of the tight tolerance (up to 0.05 mm between the piston 
skirt and cylinder bore), the complexity of the ring quizzing, the necessary hairpin 
guidance, as well as the precision stuffing pushing, manual work is the most 
common practice, and hard automation is sometimes used. 

In manual engine piston installation, two persons normally cooperate with each 
other in the assembly process. One person inserts guiding pins through a cylinder 
bore in the engine block to locate and align the holes on the piston connecting rod, 
and another person holds the piston with a piston ring compressing device and 
approaches the cylinder bore to receive the guiding pins with the piston connecting 
rod. A pneumatically driven pusher is often used for stuffing the piston into the 
cylinder bore. The connecting rod cap and screws are placed by hand, and then the 
cap is fastened manually using hand tools. Manual piston installation work is labor 
intensive, tedious, and prone to worker injury due to the force required and 
repetitive nature of the tasks. The assembly quality is entirely dependent upon the 
skill and attention of the workers. 

Automation of engine piston insertion has been performed by using specially 
built machines, often called “hard automation.” These dedicated machines are 
huge, costly, and inflexible. Switching between engine models or types to be 
assembled is difficult, time consuming, and costly. Piston installation with a 
dedicated automation machine requires a level of precision and tolerance that limits 
its application. Furthermore, in “hard automation” stuffing techniques described 
above, there is no active searching action in finding the cylinder bore. Though a 
passive floating table is sometimes used to align the piston skirt with the cylinder 
bore, the success of stuffing the piston essentially depends on the accuracy of the 
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Fig. 6 Engine piston 
stuffing/installation lab 
experiment 



machine, the actual gap between the cylinder bore and the piston skirt, and the 
leading chamfers on both the piston skirt and the cylinder bore. With the increasing 
demand of reducing the gap between the cylinder bore and the piston skirt and 
minimizing or eliminating the leading chamfers for the purpose of emission control 
and engine efficiency improvement, the challenge and difficulty are increasing for 
both manual and automated piston stuffing processes. 

To deal with this complicated engine piston installation process, an automated 
piston installation technology using industrial robots is developed. A three-robot 
and a two-robot workcell configurations are employed to balance the requirements 
of cycle time, floor occupation, and system complexity. The piston installation 
process has been simulated; the special robot gripper for the piston stuffing is 
designed and manufactured; robotic force control searching strategy is defined; 
and parameter setting and other programming-specific issues are investigated. After 
the robotic piston stuffing shown in Fig. 6 has been successfully performed in the 
laboratory, a customer demo unit is built as shown in Fig. 7. A three-robot piston 
installation system is described in this example. Other cell configuration with fewer 
robots and more specific hardware can be also used to reduce the system complexity 
and cost. 

The three-robot configuration is a total automation solution. As illustrated in 
Fig. 8, robot I with its fixture picks up the engine block from a pallet and moves the 
engine block to a position close to the second robot and orients the engine block so 
that robots II and III can perform the piston installation. The fixture with engine 
block is shown in Fig. 9. An external axis (motor) controlled by the robot controller 
rotates the crankshaft of the engine block to the proper orientation for the piston to 
be stuffed in an associated one of the three cylinder bores. As shown in Fig. 10, the 
piston installation operation consists of stuffing the piston with ring skirt, 
connecting rod assembly into one of the bores, and attaching the associated 
connecting rod cap to that assembly. Robot II which has force control moves its 
uniquely designed gripper. The gripper has jaws to grab the piston assembly. Before 
the jaws are closed to grab the piston assembly, the gripper sucks up the piston 
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Fig. 7 Robotic engine piston 
installation customer demo 




Fig. 8 Multiple robot engine piston installation system 

assembly using a suction cup on a pusher against the upper surface of the piston 
head. 

At the same time as the operations described above for robot II, robot III using 
the guiding pins and cap placing gripper picks up the connecting rod cap, moves it 
under the engine block, and protrudes the guiding pins through the crankshaft and 
the cylinder bore of the engine block to receive the piston connecting rod. Robot II 
moves the gripper with the piston assembly inserted therein above the cylinder 
bore to be stuffed and then moves the piston assembly into the bore to engage the 
connecting rod, leading the tips of the guiding pins into the screw holes on the upper 
half bearing house of the connecting rod. A stabilizing finger is employed to keep 
the connecting rod in place during transportation of the piston assembly to the 
cylinder bore from the pallet. 

Robot II and robot III move cooperatively until the lower surface of the 
piston skirt is close to the upper surface of the cylinder bore into which the piston 
subassembly is to be inserted. Then robot II enables its active searching function to 
move the subassembly so that the piston skirt finds that cylinder bore and the piston 
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Fig. 9 Gripper, fixture and 
other tool design for piston 
installation 
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Fig. 10 The unique robotic 
piston stuffing gripper 
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assembly are inserted into that bore until the lower surface of the gripper jaws 
touches the upper surface of the cylinder bore. Next, the piston assembly is pushed 
further into the cylinder bore by a pusher on the gripper. The third robot with its 
connecting rod cap placing and rundown device places the connecting rod cap and 
screws on the inserted piston assembly and fastens the cap onto the connecting rod. 
The same process is repeated for subsequent cylinder bores in the first row of three 
cylinder bores for this V-6 engine block. 

After the stuffing is finished for the first row of cylinder bores, robot I reorients 
the engine block so that the upper surface of the other row of three cylinders bores 
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Fig. 11 The process flow diagram for the three-robot workcell configuration 


for the V-6 engine can be stuffed. The piston stuffing procedure described above for 
the first row is repeated to stuff a piston assembly into each of the cylinder bores in 
the second row. Figure 1 1 is the process flow diagram for the piston stuffing process 
with three robots. 

In terms of robotic force control searching strategy, it is simpler in piston 
stuffing than in torque converter assembly. There is only one layer here, and spiral 
search pattern is used. To make sure the searching does not go too far from the 
programmed starting position to eventually find the piston bore (since the chamfer 
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is small on the piston bore, there is no physical limitation for the piston to stay near 
to the bore during searching), “spring force” is introduced to “pull” the piston and 
its bore together during the force-controlled piston stuffing. 


Summary 

In summary, robotic force control has been developed and implemented in assembly 
applications. While started from automotive industry, it has been implemented in 
general industry. Robotic force control is an enabler for industrial robots to be used 
in sophisticated assembly processes. Its application areas are expanding rapidly. 
However, since a new hardware device, force/torque sensor, and force control software 
are introduced, the robotic assembly system with force control is often more compli- 
cated and requires the system builder and user to have force control-related knowledge. 
Furthermore, more robotic assembly process parameters are introduced by force 
control assembly processes. Assembly parameters such as search pattern, search radius, 
and insertion force are critical to the success of the robotic assembly and need to be 
optimized which will be discussed in section “Assembly Parameter Optimization.” 


Assembly Without Force Control 


Introduction 

Several methods have been developed to perform precision assemblies to reduce 
the design efforts and cost of fixtures that are typically required by high accuracy 
robotic assembly applications. The passive compliance device or Remote Center 
Compliance (RCC) is an example that allows an assembly robot to compensate for 
positioning errors due to machine inaccuracy, vibration, or tolerance, thereby 
lowering contact forces and avoiding part and tool damage. There are many 
research works (Gravel and Newman 2001; Turges and Laowattana 1994; Xu and 
Paul 1990) done using the passive compliance device for assembly or other tasks. 
However, specific passive compliance devices have to be designed and 
manufactured for parts with different geometries, which makes robotic assembly 
with passive compliance devices more difficult. Also, these devices are expensive 
to design and manufacture. To overcome the shortcomings of the passive compli- 
ance devices, robotic assembly using force control (Gottschlich and Kak 1989; 
Newman et al. 2001) was then developed as described in section “Force Control 
Assembly.” A force/torque sensor is used to measure the contact force/torque, and 
the force/torque signals are used to control the motion of the assembly tool. Since 
the force/torque control is sensitive to contact with the environment, the tool 
position/orientation can be accurately controlled. This makes the force-controlled 
assembly a good solution for robotic assembly. There are many successful stories 
about robotic force-controlled assembly besides what is described in section “Force 
Control Assembly,” such as the forward clutch and valve body assemblies, in 
industrial implementations (Zhang et al. 2004; Gravel and Newman 2001; Robotics 
Application Manual - Force Control for Assembly; Chen et al. 2007). The position 
and orientation errors due to the part fixture or location errors can be easily 
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compensated using the force control strategy. Therefore, the force control method 
can greatly reduce the requirements of the part fixtures. Moreover, it can be used for 
high precision or high accuracy assemblies. However, the force control method 
requires extra devices, such as the force/torque sensors and control software 
package, etc., which can make the robot control system more complicated and 
more expensive. Therefore, this section discusses a simpler compliant robot control 
methodology to perform high precision industrial assembly that does not require 
extra equipment or cost yet can be successfully utilized in many cases. 

Position control of industrial robots is very accurate when done with high 
controller gains. However, contact forces increase rapidly when the robot tooling 
makes a contact with the environment, making industrial robots difficult to use 
when limited contact force is needed in high precision assemblies. By reducing the 
robot control loop gains, the servos can make the robot compliant to the environ- 
ment, creating a so-called “soft servo” capability. This “soft servo” method can be 
implemented to perform some of the high precision assembly tasks where part 
location errors typically require the use of force control or RCC methods. The valve 
body assembly was performed using both soft servo and force control to demon- 
strate the performance of both methods. 

High Precision Assembly Process 

A high precision robotic assembly requires a robot to perform assemblies in which 
the assembly tolerance is better than or close to the robots repeatability. Figure 12 
shows a typical method used for high precision assemblies. There are three steps in 
the assembly strategy. Firstly, the robot will search locally for the right position of 
the hole on the valve body to compensate for small errors due to the vision system 
accuracy and robot repeatability. Secondly, constant force along the Z direction 
pushes the accumulator down towards the hole, while other directions are also kept 
compliant in order to deal with the orientation errors. Finally, the robot will settle 
the accumulator down into the bottom of the valve body, and a retrieval force is 
then applied to retrieve the tool. 



Fig. 12 The assembly 
process for high precision 
assemblies 
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A searching method is used to find the exact location of the workpiece. After the 
part is engaged with the workpiece, an insertion force is applied to insert the part 
into the workpiece. During insertion, the tool orientation is changed according to 
the orientation of the workpiece to avoid jam. Therefore, any developed assembly 
method needs to be able to locate the workpiece accurately and adjust the tool 
orientation accordingly. 


Soft Servo Control Method 

For a rigid manipulator of n links, the dynamic equation of motion in the joint 
space is 


M(q)q + C(q, q)q + B(q, q) + G(q) = r (2) 

where 

r G R n vector of applied joint torques 
q G R n vector of joint positions 

M G R' 1 symmetric positive definite (SPD) manipulator inertia matrix 
C G R n vector of Coriolis and centrifugal torques 

B G R n vector of torques due to friction acting on the manipulator joints 
G G R n vector of gravitational torques 

When there is an external force applied to the robot end effector, the dynamic 
Eq. 2 becomes 


M(q)q + C(q, q)q + B(q, q) + G(q) +T e = t (3) 


Where 

r e G R n is the vector of forces/torques exerted on the environment by the 
manipulator end effector expressed in the joint space. 

For model-based control, the model parameters are estimated. Suppose the 

A A A 

estimated corresponding parameters are M (q), C(q,q), B(q,q) and G(q), respec- 
tively, a typical feed forward PD (proportional plus derivative) controller can be 
expressed as 

t = M{q)q d + C(q, q)q d + B(q, q) + G{q) + K p e q + K v e q (4) 


Where 

e p ,e p eR n are vectors of position and velocity errors in the joint space 
respectively. 

Comparing Eqs. 3 and 4, we have 

T e — At (5) 


where 

At G R n is the vector of forces/torques errors generated by the model estimation 
error. 
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Fig. 13 The controller of a robotics system with soft servo 


Assuming At is small, the position and velocity errors will balance the external 
forces/torques exerted on the robot end effector. Therefore, by decreasing the 
position control loop gains, the robot position errors could be increased to make 
the robot compliant to the environment. Because the control gains of each joint can 
be tuned individually, a joint soft servo method is formulated. One of the advan- 
tages of joint soft servo is that the orientation of the tool can be adjusted due to the 
compliance of each joint. Therefore, the tool position and orientation can be 
continuously changed based on the contact with the environment. The overall 
control system flow with soft servo is shown in Fig. 13. 

There is a switch to select the regular high gain position control and soft servo 
control. In the normal position control loop, the regular high gain position control is 
used. For soft servo control, the search pattern and oscillation patterns are 
implemented in order to perform high precision assemblies. 

For the X and Y directions, the spiral search method as described in Eq. 2 is used 
to search the hole position. For the Z direction, which is the insertion direction, a 
certain contact force has to be maintained in the insertion assembly process. To 
achieve this, a constant error along the Z axis is maintained: 

F = K pz Az (6) 

where K pz is the proportional gain along the Z axis and Az is the set error along 
the Z axis. These two values should be tuned such that the contact force 
can be maintained in a certain range. Because the inserted part may be jammed 
during assembly, an oscillation is added to the Z direction to keep the part from 
j amming : 


Az s =d z sinw s t (7) 

where Az s is the positioning oscillation along the Z axis and d z and w s are the 
oscillation amplitude and frequency, respectively. 
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Experimental Results 

To demonstrate high precision assembly using soft servo, several experiments were 
performed. The assembly process using force control (Details in section “Force 
Control Assembly”) was also implemented to compare the assembly performance 
between soft servo and force control. 

The valve body assembly was implemented using an ABB IRB140 robot, which 
is mounted horizontally on a stand. The software for soft servo and force control 
was developed on ABB IRC5 controller. An ATI Delta force/torque sensor for force 
control was mounted on the robot end effector, and a suction tool used to pick up the 
accumulator was mounted under the force/torque sensor. The experimental system 
is shown in Fig. 14. 

The valve body was placed in a vise, and, to demonstrate a generic assembly 
process, the assembly was performed along the horizontal direction. 

For the search process using the spiral pattern, the search radius R was set to 
1.5 mm and the number of turns to 4 (/? E [0, 8zr]). For soft servo, the oscillation 
amplitude was set to 1 mm and the frequency to 3 Hz. The settle down point was set 
to be 20 mm since there is no feedback available. For force control, the search force 
was set to be 20 N. For the insertion process, the spring force constant K pz was set to 
50 N/mm and the settle down force set to 50 N. 

A reference configuration was taught by inserting the accumulator into the valve 
body manually while the force control is active. The position and orientation are 
recorded as the reference position and orientation in the robot base frame. There- 
fore, there are no position and orientation errors if the reference position and 
orientation are used. However, because there are always fixture errors in a produc- 
tion line, errors were intentionally added to the X and Y axes as disturbance during 
the experiments. To compare the performance between the soft servo and force 
control without bias, the errors are added along both positive and negative 


Fig. 14 A high precision 
robotic assembly system. The 
suction tool picks up the 
accumulator and inserts it into 
the hole in the valve body. 
The robot controller is not 
shown in the figure 
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Table 1 The position offset and insertion time for both soft servo (SS) and force control (FC) 


Position offset X 

Position offset Y 

Insertion time (SS) 

Insertion time (FC) 

1 

0 

3.72 

3.17 

1 

0 

3.75 

3.17 

-1 

0 

3.51 

3.15 

-1 

0 

3.53 

3.17 

0 

1 

3.80 

3.21 

0 

1 

3.82 

3.11 

0 

-1 

3.70 

3.21 

0 

-1 

3.64 

3.20 

1 

1 

4.1 

3.20 

1 

-1 

4.72 

4.20 

-1 

-1 

5.17 

4.67 

-1 

1 

6.30 

4.53 


Fig. 15 The valve body 
assembly process. The 
accumulator is inserted in to 
the valve body 



directions along the X and Y axes as shown in Table 1. For the assembly with 
different errors, the insertion time for both soft servo and force control is recorded 
and shown in Table 1. 

The data in Table 1 shows that the insertion time using force control is quite 
close to that using soft servo. Therefore, soft servo can be used to perform high 
precision assembly with cycle time comparable to that using force control. For 
larger errors (both the X and Y axes have offsets), the insertion time is longer. This 
is because the searching time with big errors is longer. For soft servo, the cycle time 
is even longer. Thus, the experimental results illustrate that soft servo control is not 
as sensitive as the force control. 

Figure 15 shows the valve body assembly process. The accumulator is inserted 
into the valve body. Figure 16 shows there are offsets along both the X and Y 
directions in the tool frame before the searching method starts. 

The contact force signals for both the soft servo and force control were recorded 
and are shown in Figs. 17 and 18, respectively. 
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Fig. 16 The valve body 
assembly process. There are 
offsets along both the X and 
Y axes 



Force Control 



Time (s) 


Fig. 17 The recorded force signal for force-controlled assembly 

The recorded force signals in Figs. 17 and 18 illustrate that the contact force for 
both methods are reasonably close. For force control, the reference force is set to 
20 N, and the real force signals are quite close the reference force. Because there is 
no direct force control using soft servo, the contact force is indirectly controlled by 
the position offset. The recorded maximum force is about 40 N, which is reasonable 
for the valve body insertion assembly. Thus, soft servo techniques can be used for 
high precision assembly with reasonable performance compared with the force 
control method. However, the contact force signals are much more smooth using 
force control than that using soft servo. Also the retract force using force control is 
almost 0 while there is large retract force when using soft servo. This is because soft 
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Fig. 18 The recorded force signal for soft servo-based assembly 


servo is still based on a position control loop and lower control gains still generate 
big contact force. The searching time using soft servo is also much longer than that 
using force control, and searching using soft servo is not as smooth as that using 
force control. 

Although soft servo can be used for high precision assemblies, there are some 
limitations. The parameters, such as controller gain and maintained contact force 
during assembly, have to be carefully tuned; otherwise, the parts or the robotic 
system could be damaged if there was a jam during the insertion process. The main 
reason is that soft servo is not sensitive to the contact force since it is based on the 
position errors and not force measurement. Also, the assembly process using soft 
servo is not as smooth as that using force control, and there can be large contact 
forces generated during assembly. The soft servo method is also not as stable as the 
force control method. While it can be used to perform assemblies with small errors, 
if there are large offsets, it will likely fail and generate large contact force. Because 
of these limitations, careful consideration of the intended system is needed before 
implementing soft servo in an industrial application even though the laboratory 
implementation looks promising. 

Although there are some significant shortcomings for soft servo assembly 
methods with compliance in all directions, its performance can be quite close to 
that using force control, especially for small initial positioning errors and when 
using small robots. For large disturbances, a vision system could be used to 
compensate the workpiece location errors, but the added cost and complexity 
might be similar to just using force control. 
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Summary 

In this section, an assembly method using soft servo is introduced to perform 
assembly tasks where part location errors typically require the use of force control 
or RCC methods, and the assembly process requires compliance in all directions. 
The proportional and derivative controller gains are reduced to make the robot 
compliant when performing the assembly tasks. A search algorithm is developed to 
find the location of a feature, such as a hole in a part. The tool is kept in contact with 
the part while search motion is performed. An oscillation motion is added along the 
contact direction whenever the contact friction is severe to prevent binding. Once 
the tool is engaged with the part, a measured relative mating position (e.g., insertion 
distance) is used to determine the completion of the assembly. A valve body 
assembly was used to validate the developed method. Experiments were consis- 
tently successful when the relative part location errors were within 1 mm, showing 
that the developed soft servo strategy can perform assembly tasks with small part 
location errors. Experiments with force control were also implemented to compare 
the performance between soft servo and force control. It was found that force 
control methods are much more sensitive to environmental contact, and the contact 
forces can be controlled directly. Conversely, the contact force cannot be directly 
controlled when soft servo is used because it is passive to the contact. Therefore, 
soft servo requires careful programming and tuning in order to reduce the contact 
forces; otherwise, damage could be caused to the products as well as to the robotic 
system. Also, for bigger part location errors, soft servo either fails to assemble the 
parts or generates bigger contact forces than force-controlled assembly. Thus, 
applications of soft servo are more limited, while force control can be successfully 
used in most all applications. Further investigation is needed to determine the 
practical industrial use of soft servo for particular types of precision assembly. 


Vision and Force Control-Integrated Assembly 
Introduction 

Machine vision systems have been widely used to enable a robot to locate the part 
or subassembly on which it is working. In most applications, machine vision 
systems provide real-time data and live feedback to guide robots as they go through 
programmed sequences of operations. However, a vision system alone cannot 
perform high accuracy assembly because the accuracy of the vision system is 
inadequate and the tool may be jammed during assembly. Force/torque control 
based on the measured contact force/torque can be added to correct the object 
position/orientation in response to the force acting from the environment (Newman 
et al. 2001). In such a system, an admittance control method (Glosser and Newman 
1994) is used to guide the robot tool towards a desired destination. Such a combi- 
nation of vision and force control will enable industrial robots to perform high 
accuracy assembly in a semi- structured environment. 

Some researchers (Morrow et al. 1995) have applied image-based visual 
servoing techniques to guide the robot to the desired position. However, robots 
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with tooling will block the camera during assembly in some cases. This prevents the 
visual servoing techniques from obtaining some real-time images, requiring 
position-based approaches to be implemented in these cases. Position-based 
methods (Jorg et al. 2000) have been studied for assembly too. The exact position 
of the features on the assembly parts must be accurately identified. It is well known 
that accurate calibration of vision system is very challenging, and disturbances will 
cause the accurately calibrated systems to have positioning errors. This makes these 
methods difficult to use in production environments for high accuracy assembly. 
Yorck et al. (1999) presented a neuro-fuzzy method using visual and force control. 
Two camera systems are used for orientation identification. The system is too 
complicated and difficult to use in industry. 

In this section, an assembly strategy based on combining vision guidance with force 
control is presented and demonstrated using a high accuracy assembly process. The 
vision system does not need to be carefully calibrated because it is only used for rough 
positioning. The vision system is used to locate the fixture first. The position of the 
fixture is sent to the robot controller to guide the robot close to the desired position. 
Because of calibration errors of the vision system and robotic positioning errors, it is 
difficult for the overall positioning errors to be less than the small tolerance necessary 
for high accuracy assembly. Therefore, a local searching method based on force/torque 
control is developed to deal with the errors. Furthermore, the force/torque control 
algorithm is also used to insert the accumulator into the valve body to deal with the 
position and orientation errors during assembly. The methodology developed for high 
accuracy assembly in a semi-structured environment based on vision, force, and 
position sensor fusion has been implemented successfully for the valve body assembly. 
The simple vision system calibration also makes the system easy to use in production. 


Automated Assembly Methodology 

For high accuracy assembly with fixture errors, the vision system and force/torque 
control have to be integrated to perform the assembly task. A vision and force/ 
torque control architecture with multilevel accuracy is developed for high accuracy 
assembly applications with fixture errors. Figure 19 shows the control architecture. 

The vision system captures the image of the fixture and computes the X and Y 
position errors and the orientation errors about Z axis. The errors are sent to the 


Fig. 19 Control architecture 
for high accuracy assembly 
applications with fixture 
errors 
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robot controller to enable the robot to approach the desired position and orientation. 
Because the vision system errors and robotic positioning error are greater than the 
assembly part clearance, it is difficult for the robot to achieve the desired position to 
perform high accuracy assembly. A search algorithm (details in section “Force 
Control Assembly”) based on the force/torque control is then applied to control the 
robot to insert the accumulator into the valve body. 

Coarse Control Methodology Based on Vision 

The coarse control methodology is based on the ABB True View™ vision system. 
Figure 20 shows the TrueView™ interface. 

The TrueView™ vision system captures the images of a fixture from the camera 
via a frame grabber. It then computes the position and orientation of the fixture by 
identifying the predefined features in the real image. Figure 21 shows the 
predefined features on a reference image. These features are defined by the user. 
One feature is enough to define the position and orientation of the part; however, the 
identification accuracy can be increased by defining more features. A coordinate 
system in the image frame is also defined as shown in Fig. 21. 

To control the motion of the robot, the coordinates in the image frame have to be 
transferred into the robot frame. This is done by the ABB TrueView™ automated 
calibration feature. This tool uses a pattern, either in dot format or square format, to 
calibrate the system. Figure 22 shows the dot calibration patterns. 



a jj 

rjfTfKi 

j, UVU ft ■> iK* 
Conrad 

3 'dji* 

/Omtusi 

- 4** 

-I R*ji$!cp 
* 

HdftZ 

Wile OnJ 



r r* 


' 


r pt* 

& . „ 

x r - 

Y pT" 


V- I Lmt» t I | * M 


r 


P C jt-J 

P 


! 

Mu* 

Y. 


pmm 

ft Y 



pTS 


[401 DO 

\xzm 

1*5! DD 




Yw Wi +.<1.3-. 6 


flrt 





Fig. 20 The TrueView™ vision system interface 
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Fig. 21 The defined features and the coordinate system 


Fig. 22 The dot calibration 
patterns used to calibrate the 
relationship between the 
image frame and the robot 
world frame 
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Fig. 23 The calibration interface of the ABB True View™ system 

The image of the calibration pattern is captured and analyzed automatically. 
Figure 23 shows the calibration interface of the ABB TrueView™ system. In the 
image frame, a 7 x 7 dot pattern is chosen. In the robot world frame, the positions of 
the three dots of the origin P 0 , the X axis P x , and the Y axis P Y (as shown in Figs. 22 
and 23) are measured by teaching the robot. 

The coordinates of the origin, P x and P Y of the defined coordinate system, are 
input into the interface. The calibration is then performed automatically, and the 
calibration matrix is identified. 

During assembly, the image of the fixture is captured. It is then analyzed and the 
predefined features are identified in the image. From the position and orientation of 
the predefined features, the XY position and orientation about Z axis of the valve 
body are calculated. Using the calibration matrix, the position and orientation of the 
valve body in the robot world frame are computed and used to control the robot to 
approach the desired assembly configuration. 

Fine Control Strategy Based on Force/Torque Control 

Because of the camera resolution, distortion, and calibration errors, a robot guided 
by the vision system will have some small positioning errors. For high accuracy 
assembly, these small errors will cause the assembly to fail because traditional 
industrial robots are very stiff. Therefore, a fine control strategy based on force/ 
torque control is developed for the assembly as described in section “High Precision 
Assembly Process.” 
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The robot holding the accumulator starts to search for the right position of the 
hole from the center of the spiral pattern while the force along the Z axis is kept at a 
constant value. When the accumulator along the Z axis changes by a set threshold, it 
is inserted into the hole on the valve body, and the search is terminated. If the robot 
reaches the boundary without finding the desired position, it will search back to the 
center of the spiral pattern. This search process is repeated several times until the 
accumulator is inserted into the hole. The initial position error of the tool should not 
be too big. If the position error is too big, the search process will not find the desired 
position, and the search will be terminated automatically after a few trials. The 
number of trials and turns in the spiral pattern can be defined by the users. 

For the insertion process, the approximate insertion distance is known. There- 
fore, based on the insertion distance, the reference force along Z axis can be 
calculated and used to control the motion of the robot. One of the advantages of 
this control strategy is that there is no large initial impact contact force and the cycle 
is minimal. The tool orientation is adjusted based on the measured torque such that 
the accumulator can be inserted without failure. 

Once the tool approaches the settle down point, a constant force is set to make 
sure the accumulator is properly inserted. After the accumulator reaches the bottom, 
the contact force along the Z axis will increase. When the contact force reaches the 
set value, the insertion is complete, and the tool is retrieved. The assembly is then 
complete. 

Implementation of High Accuracy Assembly with Fixture Errors 

The high accuracy assembly is implemented using an ABB IRB140 robot. An ATI 
Delta force/torque sensor is mounted on the robot end effector. A suction tool is 
used to pick up the piston and is installed under the force/torque sensor. A Pulnix 
TM-200 camera is mounted on a frame that is stationary to the robot base frame. 
The experimental system is shown in Fig. 24. 

The valve body is placed on a table. The distance from the camera to the table 
surface is about 550 mm. The field of view is about 180 x 135 mm. The resolution 
of the camera is 640 x 480. Therefore, the accuracy of the camera system is about 


Fig. 24 The high accuracy 
assembly system. The suction 
tool will pick up the piston 
and insert it into the hole on 
the piston body 
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0.33 mm. This accuracy is inadequate for high accuracy assembly. The ABB 
True View™ system is installed on a PC with a Meteor II frame grabber. The PC 
and IRC5 robot controllers communicate via Ethernet. The fixture position errors 
are computed by the ABB TrueView™ system and sent to the robot controller. For 
the search process using the spiral pattern, the search radius is set to be 1.2 mm, and 
the number of turns is 4. The search force is set to be 20 N. For the insertion process, 
the spring force constant is set to be 50 N/mm. The settle down force is set to 
be 50 N. 

In order to improve the accuracy of the assembly system, the position errors and 
orientation errors relative to a reference robot configuration are used to calculate the 
robot position and orientation. A reference configuration is taught by inserting the 
piston into the valve body manually while the force control is active. The position and 
orientation are recorded as the reference position and orientation in the robot base 
frame. After that, the robot moves to the “Home” position without blocking the image 
of the fixture. The TrueView™ system will capture the image, analyze the image, and 
output the part coordinates in the robot frame using the calibration matrix. These XY 
coordinates obtained from the image system may be different from the taught 
reference position and orientation in the base frame. The two reference position 
and orientation differences are calculated. When the valve body moves to different 
places, the differences are used to compute the robot position and orientation in the 
robot base frame. This method can decrease the calibration error. 

Two sets of experiments are performed to verify the new vision and force control 
strategy. In the first set of experiments, the valve body is fixed on the table using a 
clamp. Figure 25 shows the assembly when the fixture is set at different places, and 
the assemblies are successful. A series of experiments were performed without 
failure. In the second set of experiments, the valve body is free to move on the table 
to simulate the assembly without fixture. Figure 26 shows the assembly at different 
places without fixture, and the assemblies are successful. A series of experiments 
were performed without failure. Therefore, the developed strategy can be used for 
the high accuracy assembly in manufacturing. 

Without a combined vision and force control strategy, the high accuracy assem- 
bly is hard to be implemented. Figure 27 shows the assembly without vision 
guidance. The position errors are too large, and the robot cannot insert the accu- 
mulator into the valve body. 

Figure 28 shows the assembly without force/torque control. The accumulator is 
jammed at the top of the valve body and cannot be inserted into the valve body. 

Summary 

In this section, a vision and force control-integrated robotic assembly technique is 
discussed. A high accuracy assembly system is successfully implemented based on 
a combined vision and force/torque control strategy in a semi- structured environ- 
ment. The vision system guides the robot tool near the desired position. A force/ 
torque control strategy is then applied to search for the desired position and perform 
the assembly. The system can also be applied to other similar assembly processes 
such as manual clutch assembly, torque converter assembly, etc. 
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Fig. 25 The high accuracy assembly with fixture 


Assembly on a Moving Production Line 
Introduction 

Robotized moving line assembly system (assembly is performed while the part is 
moving) is an intelligent robotic system based on vision, force, and position sensor 
fusion while the object is moving randomly on a platform. The robot can track the 
moving object while performing the assembly process. Compared to the stop station 
assembly (assembly is performed when the part is stationary), the moving line 
assembly can save huge amount of time and resource. However, integration of 
visual servoing and force control in industrial robotic systems is still a very 
challenging research area, especially when the system is installed for trim-and- 
final assembly processes in automotive manufacturing because a small damage to 
the parts could cause a big loss. 

In patent 6,886,23 1,B2, a stop-assembly-station system is described. The vision 
system is used to identify the position/orientation of the part. The assembly is 
carried out by using the vision system alone. A vision and force control -based stop 
station assembly system is described in patent 6,141,863. The force control is used 
to execute the insertion task, and the vision system is used to identify the position/ 
orientation of a part. In patent 5,040,056, a part picking up system is presented. The 
part position/orientation is identified by the vision system, and the data are 


64 Robotic Assembly 




Fig. 26 The high accuracy assembly without fixture 


Fig. 27 The high accuracy 
assembly failed without 
vision guidance 



transferred to the robot to pick up the part. Although the conveyor is moving, the 
vision system is used only for position/orientation identification. Therefore, no 
visual servoing involved. There are many research papers about moving line 
tracking, such as Cho (2005) and Shin (2006). These systems use only visual 
servoing to track the motion of the parts to perform the assembly tasks. In the 
actual applications, especially in the trim-and-fmal assembly, the malfunction of 
the visual servoing could damage the final products, which is very costly. Beaten 
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Fig. 28 The high accuracy 
assembly failed without 
force/torque control 



et al. (Beaten and Schutter 2002; Beaten et al. 2002) developed a hybrid vision and 
force control strategy to follow the contour of planar surface. Xiao et al. (2000), 
Chang et al. (2002), and Olsson et al. (2002) presented methods to follow a curve on 
a surface while maintaining a certain contact force. Although these methods can 
deal with the feature following based on hybrid control strategy, the applications 
in the moving object tracking while performing certain tasks are not discussed. 
Nelson et al. (1995) discussed different methods based on vision and force control, 
such as hybrid control, traded control, and shared control methods. Interesting 
experimental results are presented regarding the performance of these methods. 
However, the force control is only used to maintain a certain contact force. 
Although combination of vision and force control strategy has been widely studied, 
no practical implementation to perform complicated assembly processes has been 
done to perform assembly using the intelligent robotics system based on the 
synergistic combination of vision, force, and position as the feedback information. 

In actual applications, the part on the moving assembly line may move randomly 
along the X, Y, and Z axes. 3D vision (stereo vision) system can be used to track the 
motion of the part (Yoon et al. 2006). The processing of stereo information enables 
a robot to determine the position and orientation of an object in the robot coordinate 
frame. However, an accurate calibration between the camera coordinate system and 
robot coordinate system has to be realized. Because the stereo vision system 
requires high-quality camera, accurate calibrations, and high computation power, 
the existing systems are costly, error prone, and not robust enough for daily use at 
the workshop. Also the complicated computation for determining the position/ 
orientation of an object makes the system difficult to be implemented in real-time 
applications. Furthermore, the visual servoing alone could cause damages to the 
final products if the vision system malfunctions. Therefore, the robotic system must 
be compliant to the environment during the assembly process. Hence, force control 
combined with a simple vision system is a better choice for industrial applications. 
To perform an assembly using a single camera system, the X and Y axes errors of 
the moving part can be compensated by visual servoing; however, the Z axis errors 
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are difficult to compensate using only one camera. Thus, force control is applied to 
control the motion of a robotic system to perform an assembly task along the Z axis. 

This section discusses an assembly technology while the object is moving using 
visual servoing and force control. The wheel loading process, which assemblies the 
wheel onto the wheel hub of a vehicle on a moving assembly line, is used as an 
example to demonstrate the developed technology. A working system has been set 
up, and experiments of loading wheels onto the wheel hub were implemented and 
performed successfully. The experimental results demonstrate that the developed 
technology can be used for assembly while the part on the assembly line is moving 
randomly . 

Moving Line Assembly Method 

System Structure 

The developed moving line assembly system can track the moving object while 
performing the assembly process based on the vision, force, and position sensor 
fusion. Figure 29 shows a moving line assembly system. The object (part) is moving 
while the conveyor is moving. The motion of the conveyor is typically random with 
the velocity change along both the X and Y directions as shown in Fig. 29. Hence, 
the robot has to track the motion of the moving part along both the X and Y 
directions in order to perform the assembly processes. This is different from the 
typical conveyor tracking process which only the motion along the conveyor 
moving direction is controlled. 

Since the system involves the vision system, force control system, and the robot 
control system, the system structure is discussed first. 
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Fig. 29 A moving line assembly system tracks the motion of the object while some assembly 
processes are performed 
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Fig. 30 The control system structure. IRC5 is an ABB robot controller 

The control structure is shown in Fig. 30. The vision system processes the 
captured images and sends the position signals to the robot controller (IRC5, an 
ABB controller) to control the motion of the robot to follow the motion of the 
object. The force sensor measures the contact force to adjust the motion of the 
robot. The force control also makes the robot compliant to the environment to avoid 
the damage to the contact object. 

In the vision system, a predefined feature on the moving part is identified by a 
camera. The position error of the feature compared with the desired feature in the 
camera frame is computed to control the motion of the robot as shown in Fig. 31. 
Since only one camera is used, the robot can only track the motion of the moving 
part along the X and Y directions. To perform an assembly, the robot has to be 
controlled to approach the moving part. Therefore, force control along the Z axis is 
applied to control the motion of the robot to perform an assembly process. Since the 
visual servoing alone could cause damages to the final products if the vision system 
malfunctions, the force control along both the X and Y axes is also enabled to make 
the robotic system compliant to the contact environment. 

Several coordinate frames are involved in the system: the conveyor frame (work 
object frame), the feature frame, the camera frame, the tool frame (gripper), the 
force sensor frame, the robot base frame, and the world frame. Most of the frames 
are common except the feature frame. The feature frame is defined on the center of 
the feature and is parallel to the work object frame. Later on, these frames are used 
to describe the algorithms. 

Control System 

The motion of the robot is controlled by the robot controller based on the sensor 
inputs. The control system structure is shown in Fig. 32. 
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Fig. 31 The actual identified 
feature and the desired feature 
in the camera frame 
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Fig. 32 The control system structure of the moving line assembly system 


After the feature error is identified, it is input to the visual servoing controller to 
correct the feature error. The force sensor measures the contact force and torque. 
The desired force and torque are controlled to be zero except the Z axis that is 
controlled to move the robot towards the moving part. The force and torque errors 
are sent to the force controller to control the contact force. This is a trade-off 
between the visual servoing and force control that is achieved by the controller 
gains. These computed velocity errors are input to the robot controller to regulate 
the robot velocity to achieve the desired performance. 

The reference velocity v re f is generated using the actual gripper (tool) velocity v, 
the velocity error Av 7 computed from the vision system in the tool frame, and the 
velocity error A Vf from the force control system in the tool frame, as described in 
the following equation: 


v re f = v + A V/ + A Vf 
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where 


A v 7 = Kj (X d - X) 

A v f =K f (F d -F) 

F d and F are the desired force and actual measured contact force, respectively, in 
the tool frame. K I is the visual servoing coefficient and Kj the force control 
coefficient. X d and X are the desired feature position and actual feature position, 
respectively, in the camera frame. Because the velocity error from the vision system 
is computed in the tool frame, the transformation matrix from the camera frame (the 
image frame) to the tool frame has to be identified. Since the camera is moving 
towards the moving object, the scale of the feature is keeping changing related to 
the distance from the feature to the camera. Therefore, the desired feature position 
is updated on the real time. 

Experimental Implementation and Results 

To validate the developed method, experiments were implemented to perform an 
assembly process while the object (part) is moving on a conveyor. An ABB 
IRB6400 robot with force control was used as shown in Fig. 33. The wheel loading, 
a representative assembly process, is used to validate the developed methodology. 

Figure 33 shows the wheel loading system. The robot is controlled to automat- 
ically assemble the wheel into a wheel hub based on the synergistic combination of 
vision, force, and position as the feedback information to control the robot motion. 
The steps are as follows: 



Fig. 33 The wheel loading system. The wheel is assembled on the wheel hub while the wheel hub 
is moving on a conveyor 
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Fig. 34 The robot picks up a 
randomly located wheel, (a) 
The wheel is randomly 
located on a feeding table, (b) 
The vision system 
automatically identifies the 
location of the wheel to 
control the robot to pick up 
the wheel 



1 . The 2D vision system identifies the position/orientation of a wheel and picks it 
up as shown in Fig. 34. 

2. The robot moves to the first position (taught manually). The camera will take an 
image when the moving assembly line (conveyor) triggers the first trigger. After 
image processing, the orientation pattern of the bolts on the hub is identified. The 
robot additional axis will rotate to match the holes on the wheel to the bolts on 
the hub. 

3. While the additional axis is rotating, the robot moves to the second position. 
When the second trigger is triggered, the vision system will identify the speed of 
the moving object and track it. At the same time, force control is enabled along 
all directions. The robot will move towards the hub because of force control 
along the Z axis. 

4. Once the contact force along the Z axis reaches a certain value, the assembly 
completes and the force control along the negative Z axis will retract the robot. 

5. The force control along all directions will keep the system safe under abnormal 
conditions. For example, when the vision system sends wrong signals, the force 
control will balance the wrong signals and will not cause damage to the system 
and the product. 



2382 


H. Chen et al. 



Fig. 35 The robot tracks the motion of the wheel hub while the force control enables the tool 
approaches the wheel hub 


There are several calibrations involved in the wheel loading process. First, the 
holes on the wheel and the studs on the hub have to be matched. This is done by 
teaching. The tool location is then recorded, and the transformation between the 
tool frame, the feature frame, and the camera frame is then computed. The wheel is 
then moved and dropped on the wheel feeding table. The robot is moved to a 
position to capture the image of the wheel, and the reference position and orienta- 
tion are computed. The camera is then moved to several positions to calibrate the 
transformation matrix between the wheel object frame and the tool frame. These 
data are used to compute the wheel location when the wheel is loaded on the table 
randomly . 

Figures 35 and 36 show the assembly process. The robot tool grips the wheel and 
tracks the motion of the wheel hub. The wheel is then installed on the wheel hub 
using force control. The force control is enabled along all direction to make the 
system flexible to the environment. 

Figure 37 shows the velocity along the X and Y directions recorded from the 
robot controller. The velocity is changing randomly along both the X and Y 
directions. The vision system is tracking the feature and input the velocity errors 
to the robot controller to control the motion of the tool. 

Summary 

This section presents an intelligent industrial robotic system that can perform 
assembly tasks with high autonomy and adaptability to the environments. The 
assembly is performed while the object is moving based on the visual servoing 
and force control strategy. The wheel loading process, which assemblies the wheel 
into the wheel hub of a vehicle on a moving assembly line, is used as an example to 


64 Robotic Assembly 


2383 



Fig. 36 The wheel is successfully installed on the wheel hub. The assembly process is completed 


demonstrate the developed technology. Experiments were performed successfully, 
and the results demonstrated that the developed technology can be used for assem- 
bly while the assembly line is moving randomly. Since huge amount of time and 
resource can be saved using the developed intelligent robotic system, this innova- 
tive technology will have great impact in the automotive industry, especially when 
the labor cost is becoming higher and higher. 


Assembly Parameter Optimization 
Off-Line Parameter Optimization 
Introduction 

With the high demand of flexible manufacturing automation, industrial robot 
applications have been expanded into processes with contact force such as assem- 
bly and machining. In conventional robotic assembly applications, a passive com- 
pliant tool is commonly used to compensate the robot position errors. It works for 
assembly processes with relatively simple and loose insertions. For processes with 
high precision and tight tolerance requirements, robot force control has been 
developed and implemented. This recent advance in industrial robot control gives 
a “touch” sensing of the industrial robots and permits an entire new class of robot 
behaviors and applications. The new robot behaviors are possible due to the 
incorporation of force sensor into the robot control system. The force control 
robot technology enables robotic automation applications that mate parts together 
such as gear meshing, spline insertion, clutch hub assembly, and surface grinding 
following complex curved geometry. Gravel et. al. (2001) give an overview on the 
research and development of assembly applications using robots with force control. 
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Fig. 37 The recorded tool velocity. The object is moving along both the X and Y axes 


Robotic assembly application projects that involved universities and robotic man- 
ufacturers were described. Robotic assembly with a complicated and heavy (up to 
75 kg) transmission component, torque converter, was successfully performed 
(Zhang et al. 2004). The system has been installed and run in multiple production 
lines for several years. Besides torque converter assembly, transmission valve body 
assembly and engine piston installation have also been investigated. The papers 
(Wang et al. 2008; Zhang et al. 2007) dealt with force control technology and 
applications. More details in force control assembly theoretical development and 
application are described in section “Force Control Assembly.” 
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On the other hand, robot force control introduced complexity and uncertainty to 
the robot programming, control parameter setting up, and manufacturing process 
quality. The force-controlled robot behaves differently for different contact force 
conditions resulting from the manufacturing variations of the assembled parts, 
fixture, and environment disturbances on the manufacturing floor. One of the 
most recognizable behavior differences from position-controlled robot is that the 
robot motion cycle time is no longer a predetermined value in force control. 
Normally, it will be distributed in a statistics manner for robotic assembly pro- 
cesses. And the mean and standard deviation of the assembly cycle time and FTT 
rate become the measurement and the optimization objectives of a force-controlled 
robot assembly system. In order to optimize the performance of robotic assembly, 
the process needs to be first parameterized. 

Since the introduction of force control, in which actual robot path depends on not 
only the programmed position but also the interaction force between the assembled 
parts/components, the optimization has become more difficult. Although the 
search-based assembly strategy offers simple and robust solution that is favored 
in the industrial environment, it is still difficult to find the optimal parameter 
settings for the search motion. An optimization procedure is therefore much needed 
for the search-based strategy. On the other hand, the search-based strategy is very 
well parameterized by its search motion parameters and therefore is very suitable 
for optimization. The optimal set of robotic (force control) parameters is often 
obtained either by trial and error or off-line analysis tools. It is tedious and time 
consuming. Because of the statistical nature of the assembly task and increasing 
popularity of design of experiment (DOE) (Montgomery 2005) in manufacturing 
quality control, an off-line DOE has been used in the robot assembly parameter 
optimization to assist the system setup (Robotics Application Manual - Force 
Control for Assembly). The paper (Gravel 2007) introduced a systematic method 
to obtain a set of optimal assembly parameters by DOE. After the assembly 
parameters to be optimized are selected, the experiment is designed and coded 
into robot program. The program is executed on a robot controller, and the result 
data is stored in a data file. The data is processed by a separate computer with a 
statistic analysis software package such as Minitab. This whole parameter optimi- 
zation process often needs to involve a DOE expert, a robot programmer, and a 
robot operator so it is difficult to be used in manufacturing floor in massive robotic 
assembly production environment. 

Theoretical Development 

Parameterizing the Robotic Assembly Process 

In order to optimize the robotic assembly parameters, the process needs to be first 
parameterized. Based on the nature of common assembly types, assembly processes 
are categorized into cylindrical, radial, and multistage insertion/assembly in this 
investigation. Cylindrical insertion includes assembly applications with a cylinder 
and a smooth bore, such as transmission torque converter assembly shown in Fig. 4 
and piston stuffing in Fig. 6. 
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Spiral search is used as primary pattern. Radial insertion includes assembly 
applications with a “toothed,” “splined,” or “gear” mesh, such as a forward clutch 
assembly and spline gear assembly. Rotational search is used as primary pattern. 
And multistage insertion which requires the combination of cylindrical and radial at 
different stages includes assembly applications such as a torque converter assem- 
bly, half shaft-differential assembly, and the like. The combination of rotational and 
circular motion is often used as search pattern. Besides, hopping, oscillating along 
the insertion direction, can help a lot to overcome the contact friction force and the 
sliding between the loosing parts. The cylindrical, radial, and multistage insertion 
types cover most of the real-world assembly applications. For each insertion/ 
assembly type, certain search pattern is applied to perform the assembly process. 
Correspondingly, there are certain robot force control parameters that are related to 
each search pattern. As an example, the robotic assembly parameters for multistage 
type used in torque converter assembly are described in this section. A RAPID 
system module, AsmWareBase, has been written to carry out the assembly process 
parameterization. The AsmWareBase module can be called in same manner as 
standard RAPID instructions. There are two sets of assembly parameters in the 
specification. One set of the parameters is called Set Parameter. They are related to 
the search pattern transition, process action, and insertion termination and normally 
setup at the beginning of the robotic assembly engineering process, as listed below: 


Insertion Distance - distance between the insertion starting position and the end 
position 

Engage Distance - distance between the insertion starting position to the posi- 
tion where the parts are engaged 

Time Limit - time limitation before a search process is terminated or 
re-performed; 

Max Num Try - maximum number of trials before an unsuccessful insertion is 
claimed 

Use Timeout Act - a flag that signals whether or not to use a customer-specified 
timeout handling method 

TO Action Num - timeout handling routine number 

Use Force Cond - a flag that signals whether or not to use force control insertion 
termination 

Cond Force Value - condition force value used in force control insertion 
termination 

Use 10 Action - a flag that signals whether or not 10 action is used at end of the 
insertion 

10 Action Num - the 10 number used in the action above 

Use Force Retreat - a flag that signals whether or not the force control retreat is 

used 

• Retreat Force - the force value used in the force control retreat 


Another set of the parameters, which are related to the performance tuning of the 
robotic assembly process, is called Tune Parameters. Tune parameters are insertion/ 
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Fig. 38 The radial-type insertion reference path 


assembly type specific. For the torque converter assembly, the tune parameters 
may be: 

• Search Force - max value of the searching/insertion force used in the tool Z 
direction 

• Rotation Speed - the rotation speed used in the insertion around the tool Z axis 

• Rotation Angle - the maximum angle value used in the above rotation motion 

• Circular Speed - the circular searching speed in the XY plane of the tool frame 

• Circular Radius - the circular radius used in the insertion corresponding to the 
circular speed above 

• Force Amp - the amplitude of the hopping force in the Z direction of tool frame 

• Force Period - the period of the hopping (oscillation) force 

The combination of the given assembly parameter and contact force defines the 
actual assembly path. An example of the radial-type insertion reference path is 
shown in Fig. 38. 

Design of Experiment Methodology 

DOE is a powerful technique used for exploring new processes, gaining knowledge 
of the existing processes, and optimizing these processes for achieving optimal 
performance. DOE’s design and analysis theory is used in this investigation. The 
book (Montgomery 2005) gives the design and formulation detail. 
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Fig. 39 DOE process 
analysis diagram 
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Fig. 40 DOE analysis of 
variance diagram 



As illustrated in Fig. 39 for a particular process, the controllable variables such as 
production batch are defined, the uncontrollable variables are ignored, the inputs are 
varied in a designed manner, and the output characteristics are measured. 

A fractional factorial test is used to find the most influential parameters first, and 
then full factorial tests are used to result in a set of optimized parameters. In this 
robotic assembly parameter optimization, for example, the input parameters are 
Search Force, Rotation Speed, Rotation Angle, Circular Speed, Circular Angle, 
Force Amp, and Force Period. The output variables are Assembly Time and FTT rate. 

The DOE Analysis of Variance (ANOVA) method, as shown in the diagram in 
Fig. 40 is used in the experimental data analysis. 

Suppose there are a levels of a single factor that are to be compared, the 
observed response from each of the a levels is a random variable. The data would 
be yjj, representing the yth observation taken under factor level i. There will be, in 
general, n observations. Then 



— fl + T/ + £jj 



1,2, . . . , a 
1,2, . . . , n 



where 

fi is a parameter common to all levels called the overall mean. 
Tj is a parameter unique to the zth level called zth-level effect. 
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By is a random error component that incorporates all other source of variability in 
the experiment. Equation (3.1.1) is called the single-factor analysis of variance. If 
the experiment design is a completely randomized design, the objective is to test 
appreciate hypotheses about the treatment means and to estimate them. For hypoth- 
eses testing, the model errors are assumed to be normally and independently 
distributed random variables with mean and zero and variance a . The variance 
a is assumed to be constant for all levels of the factor. This implies the 
observations: 


y tj ~ N (fi + T t , a 1 ) (9) 

and that the observations are mutually independent. DOE is selected as the optimi- 
zation method due to its popular acceptance by the industry, although other 
nonlinear optimization methods such as genetic algorithm and simulated annealing 
can be also applied. Due to the randomness of the assembly strategy and the part 
positional uncertainties, it is necessary to run at least 10 replicates of each exper- 
imental trial in order to get a reliable performance measure. The performance 
measure here is chosen as a statistic of the cycle time: 

P=/A-\-W a *(7 + W p *//*/? (10) 


where 

p is the mean cycle time. 
o is the standard deviation of the cycle time. 
p is the failure rate after the repeated trials. 

w 0 and w p are the weight of the standard deviation and the failure rate. 

The motivation behind this performance measure is that a production process 
should not only have low mean cycle time and failure rate but also have low 
variability in the process. In addition, for safety reasons all the parameter values 
should be properly bounded. 

The DOE process can be performed at each substage, as illustrated in Fig. 41. 
The optimization of each substage starts with the manual selection of the 
optimizable parameters and their bounds. The purpose of the initial screening is 
to reduce the number of parameters so that the final optimization step can be 
performed more efficiently. The typical DOE-based parameter optimization steps 
normally are as follows: 

1. Screening: using fractional factorial experimental design matrix to generate the 
experimental runs, which consists of a carefully chose fraction (subset) of the 
experimental runs of a full factorial design based on the sparsity-of-effects 
principle. Then the collected data are subjected to statistical analysis to identify 
the most influential parameters. Figure 42 shows a typical Pareto plot from 
screening process. The higher the value of the parameter in the plot, the more 
influential the parameter contributes to the output performance. 
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Fig. 41 DOE optimization 
procedure 
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2. Optimization: based on the screening result, using full factorial experimental 
design matrix to generate the experimental runs, which consists of 2-level or 
3 -level discrete possible values of most influential parameters. The results of 
experimental runs are analyzed to find the optimal parameter set. 

3. Verifying: verifying the optimized parameter set by running a number of exper- 
iments and checking on the distribution of the objective metrics. The number of 
parameters to be optimized and the level of values for each parameter can be 
varied widely based on the sensitivity of the optimization goals to the parameter 
change, the available number of tested parts, and the cost of the experiment. 

Experiment 

With the development of the on-pendant assembly parameter optimization tool, 
tests have been done on both transmission valve body and torque converter assem- 
bly applications. This section presents the testing result using a torque converter 
assembly. As shown in Fig. 43, a Ford 6R60/80 transmission torque converter is 
used in the test. 

After basic robot force control setup, the initial assembly parameter values are 
chosen by analyzing the torque converter drawings, measuring part dimensions 
directly, and/or observing manual assembly process. For this particular case, the 
assembly process is divided into two stages. The first stage is from the start position 
all way to matching spline gears until the pump gear. In this stage rotational search 
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D F G B C E A 

Fig. 42 The Pareto plot from screening process 


Fig. 43 6R60/80 

transmission torque converter 
assembly process 



range would be relatively small (about 45°), and circular search with certain radius 
is necessary to deal with the starting XY position errors. Default offset (10 %) is 
added and subtracted from the nominal values to get the lower and upper bound- 
aries. Figure 44 shows the initial parameter values and lower and upper boundaries 
for the first stage. 

When initial parameter values are set and tested, screening run can be 
performed. PB-8 or PB-12 (Plackett-Burman method) could be used for the 
seven-parameter screening. The same experiment can be repeated ten times for 
better accuracy in statistics sense. After the screening run is completed, experimen- 
tal data are processed by pressing the “Update” button, and the screening result is 
displayed in the order of the most to the least influential parameters to the assembly 
cycle time. After the screening is done, the first three most influential parameters 
will be automatically selected for the optimization process. Figure 45 shows the 
setup for parameter optimization. 
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Fig. 44 The initial parameter values and lower and upper boundaries for the first stage 
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Fig. 45 The setup for parameter optimization 
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If a parameter other than the default one is to be optimized, the box on the far 
right side can be checked. Only can three parameters be optimized at a time. 
Figure 46 shows a typical optimization result. 

The first three most optimal parameter sets are displayed along with their success 
rate if multiple tests have been performed for the same experiment. Main effect plot 
can be seen by clicking the corresponding icon. Figure 47 shows a typical main 
effect plot. From the main effect plot, the trend of the optimal parameter value of a 
particular parameter can be figured out. 

To verify the mean and standard deviation of the assembly cycle time and 
success rate in a relatively large member of samples, a verification experiment is 
conducted. Up to 100 tests can be run, and resulting data can be processed by 
pressing Calculate button on the verification result page shown in Fig. 48. 
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Fig. 47 Main effect plot 
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Fig. 48 Verification result 
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Fig. 49 Assembly cycle distribution plot 


Figure 49 shows a typical assembly cycle distribution plot for a torque converter 
assembly. The assembly cycle time mean and the standard deviation are 7.56 and 
2.65 s, respectively, in this example. 

If the verification result is satisfactory, the parameter set can be used in produc- 
tion. Otherwise the parameters can be used as new initial parameters for the next 
round of parameter optimization until the desired parameter values are obtained. 


Summary 

The assembly parameter optimization tool has been used in finding a starting 
parameter set for a real-world robotic assembly cell. It has been proven that 
using the optimization tool, the robotic assembly cell programming time including 
obtaining the optimal parameter set can be reduced from weeks to days. Realizing 
that the optimal parameter set is not only related to the production batches but also 
related to the assembly conveyor and pallet accuracy, the robot gripper design, and 
manufacturing tolerance as well as the assembly starting position, the optimization 
result is robot cell and installation specific. In the applications which use same type 
of parts, same robot models, and same assembly programming strategy, the optimal 
parameter values could be close but often not exactly the same. So the optimization 
ought to be done on the production unit and on the manufacturing lines. To expand 
the parameter optimization concept, the next step is to optimize the assembly 
parameters automatically online (in production) which will be described in the 
next section. 


Online Parameter Optimization 
Introduction 

DOE is faster than exhaustive parameter searches and is driven towards a systemic 
optimal solution (Siciliano and Villani 1999) but usually requires several profes- 
sionals to construct and implement. At the early stage of the optimization work, in 
order to find the optimized parameter for setting up the system to meet the 


64 Robotic Assembly 


2395 


production, the experiments were designed off-line using Minitab on a PC and 
programmed into robot motion program. The robot program runs on the actual or 
close to actual production environment and the result data are collected. The data 
file is then imported into Mini tab for analysis. This process is often needed to be 
done for several iterations. There are several professionals needed to be collabo- 
rated in performing this optimization task, including a manufacturing quality 
control expert to analyze the assembly process and design proper DOE experi- 
ments, a robot programmer to code the design into robot program, an operator to 
execute the program, and then the quality control person to utilize the data to design 
a new experiment. To assist the optimization process, improve the efficiency, and 
make the optimization process down to the manufacturing floor, a robotic assembly 
optimization tool, running in an auxiliary processor in the teach pendant of the 
robot, is developed (Xu and Paul 1990). However, in the production of robotic 
assembly, for example, torque converter and transmission assembly, even with the 
optimized parameters, the performance of the robotic assembly system is deterio- 
rated when manufacturing environment changes such as the variation of geometri- 
cal dimension of a part and tool (the location of feature on part, the size of the 
feature, the dimension of the tool, etc.); the changes of position and orientation of a 
part, fixture, or robot; and the changes of properties of a part (weight, spring 
constant, etc.). The proposed method applies DOE-based parameter optimization 
in production to automatically adapt to the manufacturing environmental changes 
in robotic force control assembly and optimize the productivity of the assembly. 
The challenge in parameter optimization in production is to minimize the distur- 
bance of experimental trials and balance the cost of the number of experiments and 
optimization goals. Because the robotic torque converter assembly cell operates in a 
continuously running production line, a certain throughput is designed and 
maintained in order to have smooth production. There are only limited buffers in 
production line to accommodate the small variation of the assembly cycle time. On 
the other hand, in order to find the optimized parameters and adapt to the 
manufacturing environment changes, the values of the current production parame- 
ters have to be varied to perform the DOE experimental trials. Correspondingly the 
performance of the robotic assembly is varied. So the keys to the success of 
automatic in production parameter optimization are the following: 

1. Limit the interference of the optimization process to production. The proper 
algorithm should be built into the optimization process to control how to vary the 
parameters, when to switch into optimization, and when to switch back to 
production. 

2. Also the access to the result of optimization experimental trials, the analysis of 
the result, and the update of the production parameters should not disrupt the 
production. 

3. Moreover, the cost of the experiment and optimization goals needs to be 
balanced. The number of the experimental trials and the number of the optimi- 
zation iterations need to be limited. Although in theory the more trials the less 
sensitive the result is to the random noise in the process and the more iterations 
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of optimization the close the result is to the optimal solution, the variation of the 
assembly cycle time during the optimization process will make the overall 
performance of the production worse. More important the goal for the online 
parameter optimization is NOT to find the best set of parameters from the 
thousands of trials happened long time ago; the goal is to quickly catch the changes 
in the parts and fixtures and optimize the system to adapt to these changes. 
The manufacturing environment probably changes again before the optimization 
is complete. 

Theoretical Development 

Figure 50 illustrates the flow chart of the online DOE-based parameter optimization 
process, which contains three major functional tasks: (1) process monitoring, 
(2) parameter optimization, and (3) parameter verification. 

The online DOE-based parameter optimization removes the screening process to 
simplify the standard DOE process. The assumption is that the screening has been 
done in system setup process, and the rank of the influence of the parameters does 
not change greatly. In the normal production, which is a stable process, this is a 
valid assumption since the manufacturing environment will only change gradually. 
If the manufacturing environment changes dramatically, the production becomes 
unstable. The DOE-based parameter optimization should not be used to recover the 
production performance. The intervention from the professionals is expected to 
investigate the root cause of the changes. 

An algorithm is built into the optimization process to compare the verification 
result with the baseline result. The baseline result is the performance of production 
parameters and the performance of the optimized parameters in the previous 
optimization iteration. If the improvement is not significant, as defined by the 
user in terms of % improvement in the cycle time metrics, the optimization process 
is complete. 

Experiment 

Based on a real-world transmission torque converter assembly production process, 
investigation and analysis are performed in production. An on-pendant robotic 
assembly parameter optimization tool is developed to implement the online 
DOE-based parameter optimization technique. The optimization tool, running in 
an auxiliary processor on the teach pendant of the robot, applies full factorial 
experiments on the most influential parameters. Then the results are subjected to 
statistical analysis to find the optimal parameter set. The optimized parameter set is 
verified through running a number of experiments and checking on performance of 
the force control assembly to adapt to the changes. The software structure is 
illustrated in Fig. 51. 

In this software package, the robotic assembly process (using torque converter 
assembly as a development platform, but the tool can be used for various assembly 
processes of other types) has been firstly parameterized into operator- 
understandable terms and parameters such as starting point, assembly stage, inser- 
tion distance, timeout limit, max number of trials, searching force, rotation speed, 
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Fig. 50 The online 
(in production) DOE-based 
parameter optimization 
process 


Paraitwlftr Optimisation 

Setup 



rotation angle, force amplitude, force (sin wave) period, and so on. A robot program 
(ABB robot programming language, RAPID) is written to perform a particular 
assembly production. Another RAPID program module is to convert the process- 
related parameters into robot force control parameters, generate the DOE experi- 
ment design matrix to vary the value of the optimizing parameters, and collect the 
experimental trial results. Then, a DOE design and analysis function has been 
simplified specifically for the robotic assembly applications and coded into a C# 
library to realize the DOE design and analysis on the touch-screen robot teach 
pendant (so-called ABB FlexPendant) . 
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Fig. 51 The software 
structure diagram of the in 
production DOE-based 
parameter optimization tool 


Robot Controller 


Robotic Force Control Assembly 
Production Rapid Program 



DOE Experiment Rapid Program 



RAPID Program and Teach 
Pendant Interface Program 



Teach Pendant 



DOE Data Process C# Program 


Both RAPID production program module and DOE module are running in the 
ABB IRC5 robot controller. The DOE RAPID module seamlessly interacts with the 
force control assembly production program to switch the value of parameters from 
production to optimization, collect the result, and switch back to production value. 
The DOE data process, which involves the statistical analysis, is running on the 
teach pendant. And they exchanged the experimental data and parameter optimi- 
zation result via the “RAPID Program and Teach Pendant Interface Program” 
which is developed based on ABB “Robot Application Builder (RAB).” RAB 
allows the FlexPendant to communicate with IRC5 robot controller and exchange 
the data between the RAPID program and C# program. This software structure 
ensures that the access to the result of optimization experimental trials, the analysis 
of the result, and the update of the production parameters do not disrupt the 
production program. Moreover, the optimization tool can perform multiple 






64 Robotic Assembly 


2399 


Table 2 The in-production DOE-based parameter optimization experimental result 



Mean (s) 

StdDev (s) 

FTT (%) 

Baseline (initial) 

5.005 

2.014 

26.6 

Baseline (third iteration) 

4.616 

2.103 

53.3 

Verification 

4.639 

2.245 

50 


parameters and multiple level DOE optimization on multiple assembly stages 
simultaneously since the DOE experimental data collection and DOE experiment 
data processing are separated. 

The factory acceptance test on robotic force control torque converter assembly 
system was performed at the Ford Powertrain Assembly plant. The 3 -level 5 -factor 
243 (3 5 ) trials DOE optimization experiment was conducted on a three-stage 
assembly (the pump impeller feature that drives the transmission pump as the 
engine turns). Each trial has 10 repetitions to reduce the random noise in the test 
result. Totally 2,430 tests data are collected and analyzed from one optimization 
iteration. Table 2 shows the experimental results. 

In the table, Mean is mean of the assembly cycle time; StdDev is the standard 
deviation of the assembly cycle time; and FTT is the first time through rate (success 
rate). The experimental results show that the average assembly cycle time is 
improved from 5 s (initial) to 4.6 s (third optimization iteration). And success 
rate is improved from 26.6 % to 50 %. Also since in the third optimization iteration, 
the verification result does not have significant improvement compared with the 
third baseline result; therefore, the optimization is complete. The optimization tool 
updated the production force control assembly parameters and switched the system 
from optimization mode to production mode. This experimental result from the 
factory test proofs that the effectiveness of the proposed online DOE-based param- 
eter optimization and the productivity of the robotic force control assembly unit is 
improved. 

Summary 

This section introduces the force control robotic assembly and gives an example of 
torque converter assembly. DOE-based assembly parameter optimization is 
described. The online (in production) DOE-based parameter optimization is pro- 
posed to adapt to the manufacturing environment changes. The challenges and 
solutions are discussed. Finally the method is implemented and tested in force 
control torque converter assembly in factory. 


Summary 

This chapter describes the robotic assembly technology, including force control 
assembly, assembly without force control, vision and force control-integrated 
assembly, and assembly on a moving production line. Real industrial applications 
are used to demonstrate how to implement these techniques in manufacturing 
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automation. Because there are different process parameters that are involved in 
robotic force control assembly, parameter optimization methods are also presented. 
The steps about parameter optimization are illustrated by using a torque converter 
assembly process. Because the techniques are demonstrated using industrial exam- 
ples, they are somewhat ready to be implemented in production. 
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Abstract 

Robotic welding is continued to be one of the most popular applications of robots. 
Traditionally, spot welding process has been the biggest use of robots especially in 
the automotive industry. As the robot becomes cheaper and easier to program, 
there are increasing implementations of robotic welding systems in many other 
industries such as shipbuilding, offshore, constmction, and job shops. This chapter 
gives an overview of the types of robotic welding systems and their key compo- 
nents. It focuses on the arc welding processes which have been dominantly used in 
the various industries. The critical features of the robotic welding system for 
maximizing the welding quality, operational flexibility and productivity are 
discussed. These features are in the robot configurations, sensing, programming, 
workpiece handling, as well as welding process control. 


Introduction 

Robotic welding has been one of the most common and successful applications of 
robots in several industries. The earliest extensive application of welding robots 
started in the automotive industry in 1980s mainly for the resistance spot welding 
process. With the advent of the sensors and process control capabilities as well as 
advanced robot programming technology, welding robots are not just implemented 
for the high-volume production but have also been introduced for low-volume 
tasks. Such tasks are common in the industries of shipbuilding, offshore, and 
structural assemblies. Figure 1 shows several examples of robotic welding. 

Many industrial welding applications benefit from the introduction of the 
welding robots since most of the drawbacks attributed to the human factors are 
eliminated as a result. The key benefits of robotic welding include increased 
welding quality and productivity and lower production cost, which lead to cheaper 
and better-quality products. 

A typical robotic arc welding system includes an articulated six-degree-of- 
freedom (DoF) robot manipulator, welding power source, welding torch, wire 
feeder, positioner, welding torch cleaning and calibration station, fume extraction, 
and safety fence. More advanced robotic system equips with optical seam tracking 
system for complex welding processes which involve large number of passes. The 
basic functionalities of a robot welding system include welding, welding path 
programming, through- the -arc or optical seam tracking, as well as welding process 
monitoring and control. 

Operation of the welding robot in general involves two stages. In the setup stage 
the operator inputs the welding parameters and programs robot path. It is followed 
by the welding stage where the robot performs the welding process. In the setup 
stage, the user determines the welding process and procedure. This could be the 
selection of the preprogrammed job orders where the welding parameters and the 
robot paths are defined. The setup stage can also be one for programming the robot 
to weld a new workpiece. In such a setup stage, in addition to the welding process 
setting, the operator will also input the operational parameters such as number of 
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Fig. 1 Examples of robotic welding: (a) welding a workpiece on a turntable (www.forster- 
welding-systems.com), (b) welding a car frame (www.lincolnelectric.com), (c) robot panel 
welding for shipyard automation (www.kranendonk.com), (d) arc welding station with two 
welding robots and one workpiece handling robot (www.forster-welding-systems.com) 


parts, motion of the workpiece positioner, teaching the robot welding path, and 
selection of the welding parameters. 

In the welding stage, seam tracking may be used to guide the robot along the seam. 
In general, through-the-arc sensing is a common method for seam tracking. During the 
welding stage, the workpiece may be moved by the positioner to allow full accessi- 
bility for the welding torch held by the robot as well as for maximizing productivity. 


Welding Processes 

Welding is a process of permanent joining two materials (usually metals) through 
localized coalescence resulting from a suitable combination of heat, pressure, and 
metallurgical conditions. In order to obtain satisfactory welds, it is desirable to 
have: 

• A source of energy to create union by fusion or pressure 

• A method for removing surface contaminants 
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• A method for protecting metal from atmospheric contaminants 

• Control of weld metallurgy 


The solid and unified connection produced is called a weld joint. There are five 
basic types of weld joints, i.e., the butt, comer, tee, lap, and edge joints. The joints 
are made with many different types of welds. The most common types of welds are 
the bead, surfacing, plug, slot, fillet, and groove. Welding can be performed at flat, 
horizontal, vertical, and overhead positions (Jenney and O’Brien 2001). 

Welding is the most economical and efficient way to join metals permanently 
in terms of material usage and fabrication costs. It is a common method to join all 
of the commercial ferrous and nonferrous metals of different types and strengths. It is 
indispensable for fabricating and repairing metal products with wide applications in 
automotive, construction, marine and offshore, and aerospace industries. 

There are many different types of welding processes (Jenney and O’Brien 2001) 
which in general can be categorized as fusion and solid-state welding. For fusion 
welding the base and/or filler materials are melted, while solid-state welding pro- 
cesses do not involve the melting of the materials being joined. Fusion and solid-state 
welding processes can be classified as follows based on the source of energy: 


• Fusion welding 

• Gas welding 

• Arc welding 

• Resistance welding 

• Energy beam welding 

• Solid-state welding 

• Friction welding 

• Ultrasonic welding 

• Explosive welding 

• Forge and diffusion welding 


The most suitable welding processes for robotic applications include resistance 
spot welding (RSW); gas metal arc welding (GMAW); gas tungsten arc welding 
(GTAW); laser beam welding (LBW); hybrid laser arc welding (HLAW); and 
friction stir welding (FSW) (Pires et al. 2005). 


Resistance Spot Welding (RSW) 

Resistance spot welding (RSW) is one form of resistance welding processes in which 
heat is generated from the workpiece’s resistance to the flow of welding current and 
with the application of pressure. In resistance spot welding, the welding current and 
pressure is applied through copper alloy electrodes that concentrate the high electric- 
ity current and high contact pressure in the area of the weld to establish a permanent 
joint. The most critical parameters for resistance spot welding are time, current, and 
force. Figure 2 illustrates the resistance spot welding process. 
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Fig. 2 Schematic illustration 
of resistance spot welding 
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Resistance spot welding is well established for sheet metal joining where a 
continuous seam is not required. Materials suitable for resistance spot welding 
include steel, aluminum, and galvanized steel. The entire process only takes a 
fraction of a second, is easy to handle, requires no additional materials, such as 
gas or wire, and is easy to automate. This makes it the ideal application for mass 
production of vehicle bodies or the processing of large format panels outside the 
automotive industry in the fields of apparatus engineering, electronic systems, and 
the manufacture of domestic appliances. Mainstream robot manufacturers supply 
six-DoF resistance spot welding robots integrated with a welding gun and welding 
system which generates the current and pressure needed. 

Conventionally, a resistance spot welding gun is driven pneumatically to gen- 
erate the pressure. In recent years, it has become increasingly popular to use a spot 
welding gun driven by an electric servomotor (Niu et al. 2008). The welding gun 
becomes the seventh robot axis and synchronized gun and robot motions can be 
programmed and executed. Moreover, the servomotor’s torque can be more pre- 
cisely controlled. These make resistance spot welding faster, more precise, reliable, 
and efficient. Major automotive manufacturers such as Renault, Mazda, Toyota, 
and Ford choose servo welding equipment for their car body assembly lines instead 
of conventional pneumatic equipment. 

An innovative resistance spot welding process, DeltaSpot, was introduced by 
welding power source manufacturer Fronius in 2006 to join difficult materials 
such as high-strength steel and aluminum sheets. The defining feature of DeltaSpot 
is the robot welding gun with running process tape that runs between the 
electrodes and the sheets being joined as shown in Fig. 3 (Robotics On-line, 
Fronius 2012). 

The process tape performs several very different functions (Robotics On-line, 
Fronius 2012) as follows: 
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Fig. 3 Resistance spot 
welding gun with a running 
process tape 




Fig. 4 DeltaSpot for joining a 1.5 mm thick stiffening plate onto a 2 mm thick die-cast aluminum 
door frame 


• Prevent direct contact between the electrode and the workpiece to protect the 
electrodes from soiling or other influences emanating from the surface of the 
workpiece. 

• Improve the contact situation: for every single spot, the process tape creates a 
new electrode contact surface to the workpiece. 

• Influence the heat balance in the workpiece directly and selectively. 

The benefits of using running process tape include completely reproducible spot 
welds, excellent process reliability, less surface spatter, increased service life for 
electrodes, and the capability to produce top-quality joints between sheets made of 
different materials, even when they are of different thicknesses. One example 
shown in Fig. 4 is a 1.5 mm thick stiffening plate being spot welded onto a 2 mm 
thick aluminum die-cast door frame (Fabricating 2012). 
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Fig. 5 Schematic illustration of (a) GMAW and (b) GTAW 


Gas Metal and Gas Tungsten Arc Welding (GMAW and GTAW) 

Gas metal arc welding (GMAW) and gas tungsten arc welding (GTAW) are both 
arc welding processes. In arc welding processes, an AC or DC welding power 
source is used to ignite and maintain an electric arc between an electrode and the 
base material to melt the metals. Figure 5 illustrates the GMAW and GTAW arc 
welding processes. 

Gas metal arc welding (GMAW) is also referred as metal inert gas (MIG) or 
metal active gas (MAG) welding. Consumable electrode wire, having chemical 
composition similar to that of the base material, is continuously fed from a spool to 
the arc zone. An electric arc is created between the tip of the wire and the weld pool. 
The wire, which is usually solid wires, is progressively melted and added into the 
weld pool through various metal transfer modes and forms part of the weld pool. In 
general, there are four modes of metal transfer, e.g., short-circuit, globular, axial 
and pulsed spray transfer. Metal transfer mode depends upon the arc current, 
voltage, base and filler materials, electrode diameter, polarity and type of 
shielding gas. 

Both the arc and the weld pool are protected from atmospheric contamination by 
a shielding gas, which is delivered through a nozzle that is concentric with the 
welding wire guide tube inside the welding torch. The shielding gas or gas mixture 
may consist of inert gas, such as argon, or active gas like C0 2 , or a mixture of inert 
and active gases. The most critical parameters for GMAW include the following: 
voltage, wire feed rate (current), travel speed, arc (stick-out) length, shielding gas, 
wire diameter, and polarity (DC+ electrode positive (DCEP) or DC- electrode 
negative (DCEN)) as DC power source is usually used for GMAW). 

There are two variants of the GMAW process, which are flux-cored arc welding 
(FCAW) and metal-cored arc welding (MCAW). There are many common features 
between the three processes that they are all arc welding with consumable wire 
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Table 1 Differences among GMAW, FCAW, and MCAW 



GMAW 

FCAW 

MCAW 

Wire type 

Solid wire 

Tubular construction flux- 
cored wire with an outer metal 
sheath being filled with 
fluxing agents plus metal 
powder. The flux fill is used to 
provide alloying, arc stability, 
slag forming, deoxidation, and 
gas shielding with some wires 

Tubular construction metal- 
cored wire with an outer metal 
sheath being filled entirely 
with metal powder, except for 
a small amount of nonmetallic 
compounds. They are added to 
provide some arc stability and 
deoxidation 

Wire 

feeder 

Normal feed 
rolls 

Knurled feed rolls 

Knurled feed rolls 

Shielding 

gas 

Shielding gas 

Shielding gas and self- 
shielding (production of 
shielding gas by 
decomposition of fluxing 
agents within the wire) 

Shielding gas 

Slag 

cover 

No slag 

Slag cover 

No slag 

Transfer 

mode 

Short-circuit, 
globular, 
axial or 
pulsed spray 

Short-circuit or globular 

Short-circuit, axial or pulsed 
spray 


electrodes. However, there are also several fundamental differences, which are 
summarized in Table 1. 

GMAW process can be used to weld almost all metallic materials, in a wide 
range of thickness from 0.5 mm up to 30 mm or more, and is effective in all 
positions. GMAW is a very economical process because it has higher travel speed 
and deposition rate and does not require frequent stops to change electrodes. In 
addition, minimal post weld cleaning is needed because slag is almost absent, 
especially in solid-wire GMAW. These advantages make the process very well 
adapted to be automated and particularly to robotic welding applications. About 
80 % of robotic arc welding is GMAW welding. 

To further increase the travel speed and deposition rate, dual- wire or twin- wire 
GMAW process was developed (The Lincoln Electric Company 2003). In dual- 
wire GMAW process, two wires are continuously fed through a special welding 
torch and are consumed to form a single molten pool. Both wire electrodes’ arcs 
contribute to a common weld pool, but each is independently controlled by its own 
power source. The first electrode, the “lead,” controls deposition rate and penetra- 
tion. The second electrode, the “trail,” controls weld bead appearance. Systems 
normally comprise two separate wire feed units and two power sources, so the wires 
can be operated independently, i.e., with different wire diameters, current levels, or 
operating modes (continuous or pulsed). In pulsed mode, the current in each wire is 
pulsed alternately to avoid magnetic interactions between the two arcs. Welding 
commonly takes place with the two wires in line along the joint line, although the 
torch can be rotated across the joint to give a wider weld bead. Figure 6 illustrates 
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Fig. 6 Schematic illustrations of (a) dual-wire GMAW process and (b) its robotic system 
configuration 


dual-wire GMAW process and its robotic system configuration. This technology 
can only be used for mechanized or robotic welding because of the precision 
required in positioning the bulky torch. Joint accessibility is also restricted because 
of the torch size. 

Another development of GMAW is to reduce the heat input. The technology is 
called cold metal transfer (CMT) by incorporating the wire motions directly into the 
process control (Fronius 2014). During the arc period, the filler metal is moved 
towards the weld pool. When the filler metal dips into the weld pool, the arc is 
extinguished and the welding current is lowered. The rearward movement of the 
wire assists droplet detachment during the short circuit. The wire motion is then 
reversed and the process begins all over again. This makes the process alternating 
between hot and cold periods. Still maintaining the high speed and high deposition 
rate of a GMAW process, CMT technology makes the arc more stable and the 
welding process spatter free. It also enables GMAW process applicable for very 
thin sheet welding. Figure 7 illustrates the alternating hot and cold periods in CMT 
welding. 

Gas tungsten arc welding (GTAW) is also commonly referred to as tungsten 
inert gas (TIG) welding. GTAW uses the heat generated by an electric arc 
struck between a nonconsumable tungsten electrode and the workpiece to fuse 
metal in the joint area and produce a molten weld pool. The arc is shrouded in an 
inert gas shield to protect the weld pool and the nonconsumable electrode. The 
process may be operated autogenously without filler or filler may be added by 
feeding a consumable wire or rod into the established weld pool. GTWA can be run 
on either DC or AC current. The most critical parameters for GTAW include 
welding current, arc length (voltage), travel speed, shielding gas, and polarity if 
DC power source is used. 

DCEN is the most common mode of operation and is widely used for 
welding carbon, alloy, and stainless steels, as well as nickel and titanium alloys. 
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Fig. 7 Illustration of the hot and cold periods during CMT 


Copper alloys, with the exception of those containing aluminum in significant 
amounts, can also be welded with this polarity. DCEP is used for aluminum 
alloys when welding with pure helium as the shielding gas, while AC is used 
most commonly when welding aluminum and its alloys with pure argon or 
argon-helium mixtures. There are several variations of the GTWA process 
designed to improve productivity and they are orbital, hot wire, and narrow- 
gap GTAW. 

Different from GMAW, heat input in GTAW does not depend on the filler 
material feed rate. Consequently, the process allows a precise control of heat 
generation and the production of superior quality welds, with low distortion and 
free of spatter and little particulate fume. The process is very versatile and may be 
used to weld any metal or alloy system and over a wide range of thickness but is 
usually restricted to 10 mm and under for economic reasons. It is particularly suited 
to welding of sheet materials and for putting in the root pass of pipe welds. 
However, due to its relatively slow deposition rate, it is not regarded as a high- 
productivity process. 

Mainstream robot manufacturers usually work together with welding power 
source suppliers to provide total solutions for various GMAW and GTAW welding 
applications. The system is comprised of robots, fixtures and positioners, welding 
power sources and torches, wire feeders (for GMAW process), controllers, and 
weld sensing and monitoring module. 


Laser Beam and Hybrid Laser Arc Welding (LBW and HLAW) 

Laser Beam Welding (LBW) 

The term laser is the abbreviation for “light amplification by stimulated emission of 
radiation.” As a stimulated emission, laser beam is monochromatic and coherent in 
phase and has very small divergence. With the advancement of laser technology, 
lasers have been introduced into metal welding as heat source since the 1970s. Gas 
(C0 2 ) laser at 10.6 pm wavelength and solid-state (Nd:YAG) laser at 1.06 pm 
wavelength are the two main types of laser for welding applications. The powers 
for C0 2 lasers are approximately from 5 to 20 kW and can reach up to 60 kW. 
For Nd:YAG lasers, average output powers of up to several kW are obtainable 
(ISF Aachen 2013a). 
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In recent years, several other types of solid-state lasers such as Nd:YAG disk 
laser (1,030 nm wavelength), diode/semiconductor laser (920-1,050 nm wave- 
length), and Ytterbium fiber (1,070 nm wavelength) laser have been developed to 
the stage where power levels suitable for metal welding are available. With 
advantages of compact and modular design, high energy efficiency, good beam 
quality, high metal absorption, low maintenance, and operating cost, these solid- 
state lasers are also finding applications in metal welding. 

Laser beams need to be focused to a small spot to generate high power density for 
welding. For C0 2 laser, beam focusing is normally carried out with mirror optics such 
as transmissive lens or reflective mirrors. Nd: YAG and the other solid-state lasers can 
be transmitted through fibers and then focused by focusing optics. Fiber-coupled 
solid-state and fiber lasers are more flexible for robotic applications. Laser beam can 
even be distributed into several welding stations through beam deflecting or splitting. 

Conventionally, after focusing, laser beam positioning is through moving 
either the laser head or the workpiece. An alternative beam positioning method is 
the 3D scanner technology for solid-state and fiber lasers whose excellent 
beam quality enables long focal lengths. For example, moveable mirrors and 
optical elements, such as Z-adjustable collimator unit, galvanometric scanner, and 
focusing device, are integrated in the 3D scanner. Figure 8 gives two examples of the 
design of the scanners from ScanLab (ScanLab 2013; Schulze and Lingner 2013). 

The scanner allows the use of laser as a highly dynamic and flexible tool and 
enables fast, precise, and fine positioning of the laser spot in three dimensions. The 
scanner can also be guided over a workpiece in conjunction with a robot. The robot 
and the scanner synchronize their movements in real time to realize remote 3D 
welding and “welding on the fly.” 

Having high power and small laser spot, laser power intensity can reach as high 
as 10 " W/cm . Thus, laser beam welding is one form of high power density 
welding processes. There are two basic modes by which laser welding can occur 
in metals: heat conduction welding and keyhole (deep penetration) welding, which 
are illustrated in Fig. 9. 

Heat conduction welding usually occurs when laser power intensity is less than 

c o 

10 W/cm . In heat conduction welding, the materials to be joined are melted by 
absorption of the laser beam at material surface. Welding penetration depths in this 
mode are typically below 2 mm. 

Above the threshold intensity of 10 W/cm , local heating reaches evaporation 
temperature and laser-induced plasma develops. The plasma absorbs an increased 
quantity of laser radiation. A vapor cavity (keyhole) forms and allows the laser 
beam to penetrate deep into the material through multiple reflections inside the 
keyhole. The keyhole, which is moved through the joining zone and is prevented 
from closure due to the vapor pressure, is surrounded by the largest part of the 
molten metal. The residual material vaporizes and condenses either on the keyhole 
side walls or floes off in an ionized form. Given sufficient laser power, weld depth 
of up to 25 mm in steel can be achieved. 

For laser beam welding, the most critical parameters include laser type and 
mode, laser power and/or pulse width and frequency for pulse laser, laser beam 
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Fig. 8 Schematic illustrations of 3D laser scanners 
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Fig. 9 Schematic illustrations of heat conduction and deep penetration modes in laser welding 
(ISF Aachen 2013a) 
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Table 2 Advantages and disadvantages of laser beam welding 



Advantages 

Disadvantages 

Process 

High power density 

High reflection at metallic surfaces 

Small beam diameter 

Restricted penetration depth (<25 mm) 

High welding speed 

Noncontact process 

Workpiece 

Minimum thermal stress 

Expensive edge preparation 

Little distortion 

Exact positioning required 

Welding at positions 
difficult to access 

Danger of increased hardness and cracks due to 
high cooling rate 

Different materials 
weldable 

Aluminum and copper difficult to weld due to 
high surface reflectivity 

Installation 

Short cycle time 

Expensive beam transmission and forming 

Operation at several 
stations possible 

Power losses at optical devices 

Well suitable to 
automation 

Laser radiation protection required 

High investment cost 

Low energy efficiency especially for C0 2 and Nd: 
YAG laser 


quality, focusing condition, welding speed, shielding gas, and joint type. Compared 
to conventional welding methods, the most important advantages and disadvan- 
tages of laser beam welding are summarized in Table 2 (ISF Aachen 2013a). 

Hybrid Laser Arc Welding (HLAW) 

One challenge of laser beam welding is that it requires precise workpiece fit-up and 
accurate alignment of the beam with the joint edge. To address this challenge, hybrid 
laser arc welding (HLAW) is developed coupling the gap bridging capability of the 
traditional arc welding process. In hybrid laser arc welding process, the laser beam as a 
high power density heat source commonly serves as the primary heat source enabling 
deep penetration mode welding while the arc as the secondary heat source controls the 
weld bead formation. As a secondary heat source, either GMAW or GTAW can be 
chosen depending on if any addition of filler material is required. The weld penetration 
increases with an increase in laser energy, whereas the weld width increases with the 
arc energy. For different applications, there exists an optimum energy ratio between 
the two heat sources. Figure 10 illustrates hybrid laser arc welding process with the 
laser leading the GMAW arc (The Lincoln Electric Company 2011). 

Hybrid laser arc welding process combines the advantages of both laser beam 
and arc welding. They are as follows: 

• High speed and deep penetration of laser beam welding 

• Wide molten pool of arc welding reducing the demands on edge preparation and 
beam-edge alignment 

• Improved weld metallurgy due to slower cooling rate lowering the occurrence of 
cracks and too high hardness welds 
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Hybrid Laser GMAW Welding Process 

(Laser Leading Arc) 



Fig. 10 Schematic illustration of hybrid laser arc welding process (The Lincoln Electric Com- 
pany 2011) 

The materials that may be welded with laser or hybrid laser and arc range from 
unalloyed and low-alloy steels up to high-quality titanium and nickel-based alloys. 
High carbon content steels may have a problem of too high hardness weld due to the 
high cooling rate of laser beam welding. Aluminum and copper alloys are relatively 
difficult to be welded by lasers due to its high reflectivity to laser and the loss of 
low-evaporation temperature alloy elements such as zinc and magnesium. Both 
laser beam and laser arc hybrid welding find applications in automotive, aerospace, 
electronic, steel, and medical industries. 

Mainstream robot suppliers such as ABB, KUKA, Kawasaki, Yaska wa, and Fanuc 
and some welding power source suppliers, such as Fronius and Lincoln, work 
together with laser system manufacturers including Rofin-Sinar, Trumpf, and IPG 
to provide robotic laser and hybrid laser arc welding systems. The system is com- 
prised of robots, fixtures and positioners, laser beam sources, controllers, laser 
transmission, focusing, and positioning devices, and sensing and monitoring systems. 


Friction Stir Welding (FSW) 

Friction stir welding (FSW) is a solid-state welding process. Friction stir welding is 
performed by rotating a nonconsumable tool which is plunged into the material to 
be welded until the shoulder of the tool contacts the top surface of the plate. The 
tool is then traversed along the joint line to create a solid weld. The material around 
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Downward force 
maintains pressure and 
creates friction heat 



Rotating shoulder 
creates friction heat and 
welding pressure 


Direction 



Fig. 1 1 Schematic illustration of friction stir welding process and tool (ESAB 2013) 


the tool becomes softened and highly plasticized from the frictional heat generated 
during the process and is carried around the tool so that there is a complete mixing 
of materials from the two plates. In full penetration welding the tool probe extends 
almost through the thickness of the plates to be welded, but in partial penetration the 
probe can be much shorter. Because there is no melting or resolidification of 
material, there is usually lower distortion introduced by this welding process 
compared with other welding processes. The process can be used to make butt 
and lap joints in any orientation. Due to the high forces developed during the 
process, the plates must be rigidly clamped during welding. A tool tilt angle is 
usually used to aid material compaction behind the FSW tool. Figure 1 1 illustrates 
the friction stir welding process and tool (ESAB 2013). 

Initially, friction stir welding is mainly used for joining low melting point metals 
and alloys, such as aluminum, magnesium, and copper alloys. And the tool material 
normally used is high carbon steel. With the development of tools which can 
withstand high temperature and stress, such as tools made of W-Rhenium, poly- 
crystalline cubic boron nitride (PCBN), and ceramic, friction stir welding can now 
be used for joining of higher melting temperature materials including steels, 
titanium, and nickel-based alloys. 

Friction stir welding process parameters include tool geometry, tool rotational 
and traverse speed, tool tilt angle and pin plunge depth. Compared to conventional 
arc welding processes, friction stir welding has the following advantages: 

• Low thermal distortion 

• Good dimensional stability 
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• No loss in alloying elements 

• Excellent metallurgical properties 

• Fine micro structure 

• No cracking, porosity, and other welding defects 

• Environmental friendly /green welding technology 

• No shielding gas 

• No use of chemical for cleaning 

• No slag and fumes 

• Highly energy efficient 

Friction stir welding is easy to automate to weld in any position and follow 
complex 3D weld paths. Robotic friction stir welding is being developed by robot 
companies like Kawasaki, Fanuc, and ABB; welding companies and research 
institutes such as ESAB, TWI, GKSS, and CRIQ; and system integrators like 
Friction Stir Link, Inc. 


Robotic Welding Systems 

Depending on the welding processes, robotic welding system can be mainly 
classified in arc welding robots, spot welding robots, and laser processing robots. 
Each of them comprises the typical components such as robot manipulator, welding 
equipment, and positioner. In terms of the configuration, the robot welding system 
can be in the form of cell, where typically the welding robot is stationary, and the 
gantry robotic system where the robot arm is transferred to cover the large and 
usually stationary workpiece. 


Types of Robotic Welding System Configurations 

Key feature of welding robot is a robot manipulator with its end effector as a 
welding torch. The robot manipulator typically is mounted at a fix location 
surrounded by the auxiliary equipment and the workpiece holders. Such robotic 
configuration is called a robotic cell. There are other types of configurations for 
welding operation depending on the workpiece size and welding environments. 

Stationary Robot and Workpiece 

The most common configuration of a robotic welding system is one that all the 
components occupy within a confined area. Both the robot and the workpiece 
supported, respectively, by the base and positioner do not move relative to each 
other. 

Often it is a standard product supplied by the robot manufacturers for job shops 
of metal parts and assemblies fabrications. There are portable welding cells where 
all the components are mounted on a platform and enclosed within a cabinet, as 
shown in Fig. 12. A robot cell comprises of a robot arm, a workpiece positioner, the 
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Fig. 12 Robotic welding cell (Kawasaki Robotics) 


welding equipment, and welding safety curtains. Depending on the productivity 
requirements and the size of the workpiece, more than one robot can be integrated 
in the cell. Coordinated motions are preprogrammed for robot(s) and the workpiece 
positioners to reduce the cycle time and to facilitate the weld seam access as well as 
allowing better weld quality. 

Moving Robot and/or Workpiece 

When the welding points or seams are beyond the reach of the robot, the robot or the 
workpiece can be mounted on a gantry, a track, or a column. Such a configuration 
has the advantages of enlarged work space and increased flexibility and productiv- 
ity. For example, it is possible that multiple workpieces can be welded by a single 
robot or multiple robots, thereby idle time in workpiece transferring can be mini- 
mized. An advanced feature of such configurations is that all the motions, including 
the robot, the positioner, and the transfer units, are coordinated or synchronized to 
maximize the arc/welding time. 

The decision if the robot which carries the welding equipment or the workpiece 
should be made stationary depends on the weight and size of the workpiece. 
Generally the lighter of the two will be made moving. Example of such arrange- 
ment is the automotive assembly line where the robots are stationary and the car 
assemblies moved by the conveyor. Such a configuration is often found in high- 
volume production of large workpieces. 
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Fig. 13 A track-mounted welding robot 

Track-Mounted Robot 

A robot welding system with a track-mounted robot offers large working range 
along the axis of the track. Such configuration, besides the ability to deal with large 
workpiece, also provides flexibility to cater for a range of workpiece sizes. The 
most common applications of such configuration are welding of automotive panels, 
tractor frames, building frames, structural beams, and container panels. Figure 13 is 
a top view of a welding system where a robot together with wire drum and cleaning 
station are mounted on a track for welding a long work price. 

Gantry-Mounted Robot 

Gantry suspends the robot above the workpieces and reaches the welding seams at 
a different angle from floor-mounted robots. Such configuration allows the robot to 
move across the workpiece so that wider and taller workpieces can be welded. 
Long workpieces can be welded if the gantry is mounted on the tracks to form 
a robotic welding line. Such welding gantry system can be found in shipbuilding 
industry for panels and ship hull fabrications. One or more robots (Fig. 14) 
can be mounted on the gantry to achieve high throughput with simultaneously 
welding. 

Column-Mounted Robot 

A column can move the robot in a vertical direction so that tall workpieces 
can be welded. A column may occupy less space on the production floor than the 
floor-mounted robot with the welding cables being placed in a duct inside 
the column. The configurations include stationary, travelling, rotary, and rotary 
cum travelling columns. An example of column-mounted robot is illustrated in 

Fig. 15. 
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Fig. 14 Two welding robots suspended on a gantry (www.kranendonk.com) 



Fig. 15 Column-mounted welding robot (www.igm-group.com) 
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Fig. 16 A frame structure being clamped on positioner 


Workpiece Handling System 

Workpieces can be either moved or stationary during or between welding opera- 
tions. Large workpieces or assemblies such as structural steels and beams are 
placed on the fixture system for a moving robot to access their joints. Smaller 
workpiece clamped on the fixtures can be orientated by a positioner to provide the 
best angle for the welding torch. 

Fixtures 

Fixture is a device that holds the workpiece steadily in place during the welding 
operation. It is usually fixed on a positioner which may orientate or move the 
workpiece according to the welding operation needs. The fixture is designed 
according to the size and configurations of the workpiece, the location of the joints, 
the type of welding processes, as well as for easy of loading and unloading of the 
workpiece to maximize the productivity requirements. Further design consider- 
ations include protection from spatter, fixture materials selection, welding circuit 
optimization, simplicity for easy maintenance, and low lost. 

A fixture clamps the workpieces at strategic locations to ensure the joints are 
accurately and steadily presented to the welding robot. A more complex type of 
fixtures, often called assembly fixtures, holds all loose components of the workpiece 
together to form the joints. When workpiece is too large or is produced in low 
volumes, it is preferred the components are pre-tacked before being loaded to the 
fixture. Figure 16 shows a workpiece clamped on a positioner for welding by the robot. 
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Fig. 1 7 (a) Toggle clamps, (b) locating pins, (c) clamp straps, (d) air-powered clamps 

Clamping and Locating Devices 

The fixtures come with many types of clamping and locating options for different 
operation needs. Simple manual clamping such as swing and plunger clamps is typically 
applied for manual loading/unloading operations for low-volume production. Pneu- 
matic- or electric-driven clamps allow the workpiece to automatically hold the fixtures, 
which increase productivity with rapid loading and unloading for high-volume produc- 
tion. There are sensors in the fixture to monitor the workpiece presence. More sophis- 
ticated sensors such as range sensors are capable to detect the accuracy of the workpiece 
position. Figure 17 illustrates some commonly used fixture for holding the workpiece. 

Workpiece Positioners 

Workpiece positioner is one of the most common equipment in the robotic welding 
system. It moves or orientates the workpiece for the following reasons: 

(i) To increase the productivity by simultaneously doing loading/unloading and 
welding of the workpieces. This can be achieved using a turntable. 

(ii) To orientate the workpiece to maximize the deposition rate by coordinating 
the motion of the robot. 
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(iii) To turn or roll the workpiece using the turning rolls and idlers to allow the full 
circumferential welding. 

(iv) To allow the robot reach the hard-to-access positions by present the welding 
points within the work space of the robots using the manipulators. 

Depending on the applications, the positioner can be in the form of a turntable, a 
turn-tilt positioner, and even a six-DoF robot manipulator. The key design consid- 
erations of the positioners include the mechanical configuration, degrees of free- 
dom, control types, load carrying capacity, and working environment. Some 
applications where the workpiece is large and too heavy for any fixture or positioner 
to handle require the welding robot to provide all the necessary seam accessibility 
and feed rate accuracy to achieve the quality weld and productivity. 

(a) Turntables 

One of the most common turntables is a horizontal turntable that allows 
robotic welding at one side while the loading/unloading of the workpiece is on 
the opposing side. The loading and unloading can be carried out by operator 
operating outside the work envelope of the robot. A screen divider will safe 
guide operator from arc flash. 

There are several variations of the turntables in terms of rotational axis 
orientation, types of axis (rotation or linear), and number of axes. Figure 18 
illustrates some of the standard configurations of the multi-axis turntable. 
Customized configuration can be provided to cater for the accessibility of the 
welding seams in the products. One design feature of all the multi-axis turnta- 
bles is that all the axes are intersected or nearly intersected at one common 
point. With that, the workpiece, when moving around the axes, will approxi- 
mately remain within the work envelope of the welding robot. 

Typical specifications of the turntable are as follows: Tilt axis rotation range is 
about 45--135-; turntable axis allows full revolutions; and the load capacity of the 
turntables ranges from a few kilograms up to hundreds tons. 

(b) Handling Robot 

Robot can be used as a workpiece manipulator by working in coordination 
with one or more welding robots (Fig. 19). This method, often called jig-less 
robot welding, offers the best possible welding path flexibility by giving access 
to hard-to-reach seams. It is most cost-effective when a high variety of work- 
pieces are involved since extra jig setups are reduced. The approach also 
potentially allows the fastest welding cycle time. In selecting the workpiece 
handling robots and grippers, the key features shown in Table 3 should be 
considered. 


Welding Robots 

A robot is programmed to move the welding torch to the weld points or along the 
weld path in a given orientation, motion pattern, and speed. The robot is typically of 
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Fig. 18 Workpiece positioners (a) turntable positioner, (b) Ferris wheel positioner, (c) H-frame 
positioner, (d) turn-tilt positioner (www.daihen-usa.com) 


the articulated configuration with six DoF to orientate the welding torch according 
to the requirements of the welding process. Most of the robots currently use AC 
servomotors with servo-controlled system giving the advantages of high accuracy 
and maintenance free. Some of the robot arm comes with a hollow duct allowing the 
welding wire, coolant cable, and power wire to pass to maximize the maneuver- 
ability and welding work space. 

Robot Arm 

The application requirements in terms of the workpiece size, welding process, and 
operational requirements such as space and cycle time should be considered when 
selecting the robot. A robot with the proper specifications such as work envelope, 
degree of freedom, repeatability, payload, weight and mounting limitation will be 
able to meet the application requirements. Figure 21 shows three typical commer- 
cially available robot arms. 

Table 4 summarizes typical robot specifications and their impacts to the welding 
requirements. 
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Fig. 19 Robot (right side ) is holding the workpiece which is being welded by the two robots on 
the left side (www.weldingrobotics.com.uk) 

Robot Controller 

The robot controller, besides playing the key role in achieving the motion specifi- 
cations, should be able to cater the needs of the welding process requirements. 
Advanced robot controllers are able to control additional operational tasks such as 
workpiece positioning, production sequencing, and worker safety. 

One of the main components of the robot controller is the human-machine 
interface. It is mainly in the form of graphical user interface (GUI). The GUI can 
appear in the displays of stationary monitor or on a handheld teach pendant. The 
former allows the off-line programming of the welding robot and the latter lets the 
operator teach the welding sequence on the workpiece point by point. 

Using the GUI interfaces, the operator can teach the robot workpiece position, 
welding path, and sequence, as well as the welding parameters. Often the work- 
pieces and the welding process information can be obtained from database which 
has been established prior to repeated operations. Table 5 lists the main function- 
alities and specifications of robot controllers and their implications to the welding 
operations. 


Welding Equipment 

The main component of the welding system is the power source. Compared with 
manual welding, automatic welding equipment involves additional hardware com- 
ponents such as wire drum, wire feeder, cooling unit, torch cleaner, and welding 
torch. In addition, the components must have the necessary features and controls to 
interface with the main control system which is often the robot controller. Figure 22 
shows the system configurations of a typical robotic welding system. 
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Table 3 Features of workpiece handling robots and gripper 

Workpiece handling robot 


Robot types 

A 6-DoF articulated robot offers the maximum flexibility in 
manipulating and handling a high variety of different workpieces 

Payload 

Robots with payload up to 1 tone is available for handling large 
range of workpieces in different sizes 

Controller 

One robot controller can be shared by the handling and the welding 
robot(s) for easy robot programming 


Robot gripper 


Gripper actuators 

Pneumatic actuated gripper is the most widely used in majority of 
industrial applications. Such gripper can be activated for opening 
and closing for gripping a range of object sizes 

Electric actuated gripper is powered by electricity. It has the advantage 
of controlling the opening size of the gripper and thus providing the 
highest flexibility in holding a wide range of part sizes and shapes 

Multi-finger grippers 

Two-finger gripper in general is able to pick parts of simple 
geometry such as pipes and blocks which has opposing surfaces 
where the two fingers make contact. A robot gripper with hand- 
liked capabilities will have three or more fingers for picking 
complex-shaped objects (Fig. 20). A single such gripper is able to 
handle different types of parts without using tool changing system 

Gripper payload and 
fingertip force 

High payload (up to 10 kg is commercially available) and high 
fingertip force (up to 40 N) are two important specifications to be 
considered for the application needs 


Overall system design 


Two robots for parts 
handling 

The setup involves two robots with one holding each of the two 
parts to be welded together. It eliminates the use of any fixed jig 
thus providing the maximum flexibility 

Parts loading and 
unloading 

Parts such as brackets or pipes to be joined to a larger part are arranged 
manually in predetermined locations in a manner for easy pick-up by 
the part handling robots. Such setup is generally cost-effective 
compared with a setup where the smaller parts are randomly placed in 
a bin. A vision-guided handling robot will pick up the parts for 
welding on to the larger part 


Fig. 20 Robot griping a 

BiW - 


Fig. 20 Robot griping a 
stamped bracket being 
welded (Source: Robotiq) 








i 
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Fig. 21 Welding robot arms, (a) robot with a welding power source, (b) arm with hollow wrist for 
cables, (c) ceiling mounted robot 


Table 4 Typical robot specifications and their impacts to the welding requirements 


Items 

Typical 

specifications 

Design considerations 

Reach 

1-2 m 

Workpiece size 

Space availability 

Cost 

Number of axes 
(degree of 
freedom) 

Typically six; seven 
axes robots available 

Complexity of workpiece in shapes and grooves 

Range of workpiece types 

Cost 

Payload 

Around 6-10 kg; up 
to about 400 kg 

Lighter welding torch for arc welding 

Heavy welding tool for spot welding 

Repeatability 

0.03-0.1 mm 

Process welding requirement (multi-passes) 

Velocity of tool 
center point 

(TCP) 

Typically 2 m/s; up 
to 5 m/s 

Welding velocity in general is limited by the 
welding process rather than the velocity of the 
robot arm 

Robot weight 

Around 30-100 kg; 
up to 300 kg 

Moving robot carried by floor track, gantry, or 
column systems 

Mounting 

position 

Upright, inverted, 
wall or angle 
mounted 

Workpiece size, space for robot 

Hollow wrist for 
cables and hoses 

Available 
specifically for 
welding robots 

Longer-lasting cables and hoses 

Closer to full robot motion performance 

Power and signal 
cables 

Life span; flexibility 

Physically protected against heat and spatters 
and interference 


Power Sources 

The power sources supply the electric power necessary for arc welding processes. 
Their performance directly determines the weld quality as they are responsible for 
igniting the arc, allowing stable transfer of the melted electrode material, and 
generating the amount of spatter. 

There are power sources designed for a specific welding method such as GMAW 
or GTAW as well as those cater for more than one welding methods for wide range 
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Table 5 Robot controller features and their implications to welding process and operation 


Robot controller 
features 

Effect on standard 
functions 

Control for welding 
process 

Control for welding 
operation 

PC based 
GUI with jog dial 

Allow smooth path 
motion - to provide the 
required precision in 
position, orientation, 
and velocity of the 
torch for high-quality 
welding 

Fully integrated with 
the power source to 
allow optimized 
welding process 
execution 

Coordinated motion 
of the workpiece 
positioner or the 
transfer system of 
the robot 

64-bit CPU 

Control of dozens 
of axes 

Allow coordinated and 
synchronized control 
of multiple robots and 
positioners 

Welding error 
handling, production 
management, and 
sensor interfacing for 
seam tracking 

Multiple robots 
synchronization 

User I/O more than 
20; serial and 
parallel 
connections 

Ethemet/IP for 
interfacing with 
external devices such 
as welding equipment 
and seam tracking 
sensors 

Control through-the- 
arc sensing 

Production 
management and 
quality control for 
line production 

Common 
interfaces: 
Ethemet/IP; 
Fieldbus such as 
DeviceNet, 
Profuse, CC-Link 

High-bandwidth 
communication 
capabilities - fully 
integrated with the 
power source 

Monitoring of 
welding parameters 
including voltage, 
current, wire feed 
speed, and gas flow 

Control loading and 
unloading of 
workpiece 

Control the fume 
extraction hoods 

Monitoring of 
production results 

Control safety 
screens and doors 



Fig. 22 Welding system configuration: ( 1 ) teach pendant, (2) robot controller, ( 3 ) welding power 
source, ( 4 ) shielding gas cylinder, (5) air compressor, (6) wire feeder, (7) robot, ( 8 ) clutch, (9) 
welding torch, ( 10 ) wire spool, ( 11 ) wire drum, ( 12 ) workpiece positioner, ( 13 ) torch cleaning 
station, ( 14 ) TCP correction 
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Rectifier inverter Transformer Rectifier 



Control 


Fig. 23 Schematic of the primary-switched inverter of the power sources 


of welding applications. For robotic welding, the power sources used are in general 
capable to offer most of the welding processes. Such power sources, called 
multiprocess power sources, allow the user to select among welding processes in 
GMAW, FCAW, MCAW, and GTAW by changing the accessories and control 
software. In general, power sources used in automatic welding has a higher duty 
cycle allowing higher productivity than the manual welding processes. 

There are different types of power sources based on how the main supply is 
converted to the required current and voltage characteristics for the specific welding 
process. More recent power sources use the rectifier- inverter circuit with power 
semiconductors making them small in size, lightweight, and low cost. As shown in 
Fig. 23, in this type of power sources, the incoming AV current is first rectified; then 
the high-voltage DC output of the rectifier is fed to an inverter which converts the 
DC to high-frequency AC. The high-voltage and high-frequency AC is reduced by a 
transformer to a level suitable for welding and finally rectified to produce a DC 
output. Control is realized by pulse-width modulation. 

Typical features of the welding equipment for the different arc welding pro- 
cesses are listed in Table 6. 

Welding Torches 

The main functions of the welding torch are to direct welding electrode into the arc, 
conduct current to the electrode, and provide shielding gas to the weld pools. 

In the welding torch (Fig. 24a) for GMAW or MIG welding, a contact tube 
transits the current to the consumable electrode, which is in the form of wire being 
fed into the welding pool. The torch can also supply the shield gas according to the 
needs of the welding method. Gas-cooled torch is employed by the process with low 
current and light duty cycle of up to 60 % while water-cooled is used for heavy duty 
cycle of up to 100 % and high current. GMAW robotic welding torch may have 
twin-wire type for higher deposition rate and welding productivity. Both wires are 
fed to near a single point to give a single weld pool. 

The torch (Fig. 24b) used for GTAW welding has a nonconsumable tungsten 
alloy electrode surrounded by the flow of shielding gas. Torches with welding 
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Table 6 Features of power sources for different welding processes 



GMAW 

FCAW 

MCAW 

GTAW 

Power sources 

Single process or multiprocess 

Operation modes 

Constant voltage 

Constant current 

DC+ (DCEP) 

DC- (DCEN) 

DC- (DCEN) 

DC+ (DCEP) 


AC 


Synergic 

Pulsed 

Wire feeder and spool 

Electrode (wire) is pull and push 
with direct current motor 

Filler may or may not 
be used depends on the 
applications 

Electrode 

Bare 

metal 

wire 

Flux- 

cored 

wire 

Metal- 
cored wire 

Tungsten 

Shielding gas supply system 

Ar, C0 2 , or 
mixtures of inert 
and active gases 

Argon- 
C0 2 , or 
Ar-C0 2 - 

o 2 

Inner gas (Ar or He) or 
a mixture of gases 

Cooling units 

Gas or water cooled depends on the duty cycle and power 
output 

Welding torch option: 
escapement to prevent damage 
in case of collision 

Wire comes out from the contact 
tip 

Include the tungsten 
electrode 

Deliver shield gas 

Typically water cooled 

Option: twin wire for higher 
deposition rate and welding 
speeds 



Fig. 24 Welding torch (a) GMAW/MIG torch and (b) GT AW/TIG torch 

current up to 200A are generally gas cooled, while those with above 200A are water 
cooled with circulating cooling water. 

Torches for robotic welding usually come with a bend barrel to provide good 
access to the workpiece seams. In addition, a collision clutch is employed to prevent 
damage to the robot arm and the welding torch in the event of a collision. 
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Fig. 25 Wire feeder 
mounted on robot arm (www. 
lincolnelectric.com) 



Wire Feeders 

Wire feeders add filler metal during robotic welding. They are able to provide the 
required feed rate for the different welding processes. The wire feeder usually is 
mounted on the robot arm near the welding torch to have better response time as 
shown in Fig. 25. It should match to the type of the power source as well as be 
controllable by the robot controller. 

For constant current power source, a voltage-sensing wire feed system is used in 
which feed rate may be changing continuously, while the constant-voltage scheme 
requires a constant feed rate during the welding operation. For GTAW welding, the 
filler wire may not come through the nozzle of the torch. Instead, the filler wire is 
fed at an offset position into the arc area by a so-called cold wire feeder. 

Torch Cleaner and Tip Cutter 

The nozzle of the welding torch requires to be cleaned periodically for proper and 
reliable operation. This is done automatically especially for high duty cycle 
welding. During cleaning, the torch is moved to the cleaning station where a rotary 
reamer is inserted into the nozzle. Anti-spatter agent may be sprayed into the nozzle 
and the stick-out part of the wire is cut to the right length. The cleaning operation is 
activated at the required interval by the robot controller. 

Torch Calibration Unit 

It is necessary to regularly confirm the position of the tip of the nozzle which may be 
shifted due to improper programming of the robot, a collision on the torch, or worn contact 
tip. The torch calibration is done with the TCP (tool center position) calibration station 
using the calibration function in the welding programming. The TCP calibration unit 
(Fig. 26) is normally installed on the torch cleaning station to form a torch service center. 


Robot Welding Safety 

By replacing the human welder with robots, potential welding hazards are mini- 
mized. The potential welding safety hazards include bum, arc radiation, air 
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Fig. 26 TCP calibration unit 



Table 7 Some relevant safety standards for robotic welding 


Standard number 

Description 

AWS D16.1 

Robotic arc welding safety 

ANSI Z49.1 

Safety in welding, cutting, and allied processes 

CAN/CSA-W 1 17.2- 
06 (R2011) 

Safety in welding, cutting, and allied processes 

ISO 1021801-1: 2011 

Robots and robotic devices - safety requirements for industrial robots 
- part 1 : robots 

ISO 1021801-2: 2011 

Robots and robotic devices - safety requirements for industrial robots 
- part 2: robot systems and integration 

DIN EN 775 

Manipulating industrial robots - safety 


contamination, electric shock, fire and explosion, and other hazards. However, 
robot itself can cause fatal bodily injury. It is advisable that the relevant safety 
standards are followed during the installation, commissioning, testing, program- 
ming, operation, and maintenance stages. Table 7 lists some of the safety standards 
relevant to robotic welding. 

The introduction of robots requires proper safety features in order to protect 
the operator as well as others working nearby. Each robot installation must be 
carefully planned from safety viewpoint to eliminate hazards. Barriers may be 
designed to completely surround the robot welding area to prevent people from 
entry into the area. All entries must be protected by interlocked doors. Welding 
curtain may be put up to protect the people from exposure to harmful ultraviolet 
light and hot spatter. Proper fume extractor should be installed to remove 
welding fume. 
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Sensors in Robotic Welding 

Welding process is exposed to both geometry and process disturbances. Geometry 
disturbances are weld joint and path deviations caused by part variation, inaccurate 
joint preparation, fit-up and alignment error, and thermal distortion. Process distur- 
bances are process parameter shifting due to machine and device tolerances. In 
manual welding, the welder notices these disturbances and corrects them manually 
according to strategies learned and gained by experience. 

For fully automated robotic welding, these disturbances can be detrimental to 
both accurate weld path following and welding quality assurance. In order to 
improve productivity, ensure quality and lower cost for robotic welding operations, 
sensors are required to provide both geometry and process data for seam tracking 
and process monitoring. Ultimately, with the sensor feedback, process and motion 
parameters can be adjusted online to cope with the disturbances. 

Advances in sensor and electronics technologies have enabled sensors perfor- 
mance and reliability viable and cost competitive in harsh welding environment. 
There are a wide range of sensors for geometry detection and process monitoring, 
even for molten weld pool (Luo and Devanathan 2002), arc radiation (ISF Aachen 
2013b), and laser-generated plasma (during laser beam welding) (Luo and Zeng 
2002) analysis. The most mature and widely used sensors for industrial applications 
are those to obtain geometry data and process parameters for weld seam finding and 
tracking and weld and arc monitoring. 


Weld Seam Finding 

Weld seam finding is to detect the edge or start of a weld seam. The data can be used 
for purposes like confirming a product has been loaded correctly, defining a position 
in space to execute a weld, offsetting a weld seam that varies from batch to batch, 
and et al. The primary sensors for weld seam finding are tactile (touch) sensing and 
proximity sensor. 

Tactile Sensing 

Tactile sensing is a well-established method for weld seam finding. In principle it is a 
voltage applied to the welding gas nozzle, welding wire or independent probe. The 
voltage shorts out as it makes contact on the component when touching down in X, Y, or 
Z plane. The short circuit is recognized by either the robot controller or welding power 
source, and then comer for the start of a weld or weld edge height can be deduced. 

Tactile sensing requires a robot to touch the welding wire or the sensing probe to 
the workpiece in several places to determine workpiece location and orientation. It 
is limited to grooves with large dimensions and relatively straight seam path. 

Proximity Sensor 

Proximity sensor is a noncontact sensor and it operates using an analogue inductive 
field. The simplest type is a ring coil. If alternating current flows through the coil, a 
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magnetic field is generated. The magnetic field weakens when the coil approaches 
the workpiece which is metallic and electrically conductive. With multi-coil 
arrangement in one sensor, the position of the weld groove, as well as the angular 
and rotational data, can be obtained. 

The prominent welding power source manufactures and most mainstream 
robotic welding system suppliers are all able to provide weld seam finding func- 
tionality (Robotics 2012). 


Weld Seam Tracking 

Weld seam tracking involves weld groove and seam geometry measurement. The 
measured data allow for not only robot trajectory shifts but also adaptive control 
through real-time process parameter adjustments to cope with joint geometry 
disturbances. Weld seam data can be further used for multi-pass welding applica- 
tions. In multi-pass welding, weld seam tracking is used during root pass. The 
positional offset data and the parameter adjustment values from the tracked root 
pass are stored. The stored data can be played back on consecutive passes, which 
are welded according to a preprogrammed offset from the stored root pass. 

There are two main methodologies that can facilitate weld seam tracking. They 
are laser scanning and through-the-arc sensing, though stereo vision is an alterna- 
tive option. 

Laser Scanning 

Laser scanner for seam tracking is based on the principle of laser triangulation. A 
laser diode projects a visible laser stripe by either laser line projection or laser spot 
oscillation. The reflected light is received by CMOS or CCD sensors. By detecting 
changes in the position and shape of the reflection, the position of various points 
along the weld groove is measured. Thus, Laser scanning provides 2D information 
of weld width and depth for seam tracking. Figure 27 illustrates the measurement 
principle of a laser scanner (Juneghani and Noruk 2009). 

Technical limitation of laser scanner is its reliability when scanning on highly 
reflective surfaces. Other considerations need to be reviewed include relatively high 
cost, maintenance issues due to harsh welding environment, and physical con- 
straints which may limit torch accessibility to the weld joint. 

Laser scanning-based seam tracking system has been widely used for both 
robotic arc and laser welding. One can either work with sensor manufacturers 
such as SICK, Keyence, Micro-Epsilon, and MEL Mikroelektronik to build weld 
seam tracker or purchase integrated seam tracking system and software from 
suppliers like Meta Vision, Servo-Robot, Scan Sonic, OTC Daihen, and Moto 
Eye etc. 

Through-the-Arc Sensing 

Through-the-arc sensing requires that the robot weaves across the weld joint during 
welding. Welding currents are measured during arc weaving to calculate both the 
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Fig. 27 Schematic 
illustration of the working 
principle of a laser scanner 
(Juneghani and Noruk 2009) 



Part A Part B 


Fig. 28 Arc weaving for 
through-the-arc sensing 
(Robotics 2012) 






actual seam position and the arc height. These data are then used to adjust the 
robot’s path through lateral and vertical compensation. Through-the-arc sensing is 
often used in conjunction with tactile sensing. The tactile sensor finds the starting 
point of the weld. Then, through-the-arc tracking takes over to keep the welding 
torch in the joint after the arc is struck. Figure 28 illustrates the arc weaving for 
through-the-arc sensing (Robotics 2012). 

Through-the-arc sensing requires no external probes, and all information is 
gathered through the welding arc itself. It is a very well-proven technology and is 
very reliable and cost-effective. 

The only limitation is the requirement that the robot must superimpose a weave on 
the weld in order to get the information it needs. Therefore, it is most useful for simple 
fillet welds or deep groove welds, while its usage for butt and lap welds is limited. 
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Table 8 Sensors and 
transducers for welding 
parameter measurement 


Welding parameters 

Sensors/transducers 

Arc current 

Hall effect sensor or current shunt 

Arc voltage 

Voltage probe 

Wire feed speed 

Optical tachometer/encoder 

Travel speed 

Optical tachometer/encoder 

Gas flow rate 

Calorimeter or gas flow meter 

Temperature 

Thermal-couple or infrared sensor 


Most mainstream robotic welding system suppliers are all able to provide 
through-the-arc sensing functionality such as ABB WeldGuide, Fanuc ArcTool 
TAST, and Yaskawa ComArc. 


Weld/Arc Data Monitoring 

Weld/arc data monitoring refers to the real-time measurement of various welding 
process parameters. These parameters include arc current, arc voltage, wire feed 
speed, travel speed, gas flow rate, temperature, and weld time. When welding in 
pulse mode, pulse parameters like peak and background current and voltage and 
pulse frequency can also be measured. Various sensors and transducers are used for 
welding parameter monitoring and some examples are listed in Table 8. 

Apart from welding process data, production data like overall equipment effec- 
tiveness, downtime, arc-on time, as well as performance factors such as deposition 
rates, wire use, gas use, and weld faults can also be provided. 

With the advancement of database technology, the data acquired can be used for real- 
time monitoring, arc characteristics evaluation, and production status and trend analysis. 

There are both fully integrated and third-party solutions available. Systems fully 
integrated into the power source are provided by welding power source suppliers 
like Miller, Lincoln, and Fronius. These systems offer seamless integration and 
minimal start-up time. Third-party systems are advantageous for major established 
welding power sources without the option of a built-in solution. Third-party 
systems can work as a stand-alone unit or can be linked to a PC through Ethernet 
or USB. They include ARC Agent™ from Impact Engineering Inc., ADM IV™ 
from Computer Weld Technology, Inc., and ALX II from Partek Laboratories Inc. 


Robotic Welding Programming and Software 

Robotic Welding Programming Through Application Software Tools 

Robots are basically programmed through two methods: teach pendent based online 
programming and CAD model-based off-line programming. For robotic welding 
applications, apart from defining weld locations to create robot path, arc welding 
functions such as setting of torch angle and welding parameters as well as weld 
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Table 9 Examples of welding application software packages 



ABB 

KUKA 

Fanuc 

Arc welding tool 

ArcWelding PowerPac, Virtual Arc, 
Robot Ware Arc 

Arc Tech 

ArcTool 

Spot welding tool 

RobotWare Spot 

ServoGun 

SpotTool 

Laser welding tool 


LaserTech 


Seam finding and 
tracking tool 

WaveGuide 

SenseTech and 
TouchTech 

TAST 


seam finding and tracking are also needed. The welding parameters may cover 
welding, weaving, run-in, burn-back and crater fill schedules, and multi-pass 
offsets. 

Mainstream robot manufacturers all provide welding application software with 
instructions to realize the abovementioned functions. The welding application 
software can be used through both online (Ang et al. 1999) and off-line program- 
ming environment. Some application software packages examples from ABB, 
KUKA, and Fanuc are given in Table 9. There are also third-party software 
packages, such as RobotMaster, CSR from AC&E, and MOSES from AutoCam. 

A three-pass robotic welding program created by Fanuc ArcTool is given as an 
example in Fig. 29. The program includes a main program, a welding program with 
instructions for arc start and end, setting welding speed, weaving, through-the-arc 
seam tracking (TAST), and offsetting (root pass memorization (RPM) and multi- 
pass offset (MP)). 


Automatic Robotic Welding Programming 

Both online and off-line programming are time-consuming and require robotics 
knowledge and programming skills. When welding continuously changing designs 
for flexible operations, these methods are not sufficient. Some efforts have been 
made to automatically generate welding programs. 

TAT 

One example is Rians Weld from Kranedonk Productions Systems 

(Kranedonk Productions and Automatic Welding 2013; Pan et al. 2011) to 
completely automate the programming process. The automated programming sys- 
tem has two key low-level modules on which the higher-level algorithms are built 
upon. They are the robot kinematic model which is used to calculate the various 
joint angles for a given robot position and orientation and the collision model which 
determines if the robot components are in collision with the other work cell 
components. Calculation time is an important consideration for an automated 
programming system, particularly for the collision model. To check a weld path, 
the collision model can be run up to 10,000 times for a simple structure or more than 
one million times, for a complex structure. To accommodate this, the robots in the 
collision model are represented by a collection of spheres in order to minimize the 
complexity of the collision calculation. 
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Main Teach Pendant Program 


MAIN 


i: 

2 : 

3 : 


CALL VVELDQ1 1 1 ,0) 
CALL W£LD0l{2,0) 
CALL WEL001<3,0) 



Sub Program Header Information 


| End] 




Progranri'.detail 

Multi-pass, daia 

W 

Program name: WELDQ1 

Last pass: 0 Status: 

*• *■*■*#*■** 

1 Number ot passes: 

3 

2 Weld schedules: 

^-DETAIL'* 

3 weave schedules-: 

< + DETAIL*> 

4 Multi-pass oil sets: 

DETAIL-* 

5 Run-in schedules: 

<-D£TAlL-> 

€ Burn back schedules: 

DETAIL** 

7 Crater fill schedules: 

<*DETAIL*> 


WELDGf Program 


1: 

Rhl-ARhl 

2: 

R[2]*AR[2j 

3; 

IF H[2}=i. JMP LBL[3] 

5: 

C* 

IF Rf 1 1 . JMP LBL[2J 

y. 

7: 

1 ROOT PASS 

8. 

Track TAST13I RPM|3] 

9:l 

FflfS] 20mm /sec fine 

■ 

■ 

Arc Start[99] 

10: 

Weave Slne(90] 

1 1:L 

Pj4] WELD SPEED FINE 

■ 

■ 

Arc End [391 

12: 

Weave End 

13: 

Track End 

n 4 :j 

P[5] 10% FINE 

15: 

END 

1 D . 

17: 

LBLI2:FORWARD PASSES 

13: 

mp Onset pR[99JRpm[31 

19:J 

PRf3] 10% PINE 

a 

■ 

Arc Siarl|99] 

20: 

weave SinelSOl 

21 JL 

P[4j WELD_SPEED FINE 

■ 

Arc End[99l 

22: 

Weave End 

23: 

MP Oilset End 



The root pass (1 ) tracking 
information is memonzed and 
replayed with a defined offset 
for the next two passes (2. 3). 


Multi-Pass Offset Detail Screen 



Program'-detaii 
Multi-pass offsets 
Pass X{mm) V(mm) 

Z{mm) 

wrk(deg) 

Trv(deg) 

1 0.0 0.0 

0.0 

O.o 

0,0 

2 -10.0 15.0 

12.0 

■ 20.0 

5.0 

3 -10.0 -15.0 

12-0 

20.0 

5.0 


Fig. 29 Robotic welding program using Fanuc ArcTool (Fanuc 2005) 


Weld seams are identified by searching for plate edges that align with other 
plate surfaces. The start and end locations of the intersection edges along with 
the normal direction to the surface are used for tag placement and orientation. 
Once the weld tags are generated, calibration points are added along the 
weld direction and, in the case of a comer, on other required geometries. 
Linear paths between the weld and calibration points are then checked and 
optimal robot configuration and external axes position selected. Points are also 
added using a probabilistic road map (PRM) planner for robot paths that are 
difficult to access. 

T’a/t 

Rians Weld also links weld geometry with the required weld parameter spec- 

ification through a database or library of weld parameter settings and path details 
for many different types of welds and materials. 

In summary, the software determines automatically where the welds need to be, 
how the robots can reach these places, and what parameters should be used for each 
weld. Currently, RiansWeld is mainly used for the welding of complex hull 
structures and beams. The GUI of RiansWeld™ is shown in Fig. 30. 
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CfeAf interface Automatic recognition of weld $eams Automatic selection of required 



Fig. 30 Rians WeldTM software for automatic robot programming (Kranedonk Productions and 
Automatic Welding 2013) 


Fig. 3 1 Automatic robot 
programming flow through 
APG and DTPS (Valk 2011) 



Another example of automatic robot programming is Automatic Path Generation 
(APG) on top of the off-line programming software: “Desk Top Programming & 
Simulation System” (DTPS) for Panasonic robots. It is developed by Valk Welding 
B. V. (Valk 2011). APG uses data out of CAD systems, ERP, and Excel sheets to 
automatically generate complete programs for the welding robot, which in addition 
to the position of the welding torch also contain the torch angle, and the right 
welding parameters, such as the current, weaving parameters, crater filling param- 
eters, and etc. The path generated by APG is then put into the DTPS for simulation, 
reach, and collision check. The work flow of APG-based automatic programming is 
given in Fig. 31. 

With APG, it is possible to develop custom-made robot software (CMRS) to 
program different products automatically within a product family. Successful 
applications include grid and beam welding for single-piece production and con- 
siderable programming time reduction has been achieved. 
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Table 10 Some of the welding robot suppliers (Source: www.Robotiq.com) 


Robot 

company 

Robot models 

Suppliers of power sources 

ABB 

IRB140; IRB1410, IRB1520ID, 
IRB1600ID, IRB2400-10, IRB2400-16, 
IRB2600ID- 15/1.85; IRB2600ID-8/ 
2.00 

Compatible with Miller, Fronius, and 

SKS 

Kuka 

KR 5 ARC//KR 5 ARC HA; KR 5-2 
ARC HW; KR 16 ARC HW; KR 
16 L8-3 ARC HW 

Compatible with Lincoln, Miller, 
Fronius, and SKS 

Fanuc 

Arc Mate 50iC/5 L, Arc Mate OiA, Arc 
Mate lOOiC, Arc Mate 100iC/6 L, Arc 
Mate 120iC, Arc Mate 120iC/10 L, 
M-710iC/20 L 

Packaged with Lincoln welding 
power source compatible with 
Miller, Fronius, and SKS 

Y askawa/ 
Motorman 

VA14500, MAI 400, MAI 800, 
MAI 900, MA3100 

Compatible with Miller, Lincoln, 
Fronius, Thermal Arc, and SKS 

Kawasaki 

RS05N/L, RS06L, RS10N/L, RS15X, 
RS20N 

Compatible with Lincoln, Miller, 
Fronius, Kobelco, OTC Daihen, 
Panasonic, SKS 


Table 1 1 Examples of system integrators for robotic welding 


System integrators 

Specializations 

CLOOS Robotic 
Welding Inc. 

Develop and supply personalized turnkey-automated welding 
solutions 

911 Albion Avenue 

Schaumburg, Illinois 

60193 

USA 

Tel: +1 (847) 923-9988 

Fax: +1 (847) 923-9989 

http : //cloo srobot . com/ 

Kranendonk Production 
Systems BV 

Supply custom-build robotic solutions for wide range of industries 
including shipbuilding, offshore, transport, structural steels, and 
process and power 

(head office) 

Sir Rowland Hillstraat 10 

P.O. Box 6147, NL-4000 
HC 

Tiel, The Netherlands 

Tel: +31 344 623 944 

Fax +31 344 623 388 

inf o@ kranendonk.com 

http : //www .kranendonk. 
com 


0 continued ) 
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Table 11 (continued) 

System integrators 

Specializations 

OTC DAIHEN, Inc. 

Manufacture and supply welding equipment (primarily digital 
inverter welding supplies), cutting equipment, torches, robots, 
positioning equipment, standard and custom arc welding cells, and 
accessories 

1400 Blauser Dr. 

Tipp City, OH 45371 

USA 

Tel: +1 (937) 667-0800 

Fax: +1 (937) 667-0885 

www.daihen-usa.com 

Valk Welding B.V. 

Develop and deliver robotic welding systems, which are intended 
for flexible production of small to medium-large series 

Staalindustrieweg 15 

NL-2952 AT 

Alblasserdam 

The Netherlands 

Tel: +31 78 691 70 11 

Fax +31 78 691 95 15 

http : //w w w . v alkwelding . 
com 

Welding Robotics UK 

Manufacture and supply welding equipment (primarily digital 
inverter welding supplies), cutting equipment, torches, robots, 
positioning equipment, standard and custom arc welding cells, and 
accessories 

Henlow, Bedfordshire 
SGI 6 6EJ 

United Kingdom 

Tel: + 44 1462 759025 

http://www. 
weldingrobotic s .co.uk/ 


Robotic Welding Suppliers and System Integrators 

Most of the major industrial robotic companies manufacture robots for welding 
applications. They are able to supply the complete robotic welding system includ- 
ing the power sources from the welding power source suppliers. Table 10 lists the 
robots and their compatible power sources. 

There are companies and system integrators who provide the complete welding 
automation solutions for specific applications. Some of these companies are listed 
in Table 11. 


Summary 

This chapter starts with an overview of the welding processes categorizing in terms 
of fusion welding and solid-state welding. Arc welding, which is one of the fusion 
welding processes is popularly used in wide range of industries including metal 
fabrication job shops, construction equipment, construction beams, shipbuilding, 
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and aerospace. Majority of today’s robotic systems are catered for the arc welding 
processes including gas metal arc welding (GMAW) and gas tungsten arc welding 
(GTAW) processes. Due to the fundamental difference in the welding processes, 
the welding torches for the two process types cannot be shared but a single welding 
power source specifically the multiprocess type can be used for both of the 
processes. Apart from the robot arm and the welding power source, the main 
peripherals in the robotic welding system are the torch cleaning station, wire feeder, 
wire pools, and wire drum. 

The design considerations for the robotic welding system are the targeted 
workpiece in terms of material, size, configuration, joint geometry and positions, 
as well as the production requirements. There are various types of positioners and 
transfer units for moving the workpiece as well as the robot arm to meet the needs of 
the workpiece features and operation requirements. For high-mix and low-volume 
manufacturing featured frequent changing workpiece designs and flexible produc- 
tion, there is automatic robotic welding programming software for setting up the 
complete robotic welding process and operation. Through-the-arc sensing and 
advanced add-on optical sensors integrated with optimized welding control in the 
power source are available to achieve high-quality welds. 
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Abstract 

Robotized deburring and finishing are gaining more popularity in the industry, 
due to the cost-effectiveness and better quality assurance compared to manual 
operations. This chapter provides an overview of the robotic finishing process, 
from selection of the robot, spindle, media, and other hardware to programming, 
process execution, and verification. Finally, some challenges and future research 
direction in this area are also highlighted. 
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Introduction 


Deburring is the process of removing a raised edge or small pieces of material 
remaining attached to workpiece after a modification process, also known as the 
“burr,” while surface finishing is the act of smoothening the surface for aesthetic 
and practical reasons. Burrs need to be eliminated for various reasons: from 
preventing finger cuts to reducing metal contamination in aerospace assemblies, 
jammed mechanism, wear on moving parts, clogged filters, etc. On the other hand, 
surfaces need to be treated to reduce wear and friction. For instance, the surfaces of 
aerospace fan blades or marine propeller need to be smooth to reduce drag and 
improve the aerodynamics or hydrodynamics. 

A considerable amount of manufacturing time is spent on deburring or surface 
finishing after the primary machining. Currently, most of these works are done 
manually, but it is increasingly difficult to find workers who are willing to take up 
these dull, daunting, and dirty jobs, and the cost of human labor has also become 
higher. Furthermore, the quality is inconsistent, and there are health issues due to 
inhaling of metal dust as well as strong vibration of the handheld spindles. There- 
fore, there are significant incentives to automate the deburring and surface finishing 
tasks. Because of robot’s dexterity and flexibility, robotic deburring or surface 
finishing has been a popular subject of research in the past three decades. 

In Bausch et al. (1986) and Kazerooni et al. (1986), it was pointed out that a 
limitation in using robots for precision deburring is their poor end point positional 
accuracy and that low-frequency oscillations of robot position could be observed. 
To compensate for the robot inaccuracies in deburring applications, the simplest 
approach is to use a passive compliant end effector to hold the deburring tool 
(Bausch et al. 1986). With a suitable end effector compliance, i.e., with the proper 
choice of mass, spring, and damping coefficients, positional variations produce only 
slight changes in the deburring contact force. Another solution which is more 
elegant is to use active compliant end effector which operates under a separate 
closed loop control. For example, in Kazerooni et al. (1986), the workpiece is 
mounted on a positioning table which moves according to the robot so as to 
maintain a constant relative distance between the workpiece and the robot. The 
methods described in Bausch et al. (1986) and Kazerooni et al. (1986), along with 
those introduced in Dornfeld and Masaki (1987), Hollowell (1987), Paul and 
FitzPatrik (1987) and Stepien et al. (1987), are effective for parts with good 
knowledge of their geometry, since the end effector’s motion range cannot exceed 
the distance between the workpiece and the robot’s maximum positional deflection. 
In Kazerooni and Her (1988), an approach for deburring of 2-dimensional parts 
with unknown geometry was proposed. A roller bearing mounted on a force sensor 
at the robot end point serves as a tracking mechanism. The tracking controller then 
utilizes the force measured by this force sensor to find the normal vector to the part 
surface, from which the part contour information can be obtained, enabling the 
robot to travel along the edge of the part. On the other hand, the measured tangential 
force provides a clue on whether a burr is encountered, and if the answer is 
definitive, the robot will slow down to maintain a constant material removal rate. 
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Another tracking mechanism or jig was designed in Schimmels (1994), which uses 
compliance and constraints to ensure a correct positioning of end effector relative to 
the workpiece, thus allowing greater accuracy in edge-based material removal 
processes such as deburring. Looking at the emerging needs in robotic finishing 
tasks, robot manufacturers have incorporated force control functionality into their 
industrial robot. ABB has integrated force control technology into their IRC5 
controller that enables end user to explore new dimension of robot intelligence 
for grinding, polishing, deburring, deflashing, and assembly tasks. KUKA has 
incorporated force torque control into their KRC4 that gives the robot a sense of 
touch which could be used for grinding, deburring, and polishing operation. Fanuc 
incorporated Fanuc force sensor into their material removal robots for contouring 
motion with specified pushing force which targeting for deburring and polishing 
operation. 

Apart from the process of deburring or polishing itself, path planning for the task 
is also crucial for the success of the tasks. In Wenzel and McFalls (1989), a vision 
system has been developed to analyze the aircraft canopy and to determine which 
area needs to be polished. The robot then polishes the flawed areas according to 
some circular patterns. The material removal pattern is intelligent and adaptive 
based on the flaws, e.g., if a flaw cluster can fit within the effective cutting radius of 
the tool, then it will use the centroid of the flaw as the grind site’s center. 
Alternatively, there may be multiple flaws which can be swept through by the 
cutting radii at one go. Then the grind site’s center will not be placed at the center of 
a single flaw. Apart from the above highly adaptive process, most deburring or 
polishing tasks in the industry are slightly more deterministic, and the tool path can 
usually be generated through off-line programming using the CAD data (Nurminen 
1994; Van Brussel and Persoons 1996; Tam et al. 1999; Marquez et al. 2005; 
Nagata et al. 2007). For workpieces which have very complicated shapes, the 
programming of the path points can be too tedious due to the need to program 
positions and orientations, and for such cases, Zhang et al. (2006) proposes a 
method which combines vision and force sensor to obtain all the needed informa- 
tion of a path. The vision system provides the position data, and then the robot 
moves in a zigzag fashion along the path. At the same time, the force sensor 
measures the force across the zigzag path and determines the normal vector, 
which provides the orientation of the path points. 

A terminology closely related to deburring and surface finishing is the “material 
removal rate (MRR),” which determines how fast the material is being removed 
during the deburring or surface finishing process. In Yeh and Lan (2002), a study of 
controlling the MRR with fixed tool life to gain the maximal profit from the 
machining operation has been undertaken. The relationship between the 
deburring/polishing force, the feed rate, the tool speed, the tools , and their grain 
size has also been extensively analyzed in Guvenc and Srinivasan (1997), Huang 
et al. (2002), Zhang and Pan (2008), Jin et al. (2008), Tsai et al. (2009), Rajasekaran 
et al. (2010), Yano et al. (2011) and Corral et al. (2013), either analytically or 
empirically. Using the knowledge of MRR, the robot can be commanded to move 
slower or to increase the machining force if extra material such as burr and rough 
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surface is detected, so as to achieve a constant MRR, which in turn removes the 
extra material completely. The MRR also influences the surface roughness 
(Marquez et al. 2005; Rajasekaran et al. 2010; Corral et al. 2013), and thus it is 
necessary to choose an optimal set of parameters or tools to obtain a good deburring 
or polishing quality. 

Years of research in the academia have finally made robotic polishing and 
deburring a viable option for the industry. Robots have found its application in 
polishing of aircraft canopies (Wenzel and McFalls 1989), milling of molds and 
polishing of precast concrete (Nurminen 1994), deburring of castings (Van Brussel 
and Persoons 1996), sanding of furnitures (Nagata et al. 2007), deburring of cast 
aluminum wheels (Zhang et al. 2006), polishing of die and molds (Guvenc and 
Srinivasan 1997; Jin et al. 2008; Tsai et al. 2009), polishing of turbine (Huang 
et al. 2002; Hazel et al. 2012), machining of carbon-fiber-reinforced polymer 
(Rajasekaran et al. 2010), as well as grinding and deburring of stubs (Robertsson 
et al. 2006). The achieved quality is tremendous, e.g., in Hazel et al. (2012), the 
surface roughness has been lowered from Ra = 15 pm to Ra = 0.1 pm by using 
robotic polishing in place of manual polishing. In Jin et al. (2008), a surface 
roughness of Ra = 5 nm is also achieved by using a gasbag type of tool. 

Figure 1 shows the overview of the flow of using robot to perform deburring and 
finishing, which is also roughly the organization of this chapter. In section “Hard- 
ware,” the hardware needed for robotic finishing is described, followed by some 
details of programming and path planning in section “Programming and Path 
Planning.” Section “Process Execution and Verification” describes the program 
execution and tuning of parameters to obtain the required results. The chapter ends 
with a list of future research directions in section “Challenges and Future Work.” 


Hardware 

Robot 

The choice of robot is essential for a successful deburring or surface finishing task. 
During finishing operation, tool and media need to move within a predefined 
motion and maintain consistent contact with the workpiece. In order to realize the 
finishing operation, robot can be in either position control or force control mode 
depending on workpiece’s feature and media to be used. Figure 2 illustrates the 
possible approaches for the robotic finishing operation. 

For robotic position control methods, where the robot is commanded to carry the 
tool or workpiece to move along a predefined path, repeatability and accuracy are 
two important notions. A high degree of repeatability ensures that the path taught 
through online teaching can be played back well enough to perform the actual task. 
On the other hand, a good accuracy is important for the case of off-line program- 
ming, where the robotic paths are programmed based on CAD drawings and the 
actual robot has to move to the “exact” position in space. Of course, the level of 
repeatability and accuracy needed depends on the actual requirement of the 
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Hardware setup 


Path planning 
and programming 


Programme 
execution and 
verification 



Fig. 1 Overview of the flow of using robot to perform deburring and finishing tasks 


workpiece. If the tolerance is less stringent, then a less accurate robot can still be 
used in conjunction with some compliant tools which cater for the position error to a 
certain extent. The compliant nature of the tool or media will realize the consistent 
contact between the media and the workpiece during the finishing operation. 
Understanding the behavior (e.g., aggressiveness, tool wear rate, etc.) of the 
finishing tool or media is the key factor, as the proper depth of interference needs 
to be set with the range tolerable by the tool or media for optimal finishing 
performance. Besides, robot path needs to be compensated accordingly to cater 
for the tool and media wear. 

Robots with force control are also becoming more readily available, and such 
force control package can be a good option when deciding for a robot. They are 
integrated with force sensor, and the tool and media are attached to the robot 
manipulator through the force sensor. These robots are able to “sense” and “feel” 
as the tool moves around the workpiece while at the same time maintain a desired 
force or pressure on the workpiece, thus achieving a good end quality. Based on the 
measured force signal, the robot can also dynamically change its motion to 
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Robot in position control 
with compliant tool 

• Tool or media usually have certain level of compliance 

• Tool and media are attached directly to robot manipulator 

• Robot manipulator is in position control mode 

• Force sensor is usually not needed 


Robot with force control 

• Force sensor is usually integrated within the robot manipulator 

• Tool and media are attached to robot manipulator through force sensor 

• Force control is realized through the overall robot manipulator 


Robot in position control 
with add on end-effector 
in force control 


• Tool and media are attached to the add-on end-effector module 

• Force control is realized through add-on end-effector module and robot 
manipulator is in position control mode 

• Force sensor is usually integrated within the addon end-effector module 


Fig. 2 Approaches to robotic finishing operation 


accomplish a certain goal. For instance, if the force signal becomes higher than 
usual, it could mean that there are thick burrs or more material in the way. Either the 
robot will then slow down so as to remove the additional burr or material, before 
continuing with the rest of the path (Fig. 3), or the robot will automatically “climb 
over” the extra material, if the requirement is just to smoothen out the edges or 
surface without needing to achieve a constant level (Fig. 4). The force feedback also 
helps reducing the error due to positional inaccuracy, because the robot will 
eventually come in contact with the workpiece with the desired force, even if the 
workpiece was way off the planned path. 

Another alternative for the robotic finishing operation is through the add-on end 
effector module that is capable of performing force control capability. It is also 
termed as force control around the robot arm. In this approach, robot manipulator 
will be the macro positioner, while the end effector module acts as micro positioner 
with force control capability. Tool and media are attached to the end effector 
module for the finishing operation. The details and choice of force control end 
effector module will be given in the next section. 

Table 1 summarizes the advantages and disadvantages of the robotic finishing 
approaches. 

The payload of the robot is also an important consideration. In conventional 
robotic applications such as workpiece handling, the payload is mostly the weight 
of the workpiece itself. However, for robotic deburring and finishing, the additional 
machining force needs to be counted into the payload as well. For instance, if the 
machining force is 100 N, then the robot must be able to handle the weight of the 
spindle/workpiece (for robot holding tool/workpiece, respectively) as well as the 
additional approximately 10 kg of contact force. The user may also choose between 
a more dexterous serial link robot and a stiffer parallel manipulator, depending on 
the required machining force. 
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Fig. 3 Robot slows down when extra material is detected by force sensor 
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Fig. 4 Robot climbs over the extra material when this is detected by force sensor 
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Table 1 Comparison among the robotic finishing approaches 


Approaches 

Advantages 

Disadvantages 

Robot + compliant 
tool or media 

Simple and low cost 

Need to manually compensate the 
robot path for tool wear 

Robot with force 
control 

Capable of defining and 
changing desired force value 

Require force sensor 

Automatically compensate the 
robot path for tool wear 
through force control 

Performance for light force value (less 
than 1 N) still needs improvement 

Robot + add-on 
end effector with 
force control 

Capable of defining and 
changing desired force value 

Require add-on end effector 

Automatically compensate the 
robot path for tool wear 
through force control 

Depending on the weight of the add-on 
end effector, robot with higher payload 
may be needed 



Fig. 5 Points to consider when choosing robot for deburring or finishing tasks 


Some other characteristics which need to be taken into account when determin- 
ing for a robot include its reach, inertia, maximum speed, and acceleration. The 
availability of an off-line programming software by the robotic manufacturer is also 
worth mentioning, because off-line programming offers a wide range of advan- 
tages, including safety for the programmer and no need for shutting down of 
production line for path teaching. Figure 5 provides an overview of all the charac- 
teristics of the robot to look out for when choosing a robot. 

One more point to consider when installing the robot is the mounting position. A 
robot fixed on the ground and standing upright is the most natural position from the 
viewpoint of the human programmer, but the effective workspace may be reduced - 
for instance, the workpiece cannot be placed too close to the robot; otherwise, the 
joint angles may be close to the limits. Also, the robot could obstruct the human 
path. Other possibilities include mounting the robot on the wall or upside down on 
the ceiling, but this depends whether the robot is able to be positioned in these ways. 
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Force control 
end effector 


Fig. 6 


Force control end effector in action to maintain contact force with workpiece 


Force Control End Effector 

As mentioned earlier, an alternative to a robot with force control capability is to use 
a force control end effector together with a more conventional robot which runs in 
position control mode. The robot carries the end effector and positions it along a 
path, while the end effector performs the force control task - sensing the force and 
controlling its expansion to maintain a correct force between the tool and the 
workpiece (Fig. 6). Things to consider when choosing a force control end effector 
include its force capacity, the force resolution, the maximal overturning moment, 
the degrees of freedom (i.e., the number of force directions it is able to control), the 
stroke in each force axis, as well as its control bandwidth which determines the 
reaction speed. Some end effectors also come with acceleration and gravity com- 
pensation, which are beneficial for achieving the correct contact force by removing 
the acceleration and gravity forces from the sensed signal. Last but not least, it is 
also crucial that the end effector can communicate with the robot controller, so that 
the motion of the robot as well as the force control action of the end effector is 
synchronized. 


Turntable/Rail 

Apart from the robot itself, a turntable or a rail can be a useful extension as well. For 
instance, certain edges of a workpiece may be out of reach to the robot due to the 
need to twist the robot joints in an awkward manner to achieve the required 
orientation, but if the workpiece can be rotated using the turntable such that the 
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Fig. 7 Top - robot in 
awkward configuration; 
bottom - turntable brings the 
workpiece to an appropriate 
position and orientation for 
the robotic operation 



Turn table 



Turn table 


edges become nearer to the robot, then the robot may be able to deburr these edges 
without difficulty (Fig. 7). Another example is that of a huge workpiece. Part of it 
may be beyond the working envelope of the robot, but if the robot now sits on a rail 
and moved to a few different positions, then the complete workpiece can be 
covered. The turntable can provide other conveniences as well - for cylindrical or 
circular features on a workpiece, if the workpiece is stationary, then the robot needs 
to move in a circular motion, sometimes requiring a huge rotation of all the joints. 
On the other hand, if the workpiece is mounted on the turntable along the rotary 
axis, it can rotate while the robot stands still and performs the deburring or finishing 

(Fig. 8). 


Spindle 

For robotic finishing task, the spindles attached to the end of the robotic arm must 
be flexible (small in dimension for operational space flexibility), durable (high 
degrees of contamination protection), and lightweight (meet the payload limitations 
of the robot to minimize payload stress). Figure 9 shows the categorization of 
spindle systems suitable for robotic finishing. 

In general, conventional spindles for robotic finishing operation can be selected 
from either pneumatic spindles or electric spindles. 

Pneumatic spindle utilizing the air drive system is usually small in dimension, 
has high power-to-weight ratio, and is cost-efficient. This offers the opportunity for 




66 Robotic Finishing 


2455 



Fig. 8 Turntable assisting the deburring of circular feature 


Fig. 9 Spindle systems for 
robotic finishing 



the pneumatic spindle to be used in narrow areas, allowing a smaller and lighter tool 
to accomplish the task. Typical output power ranges from 0.1 to 1.2 HP with speed 
up to 60,000 rpm, within a fixed spinning direction. However, the actual output 
power of the pneumatic spindle is subjected to pressure supply variations as well as 
process-related disturbances. 

On the other hand, electric spindle may not be as elegant as pneumatic spindle 
when it comes to size and weight, but it could vary its speed and provide consistent 
speed under different motion feed rates and process-related disturbances. This 
offers the opportunity for the electric spindle to fulfill the optimized speed of 
specific tool and media. Typical output power ranges from 0.2 to 30 HP with 
speed up to 60,000 rpm, usually with both clockwise and counterclockwise spin- 
ning directions. 

Note: Some of the very high-speed spindles that required rigid vertical mounting 
are meant for CNC machine and may not be suitable for robot manipulator 
mounting. 
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Robotic Finishing 
Process 



Remove light burrs 



Remove primary 
burrs 



Break edge or 
Form edge profile 



Dress edge profile 



Fig. 10 Major steps in robotic finishing process 


For robotic finishing operation that involves multiple finishing media in different 
finishing processes or stages, spindle with tool-change functionality will further 
improve the cycle time of the finishing operation. Besides, spindle with angle head 
will increase the accessibility level of media to feature of interest, especially those 
features located within confined space. Although it enhances the accessibility level, 
it comes with the price of additional weight. 

There are also spindles that are capable of providing compliance characteristic. 
The common type of compliance nature includes linear compliance (deflection 
along a linear axis or straight line), radial compliance (deflection along a rotation 
axis or radius), or a combination of linear and radial compliance. For compliant 
tool, the ultimate “contact force” must be directed along the direction of compli- 
ance. Spindle with linear compliance (Odham 2007) is recommended for cone- 
shaped media or cup brushes, due to its great stiffness perpendicular to the control 
force. Spindle with radial compliance allows movement from a center position 
through 360° along a radius of compliance with a constant contact force. When the 
contact force is removed, the rotating shaft of the tool will return to the center of the 
compliant field. Spindle with radial compliance is recommended for radial brushes 
and helps control the side loads on the radial brush. 


Tool and Media 

Machining operation will generate various edge conditions on the workpieces, 
subject to machining process parameters, cutter condition, etc. The subsequent 
finishing operation will prepare the machined workpieces toward the final finishing 
requirements. General robotic finishing operation could be realized through four 
major steps as illustrated in Fig. 10, which are also detailed in Table 2. 

In general practice, the selection of finishing media depends very much on the 
material of workpieces as well as its input conditions (e.g., burr height, burr 
thickness, etc.). Table 3 lists some of the potential media based on the four types 
of common workpiece materials. 

In general, machined edges or surfaces on the workpiece can be categorized into 
the following major categories illustrated in Fig. 11. The recommended tools and 
media for robotic finishing are listed in Table 4. 
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Table 2 Major steps in robotic finishing process and the potential media 


Major steps 

Purpose 

Potential media 

Remove 
light burrs 

To remove fine dangling burrs 

End brush; wheel brush 

Remove 

primary 

burrs 

To remove primary burrs from 
machined edge. It depends on class of 
burrs and type of machined edges 

Bristle disc; cotton fiber mounted 
points; rubber-bonded media; flap 
disc 

Break edge 
or form 
edge profile 

To break the sharp edge and form the 
required edge profile 

Unitized wheel; bristle disc; cotton 
fiber mounted points; rubber-bonded 
media; flap disc 

Dress edge 
profile 

To achieve desired surface roughness 

Nonwoven abrasive pad; radial 
bristle disc 


Table 3 Common workpiece material and potential media 


Workpiece material 

Potential media 

Carbon steel 

End brush; wheel brush; cotton fiber mounted points; unitized 
wheel 

Titanium (nickel alloys) 

End brush; wheel brush; cotton fiber mounted points; rubber- 
bonded media 

Aluminum, copper (soft 
metals) 

End brush; wheel brush; cotton fiber mounted points; rubber- 
bonded media; bristle disc; unitized wheel 

Other - composites, 
plastics, glass, etc. 

Flap disc; pyramid-shaped abrasive disc or belt 


Hole 

inner 

surface 


Straight 

edge 


Curve 

edge 


Convex 

surface 


Type of 
machined 
edges or 
surfaces 


Circle 


Concave 

surface 


Hole 


Flat 

surface 


Groove 


Fig. 1 1 Type of machined 
edges or surfaces 
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Table 4 Type of machined edges or surfaces and the recommended media 


Type of 
machined 
edges or 

surfaces Recommended media 

Straight edge Flap disc; rubber-bonded 

disc; cotton fiber disc; 
unitized wheel; flap wheel 


Curve edge End brush; wheel brush; 

rubber-bonded disc; cotton 
fiber disc 


Image illustration for reference 




Circle 


Hole 


Groove 


Unitized wheel; honing 
brush; non woven abrasive 
pad; cotton fiber disc 


Tool with retractable cutter; 
honing brush; nonwoven 
abrasive pad 


Ball-shaped media; cone- 
shaped media 



0 continued ) 
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Table 4 (continued) 


Type of 
machined 
edges or 

surfaces Recommended media Image illustration for reference 


Flat surface Nonwoven abrasive pad; 

cotton fiber disc; bristle disc 


Concave 

surface 


Convex 

surface 


Hole inner 
surface 


Nonwoven abrasive pad; 
radial bristle disc 


Nonwoven abrasive pad; 
cotton fiber disc; bristle disc 


Honing brush; ceramic ball 
tool; nonwoven abrasive pad 





I 


** a 
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Programming and Path Planning 
Programming 

Having set up the robotic workcell, the next step in realizing the robotic deburring 
or finishing is to program the robot for the task. In the academia, many intelligent 
and adaptive programming approaches have been proposed, as can be seen in the 
introduction. Nevertheless, for real industrial applications, the following two 
methods are by far the most popular ones: online teaching using teaching pendant 
and off-line programming in virtual environment. For the online teaching method, 
the operator stands near the robot and uses the joystick or directional buttons to 
command the robot to move to certain positions, e.g., points along an edge to be 
deburred. The points are recorded, and at the end the robot will play back the path 
while executing the actual process. This method is convenient for simple work- 
pieces but will need a significant amount of effort and time for complicated parts. 
The accuracy of the taught path is also questionable as the target points are 
programmed based on the operator’s vision. Also, safety can be an issue as the 
operator needs to stand near the robot. Last but not least, teaching the robot causes 
downtime to the production, which may be undesirable for parts whose production 
volume is not significantly high. 

Contrary to online teaching, off-line programming does not require the program- 
mer to be in close proximity with the robot, as the programming is done using 
computer software. A virtual workcell is set up in the software, which includes 
CAD drawings of the robot, spindle, tool, as well as the workpiece. Since the CAD 
drawings are provided, it is straightforward to extract the edges or surfaces, even if 
the parts are complicated (Fig. 12). After the intended paths are obtained from the 
drawing, and after the tool center point is defined, the robotic program will be 
generated. Finally, the program is downloaded to the actual robot, and it will 
perform the task as commanded. The advantages of off-line programming are easily 
seen: it is safe as the programmer does not need to be near the robot; the path points 



Fig. 12 Points, lines, or 
surfaces extraction from CAD 
drawings 
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Fig. 13 Chamfering using conical or cylindrical tool 






are easily extracted from the CAD drawing, even if the parts have complex 
geometries; and it does not cause downtime to the production line. 

Nevertheless, off-line programming is not without its flaws. Firstly, robot is not 
very accurate due to modeling error, and the actual robotic path may deviate from 
that generated in computer software based on the robot model. Secondly, even if the 
robot were perfect, the workpiece itself may also be different from its CAD model. 
For example, cast parts may vary significantly from the CAD drawings. Therefore, 
off-line programming should be enhanced through proper calibration, compliance, 
force sensing, and vision. 


Path Planning 

Apart from the skill or knowledge to do robotic programming for most robotic 
tasks, deburring and surface finishing necessitates some other task-specific path 
planning, in order to achieve the best result. First of all, different tools require 
different orientations of the tool with respect to the workpiece. For example, a 45° 
conical tool will create a chamfer on the edge, even if the tool shaft is perpendicular 
to the workpiece surface, whereas a cylindrical tool will require the robot to be 
slanted at 45° to achieve the same result (Fig. 13). A stack of bristle disc can move 
either across or along the edge when deburring (Fig. 14). Secondly, to reduce the 
tool wear, the tool can be programmed to be tilted at a certain angle so as to spread 
out the portion in contact with the workpiece (Fig. 15). Next, for surface polishing 
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Fig. 15 Left - high wear rate if tool is perpendicular to edge; right - tool wear is less if tool is 
tilted to spread out the contact area 



Fig. 16 Different polishing marks for different motion paths 


of a large area, the robot can be programmed such that it moves in a continuous 
fashion with a zigzag path, or there can be a plurality of straight paths parallel to 
each other. However, the two possibilities will provide different marks on the 
surface after polishing (Fig. 16). 

Other possibilities to consider during path planning are whether the robot is 
holding the tool or the workpiece. The former is independent of the part weight, 
but the robot must also carry the tools which have power umbilical cords. Also, an 
additional external axis may be useful for the design of path. As already highlighted 
in the hardware section, for cylindrical or circular features on a workpiece, if the 
workpiece is mounted on the center of a turntable, the robotic path will be a single 
stationary position for the tool to come in contact with the workpiece while letting the 
turntable perform the rotary motion to complete deburring or polishing the circular 
feature (Fig. 8). Without the turntable, the robot itself will have to run through the 
circular path, sometimes requiring a huge and awkward rotation of the arms. 

For force-controlled deburring or surface finishing tasks, two more items 
need to be thought about during path planning. Firstly, when should the spindle 
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Example of tool: 
End-brush 




• P4 



Position control mode 

1. Move to PI. 

2. Turn on spindle. 

3. Move to P2. 

4. Move to P3. 

5. Move to P4. 

6. Turn off spindle. 


Force control mode 

1. Move to PI. 

2. Remove force sensor bias. 

3. Turn on force control and search for workpiece, 
in the direction of P2. 

4. When contact force is detected and the desired 
force value is reached, turn on spindle. 

5. Move to P3, while maintaining contact force. 

6. Turn off force control and move to P4. 

7. Turn off spindle. 


Fig. 17 Pseudocode for robot program 


be turned on? And secondly, when should the bias on the force sensor be removed? 
For the first question, it is noted that in most feedback control systems, an overshoot 
is inevitable during a step response. For a tool which moves within free space to 
come into contact with the workpiece, the force overshoot can even be more serious 
and this could create a big dent on the workpiece, if the spindle and tool were 
already rotating beforehand. Thus, it could be better to turn off the spindle prior to 
contact and turn it on only after a contact has been established and the force has 
stabilized around the desired value. For the second question, it is well known that 
force sensor signal usually contains some bias as well as a lot of noise, and the latter 
can be aggravated if the spindle, carried by the force sensor, is rotating. To obtain 
the correct contact force, the offset of the force signal should thus be removed, and 
this should be done when the spindle is not revolving. 

Pseudocodes for the robot program in both position control mode and force 
control mode are given in Fig. 17. The actual robotic code would depend on the 
language particular to the robot brand/model. 


Material Removal Rate 

The material removal rate is a terminology closely related to the force control approach 
for deburring or surface polishing. In simple words, it means how much volume of 
material is being removed when the tool traverses through a certain path. In the 
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Fig. 18 Robot programmed to revisit the portion of workpiece with more materials to remove 


literature, it is well known that the material removal rate is influenced by the contact 
force, the rotational speed of the tool, the feed rate of the tool along the path, the 
material of the workpiece, the type of tool, as well as the grit size of the abrasive tools. 

How does the material removal rate come into play for the path planning or 
robotic program preparation? Suppose there is a workpiece whose surface rough- 
ness needs to be removed. The programmer can first program the polishing process 
with aggressive parameters such as high machining force and high tool rotation 
speed while using a tool with coarse grit size. In this case, the material removal rate 
is high, and with just one sweep across the workpiece, the surface waviness is 
completely removed. However, it is intuitive that the surface quality could be rather 
poor. To improve the surface quality, the programmer then appends the robotic 
program with a few more repeated paths but at much milder parameters and with a 
finer tool. The complete program then consists of the first roughing pass as well as 
the next few finishing passes. 

Another example is when there is clearer information about the condition of a 
part. For instance, a workpiece has gone through a 3D scan using some laser 
scanner or other sensors, and the data clearly shows that there is a portion of the 
workpiece which needs more polishing than other areas. The path planning can then 
be either having a single pass, whereby the machining parameters become more 
aggressive (e.g., higher force) at the particular portion, or using only one set of 
parameters but programming the robot to revisit the special area a few times until 
the material is completely removed (Fig. 18). 


Process Execution and Verification 

Robot programming and path planning provides the robot motion and tool trajec- 
tory needed to follow through the edges or surfaces to be finished, but it is only the 
first half of the story. What is equally important is the process study, which is the 
subject of this section. 
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Based on the material type, features of interest, and robot accessibility to the 
features on the workpieces, potential tools and media will be consolidated for 
preliminary finishing trials. Finishing trials can be carried out on actual workpieces 
or test coupons. The latter, designed with major representative features, are the 
preferred choice for preliminary finishing trials mainly due to cost factor, and there 
is no effect to the quantity of workpiece produced. After the preliminary finishing 
trials, media suitability and finishing approach will be evaluated based on the 
measurement outcome. If the media performance is far from the finishing accep- 
tance level, another round of media survey may be needed. 

For feasible media, subsequent process study will be carried out to determine the 
process parameters. In general, the process parameters include spindle speed (RPM), 
feed rate of the tool along the path, tool approach angle, grain size, depth of interfer- 
ence, force, etc. The optimal operation RPM and feed rate recommended by the media 
provider serve as a good initial reference during the process study. For instance, based 
on the recommended operational RPM and feed rate, end user may consider increasing 
the media grain size if the finishing rate is too slow. On the other hand, if the 
recommended operational RPM is generating disturbances to robot systems (e.g., 
due to resonance), end user may consider reducing the RPM and feed rate. 

The choice of robot operation mode (i.e., positional control, force control) 
depends very much on the nature of the finishing media as well as features of 
interest on the workpieces. For instance, robotic finishing operation for hole, hole 
inner surface, or groove required precision contact between the media and the 
features. Thus, robot in position control mode is a better choice for this operation. 

On the other hand, while performing finishing tasks, force sensing is one of the 
key capabilities to achieve successful operation. Thus, robots with force control 
will have another degree of flexibility for finishing operation. Instead of depth of 
interference, contact force becomes one of the process parameters. Thus, for media 
with particular grain size, the combination of contact force, spindle speed, and feed 
rate will somehow determine the material removal rate. Through the process study, 
end user will also gain a better understanding on the behavior of the finishing media 
and its performance on the actual workpieces. Design-of-experiment approach 
could be used to identify the key process parameters for process parameter optimi- 
zation purposes. During the process study stage, measurement results served as an 
important feedback and guidance toward achieving the finishing requirements. 

Figure 19 illustrates the general steps in process development for identifying 
suitable process parameters for robotic finishing operation, while Fig. 20 shows an 
example of edge before and after being deburred using industrial robot. 


Challenges and Future Work 

Despite the rising popularity of robotic finishing processes and the various success 
stories from the industry, there are still some challenges to overcome to increase the 
adoption. Following is a list of these issues which necessitate further research for 
improvements : 
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f \ 

Features of interest 

& 

Finishing requirements 

V J 


• Identify features of interest on the workpiece 

• Understand finishing requirements (e.g edge profile, surface 
roughness) 


r A 

Survey of tools and media 

• Survey of potential tools and medias 

• Evaluation of robot accessibility and reachability 

• Consolidation of tools and media for finishing trials 

V J 



Preliminary finishing 

trials 

V J 

• Evaluation of media suitability for each features through preliminary 
trials 

• Evaluation of finishing approach through preliminary trials 


s a 

Review of tool and media 

V ^ 

A 

• Review of tools and media based on preliminary trials result 

• Down-select of tool for each feature and finishing approach 

J 


/* A 

Process study 

V J 

• Determination of feasible key process parameters through process 
studies 

• Verification of finishing outcomes through measurement result 


f A 

Process parameters 
V J 

• Robotic finishing approach: Position or Force control mode 

• Process parameters: Spindle speed, feed rate, force, depth of 
cut/interference 



Fig. 19 Identifying process parameters 



Fig. 20 Sharp edge and burr before deburring (left), chamfered edge (right) 


• Robotic programming is still a tedious process, even with CAD/CAM software 
for off-line programming. Not only do the path points need to be selected 
carefully, but the approach angle of the tools also needs to be properly deter- 
mined, and the process parameters need to be fine-tuned for different types of 
material and media. Since the programming time could be even longer than the 
actual process time, robotic finishing is still not well accepted by companies 
which produce high-mix-low- volume parts. It is thus necessary to improve the 
programming method. 
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• For robotic force control applications, there is usually high impact force as the 
tool approaches from the free space and touches the workpiece. This can lead to 
some dent or damages to the workpiece at the point of contact. Most robots try to 
reduce this impact by approaching the workpiece very slowly, and this increases 
the overall cycle time. Therefore, it is of interest to develop a control method or 
hardware mechanism to limit the impact force even if the approach speed is high. 

• For the process verification and fine-tuning of process parameters, the current 
practice is to do this step by step and manually - finishing on workpiece or trial 
coupon is performed, then the edges or surfaces are measured, before tuning and 
trying out of a new set of parameters, and the cycle is repeated. To reduce this 
effort, it could be useful to develop an in situ measurement system and process 
parameter adaptation algorithm to search for the optimal parameters automati- 
cally. For example, the surface waviness may be detected by vision sensors or line 
scanners, or even inferred from the force sensor, which would then command 
the robotic system to try out other parameters until the requirements are met. 
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Abstract 

This chapter will explain in detail the use of Stratasys, Inc., fused deposition 
modeling (FDM) systems in rapid manufacturing or direct digital manufacturing 
(DDM) to achieve targets of weight reduction, product customization, time to 
market, and tooling cost reduction, among others. 

Most people associate manufacturing with large, expensive, and, often times, 
polluting, factories. Utilizing FDM to produce components can be carried out in 
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a compact, clean, and cost-effective room and, depending on the machine, 
typically an office environment. 

While mass production of plastic and metal components by injection molding 
or machining has been the industry standard since around 1868, modern CAD 
and FEA programs have led to the creation of geometries that are not possible to 
injection mold or machine. These geometries are appearing more in low-volume, 
specialized products where tooling costs become prohibitive even with ordinary 
geometries. This often makes DDM the best choice to meet the desired cost and 
performance targets of the component. 

The FDM technology is also very scalable. If you already have a factory that 
has a given number of machines and suddenly you need to make twice as many 
parts per month, you can quickly and even temporarily add machines in order to 
increase production capacity or do it globally, over a distributed network. 


Introduction 

The current competitive trend of global economy drives manufacturers to deliver 
new customized products to consumers much faster than before. Business failure 
can be resulted if product development could not meet customer demands and 
expectation of quick introduction of new products. The third industrial revolution, 
known as additive manufacturing, enables manufacturers to bypass many tradi- 
tional manufacturing procedures by shortening the design and production cycle. It 
also boasts the ability for fast fabrication of direct usable parts or products. Many 
additive manufacturing processes are now introduced to the market with a wide 
range of material availability closer to standard engineering materials to fabricate 
the parts. As a result, the quality of the parts in terms of durability and geometry is 
being improved. The additive manufactured parts are not only for prototyping for 
design verification as in the olden days but also for direct usage as form, fit, and 
functional end products. The application of additive manufacturing based on 
layered technology to fabricate parts used for direct application is termed rapid 
manufacturing or direct digital manufacturing (DDM). Although the rapid 
manufacturing typically could not have mass production as traditional production 
techniques, it can eliminate tooling, setup, and assembly processes. It can be used 
for high mix low-volume manufacturing or on-demand manufacturing of parts with 
superior complexity. It is also an ideal application for making customized parts 
based on a customer’s special requirements. 


Rapid Manufacturing/Direct Digital Manufacturing Using FDM 

Rapid manufacturing or direct digital manufacturing (DDM) is one of the terms 
associated with additive manufacturing, and it can be considered as an extension of 
rapid prototyping. Instead of producing rapid prototypes for visualization, design 
verification, testing, and analysis, basically it produces the functional component or 
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end product directly from the CAD data still making use of layer-based manufactur- 
ing techniques (Sharma 2010). 

Fused deposition modeling (FDM) is one of the typical rapid manufacturing 
processes that provide functional prototypes of engineering plastic. It could produce 
functional thermoplastic models such as ABS directly from CAD data. It is based 
on additive manufacturing technique by depositing molten plastic filament and 
laying down material layer-by-layer to form the desired shape. Many 3D printers, 
nowadays, have adopted this technology. 

FDM parts are tough and durable. Materials available are ABS, PC, PC-ABS, 
PPSF, ULTEM, etc. They are sufficiently resistant to heat, chemicals, and moisture. 
Parts can be used for limited functional testing. 


Benefits of the Process (FDM) 

There are many industries where FDM parts can take the place of traditionally 
manufactured components. Mostly any component that requires high strength-to- 
weight ratios, high degrees of complexity and customization, running design 
changes, and low-volume production can benefit from DDM. Figure 1 shows an 
example of the part converted from traditional CNC machining to DDM. 




Fig. 1 Converting from CNC machining to DDM in ABS 
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DDM also shifts the logic applied to business decisions in the sales, marketing, 
and finance departments as well as those in the executive suite. Because DDM 
imparts some fundamental changes, many people have a real difficulty grasping the 
key concepts and benefits. 

Medical equipment manufacturers, for example, which produce items such as 
portable MRI machines, have embraced DDM for getting their products to market 
faster and for less total cost of goods sold. This adoption of DDM comes after 
countless hours of design considerations for the process. This adoption also comes 
after countless hours of meetings poring over test results proving out the benefits of 
DDM in a real-world usage scenario. 

To appreciate what DDM can do for a manufacturing company, an understand- 
ing of the fundamental changes that it imparts is necessary. 

Time to market no longer needs to be constrained by the lead time for 
tooling. Since DDM eliminates tooling time and cost, a product may be 
manufactured the same day that a design is completed. This allows companies 
to respond faster and to do so with less initial cash outlay for tooling and 
capital equipment. 

Manufacturers no longer face the difficult decision of postponing a desperately 
needed modification because of lost time and high expense. 

Product design is no longer constrained by the rules for design for manufactur- 
ability or the rules for design for assembly. Since components can be manufactured 
using additive fabrication technologies, such as FDM, there is no limit to design 
complexity. Complex parts are made without additional expense or production 
delays. 

Product revisions are no longer constrained by inventory levels, expense 
for new tooling machining operations, or the time for tooling rework. Because 
there is no tooling, short-run manufacturing principals may be used. This 
on-demand strategy reduces inventory levels and minimizes losses when 
product modifications are implemented, utilizing true just-in-time production 
strategies. 

When companies implement DDM into their product line, production sched- 
ules are no longer bound to inaccurate sales forecasts or rigid economic order 
quantities. The on-demand, just-in-time manufacturing practice promoted by 
DDM diminishes the need to build a production schedule from a sales forecast 
that is in the far distant future to be reliable. Additionally, unlike traditional 
manufacturing methods, there is no time-consuming setup or change over. 
Therefore, there is no penalty for frequent product changes in the manufacturing 
schedule. 

The opportunities seem boundless. In fact, commonly the only limits that arise 
seem to be those imposed by the engineers themselves during the design process. 
The implementation and advantages are limited only by our creativity, imagination, 
and innovation. DDM is a radical departure from tradition. It is a revolutionary 
process, so open minds are needed. DDM is in its earliest stages of industry 
acceptance, and there are many advancements waiting for us in the future in 
materials, machines, software, and cost reductions for each. 
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Designing for the Process 

For the DDM process to be best applied, the design of the component must 
exploit the benefits of the FDM process. Engineers must acknowledge that the 
product design constraints of the injection molding process simply do not 
apply to the DDM process. The geometries that are typically difficult or 
impossible to produce in an injection molded part become desirable attributes 
for increased strength or reduced material required when producing them using 
DDM. Those properties give FDM an advantage over many of the other 
additive manufacturing systems. Figure 2 shows a camera mount produced 
by DDM which allows the completion of complex features, elimination of 
tooling, etc. 

Producing finished goods for the cabin of an aircraft with FDM is increasing 
largely due to the ability of FDM machines to build parts in ULTEM 9085. It could be 
air grates, panel covers, and various other interior parts behind all those skins and in 
the belly of an airplane, heating and cooling ducts, power distribution panels and all 
kinds of clips and brackets in special configurations for a given manufacturer. 

With these complex systems, rapid design iteration becomes important. Additive 
manufacturing has the CAD/CAM capability to do that. Y ou can produce complex 
shapes that are difficult or impossible with conventional technologies. 


DDM in Aerospace 

The manufacture of unmanned aerial vehicles (UAVs) as shown in Fig. 3 is becoming 
a more commonly used application currently exploiting the benefits of DDM. 



Fig. 2 DDM camera mount 
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Fig. 3 DDM of UAV flight - ready hardware (Courtesy of Sheppard Air Force Base) 


This applies beyond UAVs: During the build you can design voids into the part 
and then drop in components - a strain gauge or a pressure sensor - or nuts and bolts 
or bushings embedded into the build. And then, in the future, integrate with other 
technologies. 

Packaging for every millimeter of space is permitted with DDM because you 
produce those designs cost-effectively. 


DDM in Automotive 


Additive manufacturing has been crucial in the automotive development and motor 
sports industries for years. In the race for speed and performance, it has been critical 
for design and testing. Now, race teams are revealing that AM parts will go far 
beyond review and evaluation. They have found that AM is ready to hit the track 
and endure the rigors of high-speed racing. As an example, the MINI John Cooper 
Works WRC shown in Fig. 4 is made up of up to 15 FDM parts. 

Motor sports are using AM for direct digital manufacturing (DDM) of produc- 
tion parts. In doing so, teams have demonstrated that DDM-produced parts have the 
quality and durability to meet the demands of racecars of all types. This trend has 
been developing for years, but until recently, teams have held DDM as a closely 
guarded secret that gives them a competitive edge. Figure 5 shows an example of 
hood vents for sports car fabricated using DDM. 

Although the racing world has revealed DDM’s role in reaching peak perfor- 
mance, the general public will only see the tip of the iceberg. Many AM production 
applications involve innovative designs that teams will not disclose. Further exam- 
ples of automotive components fabricated using DDM are shown in Figs. 6 and 7. 
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Fig. 4 MINI John Cooper 
Works WRC (Photo courtesy 
of Prodrive Ltd.) 



Fig. 5 DDM hood vents 
(Courtesy of Prodrive) 




Fig. 6 Automotive factory fixtures 
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Fig. 7 Automotive duct 
fitting 



DDM Case Study: Design and Fabrication of a Calibration 
Fixture for Airplane Control 

In this project, design and fabrication of a calibration fixture, which is used to 
calibrate the analog control of an airplane, control stick (Fig. 8) of microlight 
aircraft was carried out by DDM using fused deposition modeling (FDM). 

Control sticks, used to be called joysticks, are used on most 3-axis microlights. 
The aircraft is controlled by deflection of flight control surfaces. These are hinged 
or movable surfaces in which the pilot adjusts the aircraft’s attitude during takeoff, 
flight maneuvering, and landing. The flight control surfaces are operated by the 
pilot through connecting linkage to the rudder pedals and a control yoke. The pilot’s 
link to the control surfaces is by the use of the control stick and rudder pedals 
(Iskandar and Zuhairi 2012). It is important for the control sticks to be calibrated 
routinely for the pilot’s control effectiveness of the plane. 


Design for DDM 

The design priority is based on accuracy and compatibility followed by efficiency 
and human ergonomics. The reason of choosing these two as the main priority is 
that the current practice of using floor-mounted handmade fixture is not only 
inaccurate but also bulky and taking up the working space inside the aircraft, 
constraining space to move about during the measuring process. DDM allows the 
design of this product to be accurate and compact. The design was focused on 
efficiency; as such, it will be fast, easy, and no other additional parts needed to 
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Fig. 8 Dual control stick system 


assemble the product. Lastly, for human ergonomics, it was focused on easy 
handling and also easy installation without any force needed to apply from the 
technician, additional tools needed to install, or any uncomfortable position in 
installing the product. With this set of focuses, the product becomes fast to 
fabricate, efficient, and easy to use. 


Fused Deposition Modeling (FDM) 

The machine used to manufacture the product was Fortus 900mc (Fig. 9) which is 
the largest FDM machine in Nanyang Polytechnic. The FDM machine’s product 
manufacturing area is 914 x 610 x 914 mm. 

For the FDM process, the thermoplastic material ABS-M30 (acrylonitrile buta- 
diene styrene) was selected to manufacture the product. ABS-M30 is stronger than 
standard ABS and has greater tensile, impact, and flexural strength. It is an ideal 
material for conceptual prototyping, design verification, and direct digital 
manufacturing. Layer bonding is significantly stronger than that of standard ABS 
for a more durable part. This results in more realistic functional tests and higher 
quality parts for end use. When combined with FDM process, ABS-M30 gives real 
parts that are stronger, smoother, and with better feature detail (Copy right 2013). 

The FDM technique has particular tool paths to fill one layer. The layer step or 
slice thickness is controlled by the nozzle tip size. The smaller the nozzle tip size, 
the finer will the plastic be extruded out from the nozzle tip. The FDM machine 
material extrusion nozzle tip used for the process was T16 (0.254 mm) tip. 
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Fig. 9 Fortus 900mc 



FDM machine uses STL file (Standard Triangulation Language) to read the part and 
convert it into machining action. The part needs to be orientated to the best position 
considering critical geometries, strength, surface finish, and support requirement. 


Performance of DDM Calibration Fixture Produced by FDM 

The fabricated fixture is light in weight for better handling, small in size to save 
working space as it is to be mounted onto the pilot seat in the limited space in the 
cockpit, and incorporated with digital gauge mounting features for accurate cali- 
bration (Fig. 10). The screw holes were tapped on the FDM part after the fabrica- 
tion. The fixture has a secured mounting in one setup which also saves time not only 
in setting up but also during inspection. It also gives a stable measuring position. 
This method gives a precised measuring process where accurate results are 
achieved, and this was tested and proven by performing actual inspection on the 
aircraft. 

The calibration fixture was paired with aileron and elevator holders as shown in 
Fig. 1 1 to restrict any unwanted movement during measuring process. This combi- 
nation enables accurate results to be obtained, providing increased productivity in 
airplane inspection and maintenance job. 


Benefits of DDM Calibration Fixture 

Comparing between existing fixture and DDM fixture (Figs. 12 and 13), it is 
surprising to see that there are many advantages added onto the new fixture: 
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Fig. 10 Calibration fixture 
produced by DDM 




Fig. 1 1 Aileron and elevator holders 



Fig. 1 2 Existing and improved fixtures 
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Fig. 13 Existing and improved fixtures on-site 

(a) The DDM fixture is much lighter, smaller, and compact. 

(b) The structure is much stronger because it can rest and be tightened on the seat frame. 

(c) Lesser material used resulting in weight reduction. 

(d) Manufacturing time has been reduced. 

(e) Human ergonomics are incorporated. 

(f) Better locking and handling capability. 

(g) It is more appealing to see due to better aesthetic design. 

As the DDM fixture is smaller, stronger, and compact in size, it saves working 
space as it is mounted onto the pilot seat. This product has the ability to provide 
secured mounting in one setup, which results in better stability with stable measuring 
position. This method gives a precise measuring process where accurate results are 
achieved when tested for the actual inspection on the aircraft. The maintenance team’s 
feedback is that the new DDM calibration fixture enables them to measure and adjust 
micron tolerances of the control stick system. It has eliminated measurement problems 
and increased their productivity in the inspection and maintenance job. 


Summary 

This chapter exposed several areas of opportunity for direct digital manufacturing; 
the size of the opportunity is up to the individual designer to apply to their real- 
world situation. The future of DDM holds many more advancements in materials, 
machines, and process improvements. 
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Abstract 

Additive manufacturing is dependent on three-dimensional (3D) data to produce 
and inspect parts fabricated by the additive manufacturing process. Reverse 
engineering provides methods to generate the required 3D data. 3D data is 
generated by making a series of measurements and reducing that information 
into a 3D model of some nature. In the simplest form, a scale and a computer- 
aided design (CAD) software program can be used to measure features and then 
reproduce them as a 3D CAD model. However, many objects are very complex 
and would require a significant number of measurements and interpretations to 
produce a well-defined 3D model. Manual measurement of complex items can 
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result in inconsistent results from part to part or operator to operator. So, a more 
robust and repeatable method to measure parts is desirable. 

Metrology is the science of measurement, defined by the International Bureau 
of Measures as “the science of measurement, embracing both experimental and 
theoretical determinations at any level of uncertainty in any field of science and 
technology” (What is metrology? BIPM. Retrieved 01 Dec 2011. (2004)). The 
manufacturing industry uses the term “metrology” to describe the generic group 
of devices used to make measurements. There is a wide array of metrology 
devices and associated software to support both reverse engineering and inspec- 
tion of parts. Metrology devices are selected based on the application require- 
ments, like desired data accuracy, data density, contact measurements verses 
noncontact measurements, part size, part surface properties, and part access. 
Likewise, reverse engineering software is selected based on the application 
needs, like operator skills and the level of automation desired. This chapter 
will provide insight into fundamental considerations for metrology and software 
to support both inspection and reverse engineering of additive manufactured 
parts. 


Introduction 


“Additive manufacturing” is a process of sequentially depositing layers of material 
to additively construct a desired part. The 3D model of a desired part is broken 
down into a layer by layer definition of the part by specialized software dependent 
on the additive manufacturing technology. Therefore, high-quality 3D models are 
important to utilize additive manufacturing effectively. 

“Reverse engineering” utilizes metrology to capture 3D geometry data and 
specialized software to manipulate and interpret that data into usable information. 
This information can be used to create 3D models where they did not originally 
exist. Or alternately, reverse engineering can be used for inspection of additive 
manufactured parts. Both applications have important implications to the “digital 
thread” that can be established and maintained for a product. 

A digital thread is the concept where a part or product has a digital definition that 
can be leverage throughout the life cycle of the part or product. This enables users 
the opportunity to utilize digitally driven processes for manufacturing, inspection, 
and marketing. Later in a product’s life, modifications and upgrades can leverage 
this digital data stream to create new products or modifications more efficiently 
than past processes. Companies are seeing reduced time to market and realizing the 
ability to expand into high value mass customization markets base on these 
capabilities. 

The reverse engineering process can be broken into two fundamental steps, 
measurement and then processing. Measurements are taken with metrology sys- 
tems. Data accuracy and density along with part features and surface properties are 


factored into determining the right systems for a given task. Metrology systems can 


collect a wide array of data via three primary methods, touch probe, surface 
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scanning, or volume scanning. Users must decide if feature-based data or dense 
point cloud data is applicable for the desired application. These decisions will 
assure the best metrology system is selected for measurement. Based on the type 
of data that is collected and the task being performed, users must also select 
processing software that can accomplish the appropriate tasks with the collected 
data. Pulling a dense point cloud (containing millions of points) into a standard 3D 
CAD package is not recommended. However, utilizing specialized reverse engi- 
neering software to extract surfaces or interpret features from a dense point cloud 
provides a very robust and efficient solution. 

This chapter will provide insight into the interactions between reverse engineer- 
ing metrology, software, and additive manufacturing methods. Knowing these 
interactions will enable users to better utilize reverse engineering for additive 
manufacturing -related applications. As a result, users will improve outcomes and 
yield more consistent results. 


Reverse Engineering Overview 

This article will focus on how reverse engineering processes can be utilized in 
conjunction with additive manufacturing. Reverse engineering is a process that is 
comprised of two major elements, data collection (measurement) and processing 
(interpreting the measurements into meaningful information). Metrology is used to 
collect the measurements and must be carefully selected based on the subject being 
measured and the desired information about that subject. A specialized software is 
often required to handle the data sets gathered by the metrology systems. 

In addition, there are two primary reasons reverse engineering is utilized in 
conjunction with additive manufacturing. The first reason is for the creation of 3D 
CAD models that can be utilized by the additive manufacturing systems. Some 
common examples are scanning existing parts to produce additively manufactured 
tooling (holding fixture for complex part), creating new models from the original 
part to enable modification (repairs to damaged parts), creating new parts that work 
in conjunction with the original part (cell phone covers). The second reason reverse 
engineering is utilized is to inspect additively manufactured parts. The intended use 
of the reverse engineering data will influence the processes used as well as the 
selection of metrology and software. 

Reverse engineering begins with a series of measurements. These measurements 
can be structured (feature based) or more random in nature (point cloud scan data). 
The structured measurements are well understood and work well with simple 
feature-based parts, like a block, that can be fully defined by three length measure- 
ments. However, more complex parts require significantly different measurements 
to fully define them. For example, an organically shaped item, like an apple, 
would be very difficult to define with length measurements. In this case, the 
measurement of many points on the surface of the apple can fully define the surface 
of the apple and provide an accurate representation that can be used for modeling or 
inspection. 



2488 


B. Macy 


Data Types 

Measurement data can be collected in a number of different forms depending on the 
metrology used and the application requirements. These can be grouped into three 
fundamental categories: 

1. Single Point - points are measured discretely based on individual operator 
commands. They are used when the exact location of a measurement needs to 
be controlled. 

2. Point Clouds - points are measured by the metrology system based on require- 
ments commanded by the operator. Point clouds can range from a series of 
points collected by a probe swept across a part to very dense scans of a part 
surface. The files can be comprised of millions of measured points. These are 
used when the part complexity increases and single point measurements are not 
sufficient to characterize the part surface. 

3. Features - Many of the metrology systems have the capability to take single 
point or point cloud information and extract a feature-based measurement. For 
example, three or more points measured around a hole can define the hole’s 
diameter. Alternatively, the same three or more points could be fit to a plane. 
Depending on what the operator is intending to measure, the system can provide 
valuable feedback about the measurement. Say, the operator measures ten points 
of what was supposed to be a plane. The system can provide immediate feedback 
on the position and orientation of the plane as well as how well all ten points 
correlated to the plane. 

Some additive manufacturing systems can produce parts with a variety of colors. 
For this reason, it is prudent to note that some metrology systems can also capture 
part colors. Matching the captured colors to the additive manufacturing system’s 
color pallet can be limited, but it is helpful in some applications. 


Data Collection 

The types of metrology to collect measurements are not limited and new metrol- 
ogies are arriving every year. When selecting a metrology system, the first consid- 
eration is whether internal features are required. If internal features are critical, then 
an appropriate metrology should be selected based on part compatibility with the 
technologies, accuracy requirements, and whether destructive measurements tech- 
niques are an acceptable option. 

1 . Nondestructive Internal Metrologies 

(a) Computed Tomography (CT) Scan - It is also known as a CAT (computer 
axial tomography) scan. It is a medical imaging method that employs 
tomography. Tomography is the process of generating a two-dimensional 
image of a slice or section through a three-dimensional object (a tomogram). 
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The CT scanner uses digital geometry processing to generate a three- 
dimensional (3D) image of the inside of an object. The 3D image is made 
after many two-dimensional (2D) X-ray images are taken around a single 
axis of rotation - in other words, many pictures of the same area are taken 
from many angles and then placed together to produce a 3D image. 

(b) Magnetic Resonance Imaging (MRI) Scan - A strong magnetic field is 
created by passing an electric current through wire loops. While this is 
happening, other coils in the magnet send and receive radio waves. This 
triggers protons in the part to align themselves. Once aligned, radio waves 
are absorbed by the protons, which stimulate spinning. Energy is released 
after “exciting” the molecules, which in turn emits energy signals that are 
picked up by the coil. This information is then sent to a computer which 
processes all the signals and generates it into an image. The final product is a 
3D image representation of the area being examined. 

2. Destructive Internal Metrology 

(a) Cross-sectional Geometry Inspection (CGI) - Parts are encapsulated with a 
resin to act as a holding fixture and stabilize part features. Then the part is 
carefully machined one layer at a time to expose the cross sections of the part 


at each layer. Each cross section is digitized by a vision system and recorded 


based on the layer it represents. 


When internal features are not critical, there is a wide variety of metrology 
systems available to capture external geometry. These systems can be fundamen- 
tally broken down into two primary categories, contact and noncontact: 


1. Contact 

(a) Coordinate Measurement Machines (CMM) - These systems come in two 
main form factors “bridge” and “portable arms” that provide a wide range of 
accuracies, measurement volumes, portability, and access to hard to reach 
features. However, they all function based on similar principles. Degrees of 
freedom are provided by a series of mechanisms that have precision mea- 
surements incorporated to monitor positional movements. These individual 
mechanism measurements are reduced into 3D coordinates of the device’s 
probe that has been positioned where a measurement should be conducted. 
Bridge CMM systems are often automated so that program can be written to 
make repetitive measurements on multiple parts. Portable arm CMMs are 
manually articulated and provide a wide range of measurement access and 
the ability to bring the “metrology to the part” as opposed to the “part to the 
metrology” in the case of bridge CMMs. 

(b) Laser Tracker - A precision laser head is placed in the measurement 
envelop, typically on a stable tripod, with an unobstructed view of the part 
to be measured. An optical target, referred to as a “retroreflective” target, is 
used to lock on to the laser. As the optical target is moved, the laser head 
follows the target within the measurement volume. The optical target is 
placed against a part and a measurement is commanded by the operator. 
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Laser trackers can provide very accurate measurements over very large 
measurement envelops. 

(c) Optical Trackers - These systems use two cameras placed at a fixed distance 
from each other to triangulate the location of known target within the 
measurement volume. Varieties of target styles exist and each have their 
own benefits and weaknesses. Clusters of optical targets can be mounted to a 
probing device. Once the probe is positioned at a measurement location, the 
system is commanded to record the location of the probe. One of the primary 
benefits of the optical system is that it is not limited to maintaining line of 
sight with the cameras. If the targets become obscured at any time during a 
measurement, the cameras will reestablish connection with the targets once 
they are visible again. 

2. Noncontact 

(a) Photogrammetry - The process of making precise measurements based on a 
number of photographs taken from various camera angles. Known scale 
references are placed in the scene along with standardized targets that the 
software can readily detect for processing. This process is useful for large 
parts where accuracy over a large measurement volume is important. 
Although many points can be captured with this process, it is typically 
used to establish large volume coordinate systems and structured point 
clouds to be used in conjunction with structured light systems. This system 
requires everything in the measurement volume of interest to remain stable 
during the entire photogrammetry session. 

(b) Structured Light - This process uses a pitch and catch method triangulate 
points in 3D space. Projectors located a known distance from a camera, 
project known light patterns on the part. Software uses triangulation algo- 
rithms to calculate 3D measurement data based on the changes in the known 
patterns. The type of projected pattern determines the speed, accuracy, 
density, resolution, and surface sensitivity of the acquired data. 


(i) Speckle patterns can be used to acquire data very quickly, but the data density 
and accuracy is limited. For shop environments where vibration is likely to 
cause movement, fast data acquisition can be the difference between success 
and failure. 

(ii) Interference methods use a series of patterns that are projected onto the parts 
and the resulting fringe patterns are reduced to 3D measurements. Interference 
methods can produce high accuracy with very dense point clouds. The price to 
be paid for high accuracy and dense data is acquisition time. These systems are 
getting faster all the time, but typical capture rates are measured in minutes. 
This limits these systems to parts that can be well stabilized. Otherwise part 
movement can exceed the accuracy of the system and measurements will not 
resolve. 

(iii) Laser line scanners use a laser to project lines on the part. These systems can 
be handheld or automated. In either case, the scanner needs to be in 
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conjunction with something that can provide the global position of the scanner 
head in relation to the part while scanning. For this reason, laser scanners are 
often combined coordinate measurement machines. 

Combined Technologies - Metrology styles are often combined to provide the 
benefits of each technology is one more capable system. A common example of this 
is a portable arm CMM with both a probe and laser scanner end effectors. This 
enables an operator to use the probe to measure specific locations and then the laser 
scanner to collect high density surface data. This combination of technologies has 
similar measurement envelop size as compared to the additive manufacturing build 
envelop size. It also provides measurement capabilities that are compatible with 
most additive manufacturing systems and materials. For these reasons, portable arm 
CMM’s with probe and scanner options are often utilized in conjunction with 
additive manufacturing. 


Metrology Considerations for Part Measurement 

Regardless of whether the desire is to digitize an existing part for 3D CAD 
modeling or inspection of an additive manufactured, both must consider the char- 
acteristics of the subject part and how they will impact the selection and usage of 
the appropriate metrology. Part size, mobility, color, and surface characteristics all 
must be considered to properly match the right metrology to the task. In addition, 
data accuracy, resolution, and types must be assessed to properly select metrology 
and avoid collection of too much or worse too little data. For these reasons, a data 
collection plan is critical to properly prepare for a successful outcome. 

Ideally the selected metrology would be able to accommodate digitizing the 
entire part in a single setup. However, this is often not the case because of size, line 
of sight, or access to the areas on which the part is being held for data collection. For 
this reason, it is important to have a data collection strategy for all of the desired 
part features and how those results will be secondarily combined. If the size of a 
part exceeds the data collection envelop of a specific metrology, then a practice 
called “Leap Frog” can be used to expand the data collection volume. This practice 
utilizes common points, or features, within overlapping sections, of two separate 
data collection envelops, to piece the individual data collections together. Devel- 
oping a measurement plan and preselecting, or even adding these features, will 
improve the quality and success of this process. Combining metrologies can 
improve accuracy of very large parts. Laser trackers and photogrammetry can be 
used to accurately value features or targets over a large part. Then local scanning or 
probing metrologies can use these accurately valued features or targets to create a 
local reference system for the individual measurements that is accurately combined 
with the global data collection plan. 

A part’s surface characteristic can affect the results of data collection depending 
on the metrology being used. Probes must be selected based on the desired feature 
resolution. For example, additive manufactured parts have layer lines that are a 
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Fig. 1 Probe style impacts 
on additive manufactured part 
measurements 



Point probe on crown 
Point probe in valley 

Ball probe bridging 


byproduct of the manufacturing process. Figure 1 shows how two probe styles 
could result in different measurements on an additively manufactured part. In most 
cases, the crown of the additively manufactured part is the most accurate feature of 
the part. For this reason, it is desirable to use larger radius probes to span the crowns 
of additively manufactured part layers. This practice results in the most accurate 
representation of the parts build. 

When light-/laser-based metrologies are utilized, care must be taken to make 
sure that the color, translucency, and reflectivity of the part do not adversely affect 
data collection and accuracy. Significant progress has been made to reduce the 
sensitivity of light-based metrologies, but care must be taken to assure these issues 
do not manifest themselves in resulting data. A part with multiple colors that can 
result in artificial steps is the data. This is caused by the change in absorption of 
light resulting in a change in the reflected intensity at the sensor. As a result, the 
sensor perceives a relative difference, and the results will reflect a step or drop of 
data depending on the system and part characteristics. Similarly, part translucency 
allows the light to pass through the top surface of the material and does not return a 
strong reflection. In the case of clear epoxy composite parts, it is common that light- 
based systems will pass through the epoxy and image the fabric matrix inside the 
laminate. A quick check for translucency is to shine a laser pointer on the part. If the 
spot on the surface of the part is similar to the spot when shined on a piece of plain 
paper, then translucency is low and a good result can be expected. If the laser spot is 
very large and haloed, the translucency may be a factor during data collection. 
Highly reflective surfaces are problematic as well. The use of a spray talc power, 
referred to a developer in the dye penetrant inspection industry, is often used to 
lightly coat the surface of the part and produces a more consistent matte finish for 
data collection. Once the data collection is complete, the talc can be blown or 
washed away. Alternatively, scuff sanding, priming, or painting can also improve 
the surface of problematic parts if necessary and accommodating to downstream 
use of the part. 

The desired resolution and accuracy must also be considered to pick the appro- 
priate metrology. With respect to reverse engineering, resolution can be thought of 
as the density of points that are used to describe a part or model. In Fig. 2, a circle is 
shown in three different resolutions. As more points are added, the circle becomes 
more clearly defined. In Fig. 3, the same circle is represented by points with 
different levels of accuracy. As the points get more accurate, the group of points 
looks more like a circle. Obviously, high-resolution and high-accuracy data is the 
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Fig. 2 Various resolutions of a circle 


Fig. 3 Impacts of point 
accuracy 



Low Accuracy High Accuracy 

Actual part shape 

Measured data 


most desirable, but that comes at a cost. As a general rule, increasing resolution 
increases metrology cost, processing time, and the need for specialized software to 
handle the large volumes of data. With increasing accuracy requirements, metrol- 
ogy cost increase and sensitivities to data collection setup and execution become 
more critical. So, it is important to prudently assess resolution and accuracy needs. 


CAD Model Creation to Support Additive Manufactured Parts 

In this scenario, a 3D CAD model does not exist for the desired part to be additively 
manufactured. It is assumed that a physical part exists and that it is of sufficient 
quality to be considered the master definition of the desired part. The part should be 
evaluated for external and internal features. If internal features are required, then an 
appropriate metrology should be selected to obtain these features. Note that if a 
destructive data collection method must be used, it is advised that an external data 
collection process is conducted in advance. This will provide reference data that can 
prove useful to validate the destructive data collection results and fill in any external 
data voids if needed. 
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Fig. 4 Resolution impacts on 
feature definition 
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The next step is to assess the necessary resolution and accuracy of the data 
collection. As a rule of thumb, resolution should be one order of magnitude better 
than the smallest feature of interest. For example, if the smallest feature is a radius 
of 0.0 10", then the ideal density of data on this radius would be points spaced at 
0.00 1". This provides enough resolution to clearly identify whether the feature is a 
radius verses a chamfer or a corner; see Fig. 4. Accuracy is also a factor in this 
equation. In the last example, if the accuracy was +/— 0.010", then the measurement 
error could exceed the size of the feature and the noise in the collected data could 
completely mask that the feature exists. Like resolution, the rule of thumb is that 
accuracy is one order of magnitude better than the smallest feature of interest. 
Additionally, data collection accuracy should also be evaluated against the desired 
accuracy of the project. In other words, if the smallest feature was a 0.50" radius, 
then the desired accuracy for feature capture would be <0.050". However, to fit up 
with surrounding parts, the overall accuracy of the part needs to be 0.010". In this 
case, it would be desirable to collect data at 0.001" accuracy. Remember that 
further processing is required to achieve a final model, so it is preferred to provide 
some margin for any error in final processing. 

Having assessed desired part features, resolution needs, and accuracy require- 
ments, a proper metrology can be selected to collect the desired data. Care should 
be taken to make sure that the metrology also satisfies any surface issues with 
respect to color, translucency, and reflectivity for the desired part. A data collection 
plan is suggested. This plan should map out how many individual measurements 
that will be required to collect all of the desired features. If multiple metrologies are 
required to properly capture the part, a plan for how the resulting data will be 
registered to each other in the post processing software is best defined in advance. 
Many times, it is desirable to add special features, like tooling balls, into the data 
collection to enable alignment of the individual dataset after collection. In addition, 
large volume metrologies like laser trackers can be used to establish a global 
coordinate system. Individual metrologies can use the global coordinate system to 
locate each individual data collection into the larger global coordinate system 
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during data collection. Either method has its benefits and limitations. The key is to 
have a plan established so that individual data sets can be assembled accurately. 


Post Processing 

Once the data has been collected, post processing begins. Post processing consists 
of reducing the collected data sets into usable data that can be consumed by the 
additive manufacturing system. At this point, two options exist. The first option is to 
combine the point data into one file and convert into a water-tight polygonal mesh. 
The second option is to use the collected data to extract desired features like holes, 
planes, and surfaces to support the creation of a solid model. Both options have 
benefits and drawbacks dependent on part type and desired downstream needs. 
Figure 5 provides an example of a typical workflow. 

Polygonal Mesh 

A polygon mesh is a collection of vertices, edges and faces that defines the shape of 
a polyhedral object in 3D computer graphics and solid modeling. The faces usually 
consist of triangles, quadrilaterals or other simple convex polygons, see Fig. 6 
(Polygonal Mesh Definition). Converting the measured data into a polygonal mesh 
retains the most accuracy representation of the original part. However, this also 
means that any inherent imperfections that exist in the physical part will be 
translated to the model. When multiple parts are being reverse engineered, these 
imperfections can result in conflicts when the parts are used in conjunction with 
each other. In addition, a polygonal mesh needs to be water tight. This means that 
all holes in the data set must be filled in before the model is usable. When parts have 
features that cannot be fully captured by the data collection process, this can result 
in significant post processing time to fill holes in the data set. A typical case is 
shown in Fig. 7 (Yongtae Jun 2005). 

Care should be taken to assure that when collected data is converted into a 
polygonal mesh, software settings are selected to maintain desired levels of accu- 
racy and resolution. In addition, it is beneficial to reduced unnecessary data to 
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Fig. 5 Typical reverse engineering workflow 
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Fig. 6 Polygonal mesh 
example 




Fig. 7 Example of filling holes in a polygonal mesh 


minimize the size of the resulting models. Some additive manufacturing systems 
have difficulty processing very large files (files in excess 100 megabytes). A typical 
situation is when a part with fine features is captured with very high density data to 
resolve the fine features. However, the part also has large areas of flat surfaces. 
These areas can contain millions of points and result in very large files. Most post 
processing softwares can filter the collected data based on accuracy and rate of 
curvature and result in significantly reduced file sizes. Mesh reduction filters will 
maintain data for fine features and eliminate unnecessary points in the flat areas; see 
Fig. 8 (Mesh Reduction, Polyworks V10 Beginners Guide). This is a good practice 
and will result in files that are more compatible with additive manufacturing 
process software. 

Solid Model 

Solid models are the preferred output when capturing the design intent of the existing 
part is the desired outcome. In this case, it may require collected data from multiple 
parts to understand the acceptable tolerance limits and nominal feature definition. 
In some cases, it is helpful to collect data from mating parts as well. 
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before compression 


after compression 


Fig. 8 Example of polygonal mesh reduction base on feature complexity 




Fig. 9 Example of cross sections from collected data to support tradition solid modeling software 

To produce a solid model, collected data is interrogated for insight into feature 
size, location, and relations to other features. In this case, reverse engineering 
software is used to process the collected data into features like lines, circles, splines, 
planes, and surfaces that can be used by CAD software to create a solid model; see 
Fig. 9 (Cross-Section Extraction, Polyworks V10 Beginners Guide). These solid 
models can then be readily modified to meet a number of downstream needs. The 
drawback, depending on the complexity of the subject parts, is that this process can be 
labor intensive and can result in significant deltas from the original part that are 
operator judgment calls. Following the modeling exercise with an inspection of the 
resulting solid model back to the collected data is a good practice. Knowing where 
the deviations are and how extreme they are helps to make informed decisions on the 
final solution. 

Model Validation 

Regardless of which modeling process is used, polygonal or solid, inspection of the 
model back to the collected data is a good practice. Utilizing reverse engineering 
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software packages, the user can inspect the error between the collected data and the 
resulting solid model. This is a good practice and often identifies mistakes that were 
made during model creation. Having this report as a record to define the pedigree of 
the model is good practice to follow. 

Model Translation 

The final step is to export a file in a format that is compatible with additive 
manufacturing systems. STereoLithography (STL) format is the universally utilized 
format by additive manufacturing systems. Again, care must be taken to make sure 
that the processing software has the proper setting for export. Typically, accuracy 
and angle deviation are settings that can be adjusted to increase or decrease the 
resulting STL file fidelity. These settings should be checked prior to export and 
proper values assigned to obtain the desired output. 

ASTM International, formerly known as the American Society for Testing and 
Materials (ASTM), is a globally recognized leader in the development and delivery of 
international voluntary consensus standards (http www astm org ABOUT 
aboutASTM html). In 2009 ASTM International formed the F42 Committee on 
additive manufacturing. This committee is working on a new standard to replace the 
STL standard format. The new format standards are defined by F2915-12 Standard 
Specification for Additive Manufacturing File Format (AMF). This new format is 
being adopted by most additive manufacturing systems, Reverse Engineering Soft- 
ware, and CAD modeling software. AMF file format will ultimately replace STL. 


Part Inspection of Additive Manufactured Parts 

For part inspection, the collected data can be used to inspect a part to either an 
existing model or to collected data for a second part. Comparing collected data to a 
model provides information about how accurately the model representation has 
been reproduced by the additive manufacturing system. Comparing collected data 
of multiple parts provides insight into the variation of the additive manufacturing 
system for a particular part type. Figure 10 provides a typical inspection work flow. 
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Fig. 10 Typical inspection workflow 
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Data Collection 

In a classic inspection process, the controlling drawing or model defines the data, 
features, and tolerances of interest. Identifying which of these items must be 
measured and the required fidelity will establish the selection criteria for the 
metrology. In addition, the additive manufactured part’s unique surface character- 
istics discussed in the Metrology Considerations section above must be evaluated 
and matched to the appropriate technology. 

Establishing a measurement plan assures that the data collection process 
acquires the appropriate data to complete the desired inspection. The measurement 
plan should map out the sequence of data collection steps. When multiple data sets 
are required to capture the required data, a strategy to correlate individual data 
collection sets must also be defined. With the appropriate metrology selection and 
measurement plan defined, data collection can be conducted. 


Reference Data Import 

Reference data is selected based on the type of inspection being conducted. When 
inspecting the accuracy of an additively manufactured part, the originating solid 
model should be used as the reference standard. If the inspection is looking at part to 
part deviations, then multiple data collection sets are imported and compared. Most 
reverse engineering and inspection software packages can import CAD software 
formats, STEP, IGES, Parasolid, STL, and point data. Care should be taken to 
assure that reference data is of sufficient quality to achieve the desired inspection 
results. 


Alignment 

Once the part data has been collected and the suitable reference data has been 
imported, the two sets of data must be oriented relative to each other. The two basic 
methods are best fit and feature/datum alignment. There are many subtle variations 
to these two methods, but understanding how these are fundamentally utilized is 
important. 

Best Fit 

This is often the first step in an alignment process because simplified interfaces 
have been developed to solve for an initial look at how the two data sets compare. 
Care must be taken to select unique features that are distinct to help orientate the 
data sets in a meaningful manner. Figure 1 1 shows a typical example of two data 
sets being best fit to each other (Geomagic Manual). 

The “best fit” alignment attempts to evenly distribute the errors between two 
compared data sets. However, selected reference points and gross deviations in the 
two data sets can result in undesired consequences. Care must be taken to evaluate 
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Fig. 1 1 “Best fit” alignment example 


the results of a “best fit” alignment to make sure that the results were clearly 
interpreted by the software. For example, if a slightly undersized block is aligned 
by using a “best fit” method, the error between the undersized block and the 
reference block would be evenly distributed across all the surfaces. Therefore, 
the reported error on any one surface of the block would be half the total error 
over the full length of the block. 

The “best fit” alignment method is ideal for parts that have organic shapes. 
In additive manufacturing applications, part warp and shrinkage are issues that need 
to be characterized. Inspection reports based on “best fit” alignments can provide 
helpful insight into these behaviors. 

Feature/Datum Fit 

When additive manufacturing is used to produce prismatic parts that must fit into 
secondary assemblies or be utilized in conjunction with other parts, feature-/datum- 
based alignments should be utilized. When primary, secondary, and tertiary data 
have been established for a part, alignment should be conducted based on these 
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features. A majority of part designs utilize geometric dimensioning and tolerance 
(GD&T) standards; these dimensions and tolerances are based on the designated 
part data. By aligning the collected data sets accordingly, the reverse engineering 
software can report GD&T compliant results for features that have been extracted 
from the collected data sets during processing. 


Reports 


Once the collected and the reference data have been aligned, the report can be 
generated. Reports come in many variations based on the needs of a particular part 
and the inspection requirements. But, they can be grouped into two main types, 
tables and deviation plots. 

Tables are used to report discrete details about measured features. These include 
feature details like nominal, measured, and deviation from nominal values. 
Figure 12 provides a typical example of a table report (Cogniten’s Table Report 
Example). Colors can be used to readily distinguish when deviations exceed 
predefined part tolerances. In addition, views of the part can be incorporated into 
the report to help downstream users clearly understand which dimensions correlate 
to which features. 

Color deviation plots are used to display the differences between dense point 
cloud data and correlating reference models. Figure 13 provides a typical example 
of a color deviation plot (Geomagic Qualify Color Deviation Plot Example). These 
reports are helpful to identify global behaviors of a part. It can be difficult to 
identify or communicate relationships in a table of measured dimensions. Deviation 
plots readily display how the surface of the inspected part is behaving. For additive 
manufactured parts, these reports are very helpful to quantify and visualize part 
shrinkage and warp problems. Having this information enables operators to take 
appropriate actions to eliminate, or at a least minimize, these issues. 


Summary 

Reverser engineering is a critical tool used to assure the successful execution of 
additive manufactured parts. Up front, reverse engineering can be used to create 
geometry that is consumed by the additive manufacturing systems to produce parts. 
Secondarily, reverse engineering can be used to inspect the parts fabricated by 
additive manufacturing. Care must be taken to appropriately match data collection 
metrology to the unique characteristics of an additively manufactured part and the 
required accuracies for a specific task. Behaviors common to additive manufactured 
parts, like shrink and wrap, can be quantified and readily studied with reverse 
engineering tools. Quantified understandings of these behaviors are helping 
advance additive manufactured parts into higher requirement applications. 
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Fig. 12 Example of a tabular report for measured features 
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Abstract 

Rapid prototyping (RP) is any process utilized to quickly fabricate a physical 
prototype or scale model of a design directly from three-dimensional computer- 
aided design (CAD) data. The additive RP process allows designers to create a 
three-dimensional geometry directly from CAD data by slicing that geometry 
into finite layers and fabricating the prototype one layer at a time. The processes 
are also referred to as additive manufacturing because parts are created by 
adding each fabricated layer to the whole part, as opposed to subtractive 
manufacturing wherein material is removed in some fashion from a solid 
medium. There are many forms of rapid prototyping: additive manufacturing 
(AM), also known as three-dimensional printing (3DP or 3D printing); rapid 
molding or quick injection molding; rapid machining or rapid CNC; and several 
other technology classes. The increasing prevalence of additive technologies on 
the market has caused international standards organizations to begin creating 
common terminology and standards for these technologies. The ASTM Interna- 
tional Committee F42 on Additive Manufacturing Technologies has categorized 
the various forms of additive manufacturing into seven groups: material extru- 
sion, material jetting, binder jetting, sheet lamination, vat photopolymerization, 
powder bed fusion, and directed energy deposition. Rapid prototyping applica- 
tions of additive manufacturing technology provide value to businesses in four 
primary ways: reducing time to market by compressing product design cycles, 
lowering cost of product development by exposing design problems earlier in the 
design cycle, improving creativity and innovation in product designs by increas- 
ing the ability to rapidly evaluate and iterate design concepts, and increasing 
competitive advantage by keeping design plans and data internal to the organi- 
zation until later in the product development process. The most efficient product 
development processes can utilize multiple prototyping technologies so that the 
right solution is available for each step in the product design process. 


Introduction 

As companies have become global competitors, research and development teams 
have been driven to develop products with less money, less time, and more 
innovation. Achieving this goal is not easy, especially considering the hurdles in 
the way of successful product launches. According to the study “3,000 Raw Ideas - 1 
Commercial Success” by Greg Stevens and James Burley, in addition to 3,000 raw 
ideas, a single successful innovation also requires 125 small projects, four major 
developments, and 1.7 product launches (Burley and Stevens 1997). 

In order to achieve these lofty goals, product designers and engineers need a new 
way to conceptualize and communicate new product designs. As global business 
continues to adopt three-dimensional computer-aided design (CAD) software, 
designers have been able to accelerate development. Moving from a 
two-dimensional drawing to a three-dimensional CAD part allows engineers and 
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designers to check the fit of their designs in assembly, test the function of their 
parts, and simulate loading scenarios to validate materials and designs prior to 
creating the actual prototypes. 

The introduction of three-dimensional CAD software compressed the design 
cycle, but the use of software alone remains limited. Communicating a three- 
dimensional design on a two-dimensional computer screen or printout is challeng- 
ing. Many people experience difficulty in visualizing how the physical part will 
look, estimating the scale and fit, and confirming the actual performance of the 
design when CAD software is the only tool available. CAD software reduces the 
need for prototypes to visualize a design, but it does not eliminate the need. 

In the past, designers and engineers would take their two-dimensional drawings 
of their design to a fabrication shop in order to create a prototype. The fabrication 
shop would have a variety of machining and milling tools as well as material stock 
on hand. The drawings would be reviewed and interpreted, and physical parts 
would be created according to the print, usually over a span of days or weeks. 
How well the prototype represented the actual design would depend on the skills 
and capabilities of the fabrication shop, the materials available, and the complexity 
of the prototype design. 

This process for creating prototypes was expensive, labor-intensive, and typi- 
cally required long lead times due to the complexity of creating the prototypes and 
the volume of work sent to the fabrication shop. The high cost and long lead times 
forced the designers and engineers to reduce the number of prototypes they 
requested and to slow down the development process in order to avoid mistakes. 
This process was not ideal for the high-paced and high-efficiency demands of 
designers today. 

The poor fit of traditional prototyping and the needs of designers and engineers 
created a market for a new process: rapid prototyping (RP). In general terms, RP is 
the process of quickly fabricating a prototype or scale model of a design directly 
from three-dimensional CAD data. An example of RP prototype from CAD data as 
shown in Fig. 1 is for the purpose of visualizing the designed embedded cooling 
channel before the actual fabrication of the insert. There are many approaches to RP 



Fig. i CAD to product 
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Fig. 2 Sliced file of a 
prototype 



on the market today, including rapid machining, rapid molding, and additive 
manufacturing (AM), also known as 3D printing. This text will take rapid 
prototyping to refer to the additive manufacturing process and will focus on that 
technology. 

The additive RP process allows designers to create a three-dimensional geom- 
etry directly from CAD data by slicing that geometry into finite layers and fabri- 
cating the prototype one layer at a time (Fig. 2). The processes is referred to as 
additive manufacturing because parts are created by adding each fabricated layer to 
the whole part, as opposed to subtractive manufacturing wherein material is 
removed in some fashion from a solid medium. In other words, subtractive 
manufacturing begins with a solid volume and removes material from that volume 
in order to produce the desired shape; additive manufacturing begins with an 
empty stage and fabricates many finite layers, the sum of which produces the 
desired shape. 

By embracing RP, designers and engineers can dramatically reduce the cost and 
lead time required to generate a prototype of their designs, meaning that more 
prototypes can be created in the same time and cost constraints. This allows 
businesses to produce better products more quickly, due to compression of the 
design process and increased iteration of design, yielding greater innovation and 
faster time to market. 


What Is Rapid Prototyping? 

Rapid prototyping (RP) is any process utilized to quickly fabricate a physical 
prototype or scale model of a design directly from three-dimensional computer- 
aided design (CAD) data. There are many forms of rapid prototyping: additive 
manufacturing (AM), also known as three-dimensional printing (3DP or 3D Print- 
ing); rapid molding or quick injection molding; rapid machining or rapid CNC; and 
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several other technology classes. This text will focus on the AM/3DP process for 
rapid prototyping, and further reference to RP will refer to this process. 

Additive manufacturing is a term that applies to variety of processes. The 
common trait among these processes is the fabrication of parts or models by slicing 
a three-dimensional CAD geometry into finite layers and creating the physical part 
by building each layer successively. This process is unique because it does not 
require tooling to produce a part and there is no need to stock materials of different 
volumes for machining. Whereas traditional subtractive manufacturing processes 
such as machining or milling start with a volume larger than the desired part shape 
and create the physical part by removing material from that volume, additive 
manufacturing starts with an empty stage and adds layers of material to fabricate 
the desired part shape. 

Additive manufacturing is also commonly referred to as three-dimensional 
printing, or 3D Printing. The label 3D Printing is more commonly utilized for 
systems and technologies intended primarily for rapid prototyping. As the name 
implies, additive manufacturing is expanding into other elements of the business 
process, including applications such as the manufacture of tooling and fixtures to 
the manufacture of parts for consumer products. 

Rapid prototyping first entered commercial availability in 1986, when Chuck 
Hull patented the first stereolithography (SLA) rapid prototyping system. With this 
patent in hand, Mr. Hull founded the company 3D Systems and began the rapid 
prototyping industry (3D Systems). 

Since the introduction of SLA (Fig. 3), many different AM technologies have 
been invented. Today, rapid prototyping is available in a diverse set of technologies 



Fig. 3 SLA system in Nanyang Polytechnic 
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and prices. Rapid prototyping systems can be found starting in the $ 100’s 
USD ranging to over $1,000,000 USD for a system. Each technology and 
machine offering is tailored to suit a specific set of needs, meaning that 
rapid prototype customers can find a specific solution for many of their rapid 
prototyping needs. 

Rapid Prototype Technologies 

With many specific technologies available and offered by many different compa- 
nies (Wohlers Report 2012 lists 33 companies as AM providers), the numerous 
trade names and acronyms can be confusing and difficult to follow. Fortunately, as 
the industry becomes more and more prevalent, some efforts are being undertaken 
to classify technologies and define terms. 

The ASTM International Committee F42 on Additive Manufacturing Technol- 
ogies has approved a list of AM process category names and definitions. The 
following bulleted list describes the ASTM-approved process terms and definitions, 
as worded in the Wohlers Report 2012 (Wohlers and Caffrey 2012): 

• Material extrusion - an additive manufacturing process in which material is 
selectively dispensed through a nozzle or orifice 

• Material jetting - an additive manufacturing process in which droplets of build 
material are selectively deposited 

• Binder jetting - an additive manufacturing process in which a liquid bonding 
agent is selectively deposited to join powder materials 

• Sheet lamination - an additive manufacturing process in which sheets of mate- 
rial are bonded to form an object 

• Vat photopolymerization - an additive manufacturing process in which liquid 
photopolymer in a vat is selectively cured by light-activated photopolymerization 

• Powder bed fusion - an additive manufacturing process in which thermal energy 
selectively fuses regions of a powder bed 

• Directed energy deposition - an additive manufacturing process in which 
focused thermal energy is used to fuse materials by melting as the material is 
being deposited 


Material Extrusion 

Material extrusion involves the physical dispensation of a material through a nozzle 
or liquefier in a selective manner. This technology requires physical motion of the 
extrusion system; most commonly, the extrusion system is mounted to a gantry of 
some form and is capable of two-dimensional motion, similar to a plotter pen. In 
order to print three dimensionally, either the substrate or the extrusion gantry 
typically can move along the third axis. 
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Fig. 4 Stratasys FDM Fortus 
900mc in Nanyang 
Polytechnic 



This technology builds parts by extruding each layer of material in vector-based 
motions, meaning the extruder physically travels along the defined layer vectors, 
extruding material where needed. The material deposited is typically a thermoplas- 
tic filament that is heated in the extruder and deposited to create the part. 

The material extrusion process was invented by S. Scott Crump, who patented 
the technology and founded the company Stratasys, Inc. Mr. Crump termed the 
technology as fused deposition modeling (FDM) and offered the first commercially 
available FDM system in 1991. Figure 4 shows a Stratasys FDM machine, Fortus 
900mc, which is also the largest available model. 

Material extmsion technology is commonly valued for its ability to produce tough, 
thermoplastic parts that are dimensionally accurate and stable, as well as the capability 
to print parts using high-performance, engineering-grade thermoplastics. Today, mul- 
tiple companies offer material extmsion technology-based systems, including several 
open-source projects that can be purchased as kit machines to be built by the customer. 


Material Jetting 

Material jetting technology takes advantage of inkjet technology to jet or deposit 
droplets of photopolymer or wax material. The inkjet print heads selectively deposit 
the material droplets as the print head passes over a build platform in a similar 
manner as two-dimensional inkjet printers deposit ink on paper to print an image. If 
the build material is a photopolymer, the print head typically includes a light source 
that cures the deposited resin as the part is being printed. 

The material jetting technology utilized in Objet system, branded as Connex, is the 
only additive technology capable of printing two different materials simultaneously. 
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Due to a variety of material offerings, this allows the Objet printers to print parts 
with multiple material properties and to mix materials while printing to create digital 
materials. 

Parts built with material jetting technology are known for their fine part resolu- 
tion and smooth surface finish, their ability to produce rubberlike materials, and 
their ability to produce clear materials. Objet is one of the primary producers of this 
technology, along with 3D Systems. Solidscape is a leading provider of material 
jetting technology that deposits wax-based materials. 


Binder Jetting 

Binder jetting is a process wherein a powder material is selectively bonded by a 
liquid bonding material. The binder material is deposited through inkjet technology, 
similar to the technology in material jetting systems. The material used for both 
binder and powder varies widely by manufacturer. 

Parts created by the binder jetting process are typically only loosely bonded by 
the binder. A post-process step of sintering or infiltration is usually recommended in 
order to reinforce the printed part. Machines produced by Z Corp. are among the 
only rapid prototype systems today capable of printing in full color by utilizing the 
inkjet heads to deliver color to the powder in addition to the binder agent. 

Part materials range from plaster-based powders offered by Z Corp. to acrylic 
polymer powders offered by Voxeljet to sand and metal powders available on 
ExOne printers. Binder jetting parts are valued for different reasons depending on 
the process, including full-color models, fine part resolution, and the ability to print 
large sand or metal parts. 


Sheet Lamination 

Machines that utilize sheet lamination technology produce parts by fusing or 
bonding sheets of material in layers to produce a three-dimensional object. The 
sheet material is typically either a paper-based material or thin metal strips. As each 
new layer is added, it is bonded or fused to the layers below and cut to produce the 
profile of the layer, making this technology unique in its combination of additive 
and subtractive manufacturing elements. 

Paper-based sheet lamination technology is primarily offered by the companies 
Helisys and Mcor Technologies. Paper-based lamination is one of the few technol- 
ogies able to compete with binder jetting technologies on the basis of color offering 
by printing the colors of each layer onto the paper sheet stock. 

Metal sheet lamination typically involves the bonding of two thin sheets of metal 
by selectively applying an ultrasonic welding process. Once the layers are bonded 
together, the profile is cut to produce the three-dimensional part. Fabrisonic 
achieves the cutting process by providing CNC-machining capabilities in the 
printer. 
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Vat Photopolymerization 

Vat photopolymerization, which includes the stereolithography (SLA) process 
invented by Chuck Hull, involves light- activated curing of material in a vat of 
liquid photopolymer. There are two primary processes for directing the light for 
curing the photopolymer: the use of an ultraviolet laser with scanning mirrors to 
trace the pattern of the layer, or the use of Digital Light Processing (DLP) wherein 
micro-mirrors are utilized to project and cure the entire layer at once. 

Most vat photopolymerization processes utilize a build substrate that lowers into 
the vat of liquid photopolymer with each layer and the light source cures the 
material on the top surface of the vat. Recent technologies have come to the market 
where that process is inverted, with the build substrate starting at the surface of the 
vat and moving away from the surface, so that the part is drawn from the vat rather 
than being submerged. 

Vat photopolymerization technology was the first rapid prototyping technology 
on the market and was manufactured by several companies, although 3D Systems 
was the first and remains the largest manufacturer of SLA systems. EnvisionTEC 
GmbH is one of the best-known companies producing vat photopolymerization 
systems that utilize the DLP curing technology. 

Vat photopolymerization systems are known for surface finish and resolution of 
the part output, as well as the speed of the process. As the first technology on the 
market, many rapid prototyping customers know the SLA technology and its 
benefits. Figure 5 shows an example of parts produced by SLA process. 



Fig. 5 SLA sample parts 
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Powder Bed Fusion 


Powder bed fusion is known by many names, depending on the system 
manufacturer. The process involves selectively heating powdered material in 
thin layers to fuse or melt the powder where desired. After heating the pattern 
of a layer, a new layer of powder is spread over the powder bed and the next 
layer is heated. Most powder bed fusion technologies fabricate the layers by 
directing an energy source with scanning mirrors, which traces the pattern 
across the layer. 

Powder bed fusion processes are capable of building parts out of both polymer or 
metal powders, although typically the machine is designed for either polymers or 
metals and is not capable of creating parts from both materials. This process is 
generally valued for the mechanical properties of the finished parts, the diverse 
material capabilities, and the fine resolution of the parts available. The surface 
finish, due to the nature of the powder, is typically not as fine as that of vat 
photopolymerization parts or material jetting parts. 

Several prominent companies in the powder bed fusion technology arena each 
have described their technologies differently. EOS GmbH offers polymer-based 
systems as laser sintering technology and metal-based systems as direct metal laser 
sintering (DMLS). 3D Systems offers both polymer and metal-based systems under 
the descriptor selective laser sintering (SLS). SLM Solutions GmbH provides 
metal-based powders and has announced plans to provide ceramic-based powders 
for their selective laser melting (SLM) systems. Another metal-based powder bed 
fusion technology, called LaserCUSING, is offered by Concept Laser GmbH. 
Linally, Arcam AB offers metal-based powder systems that utilize electron-beam 
melting (EBM) rather than a laser as an energy source. Sample parts as shown in 
Ligs. 6 and 7 are produced by EOS GmbH DMLS machine and EOS GmbH SLS 
machine, respectively. 


Directed Energy Deposition 

The directed energy deposition process is currently the only AM process that has 
been offered with five-axis motion capability to date. This process is a modified 
version of the powder bed fusion process, wherein a laser or an electron beam is 
used to melt a metal powder; however, the powder is not spread over a bed but is 
selectively deposited. This unique process enables the possibility of gradient parts 
or deposition onto an existing part with five-axis capability. 

These systems are relatively expensive due to the complexity and process 
controls necessary to support the unique features of the system. The high cost of 
acquisition and operation has limited adoption of these systems in the AM market. 
Major system providers include Optomec, Honeywell Aerospace, Irepa Laser, 
POM, and Sciaky, Inc. (Wohlers and Caffrey 2012). 
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Fig. 6 DMLS parts 



Fig. 7 SLS parts 
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Benefits of Rapid Prototyping 

Rapid prototyping has proven a benefit to businesses by making the research and 
development process more effective. The wide variety of rapid prototyping tech- 
nologies, build sizes, material costs, and system prices provide multiple solutions 
for meeting the needs of designers and engineers. 

Most businesses are interested in reducing time, costs, or both. Rapid 
prototyping typically delivers four benefits to product development cycles: reduc- 
tion of development cycle time, reduced cost of development, increased creativity 
and innovation, and data security. 


Reduced Cycle Times 

The speed and ease of creating a prototype part or scale model using additive 
manufacturing systems reduce cycle times in two primary ways: increasing the 
amount of prototyping and testing done early in the design cycle and reducing the 
time required to create a prototype. 

Reducing the time to create a prototype has an obvious effect on the length of the 
design cycle. While each technology is different in terms of the time it takes to go from 
a CAD part to a physical model, most technologies are capable of producing an 
average prototype in a matter of hours. Because the data is derived directly from the 
three-dimensional CAD data, there is no need to read a print in order to create a part 
and no chance of misreading a drawing to produce the wrong part. Where a fabrication 
shop may require days or weeks to deliver a prototype solution, rapid prototyping can 
often have a model ready the next day, even working through the night, unattended. 

The second method of cycle time reduction, increasing the amount of 
prototyping done in the design cycle, seems as though it would increase, rather 
than decrease, the length of the design cycle; however, more prototypes early in the 
design cycle typically help designers and engineers arrive at the final product design 
more quickly. The ability to visualize, test, and analyze iterative designs quickly 
compresses the overall design cycle and improves time to market. Some case 
studies, available at leading RP manufacturers’ websites, describe the development 
time reduction in terms of months or even years. 


Reduced Cost 

The obvious cost reduction in utilizing rapid prototyping to create prototype models 
is in the actual cost per prototype. RP systems are not particularly low cost, with 
professional-grade 3D Printers starting at around $10,000 USD. Cost of materials for 
the prototyping systems are also priced high, particularly relative to comparable bulk 
materials. Although both system and material costs are not low, the per-part cost of 
prototyping typically is low relative to traditional fabrication methods. Because RP 
systems require very little labor, the combination of material and labor costs is 
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typically lower for a model produced by RP than fabricated in a machine shop, which 
requires skilled operators, programming, and setup costs for that equipment. 

In addition to the direct cost to produce a prototype, rapid prototyping reduces 
development costs indirectly, usually in much more significant ways than the direct 
cost of a prototype. One such indirect cost is maximizing the efficient use of existing 
equipment. CNC mills, lathes, and other equipment commonly used to fabricate a 
prototype are often brought in-house in order to produce valuable parts used in the 
business. Taking the time to set up and fabricate a prototype consumes machine 
availability and labor resources that could be utilized for production needs. By taking 
the prototyping load off of fabrication shops and expensive CNC equipment, RP 
systems free up resources for production and help to lower production costs. 

Another common indirect cost reduction manifested by RP equipment is the 
reduction of error costs. As a product moves through the design cycle, it passes 
through several phases of development. Example phases would be ideation and 
concept generation, initial concept evaluation, design analysis and feedback, pilot 
production, and product launch. A general rule of thumb is that with each phase of 
product development, the cost of an error is increased by an order of magnitude. 
Finding design problems or mistakes late in the design cycle can cause the need to 
modify or replace expensive production tooling, for example. 

Rapid prototyping systems help reduce the cost of mistakes in the design process 
by allowing design flaws to be detected earlier in the design cycle. The possibility 
for more prototypes early in the design phase allows for better conceptual presen- 
tation for feedback, improved evaluations of design fit and function to reduce the 
need for redesign, and realistic product testing through focus groups or beta pro- 
grams earlier in the design cycle. Rapid prototyping increases the probability that 
errors are detected early, when they are relatively inexpensive to correct, rather than 
later in the design phase where mistakes are very costly. 


Creativity 

Introduction of rapid prototyping capabilities has greater impact on design processes 
than time and cost alone. One such additional impact is increased creativity and 
innovation within the design team. By increasing the creative output of the design 
process, the quality and competitiveness of the company’s products will also increase. 

Rapid prototyping systems increase creativity by making it easier for designers 
and engineers to experiment and test multiple designs and configurations. If the cost, 
lead time, and complexity of producing a single prototype are too high, as is often the 
case in traditional prototyping methods, the design team simply cannot afford to test 
multiple concepts. In such a situation, designers and engineers are limited to 
attempting relatively conservative designs in order to improve the chance of success 
and approval and limit the number of prototypes necessary to get to production. 
While this seems like a more efficient manner of operating a design process, it 
increases the likelihood of expensive mistakes later in the process, as described 
above, and also establishes creative limits that the design team must work within. 
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With rapid prototyping capabilities, designers and engineers do not have the 
same requirement to decrease the number of prototypes and concepts analyzed; in 
fact, they are often encouraged to increase design iterations. By allowing more 
conceptual analysis, design teams are free to explore bolder concepts and innova- 
tive designs, knowing that the prototyping process will help to quickly filter the 
promising concepts and even provide better information about the functionality and 
attractiveness of the design. In freeing the design team to explore and innovate, 
rapid prototyping allows the design process to ultimately improve the quality of the 
product being developed and lends a competitive edge to the business. 


Data Security 


The final primary benefit of utilizing rapid prototyping in the manufacturing design 
cycle applies only to bringing RP capabilities in-house, rather than utilizing an 
external RP service provider. The reason for this limitation has to do with the 
benefit being delivered, namely, the security and confidentiality of design data 
generated during the process. 

In the fast-paced business world of today, time to market, product differentia- 
tion, and innovation are highly important to the success of the product. Whenever a 
company has to rely on external resources during the design cycle, the company 
runs the risk of confidential information being leaked to the market or to compet- 
itors. Sending prototype designs to an external firm for fabrication increases the 
possibility of competitors gaining knowledge of the company’s development plans 
or even the actual product designs, depending on the severity of the breach. While 
there are certainly many reputable and reliable firms with which to contract for 
fabrication, every time that design data is transferred outside of the company 
increases the risk of leaked information, even if the risk is small. 

Rapid prototyping allows a business to cover more of its development needs 
internally, which limits the chance for confidential data to fall into the wrong hands. 
Even though rapid prototyping may not be suitable to prevent all design work from 
leaving the firm, RP can frequently reduce the volume required to be handled 
externally and, more importantly, postpone the need for external fabrication to 
later in the design cycle. The loss of confidentiality is much more damaging early in 
the design cycle, allowing competitors to field a response more quickly and 
reducing the competitive advantage of the company, than it is later in the design 
cycle when the company is already approaching product launch. 


Applications of Rapid Prototyping 

Rapid prototyping primarily consists of three applications within the product 
development phase of manufacturing: conceptual models, physical part fit verifi- 
cation, and prototype functional assessment. Each of these applications has differ- 
ent requirements necessary to provide value to an organization. 
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Conceptual Modeling 

Early in product development, designers and engineers have a need for devel- 
oping conceptual models of their designs. This application fits within the 
ideation and concept analysis phases of product design. At this stage, multiple 
concepts for the product design are often evaluated. Designers gain a compet- 
itive advantage by quickly evaluating these concepts, selecting elements of 
each that hold promise and deliver value to the design, and iterating the 
concepts to improve, integrate, and narrow the suite of concepts under 
evaluation. 

Rapid prototyping is uniquely capable of delivering value through conceptual 
modeling. In this application, reduced part cost and time required to fabricate a 
model are the primary drivers of value. While some level of concept analysis can be 
conducted at the 3D CAD level, the ability to produce a physical model allows for 
greater comprehension and evaluation of the design. As a general rule, by increas- 
ing the number of concepts evaluated and the iterations of concept analysis that 
designers can complete, the final design selected for development will be more 
robust and attractive. 

With the fast pace of product development necessary to be competitive in 
today’s business world, this phase of design is often tightly constrained by both 
time and cost limitations. These limitations affect the ideal choice of prototyping 
technology for delivering conceptual models. In order to increase the quality of the 
design output, designers and engineers need to evaluate more concepts and itera- 
tions. Therefore, a prototyping technology ideal for this application is one that can 
produce realistic physical models of the design while minimizing the cost per 
model and time required to progress from CAD design to usable model. It is 
important to consider the full process from CAD to usable part when evaluating 
time and cost, rather than limiting evaluation to only the build process of the 
technology. The full process often includes file processing, system preparation, 
build process, and any post-build-processing steps required to render the physical 
model usable for concept analysis. 

Some concept models produced by rapid prototyping are as shown in Fig. 8 
below: 


Physical-Fit Models 

Once a concept has been selected for development, the next phase of the part design 
process is typically ensuring that the individual components of the design are 
physically fit when assembled. This application of rapid prototyping has slightly 
different requirements than that of conceptual models. 

When a design is selected, the early phases of product development usually 
continue to iterate relatively quickly. As components are designed or identified for 
purchase, the design of the assembly often must change to accommodate that 
component. In order to minimize the cost of iteration, it is important for designers 
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Fig. 8 Concept models 


and engineers to have confidence that the assembled design will fit together 
properly prior to investing in component purchases or tooling. Rapid prototyping 
is able to provide value in this design phase by printing inexpensive, dimensionally 
accurate models of expensive components. Having replicas of all of the components 
in the assembly allows engineers to test the assembly of the design to ensure the 
parts fit together as expected and the assembly process is as simple and efficient as 
possible. 

While it is possible for 3D CAD models to simulate the fit of components in an 
assembly, it is often difficult to translate the digital model into physical reality. 
These digital models are capable of maintaining extremely tight tolerances that 
rarely translate into the physical tolerances feasible for cost-effective manufactur- 
ing. Additionally, while 3D CAD models may verify the assembled fit of the design, 
it is extremely difficult to digitally simulate the experience of physically assembling 
the product. An excellent final assembly design is entirely worthless if the actual 
assembly process is cumbersome or infeasible. 

Whereas the conceptual modeling application of rapid prototyping emphasizes 
the ability to quickly create many models for minimal cost, the physical-fit 
application of rapid prototyping expands the requirements to provide value by 
requiring accuracy and durability from the physical model. Cost and time remain 
important to allow for rapid iteration of the design, but it is more important that 
the components prototyped are representative of the feasible tolerances required 
for final part production and that the prototype components are strong enough to 
survive the assembly process. The ability of the technology to meet these 
demands often requires a higher-cost system than that required for conceptual 
models, so the ideal choice of technology may not be the same for these two 
applications. 

Figure 9 shows an example of parts which requires fitting produced by metal 
laser sintering (metal part) and selective laser sintering (polymer), respectively. 
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Fig. 9 Fitting component 


Functional Models 

The final phase of product design usually involves creating functional prototypes 
for full concept evaluation and market testing. Because this is the last phase of 
product design prior to production, both the value of ensuring design accuracy and 
the requirements of the prototype model are higher than those of conceptual models 
or physical-fit models. 

By the time the product design has entered the phase where functional 
prototyping is necessary, the frequency of design iteration has often dropped 
considerably. Most of the major design changes happen at the conceptual phase 
and during component selection and assembly verification. Although the frequency 
of iteration has dropped, the value of quickly creating a prototype of the design 
remains high, because design changes become much higher once initial component 
purchases and tooling investments for production have been made. In this phase, it 
is critical that designers and engineers be certain that the product design functions 
as desired and that the product is desirable to the target market. 

Prototype requirements in the functional model application of rapid prototyping 
are the most demanding in the prototype segment of additive manufacturing. While 
functional models may not have such high requirements as those in the areas of jigs 
and fixtures, manufacturing tools, or end-use parts, functional models are often 
required to survive more demanding environmental conditions and to more closely 
mimic the qualities of the eventual product than models created during earlier 
phases of design. 

Functional models frequently require prototyping only the unique components in 
the product design: those components that will require tooling to manufacture such 
as the external skins of the product. These unique components are then assembled 
with functional electronics and purchased mechanical components to verify that the 
product design will function as desired. Because the functional model is required to 
function as the eventual manufactured product, the prototype components must be 
able to survive the thermal, chemical, and physical loading environments of 
common use of the product. 
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Fig. 10 Functional injection molding insert 

The environmental demands placed on the prototype components demand 
mechanical properties of the prototype material that meet those of the production 
parts. Additionally, the accuracy of the prototype output is also important to ensure 
that the assembly fits and behaves as the production components will. For this 
reason, the cost and time to produce the prototype are less critical in the selection of 
a prototype technology, and the materials available and the accuracy of the proto- 
type technology are the most important factors. 

The final consideration for selecting a prototype technology in a functional 
model application is the ability of the technology to provide a representative 
experience in the prototype product. While this consideration is often placed behind 
those of material properties and part accuracy, the ability of the prototype technol- 
ogy to mimic the tactile function and feel of the production design provides added 
value to the functional model. Prototype technologies that can provide representa- 
tive colors, rigidity, and surface quality of the desired production design are better 
able to simulate the actual experience of using the product in its functional 
environment and thus have added value for market acceptance evaluation. 

Two examples of functional parts, inserts with embedded cooling channel for 
injection molding process, are shown in Fig. 10. 


Summary 

As businesses trend toward global competition, product designers and engineers are 
increasingly pressed to provide better product designs in shorter time frames with 
lower budgets. To achieve these lofty goals, businesses must adopt more efficient 
design processes that can evaluate and iterate product designs more quickly and 
accurately so that design cycles can be compressed and costly modifications to 
component purchases and tooling investments can be avoided. Rapid prototyping 
technologies help to make these requirements achievable. 

Many companies offer varying forms of rapid prototyping technologies today. 
The increasing prevalence of additive technologies on the market has caused 
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international standards organizations to begin creating common terminology and 
standards for these technologies. The ASTM International Committee F42 on 
Additive Manufacturing Technologies has categorized the various forms of additive 
manufacturing into seven groups: material extrusion, material jetting, binder jet- 
ting, sheet lamination, vat photopolymerization, powder bed fusion, and directed 
energy deposition. 

Each of the additive technologies is capable of producing a prototype model, 
although some technologies are more suited than others. Rapid prototyping appli- 
cations of additive manufacturing technology provide value to businesses in four 
primary ways: reducing time to market by compressing product design cycles, 
lowering cost of product development by exposing design problems earlier in the 
design cycle, improving creativity and innovation in product designs by increasing 
the ability to rapidly evaluate and iterate design concepts, and increasing compet- 
itive advantage by keeping design plans and data internal to the organization until 
later in the product development process. 

The application of rapid prototyping within the product design cycle falls within 
three categories, depending on the phase of product design. Conceptual models are 
used to quickly evaluate competing design concepts and are best served by technol- 
ogies that produce models quickly and at low cost. After a design concept is selected 
for development, rapid prototyping technologies that produce durable and accurate 
prototypes are used to produce physical-fit models for evaluation of component fit 
and the product assembly process. The final stage of prototyping demands accurate 
parts in materials that closely resemble the properties of the intended production 
materials so that the functionality of the design can be evaluated in the thermal, 
chemical, and physical loading environments expected of the manufactured product. 
Additional value in functional models is created in technologies that can also match 
the color, rigidity, and tactile feel of the production components. 

With the rapid prototyping technologies available in the current market, no 
single technology emerges as the best choice for all three prototyping applications. 
The most efficient product development processes can utilize multiple prototyping 
technologies so that the right solution is available for each step in the product 
design process. While the selection of the right rapid prototyping technology is 
subject to debate, there is no question that rapid prototyping capabilities or services 
benefit the product design process and provide businesses with competitive advan- 
tage in the manufacturing environment today. 
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Abstract 

Rapid tooling (RT) refers to the rapid production of parts that function as a tool 
(primarily mold tools such as mold inserts) as opposed to being a prototype or a 
functional part. These tools are produced by different additive manufacturing 
(AM), also previously known as rapid prototyping (RP) processes such as 
stereolithography (SL), fused deposition modeling (FDM), selective laser 
sintering/melting (SLS/SLM), 3D printing (3DP), and electron beam melting 
(EBM). These AM tools are then directly used as molds or used to produce 
molds for conventional manufacturing, such as vacuum and investment casting. 

RT is generally categorized as soft or hard and direct or indirect tooling. The 
wide range of materials involved in tooling includes wax, wood, photopolymers, 
thermal polymers, metals (such as tool steels), ceramics (such as alumina and 
silica), and composites. In soft tooling, the molds produced directly or indirectly 
are destroyed after a single cast or are used for a small batch production. Single 
cast typically refers to investment casting where parts produced have properties 
identical to parts produced from conventional investment casting. Soft tooling for 
small batch production is typically used more for manufacturing of functional 
prototypes that meet the minimum properties required for application testing. 

In hard tooling, molds produced are usually made of metals, ceramics, or 
composites that can be used for high volume production. For example, metal 
molds and silica sand molds can be produced directly with the SLM and SLS 
technique respectively. Parts manufactured from these molds exhibit high qual- 
ity, fine finishing, and superior if not comparable to properties of parts 
manufactured from conventionally produced molds. Molds with high complex- 
ity are also possible. Hence, RP displays excellent tooling and manufacturing 
capabilities with the development of RT. 

There are several benefits that are realized by RT with the most evident being 
cost savings. RT greatly reduces the time needed for mold-forming process and 
therefore increases the speed of production. This in turn reduces the time to 
market allowing companies to increase profits. RT also allows the ease of 
product customization due to its flexibility in tool design, ability to adapt to 
customers’ specifications, and most importantly, does not require high volume to 
breakeven. Conceptual designs can be further improved without incurring high 
costs compared to conventional manufacturing processes. These factors in RT 
attribute to high performance manufacturing and high quality products. 


Introduction 

Additive Manufacturing 

Additive manufacturing (AM) refers to a fabrication technique that builds a 3D part 
by adding material in a layer by layer fashion (ASTM International (2009)). In 
AM, a 3D model is first designed and created on a computer-aided design or 
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Fig. 1 Development of AM technology 


computer-aided manufacturing (CAD/CAM) system (Sachs et al. 1991; Shea 1993) 
such as Rhino3D or Solidworks. The CAD model is then converted into an STL file 
for processing by different AM processes (Sachs et al. 1992) to produce the 3D 
physical part independently without additional manufacturing steps (Brown 1991). 
Stereolithography (SL), fused deposition modeling (FDM), selective laser 
sintering/melting (SLS/SLM), 3D printing, and electron beam melting (EBM) are 
examples of popular AM processes (Chua et al. 2010). 

The idea of AM was not new as evidenced from layer manufacturing in struc- 
tures such as the Great Pyramid of Giza in Egypt dated back as far as 3,800 years 
ago. It was until the 1980s, the advancement in technology has led the development 
of both hardware (improved computing capabilities such as processing speed, 
memory capacity) and software (CAD/CAM systems) to process files at the high 
resolution (thousands of layers in a small part) and data that was considered large at 
that time. AM has since gone through more than 30 years of development starting 
with rapid prototyping (RP), followed by rapid tooling (RT) in the 1990s, and rapid 
manufacturing (RM) since 2001 (Wohlers 2003). As shown in Fig. 1, RM overlaps 
RT because rapid manufacturing of functional tools is considered RM although in 
actual application, it really functions as a tool for manufacturing. This will be 
explained in RM following the introduction of RT. Nevertheless, RP and RT remain 
largely popular in the area of AM in the industry today, while the development in 
RM continues to advance and mature (Wohlers 2005). 


Rapid Prototyping 

Initially, AM was known as rapid prototyping (RP) where parts were made from a 
wide range of nonmetals, such as wax, wood, photopolymers, thermal polymers and 
ceramics, and metals such as steels. These parts function as prototypes and were 
mainly used for product development because they were lacking of properties such 
as yield strength (due to RP process and material limitation) and hardly meet actual 
application requirements. The roles these prototypes play in the product develop- 
ment process were experimentation, learning, testing, proofing, communication, 
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integration, and planning (Chua et al. 2010; Chua 1994) whereby prototypes help 
design engineers to better engage, communicate, and interact in the thinking, 
planning, experimenting, and learning process as they are able to relate to the 
product with a physical prototype. Prototypes also help to reveal unknown and 
unanticipated issues. For example, in human engineering, an accurate exact mobile 
phone prototype can be produced within a few hours to test the design for palm 
comfort based on human ergonomics (Kamrani and Nasr 2010). 

Prototypes can also be used for synthesis and product integration to test and 
proof certain concepts and ideas relating to the product development. In this 
instance, several RP parts can be produced separately and integrated to test for 
functional properties. In the mobile phone example, a prototype battery cover can 
be produced as a separate part from the main mobile phone case to test if the inner 
self-locking mechanism works as intended. In addition, these prototypes may 
undergo several design iterations to arrive at a superior design. Hence, RP plays a 
vital role in reducing the overall time spent on product development thereby 
improving the design and/or features of a product and reduce product development 
cycle time over the years. 

Despite being an important role in product development, prototypes are after all 
nonfunctional, and they cannot be used as a final part/component in an actual 
product. They remain largely as a prototype or a component used for experimen- 
tal/application tests and not used for industrial applications where final parts are 
produced by conventional manufacturing. Therefore, rapid tooling (RT) was devel- 
oped and evolved from RP to bridge this manufacturing gap of final parts for 
intended applications. In RT, RP produced parts function as a tool and is primarily 
used as a mold directly or as a pattern to indirectly produce a mold for conventional 
manufacturing such as vacuum and investment casting. Molds with high complex- 
ity are also possible. Hence, RP displays excellent tooling and manufacturing 
capabilities with the development of RT. 


Rapid Tooling 

RT is generally categorized as soft or hard and direct or indirect tooling as 
illustrated in Fig. 2. The wide range of materials involved in tooling includes 
wax, wood, photopolymers, thermal polymers, metals (such as tool steels), 
ceramics (such as alumina and silica), and composites. Generally, most work in 
RT has involved soft tooling, but the shift to hard tooling (Pham and Dimov 2001) 
is gradually increasing over the years because as the AM technology advances, 
reliable functional molding hard tools such as molds with integrated conformal 
cooling channels become a reality. 

In soft tooling, the mold produced directly or indirectly is destroyed after a 
single cast or is used for a small batch production (Chua et al. 2010), such as 
producing functional prototypes. This is because in many application tests, a 
prototype must exhibit certain functional properties required for a realistic evalu- 
ation. In contrast, molds produced in hard tooling are usually made of tool steels, 
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Fig. 2 Classification of rapid tooling 

silica sand or composites that are used for high volume mass production and actual 
fabrication of parts. For example, tool steel molds and silica sand molds can be 
produced directly with the SLM (Dimitrov et al. 2010; Campanelli et al. 2010) and 
SLS (Tang et al. 2003; Wang et al. 2003, 2002) technique respectively. 

In indirect tooling, a master part is first produced in almost all the cases in RT in 
order to obtain the final molding tool and generally requires at least one interme- 
diate step. On the other hand, direct tooling involves no intermediate steps during 
the production of the tool. Parts manufactured from these molding tools, either soft 
or hard, directly or indirectly produced, exhibit high quality, fine finishing and 
properties superior if not comparable to parts manufactured from conventionally 
produced molds. 

There are several benefits brought by RT with the most evident being cost and 
time savings. To the manufacturing engineer, RT minimizes design, manufacturing, 
and verification of tooling. Fixed costs are greatly reduced with minimal assembly, 
purchase, and inventory expenses. Consequently, labor and inspection costs are 
reduced accordingly. Costs are further reduced along with flexibility in manufactur- 
ing in RT because fewer inventories are scrapped upon design changes along with 
lower investment necessary for tool production. This is especially true in the 
dynamics of a competitive market where new products can be launched quickly 
to meet changing demands. 

Besides cost savings, RT remarkably increases the profit of an organization. 
Firstly, the speed of production is increased and this in turn greatly reduces 
production time, increases run cycles, and reduces the time to market. New prod- 
ucts can be test-marketed economically to find out what customers want. This puts 
the company in the forefront in introducing new products with an advantageous 
lead in capturing market share. 

Secondly, RT promotes the production of quality products. Due to low fixed 
costs and the ease of producing the tool, parts can be manufactured and tested in 
actual applications to reveal unanticipated design flaws. They can also undergo 
several design iterations conveniently without incurring high costs compared to 
conventional manufacturing procedure. This is done to further improve design, 
minimize material, and optimize strength-to-weight ratio in structural components 
before it is marketed. 

Thirdly, RT brings about customization of products due to the flexibility in tool 
design and the ability to adapt to customer specifications. Most importantly, 
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production of parts does not require high volume to breakeven. In this aspect, 
functional or aesthetic features can be accommodated readily and easily to meet 
customer requirements. Moreover, there are fewer constraints to design and include 
certain functional or aesthetic features with RT during manufacturing. Previously in 
conventional manufacturing, combining several different features in a part may not 
be possible due to poor tool accessibility or low manufacturability. However, such 
features can now be integrated and manufactured as a single part with RT. 

These factors in RT attribute to high performance manufacturing whereby costs 
are reduced (RT may save time and cost in the region of 10-50 % and up to 90 % 
depending on production size) (Chua et al. 2010), quality of parts are improved, and 
product customization can be realized. These help companies to potentially 
increase market opportunities and market share. 


Rapid Manufacturing 

As the AM technology advances, improved laser power and efficiency of systems 
like laser cladding, SLS, and SLM have allowed the direct manufacturing of 
functional parts from a wide range of different materials, specifically metals and 
ceramics, hence the term, RM. The list of metallic materials includes popular 
industrial alloys such as titanium (TiAl 6 V 4 , TiAl 6 Nb 7 , and cp-titanium), aluminum 
(AlSi 10 Mg, AlSi 12 , and 6/7,000 series), steels (316 L, tool steels), Inconel (718 and 
625), and cobalt chrome (Phenix Systems 2011; Technologies and Rapid 
Manufacturing Technologies 2011; CONCEPT Laser GmbH 2011; 3D Systems 
Cooperation 2011). Along with new developments in materials, ongoing research 
efforts within the AM research community on different ceramics or composites 
such as alumina, zirconia, and stainless steel with hydroxyapatite (HA) (Bertrand 
et al. 2007; Shishkovsky et al. 2007; Mumtaz and Hopkinson 2007) and metallic 
alloys such as gold (Au) (Khan and Dickens 2010), Cu (Becker et al. 2011), Mg 
(Ng et al. 2010), and tungsten (W) (Li et al. 2010; Zhang et al. 2010, 2011) have 
been reported within the research community. 

Very recently, RM may also be referred to the direct manufacturing of functional 
molds (specifically metal molds) such as a functional tool steel or copper mold with 
integrated conformal cooling channels that conventional manufacturing processes 
cannot produce (Garcia et al. 2012; Au et al. 2011; Dang and Park 2011). Confor- 
mal cooling is defined as the ability to create cooling/heating configurations within 
a tool that essentially follows the contour of the tool surface or deviates from that 
contour as thin/thick sections of the part may dictate for optimal thermal manage- 
ment. The objective typically is to cool or heat the part uniformly. Conformal 
cooling provides a tremendous advantage in mold tooling through significant 
reductions in cycle times. Other than the obvious piece-cost savings, other 
tangible benefits include tool, equipment, and floor space savings (CONCEPT 
Laser GmbH 2011). 

Such molds with conformal cooling channels are special and mentioned in RM 
although it really is a rapid tool because the mold itself exhibits functional 


70 Rapid Tooling in Manufacturing 


2531 


properties. It has a secondary function that improves cooling efficiency and thus 
production rate. Nevertheless, these rapid manufactured functional molds still serve 
as a tool in manufacturing and will be discussed in detail at the later part of this 
chapter. 


Applications of Rapid Tooling in Manufacturing 

RT in manufacturing has proven to be very popular with many industries such as 
aerospace, automotive, medical, jewelry, military, dentistry, and sport. For exam- 
ple, RT of investment casting has been used in engine components for fanjet engine 
in AlliedSignal Aerospace with production time reduced by 8-10 weeks and a 
savings of USD 50,000 for tooling. In another example in the automotive industry, 
RT of investment casting has been used to produce tool inserts for injection 
molding by Ford Motor resulting in a cost reduction of 45 % and time savings of 
more than 40 %. 

As it can be seen, RT plays an important role in manufacturing for it brings about 
benefits in product development (Chhabra and Singh 2011). Products can now 
incorporate improved design features (that are previously avoided due to cost and 
manufacturability concerns) because of the flexibility and reduced manufacturing 
time and cost in production (Wohlers 2008). Figure 3 shows an overall view of the 
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integration of RT in manufacturing where savings on time and cost could range 
from 50 % to 90 % depending on size of production (Chua et al. 2010). 

In the following sections, different RT techniques and their tooling applications 
in manufacturing will be discussed in detail. Examples and case studies will also be 
presented to demonstrate the advantage and value of RT in manufacturing. In 
general, RT removes the constraints design engineers and manufacturers face 
between part design and manufacturability and small volume production as 
shown in Fig. 4a and b respectively. 


Soft Tooling in Manufacturing 


Tools such as molds or master patterns that generally allow for single casts or for 
small batch production runs before breaking down are classified as soft tooling. 
These tools are typically made of silicon rubber, epoxy resins, low melting point 
alloys, and foundry sands via different AM techniques such as SL, SLS, and FDM. 
They exhibit high accuracy and surface finish with the flexibility to accommodate 
design changes and be produced again within a few hours to a day (depending on 
part size and design) without incurring high tool development costs. There is also 
little constrain to a part’s shape and complexity. However, it must be noted that in 
general, the higher the design complexity, the more challenging the molding 
process gets during casting and/or injection molding. 


Direct Soft Tooling 

As the name suggests, direct soft tooling is where the molding tool is directly 
produced by different AM techniques without intermediate steps. It aims to directly 
produce a tool in an optimal and efficient way while still maintaining its intended 
function. For example, sand casting molds can be produced directly with the SFS 
process. During the SFS process, the sand particles that are either coated with a 
polymeric binder or mixed in an appropriate amount of polymer powder are bonded 


70 Rapid Tooling in Manufacturing 


together into the shape of the mold when laser energy is applied to melt the polymer 
and binds the sand particles together, layer by layer (Kruth et al. 2005). For soft 
tooling, these silica sand molds do not undergo post-processing for binder removal 
as sand casting molds produced this way is meant to be destroyed after a single cast 
and are hence considered direct soft tooling. Unlike in direct hard tooling, polymer 
binders are removed, and molds undergo further sintering in the furnace to increase 
density to improve mold properties for castings, such as investment casting, meant 
for producing final production parts (more details in direct hard tooling). Accuracy 
and surface finish of metal castings produced from such molds are similar to those 
produced by conventional sand casting methods. 

Resin mold directly produced by SL is another example of direct soft tooling. 
Such molds are usually used in the injection molding of plastic components, and 
these plastic components exhibit high levels of accuracy. However, mechanical 
properties of resin molds are limited, and chances of tool damage can occur during 
ejection of the plastic part, especially parts with high geometrical complexity. A 
direct composite mold can also be produced by SL. In this case, thin shells of resin 
with the required surface geometry are backed with aluminum powder-filled epoxy 
resin to form the rest of the mold tooling. Composite molds have several advantages 
such as higher mold strengths and higher thermal conductivity compared to a full 
resin mold. Higher mold strengths allow more injection shots, and higher thermal 
conductivity promotes faster build times for the mold tooling. 

Unlike the directly produced sand casting molds by SLS which is destroyed after 
a single cast, resin and composite molds can withstand between 100 and 1,000 shots 
before they break down. Therefore, all three examples are considered direct soft 
tooling methods. 

In a study, ceramic molds for investment casting were directly produced with a 
photopolymerization process that basically works on the same principle as the SL 
technique (Halloran et al. 2011). For this purpose, the term SL will only be 
mentioned in this example. In this example, ceramic particles were suspended in 
a photocurable resin medium and were cured upon exposure to UV light 
(as opposed to laser energy for polymerization process to occur in SL). Hence, 
the shaping of a complex ceramic mold with integral cores was possible. The 
produced mold was made up of 60 vol% ceramic and 40 vol% photopolymer 
resin which actually functions as the binder. Similar to the SLS of silica sand 
mold, the produced mold underwent thermal processing step to remove the binder 
and formed a green ceramic mold. This green mold was then sintered in a furnace 
conventionally to obtain high part density and achieve a foundry-ready state, 
allowing for actual manufacturing and production of parts. Similar to conventional 
investment casting, the molds were removed or (most of the time) lost after casting, 
a characteristic of soft tooling. 

Ceramic molds produced this way have been used to produce equiaxed and 
single-crystal super alloys, Inconel 718, and SC 180 airfoil cast parts for aerospace 
applications (Direct Digital Manufacturing Laboratory - Georgia Institute of Tech- 
nology 2010). This shows that the development of SL technologies in RT can be 
considered as a disruptive manufacturing technology where single-crystal airfoils 
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can be produced quickly without incurring high start-up costs compared to 
conventional investment casting process. 


Indirect Soft Tooling 

In indirect soft tooling, a master pattern is first produced with different AM 
processes such as SL, SLS, laminated object manufacturing (LOM) FDM, and 
3DP. Among these processes, SL is the most popular and widely used due to its 
high level of accuracy and surface finish due to the high-resolution SL laser scans. 
With a master pattern, the mold tooling can then be built out of a wide range of 
materials such as silicon rubber, epoxy resin, low melting point alloys and 
ceramics. Similarly, these molds have limited mechanical properties and can only 
be applied to single cast or small batch production. Nevertheless, these molds 
remain advantageous for low volume production and show good geometrical 
accuracy, surface finish, and ability to cast complex parts. 

For example, a soft injection molding tool can be indirectly created quickly to 
mold a limited number of parts by using a metal arc spray system on a master 
pattern. In this process, a metal spray gun is used to spray metal droplets on the 
master pattern to produce the metal shell. These metal droplets are produced when 
an electric arc is introduced between two wires which then melt them into tiny 
droplets. A study on using arc spray metal for producing a tool used in injection 
molding of plastic parts showed that this method provided a time and cost saving of 
minimum 50 % (Chua et al. 1999a). Figure 5 shows in detail how a soft injection 
molding tool (half mold) made of resin filled with 85 % aluminum filler can be 
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Fig. 5 Rapid tooling using a metal arc spray system 
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Fig. 6 (a) Creating the silicon mold; (b) removal of master pattern 


created indirectly with a master pattern produced by processes such as SLS or 
FDM. The process in Fig. 5 can be repeated to produce the other mold half to 
complete the mold. 

Silicon rubber molds can also be produced indirectly with a master pattern and 
are the most flexible and popular rapid tooling process for vacuum casting, owing to 
the ease of copying every aspect of the master pattern into the silicon cavity mold 
with exceptional high resolution and detail. Moreover, parts produced can also be 
easily removed from the mold cavity (Chua et al. 1998). Hence, silicon molds 
remain popular in RT as they can be used in the manufacturing of plastic, metal, and 
ceramic parts. Figure 6a shows how the silicon mold is produced with silicon slurry 
and a master pattern suspended in a container. This process is carried out in a 
vacuum chamber to minimize air trapped pores during the pouring of the silicon 
slurry. The entire container filled with silicon slurry is then based in the oven at 
70 °C for 3 h to solidify. As shown in Fig. 6b, after the silicon mold is solidified, the 
mold cavity is produced upon removal of the master pattern. 

Investment casting process is another casting process that is probably the most 
important molding process for the casting of metal. Similar to other indirect soft 
tooling in RT, a master pattern is first produced from materials such as paper, wax, 
foam, or other materials that can be easily melted or vaporized or burned out. Also, 
parts with complex geometry can be produced this way. However, for the purpose 
of illustration, a simple master pattern used for investment casting is shown in 
Fig. 7. This master pattern is dipped into a ceramic slurry and dried to form a 
coating or ceramic shell that conforms to the shape of the master pattern (Fig. 7a). 
This process is repeated until the shell builds up an appropriate thickness and 
sufficient strength (Fig. 7b). The master pattern is then removed (melted, burned, 
or vaporized) and a ceramic cavity is then realized (Fig. 7c). Molten metal can then 
be poured into the mold to form the object (Fig. 7d). Upon solidification, identical 
to conventional investment casting, the shell is cracked (hence single cast) to obtain 
the cast object in the mold. 
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Silicon slurry 



Master pattern is 
burnt away 



Fig. 7 Rapid tooling in investment casting: (a) master pattern dipped in ceramic slurry and dried, 
(b) process is repeated until a shell with desired thickness is achieved, (c) master pattern is 
removed and a ceramic mold cavity is realized, (d) molten metal is poured into the mold cavity 


A variety of metals such as titanium, steel, aluminum, magnesium, and zinc can 
be cast using this method of indirect soft tooling. Master pattern produced by 
different AM processes can be edited and produced conveniently for improved 
mold designs. Master patterns produced are also highly accurate and repeatable. 
This is especially true for SL process when surface finish and geometrical accuracy 
can be as low as 5 pm. 


Hard Tooling 

Hard tooling refers to finished molding tools that have mass manufacturing capa- 
bility and are able to produce thousands of injection-molded/cast parts before 
breaking down. These molding tools include mold inserts for series injection 
molds, die casting molds, and prototype tools for injection moldings and die 
casting. These molds are usually produced from ceramics such as silica and alumina 
and alloys such as tool steels for high durability and for high volume production. 
They possess high mechanical properties such as high compression strength and 
hardness resulting in excellent durability that is required for mass manufacturing of 
parts. The advantages are fast turnaround times to create highly complex-shaped 
mold tooling for high volume production in manufacturing. Similar to soft tooling, 
the fast response to modifications in designs is almost immediate. Therefore, hard 
tooling in manufacturing of final parts or components for actual application is 
highly popular. 
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Direct Hard Tooling 

In direct hard tooling, molding tools made of ceramics or tool steels are directly 
produced from the AM process. Generally, AM processes with the capability of 
processing such materials are usually equipped with high energy laser or electron 
beam as the source such as SLM and EBM respectively. 

Metal and ceramic hard molding tools can both be directly produced by the SLS 
and 3DP processes. However, they require additional post processing to achieve 
high density and improve properties required for high durability. In the SLS and 
3DP process, the ceramic or metallic particles are bonded together by a polymeric 
binder. In SLS, the laser heats the polymeric binder in particle form mixed 
homogeneously with the metal or ceramic particles, or each metal or ceramic 
particle is already coated with a layer of polymeric binder. Upon heating, the binder 
reaches its glass transition temperature, softens, and bonds the metal or ceramic 
particles together upon solidification. On the other hand, in the 3DP process, the 
print head ejects liquid binders onto the metal or ceramic powders to selectively 
bond them together. 

In order to achieve characteristics required as a hard tool, the molds produced by 
both SLS and 3DP undergo binder removal which result in a green part that is 
porous and brittle. This green part undergoes further infiltration and sintering 
process with copper in the furnace (of about 40 h of infiltration and sintering 
process) to increase density and make it solid. While such molds produced this 
way are post processed, they are still considered a form of direct tooling as the final 
shape of the mold was first produced without undergoing any intermediate steps. 

Similarly, molds with complicated designs can be produced this way and shrink- 
ages have been reported to be less than 2 % which can be easily compensated for in 
the software. Typical time frames to produce these molds take 2 weeks to complete 
(independent of mold design complexity but dependent on size) compared to 6-12 
weeks (depending on complexity of mold design) using conventional techniques. 
They can also be easily machined for design modification. The finished mold 
produced by this technology for use in injection molding also have reported with- 
standing pressures up to 30,000 psi (200 MPa) and 100,000 shots of glass-filled nylon 
producing tens of thousands of accepted injection-molded parts (Wholers 2000). 

On the other hand, metal or ceramic molds produced by SLM or EBM do not 
require post-processing steps that are involved in the SLS process. Due to the high- 
power laser and high-energy electron beam source, the metallic or ceramic powder 
particles will melt fully or partially to form a solid part with reported densities of up 
to 99.9 %. In addition, functional molds with internal conformal cooling channels 
can be produced this way. Unlike in molds with cooling channels produced 
conventionally, the cooling channels created by the AM technique are highly 
continuous, and the turns in channel direction are connected with smooth gradual 
gradient change. Moreover, cooling channels (as shown in Lig. 8) can be built close 
to the surface as well as built to conform to the geometry of the mold which not only 
controls and ensures an even cooling of the entire tool but also increases the cooling 
efficiencies by a great measure. 
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Fig. 8 Cross section of a tool insert showing internal conformal cooling channels 

The conformal cooling channels can be configured in such a way that they bring 
down the temperature of the injection-molded parts quickly without affecting their 
properties. In fact, studies have shown that molds with conformal cooling channels 
can reduce cycle rates by up to 50 %. In addition, quality of injection-molded 
plastic parts improves significantly as “hot spots” are eliminated with tool inserts 
with internal conformal cooling channels as parts are subjected to less thermal 
stress and hence distortion. The characteristics displayed by such cooling channel 
cannot be produced by any traditional method, and hence, these molds are claimed 
to be of high value to the injection molding industry as they produce better quality 
parts and reduce the cost of tooling in manufacturing. 

Another method of directly producing hard tools that is not as popular compared 
to the rest is laminated metal tooling which works on the same principle as LOM. 
Sheets of material, usually steels with the required cross section, are first produced 
either by laser cut, water jet, or milling according to the 3D CAD data that provides 
the sliced 2D information. Based on the LOM principle, the thousands of sheets are 
then bonded together with bolts or even with glue. The main disadvantage of this 
process of directly producing the mold is the wastage of unwanted cut material that 
cannot be reused as conveniently as powder-based processes. Mechanical proper- 
ties are also inferior to molds produced by SLS or SLM and hence not as popular in 
direct hard tooling. 


Indirection Hard Tooling 

There are many indirect hard tooling methods in RT that fall under this category 
and the processes largely remain similar. In general, indirect methods for producing 
hard tools for plastic injection molding make use of casting liquid metals or steel 
powders in a binder system and require intermediate steps to produce the final shape 
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of the molding tool. Again, binder is removed; part is sintered in a furnace and 
undergoes further infiltration with a secondary material (such as copper). 

For instance, an SLA model is produced and post processed to a high quality part 
by sanding and polishing. The model is placed in a container where silicon rubber is 
poured around it to make a soft silicon rubber mold that replicates the female cavity 
of the SLA model. This is then placed in a box, and a mixture of metal particles such 
as tool steel and a binder material is poured around it, cured, and separated from the 
silicon rubber mold. This is then fired to eliminate the binder and sinter the green 
metal particles together. The sintered part which is about 70 % steel and 30 % void 
is then infiltrated with copper to give a solid hard mold which can be used in 
injection molding. 

Another indirect hard tooling example is to produce a master pattern first and a 
parting line block is used. A metal powder binder mixture in semiliquid form is then 
poured over the pattern and parting block and is left to cure for an hour at room 
temperature. Similar to SLS of ceramic or metal molds, the pattern is removed and 
the mold is baked in a furnace or oven to further improve mechanical properties and 
durability required for a hard tool. 


Examples of Tooling in Manufacturing 

Indirect Soft Tooling of a Silicon Mold for Vacuum Casting 

In this study (Cheah et al. 2002a), mold fabrication was carried out via an indirect 
rapid soft tooling approach. Firstly, a master pattern was produced with the SL 
technique and was carefully post processed (polished and sand blasted) to achieve a 
good surface finish. This was because the surface quality of molded parts is 
dependent on the master pattern’s surface quality which in turn affects the surface 
quality of the silicon mold cavity (see Fig. 9a). Secondly, the tooling resin com- 
prised of 75 % aluminum powder and 25 % epoxy was used to obtain the completed 
RT core and cavity mold halves (see Fig. 9b). Finally, mobile phone front housings 
were produced with polycarbonate/acrylonitrile butadiene styrene (PC/ABS) resin 
(Fig. 9c) and pure PC (Fig. 9d). 

Table 1 shows the comparison summary between RT and conventional tooling 
in this example. RT in manufacturing has much shorter turnaround times and 
lower production costs as compared to conventional tooling. Cost and lead time 
were reduced by up to 25 % and 50 % respectively in this example. The benefits 
of RT in manufacturing are thus highly desirable in the product development 
stage, especially in the initial phase where a significant number of design 
changes will be carried out after evaluation tests. Errors in design can be detected 
earlier resulting in a superior design. Overall, RT in manufacturing reduces the 
development cycle time of a new product. It should be noted that for mass 
production, the conventional approach using tool steel for mold fabrication is 
still preferred mainly because of its high strength and durability during the 
molding process. 
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Fig. 9 Rapid tooling of epoxy resin in manufacturing of a mobile phone front housing: (a) 
SL-produced master pattern, (b) epoxy tooling resin, (c) molded PC/ABS housing, (d) molded 
PC housing 


Table 1 Summary of rapid tooling versus conventional tooling from the presented example 



Rapid tooling: epoxy resin tool 

Conventional tool steel 

Cost of tool 

USD 5,000 

USD 20,000 

Lead time 

2 weeks 

4 weeks 

Part accuracy 

Acceptable 

Good 

Surface quality 

Dependent on master pattern 

Good 

Tool life 

500 (approximate) 

1,000,000 


Rapid Tooling in Sheet Metal Forming 
Indirect Rapid Tooling 

In this example, the tooling was produced directly and indirectly and used in sheet 
metal forming. In the indirect method, master patterns were produced with the SLS 
and SL process (Cheah et al. 2002b; Du et al. 2002) as shown in Fig. 10a. These 
master patterns were vacuum cast such that a silicon rubber mold with a cavity (the 
same of the master pattern) can be produced. An aluminum epoxy mixture is then 
poured into the silicon rubber mold cavity to create the aluminum epoxy tooling 
that follows the shape of the cavity (Fig. 10b). These aluminum epoxy parts 
undergo further curing to improve mechanical properties. These parts exhibit 
similar properties to that of aluminum. 

In order to achieve the desirable dimensions and finishing, the toolings were 
machined and polished. Upon the completion of the tooling preparation, the 
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Fig. 10 (a) Master patterns produced by the SLS and SL process, (b) aluminum epoxy tooling 
produced from the respective SLS and SL produced master pattern 


toolings were assembled and with die holders (Fig. 1 la). A 0.4-mm-thick aluminum 
sheet metal was used to produce the parts as shown in Fig. lib. 

As a result from this study, the total time spent on producing these tools by the 
SLS and SL process was 27 h and 29 h respectively compared to conventional hard 
tooling that takes roughly 100 h. On the other hand, the cost of producing these 
tools by the AM processes was around $150 compared to $3,000 in conventional 
hard tooling. 

Direct Rapid Tooling 

Following the same example, the same molds can also be produced directly for 
metal sheeting forming. As explained in the direct hard tooling section, the punch 
and die insert parts that are both produced by SLS (Fig. 12a) in this example have to 
undergo post-processing steps to improve density and mechanical properties by 
binder removal followed by infiltration (Fig. 12b). 

The finished punch and die inserts were then used to produce embossed samples 
made of aluminum sheet metal 1.5 and 2 mm in thickness shown in Fig. 13a and b 
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Fig. 13 


Embossed samples of (a) 1.5 mm and (b) 2-mm-thick aluminum metal sheet 


respectively. It was observed that there was no significant tool wear after 
15 stamping operations. 

The total time spent in directly producing this tool for metal sheet forming was 
approximately 1 week, and the total cost spent was $1,040 compared to 2.5 weeks 
and $3,000 in conventional hard tooling. 


Rapid Tooling for Investment Casting 

Investment casting in RT is unique; this is because in investment casting, the wax 
pattern or sacrificial patter is destroyed upon a single cast. In direct RT, the wax 
pattern produced only lasts once in investment casting, and several wax patterns 
have to be produced for high volume production. This is not feasible as each wax 
pattern produced directly by AM may take a long time. To overcome this problem, 
a more sustainable way to produce wax patterns lies in the indirect approach for 
producing a mold to cast the wax pattern to be used for investment casting. 

In this example, the indirect approach of RT in investment casting is presented 
(Lee et al. 2004; Chua et al. 2005). In the indirect approach, a master pattern is first 
produced with the FDM process. With reference to Fig. 6, a silicon rubber mold is 
produced with a cavity that follows the geometry of the master pattern. With the 
silicon rubber mold cavity, liquid wax is injected into the cavity to form the shape 
of the master pattern produced earlier with the FDM process. This wax part formed 
this way is the sacrificial part for investment casting (with reference to Fig. 7). The 
silicon rubber mold is said to be able to produce 40-50 wax patterns for investment 
casting (Lee et al. 2004; Chua et al. 2005) (see Fig. 14). 

Similar to other examples, the time (2 weeks spent) and cost ($1,000) savings 
were more than 50 % compared to conventional pattern fabrication with a metal 
(usually steel) mold. 
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Fig. 14 (a) FDM-produced master pattern, (b) silicon rubber mold, (c) sacrificial wax pattern, 
(d) resulting aluminum casting 


Direct Hard Tooling of a Silica Mold 


In this example, a hard silica molding tool is directly produced by SLS using silica 
sand that is commonly used in the foundry industry to build shell or core for metal 
casting. Unlike in SLS of silica sand mold with polymer particles as binding agent 
in direct soft tooling, the silica sand molds directly produced this way have high 
mechanical properties and durability. This is because the silica sand has other 
inclusions such as A1 2 0 3 , and therefore, the sand particles were bound together 
through the liquid surfaces of A1 2 0 3 particles and then solidified after the SLS 
process. While the A1 2 0 3 particles were responsible and function as the binding 
agent, it need not be removed and can be involved in the metal casting process 
without compromising the durability and properties of the silica sand mold (Tang 
et al. 2003; Wang et al. 2003). This method is hence considered as direct hard 
tooling. 
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Direct Hard Tooling of a Functional Copper Tool Insert 

In this example, functional hard tool inserts with conformal cooling channels are 
produced by the SLM process. These functional hard tool inserts are usually made 
with steel material; until recently (since 2011), copper alloys are possible (Becker 
2011) due to advances in the SLM technology. Copper alloy has a much higher 
conductivity and low absorptivity compared to steels, and there were challenges in 
processing this material as processibility of copper powder was previously limited 
by the laser power. 

The high thermal conductivity of copper and its alloys makes them highly 
suitable for mold tooling applications. It can be used as cooling inserts, mold 
inserts, and mold cores with advantages of overcoming thermal problems in plastic 
injection molds. A copper tool insert with internal cooling channels made of 
Hovadur (R) K220 produced by the SLM process equipped with a high power 
laser of 1 kW with an even bean profile was produced from Institute of Laser 
Technology, Fraunhofer. These molds were used in steel injection molding tools for 
the manufacture of plastic parts to ensure rapid heat removal at critical points. 
Cooling channels can also be designed to carry coolant such as water to reduce 
cycle times and part warpage by fast and even cooling of the entire tool which can 
also be produced by AM processes. 


Functional Hotwork Steel Tool Insert Produced by Metal Laser 
Melting and CNC Milling 

In this example, part of the tool insert for a vacuum cleaner cover is directly 
produced by the metal laser melting (also known as SLM) process, and remaining 
tool insert is produced conventionally by CNC milling. The integration of both 
conventional and AM technique is named as hybrid manufacturing. As seen in 
Fig. 15b, the bottom tool insert involves simple cooling channel design for the 
purpose of creating coolant inlet and outlet. This can be produced easily by 
conventional CNC milling. However, the top half involves carefully designed 
cooling channels to evenly and quickly cool down the tool to overcome thermal 
cooling limitations present in conventionally produced tool insert and can only be 
manufactured with AM techniques which in this case, it is manufactured by the 
SLM process. Nevertheless, the main focus is on the capability of the metal laser 
melting process in fabricating a tool insert with conformal cooling channels that 
eventually connects with the CNC-milled half as shown in Fig. 15 (with permission 
from Concept Laser GmbH). 

Further examples of tool inserts produced directly by additive manufacturing 
with metals are shown in Fig. 16 (with permission from Concept Laser GmbH) 
where tool inserts with internal conformal cooling channels result in near perfect 
temperature level on the contour surface with decreased warpage. 
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Illustration and simulation of 
conformal cooling channels 



Top half produced by SLM 


Bottom half produced by CNC milling 


Fig. 15 Functional tool insert produced by CNC milling and metal laser melting 



Fig. 16 Tool inserts with near perfect temperature level on the contour surface with decreased warpage 
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Summary 

Generally, soft tooling (direct or indirect) is widely used for single cast and small 
batch product. For single cast purposes in investment casting, parts with mechanical 
and physical properties identical to parts produced by conventional investment 
casting can be manufactured. Hence, parts produced by RT can be used and applied 
in actual intended application. For small batch production, soft tools are mostly 
used to manufacture prototypes with functional properties required for testing 
before the finalization of the design. 

Molding tools in hard tooling are usually produced directly (although they may 
be produced indirectly) due to high process capabilities of AM techniques as a 
result of technological advances (Cheah et al. 2005). They possess high mechanical 
properties and durability required as a hard tool with long tool life. In addition, they 
exhibit functional properties as they can be built with internal conformal cooling 
channels. Hard tooling in RT is generally meant for large volume production in the 
manufacturing of actual parts or components that can be used or tested for in the 
intended application. 

There are several benefits brought by RT with the most evident being cost and 
time savings. To the manufacturing engineer, RT minimizes design, manufacturing, 
and verification of tooling. Fixed costs are greatly reduced with minimal assembly, 
purchase, and inventory expenses. Consequently, labor and inspection costs are 
reduced accordingly. Costs are further reduced along with flexibility in manufactur- 
ing in RT because fewer inventories are scrapped upon design changes along with 
lower investment necessary for tool production. This is especially true in the 
dynamics of a competitive market where new products can be launched quickly 
to meet changing demands (Chua et al. 1999b). 

Additional advantages of RT in mold making and plastic processing industry 
include increased dimensional accuracy whereby defects can be avoided and parts 
are improved in aesthetic wise. Production cycle time can also be reduced by up to 
50 % in injection molding due to faster cooling. Core and cavity inserts can be 
easily modified or repaired which contributes to short return on investment. In the 
die casting industry, tool inserts with conformal cooling channels help to minimize 
or eliminate thermal-related mold and mold surface defects. Hot spots can be 
eliminated or reduced, increasing casting quality in terms of both dimensional 
and geometrical features coupled with high material structure integrity. Similarly, 
cycle time is reduced and tool life is increased. Core and cavity inserts can be easily 
modified or repaired which contributes to short return on investment. 
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Abstract 

The term Additive Manufacturing (AM) refers to a range of technologies that 
permits automated fabrication of computer-generated 3D models. AM generally 
uses an approach of segmenting the digital model data into thin, precise layers that 
are then bonded together to create the final solid object. Already established for 
larger-scale, lower-resolution applications, AM can also be used to fabricate com- 
plex geometry, micron-resolution models. Some of the basic AM technologies are 
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explored, and how they can be used to make small, precise mechanical structures is 
identified. How these techniques have been adapted to specifically improve this area 
is examined, and how they can develop in the future is discussed in the end. 


Introduction 

Additive Manufacturing (AM) is rapidly becoming a manufacturing process of choice 
for many new products. While there are numerous limitations for the related technol- 
ogies, these are counterbalanced by the time, cost, and geometric freedom advantages 
that they provide. Micron- scale parts are generally difficult to develop using these 
technologies due to the layer-additive approach used. Most layer thicknesses are in the 
order of a few tens to a few hundreds of microns, which characterizes resulting parts 
with a “stair-step” texture, particularly on highly curved geometry. However, some 
machines are better than others at providing fine detail, and there are even a number of 
machines that have been specifically developed to provide such detail. 

This chapter will introduce the basic concepts of Additive Manufacturing. This 
will discuss in particular the reasons why fine detail is difficult to achieve. The 
machines that can provide the highest resolutions will be discussed in more detail. 
As mentioned, there are a small number of dedicated micro AM technologies that 
aim to provide finer-detailed parts and these will be discussed in a separate section. 
Other approaches that have yet to be commercialized will then be discussed, 
followed by a brief discussion on some of the concerns that are specifically related 
to such small-scale technologies. 


Additive Manufacturing 

Additive Manufacturing is a terminology applied to a specific range of technologies 
that add material, normally layer by layer, to form complex 3D structures 
(paraphrased from the ASTM definition (ASTM)). Formerly referred to as rapid 
prototyping and often popularly referred to as 3D printing, the key attributes to all 
these technologies are as follows: 

1 . An original model is designed using 3D CAD (computer-aided design) technology. 

2. The CAD model is converted into a generic file format that is readable by the 
AM machine. 

3. The file is segmented in (normally) layers. 

4. These layers are added as material inside the AM machine. 

5 . Such material is added in a sequential fashion until the part has been completed. 

6. The part is removed from the machine and some post-processing is carried out 
before use. 

A typical sequence related to this description can be seen in Fig. 1. All com- 
mercial AM technologies apply this sequence in one way or another. There are 
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1 CAD 

2 STL convert 

3 File transfer to machine 

4 Machine setup 

5 Build 

6 Remove 

7 Post-process 

8 Application 





Fig. 1 A generalized process chain for Additive Manufacturing (Gibson et al. 2009) 


other approaches, but since the implications for micron-scale fabrication are being 
discussed, the focus can be on three basic material delivery systems: 

7. Liquid photopolymer resin systems 

8. Powder-based systems 

9. Extrusion-based systems 

For the liquid photopolymer systems, this category can be further subdivided 
into vat systems and droplet deposition systems. The vat systems were among the 
first AM technologies to be commercialized, requiring a platform to be lowered into 
a container of liquid photopolymer resin. The first layer of build will occur when the 
platform is just below the resin surface where it will be exposed to light source 
(usually ultraviolet) that will harden, or cure, the resin. The light source will 
somehow describe the cross section of the model to be created. Earlier machines, 
like the stereolithography (3D Systems) process from 3D Systems, used a laser to 
scan over the resin surface. Later technologies used 2D projection techniques to 
expose the resin. The energy content and the exposure time of the light source 
determine how far into the resin the curing takes place. The first layer therefore 
penetrates into the resin a sufficient enough depth to reach and adhere to the 
platform. Between each exposure, the platform indexes down to allow recoating 
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of the resin and subsequent layers adhere to the previous ones. There are alternative 
approaches to depositing and exposing the photopolymer resins, which are 
described in more detail later. 

At this point it is worth mentioning that certain part geometries may not directly 
connect to the platform. In these circumstances it is necessary to build what is 
known as a support structure below these features. Eventually such features will 
combine with the main body of the part, but these supports will keep them in place 
until they do. Such supports must be removed once the build is complete as part of 
the post-processing. The supports can be designed in such a way that they connect 
only slightly with the part so they can be easily removed. It is not necessary for the 
supports to connect with the entire surface of the part since the layers above are very 
thin and light and will not easily sag or sink. 

Powder-based technologies use fine powder particles to form the part. Such 
powders are spread over a build platform layer by layer. These particles are 
somehow combined together to form the solid part. Two methods are commonly 
used to create the solid cross sections. The first method is commonly known as 3D 
printing, as devised by researchers at MIT (Sachs et al. 1992). In this approach the 
powder particles are bound or glued together using printing technology to selec- 
tively deposit the binder. The second approach uses a laser to melt the powder 
particles so that they flow together and rapidly cool into a solid (as illustrated by the 
Selective Laser Sintering process (3D Systems)). In both cases the scanning only 
takes place where the part is and the remaining powder remains in its original state. 
Like all other processes, a platform increments the part for subsequent layers to be 
added. In the powder systems, there is a constraining container that keeps the 
powder from leeching out. This powder therefore acts as a surrounding support 
that removes the need for building supports in the manner described above. 

Extrusion-based systems require material to be extruded through a nozzle. The 
extruded material must be in a liquid state as it passes through the nozzle and 
hardens at a later point to form the solid part. Some simple approaches can apply the 
material in a gel or paste form that essentially dries out to form the solid. The 
resulting solid material is understandably very weak when using this approach and a 
more common method heats up the material until it is molten and therefore cools 
down rapidly to form the solid, like with the Lused Deposition Modeling process 
(Stratasys). The material must therefore be thermoplastic in nature to achieve this. 
Before the material cools or dries, it must bond with the subsequent material to form 
a complete solid. The same problem arises with this process as with 
stereolithography with respect to support structures. With these processes however 
it is common to have a second nozzle so that a second material can be extruded as a 
support. This second material can be chemically different so that the support 
removal can be assisted. Lor example, the part material could be insoluble in 
water so that a water-soluble material can be used for the support. One need only 
place the whole piece in water to facilitate the removal process. 

While it is not essential that material be added in a planar layer-wise fashion, 
currently every commercial system operates in this way. Thus, complex 3D objects 
are simplified as a sequence of 2D planar layers. A layer is generally interpreted as a 
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Fig. 2 A smooth CAD model (above) can be sliced with different thicknesses to produce more or 
less stair-stepping effects in the result (Gibson et al. 2009) 

normal extrusion of a 2D profile intersection with the model data. One limiting 
factor of such technology is therefore the thickness of these layers (as seen in 
Fig. 2). Another limiting factor lies in the ability of the systems to create the layers 
themselves. Lasers have a beam width, extrusion heads have a nozzle diameter, 
printers have a droplet size, and projectors have a pixel resolution. The positioning 
of a laser, for example, can be achieved to a very fine precision, even submicron. 
The overall accuracy of a part may therefore be very good even though the 
minimum feature dimension, like the wall thickness, for example, may be compar- 
atively large and based on the diameter of the laser beam or extrusion head, etc. 
Other factors like material shrinkage or vibration effects may also affect the 
accuracy of a part within a layer. Usually, the coarsest resolution results from the 
minimum layer thickness however and manufacturers have spent most efforts on 
trying to overcome this. It should be noted however that reduction of layer thickness 
will almost certainly increase the build time, and some systems provide a means of 
building with thicker layers when a part is required in as short a time as possible. 


Conventional Commercial Systems with the Finest Resolution 

There are numerous commercial technologies available. The vendors of these 
technologies are under pressure from competition to make their machines cheaper, 
faster, more accurate, and more capable of producing parts with improved mechan- 
ical properties. It is clear therefore that this is a matter of compromise, and so 
reducing the resolution of the parts may require the vendor to ignore other 
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attributes. The fundamental aspects of some technologies allow parts to be made 
with higher resolution than others. It is therefore worthwhile considering each basic 
class in turn, even though photopolymer systems are currently the best in terms of 
accuracy. One may, for example, want the most accurate part that can withstand a 
certain temperature or require a certain minimum Young’s modulus that is outside 
the capacity of photopolymer technology. Other less accurate systems may there- 
fore be chosen accordingly. 

The following sections describe the basic technologies that are applied for 
general Additive Manufacturing. There are some specialized systems that make 
use of similar technologies but specifically for creation of very fine-detailed parts. 


Photopolymer Systems 

Photopolymer systems are the most accurate currently available on the market. 
Liquid resins can be created with quite low viscosity and so can be delivered and 
controlled quite easily in low volumes when compared to molten polymers and 
powders. There are a variety of different ways in which photopolymers can be used 
in AM processes. 

Stereolithography 

As described earlier, stereolithography (SL) uses a laser to expose the photopoly- 
mer material by scanning across the surface. The part is built on a platform that is 
lowered into the vat of resin, thus growing the part from the bottom upwards. In 
order to ensure that surface tension does not disturb the surface of the resin and 
cause it to be uneven, a scraper (sometimes referred to as a doctor blade) redistrib- 
utes the resin. This ensures that the part has a thin, precise layer of resin over the top 
of it. This spreading process can be used in conjunction with a “deep dip” that drags 
the part deep below the surface of the resin to ensure that resin does indeed cover 
the previous layer. 

In order to maintain precision, precise mechanisms must be used. Leadscrew 
mechanisms with servomotor drive units maintain the positioning accuracy. Lasers 
scan using galvanometric mirrors and special optics are used to ensure the spot size 
is maintained over the entire build surface. Stability is maintained by having an 
enclosed build chamber where temperature and humidity are monitored and con- 
trolled so that the part does not swell during the process. Having said all this, there 
is still the possibility of part distortion due to uneven shrinkage of the part. This is 
very difficult to control since it is a function of the original part designed geometry. 
Parts taken off the SL machine will not be fully cured when taken off the machine 
and must be post-cured in a UV oven. If this post-curing is performed with the part 
still on the platform, distortion can be minimized. However, this may cause 
problems when cleaning up the part later, particularly when trying to remove the 
support structures. 

SL machines are mostly supplied by one company called (3D Systems), who 
pioneered much of this technology. Although some machines claimed to have 
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Fig. 3 Parts made using a 
Viper SL machine in high- 
resolution mode (Courtesy of 
FineLine Prototyping) 



greater accuracy in the past, the most accurate machine today has a minimum layer 
thickness of 50 pm using their Zephyr recoating system. Apart from possible 
reliability issues in maintaining such a layer thickness, the reason for not going 
smaller than this value is probably due to the minimum laser diameter. The 
smaxllest laser spot size that can be achieved is 130 pm, although positioning of 
the laser is much more precise. This means however that the level of detail is 
limited by this dimension. While feature definition can be enhanced by correct 
orientation of the part within the machine, there is not much point in using thinner 
layers with this technique. 

Another factor to consider is the build volume of the machine. SL machines can 
be 250 mm x 250 mm x 250 mm or larger in terms of build volume. If building 
large parts, scanning and resin distribution can take more than a minute for each 
layer. If using 50 pm layers, parts can take more than 4 days to complete. Most SL 
machines therefore have larger layer thicknesses and can even vary the laser spot 
size in order to reduce the overall build time. High precision is therefore only used 
commonly to make small parts using SL, and so the large vat size is therefore 
unimportant. Figure 3 shows some micron-scale parts that were made using the 
most precise SL equipment produced by 3D Systems with a resin specifically 
designed for this type of application. 

Polyjet 

The Stratasys Polyjet and Connex processes (originally developed by the company 
Objet, who subsequently merged with (Stratasys)) also use photopolymer resins. 
With this technology the resin is printed using drop on demand deposition heads, 
much like conventional 2D inkjet printing. The droplets of resin are printed using an 
array of print jets that make this a highly parallel process. Furthermore, there are 
different printheads for part material and support material. Once the droplets are 
deposited onto the platform or onto previous layers, a UV light follows the 
printhead to cure the resin. Part material will cure to a hardened solid polymer 
while the support material cures to a jelly like state that facilitates removal once the 
build has completed. The Connex process further elaborates this by allowing two 
separate part materials to be used and even permits the mixing of these part 
materials together in the build. Two materials of different colors can thus be 
blended to show gradient colors. It is even possible for two materials with different 
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mechanical properties to be blended together using this process, and so hard and 
soft materials can be used to form a variety of different tactile effects. 

The use of well-proven technology developed for the printing industry provides 
an excellent platform for quality and reliability. However, the photopolymer resins 
are still significantly higher in viscosity when compared to the inks that are 
normally dispensed by these devices. This means that care must be taken to prevent 
these nozzles from clogging. Further to this, only the lower viscosity resins can be 
used here and so the range of materials and properties is not as high as 
stereolithography. Since the droplets are extremely small, the resolution is equally 
small. The smallest layer thickness is approximately 15 pm. In-plane resolution is 
not as high since droplets will spread out and flatten on impact. However this is still 
very good and in the order of 50 pm. However build speed is still reasonable when 
compared with the laser-based technology due to the use of multiple print nozzles 
printing in parallel. 


EnvisionTEC 

This company initially introduced the Perfactory system (EnvisionTEC), which 
used 2D LCD arrays (similar to those used in flat screen computer monitors) as the 
mechanism to expose the photopolymer resin. This approach makes it possible to 
realize a very high resolution in all three Cartesian directions, especially since LCD 
screen resolutions have correspondingly increased in recent years. Since this 
enables exposure of a full layer, build speed is also quite high, albeit tempered by 
the lower light intensity that comes from LCD technology. This light is obviously in 
the optical range, and thus there is a need to have different resins compared with the 
other photopolymer systems, tuned to cure at a different frequency. Since the earlier 
systems, a range of new machines has been released by this company, aimed at 
specific application areas. There is a particular niche of small-scale medical appli- 
cations, like in dentistry and the fabrication of in-the-ear hearing aids. Machines are 
also widely used in the jewelry industry as well due to the fact the materials work 
well with existing investment casting technology. Layer resolution is around 15 pm 
for some of their machines, dependent on which material is being used. In-plane 
resolution is about 45 pm, which is somewhat similar to the Objet machine. 


Powder Systems 

This approach makes use of powder to form the structure of the part. There are 
currently two technologies that make use of powders, powder binding and powder 
sintering. Powder binding is essentially the use of inkjet printing technology to glue 
powder particles together. Powder sintering makes use of polymer powders and 
uses laser energy to melt them, thus fusing them together to form a solid. Both of 
these technologies are limited by the powder particle morphology and dimensions. 
Unlike the resin-based systems, the particles are discrete and so cause problems 
relating to keeping them in place during the solidification process. The smaller the 
particle size, the more difficult this would be since the surface to volume ratio 
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increases. With relatively low density polymeric materials, static charges are likely 
to attract the powder particles to surfaces, causing disturbance in position. Since 
there is no evidence of the powder binding systems being developed into a micron 
scale, it will not be described here. 

While the term is somewhat misleading, the earliest powder-based technol- 
ogy was called Selective Laser Sintering. This process has been commercialized 
for polymers by the US company DTM, which was merged with 3D Systems a 
few years ago, and by the German company (EOS). Powder material is some- 
how deposited as a thin layer onto a build platform. There, laser energy is 
directed at the powder to cause localized melting (or at least sintering if the 
whole particles do not melt, hence the name), which rapidly solidifies to form 
the solid structure. Further powder is deposited onto the platform once it has 
been lowered by the thickness of one layer and the process is repeated. 
Constraining walls around the platform keep the part and un-melted powder in 
place. Since the surrounding powder is loose, there is no need for additional 
support structures. 

One major benefit of powder-based systems is the range and quality of materials. 
These materials have some of the most superior mechanical properties available in 
Additive Manufacturing. Furthermore, this extends beyond polymer technology, 
with metal powders available, albeit requiring higher power laser technology as 
well as more precise control over the process parameters. This technology is also 
therefore currently among the most expensive. 

The most limiting factor behind this process in terms of accuracy is the powder 
particle size. If particles are too small, it is difficult to prevent them from moving 
around the powder bed and adhering to other surfaces. It is possible to have smaller 
metal powder particles compared to polymer due to their higher density and also 
because polymers store electrostatic charge, which can cause them to move more. 
Particle sizes for polymer materials average around 70 pm, while metal powders 
can be lower. Unused powder from a previous build can be recycled into a later one. 
The elevated temperatures during the build can cause small particles to bind with 
others that will ultimately affect the final part accuracy. 


Extrusion Systems 

Extrusion-based technology is by far the most popular in recent times. This is partly 
because the material delivery process is the simplest of all. Material is extruded 
through a nozzle, which is mounted onto a three-axis positioning system. Material 
is therefore deposited using a single-channel approach with a phase change from 
molten (or gel state) to solid. The transition will cause the material to adhere to 
surrounding material, and the rapid cooling or solidification process will ensure the 
geometry remains intact. It is somewhat difficult to extrude material in a precise 
manner at a high feed rate, and so this process is considerably slower than 
competing processes. The competitive advantage therefore lies in the process 
simplicity that leads to low-cost machines. Recent expiration of key patents related 
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to this technology and the publication of open-source designs have resulted in a 
large number of commercial systems exploiting this extrusion-based approach. 

The most well-known and original technology is referred to as Fused Deposition 
Modeling and was commercialized through the company called (Stratasys). These 
machines are somewhat more sophisticated than the recent commercial systems 
because some patents remain in place that prevent the others from using environ- 
mental chambers to prevent temperature-based distortion in the parts. The Stratasys 
machines are therefore capable of making the highest quality and widest range of 
parts. Their extrusion heads are capable of melting high- temperature materials that 
can be used for functional application in demanding industries like automotive and 
aerospace parts. Furthermore, the more precise control of temperature permits the 
smallest nozzles to be used. However, even these machines are somewhat limited in 
accuracy compared with photopolymer and even powder-based systems. It is 
however worth mentioning extrusion-based technologies since it does have the 
capacity for creating micron- scale parts. 


Commercial System Summary 

The following Table 1 covers the current commercial technologies, indicating the 
most precise of the various systems available. These were taken from the published 
vendor information and were verified at the time of writing this chapter. Much of 
this technology can be considered relatively mature, and it is not expected that these 
machines are going to become much higher in resolution. Some specialist technol- 
ogies have been developed specifically to push these barriers and that is the subject 
of the following section. 


Table 1 List of Additive Manufacturing technologies with typical build dimensions from some of 
the referenced manufacturers 



Model 

Build volume 
(mm) 

Layer 

thickness 

(pm) 

Minimum 

resolution 

(pm) 

Stereolithography SL 
3D Systems 

iPro 8,000 

650 x 750 x 550 

50 

130 

Selective Laser 
Sintering SLS 3D 
Systems 

sPro 60 

381 x 330 x 457 

80 

100 

Fused Deposition 
Modeling FDM 
Stratasys 

Dimension 

Elite 

203 x 203 x 305 

254 

254 

Polyjet Stratasys 

Objet30 Pro 

294 x 192 x 148 

16 

100 

Micro SL D-MEC 

ACCULAS 

150 x 150 x 50 

5 

2 

Micro SLS EOS 

EOSINT p60 

57dia x 30 

5 

30 


a Note the figures are based on the manufacturer’s claims in published data. Feature size is an 
estimate based on dimension of laser spot, nozzle diameter, head positioning accuracy, etc. The 
smaller model machines have been cited since these are likely to provide slightly higher accuracy 
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One is probably approaching the limit of resolution that these machines can 
reach. This is as much as anything to do with the speed at which these machines can 
produce parts and the constraints of building precision technology. If one is to build 
with much thinner layers than the current ones, the operating environment would 
probably have to include clean-room technology, parts would have to be fabricated 
to very high precision, and additional sensor/control systems would have to be 
implemented to monitor and maintain the system. All of this would make the 
machine cost-prohibitive to many users. It is clear that AM technology is moving 
towards a wider range of users, even for domestic use. If this is so, then it is not a 
good idea to add complexity and cost. It is therefore considered that micron-scale 
AM will take a different route to. 


Dedicated Micro AM technologies 

Two of the technologies mentioned above have been commercialized into micron- 
scale systems aimed at producing small, highly detailed free-form products. Until 
recently stereolithography was the only core technology that was available for this 
purpose. However, recent introduction of powder laser sintering technology at a 
micron scale has become available. 


Micro-Stereolithography 

As mentioned previously, stereolithography has proven to be the most popular and 
effective way of creating micron- scale AM parts. This is probably because liquid 
photopolymers are easier to handle compared with molten or powder materials. 
However, the polymerization process is still problematic. There appear to be two 
popular methods of causing polymerization: 2-photon and dynamic mask 
projection. 

With 2-photon systems, instead of using UV light to cure the resin, high- 
intensity near infrared (IR) light is used. With UV, polymerization will take place 
at much lower intensity and roughly wherever the light hits the resin. As such, the 
resins used must be kept sealed and protected even from the UV radiation in 
ambient conditions. 2-photon polymerization is a more nonlinear process and really 
takes place when a very high intensity is present. As such, a near IR laser beam can 
be focused inside the resin, rather than on the surface and polymerization will only 
take place where that beam is focused. The focused region can be very small, in the 
order of 100 nm, which can result in very fine detail. Pulse duration can be in the 
femtosecond range and extremely high power (hundreds of gigawatts). Scanning 
time can be very fast, and since there is no requirement to raise or lower platforms 
or manipulate the resin during the build, small parts can be built very quickly. A full 
3D geometry of 100 pm cube can be fabricated in a few minutes. Furthermore, the 
need for support structures is significantly reduced, although not completely 
eliminated. 
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Fig. 4 Micro- 
stereolithography part 
(Courtesy of Nanoscribe 
GmbH) 



The major disadvantage of 2-photon systems is the high cost. High-intensity fs 
IR lasers are expensive and somewhat rare. The optics required to focus these lasers 
and the positioning systems are all extremely difficult to build. Resins, although 
requiring careful formulation, are not so difficult to source since many have already 
been developed for the semiconductor industry as part of the wafer fabrication 
processes. However, it should be noted that fabricating and handling complex 3D 
parts that may be difficult to even see with the naked eye demand precise technol- 
ogies regardless of which process is used. A 2-photon system is commercially 
available from the German company (Nanoscribe). A sample part built with 
features smaller than 1 pm using this process can be seen in Fig. 4. 

The dynamic mask projection approach uses the more conventional UV curing 
approach. As such, there is a need for a build platform that raises and lowers so that 
layers can be created using mechanical positioning. Layers of around 5 pm can be 
achieved, but surface tension effects require careful handling of the resins, includ- 
ing allowing time for the resin to settle to a flat surface. The lower the viscosity of 
the resin, the quicker this settling will be. Raising the temperature of the resin can 
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help to reduce this viscosity, but even so there are a limited number of resins that 
would work using this technology. 

With UV curing, lasers are not a good option since polymerization does not 
require much energy. A single laser would take time to scan across the surface and 
resolution would be determined by the diameter of the laser. A more effective 
approach would be to use an approach that can expose a full layer at the same time, 
similar to the EnvisionTEC Perfactory process. A popular approach to achieve this 
is to use a digital micromirror device (DMD). This technology was developed for 
projector devices and consists of an array of tiny mirrors that can change position 
due to an attached actuator. By shining light onto or away from the surface of the 
resin, depending on the position of the mirror, it is possible to create a 2D mask. 
DMDs are high resolution (over 1,000 x 1,000 pixels) and can be focused to the 
correct dimensions using suitable optics. High-speed switching of the actuator can 
result in gray-scale exposure, but it is not clear whether this would work or has been 
used to improve the layer precision. Even so, with a conventional DMD and a 5 mm 
vat size, it is possible to achieve 5 pm resolution. 

Support structures are necessary using this approach. However, the small scale 
of the system and the corresponding part dimensions means that only a few supports 
would be needed for any part. A technique that has been used is to place the 
supports in such a way that the part is not physically connected to the part. A 
small gap of one layer thick can be placed between them, relying on the surface 
energy of the liquid resin to keep the geometry in place. Such small parts would 
benefit from any approach that simplifies or reduces the amount of handling of parts 
coming from these machines. The ACCULAS system from Japanese company 
D-MEC uses a DMD approach (D-MEC), achieving a spatial in-layer resolution 
of 1 pm and a layer thickness of 5 pm. 

Since the DMD-based technology is based on more readily obtained compo- 
nents, it is inevitably going to be cheaper. However, the resolution is not as good as 
2-photon laser-based machines. 


Micro Selective Laser Sintering of Metals 

Already well-known for conventional SLS technology, the German company EOS 
recently introduced a micro SLS machine (EOS). At the time of writing, this 
machine is in beta testing but has already been used to produce many micron- 
scale parts (Fig. 5). The main advantage of this process is the materials used. EOS 
uses metal powders in a similar way to conventional SLS machines, the main 
difference being the scale of the machine. With metals being high density, it is 
possible to reduce the powder particle size to much lower than polymer powder 
systems without having problems with the material feed requirements. The quoted 
layer thickness for the EOS machine is a minimum of 1 pm, which must be the 
approximate size of the powders used. 

The energy requirements for such a system to melt powders, even in thin layers, 
are quite high. Beam technologies, like lasers, are currently the only option for 
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Fig. 5 A metal part 
fabricated using an EOS 
micro SLS machine 
(Courtesy of EOS) 



metals in such machines. The use of a laser means much the same as mentioned 
previously that the processing (scanning) time is quite long and that there is a 
limited resolution correlating to the beam diameter. The laser beam used in the EOS 
machine is 30 pm, which corresponds to a minimum feature dimension of 50 pm 
due to thermal transfer effects in the melt region. 

As a contrast, metal parts can also be produced using Microfabrica’s MICA 
technology (Microfabrica). This technology uses lithographic processes developed 
for semiconductor device manufacturing. Materials like palladium, rhodium, and 
nickel-cobalt can be fabricated this way. Parts are generally quite flat, with 5 pm 
layers, features of around 1 pm, and gaps around 10 pm. Each layer has an 
extremely good surface finish. 


Experimental Systems 

There are numerous research projects investigating micron-scale Additive 
Manufacturing. Many are based on the micro- stereolithography systems described 
above. Work may involve new resin formulations, more precise positioning 
devices, higher resolution exposure systems, or better environmental control. 
Improvements are primarily incremental and most of the work can be easily 
searched. 

Research into other technologies may not be so easy to find however. For 
example, attempts to fabricate micron-scale Fused Deposition Modeling technol- 
ogy may be overlooked. A requirement for micro FDM would be to reduce the 
extrusion filament dimensions, which would make layer thicknesses smaller as well 
as allowing finer-detailed parts to be defined in-layer. As the extrusion nozzle 


71 Micro Prototyping and Fabrication in Manufacturing 


2565 


becomes narrower, so the outer surface to volume ratio decreases. This means that 
the shear forces observed at the walls of the nozzle start to dominate the material 
outflow. Pressure required to push material out of a nozzle becomes exponentially 
more difficult as the diameter decreases. If the shear forces vary in different parts of 
the barrel due to uneven polishing or scratching of the inside, the flow direction can 
also be affected and the material may not come out of the barrel straight. Since the 
distance between the nozzle and the platform or part on the platform is very small, 
this may not have a large impact, but all inconsistencies in the process can lead to 
inaccuracies. With most FDM systems, the material is heated in a chamber and 
cools during travel down the nozzle. One technique that could be adopted is to 
maintain the temperature during this travel to provide consistent flow. A proposed 
new design for a 50 pm FDM system with heated nozzle including a good study on 
how to model the molten material flow throughout the system can be found in 
(Monzon et al. 2013). 

As mentioned previously there have been a number of developments in 
stereolithography technology that are moving away from the conventional 
approach used in larger systems. The 2-photon approach makes it possible for the 
resin to cure within a very sharp cutoff region, thus making it suitable for micro- 
and even nanoscale structures. The photopolymer molecules absorb two photons 
instead of just one in conventional SL and so are excited to a higher state. An 
experimental 2-photon system was developed by Lim et al. (Lim et al. 2005) that 
effectively demonstrates the feasibility of this approach, but it is clear that there is 
still much work to be done to make it a truly viable technology, including the 
development of appropriate polymer systems (Schafer et al. 2004). 

Another approach that can be applied to micron-scale photopolymer technology 
is the use of two energy sources, namely, ultraviolet combined with infrared 
radiation in the form of stereo-thermal lithography (Bartolo and Mitchell 2003). 
The photopolymer requires to be formulated with both photo and thermal initiators. 
This makes the curing more localized and efficient, which turn will make the parts 
more accurate. The curing process can be very complex and involve a number of 
stages. While this is somewhat involved, an advantage is that multiple material 
parts can be possible. 

Summary 

It should be noted that micron-scale Additive Manufacturing is not the same as 
lithographic technology that was developed from the semiconductor industry and 
operates on the nanometer scale. AM is not just about being able to make parts 
using a layer-based approach but also being able to achieve this quickly and easily 
directly from CAD-generated 3D model data. Lithographic processes, like LIGA, 
can make some kinds of 3D shapes, but they are by no means as quick and versatile 
as what is expected from AM technology, particularly in terms of producing the 
geometric freedom required. Parts made from micro AM are therefore slightly less 
precise but considerably higher in geometric complexity. 
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When you reduce the scale of AM technology, some of the factors that were 
originally assumed to be negligible start to take effect. This can include surface 
tension of resins causing the parts not to be flat or the shear forces in nozzles 
causing material not to flow smoothly. For larger parts requiring lower resolution 
using conventional AM, these factors can be ignored. Furthermore, handling of 
such small parts can cause problems for the user, not only in terms of ensuring the 
part is not damaged, but also in how it may affect the system. Intensive calibration 
must be done every build to ensure the settings are correct. 

Micro Additive Manufacturing is definitely a niche application field within the 
wider AM market. It makes use of similar approaches and devices that can be found 
in its bigger brother. However, in order to get the best out of it, new machines are 
likely to evolve away from this into more specialist technologies. They will still use 
a layer-based approach with direct driving from the CAD sheer software, but they 
will also have special features that assist in the material delivery at such a high 
resolution. 
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Abstract 

Rapid prototyping or referred as layered manufacturing has been widely used in 
fabricating 3D components from CAD data. It has evolved from liquid, plastic, 
powder to metal-based systems in the past two decades. Recently, 3D printing 
using polymeric material has caught much attention in many industry applica- 
tions including in micro- and bio-fabrication due to its advantage of layer 
manufacturing. In this chapter, drop-on-demand (DoD) printing will be specially 
discussed. Among those, the novel electrohydrodynamic jet printing (E-jetting) 
and piezo-actuated micro-dispensing methods will be described in details. Their 
applications to biomedical engineering - e.g., PCL scaffolding, HA bioactive 
multilayered coating, and cell printing - will be presented to demonstrate its 
potentials. 


J.Y.H. Fuh (ISI) • J. Sun • E.Q. Fi • J. Fi • F. Chang • G.S. Hong • Y.S. Wong • E.S. Thian 
Department of Mechanical Engineering, National University of Singapore, Singapore 
e-mail: mpefuhyh@nus.edu. sg 


© Springer-Verlag Fondon 2015 

A.Y.C. Nee (ed.), Handbook of Manufacturing Engineering and Technology, 

DOI 10. 1007/978-1-447 1-4670-4_79 


2567 


2568 


J.Y.H. Fuh et al. 


Introduction 

Rapid prototyping (RP) is a solid free-form fabrication technique which creates 
products using additive manufacturing technology. This technique is different from 
traditional manufacturing methods of subtractive manufacturing using CNC 
machine tools. Based on the concept of material addition, physical objects are 
fabricated by adding materials layer by layer. Computer-aided design (CAD) is 
usually used in the RP system to create a 3D model of the object in the first place. 
The software of the RP system then converts the 3D model generated from the CAD 
drawing into a format compatible with the system. An example would be the STL 
format that is also adopted in this project. The 3D model is then converted into 2D 
data usually by slicing and printed out layer by layer into a solid physical object. In 
this manner, RP technology is able to build complicated shape or geometric features 
without the use of tools or molds. This flexible method allows more effective 
communication between design and manufacturing and greatly reduces the time 
required for product development. 

RP is the name generally given to the various additive processes. Besides the 
advantage of environmental benignity, additive manufacturing process is also a 
low-cost production method for reducing the material wastage, especially for the 
specialty polymers and precious metals. RP sometimes also referred as additive 
manufacturing (AM) technologies have attracted considerable attention because 
they have the potential to greatly reduce ecological footprints as well as the energy 
consumed in manufacturing. Inkjet printing is one of the most successful additive 
manufacturing technologies. It develops at a rapid pace and has been expanded 
from conventional graphic printing to various new applications, such as organ 
printing, displays, integrated circuits (ICs), optical devices, MEMS, and drug 
delivery. Accordingly, the dispensed liquids have been expanded from the conven- 
tional pigmented ink (or standard dye-based ink) to polymers, gels, cell ink, or other 
materials which often have higher viscosities or even contain large particles or 
cells. 

Drop-on-Demand (DoD) Inkjet Printing 

The traditional inkjet printer designed for graphic printing is unable to fulfill 
the new challenges, one of which is to dispense fluids of very high viscosities. 
For most of the commercial inkjet printheads, only liquids with viscosities lower 
than 20 cps can be consistently dispensed. Fluids with even higher viscosities have 
to be diluted before printing or warmed up during the printing, which will adversely 
affect the properties of the liquids. Another challenge is raised by nozzle clogging. 
Fluids containing particles, or cells, can easily block the nozzle orifice, resulting in 
time-consuming nozzle cleaning or even damage of the entire conventional 
printhead. 

Drop-on-demand 3D inkjet printing is an additive or RP manufacturing process, 
a data-driven process that patterns directly onto the substrate with ejected droplets. 
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It is capable of precise deposition of picoliter volumes (down to 2 pL, 15 pm in 
diameter) of liquids into 3D layer-by-layer structures at high speed (up to 60 kHz 
(Kyocera 2009)) and accuracy (<5 pm) on a target surface, even onto nonplanar 
surface. Due to its advantages in high resolution, automation, low cost, noncontact, 
flexible, environmental benignity, and ease of material handling, the application of 
DoD inkjet printing technology has been expanded from conventional graphic 
printing to new areas, such as organ printing, displays, integrated circuits (ICs), 
solar cells, memory devices, optical devices, MEMS, and drug delivery. 


Introduction to Inkjet Printing 

Inkjet printing is a contact-free dot-matrix printing technique in which an image is 
created by directly jetting ink droplets onto specific locations on a substrate. The 
concept of inkjet printing can trace its history to the nineteenth century, and the 
inkjet printing technology was first developed in the early 1950s. Inkjet printers that 
are capable of reproducing digital images generated by computers were developed 
in the late 1970s, mainly by Hewlett-Packard, Epson, and Canon. The booming of 
the personal computer industry in 1980s has led to a substantial growth of the 
printer market, and nowadays personal printer is present in almost every office and 
home. Inkjet printing technology is developing at a rapid pace. It has been expanded 
from conventional graphic printing to various applications, such as organ printing, 
displays, integrated circuits (ICs), optical devices, MEMS, and drug delivery. 

Classification of Inkjet Printing Techniques 

Inkjet printing technology has been developed in a wide variety of ways. In Fig. 1, 
the inkjet tree structure shows a layout for most of the better-known inkjet printing 
techniques and some of the corresponding adopters. As can be seen, there are two 
categories of inkjet printing technology: continuous inkjet printing and drop-on- 
demand inkjet printing. 

Continuous Inkjet Printing 

The earliest inkjet devices operated in a continuous mode. The idea was first 
patented by Lord Kelvin in 1867, and the first commercial model was introduced 
by Siemens in 195 1 . In this technique, a continuous jet of the liquid ink is formed by 
applying pressure to the ink chamber with a small orifice at one end. A fluid jet is 
inherently unstable and will break up into droplets, which is entirely a consequence 
of the surface tension effects. This phenomenon was firstly noted by Savart in 1833 
and described mathematically by Lord Rayleigh (1878). If surface tension force is 
the only force acting on the free surface of the jet, it will break up into droplets of 
varying size and velocity; when a periodic perturbation of an appropriate frequency 
is applied to the liquid, typically using a piezoelectric transducer, the jet will break 
up into droplets of uniform size and velocity. 

The droplets separate from the jet in the presence of a properly controlled 
electrostatic field which generated by an electrode that surrounds the region 
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Fig. 1 Layout of the different inkjet printing technologies 


where break-off occurs. As a result, an electric charge can be induced on the 
drops selectively. Subsequently, when the droplets pass through another electric 
field, the charged droplets are directed to their desired location on the substrate to 
form an image; those uncharged droplets will drift into a catcher for recirculation. 
Continuous inkjet can be classified into binary deflection or multilevel deflection 
according to the drop deflection methodology, as can be seen in Fig. 1. 

The major advantage of continuous inkjet printing is that it can generate ink 
droplets with very high velocity, which can reach to 50 m/s. This feature allows 
for the usage of a relatively long distance between printhead and substrate. 
It also allows for rapid droplet formation rate, also known as high-speed printing. 
Another advantage of continuous inkjet is no waste of ink, due to droplet recycling. 
Furthermore, since the jet is always in use, nozzle clogging can be avoided in 
continuous inkjet. Therefore, volatile solvents such as alcohol and ketone can be 
employed to promote drying of droplets onto the substrate. 

The major disadvantage of continuous inkjet is that the ink to be used must be 
electrically conducting to ensure that ink droplets can be charged and directed to the 
desired location. Furthermore, due to ink recycling process, ink can be contaminated. 
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Pattern Data 


Fig. 2 Schematic of the DoD inkjet printing process 


Drop-on-Demand Inkjet Printing 

Drop-on-demand inkjet systems were developed in the 1970s, when different 
actuation principles were utilized (Heinzl and Hertz 1985). In this technique, ink 
droplets are produced only when they are required. According to the mechanism 
used during the droplet formation process, DoD inkjet can be categorized into four 
major types: thermal mode, piezoelectric mode, electrostatic mode, and acoustic 
mode. Most of the DoD systems in the market are using the thermal or the 
piezoelectric modes. Nevertheless, no matter which mode is used, the basic prin- 
ciples of all these different inkjet methods are similar: a transducer, normally a 
piezoelectric element or a thermal heater, generates a pressure pulse into the ink and 
forces a droplet out of the orifice, as schematically shown in Fig. 2. The only 
difference lies in that the way how this pressure pulse is generated. 

Electrohydrodynamic Jet Printing (E-Jetting) 

Built based on the layered manufacturing principle, electrohydrodynamic technol- 
ogy, which is mainly consisted of electrospinning (E-spinning) (Formhals 1934) 
and electro spraying (E-spraying) (Jaworek and Sobczyk 2008), is a novel method to 
produce fibers or particles in micro -/nanoscale, utilizing the principle of the 
dynamics of electrically charged fluids. E-spinning process has attracted rising 
interest as an effective method to fabricate micro -/nanofibers. During E-spinning, 
the charged jet of polymer solution is deposited onto the collector under the 
influence of a high-voltage electric field. The feasibility of generating nanofibers 
with diameters under 100 nm has been demonstrated using E-spinning method. 
Representative applications include optical sensor fabrication (Wang et al. 2002), 
drug delivery (Liao et al. 2006), and tissue engineering (Pham et al. 2006). To 
further expand its applications, it is important to achieve the high controllability of 
fibers. 

However, the resultant fibers from conventional E-spinning process are disor- 
dered, attributing to the chaotic whipping of liquid jet over the long distance 
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Fig. 3 Schematic view of the E-jetting system 


between nozzle and collector. Thus E-spinning process is limited for the applica- 
tions to the requirements of fiber patterning. To align fibers is challenging to 
achieve for the E-spinning process. Researchers have made an effort to modify 
the process using a mandrel as the fiber collector to achieve aligned fibers, but fully 
controlled fiber orientation is still yet achieved. Sun et al. (2006) have also 
demonstrated the ability of the near-field electrospinning process to produce 
aligned fibers with diameter ranging from 100 to 500 nm. However, the throughout 
is limited, and fibers are discontinuous since the needle tip has to dip again to get 
solutions after dispensing each time. Furthermore, a few works on improved near- 
field electro spinning have been done to evaluate the substrate effect dispensing 
patterns (Chang et al. 2008a) and process parameter optimization (Thirumalpathy 
and Leroy 2011). 

The E-jetting system is an alternative electrohydrodynamic jet printing system 
which applies high voltage on the fluid to achieve higher resolution for printing. 
Figure 3 presents the schematic of the E-jetting system. As shown, the E-jetting 
system mainly consists of a pneumatic system, a high-voltage amplifier, an 
in-house made printhead, and an XYZ stage with a controller. During the 
E-jetting process, sufficient solution is added into the reservoir which is fixed 
with a stainless steel blunt needle. The supply of solution is controlled by the 
precise pneumatic system with a constant pressure, and a high DC voltage is applied 
between the needle and the substrate to generate fibers continuously. At the 
beginning, there is a conical meniscus of the solution formed at the tip of the 
nozzle. After charging with high voltage, the conical meniscus is drawn and Taylor 
cone is formed. Following that, the liquid is stretched out by the electrostatic forces. 
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Various phenomenons of droplets, jet, and multi -jet will form during electrohy- 
drodynamic printing process, depending on the different processing parameters 
applied. To achieve the fiber printing and orientation, only the process with single 
jet is studied in this work. Before jet bending by the whipping instability as the 
electrospinning process (Doshi and Reneker 1995), the fiber is deposited continu- 
ously on the substrate, and a complex fiber pattern can be precisely realized with the 
user-defined path created by the XYZ stage. 


Advantages and Disadvantages of Inkjet Printing 

As a rapid prototyping technique, inkjet is an additive manufacturing process. It 
ejects droplets only when required and hence reduces the material wastage. This 
implies a lower cost for the applications that requires expensive materials. In the 
meantime, environmental impact of the technique is reduced due to reduction of 
material wastage and less usage of solvents, which is sponsored for conservation 
and the realization of a sustainable society. As compared to the traditional 
photolithography-based patterning process, which consists of many subprocesses 
and leads to long processing time and high cost, inkjet printing is much more 
compact. It avoids all those complicated subprocesses. Furthermore, it also saves 
the cost for lithography masks as well as the huge work for storing the hundreds of 
masks. 

Inkjet printing is a data-driven direct-write process that can directly transfer 
computer-aided design (CAD) into device patterns, which can greatly save the 
process time and accommodate customization. Inkjet printing offers the advantage 
of noncontact between the nozzle and the substrate; thus there is no mechanical 
wear on the printed sample. The possibility of cross-contamination is also reduced 
to a minimum. 

To conclude, inkjet printing offers advantages in low cost, compact, automation, 
noncontact, environmental benignity, and ease of material handling. It is a highly 
flexible technology that is able to accurately deposit small volumes of materials in 
almost any pattern. There are two main disadvantages in inkjet printing. Firstly, the 
anisotropic nature of the inkjet process, due to the intrinsic pinhole nature of the 
deposited ink, results in the uneven surface roughness of the printed features 
(Chang et al. 1999). Furthermore, this film nonuniformity can also be produced 
by the inevitable “coffee stain” effect, which arises from interaction of multiple 
effects of the solvent drying process (Singh et al. 2010; Deegan et al. 1997; Ikegawa 
and Azuma 2004). This disadvantage in film uniformity does not exist in planar 
processing currently used in industry. Secondly, the resolution of inkjet method is 
limited by the dispensed droplet size. The minimum diameter of an inkjet droplet 
for a state-of-the-art inkjet printhead is around 10 pm. The resulting printing 
resolution is enough for document printing use, but for nanotechnology use and 
many other industrial applications, such resolution is not good enough, and a more 
precise inkjet device needs to be developed. Furthermore, the attainable feature size 
of a component fabricated by inkjet printing is also affected by how the droplet 
interacts with the substrate. Thus the impact behavior of the droplets onto a 
substrate also needs to be carefully studied. 
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Applications of DoD Printing Technique to Bioengineering 


The DoD printing is proven to be a promising micro-fabrication process for micro- 
and bio-RP applications, which owns high resolution and particularly good 
controllability of fiber orientation. It is relatively easy and cost-effective to set 
up. The technique shows the feasibility of fabricating complicated 3D microstruc- 
tures with ultrafme fibers, showing its potential applications in the manufacturing of 
optical devices, sensors, drug delivery, tissue scaffolding, etc. 


Fabrication of Scaffolds Using E-Jetted Oriented Fibers 
Experimental Setup 

An experimental setup of the E-jetting system is demonstrated in Fig. 4. Positive 
pressure ranging from 0 to 6 bar is provided by the PC-controlled 5 -channel 
pneumatic system, maintaining the sufficient solution supplement. As Food and 
Drug Administration approved biocompatible material, polycaprolactone (PCF) is 
used as demonstration for fiber printing. The high-voltage amplifier, which is the 
critical component of the whole system, is applied between a nozzle and substrate, 
to generate the high DC voltage (—3.25 kV ~ + 3.25 kV) for solution charging. The 
in-house made printhead is adaptable to different nozzles with an inner diameter 
varies from 80 to 510 pm. Silicon substrate is placed under the nozzle with a very 
close gap (less than 2 mm). To orient the fibers during E-jetting, a precise XYZ 



Fig. 4 Experimental setup of the E-jetting system 
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Fig. 5 E-jetted 3D scaffold structure with various pore sizes: (a) pore size = 500 pm, (b) pore size 
= 400 pm, (c) pore size = 300 pm 


stage is applied. It is controlled by the software and controller via TTL signals to 
move with the predefined path. 

Ejected Scaffolds for Tissue Regeneration 

With good controllability of fiber orientation, E-jetting technique is applied to 
create three-dimensional (3D) scaffold. For conventional E-spinning process, the 
resultant fibers are randomly deposited without capability to define the pore size. 
Different from that, it is the significant advantage of E-jetting technique to construct 
3D structures with controlled pore size. It is required for scaffolds to own pores 
ranging from 100 to 500 pm for tissue growth and prosthesis fixation enhancement. 
However, the smaller pores less than 300 usually present low porosity, limiting the 
cell ingrowth and nutrient penetration. In this study, the scaffolds with various 
given pore sizes of 500, 400, and 300 pm are fabricated (Fig. 5), consisting of layer- 
by-layer-deposited uniform fibers. 

As illustrated from the zoomed picture of the scaffold with pore size of 500 pm, 
the diameter of fibers is less than 20 pm, while the gap between the adjacent fibers is 
500 pm, meaning that slenderness ratio is 25: 1. It is challenging for such fine fibers to 
support themselves without bending in the long-span structure. While for the scaffold 
with decreased pores of 400 pm, uniform fibers with diameter of 20 pm are also 
achieved. With further reduction of pore size down to 300 pm, it is illustrated from 
the zoomed pictures that the E-jetted fibers are uniform, providing good intercon- 
nectivity of the 3D structures. With oriented ultrafine fibers, the scaffolds own desired 
pore size and connectivity, exhibiting superiority for tissue engineering application. 


DoD Printing of Multilayer Bioactive Coating 

Depending on the fluid properties of dispensing materials and requirements for 
specific applications, various micro-dispensers are selected by different 
researchers. Walter et al. (2005) utilized both piezo-based and solenoid valve- 
based liquid dispensers to print miniature biological assays. To construct a 3D 
tissue, Chang et al. (2008b) used four types of nozzles, i.e., solenoid-actuated 
nozzle, piezoelectric glass capillary nozzle, pneumatic syringe nozzle, and spray 
nozzle to deposit specified hydrogels with different viscosities. 
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Fig. 6 Schematic diagram of the solenoid-actuated micro-valve dispenser 


Experimental Setup 

In a proposed DoD micro-dispensing system (Sun et al. 2012), a solenoid- 
actuating micro-valve is used for the biocoating fabrication, which functions 
based on the control commands sent from a PC. The solenoid-actuated micro- 
valve dispensing system consists of a micro-valve and its driver and a pneumatic 
controller. 

Figure 6a shows a schematic diagram of the solenoid micro-valve. This micro- 
valve operates through a magnetic-piston system to open or close the valve. The 
opening duration is guaranteed by first 24 V spike voltage to activate the valve and 
then 3 V to hold the open status (shown in Fig. 6b). The total time period including 
spike and hold status is defined as operational on time (OOT), which is the key 
parameter to control the droplet dispensing volume. Figure 6c shows the assembled 
micro-dispenser with a reservoir, a filter screen, a Teflon tubing, the micro- valve, 
and a nozzle. 

Figure 7 illustrates the proposed dispensing system from both front view and the 
isometric side view including an XYZ precision stage from Aerotech (http://www. 
aerotech.com/), a stage controller, two dispensing units and their drivers, a pneu- 
matic controller, and a curing controller. The two dispensing units are attached on 
the Z-axis stage with a mounting structure. The substrate plate is mounted on the X 
stage which rides on the Y stage. When the stage reaches the given position, the stage 
controller will output TTL trigger signals to activate the corresponding dispensing 
unit driver, and then the micro -dispenser will complete droplet ejections. The curing 
controller is to adjust heating temperature, thus accelerating the solidification of 
deposited droplets on the substrate, with limits within 60 °C. The pneumatic control- 
ler provides independent positive outlets for different dispensers. When the valve is 
open, this pressure pushes the liquid along the chamber towards the nozzle. With the 
proper pressure and OOT, the liquid can form a droplet at the nozzle tip. 
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Fig. 7 Experimental setup of the micro-dispensing system 
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Fig. 8 SEM photos for HA and 4-layer HA/Col coating 


Surface Morphology 

After fabrication process, the surface morphology of the coating and cross-sectional 
area is observed using scanning electron microscope (SEM). The accelerating 
voltage and beam current are set at 15 kV and 12 pA, respectively. Elemental 
analysis is obtained by energy-dispersive X-ray spectroscopy (EDS) coupled with 
the built-in EDS detector of SEM. The results demonstrate the elemental distribu- 
tion of a specific area in weight percentage. 

Figure 8a and b shows the surface morphology observation of pure hydroxyap- 
atite (HA) coating and 4-layer multilayer H A/collagen (Col) coating structure, 
respectively. This structure is mostly consisted of materials which possess the 
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Fig. 9 Elemental distribution of 4-layer HA/Col coating 


chemical affinity to the natural body tissue, such as hydroxyapatite (HA) and 
collagen. This composite coating is expected to induce a continuous transition 
from tissue to implant surface, hence promoting bone regeneration. As observed 
in Fig. 8, there is a significant difference on the surface morphology between the 
two types of coating. In Fig. 8a, the surface of pure HA coating looks relatively 
“rough.” For the HA/Col coating, the collagen is being physically absorbed by the 
HA layer, which dampens and smoothens out the topmost coating surface as 
Fig. 8b. This characteristic may tend to improve osteointegration with the bone as 
compared to a smoother surface (Nakashima et al. 1997). The 4-layer HA/Col 
structure was fabricated based on the concept of the multilayer coating structure. As 
shown in Fig. 9, the structure presents a uniform coverage by the dispensing 
materials on the titanium surface, and through the elemental distribution analysis 
by EDS, C and Ca/P appear at the desired positions according to the designed 
layered coating structures. 


DoD Cell Printing 

To date, different types of cells have been printed successfully, and their viability 
has been verified (Chen et al. 2006, 2008c; Xu et al. 2004, 2005, 2006; Barron 
et al. 2005; Nakamura et al. 2005; Saunders et al. 2008; Takahashi and Tomikawa 
2003). One of the comprehensive studies was carried out by Saunders et al. (2008), 
who used a commercial desktop printer to dispense human fibroblast cells, for the 
investigation of the relationship between cell survivability and the inkjet printing 
parameters. Their study supported previous claims (Xu et al. 2004, 2005, 2006; 
Nakamura et al. 2005) that cell survivability was not significantly affected by the 
printing process since cell survival rates only fell from 98 % to 94 % in their case, 
when the excitation pulse was increased from 40 to 80 V. However, in their study, 
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Fig. 10 Schematic showing the DoD setup for cell printing experiment 

the printing process was all carried out using a modified commercial printer, thus 
limiting their experiments to a fixed nozzle diameter (60 pm) and a small range of 
the drop velocities (lower than 1.0 m/s). These limitations may be of importance, 
because the shear stresses, which are expected to be the main factor in the killing of 
cells during the printing process, are proportional to the velocity gradients within 
the nozzle. 

Experimental Setup 

An experimental DoD cell printing system developed (Li 2010) will be used as an 
example to illustrate its feasibility. Printing experiments were carried out by using 
the self-developed squeeze mode piezoelectric inkjet printing system. The setup is 
comprised of a compressor, a pressure regulator, a reservoir, a piezo-actuated 
printhead, a piezo driver, an Arrisun-5 lamp, and a Photron FASTCAM SA1 
camera (high-speed video camera), as shown in Fig. 10. The present printhead 
design is a great improvement over conventional printheads, as it allows for the use 
of interchangeable nozzles, for the same piezoelectric transducer. The interchange- 
able nozzle design allows one to easily clean or change the clogged or damaged 
nozzle. 

The inkjet process is highly periodic. Fig. 11a shows the droplet formation 
process in time sequence. Drop velocity can be calculated by dividing the spacing 
between two droplets by their time difference. Fig. lib shows images of a few cells 
inside the nozzle. Liquid used was 1.0 % (w/v) aqueous solution of sodium alginate. 
Drop velocity is 0.74 m/s. 
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Fig. 1 1 Images taken by using a high-speed video camera: (a) formation of a 160 pm droplet from 
a 1 19 pm nozzle, taken at a frame rate of 8,000 fps, time between frames of 125 ps; (b) cell motion 
in the nozzle with diameter of 119 pm 


For the study of cell survival rates, L929 rat fibroblast cell suspensions were 
printed through orifices of three different diameters (119, 81, and 36 pm) onto well 
plates (Costars) which contained the live-dead assay solution. Preparation of the 
live-dead assay solution will be introduced later. The electric pulses which used to 
drive the piezoelectric transducer were in the range of 52-140 V. Each sample was 
printed for approximately 20 s with a driving frequency of 1.5 kHz for the 
printhead. Prior to the printing process, a 15 pi cell suspension was deposited into 
a well plate in the same environment as the printing system, to act as a control. 

Survivability Tests 

A LIVE/DEAD Viability/Cytotoxicity Kit (L3224, Molecular Probes, Invitrogen) 
was used to assess the survivability of the cells after the printing. The frozen vials 
containing the assay were thawed and centrifuged briefly before use. 20 pi of the 
supplied 2 mM EthD- 1 solution and 5 pi of the supplied 4 mM calcein AM solution 
were added into 10 ml of 1 x DMEM solution and mixed thoroughly, which gave an 
approximately 4 pM EthD-1 and 2 pM calcein AM working solution. Cells were 
directly dispensed into well plates which each contained 100 pi of the assay 
mixture, then incubated for 30 min. For each printing condition, cells were dis- 
pensed into five separate Petri dishes, to study the variation of survival rates. 
Controls were taken directly from the cell ink before printing and put on a set of 
separate Petri dishes, undergoing the same environment and procedure. The stained 
samples were then partly transferred onto microscope slides and observed under a 
fluorescence microscope. Six images were captured from each Petri dish for cell 
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Fig. 12 Mean cell survival rate with respect to excitation pulse amplitude for the samples printed 
through the 36 pm orifice. Error bars show the standard error from five replicates 
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Fig. 13 Graph showing the mean cell survival rate against excitation pulse amplitude. Each cell 
survival rate data was the average value from five replicates 

counting. Cells that remained alive after the printing were stained green, and the 
damaged cells were stained red. The numbers of alive and dead cells for each 
sample were tallied with respect to that of the control which was taken prior to the 
printing . 

Figure 12 shows the effects of excitation pulse amplitude on the mean cell 
survival rate, for the 36 pm nozzle with excitation pulse amplitude from 60 to 
130 V, at a frequency of 1.5 kHz and with rising and falling times of 3 ps. It is 
shown that the survival rate falls from 95 % to 78 % as the excitation pulse is 
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increased from 60 to 130 V, and the lowest survival rate of 76 % is observed when 
the highest voltage is approached. The excitation pulse amplitude represents the 
power for the piezoelectric actuator to dispense the droplets, and this power directly 
affects the droplet velocity and thereby the shear stress in the liquid. In Fig. 13, the 
mean cell survival rates against excitation pulse amplitude for all of the three 
different orifices are drawn together to compare the effects of different orifice 
sizes on cell survivability. Samples printed through orifices with the diameter of 
36, 81, and 119 pm, with excitation pulse amplitude from 52 to 140 V, at frequency 
of 1 .5 kHz, with rising and falling times of 3 ps. Comparing the three different trend 
lines in Fig. 13, we conclude that it is not the strength of the electric field which 
directly affects cell survival rate; rather it is the fluid shear stress. 


Summary 

Compared with the conventional electro spinning process, the E-jetting technique 
provides good controllability of fiber orientation in a precise manner, demonstrat- 
ing high feasibility of building complicated 3D structure with fine fibers for various 
applications in manufacturing of sensors, optical devices, and tissue engineering. 
The cell printing study has demonstrated that the piezoelectric DoD printing is able 
to successfully deliver L929 rat fibroblast cells through nozzles as small as 36 pm. 
There was no significant cell death when dispensing the cells through the 8 1 pm and 
the 119 pm nozzle, with the mean survival rates only reducing from 98 % to 85 %. 
This is in good agreement with the existing study reported (Saunders et al. 2008), in 
which a commercial printer was used to print human fibroblast cells. 
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Abstract 

This chapter of the book presents a forecast development of materials surface 
engineering over the nearest 20 years. The proposed methodological approach 
and the relevant selected results of the research carried out using neural networks 
and contextual matrices are presented. Contextual matrices were used for the 
graphical presentation of a strategic position of the critical materials surface 
engineering technologies. Critical technologies are such having best develop- 
ment prospects and/or key significance in industry. For the purpose of preparing 
forecasts and analyses, expert studies were carried out with the e-Delphix method 
using information technology. The probabilistic multivariant scenarios of future 
events concerning materials surface engineering were created based on the data 
acquired from experts according to the results of computer simulations made using 
artificial neural networks. The original data from the experts also served to 
perform further investigations into the importance of the new technologies, 
according to the new approach, and the correctness of such approach was verified 
by comparing the results of heuristic research with the results of classical materials 
science research for 35 diverse technology groups. The strategic positions of 
140 critical materials surface engineering technologies were determined by acting 
in consistency with the new approach and using own software and custom 
conceptual matrices. Hidden expert knowledge was thus converted, using the 
analytical tools and quantitative methods dedicated to this task, into a publicly 
available open knowledge. The relevant technologies were described and charac- 
terized by harmonized criteria using roadmaps and technology information sheets. 
The new approach described in this chapter, supported with extended information 
technology, is suitable for direct applications in other areas of knowledge while 
maintaining economically reasonable costs. 


Introduction 

A belief is underlying an interest in the field of the computer-aided prediction of 
development of materials surface engineering, substantiated with numerous practical 
examples that the fundamental, current civilizational objectives can be achieved by 
developing material engineering. The producers of goods satisfying human needs 
have at their disposal practically an unlimited number of state-of-the-art engineering 
materials and the related material process technologies. To satisfy customer demands, 
engineering materials have to be designed and applied that, when undergoing appro- 
priate engineering processes for formulating their geometric form and especially 
structure formation - ensuring the material’s appropriate physiochemical properties 
- will guarantee the expected functional uses of the products manufactured using 
them. It should be noted that the producer-client relations have substantially changed 
in the recent years as signified by a demand to deliver materials with the desired 
structure and physiochemical properties meeting functional requirements determined 
by a client’s expectations and usable functions of products (i.e., materials on 
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demand). The rules of materials producers’ market have been clearly replaced by the 
customer market despite the fact that just a few years ago market products had been 
only manufactured of engineering materials with their chemical composition, struc- 
ture, and properties, and even dimensions, absolutely imposed by manufacturers’ 
production timetable and by plans for a limited - by the nature of things - number of 
engineering materials. It is very common these days that product design, and, in 
consequence, the manufacture of functional products, is not connected with require- 
ments set for chemical composition, structure, and properties of material cores, or 
actually a product or its element, but for their surface and, in fact, their surface layer. 
It is not reasonably substantiated in engineering calculations and real requirements to 
ensure the expected properties uniformly across the whole section of a product. The 
most general aim of such measures, more and more often employed in many 
industries, is to achieve a structure in the zone around the surface similar to a 
composite, and this, as a result of scientific and technological experiments lasting 
for many decades, has led to the development of multiple technologies of surface 
layer structure formation, deposition of coatings, including those consisting of many 
or even several hundred layers, and also to the production of surface graded materials. 
The tailoring of properties of different items to operational requirements is therefore 
accomplished through selecting appropriately the core material and the technologies 
ensuring its properties (e.g., thermal or thermochemical treatment) and by choosing at 
the same time the surface layer treatment technology. Scientific institutions around 
the globe are expressing their ongoing and intensifying interest in this field as well. 

Foresight stands for overall actions aimed at selecting the most beneficial visions 
of future and at indicating the ways to implement the future, using the appropriate 
methods originating from organization and management science. Technology fore- 
sight consists of looking regularly, over a long-term prospective, into the future of 
science and technology, economy, and societies, in conjunction with an ability to 
select strategic technologies aimed at bringing substantial economic and social 
benefits, the specificity of which is illustrated by the technology foresight triangle 
(Fig. 1). Technology foresight is focussed on the priority innovative technologies, 
the implementation of which brings the highest added value, and, therefore, will 
contribute over a long term to a high statistical level of technologies implemented 
in industry. Multivariant strategies established under the technology foresight 
serving to describe alternative development paths and visions of future are to 
contribute in long term to sustainable development considering the fate and well- 
being of the future generations. The activities performed under the foresight 
research are concentrated around the externalization of knowledge based on the 
transformation of an implicit knowledge available to only experts, specializing in a 
given field, and into an explicit knowledge available to the widely understood 
public, which over a long period should lead to the strengthening of a knowledge- 
and innovation-based economy. 

E-foresight (electronic foresight) is a process of foresight investigations pursued 
to identify the priority, innovative technologies and directions of strategic devel- 
opment with reference to a particular thematic area using the Internet. The concept 
of e-foresight (Dobrzanska-Danikiewicz 2011a) associated with the well known 


2590 


LA. Dobrzanski and A.D. Dobrzanska-Danikiewicz 



/ Acquisition tacit 
knowledge from a wide 
expert group and its dissemination 
through public debate 


EXTERN ALISATION 


Knowledge- 
and innovation- 
based economy 


Sustainable 

development 


Fig. 1 Technology foresight triangle (Dobrzanski and Dobrzanska-Danikiewicz 2011) 


and commonly used notions of e-management, e-business, e-commerce, e-banking, 
e-logistics, e-services, e-administration, and e-education (Dobrzanska-Danikiewicz 
et al. 2010a) was developed during the practical performance of foresight investi- 
gations concerning materials surface engineering, as a result of scientific quests 
serving to harmonize, streamline, and modernize the process of the foresight studies 
pursued. The main driving force for implementing improvements was a broad scale 
of the studies planned for implementation. 

A full overview of the contemporary treatment technologies decisive for the 
formation of the structure and properties of engineering materials surface layers 
(Dobrzanska-Danikiewicz and Lukaszkowicz 2010) reveals that over 500 specific 
technologies of surface treatment and their numerous technological variants have 
been conceived to date. Usually costly manufacturing equipment and the necessary 
industrial infrastructure with the average depreciation time of approx twenty years 
have to be used each time when a relevant technology is selected. A manager’s 
decisions are becoming vital in the context of selecting correctly a technology, 
including relevant decisions on the investments to be made, and such decisions, in 
the longer run, are crucial for the success or defeat of the enterprise managed by 
such a manager. For this reason, it is crucial for long-term economic development 
to identify the priority, innovative technologies, and their desired development with 
pinpointing a product for which they should be applied and also to determine the 
development trends of such engineering material properties and structure formation 
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technologies and also the directions of scientific research in this area within the 
long-term timeframe of at least twenty years, and this is decisive for an economy’s 
competitiveness. Trial and error must not be used for making such important 
decisions, and this makes it necessary to apply in this area, optionally, a method 
of credible scientific research into the development prospects of science and 
technology. A new approach described in this chapter of the book is aided with a 
computer-aided methodological concept serving to handle this task. 


Presentation of a New Approach 

A complex methodological apparatus serving to diagnose the key scientific, techno- 
logical, economic, and ecological issues in the area of materials surface engineering 
and to identify the directions of its strategic development and decision-making pertains 
essentially to the three overlapping fields of knowledge: materials surface engineering 
forming part of material engineering, technology foresight being part of a widely 
understood field of organization, and management and information technology orig- 
inating from computer science (Fig. 2). The chapter of the book presents the forecast 
development of materials surface engineering over the nearest 20 years. The proposed 
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Fig. 2 Interdisciplinary methodology of the computer-integrated prediction of the development 
of materials surface engineering in relation to the fields of knowledge and research methodology 
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Fig. 3 General scope of materials science research 

methodological approach and the relevant selected results of the research carried out 
using neural networks and contextual matrices are presented. Contextual matrices 
were used for the graphical presentation of a strategic position of the critical materials 
surface engineering technologies. As part of the interdisciplinary studies pursued, a 
pool of methods was employed including originally matched methods already known 
in the literature concerning this domain, as well as completely new methods devel- 
oped, verified experimentally for their correctness, and implemented in order to solve 
specific scientific problems. The selected results of the research conducted are 
presented further in the chapter in subchapters due to a limited size of this chapter. 

The original reference data constituting a basis for the works performed at the 
further stages of the research comprises the outcomes of classical materials science 
experiments as well as the results of comprehensive expert studies. The overall scope 
of the materials science experiments performed is presented in Fig. 3, and details 
pertaining to this aspect are described comprehensively in the book (Dobrzanska- 
Danikiewicz et al. 2010b). In particular, the results of our own materials science and 
heuristic research are carried out for eight groups of specific technologies (SI to S8) 
(Dobrzanska-Danikiewicz et al. 2011a, b, c, d, e, 2012a, b; Dobrzanska-Danikiewicz 
and Drygala 2011; Dobrzanska-Danikiewicz 2010a, 2011b, c; 2012a; Fig. 4), and 
36 specific technologies (Table 1) were used for reviewing the correctness of the 
newly developed methodology of computer-integrated prediction of materials surface 
engineering development. After achieving the satisfactory results of reviewing the 
correctness of the original methodological concept, it was applied based on the results 
of expert studies to identify the strategic position of 140 groups of critical materials 
surface engineering technologies regarded as the priority technologies with the best 
development prospects and/or of key significance in industry over the assumed time 
horizon of 20 years. The results of the expert studies provided also original data used 
for creating the alternative scenarios of future events dependent upon the evolvement 
of the individual thematic areas and on the impact of key mezofactors, and artificial 
neural networks implemented in custom software were applied for this purpose. 

A limited group of methods featuring diverse potential applications has to be 
selected originally from an extensive range in order to put technology foresight into 
life. A diagram of such methods together with their interrelations and monitoring 
zones and data sources is shown in Fig. 5. Originally selected methods of 
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si 


Laser treatment of hot-work alloy tool steels 


S2 Laser treatment of casting magnesium alloys 


S3 Physical Vapour Deposition of coatin gs onto the brass substrate 


S4 Selected methods of steel thermochemical treatment 


S5 Physical /Chemical Vapour Deposition of coatings onto the sintered 
tool materials 


S6 Texturisation of polycrystalline silicon for photovoJtaccs 


cy Manufacturing of graded sintered materials by the classical powder 
metallurgy way 


S8 Selected methods of polymer surface layers modification 



Fig. 4 Specific technologies analyzed in order to review the correctness of the methodology of 
computer-integrated prediction of materials surface engineering development 

organization, work, and management had contributed to generating a set of critical 
technologies that were next subjected to expert studies pursued in line with the 
concept of e-foresight. The pool of critical technologies of materials surface 
engineering generated in the course of the works is listed in Table 3 later in the 
chapter. 

Electronic surveys embraced a group of nearly 400 experts from academic, 
industrial, and public administration circles who completed an overall of several 
hundred multi-question surveys (Hasan and Harris 2009) in three subsequent 
iterations of the studies. The scope of works pursued also included the creation of 
surveys, each time, in more than ten versions pertaining separately to each of the 
analyzed thematic areas together with an electronic online editing system. 

The survey studies were carried out with the e-Delphix method (Dobrzanska- 
Danikiewicz 2010a, b; Dobrzanska-Danikiewicz et al. 201 If) deriving the main 
idea of the survey of experts, consisting of several steps, from the classical 
Delphi method (Costanzo and Mackay 2009; Loveridge 2009; Miller et al. 2009; 
Georghiou et al. 2008), but differing largely from the method in terms of 
methodology and in terms of the accompanying expanded information technol- 
ogy encompassing a virtual organization, web platform, and artificial neural 
networks. A virtual organization represents a system comprised of multitask 
elements created on a voluntary basis, functioning dynamically and structured 
flexibly, oriented at particular goals, and coordinated by means of information 
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technology, allowing to collect, harmonize, select, disseminate, and manage the 
explicit and implicit knowledge in cyberspace. A virtual organization can be 
managed in cyberspace by creating, from the scratch, a web platform based on a 
custom concept of a computer system with a hierarchical structure. A separately 
developed individual algorithm had to be applied for each of the several dozens 
constituent modules of the platform, making up a complex multilevel structure 
of the platform. The last element of information technology applied in the works 
followed is represented by artificial neural networks developed using commer- 
cial software. An indispensable starting point for putting into life the IT goals of 
the work was to design a network, and the network was next enhanced with 
original software SCENNET21 and SCENNET48 for preparing alternative sce- 
narios of future events for materials surface engineering. The computer pro- 
grams created enabled to search a suboptimal solution randomly using the 
Monte Carlo method and to interpret and present the outcomes of the research 
followed in a graphical manner in diagrams. The approach presented, employing 
artificial neural networks for creating the alternative scenarios of events, is 
innovative and experimental and has not been described to date in the literature 
of the field. 

The original data assembled through the electronic surveys of experts was used 
in further research basing on the original methodological concept for: 

• Preparing alternative scenarios concerning the future of materials surface 
engineering 

• Identifying the strategic position of the relevant critical technologies using 
contextual matrices 

• Preparing technology roadmaps and technology information sheets 

The further works performed relate to analyzing a set of various factors classi- 
fied as: 

• Critical macrofactors of general nature existing individually and influencing 
other factors strongly 

• Mezofactors occurring in limited numbers and influencing other factors 
moderately 

• Specific microfactors occurring in great numbers, characterized by sensitivity to 
the influence of other factors. 

Three alternative scenarios of future events are considered at the macro-level: an 
optimistic, neutral, and pessimistic scenario created on the basis of the results of 
surveys made using an original computer system among several hundred experts. 
The results of the expert studies were implemented as input data into neural 
networks as a training, validation, and test set. Nine models of neural networks 
were created, of which 7 ones best meeting the set criteria were chosen to generate 
the final results of the research by implementing them as function into the computer 
system, enable to search, randomly, the solutions according to the Monte Carlo 
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method, and to generate the final result as graphical diagrams. The final result is a 
set of probability values that the relevant variants of events, which depend on the 
occurrence of specific conditions or special factors, occur. 

The mezo-level is grouping 16 key factors of the general nature influencing most 
greatly, in the surveyed experts’ opinion, the predicted development of materials 
surface engineering, presented in Fig. 6, and the 14 thematic areas analyzed were 
grouped into two research fields (Fig. 7). The research field M (Manufacturing) 
reflects a manufacturer’s point of view and encompasses production processes 
determined by the state of the art and a machine park’s manufacturing capacity, 
whereas the research field of P (Product) is conditioned by the expected functional 
and usable properties stemming from customer needs and is focused on the product 
and on the material it is made of. 

The microlevel is represented by 140 groups of critical technologies comprising 
10 groups of technologies selected for each of 14 thematic fields. The groups of 
critical technologies were selected from approx. 500 groups of specific technologies 
considered in the initial phase of the research according to the outcomes of studies 
including a state-of-the-art review, technological review, and a strategic analysis 
with integrated methods (STEEP, SWOT). Specific technologies can additionally 
be differentiated for the individual 140 groups of critical technologies, often 
differing in details only which can, however, substantially condition the develop- 
ment prospects of a particular technology and its applicability in the industrial 
practice. Some of the technologies, chosen in an arbitrary manner, were subjected 
to in-depth materials science and heuristic investigations for reviewing the correct- 
ness of the developed methodology, and the overall results were presented in the 
Dobrzanska-Danikiewicz et al. (2010b) book publication. A set of contextual 
matrices was created in order to determine the strategic position of the relevant 
groups of critical and specific technologies encompassing the dendrological matri- 
ces of technology value, the metrological matrices of environment influence, and a 
matrix of strategies for technologies. The matrices represent a tool of a graphical 
comparative analysis of the individual technologies or their groups, allowing for 
objectivized assessment and for determining the recommended action strategies 
with regard to the relevant technologies or their groups, and also to define the tracks 
of strategic development. The final outcome of the investigations carried out also 
appears in the Book of Critical Technologies, grouping a set of several hundred 
roadmaps and technology information sheets, representing a convenient tool for 
their comparative analysis according to the selected materials science, technolog- 
ical, or economic criterion. 

A single-pole, ten-degree positive scale without zero called a universal scale of 
relative states (Fig. 8) was used in the surveys made in the course of the research 
undertaken in order to assess the factors and phenomena, where 1 is a minimum rate 
or a compliance level with a given characteristic/phenomenon/factor/statement, 
while 10 is an extraordinarily high rate or a compliance level with a given 
characteristic/phenomenon/factor/statement. In the course of the research, the 
experts were also evaluating the lifecycle phases of the technologies analyzed or 
their groups, and a ten-degree scale compatible with the universal scale of relative 
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engineering 


states had to be established for maintaining consistency of considerations, and the 
ten-degree scale was used for an objectivized evaluation of the lifecycle of a given 
technology and group of technologies, where 1 signifies a declining technology and 
10 is a technology in its incipient phase. A process of developing a new technology 
is accompanied by expenditures for materials, for the construction of new devices, 
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Fig. 9 Technology lifecycle phases 


and for remuneration for the personnel performing research assignments, and the 
expenditures gradually grow, reaching their maximum at the stage of constructing 
and testing the prototype installations. In the case where newly developed solutions 
meet a manufacturer’s expectations, and one should be aware that many technol- 
ogies do not go beyond a prototype testing phase, then a phase of gradual imple- 
mentation into production takes place which allows the new technology to generate 
first profits, thus partially offsetting the costs incurred until a breakeven point is 
attained, i.e., a point where profits equal the expenditures made. A newly developed 
technology is next transiting to the growth phase, becoming more and more 
important for an enterprise’s general processes: first, serious profits are generated, 
but the costs incurred for its improvement and for ongoing modernization of the 
machine park usually involving its automation and robotization, for product cus- 
tomization, and for promotion continue to absorb large amounts. The proportions 
change over time as a technology entering the maturity phase generates higher and 
higher profits and costs decline, and this is the time long awaited by the manufac- 
turer called - according to the terminology used in management sciences - milking 
the cow or harvesting the crops (Thompson and Strickland 1987; Little 1981; Hofer 
1977; Glueck 1980; Henderson 1970). After a period of prosperity, profits from 
production using the technology begin to dwindle which usually mobilizes the 
company’s management to undertake rehabilitation, modernization, and improve- 
ment measures, accompanied by a promotional campaign in the media. The mea- 
sures are usually moderately effective over time, and after a temporary 
improvement, gradual degradation of the technology - being already in its base 
phase - begins, and then it becomes an obsolescent technology and, as a declining 
technology, finally exits the market. As regards the presented typical and most 
classical technology lifecycle shown in Fig. 9, deviations may occur in practice 
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usually as it relates to the duration of individual phases; untypical, sudden exclusion 
of the technology by other, more modem solutions; or, on contrary, its quite new 
applications may be found and partial duplication of its individual lifecycle phases 
may take place. 

Development Scenarios of Materials Surface Engineering 

The references (Lindgren and Bandhold 2009; Bradfield et al. 2005; van Notten 
et al. 2003; Heugens and van Oesterhout 2001; Martelli 2001) indicate that there is 
no one correct and generally accepted method of creating the scenarios of future 
events or a management algorithm recommended for implementation in the sce- 
nario creation process. In practice, the algorithm is created each time from scratch 
by those undertaking a specific investigation. The same refers to building the 
scenarios presenting the forecast future of materials surface engineering where a 
methodological challenge exists in combining skillfully the presentation and 
description of factors with their varied degree of generality and capturing the 
cause-and-effect relationships existing between them. In order to solve the so 
formulated research task, all the factors analyzed were split into the three groups: 
macro-, mezo-, and microfactors. 

Three alternative scenarios of materials surface engineering development were 
presented at the highest level of generality assuming the optimistic, neutral, and 
pessimistic progress of future events at a macro- and mezoscale. It was shown in 
particular how the development of surface engineering (macroscale) would influ- 
ence the development of the key mezofactors (Fig. 6) and thematic areas (Fig. 7). 


Macroscenarios 

The optimistic scenario race won assumes that the global downturn has been 
averted and economic growth is seen based on peaceful cooperation and interna- 
tional integration. The competitive position of the European Union is strengthening 
in the global economy. Numerous reforms are being successfully implemented in 
Poland having social approval, the purpose of which is the real transformation of 
economy supporting the sustainable development of a knowledge-based economy. 
Poland is skillfully combining endogenic growth factors with foreign investments 
and the effective use of EU funds. The consequence of the widespread actions 
planned is the gradual improvement of the society’s education, the wide-scale 
application of innovative and environmental friendly technologies in many thriving 
small- and medium-sized companies (SMEs) and large corporations operating more 
and more often in high-tech industries, effective use of Poland’s agricultural 
resources, and also the development of modern transportation and ICT infrastruc- 
ture. The available potential is used adequately to put into life the strategic 
development goals: people are, statistically, better off; social attitudes are optimis- 
tic; and prospects for the coming years bright. 
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In line with a neutral scenario called progress achieved, the world economic crisis 
has been prevented, and the world is slowly returning to the growth path in the 
paradigm of sustainable growth based on cooperation and international integration, 
although the fear of terrorism and local wars is still looming which, in unfavorable 
circumstances, may spread to many countries. The European Union needs to fight 
hard for its position among global economies, especially with regard to China and 
India emerging as world powers. There are efforts made in Poland, with different 
outcomes, to tackle reforms aimed at economy transformation, and the reforms are 
often opposed by the society and the people’s reluctance toward change. Poland is 
attempting to make use of EU funds, but not all the money is managed effectively. 
The introduction of a knowledge-based economy and sustainable development brings 
such results as the growing education level of the society and its environmental 
awareness. The SME sector is developing at a constant but slow rate, and the level of 
implementing the innovative and environmental technologies leaves still much to 
wish for. Large corporations operate mainly in medium-low and medium-high 
technologies. The country is constantly facing problems in public finance, agricul- 
ture, and healthcare, and a modern transportation and ICT infrastructure is developing 
steadily, but relatively slowly. The available potential is only partly used to achieve 
the strategic development goals, statistically people are slightly better off but social 
attitudes are mixed. Theoretically, quite good development prospects for the coming 
years depend primarily upon the circumstances in the European and world economy, 
wise management of public funds in long term, and on how quickly the relevant 
reforms supported with the society’s involvement are introduced. 

The pessimistic scenario inclined plane provides that the economic crisis has been 
slowed down to some degree only. The world is facing terrorism, growing oil prices, 
aftermaths of disasters, and local wars spreading to more and more countries. The 
European Union stays behind other global economies, especially China and India 
emerging as global powers. Usually unsuccessful attempts are made in Poland to 
tackle reforms serving to transform economy, and these are facing social disapproval 
and strong reluctance toward changes. The EU funds allocated to Poland dwindle 
year by year, and most of the money is used to save the current economy, but the level 
of investments is plummeting. The implementation of the knowledge-based economy 
and sustainable development concepts, initially boding well, is now weakening. The 
SME sector is developing sluggishly, and innovative environmental technologies 
cannot usually be applied due to the lack of investments and the low availability of 
credits. Large corporations operate in medium-low and medium-high technologies, 
and many of them go bankrupt and move their head offices to Asia. The country is 
constantly facing problems in public finance, agriculture, healthcare, education, and 
transportation infrastructure. The available potential to achieve the strategic devel- 
opment goals - which most apparently had been wrongly formulated - is 
underutilized: statistically people are worse off and this is accompanied by social 
unrests. Development prospects for the coming years are weak, and Poland will be 
heading for a disaster if a sudden breakthrough is not seen. 

Deductive reasoning was undertaken under the research conducted consisting in 
seeking the combinations of micro- and mezofactors that would contribute, with 
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Fig. 10 Implementation phases of neural networks in e-foresight research 


specific probability, to the occurrence of each of the three possible macroscenarios 
in the future. Using contextual matrices, strategic positions of specific technologies 
and groups of critical technologies were presented, corresponding to the microlevel, 
together with their predicted development - presented as statistics and/or technol- 
ogy strategic development tracks prepared based on the results of the expert studies. 
A methodology of IT studies based on artificial neural networks and original 
software SCENNET21 and SCENNET48 with artificial neural networks 
implemented as functions designed using commercial software Statistica 4. OF 
was applied for analyzing mutual interactions between events at a macro- and 
microscale. The results of the expert studies in the form of probability values of 
occurrence of the relevant analyzed phenomena, acquired in the electronic survey 
of experts with the e-Delphix method, were used as input variables divided into the 
following subsets: a training, validation, and test subset that were used for training 
neural networks. Output variables, on the other hand, are generated as different 
diagrams presenting relationships between the probability values of different 
macroscenarios subject to relevant trends of other factors of the analysis. This 
concept is illustrated with a diagram in Fig. 10. 


Macroscenarios and Development of Thematic Areas 

The simulation experiments performed with the SCENNET21 program, with arti- 
ficial neural networks implemented as functions, were to identify how the devel- 
opment of 14 individual thematic areas forming part of the research field M 
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reflecting the manufacturer’s point of view and of the field P corresponding to the 
approach of a consumer expecting a product with the required functional properties 
influences the occurrence - with specific probability - of each of the three alterna- 
tive scenarios. 

By applying artificial intelligence tools to solve a research issue proposed in 
such a way, a solution can be searched immediately in a group containing thousands 
of solutions, similar to an optimum solution satisfying the user. A neural network 
trained using the implemented reference data acquired through expert surveys 
generates the multivariant forecasts of future events. 

There are 3 relevant development trends of the thematic areas analyzed, i.e., a 
growth trend, a trend stabilized at the current level, and a falling trend, taken into 
consideration. It is possible to identify the occurrence probability of the relevant 
development trends of the thematic areas analyzed with the one defined at the 
beginning by each user probability value wherein each of the three alternative 
macroscenarios of future events occurs. The user, while performing a computer 
simulation, may set a particular numerical value of a chosen macroscenario’s 
probability and also search a solution for its extreme values, i.e., a maximum or 
minimum value. 

The results are shown in Fig. 11, of examples of three simulation experiments 
carried out with reference to the research field M as charts generated with the 
SCENNET21 program, concerning a pessimistic scenario with the probability 
value set, respectively, as 10 % (Fig. 11a), 20 % (Fig. lib), and 30 % (Fig. lie). 
Percentages of probability that a growth trend, a trend stabilized at the current level, 
and a falling trend occurs are provided on the axis of abscissa. 

The relevant, analyzed thematic areas are provided on the axis of coordinates, 
i.e., laser technologies in surface engineering (Ml), PVD technologies (M2), CVD 
technologies (M3), thermochemical technologies (M4), technologies of polymeric 
surface layers (M5), technologies of nano structural surface layers (M6), and other 
surface engineering technologies (M7), respectively. 

The results of computer simulations, performed using neural networks, indi- 
cate a predicted leading role of the development of the nano structural surface 
layer technologies (M6) and laser technologies (Ml) against the entire research 
field M. For the areas of M6 and Ml, the probability of a growth trend in all the 
analyzed cases maintains on the highest level of the analyzed cases and is slightly 
decreasing as the probability of a pessimistic variant of future events grows. Note 
also that a probability of a falling trend for such thematic areas is zero, meaning 
that such areas cannot be degraded. In accordance with the results of the simula- 
tions, the degradation is not possible, in either of the PVD technology (M2) or of 
polymer technologies of surface layers (M5) for which the probability value of a 
growth trend in all the three analyzed cases is similar and is maximum approx. 
70-73 % for both thematic areas each time. A relationship between the probability 
value of a growth trend for other surface engineering technologies (M7) and the 
probability value of a pessimistic scenario at a macroscale is directly propor- 
tional, meaning that the worse is the general situation, the faster is the progress of 
this technology group against the research field, i.e., it is recommended that 


Thematic area O Thematic area QT Thematic area 
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Fig. 1 1 The results of simulations presenting probability values of individual trends of thematic 
areas of the research held M if a pessimistic scenario occurs with the probability of (a) 10 %, 
(b) 20 %, and (c) 30 % 
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instead of it, more promising technologies develop more intensively, i.e., M6, 
Ml, M2, and M5. The probability value of a growth trend for the CVD technology 
(M3) maintains at the maximum level of, respectively, 64 %, 60 %, and 57 % for a 
pessimistic macroscenario occurring with, respectively, 10 %, 20 %, and 30 % 
probability. This inversely proportional dependency coupled with the values of a 
stabilized and falling trend indicates that the existing dynamics of slight growth is 
maintained in the area’s importance. The development of classical thermochem- 
ical treatment technologies is most predictable. As the probability of a pessimistic 
scenario rises (from 52 % to 69 %), the probability of the M4 trend stabilized at 
the existing level is growing substantially. Both these values, probability values 
of a rising and falling trend of such technologies, reflect the actual situation and 
future strategic position of these technologies with respect to the research field 
M. Thermochemical treatment, although not an avant-garde or extremely devel- 
opmental treatment nor an extremely developmental technology, considering the 
economic calculus and widespread applications, has an important role in the 
current economy, and forecasts show this status quo will maintain for the next 
20 years. 

As regards the research field P, chosen for presentation have been the examples 
of results of simulation experiments concerning a dependency between develop- 
ment trends of the relevant thematic areas and an optimistic macroscenario of future 
events having the occurrence probability of 10 % (Fig. 12a), 20 % (Fig. 12b), and 
30 % (Fig. 12c). The occurrence probability values of the relevant development 
trends (a growth trend, a stabilized trend, and a falling trend) of the individual 
thematic areas were applied in per cents on the axis of abscissa, similar as for the 
research field M. The following thematic areas subjected to simulation investiga- 
tions are provided on the axis of coordinates, respectively: biomaterials surface 
engineering (PI); metallic structural materials surface engineering (P2); nonmetal- 
lic structural materials surface engineering (P3); tool materials surface engineering 
(P4); steel surface engineering for the automotive industry (P5); surface engineer- 
ing of glass, micro-, and optoelectronic elements and photovoltaic elements (P6); 
and polymer materials surface engineering (P7). 

The results of the simulation experiments performed reveal that certain and 
rapid development of biomaterials surface engineering (PI), functional materials 
(P6), and tool materials (P4) as signified takes place with 30 % probability. The 
probability of a falling trend of the areas PI, P6, and P4 was determined as zero, 
similar as for nonmetallic structural materials surface engineering (P3) and poly- 
meric materials (P7). As regards nonmetallic structural materials surface engi- 
neering (P3), an inversely proportional dependency is observed between the 
probability value of a growth trend of such technologies and a probability value 
of an optimistic macroscenario. This means that in most optimistic variant of 
events, the development of the most promising areas, i.e., PI, P6, and P4, is more 
important than the development of the area P3. As it is not possible that the 
importance of the thematic area P7 will decrease and as there is no clear regularity 
in changes of the growth and stabilized trend, depending on the probability value 
of an optimistic macroscenario, this signifies that the predicted dynamics of 
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Fig. 12 The results of simulations presenting the probability values of individual trends of 
thematic areas of the research held P if an optimistic scenario occurs with the probability of 
(a) 10 %, (b) 20 %, and (c) 30 % 
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changes in such area is stabilized at the existing, good level. A directly propor- 
tional dependency between the probability value of a growth trend and the 
probability value of an optimistic macroscenario of future events can be observed 
also with reference to steel surface engineering for the automotive industry (P5). 
This shows that this area will develop in the future in connection with predictable 
further development of the car industry. Despite the fact that traditional steel is 
more and more often replaced by other materials, e.g., light metal alloys (Mg, Al) 
or polymeric materials, the position of steel in the automotive industry is basically 
unthreatened in particular owing to the rapid development of new grades of steel. 
High-manganese austenitic steels of the TRIP type (transformation-induced plas- 
ticity) should be mentioned when discussing such steels, characterized by a 
unique combination of strength and plastic properties where a martensite TWIP 
(twinning-induced plasticity) transformation is induced in a cold plastic defor- 
mation, characterized by intensive mechanical twinning during plastic deforma- 
tion, and of the TRIPLEX type characterized by a three-phase structure: an 
austenitic and ferrous structure with dispersive carbides. There is no clear regu- 
larity observed of changes in the growth and stabilized trend with regard to 
metallic structural materials surface engineering (P2) and approximate probabil- 
ity values of a falling trend for individual probability values of an optimistic 
macroscenario. This signifies the predicted, stabilized dynamics of changes in the 
area at the existing level. The manufacturing technologies of metallic structural 
materials (P2), which are not avant-garde or extremely developmental, have their 
certain place ensured among the important materials surface engineering tech- 
nologies as they are so widespread in industry and as they often cannot be 
replaced by other solutions at economically reasonable costs. 

The selected results presented concerning a pessimistic scenario for the research 
field M and a pessimistic scenario for the research field P, being only an example of 
much broader simulation investigations, allow to answer the question of how 
growth, stabilization, or decline in the importance of the analyzed thematic areas 
influences the occurrence, with particular probability, of each of the alternative 
scenarios in the nearest 20 years, i.e., an optimistic, neutral, and pessimistic 
scenario. It should, therefore, be underlined that adequately trained neural networks 
are a useful tool allowing to generate quickly the alternative forecasts variants of 
future events. 

Macroscenarios and the Importance of Mezofactors 

In order to identify the impact of the key mezofactors selected earlier in the work 
conditioning the development of materials surface engineering on the probability 
value of the occurrence of alternative macroscenarios of future events, the original 
SCENNET48 software was used with an artificial neural network attached as a 
function (trained earlier using reference data implemented being an outcome of 
expert studies). A full list of 16 mezofactors analyzed marked with the alphanu- 
meric symbols of C1-C16 is shown in Fig. 6. 
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Three groups of experiments were made during simulation investigations, and 
their selected results were chosen for presentation in this paper by using the 
following assumptions: 

• The classifier calculates for each output vector a probability value of the 
occurrence of an optimistic, neutral, or pessimistic scenario. 

• Each independent variable may assume one the of three values from the set 
{1,0, — 1 } with the value 1 corresponding to a growth trend, 0 to a trend stabilized 
at the existing level (neutral level), and — 1 to a falling trend of a given mezofactor. 

• During each of the experiments, the individual independent variables may 

assume only two chosen values from the defined set of permitted values 

{1, 0, —1}; therefore, the number of possible cases is limited from 3 16 , i.e., 

43,046,721, which corresponds to the number of all combinations of 

16 mezofactors considered at the same time that can assume each of the 

1 6 

3 possible trends, to 2 , i.e., 65,536, which takes place when only 2 trends are 
taken into account. 

The aim of the first group of experiments was to select the mezofactors influenc- 
ing the change of a macroscenario from an optimistic one to a neutral one together 
with determining a probability value of such an event. Independent variables may 
assume a value from the set { 1 , — 1 } , which corresponds to a change in the trend of 
the examined mezofactors from the growing to the falling trend. Cases were 
searched, in line with the assumption used, where the number of mezofactors 
exhibiting a change in the trend from the growth trend to the falling trend is as 
small as possible. Fifteen different combinations were found as a result of the 
experiments conducted where 2 mezofactors changed their value at the same time 
from 1 to —1 and 194 such combinations with 3 mezofactors. According to the 
results of the experiments carried out, three 2-element combinations in the change 
of trends from growing to falling trends will contribute to, with a hundred percent 
probability, the change of a macroscenario from an optimistic one to a neutral one. 
Such an effect is caused by simultaneous changes into worse of the factor C9 - 
striving for constant improvement by ensuring the higher quality of technology and 
multiple implementations, especially in small- and medium- sized enterprises in 
combination with the following mezofactors: Cl, the effectiveness of the state’s 
actions aimed at broad access to information concerning the key technologies and 
results of technology foresights; C2, transparency and friendliness of legal regula- 
tions, or C3, strategic priorities of the united Europe identified with the 
community’s level of cooperation and the amount of funds granted. 

The aim of the second group of experiments was to identify the mezofactors 
influencing the change of a macroscenario from a neutral one to an optimistic one 
together with determining a probability value of such an event. Independent vari- 
ables may assume a value from the set {0, 1 } , which corresponds to a change in the 
trend of the examined mezofactors from a neutral to a growth trend. An assumption 
was taken again that a minimum number of the changing trends is sought for the 
relevant mezofactors. Sixteen different combinations were found as a result of the 
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experiments conducted where 4 mezofactors changed their value at the same time 
from 0 to 1 and 152 such combinations with 5 mezofactors. An analysis of the 
results of the simulation experiments shows the highest probability (94.9 %) that a 
macroscenario changes from a neutral to an optimistic one in the case where a 
change in trends occurs at the same time from a neutral to a growth trend for the 
following mezofactors: C2, transparency and friendliness of legal regulations; C3, 
strategic priorities of the united Europe identified with the community’s level of 
cooperation and the amount of funds granted; C9, striving for constant improve- 
ment by ensuring the higher quality of technology and multiple implementations, 
especially in small- and medium-sized enterprises; and Cll, the importance of 
environment-friendly technologies for improving functionality, extending life and 
recyclability. 

The purpose of the last group of experiments was to identify the mezofactors 
influencing the change of a macro scenario from a neutral one to a pessimistic one 
together with determining a probability value of the occurrence of such an event. 
Independent variables for such experiments may assume a value from the set {0, 
— 1 }, which corresponds to a change in the trend of the examined mezofactors from a 
neutral to a falling trend. An assumption was taken again as for other experiments that 
a minimum number of changing trends is sought for the relevant mezofactors. Five 
different combinations were determined as a result of the experiments conducted 
where 2 mezofactors changed their value at the same time from 0 to — 1 (Table 2) and 
63 such combinations with 3 mezofactors. According to the results of the simulation 
experiments, it is most probable (94. 1 %) that a macroscenario changes from a neutral 
to a pessimistic one while trends change at the same time from a neutral to a 
pessimistic one for the following mezofactors: Cll, the importance of 
environment-friendly technologies for improving functionality, extending life and 
recyclability, and C13, the importance of technologies enhancing the mechanical, 
tribological, and anticorrosive properties of surface layers. 

The presented results of the simulation experiments providing the representative 
examples of much broader research reveal that it is reasonable to employ neural 
networks for analyzing interactions between events when creating multivariant 
probabilistic scenarios of future events. By using this artificial intelligence tool, a 
dependency can be determined between the occurrence, with a specific probability, 
of each of the considered alternative macroscenarios and the variants of changing 
the individual thematic areas or mezofactors while taking into account that, within 
the defined time horizon, their importance may increase, remain stable, or decrease. 
All the difficulties encountered by the pioneers stemmed from the experimental and 
innovative idea of implementing neural networks for creating future event scenar- 
ios. The chief challenge was the specificity of input data in form of expert opinions 
expressed quantitatively (in percents). The phenomena were assessed differently as 
such an assessment tends to be subjective, which is typical for expert investigations. 
In particular, the experts, most likely unintentionally, on one hand were making an 
attempt to represent the interests of their own circles and on the other hand were 
frequently viewing those phenomena having a high level of generality (macro- and 
mezoscale) through the prospective of their own, much narrower, specialization. 
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Table 2 Minimum combination of mezofactors with the strongest impact on changing a 
macroscenario from a neutral to a pessimistic one 



Strategic Position of the Critical Materials Surface Engineering 
Technologies 

The determination of the strategic position of critical materials surface engineering 
technologies was preceded by the elaboration of a custom computer-integrated 
methodology for converting the hidden implicit knowledge, which by the nature 
of things is difficult to measure, into explicit knowledge available to the public 
(Fig. 13), expressed quantitatively using engineering analytical tools. The input 
date was acquired, in consistency with e-foresight, by the electronic survey of 
experts with the e-Delphix method using IT technologies. 


Contextual Matrices 

The positions of relevant technologies vis-a-vis other technologies were presented 
graphically using contextual matrices. A dendrological matrix of technology 
value was applied in order to determine the objectivized values of the individual, 
separate technologies, or their groups, and a meteorological matrix of 
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Fig. 13 Knowledge 
availability matrix 
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environment influence was used to determine the intensity of positive and nega- 
tive influence of the micro- and macroenvironment on the specific technologies. A 
methodological structure of both matrices refers to portfolio methods commonly 
known in management sciences (Thompson and Strickland 1987; Little 1981; 
Hofer 1977; Glueck 1980) serving to characterize the portfolio of products 
offered to a client by an enterprise, allowing for a graphical presentation of results 
of a comparative analysis according to two criteria/factors placed, respectively, 
on the horizontal and vertical axis of the matrix. The most renowned of such 
matrix, Boston Consulting Group (BCG) (Henderson 1970), enjoys its unique 
popularity due to references to simple associations and intuitive reasoning, which 
became an inspiration when elaborating methodological assumptions for the 
dendrological and meteorological matrix. To evaluate individual technologies 
for their value and environment influence intensity, a single-pole positive scale 
without zero was used, called a universal scale of relative states, where 1 is a 
minimum rate and 10 is an extraordinarily high rate, while lifecycle phases were 
determined in line with a compatible 10-point evaluation scale of technology 
lifecycle phase. 

A four-field dendrological matrix of technology value (Fig. 14) contains expert 
assessments for the relevant technologies according to the potential being the 
realistic objective value of the specific technology group and according to attrac- 
tiveness reflecting the subjective perception of the relevant technology group by its 
potential users. Depending on the potential value and attractiveness level deter- 
mined in an expert assessment, each of the analyzed technologies is placed into one 
of the matrix quarters. The wide -stretching oak is the most promising quarter 
guaranteeing the future success, and the technologies placed in this quarter are 
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Fig. 14 Dendrological matrix of technology value, presentation of approach (Dobrzanska- 
Danikiewicz 2010b, 2012b) 


characterized by a high potential and high attractiveness. The soaring cypress 
characterizes new, very attractive technologies with a limited potential, and the 
rooted dwarf mountain pine symbolizes stable, proven technologies with a high 
potential and limited attractiveness likely to ensure a strong position for a technol- 
ogy if an adequate strategy is employed. The least promising technologies are 
placed in a quarter called quaking aspen with their future success having small 
probability or being impossible. 

A four-field matrix of environment influence (Fig. 15) presents, in a graphical 
manner, the results of how the external positive (opportunities) and negative 
(difficulties) factors impact the technologies analyzed. Each of the technologies 
evaluated by the experts is placed into one of the matrix quarters. Sunny spring 
illustrates the most favorable external situation ensuring the future success. Rainy 
autumn, providing a chance for steady progress, corresponds to a neutral environ- 
ment and hot summer to a stormy environment where the success of a technology is 
risky but feasible. Frosty winter communicates that technology development is 
difficult or unachievable. 
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Fig. 15 Meteorological matrix of technology value, presentation of approach (Dobrzanska- 
Danikiewicz 2010b, 2012b) 


The results of the expert studies presented graphically with the dendrological 
matrix of technologies value and the meteorological matrix of environment influ- 
ence were entered into the matrix of strategies for technologies (Dobrzanska- 
Danikiewicz 2010b). The matrix consists of 16 fields corresponding to relevant 
variants, resulting from the set of combinations of the four types of technologies 
with four types of surrounding. The matrix of strategies for technologies (Fig. 16) 
presents graphically the position of a technology according to its value and to 
intensity of environment influence expressed with a universal scale of relative states 
consisting of ten degrees and indicates an action strategy that should be adopted for 
a given technology. To be able to transfer relevant numerical values from the 4-field 
dendrological and meteorological matrices to the 16-field matrix of strategies for 
technologies, mathematical dependencies were formulated, allowing to rescale and 
objectivize the research results, and a computer program was developed on their 
basis to enable the fast calculation of the values sought and to generate graphically 
the matrix of strategies for technologies. 
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Fig. 16 General form of the matrix of strategies for technologies (Dobrzanska-Danikiewicz 
et al. 201 le) 


The Aggregate Matrix of Strategies for Critical Technologies 
of Materials Surface Engineering and Interpretation of Research 
Results 

The critical technologies of materials surface engineering are the priority technol- 
ogies with the best development outlooks and/or of key significance for industry 
within the analyzed time horizon of 20 years. All the critical technologies were 
classified according to two research fields: M (Manufacturing) and P (Product). 
Seven thematic fields were distinguished between each of the research fields, 
M1-M7 and P1-P7, respectively, each containing 10 groups of technologies, giving 
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the pool of 140 groups of critical materials surface engineering technologies 
(Table 3). The pool was generated based on the results of a state-of-the-art analysis 
including its assessment according to the review of references, technological 
review, and a strategic analysis with integrated methods (STEEP, SWOT). It 
needs to be explained that there are cases where a given group of technologies 
occurs more than once in the pool of critical technologies, which is not an omission 
or an error but a deliberate action. It is possible that a given group of technologies is 
significant for several thematic areas at the same time, while this significance 
against other technologies of different areas may be either similar, the same, or 
quite different. 

In order to determine the strategic position of the relevant groups of critical 
technologies and to formulate action strategies recommended for use for such 
technologies, a newly developed methodology of the computer-integrated predic- 
tion of materials surface engineering development was employed with its correct- 
ness verified positively earlier by using the results of classical material science 
research as a reference point (Dobrzanska-Danikiewicz et al. 2010b; Hasan and 
Harris 2009). 

The results of the electronic survey of experts, the specialists representing 
particular thematic fields, are carried out according to the idea of technology 
e-foresight using the e-Delphix method and the accompanying information tech- 
nology, enabling to conduct research in virtual reality, represented the original 
reference data, expressed quantitatively using the universal scale of relative states 
consisting of ten degrees (1, minimum; 10, maximum) used for preparing aggregate 
contextual matrices concerning all the analyzed groups of critical materials surface 
engineering technologies. The values of individual technologies were evaluated in 
particular according to their potential and attractiveness, and the results of such 
evaluations were entered into the summary dendrological matrix of technology 
value (Hasan and Harris 2009). The results of the positive and negative evaluation 
of the influence of external factors were entered into the summary metrological 
matrix of environment influence by Hasan and Harris (2009). A summary matrix of 
strategies for technologies was next generated using a dedicated computer program. 
As it is necessary to ensure that a figure of the matrix of strategies for technologies 
is adequately clear, for the purpose of matrix presentation it was divided into 
quarters with their layout and numbering shown in Fig. 17. The individual quarters 
of the matrix together with the provided strategic position of individual critical 
technologies of materials surface engineering are illustrated in Figs. 18, 19, 20, and 
21. The strategic development prospects of relevant technologies expressed quan- 
titatively with the universal scale of relative states were marked with circles entered 
into the relevant quarters of the matrix, and this allows to make a quantitative 
comparative analysis of the individual groups of critical technologies subjected to 
heuristic studies. 

Considering a limited size of this publication, the data provided in the matrix of 
strategies for critical technologies of materials surface engineering divided into 
quarters (Figs. 18, 19, 20, and 21) is interpreted with an abbreviated description 
pertaining to the most promising of the analyzed critical technologies. 
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Table 3 Groups of critical materials surface engineering technologies subjected to heuristic 
studies in line with a newly developed methodology of the computer-integrated prediction of 
development 


Research area manufacturing (M) 

Research area product (P) 

Ml 

PI 

Am\ 

Laser heat treatment 

Api 

Immobilization 

Bmi 

Laser remelting 

Bp\ 

Self-organization monolayers 
deposition 

Cmi 

Laser alloying/cladding 

Cp\ 

Patterning 

B>mi 

Laser cladding 

Dpi 

The sol-gel method 

Bmi 

Laser additive manufacturing 

Bp\ 

Infiltration 

Bmi 

Laser chemical vapor deposition 
(LCVD) 

B p\ 

Electrophoretic and sediment 
deposition 

Gmi 

Laser-assisted physical vapor 
deposition (LAPVD) 

G Pl 

Consolidation 

Bmi 

Laser treatment of functional 
materials 

H P i 

Physical vapor deposition/chemical 
vapor deposition (PVD/CVD) 

Imi 

Pulsed laser deposition (PLD) 

~Bp\ 

Pulsed laser deposition (PLD) 

J Ml 

Laser treatment of biomaterials 

Jpi 

Deposition of diamond layers and 
diamond-like coatings (DLC) 

M2 

P2 

Am2 

Cathodic arc deposition (CAD) 

Ap2 

Painting 

Bm2 

Reactive magnetron sputtering (RMS) 

Bp2 

Galvanic technologies 

Cm2 

Pulse plasma method (PPM) 

Gp2 

Heat and thermochemical technologies 

B>M2 

Ion beam-assisted deposition (IB AD) 

B>P2 

Thermal spraying 

Bm2 

Hot hollow cathode deposition 
(HHCD) 

Bp2 

Detonation spraying by laser/electron 
beam/explosion 

B M2 

Electron beam physical vapor 
deposition (EB-PVD) 

Bp2 

Laser melting/alloying 

G m2 

Bias-activated reactive evaporation 
(BARE) 

Gp2 

Electron beam melting/alloying 

BI M2 

Ionized cluster beam (ICB) 

H p2 

Ion implantation 

I M2 

Thermionic arc evaporation (TAE) 

Ip2 

Ceramics/cermetals deposition 

J M2 

Pulsed laser deposition (PLD) 

Jp2 

Physical vapor deposition/chemical 
vapor deposition (PVD/CVD) 

M3 

P3 

A-M3 

Hot filament chemical vapor 
deposition (HFCVD) 

Ap3 

Physical vapor deposition (PVD) 

Bm3 

Atmospheric pressure chemical vapor 
deposition (APCVD) 

Bp3 

Chemical vapor deposition (CVD) 

G m3 

Low-pressure chemical vapor 
deposition (LPCVD) 

Cp3 

Galvanic technologies 

B>M3 

Plasma-assisted chemical vapor 
deposition/plasma-enhanced chemical 
vapor deposition (PACVD/PECVD) 

Dp3 

Vacuum metallization 

Bm3 

Laser chemical vapor deposition 
(LCVD) 

Bp3 

Ion implantation 


0 continued ) 
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Table 3 (continued) 


Research area manufacturing (M) 

Research area product (P) 

F M3 

Photochemical vapor deposition 
(photo CVD) 

F P3 

The sol-gel method 

Gm3 

Metal organic chemical vapor 
deposition (MOCVD) 

Gp3 

Thermal spraying 

Hm3 

Fluidized-bed chemical vapor 
deposition (fluidized-bed CVD) 

Hp3 

Painting 

l M3 

Chemical vapor infiltration (CVI) 

Ip3 

Electrophoretic deposition (EPD) 

J M3 

Atomic layer deposition (ALD) 

Jp3 

Pulse laser deposition (PLD) or 
deposition using laser plasma extreme 
ultraviolet (EUV) 

M4 

P4 

Ama 

Plasma nitriding 

A P a 

Physical vapor deposition (PVD) 

Bma 

Low-pressure nitriding 

BpA 

Chemical vapor deposition (CVD) 

Cma 

Gas nitriding 

CpA 

Spraying 

Dma 

Complex treatment with nitriding 

Dpa 

Powder metallurgy (chemical and/or 
phase composition changes in surface 
layers) 

Em4 

Gas carburizing and high-temperature 
carbonitriding 

Epa 

Nitriding and complex treatment with 
nitriding 

F ma 

Plasma and low-pressure carburizing 

F PA 

Laser alloying/cladding 

Gma 

Aluminizing 

Gp A 

Cladding 

FIma 

Boriding 

Hpa 

Graded coating deposition 

I M4 

Passivation 

IpA 

Hybrid technologies 

Jma 

Hybrid technologies 

J PA 

Pulsed laser deposition (PLD) 

MS 

PS 

^M5 

Traditional painting techniques and 
immersion deposition 

Ap5 

Hot dip zincing (in pure Zn and in 
Zn-Al alloys) 

Bm5 

Hydrodynamic spray 

Bps 

Hot dip zincing and annealing (Zn-Fe 
coating) 

Gm5 

Powder painting 

Gps 

Hot dip alumination (in pure A1 and in 
Al-Si alloys) 

F>M5 

Electrophoretic deposition (EPD) 

Dps 

Galvanic technologies 

Fm5 

Electrostatic fluidized-bed deposition 

Fps 

Spray metallization 

F M5 

Graded coating deposition 

Fps 

Thermal spraying 

Gm5 

Deposition of coatings with 
nanofillers 

Gps 

Ground polymer coating deposition 

FIm5 

Shape-memory coating deposition 

Flps 

Painting and lacquering using liquid 
polymer materials 

I M5 

Deposition of coating self-stratifying 
on polymer surface layers 

Ip5 

Powder polymer coating deposition 

J M5 

Biocompatible coating deposition 

J PS 

Deposition of coatings from polymer 
foils 

M6 

P6 

Am6 

Reactive ion etching (RIE) 

Ap6 

Chemical vapor deposition (CVD) 

Bm6 

Electron beam lithography (EBL) 

Bp6 

Physical vapor deposition (PVD) 


(< continued ) 
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Table 3 (continued) 


Research area manufacturing (M) 

Research area product (P) 

L/V/6 

Chemical vapor deposition (CVD) of 
nanometric surface layers 

Gp6 

Pyrolysis and its variants 

Dm6 

Ion beam-assisted deposition (IB AD) 

Dp6 

The sol-gel method 

Em6 

Electron beam physical vapor 
deposition (EB-PVD) of nanometric 
surface layers 

Ep6 

Organic-inorganic hybrid coatings 
obtaining 

Fm6 

Atomic layer deposition (ALD) 

Fp6 

Evaporation 

Gm6 

Electrodeposition of nanometric 
surface layers 

Gp 6 

Chemical methods/alkalis leaching 
from surface layers and remaining 
Si0 2 concentration 

Hm6 

The sol-gel method of nanometric 
surface layers obtaining 

H p6 

Reactive ion etching (RIE) 

I M6 

Deposition of coatings with 
nanomaterials on surface layers 

Ip6 

Mechanical texturization using 
diamond edge 

J M6 

Surface treatment of nanomaterials 

Jp6 

Laser texturization 

Ml 

PI 

Ami 

Galvanic coating deposition 

A P7 

Metallization 

Bm7 

Thermal sprayed coating deposition 

Bpi 

Corona treatment 

C mi 

Deposition of coatings formed in low 
pressure from powders and sintering 

Cpi 

Plasma treatment of polymer surface 
layers 

E>mi 

Immersion metallized coating 
deposition 

Dpi 

Laser treatment of polymer surface 
layers 

Emi 

Ceramics/cermetals deposition 

Epi 

Ultraviolet (UV) radiation treatment of 
polymer surface layers 

Emi 

Casting and infiltration surface layers 
manufacturing 

F pi 

Gamma radiation treatment of polymer 
surface layers 

Gmi 

Coating cladding 

Gp 7 

Electron beam treatment of polymer 
surface layers 

Emi 

Burnishing, ball burnishing 

Hpi 

Obtaining of graded coatings on 
polymer surface layers 

l Ml 

Plating 

Ipi 

Obtaining of self-organized coatings 
on polymer surface layers 

J Ml 

Detonation spraying 

J pi 

In situ polymerization 


o o 

Laser alloying/cladding C M1 (7.0, 8.4) and pulsed laser deposition I M1 (4.6, 8.4) 
have the best development prospects from among the critical technologies belong- 
ing to the area of the thematic field Ml: Laser technologies in surface engineering, 
and they were evaluated highly (8 points). The technology group C M1 was placed in 
the dwarf mountain pine in spring field signifying its high potential and limited 
attractiveness; hence, the actions recommended for the group include the following: 
to render the technology more attractive and more modern, to automate it, to 
computerize it, and to promote it using strong market conditions. Pulsed laser 
deposition I M1 was placed in the cypress in spring field, meaning it is highly 
attractive with a limited potential that needs to be strengthened through further 



73 Foresight of the Surface Technology in Manufacturing 


2623 


Fig. 17 Layout and 
numbering of quarters of 
strategies for technologies 
prepared for critical 
technologies of materials 
surface engineering 



research efforts, improvement, and additional investments in strong market 
conditions. 

The results of heuristic studies point out that reactive magnetron sputtering 
(RMS) Bm 2 (8.6, 9.0) and cathodic arc deposition (CAD) Am 2 (8.5, 8.7) have the 
best prospects of strategic development for physical vapor deposition (PVD) 
technologies, referred to as very high (9 points), and these should be developed, 
enhanced, and implemented in industrial practice for achieving a spectacular 

o 

success. An attractive group of technologies with limited potential J M2 (4.6, 8.4), 
i.e., pulsed laser deposition (PLD), should be investigated, improved, and invested 
more using the robust economic market circumstances. In relation to an attractive, 
stable technology set in a foreseeable environment, consisting of electron beam 

c 

physical vapor deposition (EB-PVD) F M2 (8.2, 4.1), a future success is foreseen 
while recommending at the same time that new. 

In the group of technologies consisting of chemical vapor deposition (CVD), the 
best strategic positions, evaluated each time to have 9 points in the scale of 
10 points, have the following methods: metal organic chemical vapor deposition 
(MOCVD) Gm 3 (8.6, 8.5), laser chemical vapor deposition (LCVD) Em 3 (8.7, 8.3), 
and plasma-assisted chemical vapor deposition/plasma-enhanced chemical vapor 
deposition (PACVD/PECVD) Dm 3 (8.5, 8.2) found in the best 16 matrices of 
strategies for technologies for which the oak in spring strategy is recommended; 
thus, their future success is certain. A prototype technology of atomic layer 

c 

deposition (ALD) J M3 (4.4, 8.8) is also very promising (8 points), in relation to 
which it is recommended to use the cypress in spring strategy consisting of further 
scientific and research works aimed at its improvement and enhancement. It is also 
recommended to make additional investment in this attractive technology 
connected with exploiting numerous opportunities arising from the micro- and 
macroenvironment. An analysis of the results of heuristic studies visualized using 
contextual matrices indicates that among thermochemical technologies, the best 

o 

strategic position has hybrid technologies J M4 (9.0, 9.1) combining at least two 
surface treatment methods, i.e., nitriding and physical vapor deposition (PVD) or 
carburizing with chemical vapor deposition (CVD). The technologies currently 
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Fig. 18 The first quarter of the matrix of strategies for technologies concerning the thematic areas 
of M1-M7 and P1-P7 (Dobrzanska-Danikiewicz 2012b) 


experiencing a growing phase were also evaluated very high (9 points): plasma and 

o o 

low-pressure carburizing F M4 (8.7, 8.9) and low-pressure nitriding B M4 (8.3, 8.6). 

o 

The experts also evaluated plasma nitriding A M4 (8.6, 8.6) being in its 
early-mature phase, with 9 points. All the technologies J M4 , B M4 , F M4 , and A M4 
were found in the most advantageous group of 16 matrices, and the oak in spring 
strategy is recommended for them consisting in developing, strengthening, and 
implementing an attractive technology with a large potential in the industrial 
practice, and their future success is guaranteed. 

It was pointed out by analyzing the results of heuristic studies conducted through 
an electronic survey of experts that the following polymer technologies of surface 
layers have the best strategic position evaluated with 9 points in the ten-degree scale 

o 

of universal states: deposition of coatings with nanofillers G M5 (8.3, 8.6) and 

c 

graded coatings F M5 (8.6, 8.1) with respect to which it is recommended to apply 
the oak in spring strategy where efforts are made to achieve a success by developing 
and strengthening the technologies that are boding very well set in a friendly 

c 

environment bringing numerous opportunities. Electrophoretic deposition D M5 

c 

(6.6, 8.7) and electrostatic fluidized-bed deposition E M5 (6.6, 8.3) also rank high 
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Fig. 19 The second quarter of the matrix of strategies for technologies concerning the thematic 
areas of M1-M7 and P1-P7 (Dobrzanska-Danikiewicz 2012b) 


in the ranking (8 points), and they are placed in the field of dwarf mountain pine in 
spring, and a strategy recommended for them assumes that such mature technolo- 
gies must be rendered more attractive, more modern, computerized and automated, 
and also intensively promoted in a supportive environment. 

A heuristic analysis using reference data acquired via electronic surveys of 
experts with the e-Delphix method was carried out also with reference to the critical 
technologies of materials surface engineering classified as technologies of 
nano structural surface layers among which a strategic position of physical vapor 
deposition of nanometric surface layers using ion beam-assisted deposition (IB AD) 

c c 

E m 6 (9.4, 9.1) and with electron beam physical vapor deposition (EB-PVD) D M6 
(9.2, 8.6) was evaluated extraordinarily high (10 points). The oak in spring strategy 
should be employed with regard to those exceptionally promising technologies that 
were granted the maximum possible rate as it encourages their development, 
strengthening, and implementation on a wide industrial scale, and their future 

o 

success is certain. The surface treatment of nanomaterials J M6 (4.4, 9.0) 
(Dobrzanska-Danikiewicz and Lukowiec 2013) and also atomic layer deposition 
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Fig. 20 The third quarter of the matrix of strategies for technologies concerning the thematic 
areas of M1-M7 and P1-P7 (Dobrzanska-Danikiewicz 2012b) 


o 

(ALD) F M6 (4.9, 8.8) were evaluated very high (9 points). A strategic position of 

o 

the deposition of coatings with nanomaterials on surface layers I M6 (4.7, 8.4) and 

o 

of electron beam lithography (EBL) B M6 (4.3, 8.0) was evaluated very high 
(8 points). The groups of technologies J M6 , F M6 , and I M6 being in their prototype 
phase of development as well as the early-mature group B M6 require that the 
cypress in spring strategy is applied, showing that further scientific and research 
efforts need to be pursued to improve and strengthen the potential of such prom- 
ising young technologies; numerous opportunities should also be exploited emerg- 
ing in the closer and farther environment. 

The results of heuristic studies indicate that thermal sprayed coating deposition 

o 

B M7 (9.1, 8.7) have the best prospects of strategic development for the technolo- 
gies classified as other surface engineering technologies determined as extraordi- 
narily high (10 points). The technologies were placed in the best 16 matrices of 
strategies for technologies, their further development and numerous applications in 
industrial practice should therefore be expected, and such development should be 
aided with development and strengthening of such proven solutions in supportive 
environmental conditions. The cypress in spring strategy, whereby the potential of 
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Aspen Cypress 



Fig. 21 The fourth quarter of the matrix of strategies for technologies concerning the thematic 
areas of M1-M7 and P1-P7 (Dobrzanska-Danikiewicz 2012b) 


an attractive technology is enhanced through further scientific and research works 
aimed at technology improvement, while using the opportunities from the environ- 

o 

ment, should be applied for the methods of ceramics/cermetals deposition E M7 
(5.0, 8.9), whose strategic position was rated very high (9 points), and casting and 

o 

infiltration surface layers manufacturing methods F M7 (4.4, 8.5) rated very high 
(8 points). A good strategic position (8 points) was observed for galvanic coating 
deposition A S M7 (7.0, 8.3) found in the field of dwarf mountain pine in spring, 
meaning that measures need to be undertaken to make this mature technology 
attractive, modem, automated, and computerized, with such processes being 
supported by a friendly environment, as well as for the deposition of coatings 

c 

formed in low pressure from powders and sintering C M7 (8.9, 5.0) for which the 
oak in autumn strategy is recommended. This strategy, appropriate for attractive 
technologies with a high potential being in a neutral environment, belongs to those 
with strong prospects and measures need to be taken to broaden the markets, 
customer groups, and areas of applications. 

It was revealed in an analysis of strategic prospects of biomaterials surface 
engineering carried out using a newly established methodology of computer- 
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integrated prediction of development that in this thematic area, regarded by experts 
as one of the most promising for those under examination, there are as many as six 
groups of technologies, H P1 , D P1 , A P1 , J P1 , B P1 , and C P1 , characterized by a very 
high strategic position (9 points). The strategies recommended for them are, 
however, diversified. As regards the physical and chemical vapor deposition 
methods (PVD/CVD) H S P1 (9.2, 8.4) and the sol-gel method D S P1 (8.9, 8.2), it is 
reasonable to apply the oak in spring strategy consisting of developing, strength- 
ening, and wide application in industrial conditions of such early-mature technol- 
ogies set in a friendly environment, which ensures their commercial success. The 
dwarf mountain pine in spring strategy is recommended for immobilization 
methods (6.8, 9.1) and for deposition of diamond layers and diamond-like coatings 

o 

(DLC) J P1 (7.2, 8.7), and the planned strategic measures should be aimed at 
making the technological solutions more attractive and modem, at automation 
and computerization of a machine park and at promotion, which should strengthen 
such mature technologies in a supportive environment. As regards the growing 
technology of self-organization monolayers, deposition^ (4.5, 9.4) and pattering 
C P1 (5.0, 8.7) being in its early-mature phase of lifecycle - with such pattering 
potentially occurring in the processes of local laser irradiation, surface injection, or 
bombing with an ion beam - it is rather recommended to apply the cypress in spring 
strategy consisting of exploiting the opportunities from the environment while 
strengthening the technology’s potential. This strategy should also be applied for 

o 

highly rated (8 points) infiltration methods E P1 (4.2, 8.0), enabling the surface 
production of graded materials. 

It was pointed out by analyzing the results of the heuristic studies conducted 
through an electronic survey of experts that the best strategic position for the critical 
technologies of stmctural metallic materials surface engineering, rated maximally 
with 10 points, has the physical and chemical vapor deposition methods (PVD/CVD) 

o 

J P2 (9.3, 9.5), with respect to which it is recommended to apply the oak in spring 
strategy consisting of attempts to achieve the guaranteed success by developing and 
strengthening those technologies boding well in a friendly environment. Thermal and 

o 

thermochemical technologies C P2 (7.3, 8.7) widely used in different versions and 
variants for the surface treatment of metal materials enjoy a very high strategic 
position (9 points). The group of technologies C P2 requires the dwarf mountain 
pine in spring strategy to make late-mature technologies more attractive and to 
modernize them, to computerize them, and to automate a machine park, and also to 
undertake promotional actions reinforcing their competitive market position at the 
market at which opportunities prevail. Similarly, the strategic position of the follow- 

o 

ing technologies was also evaluated very high (9 points): ion implantation H P2 (4.6, 
9.2) being in its prototype phase of life and the growing ceramic s/cermetals deposi- 

o 

tion technology I P2 (3.9, 9.2). With reference to such groups of technologies, as well 
as to a highly evaluated (8 points) early-mature technology of laser melting/alloying 

o 

F P2 (4.7, 8.3), it is recommended to apply the cypress in spring strategy, according to 
which opportunities should be exploited coming from the environment, while making 
the technology more attractive and strengthening its potential, the purpose of which 
are for research and development works. 
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Contextual matrices were created, as part of a heuristic analysis carried out using 
a newly established methodology, visualizing the strategic positions of the individ- 
ual groups of critical technologies classified to the studies under the considered 
thematic area of structural nonmetallic materials surface engineering. The highest 

o o 

possible rate (10 points) was given to chemical B P3 (9.2, 9.3) and physical A P3 
(9.1, 9.1) vapor deposition methods with respect to which it is recommended to 
apply the oak in spring strategy leading to a market success in the supportive 
conditions of the environment. A prototype group of technologies of pulse laser 

o 

deposition (PLD) or deposition using laser plasma extreme ultraviolet EUV J P3 
(4.9, 9.1) received very high rates (9 points), and the cypress in spring strategy is 
recommended for it assumes that the supportive environmental conditions are used 
and investigations into such technologies continued. The innovative - on a global 
scale - research into micro- and nanotreatment of organic polymers with EUV 
irradiation produced with laser and plasma sources is especially noteworthy. The 
technologies with a high strategic position (8 points) include the sol-gel method 

o o 

F P3 (6.7, 8.6) and ion implantation E P3 (6.5, 8.0) placed in 16 matrices of 
strategies for technologies corresponding to the dwarf mountain pine in spring, 
meaning that their attractiveness should be reinforced while making use of numer- 
ous opportunities coming from the environment, as well as the production of 

o 

coatings in galvanization processes C P3 (8.6, 5.1) with reference to which the 
oak in autumn strategy is recommended, where current benefits are derived while 
seeking at the same time new applications for technologies in the neutral conditions 
of the environment. 

An analysis carried out according to the results of heuristic studies presented 
graphically using contextual matrices allowed to determine the strategic position of 
critical technologies of tool materials surface engineering. The highest possible 

o 

rate (10 points) was assigned to the physical vapor deposition method (PVD) A P4 
(9.5, 8.4) found in the very promising field of the matrix of strategies for 

o 

technologies - oak in spring, just like chemical vapor deposition (CVD) B P4 
(9.1, 8.1), is assigned 9 points; therefore, wide applications in industrial practice 
using the opportunities coming from the environment are recommended for both 
groups of technologies. The strategic positions of very promising prototype treat- 

o 

ment methods, i.e., pulsed laser deposition (PLD) J P4 (4.4, 9.0), hybrid technolo- 

o o 

gies I P4 (4.8, 9.0), and graded coating deposition technology H P4 (4.7, 8.7), were 
evaluated very high (9 points) with their further development requiring continued 
R&D works and the use of opportunities from the environment, i.e., the cypress in 
spring strategy. 

An analysis of strategic positions was made, using the original reference data 
collected during the e-foresight research conducted, of the relevant groups of tech- 
nologies classified to heuristic studies in the thematic field of surface engineering of 
steels used in automotive industry. The following technologies have the highest 
strategic position (9 points) for such technologies: hot dip zincing and annealing 

o 

(Zn-Fe coating) B P5 (8.1, 9.1), whose future success is guaranteed as it was found in 
the most promising quarter of the matrix of strategies for technologies - oak in spring 

o 

- as well as powder polymer coating deposition I P5 (4.6, 9.0) placed in the field of 
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cypress in spring; the potential of this early-mature technology in supportive envi- 
ronmental conditions needs, therefore, to be reinforced. The cypress in spring 
strategy is also recommended for implementation for the deposition of coatings 

o 

from polymer foils J P5 (3.9, 8.2), being currently in the growing phase, as well as 

o 

for thermal spraying F P5 (4.6, 7.9) that has entered its early-mature lifecycle phase. 
The strategic position of both technologies J P5 and F P5 is high (8 points), just like of 

o 

painting and lacquering using liquid polymer materials H P5 (6.8, 8.5), which - as a 
mature technology - corresponds to dwarf mountain pine, and supportive environ- 
mental conditions make it reasonable to apply the dwarf mountain pine in spring 
strategy here, in line with which it is necessary to take actions to improve its 
attractiveness through modernization, automation, and promotion. 

A heuristic analysis using the custom methodology of computer-integrated pre- 
diction of development allowed to define the strategic positions of individual groups 
of technology which, as critical technologies, were analyzed in the thematic area of 
surface engineering of glass, micro- and optoelectronic elements, and photovoltaic 

c 

elements. Physical vapor deposition (PVD) B P6 (9.5, 8.9) that received the maxi- 
mum note (10 points) and, therefore, was classified to the most promising 16 matrices 
of strategies for technologies -oak in spring - has the best strategic position for the 
technologies analyzed in this area; it is therefore recommended to develop and 
strengthen this group of technologies which in the future should be used more and 
more at a wide industrial scale, especially that the environment is very supportive. 

o 

The methods of chemical vapor deposition (CVD) A P6 (9.1, 8.4) and sol-gel 

o 

methods D P6 (8.7, 8.6) evaluated very highly (9 points) were also found in the oak 

o 

in spring field. Incipient technologies, including laser texturization E P6 (4.8, 9.1) and 

o 

production of organic-inorganic hybrid coatings J P6 (4.6, 9.3), influenced inten- 
sively by positive factors coming from the environment, require a cypress in spring 
strategy, consisting of strengthening the technology potential using numerous posi- 
tive external events influencing positively their progress. 

A heuristic analysis carried out using a newly established methodology permit- 
ted to identify the strategic positions of the relevant groups of critical technologies 
of polymer materials surface engineering. The results of the research reveal that five 
groups of technologies, i.e., B P7 , H P7 , J P7 , I P7 , and D P7 , were evaluated very high 
(9 points), signifying good prospects of the whole thematic area. Young, promising 
technologies, i.e., those obtained from graded coatings on polymer surface layers 

o o o 

H P7 (5.0, 9.5) and self- stratifying I (4.7, 8.9), in situ polymerization J P7 (4.6, 

q 

9.2), and laser treatment of polymer surface layers D P7 (4.8, 8.6), were placed in 
the cypress in spring field, meaning that scientific works should be continued to 
develop them and numerous opportunities coming from the environment should be 

o 

used. A relatively simple and efficient method of corona discharges B P7 (7.4, 8.3) 
widespread in industry due to low expenses, both at the investment and operation 
stage, was placed in the dwarf mountain pine in spring field, similar to plasma 

o 

treatment of polymer surface layers C P7 (7.0, 8.4) evaluated highly (8 points); it is 
thus recommended for those technologies to exploit the opportunities arising from 
the environment while caring at the same time for their improved attractiveness to 
maintain competitive advantage. 
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An analysis made with contextual matrices, i.e., a dendrological matrix of 
technology values, meteorological matrix of environment influence, and a matrix 
of strategies of technologies, being the result of the two first matrices and statistical 
lists, generated on the basis of results of the e-Delphix method, allowed to deter- 
mine the strategic positions of the individual groups of critical materials surface 
engineering technologies against the examined thematic area to which they were 
classified and to define their strategic development path for the assumed time 
horizon of the nearest 20 years. 


The Determinants of Industrial Application Conditions 
for Materials Surface Engineering Technologies 

The determinants of industrial application conditions for materials surface engi- 
neering technologies were listed using the reference data provided in the technol- 
ogy roadmaps and technology information sheets prepared for each of the 
140 technologies analyzed and were presented as the selected examples only. A 
framework of the technology roadmap (Fig. 22) corresponds to the first quarter of 
the Cartesian system of coordinates. Three time intervals for 2010-2011, 2020, and 
2030, respectively, are given on the axis of abscissa, and the time horizon for the 
overall results of the research provided on the map is 20 years. Seven main layers 
are provided on the axis of coordinates of the technology roadmap responding, 
respectively, to the following more and more detailed questions: When? Why? 
What? How? Where? Who? How much? Overview of the relevant layers is 
presented in Table 4. 

The main layers of the technology roadmap are ordered according to 
their hierarchy starting from the uppermost, most general ones, determining all 
social and economic premises, reasons, and causes of the actions followed, 
through the middle layers characterizing a product and its manufacturing tech- 
nologies, ending with the bottom layers detailing the organizational and technical 
issues concerning the place, contractors, and costs. The middle layers of the 
technology roadmap are subjected to two types of influence - pull from the 
uppermost layers and push from the bottom layers. The relationships between 
the individual layers and sub-layers of the technology roadmap are presented with 
the different types of arrows representing, respectively, cause-and-effect relation- 
ships, capital ties, time correlations, and two-directional data and/or resources 
flows. The technology roadmaps are a very convenient tool for a comparative 
analysis enabling to select the best technology according to the criterion chosen. 
Besides, their flexibility is their undisputed advantage, and, if needed, additional 
sub-layers can be added to or expanded for the maps according to the circum- 
stances of the industry, size of an enterprise, scale of the company’s business, or 
an entrepreneur’s individual expectations. An example of a technology roadmap 
shown in Fig. 23 was made for reactive magnetron sputtering (RMS). Technology 
information sheets, containing technical information very helpful in 
implementing a specific technology in the industrial practice, especially in 
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Fig. 22 Technology roadmap framework 


Table 4 Overview of the main layers of technology roadmap 



No. 

Layer 

Scope 

Question 

Description 

1 . 

Time layer 

Sequence 

When? 

Defines the set time intervals and time 
horizons of research conducted 

2. 

Conceptual 

layer 

Relevance 

Why? 

Formulates in detail general social and 
economic prospects of actions conducted 
and a strategy appropriate for a given 
technology 

3. 

Product layer 

Subject 

What? 

Characterizes a product produced in a 
given technological process according to its 
structure and properties 

4. 

Technological 

layer 

Method 

How? 

Describes a particular technology 
according to the following detailed criteria: 
lifecycle, type, and form of production; 
machine park, automation, and 
robotization; quality; and environment 

5. 

Spatial layer 

Place 

Where? 

Determines type of organization and 
represented industries 

6. 

Personnel 

layer 

Contractor 

Who? 

Identifies personnel structure and expected 
personnel skills 

7. 

Quantitative 

layer 

Cost 

How 

much? 

Provides capital requirements and 
estimated production volume 


SMEs lacking the capital allowing to conduct own research, are detailing and 
supplementing the technology roadmaps and are provided as an example. A 
selected, representative technology information sheet made for RMS is given in 
Fig. 24a, b. 
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Fig. 23 A demonstrating technology roadmap prepared for the reactive magnetron sputtering 
(RMS) 

Technology roadmaps and information sheets prepared for 140 critical technol- 
ogies of materials surface engineering make up the Book of Critical Technologies 
of Surface Structure and Properties Formation of Engineering Materials 
(Dobrzanska-Danikiewicz 2013). It is estimated that the Book of Critical Technol- 
ogies is to be available on the Web as an e-book and information contained therein 
will be disseminated via the Internet without any limitations and for free, using a 
publicly available (open access) web platform in line with a newly established 
concept of e-transfer of technology (Dobrzanska-Danikiewicz and Lukaszkowicz 
2010; Dobrzanska-Danikiewicz et al. 2010b, 201 If). The technology e-transfer 
Internet platform planned to be created is to be made up of three compatible 
functional modules embracing e-consulting, e-training, and e-information, coupled 
with a technology database. In e-consulting, technology roadmaps and information 
sheets will be available being a compendium of the knowledge considered on 
priority innovative technologies, characterized in a uniform manner, enabling to 
compare them easily according to the selected materials science, technological, or 
organization criterion. E-training allows the self-education of beneficiaries by 
access to specialist training materials and using a self-control system in form of 
tests that can be completed multiple times until a result satisfactory for the user is 
reached. Supplementary, highly specialized training materials, available mainly in 
English, are represented by articles and monographs collected in a publicly avail- 
able database of scientific works, a so-called open repository. Updates concerning 
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web platform resources, conferences, workshops, and other initiatives related to 
current activity will be covered by e-information. 

The concept of setting up an e-technology transfer center is the continuation of the 
idea and the extension of the aims of e-foresight to incorporate the domain of 
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application and implementation of knowledge on the selected engineering materials 
surface properties and structure formation technologies and, in general, material 
processes technologies and engineering materials processing, primarily in the machine 
and electrotechnical industry. The synergic influence of both concepts of e-foresight 
and technology e-transfer creates a full and integrated system of prediction of the 
development of surface properties and stmcture formation technologies and of imple- 
mentation of the results of such research in an extensive environment of managers and 
engineers employed in industrial entities. In line with the adopted conceptual assump- 
tions, interested individual can be provided, anytime and without any restrictions, with 
all the information, while the monitoring of current issues - being merely an indirect 
way of interaction with enterprises - should allow to focus research works on 
satisfying the real needs of a knowledge- and innovation-based economy. 


Summary 

A new approach presented in this book chapter enables the neural network-aided 
creation of alternative scenarios of future events, strategic placement of technolo- 
gies with graphical analytical, i.e., contextual matrices and a comparative analysis 
of technologies using roadmaps and technology information sheets. The methodo- 
logical approach presented serves to lessen the risk in predicting the prospective 
development directions of engineering materials surface layer properties and struc- 
ture formation technologies. The analyses carried out indicate that heuristic studies, 
based on expert knowledge, allow to produce credible results of predicted technol- 
ogy development, without having to support them each time with the results of 
materials science research. This was confirmed in the experimental verification 
process of the new approach undertaken for 36 groups of technologies. The results 
of detailed experimental investigations into the structure and properties of engi- 
neering materials surface layers were used as a reference point. The newly 
established methodology, describing concisely the actions and activities targeted 
at creating multivariant probabilistic scenarios of future events, at selecting and 
characterizing the critical technologies in a clear and harmonized manner, and at 
paving the strategic development directions, organizes, streamlines, and modern- 
izes the prediction process. The use of information technology encompassing a 
virtual organization, web platform, and artificial neural networks plays an important 
role here. It is reasonable to use artificial neural networks to create multivariant 
probabilistic scenarios of future events because they allow to generate very quickly 
alternative forecasts as probability values that the alternative macroscenarios of 
future events occur dependent upon the emergence of the special conditions or 
factors considered at a mezo-level. It was recognized that it is possible, needed, and 
necessary to implement the outcomes of the research performed in the economic 
reality at the macro-, mezo-, and microlevel and that the new methodology, 
supported with extended information technology, is suitable for direct applications 
in other areas of knowledge while maintaining economically reasonable costs. It is 
foreseen that if the effects of the e-foresight research performed are broadly 
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disseminated via the Internet, this will represent - in line with the technology 
e-transfer concept - one of the crucial factors contributing to accelerated sustain- 
able development, to a stronger knowledge- and innovation-based economy, and to 
statistical growth in the quality of technologies used in industry. 


Comments 

All the results, figures, and detailed diagrams presented in this chapter of the book have 
been developed in the framework of an author’s multiannual foresight project entitled 
FORSURF founded by EFFR and Polish Ministry of Science and Education completed 
in September 2012 aimed at determining the future development trends of materials 
surface engineering and at identifying the priority innovative technologies in this area. 
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Abstract 

Surface engineering aims at tailoring the micro structure and/or composition of 
the near surface region of a component for improving surface dependent engi- 
neering properties. Conventionally, surface engineering may broadly be classi- 
fied into two categories: surface modification (where the treated layer is part of 
the substrate) and coating (adding another layer onto the surface). Laser as a 
clean source of heat may be used for modification of micro structure and/or 
composition of the near surface region of the component by heating/melting or 
by deposition and alloying/cladding. Especially, because of its exponentially 
decaying energy distribution profile, laser enjoys a prominent position for its 
application in surface engineering. Laser surface engineering may be classified 
as surface transformation hardening, surface melting, laser surface alloying, and 
laser surface cladding. In this chapter, the application of laser for surface 
modification like laser transformation hardening, melting and homogenization 
of surface micro structure, changing composition by laser surface alloying for 
improving surface properties for structural application and laser surface cladding 
techniques will be discussed in detail. With a brief introduction to the individual 
technique, the principle of its operation will be discussed. Finally, the examples 
of application of laser surface engineering will be discussed in detail. 


Introduction 

Surface micro structure and composition control the serviceability and functionality 
of the component to a significant extent. Hence, an optimum designing of the 
microstructure and composition of the near surface region is necessary for tailoring 
the surface dependent structural and functional properties of any component (ASM 
1982). Furthermore, in real components, the properties needed on the surface are 
often different from those in the bulk. Surface engineering aims at modifying/ 
changing the micro structure and/or composition of the near surface region of a 
component to improve surface dependent engineering properties (ASM 1982). The 
advantages of surface engineering over bulk manufacturing techniques include 
conservation of strategic and expensive alloying elements, improved functionality, 
and improved service life without altering the properties of the bulk, and hence, 
economic in energy and cost. There are a large number of techniques available for 
altering the surface chemistry and micro structures of the component. However, the 
choice of the technique depends on the required microstructure, composition, 
thickness, environmental impact, and economic viability. Laser surface engineering 
is an emerging surface engineering technique which involves tailoring of the 
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micro structure and/or composition of the near surface region of the component 
using laser as a source of heat (Dutta and Manna 2011). 

When a laser beam interacts with matter, there is absorption of photons from the 
incident laser beam which excites the electrons from the valence/conduction bands 
to states of higher energy. The excited electrons may get back to their original state 
by emitting photons which are equivalent to the difference in energies between 
excited and de-excited states. These emitted photons interact with phonon to 
generate heat. The time duration for the electron excitation to electron phonon 
interaction is as short as 10 -9 s. As a result, there is instantaneous transfer of heat 
during the laser matter interaction process. Figure 1 illustrates the energy distribu- 
tion profile with depth during laser irradiation of solid surface. For laser irradiation, 
the beam intensity (/) at any depth (z) for the normally incident beam of initial 

o 

intensity I Q (in W/m ) is given by (Picraux and Follstaedt 1983): 

I(z,t) = I 0 (t)(l - R)exp(-az), (1) 

where I Q is the incident intensity, t is time, R and a are the reflectivity and 

/r i 

absorption coefficients, respectively. Since a is very high (~10 cm ) for metals, 
light is totally absorbed within a depth of 10-20 nm. The efficiency of optical 
coupling is determined by the reflectivity ( R ). R for metals is relatively low at short 
wavelengths, rises abruptly at a critical wavelength, then remains very high at long 
wavelength (Picraux and Follstaedt 1983). Due to the exponential energy distribu- 
tion profile of the laser beam it is able to uniquely heat the near surface region of the 
component for modification of its micro structure without changing composition 
(hardening, remelting, shocking, texturing, and annealing), or both microstructural 
as well as compositional modification of the near-surface region (alloying, clad- 
ding, etc.) of the component (Dutta and Manna 2011). The advantages of applica- 
tion of laser in surface engineering over conventional techniques include a rapid 

a i i /: o 

rate of heating/cooling (10 -10 K/s), a very high thermal gradient (10 -10 K/m) 

and ultra-rapid solidification velocity (1-30 m/s) achievable during laser assisted 
surface processing (Molian 1989). These extreme processing conditions could 
develop metastable microstructures and compositions in the near surface region 
with large extension of solid solubility. 

Figure 2 shows the general classification of laser surface engineering. The 
processes are divided into three major classes, namely the process involving only 
heating (without melting), melting (without vaporizing), and vaporizing. The 
regime for individual laser surface engineering technique as a function of applied 
power density and interaction time is illustrated in Fig. 3 (Dutta and Manna 2011). 
It may be noted that transformation hardening needs only surface heating and 
requires low power density. Surface melting, cladding, glazing, and reclamation 
involve melting and require a moderately high power density. Cleaning and shock 
hardening demand removal of materials as vapor, hence, delivery of a very high 
power density is needed. Therefore, a careful control and optimization of several 
important process parameters are needed for achieving a desired micro structure and 
composition of the surface. The main process parameters active in controlling the 


2642 


J.D. Majumdar and I. Manna 


Fig. 1 Energy distribution 
profile with depth during laser 
irradiation of solid surface 
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Fig. 2 General classification of laser surface engineering 


Fig. 3 Processing regime of 
different laser surface 
engineering (Dutta and 
Manna 2011) 
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micro structure and composition of the surface include the type of laser used 
(wavelength of laser), power density, beam size, geometry, working distance, 
powder feed rate, and deposition angle. 

In this chapter, with a brief description of laser surface engineering techniques 
like laser transformation hardening, melting and homogenization of surface micro- 
structure, laser surface alloying, and laser surface cladding, the role of process 
parameters in controlling the micro structure and composition of the surface 
engineered samples will be discussed in detail. Finally, the examples of application 
of laser surface engineering techniques will be stated. 


Laser Transformation Hardening 

Laser transformation hardening is a process where a high power laser beam (either 
in continuous wave mode or in pulsed mode) is used as a heat source to austenitize 
the surface of Fe-C alloys (steel, cast iron, etc. with a carbon content higher than 
0.4 %) and is subsequently, quenched to a temperature below martensitic start 
temperature (Ms) to induce martensitic transformation by self-quenching or by 
spraying external quenchant. Figure 4 presents the laser surface transformation 
hardening technique schematically, with the temperature distribution in different 
regions marked and labeled. The steps in transformation hardening include 

(a) austenitization, which takes place at a temperature of 50-100° above AC3 
temperature. The transformation to austenite from the initial micro structure during 
the heating cycle is generally assumed to occur in two steps: in the first step pearlite 
transforms to austenite followed by transformation of ferrite into austenite. Ferrite 
transforms at higher temperature as it has low solubility of carbon 

(b) homogenization, it is important that sufficient time should be available for the 
diffusion and distribution of carbon all throughout the micro structure, and 

(c) quenching at temperature below martensitic start (Ms) temperature for transfor- 
mation from austenite to martensite. The kinetics of transformation hardening is 
dictated by the kinetics of dissolution of carbon into matrix. Under equilibrium 
conditions, as soon as the temperature during heating reaches above eutectoid 
temperature, pearlite starts to transform into austenite. The diffusion distance for 
carbon in pearlite is small; due to this the pearlite transformation to austenite is very 
rapid and sometimes assumed to be instantaneous. The reaction then proceeds to the 
remaining ferrite after all the pearlite is consumed. This step occurs within a 
temperature range which is limited by the eutectoid line and the transition temper- 
ature. During laser transformation hardening, the surface temperature should be as 
high as possible to shorten the time to complete the transformation to austenite 
while a high temperature gradient is required to heat a sufficient thick surface layer 
in a short time. A short time and high temperature gradients are also required to 
prevent heating of the bulk material. 

The parameters for laser transformation hardening include type of laser used, 
beam shape, applied laser power density, relative scan speed between the laser and 
substrate or pulse time, mode of its operation (pulsed or continuous mode), surface 
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Fig. 4 Schematic of laser 
transformation hardening 
technique 
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composition, topography, and absorptivity. The effect of process parameters 
responsible in controlling the micro structure and composition of the laser processed 
zone will be discussed briefly in the later part of the process description. 

Depth of the transformation hardening plays an important role in determining the 
service life of any component. The higher the depth of hardening is, the greater the 
durability of the component under wear will be. The depth of transformation 
hardening depends on the choice of applied power density and interaction time 
along with the chemical composition of the material used for hardening. The 
maximum depth of hardening achieved by laser surface hardening can be as high 
as 0.8 mm. However, for some industrial applications, a hardened depths of up to 
2.0 mm is needed which necessitates a proper designing of the process and 
application of external quenchant. Typical hardened case depths of up to 1.6 mm 
have been reported by Schuocker for type 50NiCrl3 material (1998). Due to the 
high reflectivity of metal surfaces to the C0 2 laser (wavelength of 10.6 pm), an anti- 
reflective coating is often applied onto the metal surface to increase absorptivity. 

A detailed review of laser surface hardening and its comparison with conven- 
tional surface hardening technique is described by Kennedy et al. (2004). The 
characteristics of laser surface hardening over conventional hardening include 
economic in time and energy, scope of full automation, minimum macro distortion, 
and hence, reduced need for additional machining, a precision thickness of the 
hardened layer, no need for external quenchant and environmental cleanliness. The 
application of high power lasers in surface hardening has already been proven to be 
feasible and is applied on real components for automobile and aerospace industries 
for hardening the surface layers of turbine blades, crankshafts, and tractor engine 
components. Benedek et al. (1980) carried out case hardening of carbon steel, 
armco iron, alloyed steel, and high-speed steel using a C0 2 laser beam. Processing 
at optimal conditions yielded a hardening depth of up to 0.7 mm and a surface 
hardness of up to 720 HV. Prior microstructure of the substrate influences the final 
micro structure of the laser surface hardened steel to a significant extent. Grum and 
Slabe (2005) observed the formation of nanometric precipitates with a diverse 
micro structure up to varying depth following laser surface hardening of maraging 



74 Laser Surface Engineering 


2645 


steel. Heidkamp et al. (2004) studied the effect of laser beam hardening on the 
properties of TiN-coated AISI 4140 (EN 42CrMo4), AISI A2 (EN X100CrMoV5-l), 
and AISI D2 (EN X153CrMoV 12) substrates by CVD. A composite material with flat 
strength and hardness gradients from the interface to the core, together with a high 
surface hardness, increased wear resistance, and an increased material resistance 
against high mechanical load was achieved. 

Surface conditions (roughness, residual stress, phase aggregate) play a signifi- 
cant role in determining the characteristics of hardened layer. The effect of different 
types of absorbing coating on laser surface hardening of C45E steel was determined 
by Grum and Kek (2004). Heitkemper et al. (2003) showed that the fatigue and 
fracture behavior of laser heat treated high nitrogen martensitic tool steel 
(X30CrMoN15) with a 3 kW Nd:YAG laser was significantly improved as com- 
pared to that of the untreated one. The residual stresses generated during laser 
treatment had a significant influence on crack initiation, while those generated 
during the transformation of retained austenite had a minor influence on crack 
propagation. Iordanova et al. (2002) carried out laser surface melting of 0.46-mm- 
thick cold-rolled low carbon steel by pulsed Nd:glass laser up to a thickness of 
80 pm. Prior cold rolling developed residual tensile stress both along rolling and in 
transverse directions. It was observed that laser melting does not affect the residual 
stress along the rolling direction, but significantly increases it along the transverse 
direction with a weaker and scattered texture. 

Amulevicius et al. (2000) studied the influence of simultaneous laser irradiation 
and ultrasound on the phase evolution in the Fe-Si-C system (gray cast iron) by 
means of Mossbauer spectroscopy. The application of ultrasound reduced the 
amount of retained austenite in the samples by two to three times due to active 
dissolution and re-distribution of carbon between austenite and iron-carbon clus- 
ters. Chen and Shen (Chen and Shen 1999) optimized the process parameters for 
laser transformation hardening of SNCM 439 steel using Taguchi methodology and 
fuzzy evaluation method. It was concluded that there was improvement in hardness 
from HRC 52.5-63.9; with an increase in hardening width 0.43-0.89 mm, with a 
reduction in erosion loss from 69.55 to 40.94 mg when a long pulse Nd-YAG laser 
was used for surface hardening. It was concluded that both the fuzzy evaluation 
method and Taguchi methodology were effective and applicable for evaluating the 
usefulness of long pulsed Nd-YAG laser transformation hardening processes quan- 
titatively. Laser surface treatment of pearlitic gray and ductile cast irons using a 
5 kW continuous wave C0 2 laser improved the erosion resistance of gray iron by 
5 times and the same for ductile iron by 26 times (Tobar et al. 2006). Molian (1981) 
studied the effect of laser surface hardening on the micro structure of AIS 14340 
steel and compared it with the conventional hardened one. The micro structure 
exhibited a mixture of dislocated and twinned martensites with an enhanced volume 
fraction of retained austenite as compared to the conventional furnace heat treated 
(870 °C treatment) specimens, though, prior austenite grain size and martensitic 
substructure remained essentially the same. Roy and Manna (2001) have demon- 
strated that laser surface hardening (LSH), instead of laser surface alloying (LSA) 
or laser surface melting (LSM) is more effective in enhancing the wear resistance of 
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austempered ductile iron (ADI), attributed to the development of martensitic 
surface with residual compressive stress. Adel et al. (2009) studied the effect of 
neodymium yttrium (Nd-Yag) pulsed wave laser assisted heat treatment on the 
coefficient of friction and wear resistance of acicular bainitic ductile iron and 
observed a significant refinement of micro structure with an improved wear resis- 
tance and reduced coefficient of friction. 

In spite of ample advantages of laser surface hardening as compared to conven- 
tional hardening, a large thermal stress is generated in the laser irradiated surface 
resulting from the high cooling rate and thermal gradient associated with the 
process (Mura 1982). The magnitude of residual stress and its vector depends on 
laser parameters. The residual stress arrested due to laser transformation hardening 
may be due to combined influence of (a) quench stress, which is tensile in nature 
and (b) transformation stress, which is compressive in nature. Compressive residual 
stress is beneficial in improving fatigue property, however, tensile residual stress is 
detrimental as it deteriorates wear, corrosion, and fatigue properties of material. 
Hence, process parameters should be carefully chosen to minimize the residual 
stress arrested due to laser transformation hardening. Prediction of the residual 
stress on the surface necessitates a quantitative knowledge of the thermal history of 
the alloyed zone. Furthermore, the microstructure of the hardened zone developed 
by LSH is closely related to the thermal history of the process. The parameters 
which play a role in controlling the thermal history and residual stress are laser 
power density and scan speed. Time-resolved measurement of temperature during 
this process of transient heating and cooling within the short laser interaction time 
(/i), is extremely difficult and confined only to the top surface. As an alternative, 
mathematical modeling of the heat transfer process accompanying laser surface 
hardening is a convenient tool to predict the thermal history of the alloyed zone 
with reasonable accuracy. In the past, several studies have been attempted to predict 
the temperature distribution during laser surface hardening (Ashby and Easterling 
1984; Mazumder et al. 1996). A one dimensional unsteady state heat transfer model 
based on explicit finite difference method of solution was developed to simulate the 
thermal profile in laser surface hardening and subsequently, the residual stress 
developed on the surface has been calculated considering thermal stress and 
transformation stress. 

Figure 5 presents the distribution of temperature as a function of time at different 
applied power (on the surface and at a depth of 40 pm from the surface) for laser 
surface hardened 0.6 % C steel at the surface (solid line) and at a depth of 40 pm 
from the surface (dashed line). From Fig. 5, it is apparent that the top surface 
experiences the maximum or peak temperature (T peak ), and T peak decreases as depth 
(z) below the surface increases. It is interesting to note that the time (f pea k) to reach 
r p eak (except at the top surface, i.e., at z = 0) gradually increases as depth increases. 
In fact, /peak > h at all z rather than z = 0. From Fig. 5 it is relevant that temperature 
developed at the surface and at a certain depth is dependent on applied power. The 
higher the applied power, the more the temperature develops in each layer because 
of a higher energy input (and hence, absorbed energy). Hence, laser power density 
should be carefully chosen to optimize the depth of austenitization. Too high or too 
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Fig. 5 Variation of 
temperature with time during 
laser heating and 
subsequently, cooling of laser 
surface hardened 0.6 % C 
steel at a scan speed of 50 ms 



Fig. 6 Variation of the 
temperature in the hardened 
layer as a function of the 
depth from the surface lased 
at different power and an 
interaction time of 10 ms 



low a temperature is not desirable as they lead to melting and improper homoge- 
nization of austenite, respectively. It may be pointed out that f peak at a given z is 
always larger than the time needed to reach T rt from T peak on cooling. In other 
words, it suggests that the cooling rate is higher than the corresponding heating rate 
in each layer. The apparent difference within the same volume of material may be 
attributed to the fact that, the effect of thermal gradient during heating cycle is 
smaller than the same during cooling cycle. The present model predicts heating and 
quenching to develop the hardened zone within a very short period of time. 

Figure 6 shows the variation of temperature with depth from the surface during 
cooling of laser surface hardened 0.6 % C steel. From Fig. 6 it is evident that 
temperature is maximum at the surface and decreases as the depth from the surface 
increases. The higher the interaction time, the higher the temperature is in each 
layer, though the temperature gradient does not seem to vary significantly with the 
interaction time. Figure 7 reveals the variation of thermal gradient (dT/dz) with 
depth (z) during cooling cycle of laser surface hardening of 0.6 % C steel (a) lased 
with a power of 0.95 kW for different interaction times (6) and (b) with different 
power but at an interaction time of 40 ms. From Fig. 7 it may be noted that the 
thermal gradient is maximum at the top surface and decreases as the depth from the 
surface increases. This is anticipated because the structure below 500 pm provides 
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Fig. 7 Variation of thermal gradient (dT/dz) with depth (z) during cooling cycle of laser surface 
hardening of 0.6 % C steel (a) lased with a power of 0.95 kW for different interaction times (t\) and 
(b) with different power but at an interaction time of 40 ms 


an infinite heat sink which allows easy and unidirectional conduction of heat 
vertically downward. Furthermore, it is apparent that the higher the 0, and P, the 
greater the thermal gradient at a given depth (z). However, the thermal gradient 
decreases with depth and follows a uniform and almost identical rate for the range 
of t[ studied in this investigation. Similarly, thermal gradient at a particular depth 
increases with an increase in applied power. 

Figure 8 shows the variation of the depth of hardening with (a) applied laser 
power and (b) interaction time for laser surface hardened 0.6 % C steel. Depth of 
hardened layer was found to increase with an increase in applied power and an 
increase in interaction time. Increased hardening depth with increase in applied 
power and interaction time are attributed to an increased quantity of absorbed 
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Fig. 8 Variation of the depth of hardening with (a) applied laser power and (b) interaction time 
for laser surface hardened 0.6 % C steel 

energy. However, too high of an applied power or too low interaction time might 
lead to melting of the surface. On the other hand, application of inadequate power 
and interaction time might cause inhomogeneous dissolution of carbon within the 
matrix. Hence, laser power and interaction time should be chosen so as to obtain a 
homogenous hardened layer with the desired depth of hardening. 

Figure 9 shows the distribution of residual stress along the depth from the 
surface of laser surface hardened 0.6 % C steel. From Fig. 9 it is evident that 
there is introduction of residual compressive stress on the surface, the magnitude of 
which decreases as the depth from the surface increases and gradually changes to 
tensile residual stresses at a depth of 10 pm from the surface. Furthermore, the 
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Fig. 9 Variation of residual 
stress along the depth from 
the surface of laser surface 
hardened 0.6 % C steel 



magnitude of residual compressive stress increases with increase in interaction time 
which is attributed to an increased volume fraction of martensite formed on the 
surface at a higher interaction time. On the other hand, increasing the power 
increases the surface compressive stress and the maximum tensile stress at the 
sub-surface region. Increasing the power causes rapid cooling from a higher 
temperature, thus increasing the volume fraction of martensite in the microstruc- 
ture. In addition to that, increasing the power causes heating to a larger depth, and 
hence, plastic flow will increase a bit in the lower layer which increases the 
maximum tensile stress value. As the compressive stresses are very much desirable 
on the surface (they increase the fatigue strength of the material) their variations 
with power and interaction time have been studied in detail. It is easily seen that for 
a constant power increasing the interaction time increases the surface compressive 
stresses. On increasing the power, there is an increased cooling rate causing an 
increased area fraction of martensitic transformation, and hence, more of the 
transformational stress domination leading to even higher values of compressive 
stresses at the surface. Figure 10 summarizes the variation of residual compressive 
stress developed on the surface of laser surface hardened 0.6 % C steel with 
(a) applied power and (b) scan speed. From Fig. 10 it is evident that the magnitude 
of the residual compressive stress on the surface increases with increase in applied 
power. Furthermore, the rate of increase in compressive stress with applied power is 
higher when laser surface hardened at a higher interaction time. Similarly, it is also 
observed that increasing the scan speed increases the magnitude of compressive 
residual stress on the surface, the rate of which is higher at a higher power. From the 
above mentioned discussion it appears that increasing both the laser power and the 
interaction time is a desirable feature as they increase the compressive residual 
stress on the surface. However, a detailed calculation of the influence of laser 
parameters on the maximum tensile residual stress at the sub-surface region 
shows that maximum residual stress increases with increase in applied power, 
though the interaction time does have a marginal influence on it. 
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Fig. 10 Variation of residual 
compressive stress developed 
on the surface of laser surface 
hardened 0.6 % C steel with 
(a) applied power and (b) 
scan speed 


a Power (KW) 



b Interaction time (ms) 
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From the above discussions, it may be concluded that even though surface 
hardening may be achieved by austenetizing followed by cooling with application 
of a wide range of laser power-scan speed combinations, residual stress distribution 
at the near surface and sub-surface regions varies with laser parameters. Hence, 
optimum choice of laser parameters is essential to ensure a high compressive stress 
at the surface and low tensile stress at the sub-surface regions. In this regard, it is 
relevant to note that application of inadequate power density or low interaction time 
will lead to incomplete austenitization and very low compressive stress at the 
surface, while application of very high powder density and high interaction time 
will cause a very large sub-surface tensile stress and overheating/melting of the 
surface. 


Laser Surface Melting 

Laser surface melting (LSM) also proved to be an important technique to modify 
the micro structure and composition of the near surface region of a component. In 
laser surface melting, a high power laser beam is used as a source of heat to melt a 
thin surface layer of substrate made of metals/alloys for micro structural refinement 
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and homogenization. Laser surface melting aims at increasing the hardness of the 
surface and improving corrosion resistance and fatigue property. Laser surface 
melting leads to the following changes on the surface of metals and alloys: 

(a) Polishing of surface 

(b) Grain refinement and homogenization 

In the present section, both laser polishing and grain refinement and homogeni- 
zation will be discussed in detail. 


Laser Polishing of Metals 

Polishing is an integral part of manufacturing which involves smoothening of the 
surface by materials removal. Manual polishing alone accounts for 37 % of the total 
production time and 30 % of production cost in mold and die making industries 
(Nusser et al. 2011). Moreover, manual polishing is time consuming and involves a 
high production cost. The polishing process can be automated to reduce the cycle 
time. However, polishing of materials with complex size and shape often restricts 
the application of automatic polishing unit. Laser beam irradiation may be applied 
for polishing of materials which involves the process of reducing surface roughness 
by (a) surface melting or (b) ablation of materials from the surface. The advantages 
of laser surface polishing over conventional polishing techniques include a reduc- 
tion in polishing time from 3.5 h to 10 min, the ability to polish objects of complex 
geometry, micro structural refinement, and associated improvement in properties of 
the object. Over the last decade, laser polishing (LP) process applicability was 
demonstrated on glass, lens, fibers, and diamond polishing and it has been extended 
toward polishing of metallic materials recently (Bol’shepaev and Katomin 1997). 
The parameters influencing the quality of the polishing include topography, ther- 
mal, and optical properties of the substrate, laser power, pulse duration, pulsing 
frequency, beam shape, speed, tool path trajectory, number of passes, percentage of 
overlap, etc. Among all these parameters energy density plays an important role to 
determine the quality to a maximum extent. 

Figure 11 shows the schematic of laser melting assisted surface polishing 
technique. In laser polishing, a laser beam with sufficient energy is applied on the 
surface leading to surface melting of a thin layer of the work-piece which subse- 
quently, flows and redistributes under the action of surface tension, which evens out 
the surface asperities and reduces the surface peak to valley heights after rapid 
solidification. 


Grain Refinement and Homogenization 

When a laser beam melts the surface it causes a significant refinement of micro- 
structure due to a very fast quenching rate associated with the process, due to the 
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Fig. 11 Schematic showing 
laser assisted surface 
polishing of metals 


Velocity of laser beam 


V 




Laser 


Initial surface 
topography 


Polished surface 


presence of the underlying substrate which acts as quenchant. The microstructural 
refinement leads to a significant improvement in hardness, wear, and corrosion 
resistance properties. The morphology of the micro structure is determined by the 
ratio of temperature gradient of the liquid at the solid-liquid interface (G) and 
solidification velocity (R); i.e., G/R. During laser surface melting, a high solidifi- 
cation velocity is attained and it at times exceeds the local diffusion rate and due to 
it, equilibrium at the interface can not be established and the solute atoms are frozen 
into a solid at the same composition as they arrive at the interface (solute trapping). 
However, at a lower solidification rate (slower than the diffusion rate), local 
equilibrium can be established and the interface composition can achieve the 
same composition as that at the melt. During laser surface melting, due to a rapid 
cooling there is microstructural homogenization, extension of solid solubility and 
dissolution/redistribution of precipitates or inclusions in the matrix. During laser 
surface melting, when the cooling rate exceeds the critical cooling rate for glass 
forming, there may be formation of glassy phase for the alloys with a high glass 
forming ability. Laser surface melting with a high intensity laser beam at a very 
rapid traverse rate may be applied to do surface glazing of ceramics and metals. 
Laser surface melting has been successfully employed on tool steel, titanium, and 
magnesium substrates to refine and homogenize the micro structure and subsequent 
improvement in its corrosion resistance property. In the past, laser surface melting 
was reported to improve the corrosion resistance (in NaCl solution) of 440C 
martensitic stainless steel by refinement of carbides and its re-distribution 
(Lo et al. 2003). De-sensitization of AISI 304 stainless by laser surface melting 
has been successfully reported on AISI 304 stainless steel which is achieved by 
complete dissolution of M 2 3 C 6 type of carbides and subsequently, homogenization 
of micro structure (Yanez et al. 2002). Dutta Majumdar et al. (2010) surface melted 
AISI 52100 steel using a high power (2 kW) continuous wave C0 2 laser (with a 
beam diameter of 3 mm) with a linear scan speed of 1-2 m/min, using different 
proportions of argon (Ar) as shrouding atmosphere and observed an improvement 
in wear and corrosion resistance. A detailed microstructural investigation shows a 
significant refinement of the micro structure (grain size) with the presence of lath 
martensite (Dutta Majumdar et al. 2010). The wear behavior of the surface treated 
component against hardened steel ball (of 3 mm diameter) at an applied load of 2 kg 
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and at 15 numbers of revolution showed a marginal improvement in wear resistance 
of laser surface melted surface as compared to as-received SAE 52100, which is 
attributed to improvement in hardness due to grain refinement, refinement of 
carbides, and presence of martensite in the micro structure. The corrosion behavior 
of laser surface melted SAE 52100 steels was evaluated by potentiodynamic 
polarization studies in a 3.56 wt% NaCl solution. From the calculated values of 
corrosion parameters, it was observed that corrosion voltage (E corr ) does not vary 
significantly with laser surface melting, however, there was a marginal reduction in 
corrosion rate (2.32 mm/year for laser surface melting in comparison to 2.60 
mm/year for as-received substrate) and an improvement in critical potential for 
pit formation (Epit) (—270 mV (SCE) for laser surface melting in comparison to 
—430 mV(SCE) for as-received substrate). Tsay et al. (2001) studied the effect of 
laser annealing of aged 17-4 PH (AISI 630) steel (aged at 482 °C) using a 5 kW 
Rofin-Sinar C0 2 laser on its properties. Laser annealing led to an improved fatigue 
property predominantly because of the presence of residual compressive stresses. 

Weld decay in austenitic stainless steels is caused by the precipitation of 
chromium carbides along grain boundaries when the material is subject to a thermal 
cycle as is the case in the welding process. For example, cooling rate involved in 
MIG welding is sufficiently low to form such carbides along the grain boundaries 
within the HAZs, leading to intergranular corrosion within the HAZs, called 
sensitization. Laser surface melting of the HAZs dissolves these carbides, homog- 
enizes the distribution of Cr through liquid state diffusion and convection. Then the 
rapid cooling is sufficiently high to avoid the re-precipitation of the carbides, thus 
desensitizing the material. 

Magnesium and its alloys are widely used in automotive and aerospace appli- 
cation because of its light weight. However, poor wear and corrosion properties are 
of serious concern for its application as structural components. Laser surface 
melting was found to be an effective tool to enhance wear and corrosion properties 
of magnesium based alloys (Dutta and Manna 2011). Laser surface melting of a 
magnesium alloy (MEZ) using a continuous wave C0 2 laser was reported to 
improve its mechanical and electrochemical properties (Dutta et al. 2003). Figure 12 
shows scanning electron micrograph of the cross section of the laser surface melted 
MEZ alloy, lased with a power of 2 kW and scan speed of 200 mm/min, which 
shows the presence of refined columnar grains growing epitaxially from the 
liquid-solid interface (Dutta et al. 2003). The precipitates observed along the 
grain boundaries of MEZ are Mg/Zr/Ce-rich compound (confirmed by energy 
dispersive spectroscopy). Furthermore, the melted zone -substrate interface is 
crack/defect-free and well compatible with practically no noticeable amount of 
heat-affected zone. Figure 13 presents the microhardness profiles of the laser 
surface melted MEZ as a function of vertical distance from the surface measured 
on the cross sectional plane and shows that microhardness of the melted zone has 
significantly increased by two to three times (85-100 VHN) that of the substrate 
(35 VHN) primarily due to grain refinement and solid solution hardening (Dutta 
et al. 2003). The average microhardness of the surface melted zone was found to 
vary with laser parameters. Figure 14 compares the kinetics of pit formation in 
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Fig. 12 Scanning electron 
micrograph of the cross 
section of the laser surface 
melted MEZ alloy, lased with 
a power of 2 kW and scan 
speed of 200 mm/min 



Fig. 13 Microhardness 
profiles of the laser surface 
melted MEZ as a function of 
vertical distance from the 
surface (Dutta et al. 2003) 



Depth (pm) 


Fig. 14 Kinetics of pit 
formation in terms of 
cumulative area fraction pits 
as a function of time in a 3.56 
wt% NaCl solution for 
as-received (plot 1 ) and laser 
surface melted MEZ (plot 2) 
lased with a power of 2 kW 
and scan speed of 
200 mm/min (Dutta 
Majumdar et al. 2010) 



Immersion time(h) 


terms of cumulative area fraction pits as a function of time in a 3.56 wt% NaCl 
solution for as-received (plot 1) and laser surface melted MEZ (plot 2) lased with a 
power of 2 kW and scan speed of 200 mm/min (Dutta et al. 2003). From Fig. 14 it 
may be noted that in as-received MEZ, the extent and rate of pitting are significantly 
reduced following laser surface melting, where, visible pits are observed in laser 
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surface melted MEZ only after 12 h of exposure. Furthermore, the pits in the laser 
surface melted samples are both smaller in dimension and remain isolated com- 
pared to larger and interconnected pits in as-received MEZ. However, the rate of 
pitting is initially faster and decreases after 30 h of immersion time in as-received 
MEZ. Schippman et al. (1999) reported the improvement of corrosion resistance of 
AZ91 substrate by grain refinement due to excimer laser surface treatment of AZ91 
substrate. 

Several attempts have been made to improve corrosion resistance of Al-based 
alloys by laser surface melting by grain refinement and micro structural homo- 
genization (de Mol van Otterloo and De Hosson 1994). Yue et al. (1999) surface 
melted 7075-T65 1 Al alloy using an excimer laser with a pulse energy of 8 J/cm to 
develop a melt layer free from any grain boundary precipitates, and hence, a 
significant improvement in inter- granular corrosion along with stress corrosion 
cracking. A similar response was also observed in excimer laser surface melted 
(using an excimer laser (KrF) with a wavelength of 248 nm) 7075-T651 aluminum 
alloys with a reduction in both the number and size of constituent particles and 
a refinement of the grain structure resulting in improvement in corrosion resistance. 
Li et al. (1996) surface melted 2024-T351 aluminum alloy using a continuous 
wave C0 2 laser and showed an improvement in both general corrosion and 
pitting corrosion resistance. Badekas et al. (1998) irradiated an Al-4.5%Cu 
alloy with an excimer laser. The residual stress of the treated surface changed 
from tensile to compressive with an improved microhardness. Gremaud 
et al. (1990) laser surface melted Al-Fe alloy (with Fe content varying from 0.25 
to 8 wt%) using a 1.5 kW continuous wave C0 2 laser and observed significant 
influence of process parameters on the morphology and degree of fineness of the 
precipitates/grains . 

Hari Prasad and B alasubramaniam (1997) laser irradiated an Al-Li-Cu alloy 
(of nominal composition 2.5 % Li, 1 % Cu, and 0.12 % Zr) with a Nd:YAG laser 
for the purpose of surface modification and studied its oxidation behavior at 450 °C in 
air by thermogravimetry and compared it with the untreated specimen. Parabolic 
oxidation kinetics was observed for both the laser-treated and untreated specimens. 
The laser-treated specimen exhibited improved oxidation resistance. X-ray diffraction 
analysis indicated that the major component of the scale was Li 2 C0 3 . The improved 
oxidation behavior of the laser-treated specimen was related to the compact oxide 
structure produced on its surface during oxidation. Wang et al. (2002) melted a series 
of Al-Si alloys with various Si contents using a 2 kW C0 2 continuous wave laser. 
It was found that laser treatment refined the grains and altered the morphology of 
eutectic silicon from lath-like to coralline-like with a higher Si content in Al matrix 
as compared to equilibrium Si content in Al. X-ray diffraction analysis illustrated that 
the lattice parameter of Al was reduced after laser melting. A series of Al-transition 
metal alloys (Al-Cu, Al-Si, Al-Zn, Al-Fe) were melted by Watkins et al. (1998) 
leading to an improved hardness, wear and pitting corrosion resistance under 
optimum process parameters. 
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Laser Surface Alloying 

Laser surface alloying involves melting of a pre-deposited coating layer or simul- 
taneously added alloying ingredients along with a part of the underlying substrate to 
form an alloyed zone for improvement of wear, corrosion, and oxidation resistance. 
Surface alloying may be achieved by the deposition of coating materials on the 
surface prior to laser melting which may be termed as pre-deposition or by feeding 
of alloy ingredient simultaneously during melting of surface, which may be termed 
as co-deposition. Figure 15a illustrates the schematic of laser surface alloying with 
a continuous wave laser, consisting of three major components, namely a laser 
source with a beam focusing and delivery system and a CNC controlled sweeping 
stage, where the specimen is mounted for lasing, the provision for introducing 
controlled atmosphere and a powder delivery system through which alloy powders 
are delivered (Dutta and Manna 2011). The process consists of three steps: melting 
of alloying elements (pre-deposited or deposited simultaneously during 
processing), melting of substrate, intermixing of the substrate and alloying elements 
in molten stage, and a fast quenching forming the alloyed zone on the top of the 
substrate (Fig. 15b). Usually, a 20-30 % overlap of the successive molten/alloyed 
tracks is intended to ensure microstructural/compositional homogeneity of the laser 
treated surface. The parameters influencing the quality of the alloyed zone in terms 
of depth, chemistry, micro structure, and surface properties include incident power/ 
energy, beam diameter/profile, interaction time/pulse width, pre or co-deposition 
thickness/composition, and concerned physical properties like reflectivity, absorp- 
tion coefficient, thermal conductivity, melting point and density. 

The main characteristic of this process is the possibility of producing a surface 
layer with extended solid solubility, a refined and homogeneous micro structure, and 
no restriction in the selection of alloying elements. However, attention needs to be 
paid to the relative melting temperature, vaporization temperature, and vapor 
pressure of the alloying elements and the substrate. Laser processing will be 
difficult if the added element vaporizes at a temperature lower than the substrate 
material. For example, it would be difficult to surface alloy copper with zinc due to 
a lower temperature of vaporization of Zn (900 °C) than the melting point of copper 
(1,100 °C) at 1 atm. 

Laser surface alloying was reported to improve the structural properties of 
material like wear, corrosion, and oxidation resistance on a large numbers of 
systems (Dutta Majumdar and Manna 2003). Laser surface alloying of Mo on 
AISI 304 stainless steel by plasma spray deposition of Mo and subsequently, 
laser surface melting significantly improved the pitting corrosion resistance of 
AISI 304 stainless steel substrate (Dutta and Manna 1999). However, optimization 
of process parameters was essential for the formation of a homogeneous micro- 
structure and composition for improvement in pitting corrosion resistance and 
mechanical property (Dutta and Manna 1999). Potentiodynamic anodic polariza- 
tion tests in 3.56 wt% NaCl solution (both in forward and reverse potential) showed 
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Fig. 15 Schematic illustration of (a) laser surface alloying with a continuous wave laser, (b) stage 
in alloy formation by intermixing of the substrate and alloying elements (Dutta and Manna 2011) 


that the critical potential for pit formation (E PP1 ) and growth (E PP2 ) were signifi- 
cantly (two to three times) improved to 550 mV(SCE) after laser surface alloying as 
compared to 75 mV(SCE) in the substrate (Fig. 16) (Dutta and Manna 1999). E PP2 
has also been found to be nobler in as-lased specimens than that in the stainless steel 
substrate. The improvement in pitting corrosion resistance of laser surface melting 
as compared to plasma spray deposited surface and the substrate is attributed to 
sealing of porosities (as observed after plasma spray deposition) and enrichment of 
surface with Mo, respectively. Erosion corrosion behavior of the laser surface 
alloyed AISI 304 stainless steel was compared to the as-received one by evaluating 
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Fig. 16 Potentiodynamic 
anodic polarization tests in a 
3.56 wt% NaCl solution (both 
in forward and reverse 
potential) (Dutta and Manna 

1999) 
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Fig. 17 Cumulative loss of 
material per unit area (Am) 
due to erosion as a function of 
time (0 for as-received and 
laser surface alloyed AISI 
304 stainless steel with 
molybdenum (with a 
pre-deposit thickness of 
250 pm and energy density of 
38.1 MJ/m 2 ) (Dutta and 
Manna 1999) 



t(h) 


the kinetics of material loss by circulating (at 750 rpm) the specimen in a medium 
containing 20 wt% sand in 3.56 wt% NaCl solution as a function of time. Figure 17 
compares the kinetics of material loss in terms of variation of mass loss (Am) with 
time (t) for AISI 304 stainless steel and laser surface alloyed stainless steel with 
Mo, SS(Mo) under erosive corrosion condition (Dutta and Manna 1999). Laser 
surface alloying reduces the magnitude of Am by over an order of magnitude and 
decreases the kinetics of erosive corrosion loss in SS(Mo) more than that in 304-SS, 
especially beyond 20 h. Therefore, it appears that the present LSA routine is 
capable of imparting an excellent superficial microhardness and resistance to 
corrosion and erosion-corrosion to 304-SS. 

Oxidation is another serious mode of surface degradation that gets aggravated at 
an elevated temperature. Unlike electrochemical corrosion, oxidation occurs 
through dry reaction and solid state ionic transport through the oxide scale. In the 
past, several attempts have been made to enhance resistance to oxidation by laser 
surface alloying, cladding, and similar LSE techniques. Laser surface alloying of Ti 
with Si, A1 and Si + A1 (with a ratio of 3: 1 and 1:3, respectively) was conducted to 
improve the wear and high temperature oxidation resistance of Ti (Dutta et al. 2002; 
Dutta-Majumdar et al. 1999). Figures 18a-c show the (a) scanning electron micro- 
graph of the top surface of laser surface alloyed titanium with silicon, (b) isothermal 
oxidation behavior showing the variation of weight gain due to oxidation per unit 
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area as a function of time for as-received (plot 1) and laser surface alloyed titanium 
with (b) silicon, (c) 3 Si + A1 and (d) Si + 3A1 and (c) wear behavior in terms of 
depth of scratching (z w ) with load (L) due to scratching of pure Ti and laser surface 
alloyed Ti with Si, A1 and Si + A1 with a hardened steel ball. Figure 18a reveals the 
hyper-eutectic micro structure on the top surface of the alloyed zone consisting of 
uniformly distributed faceted Ti 5 Si 3 phase in a two-phase eutectic aggregate of a-Ti 
and Ti 5 Si 3 (Dutta-Majumdar et al. 1999). The high volume fraction of the primary 
phase and degree of fineness of the eutectic products signify complete dissolution 
and uniform intermixing of Si in the alloyed zone, and a rapid quenching experi- 
enced by the latter, respectively. Subsequent oxidation studies conducted at 
873-1,023 K showed that LSA of Ti with Si and Si + Al significantly improved 
the isothermal oxidation resistance (Fig. 18b) (Dutta et al. 2002). The effect of laser 
surface alloying of Si and Al on the wear resistance of titanium was also studied. 
Figure 18c shows the variation of depth of scratching (z w ) with load (L) due to 
scratching of pure Ti and laser surface alloyed Ti with Si, Al and Si + Al with a 
hardened steel ball (Dutta et al. 2000). It may be noted that scratch depth varies 
linearly with load for all the cases. The effect of load on scratch depth is more 
prominent at higher number of scratching (>1,000) than that at a lower value of the 
same (= 25). Under comparable conditions of scratching, Ti undergoes the most 
rapid wear loss followed by that in laser surface alloyed specimens. Laser surface 
alloyed sample with Si undergoes the minimum wear loss, which is attributed to the 
formation of hard Ti 5 Si 3 precipitates in the alloyed zone (Dutta et al. 2000). 

Sicard et al. (2001) reported enhancement of mechanical and chemical proper- 
ties of aluminum alloys (AlSi7Mg0.3) by excimer laser assisted nitriding treatment. 
The special nitriding treatment requires an excimer laser to irradiate the sample 
placed inside the cell with 1-bar nitrogen pressure allowing expansion of the vapor 
plasma, dissociation/ionization of nitrogen by laser generated shock wave, and 
subsequent penetration (adsorption and diffusion) of nitrogen from plasma up to 
some depth. While adequate laser fluence is required to create the plasma, but this 
fluence must be limited to prevent laser-induced surface roughness. The resultant 
polycrystalline nitride layer is several micrometers thick and comprises pure AIN 
columns (200-500 nm thick) on top of equiaxed AIN grains in the diffusion layer. 
Heat conduction calculations show that a 308-nm laser is better suited for greater 
nitride thickness, as it corresponds to a weaker reflectance value for aluminum. 
Tang and Man (2006) observed that laser surface alloying of manganese-nicke- 
l-aluminum bronze with Al was more effective in enhancing corrosion and cavita- 
tion erosion resistance in a 3.56 wt% NaCl solution than that after laser surface 
melting. 

Manna and Dutta Majumdar (1999) attempted to enhance the wear and erosion 
resistance of Cu by laser surface alloying with Cr (electrodeposited with 10 and 
20 pm thickness, t z ). Total Cr content (X Cr ), Cr in the form of precipitates (/ Cr ), and 
Cr in solid solution with Cu (Cu c r ) in the alloyed zone was determined by energy 
dispersive spectrometry, optical microscope, and X-ray diffraction technique, 
respectively. Laser surface alloying extended the solid solubility of Cr in Cu to 
as high as 4.5 wt% as compared to less than 1 wt% equilibrium solid solubility. 
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X Cr ,W Cr , Cu Cr (%) 

Fig. 19 Variation of average hardness in the alloyed zone (H“ v ) as a function of Cu Cr ,fcr andX Cr 
for laser surface alloyed copper with chromium with a pre-deposit thickness of 20 pm (Majumdar 
and Manna 1999) 

The microhardness of the alloyed zone was found to improve significantly (as high 
as 225 VHN) following laser surface alloying as compared to 85 VHN of the base 
metal. Figure 19 shows the variation of average hardness in the alloyed zone (H® v ) 
as a function of Cw Cr ,/ Cr , and X Cr for laser surface alloyed copper with chromium 
with a pre-deposit thickness of 20 pm (Majumdar and Manna 1999). From Fig. 19 it 
may be noted that average microhardness of the alloyed zone increases with 
increase in total chromium content, and hence, with solid solution, and with 
dispersoids. However, the contribution of solid solution is higher than dispersoids. 


Laser Composite Surfacing 

Metal matrix composite possesses an enhanced wear resistance greater than that of 
the base substrate or the matrix. However, volume fraction of reinforcement is 
restricted as a substantially higher amount of reinforcement reduces the toughness. 
Development of composite layer on the surface by conventional means is extremely 
difficult. Laser melting of substrate and subsequent feeding of ceramic particles into 
the molten matrix is an effective means of development of composite layer on the 
surface through a process called laser composite surfacing. Several attempts have 
been made to develop a composite layer on metallic matrix by this technique (Dutta 
and Manna 2011). Attempts were made to develop sub-micron sized titanium 
boride dispersed surface on AISI 304 stainless steel substrate by melting the surface 
of sand blasted AISI 304 stainless steel substrate using a continuous wave C0 2 laser 
and simultaneous deposition of a mixture of K 2 TiF 6 (potassium titanium 
hexafluoride) and KBF 6 (potassium hexafloroborate) (in the weight ratio of 2:1) 
at a flow rate of 4 g/min using Ar as shrouding environment (Dutta and Manna 
2011). The microhardness of the surface was improved 250-350 VHN as compared 
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to 220 VHN of the AISI 304 stainless steel substrate and found to vary with laser 
parameters. The improved microhardness of the composite layer is attributed to 
both grain refinement and dispersion strengthening. However, laser parameters 
should be precisely controlled to achieve an improved microhardness of the com- 
posite layer. A detailed wear behavior of as-received and laser composite surfaced 
AISI 304 stainless steel against hardened steel ball shows that rate of wear is lower 
in laser composite surfaced AISI 304 stainless steel as compared to the as-received 
one. A detailed investigation of the surface following wear shows that the mecha- 
nism of wear was mainly a combination of adhesive and abrasive in as-received 
stainless steel, however, predominantly abrasive for laser composite surfaced 
stainless steel. Similar observation was also noticed in laser surface alloyed A1 
with titanium boride (Dutta and Manna 2011). In the past, several attempts were 
made to develop a ceramic dispersed metal matrix composite surface on stainless 
steel substrate by laser surface alloying (Dutta and Manna 2011). However, a poor 
wettability of ceramic particulates in metallic matrix necessitates the application of 
binder for improving the particle-matrix bond strength. Dispersion of carbide based 
ceramic particles (WC, Cr 2 C 3 , SiC, TiC) on austenitic stainless steel UNS S3 1603 
was reported to improve the cavitation erosion characteristics of the surface- 
modified specimens in 3.56 % NaCl solution considerably (Cheng et al. 2001). 
St-Georges (2007) studied the effect of laser surface cladding of Ni-Cr-WC on the 
wear resistance of steel substrate (AISI 1020, Fe + 0.2C, wt%). In the past, laser 
surface alloying of AISI 304 stainless steel with WC, Ni, and NiCr was reported to 
develop a defect-free and homogeneous micro structure under optimum processing 
conditions (Anandan et al. 2012). It was observed that laser parameters played a 
crucial role in determining the microstructures of the alloyed zone. The average 
microhardness of the alloyed zone was found to be improved to 700-1,350 VHN 
(with laser parameter) as compared to 220 VHN of as-received y-stainless steel. It 
was observed that application of low scan speed developed a graded micro structure. 
Figure 20 shows the scanning electron micrographs (FESEM) of the top surface of 
laser surface alloyed AISI 304 stainless steel subjected to laser surface alloying 
using a 5 kW fiber optics delivery continuous wave (CW) Nd:YAG laser (with a 
beam diameter of 3 mm) by simultaneous feeding of 70WC-15Ni-15NiCr in the 
molten pool (at a powder feed rate of 10 g/s) with a power of 1 .5 kW and scan speed 
of 0.008 m/s using Ar as shrouding environment (at a flow rate of 5 1/min). From 
Fig. 20 it is evident that, in the micro structure, partially dissolved WC in grain 
refined matrix with precipitates of secondary carbides like M 23 C 6 (M = Cr, Ni, Fe, 
and W) (which is confirmed by X-ray diffraction analysis) and the eutectic mixture 
of austenitic steel (Fe-Ni-Cr) and M 23 C 6 are present. The different carbides as 
detected by EDS analysis are labeled in Fig. 20. A detailed study of the effect of 
process parameters on the micro structures was reported elsewhere (Anandan 
et al. 2012). Figure 21 shows the wear behavior of (a) as-received and laser surface 
alloyed AISI 304 stainless steels lased with a (b) 1.50 kW, 0.016 m/s, (c) 1.50 kW, 
0.008 m/s, and (d) 2.00 kW, 0.008 m/s, respectively against WC ball at an applied 
load of 10 N and under fretting motion (at a frequency of 10 Hz) in terms of 
cumulative depth of wear with time. From Fig. 21 it is evident that there is a 
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Fig. 20 Scanning electron 
micrograph (FESEM) of the 
top surface of laser surface 
alloyed AISI 304 stainless 
steel with 50WC-30Ni- 
20NiCr lased with a power of 
1.5 kW and scan speed of 
0.008 m/s 



Fig. 21 Wear behavior in 
terms of cumulative depth of 
wear with time for (< a ) 
as-received and laser surface 
alloyed AISI 304 stainless 
steels lased with a ( b ) 1.50 
kW, 0.016 m/s, (c) 1.50 kW, 
0.008 m/s and (d) 2.00 kW, 
0.008 m/s, respectively 
against WC ball at an applied 
load of 1 0 N and under 
fretting motion (at a 
frequency of 10 Hz) 



significant improvement in wear resistance property of laser surface alloyed sam- 
ples (plots b-d) as compared to the as-received one (plot a). Furthermore, laser 
parameters also play an important role in determining the degree of wear. A careful 
observation of different curves in Fig. 21 also reveals that in the case of as-received 
AISI 304 stainless steel, the kinetics of initial wear rate is higher as compared to 
laser composite surfaced state which is attributed to hardening of surfaces due to 
laser surface alloying, and hence, less degree of abrasive and fretting wear. Laser 
parameters play an important role in determining the rate of wear. It is observed that 
the effect of scan speed is more significant in contributing to wear rate as compared 
to applied power. The wear resistance property is improved when the scan speed is 
reduced which is predominantly due to enhanced hardness of the alloyed zone 
achieved because of the increasing area fraction of WC particles in the microstruc- 
ture (plot b vis-a-vis plot c). Increase in applied power reduces the kinetics of wear 
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Fig. 22 Coefficient of 
friction with number of cycles 
for (a) as-received and laser 
surface alloyed AISI 
304 stainless steels with ( b ) 
1.50 kW, 0.016 m/s, (c) 1.50 
kW, 0.008 m/s, (d) 2.00 kW, 
0.008 m/s, respectively 
against WC ball at an applied 
load of 1 0 N and under 
fretting motion (at a 
frequency of 10 Hz) 



at the initial state, however, under steady state, increased power does not cause any 
significant change in steady state wear value. The decreased wear rate at high power 
level is possibly due to the dissociation of WC at high power level and its 
redistribution to homogenize the microstructure and formation of a larger volume 
fraction of secondary carbides (Anandan et al. 2012). Figure 22 shows the variation 
of coefficient of friction with number of cycles for (a) as-received and laser surface 
alloyed AISI 304 stainless steels with (b) 1.50 kW, 0.016 m/s, (c) 1.50 kW, 0.008 
m/s, (d) 2.00 kW, 0.008 m/s, respectively against WC ball at an applied load of 10 N 
and under fretting motion (at a frequency of 10 Hz) obtained by wear testing. From 
Fig. 22 it may be noted that the coefficient of friction is very high initially for both 
the as-received and laser surface alloyed samples which is attributed to a strong 
adhesive bond formed between mating surfaces. Following an initial high value, the 
coefficient of friction decreases possibly due to dislodgement of the materials from 
the worn surface and subsequently, covering the interfaces between meeting sur- 
faces and acting as lubricant. In laser surface alloyed sample, the coefficient of 
friction reaches the steady state value thereafter. However, in AISI 304 stainless 
there is further increase in coefficient of friction from 0.5 to 0.55, following which it 
reaches the steady state. The hike in coefficient of friction for the second time in 
as-received AISI 304 stainless steel is attributed to damage of the surface and its 
roughening due to fretting action. From Fig. 22 it is also observed that coefficient of 
friction under steady state is significantly lower in laser surface alloyed samples as 
compared to that of the as-received one, which is attributed to its high hardness 
(Anandan et al. 2012). In this regard, it is also relevant to note that increased scan 
speed reduces the coefficient of friction under steady state which is possibly due to 
an increased hardness at reduced scan speed. On the other hand, increase in power 
does not contribute significantly to the coefficient of friction at steady state. 

Table 1 summarizes the corrosion parameters of as-received and laser surface 
alloyed AISI 304 stainless steels derived from potentiodynamic polarization study 
in a 3.56 wt% NaCl solution. From Table 1 it is evident that laser composite 
surfacing shifts the corrosion potential toward the nobler direction which is 
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Table 1 Summary of corrosion parameters of as-received and laser surface alloyed AISI 
304 stainless steel derived from potentiodynamic polarization study in a 3.56 wt% NaCl solution 


s. 

Laser power 

Scan speed 

F V 

^corr y 

Epiti V 

Corrosion 

No 

(kW) 

(m/s) 

(SCE) 

(SCE) 

rate 

1 

Base metal 

-0.937 

-0.015 

7.8 x 10“ 3 

2 

2.00 

0.008 

-0.844 

-0.003 

5.62 x 10 3 

3 

2.00 

0.030 

-0.513 

0.032 

2.687 x 10 3 


attributed to the presence of Cr and Ni in solution with AISI 304 stainless steel, 
hence, increasing the corrosion resistance properties. Application of very low scan 
speed increases the area fraction of WC in the matrix, which may increase the 
probability of galvanic attack at the interface, and hence, makes the matrix prone to 
corrosion. On the other hand, there is significant reduction of corrosion rate when 
applying a higher scan speed. The comparison of potential for pit formation (E pitl ) 
shows that E pitl shifts toward nobler direction for laser surface alloyed samples and 
it is more significant when applying a higher scan speed. Hence, it may be 
concluded that the pitting corrosion resistance is improved due to laser surface 
alloying due to the presence of more Cr and Ni in solution and micro structural 
homogenization. The presence of carbides might have detrimental influence in 
corrosion resistance property as corrosion resistance property is increased by 
reducing carbide content in the alloyed zone. A systematic study of the effect of 
laser parameters on the corrosion behavior however shows that a maximum 
improvement in corrosion resistance is achieved when lased with a power of 
2 kW and scan speed of 0.030 m/s. 

In another effort, laser surface alloying of AISI 304 stainless steel was carried 
out using a 5 kW fiber optics delivery continuous wave (CW) Nd:YAG laser (with a 
beam diameter of 3 mm) by simultaneous feeding of WC-Co-NiCr (in the weight 
ratio of 20:40:40) in the molten pool (at a powder feed rate of 10 g/s) and using Ar 
as shrouding environment (at a flow rate of 5 1/min). Figure 23 shows the scanning 
electron micrograph of the top surface of laser surface alloyed AISI 304 stainless 
steel with WC-Co-NiCr lased with a power of 0.75 kW, scan speed of 0.012 
m/s. From Fig. 23 it may be noted that the micro structure of the alloyed zone 
consists of the presence of large grains (of size ranging from 5 to 7.5 pm) with very 
fine sub grains with average grain size .25-5 pm. Furthermore, there is the presence 
of very fine nano-size precipitates of carbides dispersed randomly within the grains. 
Few carbide precipitates are also observed along the grain boundary region. The area 
fraction of carbide precipitated inside the grains were also reduce by application of 
higher power. The morphology of microstmcture was changed from equiaxed to 
cellular, when applied power was higher. Detailed X-ray diffraction profile analysis 
of the alloyed zone shows that there are WC, W 2 C, M 23 C 6 phases in austenite matrix, 
the mass fraction of individual phase was found to vary with laser parameter. 

Figure 24 shows the micro hardness profile with depth from the surface of laser 
surface alloyed in AISI 304 with WC-Co-NiCr laser with a power of (a) 2.5 kW, 
scan speed .012 m/s; (b) 2.5 kW, scan speed .016 m/s, and (c) 2.0 kW, scan speed 
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Fig. 23 Scanning electron 
micrograph of the top surface 
of laser surface alloyed AISI 
304 stainless steel with WC- 
Co-NiCr lased with a power 
of 0.75 kW, scan speed of 
0.012 m/s 



Fig. 24 Microhardness 
profiles as a function of depth 
from the surface along the 
cross section of laser alloyed 
304 stainless steel lased with 
an applied power of 2.5 kW, 
scan speed of 12 mm/s (plot 
a); 2.5 kW, 16 mm/s (plot b ); 
and 2.0 kW, 12 mm/s (plot c ) 
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.012 m/s. From Fig. 24 it may be noted that there is an improvement of hardness 
from 250 VHN to 320 VHN due to laser surface alloying. Furthermore, a maximum 
hardness is achieved on the surface which decreases gradually with depth from the 
surface. Decrease in micro hardness is attributed to coarsening of micro structure 
with increase in depth and also due to the decrease in mass fraction of precipitated 
with depth. A comparison of micro hardness values with process parameter shows 
that laser parameters do have marginal influence on the hardness and its distribu- 
tion. With an increase in scan speed and a decrease in applied power, there is a 
marginal improvement in microhardness value. 


Laser Surface Cladding 

Laser surface cladding is a technique, where, a high power laser beam is used as a 
source of heat to melt a pre-deposited coating layer or simultaneously added 
cladding ingredients along with a partial melting of the underlying substrate to 
form a clad zone with a minimum dilution at the interface for improvement of wear, 
corrosion, and oxidation resistance (Dutta and Manna 2011). In laser cladding 
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Table 2 Summary of thermal spraying, weld overlaying, and laser cladding processes 


Coating 

process 

Thermal spraying 

Weld overlaying 

Laser cladding 

Heat source 

Combustible fuel, electric 
arc or plasma arc 

Combustible fuel, electric 
arc or plasma arc 

High intensity 
laser irradiation 

Bond strength 

Moderate, mechanical 
interlocking 

High, metallurgical bond 

High, 

metallurgical 

bond 

Coating 

structure 

Splat like, porous 

Dense, solidified 

Dense, refined 

Thermal load 
to substrate 

Minimum 

Very high 

Minimum 

Dilution 

Nil 

Moderate to high 

Low 

Heat affected 
zone 

Nil 

Wide 

Very shallow 

Coating 

thickness 

25 pm to few mm 

Several mm 

50 pm to several 
mm 

Productivity 

Low to high 

Low to very high 

Low to moderate 

Cost 

Low to high 

Low to moderate 

Moderate to high 


process, the coating materials in the form of powder or wire is melted using a high 
power laser and subsequently deposited on the surface of the substrate, leading to 
the development of deposited surface with a minimum dilution at the interface. The 
characteristic features of laser cladding process include (a) development of a 
refined micro structure, (b) negligible heat affected zone, (c) a strong bonding at 
the interface due to chemical interaction of the clad layer with substrate and 
flexibility of cladding on substrate of any shape and size. 

Table 2 compares the characteristics of laser cladding with conventional weld 
surfacing and coating by thermal spraying. From Table 2 it may be noted that being 
a high intensity, well focused, and controllable heat source, laser clad offers a 
defect free clad zone with a minimum dilution at the interface and is an emerging 
technique for the development of clad zone with a higher productivity. Laser 
cladding on pure A1 substrate was carried out using a continuous wave Nd:YAG 
laser and a coaxial powder injection system to develop composite coatings of 0-40 
wt% Si and 0-30 wt% TiC particles in A1 matrix (Dubourg et al. 2005). The 
micro structure of the coatings was homogeneous and crack/pore free with uniform 
distribution of carbides. The addition of Si and TiC reinforcement increased the 
bulk hardness and wear resistance of the coatings. Katipelli et al. (2000) have 
achieved considerable improvement in oxidation resistance of 6061 Al alloy by 
laser surface alloying with Al + TiC. 

Aihua et al. (1993) developed a graded coating consisting of a Ni-clad Al bond 
layer, a 50 wt% Ni-clad Al + 50 wt% (Al 2 0 3 -13 wt%Ti0 2 ) intermediate layer and an 
Al 2 0 3 -13 wt%Ti0 2 over-layer (or ceramic layer) on an Al-Si alloy substrate by 
plasma spraying followed by laser surface remelting. While the plasma- sprayed 
coatings seem prone to spallation at the different depths originating from macrocracks 
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in the ceramic layer during thermal cycling, the laser-remelted coatings, containing a 
network of microcracks in the ceramic layer reveals a significantly reduced tendency 
of spallation at the interface between the intermediate and bond layer. Thus, laser 
remelting improved the spalling resistance of plasma- sprayed coating to thermal 
shock, though cracking due to thermal shock remains a problem to be solved. An 
aluminum oxide layer of 100 pm thickness has been successfully coated on aluminum 
alloy 606 1 and pure aluminum using a powder mixture of silicon oxide and aluminum 
by laser cladding using a continuous wave C0 2 laser. A strong AI/A1 2 0 3 interface 
was formed possibly due to an exothermic chemical reaction between Si0 2 and A1 
(Zhou et al. 1993). Uenishi and Kobayashi (1999) obtained Al 3 Ti dispersed 
intermetallic-matrix composite (IMC) on Al by laser surface cladding. 


Process Parameters in Laser Surface Engineering 

The quality of the surface engineered product is defined by the following param- 
eters: (a) microstructural and compositional homogeneity, (b) surface topography, 

(c) residual stress distribution, and (d) thickness of the treated layer. All the above 
mentioned parameters are dependent on the following variables: 

(a) Type of laser used 

(b) Laser power 

(c) Beam size 

(d) Beam configuration from the laser cavity 

(e) Travel speed of the workpiece 

(f) Mode of alloy addition 

(g) Pre-treatment of the substrate 

Types of Laser Used 

Depending on the type of laser and wavelength desired, the laser medium could be 
solid, liquid or gaseous. Different laser types are commonly named according to the 
state or the physical properties of the active medium. Consequently, there are glass 
or semiconductor, solid state, liquid, and gas lasers. Gas based lasers can be further 
subdivided into neutral atom lasers, ion lasers, molecular lasers, and excimer lasers. 
Among the commercially available lasers, the lasers commonly applied for surface 
engineering include Nd:YAG laser, semiconductor laser (AlGaAs, GaAsSb, and 
GaAlSb), C0 2 laser and excimer laser (XeCl, KrF). The intensity of the laser beam 
together with the interaction time dictates the interaction phenomena’s effective- 
ness. Absorption of laser radiation is also determined by the wavelength (X) of the 
incident laser beam. Figure 25 summarizes the effect of wavelength of laser on 
the absorptivity of different materials graphically. The materials in Fig. 25 are 
summarized into three categories, i.e., metals 1 with full inner shells electron 
(Au, Ag, Cu, etc.), metals 2, i.e., transition metals (Fe, Ni, Cr, etc.), and insulator 
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Wavelength X (gm) 

Fig. 25 Graphical presentation of variation of absorptivity of different materials with wavelength 
of laser 


with the wavelength of laser (Fig. 2). From Fig. 25 it may be noted that for metallic 
materials with full inner shells electrons, absorptivity decreases from 90 % to 5-7 % 
at a wavelength of 0.35 pm. For metals 2 (i.e., transition metals), absorptivity 
decreases gradually with increase in wavelength of laser. In insulators, absorptivity 
decreases from 100 % to 4-5 % at a wavelength of 0.3 following which it remains 
the same up to a wavelength of 2.5 and again it starts rising sharply to 90 % at a 
wavelength larger than 5 pm. 


Applied Power 

Laser power necessary for surface melting of metallic materials is generally high 
due to reflectance of the laser beam and high thermal conductivity. Reflectivity of 
the metal surface is actually related to electrical conductivity. For instance, it is 
difficult to surface melt aluminum or copper with a C0 2 laser beam because these 
materials absorb little laser power. For metallic materials, laser power for surface 
alloying should exceed 0.5 kW. 


Beam Diameter 

The beam diameter determines the power density on the specimen surface (power 
density is defined as the power divided by the cross sectional area of the laser 

o 

beam). Typical values of power density during laser surface alloying are 10-10 
W/cm . The beam diameter is difficult to measure. A simple method is to measure 
the distance of the focal point from the workpiece surface and relate it to the beam 
size. Increasing the beam size by defocusing or other means decreases the heat input 
per unit volume, reduces melt penetration, and increases melt width. 
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Beam Shape 

The energy distribution and shape of the beam is described by the beam configu- 
ration or beam profile. Four beam profiles are made available by the lasers in the 
conjunction with beam delivery systems. These include Gaussian, multimode, 
square (or rectangular), top hat (Fig. 4). It is seldom possible to produce Gaussian 
beam configuration (TEMP 0 o) at high power although many high power laser 
manufacturers claim it can be done. A Gaussian beam is most suitable for cutting 
and welding applications rather than surface treatment because, being a “sharp 
tool,” it tends to vaporize and melt the substrate deeply. In contrast, multi-mode, top 
hat, and square profiles (“blunt tool”) are preferred for surface alloying. These beam 
profiles offer alloyed casings with wider coverage rates and uniform case depths. 

Square and rectangular beam profiles are generated by using an optical integrator 
or scanner. Oscillating a Gaussian beam at high frequency using a rotational mirror 
in the beam delivery system may produce a square or rectangular profile because of 
the power. 


Relative Speed Between the Laser Beam and Work-piece 

The travel speed of the workpiece during laser surface alloying is a very important 
parameter as it controls the diffusion time of the alloying elements, changes the 
melt depth, width, and profile, and determines the micro structures of surface alloys 
to a greater extent than other variables. 


Mode of Alloy Addition 

Alloying elements in the laser melt pool may be introduced by means of 
pre-deposition methods or codeposition methods. Predeposition methods include 
electroplating, thermal spraying, vacuum evaporation, etc., which are done prior to 
laser melting. Co-deposition involves injecting powder, wire, or rod forms of alloys 
into the laser generated melt pool of the substrate. The one step co-deposition 
method is more attractive than the two step deposition technique, which also offers 
the advantage of real time control on the supply of alloying element. Pre-deposition 
variables include thickness, particle size, composition, and method application. 


Shield Gas 

Shield gas serves two functions in laser surface treatments: (a) it shields the heated/ 
melt zone from oxidation and (b) it protects the focusing optics from the fumes. 
Argon and nitrogen shield gases are normally used and typical flow rates are around 
20 1/min. The flow rate will depend on the method of shielding and also diameter of 
nozzle that is being used to deliver the gas. 
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Applications of Laser Surface Engineering 

Because of the directional nature of laser light, laser surface engineering may be 
applied to specific areas of a work piece. However, the main disadvantages of 
laser surface engineering with respect to the other technologies are the high initial 
cost of the equipment and inability to apply on large areas. However, laser 
surface engineering is an environmentally friendly technique which is flexible 
and versatile in nature. In this regard, it is relevant to mention that in spite of 
the large number of advantages associated with laser surface engineering, the 
process is yet to be industrially popularized due to lack of available information 
on laser-matter interaction phenomena in different materials and the change in 
materials, microstructures, and properties developed in the hardened or alloyed/ 
clad zones. 

In this regard, it is relevant to mention that, among all the existing surface 
engineering techniques, laser surface hardening and laser surface cladding are two 
important techniques which are enjoying market status for the development of wear 
and corrosion resistance surfaces for automotive and aerospace application. Notable 
examples of applications of laser surface engineering include the following. 


Hardening of Steering Gear Assemblies 

The first commercial production application of laser heat treating is the hardening 
of steering gear assembly (Ready and Farson Dave 2001). Originally designed as a 
batch furnace heat treatment process, automobile power steering pumps became a 
subject for process improvement as the cost of energy made a 24-h furnace schedule 
expensive. Rather than heating 5 kg of ferritic malleable cast iron, engineers at the 
steering gear division of a U.S. automotive company decided to produce a localized 
wear-resistant surface on the inner bore of the pump housing, as protection against 
expected wear from the piston operating under heavier loads. Experiments with a 
defocused C0 2 laser beam indicated that a series of five hardened tracks, strategi- 
cally placed, would produce sufficient wear resistance without part distortion. In 
practice, the beam from a kilowatt-level C0 2 laser is directed through a rotating 
optic device to the inside bore of the pump housing. A disposable, piston-mounted, 
mirror is sequenced up and down so that the beam produces vertical stripes on 
the steering gear bore. A large-diameter, low-power density spot of laser light 
produces the heat required to transformation harden a 2.5-mm-wide track to a 
depth of 0.35 mm. 


Diesel Engine Cylinder Liners 

Fuel used in railroad diesel engines contains abrasive residues such as vanadium 
which causes excessive scuffing on the bore wall of malleable cast iron cylinder 
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liners. One of the most logical choices to obtain the desired wear resistance was 
chrome plating to provide a smooth, tough, and chemically resistant, surface. 
Unfortunately, the chrome plating process has severe environmental problems, 
specifically with regard to treating the effluent from the process. Laser transforma- 
tion hardening is an effective route to improve wear resistance of the surface with 
its capability in development of precision hardened layer with a minimum heat 
affected zone and minimum distortion of the substrate. 


Turbine Blades 

Hardening of blades for steam turbines, which can be eroded by water droplets during 
turbine operation, is another area of application of laser surface hardening. Other 
approaches to increase the hardness of the blades, like flame hardening or cladding 
with a hard metal, like Stellite , had various disadvantages, as indicated by Ready 
and Farson (2001). Laser surface hardening of turbine blades have been used in 
several countries for a number of years, as indicated by Ready and Farson (2001). 


Stub Axles 

The selective laser hardening of large stub axles used for industrial dump trucks 
proved to be an economically viable alternative compared to other bulk hardening 
processes. A type AISI 4140 medium carbon steel is used for the manufacturing of 
the ~45 kg components, with specified areas required to have high wear and fatigue 
resistance. 


Application of Laser Surface Alloying/Cladding 

Laser cladding may be applied for the manufacturing of spare parts and mainte- 
nance and repair of worn components and equipment. Laser surface cladding 
is used to produce surfaces which are resistant against abrasive, erosive and 
adhesive wear, wet corrosion, high temperature oxidation and corrosion, etc. 
Typical applications of laser coatings are in shafts, rods and seals, valve parts, 
sliding valves and discs, exhaust valves in engines, cylinders and rolls, pump 
components, turbine components, wear plates, sealing joints and joint surfaces, 
tools, blades, and molds. 

The future scope of research activities of laser surface engineering include 

(a) development of multifunctional graded surface for structural application, 

(b) nano -structuring of the surface to improve mechanical and electrochemical 
properties, (c) development of nano-composite surface, and (d) possibilities in 
research activities for the application of hybrid coating techniques for optimum 
micro structures and properties. 
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Summary 

In the present contribution, the principle, brief description and present status of 
individual laser surface engineering technique and scope of application have been 
presented. From the detailed review it may be concluded that laser surface engi- 
neering has an immense scope of application in the manufacturing sectors for 
precision surface processing. However, the residual stress developed on the surface 
and introduction of surface roughness are the major troubles associated with laser 
surface engineering. Optimization of process parameters for minimization of the 
above mentioned problems are the main challenges associated with process design- 
ing. Furthermore, due to the nonequilibrium processing state, prediction of micro- 
structure and hence, properties is the key issue which needs to be understood for the 
selection of process parameters for specific need. 
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Abstract 

Laser manufacturing techniques belong to the most promising and efficient ones, 
contributing to the technological development in many industry branches and espe- 
cially those in which material processing dominates. Laser treatment is characteristic 
of contactless operation, selectivity, and possibility of full process automation. 
Contactless nature of laser treatment guarantees cleanliness of the treatment location 
and makes also possible remote control of the laser beam through transparent protec- 
tion barriers, in vacuum, gas atmosphere, or under water. It is important that one can 
concentrate a laser radiation beam to a very small dimension, even as small as a portion 
of a micrometer. This makes it viable to achieve externally big power concentration 
values and selective acting with the beam on carefully selected material areas, e.g., in 
locations hard to access, subjected to mechanical loads, etc., with no fear of the effect 
of the delivered heat on the adjacent areas, neighboring elements and part deformation. 

Following an introduction of laser fundamentals in the first part of the 
chapter, theoretical aspects associated with laser radiation and the structure 
and operating principle of lasers were discussed in the second part of the present 
chapter concerning laser treatment of metallic and nonmetallic materials. The 
third part of the work presents knowledge devoted to the most popular tech- 
niques of laser treatment of engineering materials such as hot working; 
remelting; laser alloying/cladding; laser hardfacing; laser-assisted chemical 
and physical vapor deposition (LACVD and LAPVD); laser treatment of func- 
tional materials (e.g., silicone texturization); laser cleaning, cutting, drilling, and 
marking; and laser micromachining. The application examples of lasers in 
materials engineering described in part 4 of the article are supplementing the 
knowledge relating to the utilization of laser techniques. 


Introduction 

Laser surface treatment is one of the most advanced and technically avant-garde 
technologies used currently in the engineering of material surfaces. This chapter is 
primarily focusing on the characterization of coatings and surfaces fabricated with 
laser techniques, methods of laser surface treatment, tools and apparatus indispens- 
able in the process of laser surface treatment, and the most important dates 
involving the history of laser technology. Furthermore, this chapter is focusing on 
the mechanisms taking place during laser forming, applications of particular tech- 
niques, their advantages and disadvantages, chances, and threats as well as their 
potential and predicted development trends (Bonek 2013a; Dobrzanska- 
Danikiewicz 2012; Hecht 2005; Ion 2002, 2005; Jankowska et al. 2008; 
Kannatey-Asibu 2009; Klimpel et al. 2002; Paschotta 2008). 

The first laser presented in 1 960 had been defined as an invention which would have 
no application in practice. Yet, in time, along with the development of laser technologies 
and their application potential, the opinion voiced on lasers proved to have been no 
longer plausible. Presently, many fields of science and technology are taking advantage 
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of the potential offered by the application of laser, for example (Dobrzanska- 

Danikiewicz 2012; Kannatey-Asibu 2009; Paschotta 2008; National Academy of Engi- 
neering et al. 1987; O’Shea; Steen and Mazumder 2010; Tanski 2013): 

• Materials technologies (precision cutting, welding, melting, alloying, 
hollowing) 

• Precision measurements (length, distance, level of atmosphere pollution, flow 
rate) 

• Control over the operation of working machinery 

• Setting out water routes in ports or, e.g., headings in mines 

• Precision positioning of assembled constructions 

• Medicine and biology (micro surgical eye operations, bloodless surgical opera- 
tions, tooth decay prevention, angioma removal, cosmetic surgery) 

• Recording and playing back sounds or images 

• Military technologies (measurement of distance, control of bombs or missiles, 
illumination, special methods of reconnaissance or photography) 

• Holography 

• Chemical technologies (selective catalysis of chemical reactions) 

• Optical telecommunications (multichannel waveguide communication between 
large computer centers) 


Surface treatment of manufactured elements with the application of such means as 
laser is nowadays a very common and innovative technology due to its seemingly 
unlimited potential, e.g., large material savings, treatment accuracy, considerable 
improvement of the properties of the treated materials, applicability of full automa- 
tion processes, etc. Such a treatment can also contribute to better resistance to 
corrosion and wear. It can raise hardness and afford better resistance to abrasion 
between surfaces of relatively small size. The cost of laser treatment is the only 
limitation when considering the application of this method on large surfaces. Cover- 
ing a large surface for decorative reasons or to protect it against corrosion with the use 
of laser techniques is more expensive than traditional methods, e.g., painting or 
electrolytic coating. The application of laser surface treatment is grounded both 
from the economical viewpoint and due to the fact that laser treatment in many 
cases ensures the acquisition of higher mechanical properties of the treated surfaces, 
e.g., gear teeth or cutting bits, which would be impossible to achieve using conven- 
tional methods of surface treatment (Bonek 2013a; Dobrzanska-Danikiewicz 2012; 
Steen and Mazumder 2010; Tanski 2013; Crafer and Oakley 1992; Dobrzanski 
et al. 2008, 2010, 2011a, b, 2012; Dobrzanski and Drygala 2012). 


Lasers and Laser Radiation 

The word laser is an acronym of the English term “light amplification by stimulated 
emission of radiation,” which may be explained as the light amplification by a 
forced (stimulated) emission of radiation.” A laser is a device generating a coherent 
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Fig. 1 Absorption of photon 
by electron 
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beam of electromagnetic radiation within the wavelength from ultraviolet to infra- 
red in which, in the active medium, the phenomenon of radiation amplification is 
used through forced emission (Hecht 2005; Ion 2005). 

A laser is a source of electromagnetic radiation within the range of infrared 
radiation to ultraviolet radiation (including the range of visible light) produced as a 
result of activity of stimulated emission. As opposed to traditional sources of light, 
laser radiation has several characteristics resulting from the way of generating light 
radiation by a laser, i.e.: 

1 . Small disparity of a laser beam (so-called directivity) 

2. High monochromaticity 

3. Ability to concentrate a laser beam to obtain a high density of power in a spot, 
higher than the density of power of the source 

4. Consistency and order in time and in space 

5. Parallelism 

6. Ability to achieve very high power in a pulse, with very short duration (pulses of 
light lasting even from several to several dozens of femtoseconds) 

According to the research performed by scientists concerning radiation of a 
blackbody, as well as by an excellent physician, Max Planck, we now know that 
energy is not emitted and absorbed in a continuous way, but in suitable, indivisible 
portions of energy, so-called quanta (absorption effect, Fig. 1). The energy of 
quanta is proportional to the vibration frequency of an electromagnetic wave and 
can be determined with the formula (1) (Steen and Mazumder 2010) 

E = hv (1) 


where 
E - Energy 

h - Planck constant (6,626 069 57(29) • 10 -34 J • s) 
v - Length of electromagnetic wave 

This theory sets a basis for formulating a corpuscular-wave theory of light, for 
explaining a photoelectric phenomenon, and is also one of the bases of the theory of 
the structure of matter (Steen and Mazumder 2010). Niels Bohr published in 1913 
his theory of structure of atom (Ion 2005) in which he claimed that not only light is 
subjected to quantization but also energy levels across which an electron is moving 
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Fig. 2 An electron transiting 
between energy levels 
together with emission of 
photon 
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around an atom. This means that elementary particles cannot circulate within any 
given distance from the nuclear fission (to put it otherwise: they cannot possess any 
given energy), but they move along strictly specified orbitals, meaning that also the 
energy of electrons is quantizated. What is more is that it is possible that electrons 
“transit” from one energy level to another; this, however, is related to emission or 
absorption of energy of electromagnetic radiation (Steen and Mazumder 2010). The 
electrons situated on higher energy levels have a higher energy than electrons on 
lower levels, which enables a particle to jump from a lower to higher level, 
provided that energy is delivered to such particle, which is equal to or higher than 
the difference of energy between the levels. A photon with a specific wavelength 
may be the source of it (boson with integer spin being a carrier of a quantum of 
electromagnetic radiation). If an electron is situated on a higher energy orbit 
unnatural for such electron in stable conditions, it is then called an excited electron, 
and the lifetime of such particle in an excited state is approx. 0.01-0.0000001 
s. If, within such time, neither photon “forces” the electron to return to a lower 
energy level, this change will be made by itself, without the influence of an external 
factor. An atom will return to the basic state with energy E 0 , and an identical 
quantum of radiation will be emitted, equal to the quantum that had been earlier 
absorbed. This phenomenon is called spontaneous emission of radiation (Fig. 2). 

If the same photon, which was previously absorbed, acts on the excited atom, 
stimulated emission occurs, as a result of which the stimulating and stimulated 
photon is emitted, and both of them have the same properties (the same direction 
and phase) (Fig. 3). Such bosons can then take part in another stimulated emission 
or in absorption. With the right conditions, they are able to lead to generating a 
massive effect of light emission (Steen and Mazumder 2010). 

The above phenomenon of cascade stimulated emission forms a basis for the 
activity of lasers; however, a necessary condition for it to occur is to reverse 
Boltzmann distribution (so-called population inversion, that is, a state in which 
more particles are observed with high energy than particles with low energy), 
otherwise most of the photons will undergo absorption, without causing the 
strengthening of the electromagnetic wave. It is difficult to achieve population 
inversion considering a very short life cycle of electrons in an excited state. 
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Fig. 4 The schema of laser Energy supply 

and its operating system 0^ the system (resonator) 



medium Energy loss 


Essential for achieving it are materials or chemical compounds (e.g., solid bodies 
(strontium and calcium tungsten); barium, strontium, lanthanum, and calcium 
fluoride; calcium niobate, iron (aluminum garnet), calcium molybdate, doped 
with different elements; fluids (SeOCL 2 , POCL 3 ); gases (He + Ne, C0 2 + N 2 + 
He, He + Cd, He + Sn); and precious gases) which have so-called metastable levels, 
i.e., bands on which an electron may be on a higher energy level much longer than 
during a normal excited state. It is then much easier to reverse Boltzmann distri- 
bution and it is relatively long-lasting. They are characterized by much lower 
energy than excitation bands. The so-called pumping systems are used to ensure 
adequate energy for achieving population inversion. Optic pumps can be distin- 
guished - these are, in particular, strong sources of light (xenon lamps) and other 
(diode) lasers, and excitation can also occur as a result of electric discharge and 
chemical reaction or through the collision of atoms (Fig. 1 from the article). 

When spontaneous emission occurs, which initiates the reactions of stimulated 
emission, the photons produced are distributed in all directions. In order to create 
consistent electromagnetic radiation, an active material is closed in space (resona- 
tor) usually shaped as a cylinder (Fig. 1 from the article). There are many solutions 
for resonators’ design. The simplest systems incorporate two mirrors positioned 
perpendicular to the axis of the laser core, of which one is completely reflecting and 
the other is translucent. The so constructed resonator is called “stable.” The radii of 
mirrors’ curvatures are so selected that radiation cannot escape outside the resona- 
tor. Only this beam is amplified which, after being reflected two times, is in the 
same phase, i.e., in resonance with the initial wave. Radiation is achieved, as a 
result of laser action in a resonator, with the same direction of propagation, which 
then is escaping the system through the translucent mirror. A disadvantage of stable 
resonators is that a translucent mirror is heated, which finally leads to their limited 
application only for medium power lasers. Another solution is non-stable resona- 
tors, in which nontransparent mirrors are used. A ray in such solutions is generated 
just by the resonator axis and, by using parabolic mirrors, is gradually moved in a 
direction perpendicular to the resonator axis, and then evacuated beside the mirror, 
by applying other mirrors positioned at a set angle. Non-stable resonators are 
characterized by higher diffraction losses as compared to stable resonators (Steen 
and Mazumder 2010) (Fig. 4). 
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The following components of the laser design can be distinguished: 

1. Cooling system 

2. Electric supply 

3. Bearing structure with a control system 

Laser has many special properties that make it a unique light source. The 
operating principle of laser consists in obtaining or amplifying a medium and a 
set of mirrors that feeds the light back into the amplifier for continued growth of the 
developing beam. For the phenomenon to occur, relatively special conditions are 
required within the laser medium, but also the capacity to generate light and focus 
the light into a beam travelling in a single direction. It had been more than two 
decades since the first direct diode laser application for materials processing was 
reported in 1991 for soldering using a 15 W medical diode laser. Currently, the 
HPDL (high-power diode laser) is a state-of-the-art energy source. This type of 
laser is employed at an industrial scale in materials engineering since 1998. The 
laser power of the HPDL reaches up to 6 kW. The big advantage of such lasers is 
that they allow to produce rectangular, square, linear, or circular shapes of the laser 
beam focus. They offer the controlled distribution of energy in the focus area with 

c o 

power density of up to 10 W/cm . Due to a high coefficient of radiation absorption, 
a laser beam does not have to be guided through any complex optical systems 
causing an energy loss of 10-30 %, and high-energy efficiency of 50 % is achieved. 
The robotization of technological processes is easy; they are reliable and universal, 
which makes them a very attractive tool in materials engineering. 

Owing to excellent coherence and directionality, a laser beam has been used 
more and more intensively in the recent years for the surface modification of 
metals, particularly in the manufacturing of coatings. 

By applying lasers for surface alloying, cladding, glazing, and annealing of 
semiconductors, a possibility is offered of producing new materials with better 
quality. In the recent years, surface modification using advanced heat sources such 
as lasers has been forcing out conventional methods of producing surface layers 
with improved hardness, wear resistance, heat resistance, or other properties. 

A laser beam is characterized by the following features (Hecht 2005 ; Ion 2005 ; 
Kannatey-Asibu 2009; Bonek 2013a; Crafer and Oakley 1992): 

• Direction is defined by the axis of optical resonator 

• Small divergence angle 

• On the cross section, the highest radiation intensity occurs on the axis of 
resonator 

• Focusing ability by means of optical systems, the spot is from 10 to 300 pm in 
diameter 

• Continuous and pulsed modes of operation, ensuring higher power density 

• Radiation is cohesive and coherent in time and space; there occurs a constant 
phase relation for a beam in time and between any two points of its cross section 

• Radiation is highly monochromatic 
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Fig. 5 Working principle 


/ N 

Working principle 

V J 


Pumping system 


Supplies 




Energy 

The quantum amplification 
of photons occurs 




• The wavelength of radiation emitted by different lasers can be from about 10 nm 
to over 1 mm 

• Presence of electrical field intensity of electromagnetic wave 

Laser consists of four basic elements (Fig. 5; Dobrzanska-Danikiewicz 2012; 
Hecht 2005; Ion 2005; Kannatey-Asibu 2009; O’Shea; Crafer and Oakley 1992): 

• Active medium, in which laser action is taking place 

• Optical resonator (system of mirrors), in which the lasing medium is located 

• Cooling system of the active medium; in the case of high-power gas lasers, it can 
contain pumps enabling the flow of medium through a heat exchanger 

• Source of power (pumping), which pumps the laser 

Laser radiation is currently a state-of-the-art source of thermal energy used for 
the formation of the structure and properties of surface layers. Laser manufacturing 
techniques are the most promising and effective to ensure development in many 
branches of industry and in particular in those in which processing of materials is 
dominating. Thanks to a very precise delivery of energy, laser radiation permits a 
better and also faster way to realize treatment of material surface. It also facilitates 
the introduction of new technologies whereof the realization is not feasible with the 
application of conventional power densities (Kannatey-Asibu 2009; Steen and 
Mazumder 2010). 

The industry worldwide is already utilizing laser technologies to a great extent, 
and the most extensive industrial application in the processes of laser material 
treatment has been demonstrated by solid lasers with crystalline medium (Nd: 
YAG, Yb: YAG, Nd: YV0 4 , gas lasers, and C0 2 ), and there are also other lasers 
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applied as fiber (waveguide) and shield (disc) lasers having the laser beam power, 
impact, or continuous, reaching even 100 kW (Hecht 2005; Paschotta 2008; Steen 
and Mazumder 2010). 


Treatment in Materials Engineering 

Laser surface treatment includes presently the following processes (Dobrzanska- 
Danikiewicz 2012; Ion 2002; Jankowska et al. 2008; Steen and Mazumder 2010; 
Tanski 2013; Dobrzanski et al. 2008, 2010, 2011a, b, 2012; Sulaiman et al. 2007; 
Bonek 2013b; Havrilla and Anthony 1999; Lubman 1990; Pfeifer et al. 2010): 

• Heating surface layers for quench hardening or annealing 

• Treatment of porous ceramics, e.g., concrete or rocks 

• Melting the surface in order to change its structure (e.g., the acquisition of glassy 
layers) and for quench hardening purposes 

• Enrichment of surfaces, e.g., to improve their resistance to corrosion, wear, or 
for decorative reasons 

• Laser enhancement of roughness to raise the adhesion of adhesives/glues on 
surfaces to be joined 

• Laser coating, including the methods LACVD (laser chemical vapor deposition) 
and LAPVD (laser physical vapor deposition) 

• Laser removal of coatings and cleaning 

• Laser marking and engraving 

• Laser cutting 

• Micromachining (e.g., hollowing) 

• Texturing of photovoltaic cells 


Surface Hardening of Steels 

The laser thermal treatment process encompasses operations which are carried out using 
laser beam as a source of energy needed to heat the surface layer of a treated material in 
order to change its structure for the acquisition of appropriate mechanical, physical, or 
chemical properties (Dobrzariska-Danikiewicz 2012; Ion 2002; Klimpel et al. 2002; 
Paschotta 2008; Steen and Mazumder 2010; Tanski 2013; Landolt and Bomstein 2004). 

The rapid heating of a steel surface to the austenization temperature, then 
followed by self-quenching, allows to obtain a martensitic structure with high 
hardness. Laser treatment also allows to harden local molten pool areas through 
the local heating with high intensity and very high self-cooling rates. 

Two methods of surface layer formation have been mainly applied so far for 
laser surface heat treatment: 

• Formation through phase transformations in the solid state consisting of the 
heating of surface layers to the austenization temperature followed by their rapid 
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(hardening) or slow (annealing and quenching) cooling. The material treated can 
be preheated prior to the next surface treatment or machining (e.g., cutting, 
welding, plastic working, or laser treatment); thus, higher plasticity is achievable 

• Formation by surface layer remelting consisting of heating the material above 
the solidus temperature, its rapid solidification, so that phase transformations 
take place in the solid state (Crafer and Oakley 1992; Ion 2005; Klimpel 
et al. 2002; Landolt and Bornstein 2004) 

The two methods are compared with conventional heat treatment in the illustra- 
tions in Fig. 3. 

Using laser surface hardening, depending on cooling rate, surface layers of 
strongly fragmented structure or amorphous surfaces of unchanged chemical com- 
position with respect to the material core can be obtained, but of high chemical 
homogeneity. Other types of material forming by way of surface layer melting 
involve such methods as enrichment (alloying) of surface layer with selected 
alloying elements and plating, consisting in the deposition of a layer having a 
different chemical composition on the surface of the treated material, which ensures 
corrosive resistance, high-temperature creep resistance, or decorative properties, 
etc. (Bonek 2013a; Ion 2002; Klimpel et al. 2002; Paschotta 2008; Steen and 
Mazumder 2010; Crafer and Oakley 1992; Landolt and Bornstein 2004). 

The response of steel surface to laser beam depends principally on surface power 
density P and reaction time, also referred to as exposition time t. With the low level 

A O 

of power density, lower than 10 W/cm , non-melting quench hardening can be 
carried out. Then, the heating of steel surface to the temperature lower than solidus 
(A, < T treat < T s ) takes place. After the termination of the treatment, the absorbed 
heat is instantly transferred to the matrix by way of thermal conductivity, which 
results in quench hardening of surface zone. Non-melting quench hardening enables 
to carry out quench hardening of steel and alloys which are conventionally not 
subjected to quench hardening, e.g., steels of carbon concentration below 0.2 %. 
With the rise of carbon concentration, the hardness and thickness of the quench- 
hardened layer are increasing for the same parameters of laser treatment, which, in 
consequence, results in the rise of hardenability and in the decrease of austenitizing 
temperature. The surface layer after such a treatment consists of two or three zones 
(the third zone comes out in hypoeutectoid steels - the zone heated to the temper- 
ature range of Ax-A 3 (Fig. 4; Bonek 2013a; Ion 2002; Klimpel et al. 2002; 
Paschotta 2008; Steen and Mazumder 2010; Crafer and Oakley 1992; Dobrzanski 
et al. 2011a; Landolt and Bornstein 2004): 

• Zone heated to the temperature A3 - solidus and quench hardened from solid 
state 

• Steel zone heated below the temperature Al and tempered; the zone is present in 
steels which were quench hardened and tempered before laser treatment 

A S' r\ 

With the power density in the range 10-10 W/cnT, the melting of surface layer 
of the metal takes place. Due to this, the structure can be subjected to change both in 
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the solidifying process and in further cooling process by quench hardening with 
melting, glazing, and enrichment (alloying). After the termination of laser treatment 
with such a range of power density, the following zones can be singled out in the 
steel (Ion 2002; Kannatey-Asibu 2009; Klimpel et al. 2002; Steen and Mazumder 
2010 ): 

• Melted zone of the temperature above solidus, which then rapidly solidifies and 
undergoes quench hardening from the liquid state 

• Zone heated to the temperature A3 - solidus and quench hardened from solid 
state, heated to the temperature lower than Al, and tempered 

The increase of surface power density to over 10 W/cm leads then not only to 
rapid melting of steel but also to metal evaporation, which takes advantage of with 
laser cutting, hollowing, engraving, or the removal of material. The steel is heated 
then to the evaporation temperature or higher (T treat > T p ). And then the explosive 
ejection of metal particles is taking place, i.e., so called ablation. During the 
treatment with such power densities, plasma can be formed, which hinders and 
even stops the inflow of radiation to the steel by its absorption. During the treatment 
with such power densities, four zones are formed in the steel (Ion 2002, 2005; 
Klimpel et al. 2002; Steen and Mazumder 2010; Landolt and Bomstein 2004): 

• Zone with craters after steel evaporation 

• Zone melted and quench hardened from liquid state 

• Zone heated within the temperature range A3 - solidus and quench hardened 
from solid state 

• Zone heated to the temperature lower than A 1 and tempered 

O Q 9 

The rise of surface power density to the range of 10 -10 W/cnT leads to the 
formation of plasma which absorbs the radiation of laser and stops the inflow of 
energy to the material. Such high-power densities are used for marking or harden- 
ing by means of shock waves generated in the material in effect of the reaction of 
widening plasma. 

The main objective of melting hardening is the change of output metal structure 
in order to obtain a fine-grained structure, without changing the chemical compo- 
sition of the material. During the melting process, a part of the absorbed thermal 
energy penetrates the material, which results in the generation of high-temperature 
gradient between the liquid layer and the matrix. During the laser treatment with 
melting, the mixing of liquid metal is taking place due to convection movements. 
The movements are generated in effect of temperature difference between the 
melted surface and the bottom of the melted area and, furthermore, due to the 
blowing shielding gas and the “pressure” of laser beam. After the melting and 
mixing of liquid metal, fast solidifying is taking place due to the existing high- 
temperature gradient (Ion 2002, 2005; Steen and Mazumder 2010; Tanski 2013). 
Laser quench hardening with melting is taking place with the heating and 

A Z 

cooling rates of V g ( CO oi.) = 10 -10 °C/s, and in the case of conventional quench 
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hardening, the heating and cooling rate is within the range of 1CL-10 °C/s. Due to 
rapid laser heating and cooling, precipitation processes can be slowed down in the 
steel. During the melting quench hardening, partial or total dissolution of precipi- 
tation phases of, e.g., carbides or contaminants is taking place. Fast and uniform 
crystallization facilitates the acquisition of fine-grained structures, and the slowing 
down of precipitation processes brings about the situation where the precipitation 
phases do not precipitate again or they precipitate in a different form. The melting is 
usually accompanied by supersaturation of solutions, fragmentation of phases, and 
cleaning of grain boundaries, which is of particular importance in terms of corro- 
sive resistance and tribological properties. The research carried out on the laser- 
melted metals and alloys shows also that the solidified layer consists of fine 
columnar or dendritic crystals oriented in accordance with the directions of heat 
dissipation (Bonek 2013a; Paschotta 2008; Steen and Mazumder 2010; Crafer and 
Oakley 1992; Dobrzanski et al. 2011a; Landolt and Bomstein 2004). 

Laser quench hardening is widely applied in the treatment of cast iron - in 
particular gray one - and also in the treatment of structural steel: low and medium 
carbon steels, bearing, tool, and stainless steels. In all cases the structure of quench- 
hardened steels is more chemically and structurally homogeneous. The steels also 
demonstrate higher toughness, strength, and rise of hardness. Laser melting quench 
hardening combined with thermal treatment brings about a considerable rise of 
surface layer hardness. For example, martensitic and ferritic stainless steels, in 
effect of al. 5-4-fold rise of hardness after such operations, demonstrate several or 
even 10-fold rise of abrasive resistance. And the melting of quench-hardened HSS 
steels demonstrates a 1. 5-3-fold rise of abrasive wear resistance. This type of 
quench hardening worsens the smoothness of surface. When the porosity of R a < 
10-15 pm is required, laser treatment should be carried out before the grinding of 
the treated object (Ion 2002; Klimpel et al. 2002; Steen and Mazumder 2010; 
Landolt and Bomstein 2004). Laser surface melting (LSM) is taking place over 
very short time, in which only a small part of absorbed thermal energy penetrates 
the material, which brings about high temperature gradient between the liquid 
surface layer and its remaining mass. During the melting process, very intensive 
mixing of liquid metal is taking place due to convective movement effected by 
temperature difference between the melted surface and the bottom of melted area 
and due to intensive blowing of shielding gas. Fast solidification is taking place due 
to the existing temperature gradient. Cooling the layer at the speed, e.g., of the order 
of 108 [grades/s] leads to the formation of amorphous structure of the layer 
thickness of about 20 [pm]. Amorphous layers are characterized by a considerably 
high mechanical strength and high resistance to corrosion. Such a way of changing 
the properties of the surface layer of metal is often referred to as laser glazing. Laser 
melting of surface layers of materials leads to the acquisition of high resistance to 
abrasion, corrosion, and erosion, due to the formation of a chemically homogeneous 
fine-crystalline surface layer with the chemical composition of the material being 
maintained (Bonek 2013a; Dobrzanska-Danikiewicz 2012; Steen and Mazumder 
2010; Tanski 2013; Dobrzanski et al. 2008; Dobrzanski and Drygala 2012; 
Dobrzanski et al. 2010, 201 la, b, 2012; Bonek 2013b; Landolt and Bomstein 2004). 
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Fig. 6 Scheme of hardening using (a) conventional treatment and (b) laser treatment 


Laser Alloying and Feeding 

Laser surface alloying (LSA), also referred to as enrichment, is one of the most 
modern methods of thermochemical treatment which consists in the introduction of 
alloying elements to the alloyed material where the mixing of both materials is 
taking place, whereof at least one is in liquid state. Mutual intensive mixing of the 
materials is taking place in a pool due to convective, gravitational movement and 
due to the pressure of laser beam (Figs. 5, 6, 7, 8, and 9). 

Depending on how the alloying material is introduced to the pool, alloying and 
fusion (cladding) can be distinguished. 

Alloying is a single- or two-stage process consisting in direct delivery of 
alloying material by means of a feeder or in depositing the alloying material on 
substrate (in the form of solid powder particles, wire, or gases containing nitrogen 
and carbon) and then melting it with a laser beam (Bonek 2013a; Dobrzanska- 
Danikiewicz 2012; Steen and Mazumder 2010; Tanski 2013; Dobrzanski 
et al. 2008; Dobrzanski and Drygala 2012; Dobrzanski et al. 2010, 2011a, b, 
2012; Bonek 2013b; Landolt and Bornstein 2004). 

The alloying process starts from the layer of alloyed material and then through 
conductivity and thermal convection; the melting of substrate and mutual mixing 
are taking place until the surface layer has been solidified. The alloying material can 
be deposited in the compact form: foils, tapes, plates, electrolytic coatings, or layers 
in the form of powdered paste. Usually the coating thickness of alloying materials is 
similar to the thickness of the obtained surface layer and in the case of continuous 
heating is equal to 0. 3-1.0 mm, and with pulsed heating from 0.3 to 0.4 mm 
(Figs. 10, 11, and 12; Bonek 2013a; Dobrzanska-Danikiewicz 2012; Steen 
and Mazumder 2010; Tanski 2013; Dobrzanski et al. 2008; Dobrzanski and Drygala 
2012; Dobrzanski et al. 2010, 2011a, b, 2012; Bonek 2013b; Landolt and 
Bornstein 2004). 
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Fig. 7 Structural changes in the surface steel layer during the laser heating process and after its 
completion depending on surface density of power 


Fig. 8 Diagram of laser 
alloying 


Laser head 
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Fig. 9 Surface layer of steel 
after alloying with A1 2 0 3 
particle with scanning speed 
0.5 m/s, laser power 2.0 kW 



Fig. 10 Surface layer of steel 
after alloying with Si 3 N 4 
particle with scanning speed 
0.5 m/s, laser power 2.3 kW 



Fig. 1 1 Fine eutectics along 
the boundary of melted 
surface layer zone of steel 
after alloying with wolfram 
carbide WC, laser power 2.0, 
and magnification 1,000 x 



Laser feeding is a single- or two-stage process consisting in simultaneous 
melting of alloying material and substrate in the laser action zone or depositing 
the fused material on the substrate. The fused material can be introduced in the form 
of powders, wires, or gases containing free atoms of a chemical element with which 
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Fig. 12 Dendritic structure 
along the boundary of melting 
zone and heat impact in a 
steel surface layer after 
alloying with TiC powder, 
laser power 1.6 kW 



Fig. 13 Diagram of laser 
feeding 
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Fig. 14 Surface layer of 
magnesium alloy 
MCMg A19Zn 1 after the 
feeding of WC powder; laser 
power 2.0 kW, alloying speed 
0.75 m/min 



the fusion is to be done (Figs. 13, 14, 15, 16, 17, 18, and 19). In order to prevent 
material oxidation during the fusion, shielding inert gases are applied, e.g., argon 
and helium (Bonek 2013a; Dobrzariska-Danikiewicz 2012; Steen and Mazumder 
2010; Tariski 2013; Dobrzanski et al. 2008; Dobrzanski and Drygala 2012; 
Dobrzanski et al. 2010, 2011a, b, 2012; Bonek 2013b; Landolt and Bornstein 2004). 
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Fig. 15 Surface layer of 
magnesium alloy 
MCMgA112Znl after the 
feeding of WC powder; laser 
power 1.6 kW, alloying speed 
0.75 m/min 



Fig. 16 Melting edge of the 
surface layer of alloy 
MCMgA112Znl after the 
feeding of WC powder; laser 
power 2.0 kW, alloying speed 
0.75 m/min 



Fig. 17 Surface layer central 
zone of alloy MCMgA16Znl 
after the feeding of SiC 
powder; laser power 2.0 kW, 
alloying speed 0.75 m/min 
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Feeding is realized with the application of continuous work lasers, since the 
alloying material can be fed to the melting zone only during laser heating and not 
during breaks between heating impulses. The powder during feeding melting 
process is melted simultaneously with substrate material, yet the particles of 
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Fig. 18 Central zone 
between the melting and 
substrate of MCMgA13Znl 
alloy after the feeding of TiC 
powder; laser power 1 .2 kW, 
alloying speed 0.75 m/min 



Fig. 19 Melting edge of 
surface layer of 
MCMgA16Znl alloy after the 
feeding of TiC powder; laser 
power 1.6 kW, alloying speed 
0.75 m/min 



powder can be melted in laser beam or in the pool of melted substrate material 
(Bonek 2013a; Dobrzariska-Danikiewicz 2012; Steen and Mazumder 2010; Tariski 
2013; Dobrzanski et al. 2008; Dobrzanski and Drygala 2012; Dobrzanski 
et al. 2010, 2011a, b, 2012; Bonek 2013b; Landolt and Bomstein 2004). 


Laser Surfacing 

Laser surfacing, also referred to as plating or cladding (laser surface cladding - 
LSC), is a process consisting in melting a large amount of surfacing (plating) 
material and a thin layer of substrate material. The parameters used in surfacing 
are similar to those used in alloying, and the thickness ratio of the melted substrate 
material to the thickness of surfacing layer is usually 1/10. In this process the 
mixing of both materials is minimal. Along the boundary between coating and 
substrate, a transition zone is formed, whereof chemical composition 
depends on the percentage share of the material from the melted substrate. In this 
way the solidifying surfacing material is joined metallurgically with the substrate 
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Fig. 20 Scheme of the laser 
cladding (7) laser beam, (2) 
laser tray-cladded material, 

(5) cladded wire, ( 4 ) source of 
the direct resistance heating 
currant, and (5) substrate 
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direct resistance Ctadded wire 

heating currant 




Laser beam 


water 
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Laser tray- 
cladded 
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Fig. 21 HPDL laboratory stand and instrumentation allowing the use of cladding wire in the 
direction of the longer side of the laser beam focus 

(Figs. 20 and 21; Bonek 2013a; Dobrzanska-Danikiewicz 2012; Steen and 
Mazumder 2010; Tanski 2013; Dobrzanski et al. 2008; Dobrzanski and Drygala 
2012; Dobrzanski et al. 2010, 2011a, b, 2012; Bonek 2013b; Landolt and Bornstein 
2004). 

The formed transition layer between the substrate and the surfacing coating can 
have metallurgical character, which, in consequence, results in strong bonding of 
the latter with the substrate. As it was in the case of alloying, also with surfacing, 
the surfacing material can be coated on the substrate using the following processes 
(Figs. 22, 23, and 24; Bonek 2013a; Dobrzanska-Danikiewicz 2012; Steen and 
Mazumder 2010; Tanski 2013; Dobrzanski et al. 2008; Dobrzanski and Drygala 
2012; Dobrzanski et al. 2010, 2011a, b, 2012; Bonek 2013b; Landolt and Bornstein 
2004): 

• Two- stage process before laser treatment. The melting process is taking place 
from the top, which is unfavorable since it contributes to the formation of defects 
in the form of blowholes or insufficient melting in the vicinity of substrate 
material. The formation of these defects is also facilitated by the surfacing filler 
material (most frequently Zapon Lacquer, epoxy resin, fats, plastics, water glass, 
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Fig. 22 The influence of linear energy on the shape of the weld faces in the cladding process of 
HPDL laser with additional material as powder wire EnDOtec DO*60 with 1.2 mm diameter 
cladded in the shortest side of laser beam focus 


Fig. 23 View of the weld 
face and surfaces prepared to 
wear test according to the 
G65 standard of samples 
cladded using a powder wire 
EnDOtec DO*60 with a 
diameter of 1.2 mm using 
high-power diode laser 
(HPDL) 

silicate adhesive, shoemaker’s glue, borax solution with acetone, isopropyl 
alcohol, alcohol solution of rosin, nitrocellulose and rubber adhesives, “Super 
cement” glue) added to the powders (aimed to ensure an appropriate adhesion to 
the substrate), evaporated during the heating with laser radiation and periodi- 
cally shielding the pool, which, in effect, brings about nonuniform melting of the 
substrate. Since this process requires that a coat of plating material should be first 
deposited on the substrate (in the form of powder layer, foil, thin plate or thick 
coat), therefore it can be here referred to as coating process. Each form of plating 
material requires specific parameters of process realization, principally the fed 
power density and surfacing speed. Parameters higher than optimal bring about 










75 Laser Surface Treatment in Manufacturing 


2697 



Fig. 24 Padding weld area performed on the hot work tool steel X37CrMoV5-l adjacent to the 
weld face, mag. 200 x 


too deep melting of the substrate, and lower parameters yield droplet-like form 
of surfacing layer. In the case of powder or paste materials, burning out of the 
filler on the sides of the padding weld is taking place, which frequently neces- 
sitates that the powder or paste must be applied again when more than one 
padding weld is deposited. Therefore, the surfacing with the use of powder 
coatings is frequently carried out in the form of single padding welds. 

• Single-stage process during laser treatment, making use of the coating material 
in the form of powder or rod (tape). The powder is applied by spilling (inertial or 
vibratory), or it is transported by means of indifferent gas (e.g., air) or inert gas 
(nitrogen, helium, argon) with respect to the coating material. 

In a similar way as with feeding, the powder can be fed by means of active gas, 
which, in effect of exothermic reaction, can intensify the surfacing process. The rod 
is fed mechanically in a continuous mode. In the case of powder surfacing, the 
process is referred to as injection surfacing, and in the case of rod surfacing, it is 
called cladding surfacing. The powder or powder mixture blown into the laser beam 
zone is being melted and in the melted form falls upon the substrate surface. The 
surfacing layer is formed, therefore, from the bottom, due to the solidification of 
the melted material on the surface of substrate material. This method necessitates 
the application of a jet coupled with the laser head which provides the powder in 
accordance with or against the movement direction of the feed and requires the 
application of powders of appropriate granulation. The most frequently applied 
granulation of powder is 40-80 pm; powder consumption does not exceed 
1 g/s. Before the application the powders should be dried (Bonek 2013a; 
Dobrzanska-Danikiewicz 2012; Steen and Mazumder 2010; Crafer and Oakley 
1992; Dobrzanski et al. 2008; Dobrzanski and Drygala 2012; Dobrzanski 
et al. 2010, 2011a, b, 2012; Landolt and Bomstein 2004). 

Powder surfacing is characterized by low energy consumption as compared to 
other laser technologies. The fabrication of a single padding weld of the thickness 
1 mm using the surfacing of the fed powder requires the unit energy of 30-50 
J/mm , and the surfacing method of the formerly deposited powder paste consumes 
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60-90 J/mm , whereas the melting of the formerly deposited plasma- sprayed 
coating as much as 180-350 J/mirT (Bonek 2013a; Dobrzanska-Danikiewicz 
2012; Steen and Mazumder 2010; Crafer and Oakley 1992; Dobrzanski 
et al. 2008; Dobrzanski and Drygala 2012; Dobrzanski et al. 2010, 2011a, b, 
2012; Landolt and Bornstein 2004). 

A rod from the coating material can be surfaced on cold substrate or on the 
preheated one (as with standard surfacing) or on the substrate heated simulta- 
neously in order to avoid the generation of too high eigen-stresses and resulting 
from the cracks. In the second case the substrate can be subjected to direct resistive 
heating, arch heating, or some other kind of heating. The fragmentation of the 
structure and the dissolution of hard carbide phases as well as the formation of 
supersaturated old solutions are obtained with laser surfacing. It considerably 
improves the performance properties of substrate materials, increasing their resis- 
tance to wear, principally with high unit pressures being applied. A very high 
strength of the metallurgical connection is also obtained between the surfaced 
coating or padding weld and the substrate. The resistance to the impact of high 
temperatures is also increasing. A significant defect of the surfaced coatings is their 
tendency to cracking. It can be prevented by preheating the substrate to the 
temperatures of 300-400°C and appropriate selection of coating materials (Bonek 
2013a; Dobrzanska-Danikiewicz 2012; Steen and Mazumder 2010; Crafer and 
Oakley 1992; Dobrzanski et al. 2008; Dobrzanski and Drygala 2012; Dobrzanski 
et al. 2010, 2011a, b, 2012; Landolt and Bornstein 2004). 

Surfacing processes have found application principally in two groups of mate- 
rials: materials resistant to corrosion, also in high temperatures, and in heavily 
loaded materials resistant to abrasion, principally tool materials. Unique properties 
of surfacing materials suggest their application in the conditions of high loading and 
temperatures as well as when subjected to erosive and corrosive hazards, e.g., for 
coating the sealing surfaces of valve seats or valves of internal combustion engines, 
water, gas or vapor separators, and parts of metallurgical equipment (Crafer and 
Oakley 1992). 

The resistance to abrasion or corrosion is ensured by the layers Co-Cr-Mo-Si: the 
presence of hard precipitations of intermetallic phases in the matrix, having the 
composition ranging from CoMoSi to Co 3 Mo 2 Si, ensures tribological properties, 
and the presence of chromium in the matrix provides anticorrosion properties. 
Similar resistance is ensured by the coatings Cr-Ni-B-Si-Fe. There are known 
examples involving the plating of acid-resistant stainless steels to raise their 
resistance to abrasion, as well as heat-resistant materials or resistant to high 
temperatures, e.g., nimonic. The operations typical for laser surfacing involve the 
coating of austenitic steels with wolfram or cobalt carbides (Bonek 2013a; 
Dobrzanska-Danikiewicz 2012; Crafer and Oakley 1992; Dobrzanski et al. 2008; 
Dobrzanski and Drygala 2012; Dobrzanski et al. 2010, 2011a, b, 2012. 

The laser surfacing process is applied to deposit heat-resistant layers which are 
resistant to abrasive and erosive wear, especially in high temperatures. Steel can be 
plated with cobalt or titanium alloys, or with the following mixtures: Cr-Ni, 
Cr-B-Ni, Fe-Cr-Mn-C, C-Cr-Mn, C-CrW, Mo-Cr-CrC-Ni-Si, Mo-Ni, 
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TiC-Al 2 0 3 -Al, TiC-Al 2 0 3 -B 4 C-Al, aluminum, carbides WC, TiC, B 4 C, SiC, 
nitrides, including BN, chromium or aluminum oxides, and others. Cobalt alloys 
can be plated with nickel alloys to obtain resistance to high-temperature erosion 
so-called superalloys, titanium alloys with boron nitrides, and silumins with silicon. 
Aluminum and copper can be coated with the mixture of 91 % Zr0 2 and 9 % Y 2 0 3 
or Zr0 2 -CaO. Frequently, the mixture Cr-Ni-B-Fe is applied for laser regeneration 
of worn surfaces, infrequently with the admixture of C and Si. It is believed that the 
plated glazed Si02 coatings deposited on heat-resistant alloys are promising in 
terms of future application potential. They can be deposited on heating equipment 
working at the temperatures of 900-1, 000° C in strongly oxidizing, carburizing, and 
sulfiding environments (Bonek 2013a; Dobrzanska-Danikiewicz 2012; Crafer and 
Oakley 1992; Dobrzanski et al. 2008; Dobrzanski and Drygala 2012; Dobrzanski 
et al. 2010, 2011a, b, 2012). 

Coating compositions of hard and difficult to melt materials comprise the 
powder mixtures of these materials with the powders of coated steel or with iron 
powders used as binder. With respect to the composition WC + Fe (or WC + Co, 
WC + NiCr), due to high speed of the surfacing process which disables the diffusive 
transformation WC^Fe, the hardness of carbides of the order of 11,000 MPa is 
maintained, equal to the obtainable hardness of wolfram carbides present in the 
matrix of tool steel containing wolfram after the conventional thermal treatment. 

Tool steels are laser plated with stellits to ensure their protection against abrasive 
wear. The heat-resistant alloy Nimonic 8A (over 70 % of Ni, 20 % of Cr, admix- 
tures of Al, Co, Ni), applied on the blades of gas turbines, laser stellited, demon- 
strates almost a 100-fold rise of resistance to abrasive wear. Laser stelliting is more 
advantageous than welding stelliting (using the TiG and plasma methods) since 
higher hardness and finer structure is obtained. Chromium oxide Cr 2 0 3 , laser 
surfaced on the austenitic steel 1H18N9T results in a 2-fold rise of its creep 
resistance. In the automobile industry, laser surfacing of valve seats and flanges 
has been applied for a long time (Bonek 2013a; Dobrzanska-Danikiewicz 2012; 
Crafer and Oakley 1992; Dobrzanski et al. 2008; Dobrzanski and Drygala 2012; 
Dobrzanski et al. 2010, 2011a, b, 2012; Landolt and Bornstein 2004). 

The thickness of surfacing coatings is higher than that of alloyed layers and can 
reach several millimeters. The width of a single padding weld (with the laser beam 
scanning with the frequency of 10-300 Hz along the direction transverse to the 
movement direction of the feed) can exceed 10 mm. Surfacing efficiency is from 
several to over 100 mm/s. The quality of the obtained surfaced layers (density, 
binding with the substrate, hardness) is better than the quality of layers subjected to 
thermal spraying, including also plasma ones. Laser surfacing can bring about the 
decrease of fatigue limit of structural steel in the way depending on the type of 
coating; coating materials which are more difficult to melt have stronger impact on 
this decrease (Bonek 2013a; Dobrzanska-Danikiewicz 2012; Crafer and Oakley 
1992; Dobrzanski et al. 2008; Dobrzanski and Drygala 2012; Dobrzanski 
et al. 2012; Dobrzanski et al. 2010, 2011a, b, 2012; Landolt and Bornstein 2004). 

The issue of surface roughness seems to be more problematic with the surfacing 
process as compared to alloying. Surface roughness of the coating is progressively 
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higher with the rise of melting temperature of the surfaced material. Therefore, for 
the surfacing the mixtures of materials are frequently applied, most commonly 
powders of high and low melting temperature, e.g., TiC + A1 2 0 3 + Al + B 4 C, which, 
however, have a negative impact on the hardness of the coatings (Bonek 2013a; 
Dobrzanska-Danikiewicz 2012; Crafer and Oakley 1992; Dobrzanski et al. 2008; 
Dobrzanski and Drygala 2012; Dobrzanski et al. 2010, 2011a, b, 2012; Landolt and 
Bornstein 2004). 


Rapid Prototyping Technologies with the Use of Laser Beam 

RP techniques, thanks to which, owing to the three-dimensional virtual model 
CAD-3D, it is possible to fabricate physical models, master parts, and prototypes, 
are more widely applied in product development techniques, and they contribute to 
the fast fabrication of prototypes and the whole product and to the preparation of its 
manufacturing process. 

The application range of these techniques is still rising, which is particularly 
visible in the automobile industry, where their share reaches even 25 % of the 
product development time. The opinion that in the present day realities this share is 
relatively small can be accepted. But on the other hand, it is known that with the 
application of traditional methods to build a prototype (most frequently using 
machining methods), the fabrication time of a prototype in some cases used to be 
even 60 % of the total time devoted to the development and designing works 
(Dobrzanska-Danikiewicz 2012; Ion 2005; Steen and Mazumder 2010; Crafer 
and Oakley 1992; Landolt and Bornstein 2004; Pham and Dimov 2001). 

The essential application areas of rapid prototyping techniques (Chlebus 2003; 
Oczos 1997): 

• Assessment and analysis of assembly processes 

• Assessment and analysis of processes related to production 

• Market appraisal of newly developed products 

• Modelling and creating implants used in medicine 

• Models used for casting 

• Models used for plastic working 

• Ergonomy and design workshops 

Despite numerous advantages deriving from the application of RP techniques in 
the production process of models, there are also a number of disadvantages condi- 
tioning the use of such processes not for all applications. The main advantages and 
disadvantages of RP processes are given in Table 1. 

The laser melting processes are engineered to create homogeneous solid metal 
components from metal powder, using high-powered laser energy to manufacture 
parts directly from CAD-3D data (Figs. 25 and 26; Dobrzanska-Danikiewicz 2012; 
Ion 2005; Steen and Mazumder 2010; Crafer and Oakley 1992; Landolt and 
Bornstein 2004; Pham and Dimov 2001). 
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Table 1 Main advantages and disadvantages of rapid prototyping technology 


Advantages of RP technology 

Disadvantages of RP technology 

Fast creation of physical models 

Limited dimensions of created objects 

Suitable for creating parts with complex 
geometry and surfaces with free shapes 

Limited range of materials 

Ability to formulate different materials and 
any item shape can be obtained 

It is often necessary to apply additional mass 
finishing treatment 

Low production costs versus traditional 
methods (mainly for fewer units) 

Parts produced meet mechanical requirements 
to some degree only 

The model element is available already during 
construction development 

Limited accuracy; surface quality conditioned 
by production technique employed 

Different methods can be applied using the 
whole array of RP processes 

Supports are needed in some techniques which 
then have to be removed 

It is easy to automate the process 

Porosity of items produced with some 
techniques 



Fig. 25 The view of models manufactured by selective laser melting process 


Fig. 26 Operating principle 
of stereolithographic device 
(SLA) 


Scanning 

system 
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table 


Photopolymer 




Produced 
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There are presently many rapid prototyping techniques. They are being devel- 
oped, and a variety of new, related techniques are being worked out. The most 
popular and most commonly applied techniques are as follows (Dobrzanska- 
Danikiewicz 2012; Ion 2005; Steen and Mazumder 2010; Crafer and Oakley 
1992; Landolt and Bornstein 2004; Pham and Dimov 2001): 

• Stereolithography - SL, SLA 

• Selective laser sintering - SLS 

• Three-dimensional printing - 3DP 

• Ink j et printing - IJP 

• Laser engineered net shaping - LENS 

Stereolithography (SLA). It is the oldest and most thoroughly examined RP 
method introduced in 1987 by C. Hull and presented for the first time by an 
American company 3D Systems Inc. at the AUTOFACT fair in Detroit. The 
technology was called 3D Systems’ Stereolithography Apparatus (SLA). In the 
first phase of the method, the geometrical modelling of an item in a CAD-3D 
system is carried out, then geometrical data is processed with special software 
dividing a three-dimensional model in an X/Y plane to approx. 0. 1 mm thick layer 
(layer thickness depends on the set modelling accuracy and laser power). A laser 
beam control program is created based on the created group of layers. Special 
support elements need to be designed and made for parts produced with this 
method, so-called supports, ensuring stability when irradiating the subsequent 
layers of the working material. The supports form an integral part of the object 
being developed and are attached to a three-dimensional model before it is divided 
into individual sections. The solids need to be removed after production. The parts 
produced with the SLA are often undergoing additional treatment of surface 
polishing or such improving general strength parameters. The laser beam scanning 
time depends on surface complexity of the individual layers of the object, laser 
travel speed, as well as the time of polymer hardening and joining with the previous 
layer. The surface of liquid polymer is levelled with a scraper (Fig. 23) to prevent an 
uneven height of individual layers. 

Selective Laser Sintering (SLS). The SLS (selective laser sintering) method was 
developed by the Austin University in the United States. A thin layer of thermoplastic 
powder is subjected to spot sintering with a continuous beam of laser with the 
capacity of 50-100 W. The previously laid layer is then melted; hence, a uniform 
body of a model created is obtained. Another layer of powder is applied after local 
sintering with laser and the cycle is repeated. A material in this method is transiting 
from the solid state (powder) through the liquid state, and again into the solid state 
(sinter). Powder is not removed from places outside the section and can act as a 
support. Materials used in the SLS method are plastics, waxes, metal powders 
(FE-Cu), and mixtures of metal powders and ceramic powders (Fig. 27). A material 
becomes porous as a result of sintering. It needs to be soaked (infiltrated) with a 
strengthening substance in some cases. The biggest disadvantage of the technology is 
the long cooling time of parts with a complicated internal structure. 
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Fig. 27 W orking scheme of 
the selective sintering device 

(SLS) 


Laser source 


Laser Engineered Net Shaping (LENS). This method allows to obtain a multi- 
material structure due to a possibility of changing the type of powders when 
creating a solid. Parts fabricated with laser engineered net shaping (LENS) are 
characterized by high metallurgical properties while maintaining quite high preci- 
sion of surface representation. LENS’ additional advantage is high functionality: it 
can be used not only for fabricating parts but also for repairing them. The LENS 
method is very similar to the laser cladding process. It is usually made with the 
high-power diode laser (HPDL). The worked item (sheet) is cladded with metal 
powders in both methods (Chlebus 2003; Chlebus 2000). 

Laminated Object Manufacturing (LOM). The LOM (laminated object 
manufacturing) method - multilayer fabrication of objects, developed by an Amer- 
ican company Helisys. Models are made using subsequent layers of heat-sealing 
paper or foil. A shape of the layer is cut with a laser (in the classical LOM method) 
or a blade (in the modification developed by Kira called paper lamination technol- 
ogy - PLT). The LOM technology allows to use plastic foils, ceramics, and metals. 
The result of LOM devices’ work is usually a cuboid whose unnecessary fragments 
should be removed (a laser beam cuts distinctive squares on the pieces of foil not 
used for building the model which, after sealing, produces prisms easily removable 
from the internal space after sealing). Objects made with the LOM require no 
supports. The LOM method is not quite suitable for building elements with a 
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complex internal structure due to difficulties with removing unnecessary fragments 
later. The surfaces of solids created with the LOM have the lowest strength and 
quality as compared to those fabricated with the technologies described above and 
make it useful as a visualization technology only, type RM (Chlebus 2003; Oczos 
1997; Chlebus 2000; Zi^tek 2005; Webb and Jones 2004). 


Application of Laser Radiation for the Synthesis of Materials 

The processes of laser synthesis of materials encompass the technologies making 
use of laser beam to bring about layers consisting of pure metals or compounds on 
the surfaces subjected to treatment. 

The said techniques include (Bonek 2013a; Dobrzariska-Danikiewicz 2012; 
O’Shea; Landolt and Bornstein 2004; Pham and Dimov 2001; Pauleau 2002; 
Conde and Silvestre 2004): 

• LAPVD (laser-assisted physical vapor deposition - deposition of vapors with the 
use of laser beam) 

• LACVD (laser-assisted chemical vapor deposition) 

• Laser synthesis of liquids and gases 

Pulsed Laser Deposition 

A laser beam in material engineering is used, in particular, for pulsed layer 
deposition where high-energy laser radiation quanta reduce the energy of bonds 
between molecules which enables to remove atomic layers one after another. Pulsed 
layer deposition is often associated with evaporation where transition takes place 
from the solid state to the gaseous state with the liquid phase being omitted under 
specific temperature and pressure conditions. The essence of pulsed layer deposi- 
tion is sputtering where a typical material removal speed reaches the value of 
several monolayers per impulse, and the surface is modified structurally or its 
composition is modified at a mesoscopic scale. High-energy laser radiation quanta 
reduce the energy of bonds between molecules which enables to remove atomic 
layers one after another. The basic application areas of pulsed layer deposition: 

• Material evaporation from the target and deposition of its vapors onto a specific 
pad 

• Removal of a redundant material to clean the contaminated surface may be done 
in vacuum, atmosphere of air or in the presence of inert gases, the removal of 
part of material by way of explosive evaporation, and strengthening the 
remaining material using the shock wave phenomenon 

Layers may be fabricated on different substrates made of vapors of metals 
produced as a result of interaction of a stream of laser light photons on solid bodies 
(evaporation and pulsed laser deposition), gases, and liquids (pyrolysis and photol- 
ysis). The synthesis of materials is one of the latest and constantly improved 


75 Laser Surface Treatment in Manufacturing 


2705 


Laser radiation 


Radiation 

transmitting 

window 



Vacuum 

system 





Heated or cooled 
substrate 


Power supply 
system 



Gas input 


Rotating target 



Target rotation system 


Fig. 28 Scheme of the device for laser ablation 


techniques (Fig. 28; Dobrzanska-Danikiewicz 2012; Klimpel et al. 2002; O’Shea; 
Landolt and Bomstein 2004; Pham and Dimov 2001; Pauleau 2002; Conde and 
Silvestre 2004). 

The interaction of a beam with the material leading to its evaporation (pulsed 
laser deposition) should be mentioned for the processes taking place during layers’ 
deposition. This can be divided into four stages: 

1 . Laser beam absorption by material (target) 

2. Material (target) heating 

3. Pulsed laser deposition of material (rapid evaporation) 

4. Ionization of vapors and formation of plasma 

5. Explosive propagation of plasma in a work chamber, mainly in the direction of 
the coated item (substrate) 

LAPVD: Ablative Deposition of Coatings 

The synthesis of materials by means of laser evaporation of materials, referred to as 
LAPVD technique, is a process in which an impulse laser beam is used to evaporate 
material from a disc and to deposit the fabricated vapors in the form of a thin layer 
on a specific pad. The vapors condense, and the growth of the deposited layer is 
taking place, and its structure depends on the structure and properties of the 
substrate as well as on the properties of the evaporated material. This process is 
often referred to as ablative deposition of coatings (Dobrzanski et al. 2011a; Tanski 
2013; Pauleau 2002; Conde and Silvestre 2004). 

The basic elements of the apparatus for laser ablation are as follows 
(Dobrzanska-Danikiewicz 2012; Ion 2005; O’Shea): 
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• Laser 

• Working chamber with probes and with the mounting setup and rotating system 
of disc and substrate 

• Power supply system 

• Vacuum generation system 

• Supply system of process gas 

• Control system of the process 

A laser beam is delivered to a vacuum chamber by a window made of ZnSe and is 
falling on a target at a particular angle (evaporated material prepared, e.g., as a 
homogeneous sinter), causing rapid evaporation (pulsed laser disposition). With the 
appropriate arrangement of the laser beam-target-substrate system, the vapors of the 
evaporated material are distributed immediately (explosively) in a vacuum chamber 

z: 

(at the pressure of about 10 torr) and are mainly deposited in the substrate. A layer 
is growing with strictly controlled thickness as a result of subsequent laser actions 
(layer deposition speed of several tenths of nanometer per impulse). By heating a 
substrate to the set temperature, adhesion is facilitated between the deposited coating 
and the substrate. Moreover, additional substrate cooling allows to obtain an amor- 
phous or nanocrystalline structure. Laser beam interaction with the substrate material 
occurs depending on the length of the applied laser radiation, impulse energy, and its 
duration. The deposition of layers with the laser- assisted physical vapor deposition 
(LAPVD) technique is carried out with lasers emitting ultraviolet radiation due to the 
highest absorption of this range of radiation by materials. 

The following factors make the LAPVD process unique: 

• A possibility of depositing (with stoichiometry maintained) complex, 
multicomponent materials using laser irradiation of a material (target). 

• Easy adjustment/setting of pressure in the chamber, distance between the sub- 
strate and the target, and its orientation in relation to the laser beam. 

• A possibility of obtaining higher target efficiency (of the irradiated material) as 
compared to other techniques as most of the evaporated material is reverse 
guided; hence, it can be collected more efficiently. 

• A possibility of producing multilayer materials as a result of fast replacement of 
targets inserted in the laser beam interaction area. 

• Scheme repeatability can be achieved, i.e., the evaporated and deposited mate- 
rial form a new substrate for the development of next evaporated layers. 

• Hard layers can be produced resistant to abrasion, notably TiC or TiN and thin 
layers for the purpose of microelectronics. 

LACVD: Laser-Assisted Chemical Vapor Deposition 

The process of laser-assisted chemical vapor deposition (LACVD) is similar to 
LAPVD process. Instead of a disc made of solid body, as a material to fabricate 
plasma with the use of laser beam, a specific gas is applied. The laser is applied as 
an energy source used for the decomposition (breakdown) of gas particles and 
to raise the temperature of substrate on which the material is being deposited. 
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This method is used to deposit different materials, such as metals, superconductors, 
and layers of nonconductive properties (Dobrzanska-Danikiewicz 2012; Crafer and 
Oakley 1992; National Academy of Engineering et al. 1987; Landolt and Bomstein 
2004; Pham and Dimov 2001; Pauleau 2002; Conde and Silvestre 2004). 

The breakdown of interatomic bonds in gas particles can be effected by pyrolysis 
or photolysis. In the case of photolysis, the substrate is heated to a specific 
temperature by means of laser beam; by controlling the power of the laser and 
sometimes the impact of laser light on the material, the temperature of the surface 
of the material being heated can be controlled, and the gas undergoes decomposi- 
tion when it gets into contact with the hot surface of the substrate. The technique 
has the following advantages (Dobrzanska-Danikiewicz 2012; Crafer and Oakley 
1992; National Academy of Engineering et al. 1987; Landolt and Bomstein 2004; 
Pham and Dimov 2001; Pauleau 2002; Conde and Silvestre 2004): 

• Localized deposition of thin layers: owing to the application of the LACVD 
technique, localized deposition of thin layers is taking place; such a deposition is 
not feasible to accomplish with the application of the conventional CVD tech- 
nique, since the possibility to localize heating is limited. 

• High deposition speed: using the LACVD technique, thin layers of the thickness 
1 pm can be deposited in the time of about 0.1 s; the research carried out to date 
on the reactions taking place during the LACVD process demonstrates that the 
speed of the reactions is from two to four times higher as compared to the 
reactions taking place during the conventional CVD processes. 

• Cooling and heating speed: the LACVD technique offers a wide range of heating 
or cooling speeds; the acquisition of high heating or cooling speeds is possible 
through changing the way and time of the reaction of laser radiation. It enables to 
regulate grain size in the thin layer; for example, very fine-grained layers (also 
amorphous) can be obtained using short laser impulses. Also, the CVD technique 
can render the acquisition of coarse-grained coatings due to the fact that the time 
of maintaining high temperature of the material is sufficiently long. 

• Speed of gas flow: in the conventional CVD technique, the deposition speed of 
the layer and its properties considerably depend on the speed of gas flow and in 
consequence on the shape of the object and chamber size in which the process is 
carried out. Owing to localized heating and high deposition speeds offered by the 
LACVD technique, the efficiency of the process is practically independent of the 
speed of gas flow. 

The most relevant independent parameters of the process comprise the following 
(Dobrzanska-Danikiewicz 2012; Ion 2005; Klimpel et al. 2002; Steen and 
Mazumder 2010; Landolt and Bomstein 2004; Pham and Dimov 2001; Pauleau 
2002; Conde and Silvestre 2004; McGeough 2002): 

• Output power of the laser 

• Beam diameter 

• Wavelength of laser radiation 
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Table 2 The key independent parameters of the LACVD process and their corresponding 
dependent variables 

LACVD process parameters 


Independent 

Dependent 

Laser output power 

Rate of deposition 

Beam diameter 

Chemical composition and micro structure of the layer 

X-ray radiation wavelength 

Layer geometry 

Scanning speed or heating time 

Layer deposition thickness. 

Surface reflectivity 

Substrate temperature 

Heat properties of the substrate 


• Scanning speed or heating time 

• Reflexivity of the surface 

• Substrate temperature 

• Thermal properties of the substrate 

The corresponding dependent variables involve the following (Bonek 2013a; 
O’Shea; Landolt and Bornstein 2004): 

• Deposition speed 

• Chemical composition and microstructure of the layer 

• Geometry of the layer 

• Thickness of the deposited layer (Table 2) 

Laser Synthesis of Materials from the Liquid Phase 

The laser synthesis of materials from the phase in a form of layers or nanoparticles 
is possible by using the laser irradiation of the body surface. It is a prerequisite for 
initiating the process to submerge a solid body in liquid having sufficient perme- 
ability for a beam of laser light with specific length. Liquid permeability for laser 
light is a significant factor. By absorbing the energy of a laser beam, a surface layer 
of the substrate material is rapidly heated and then cooled in liquid. This creates 
the condition of certain imbalance in liquid on the boundary with the substrate (near 
the solid body - liquid boundary). The cleavage of bonds occurs as a result of the 
molecules of the liquid phase leading to the production of metastable phases. 
Wavelength, energy, and laser impulse duration have decisive influence on the 
progress of the processes of materials’ synthesis from specific liquids and their 
deposition on substrates (Fig. 29; Yang 2007; Burakowski and Napadlek 2007). 


Application of Laser in Photovoltaics 

The application of laser technology in the manufacturing process of solar 
cells of high conversion efficiency of solar radiation into electrical energy is 
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Copper substrate 


The sample holder 


Fig. 29 


Schematic diagram of the process of synthesis of laser material from the liquid phase 


Fig. 30 Influence of surface 
texture on light absorption 


hv Reflected light 



becoming an indispensable element of current photovoltaic technology. In photo- 
voltaic s, lasers are most commonly applied for texturing surfaces, electrode 
insulation, producing buried contact, burning out contacts, doping, drilling, and 
cutting (Dobrzanski et al. 2008; Dunsky 2007). The losses caused by the reflection 
of solar radiation from the front surface of cells lead to intensive global 
research into reduction of such effect. One of the basic stages of photovoltaic cell 
fabrication is texturing its surface (Fig. 30; Dunsky 2007; Ali et al. 2012; 
Gangopadhyay et al. 2007). Considering that the distribution of crystallographic 
orientation of grains in poly crystalline silicon is random, difficulties are encoun- 
tered with producing a uniform texture on its entire surface, and the use of the laser 
technique is one of the most promising methods of polycrystalline silicon 
surface texturization (Figs. 31 and 32; Dobrzanski et al. 2008; Dobrzanski and 
Dry gala 2012). 

A dynamic development of photovoltaics is enforcing a redefinition of 
requirements involving solar cell technologies and in particular the improvement 
of their efficiency, and laser techniques are classed as being most prospective 
(Dunsky 2007). 
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Fig. 31 SEM micrograph of laser texture corresponding to a grid of grooves produced with the 
spacing of 0.05 mm (Dobrzanski et al. 2008; Dobrzanski and Drygala 2012) 



Fig. 32 SEM micrograph of laser texture corresponding to a grid of grooves produced with the 
spacing of 0.05 mm after the removal of a distorted layer of the thickness 40 pm (Dobrzanski 
et al. 2008; Dobrzanski and Drygala 2012) 


Laser Coating Removal and Cleaning 

The technique is applied principally for conservation works, such as cleaning of 
surfaces (stone, ceramic, metallic, etc.), removing dirt buildup, paint coatings, 
varnish, and corrosion, but also as a repair technique which enables to clean 
corroded surfaces of elements (Fig. 33; Dobrzariska-Danikiewicz 2012; Jankowska 
et al. 2008; Landolt and Bomstein 2004; Napadlek 2009). 

Advantages of laser cleaning: 

• Contactless operation 

• No thermal impact on the surface 

• Precision and selective impact of laser beam 

• Full control of the process 
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Fig. 33 Schematic diagram of the cleaning (Napadlek 2009) 


• Unrivaled for the conservation of elements of complicated shape 

• Perfect supplementation of traditional cleaning methods. 


Cutting, Laser Perforating and Engraving, and Laser Marking 

In principle, there are two methods of laser cutting: melting and sublimation. 
During laser cutting by means of melting, a material, e.g., acryl, is melted or 
evaporated. With the sublimation, a material, e.g., wood, is evaporated instantly, 
but the liquid phase is ignored. Laser can be also used to perforate very small holes 
in very hard materials, e.g., in diamond, and also in very brittle materials, e.g., 
ceramics. The holes are made at high speed and have repeatable shape 
(Dobrzanska-Danikiewicz 2012; Ion 2005; Sulaiman et al. 2007; Havrilla and 
Anthony 1999; Pfeifer et al. 2010; Landolt and Bornstein 2004; Dunsky 2007). 

During engraving the material is evaporated. As a result, a conical groove 
effected by the profile of a laser beam and thermal conductivity of the material 
can be obtained. 

Laser marking is a thermal process where a focused beam of laser light is used, 
having high concentration (intensity) level in order to effect a visible change on the 
surface of a given material. The beam brings about a high change of temperature on 
the surface of the marked material until there is a visible change in coloring of the 
material and until the material has been removed by evaporation (Ion 2005; 
Havrilla and Anthony 1999; Landolt and Bornstein 2004). 
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Fig. 34 Laser marking 
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Basing laser marking methods are among others as follows (Landolt and 
Bornstein 2004): 

• Laser marking with a mask (Fig. 34) 

• Marking with a dot matrix method 

• Vector-based marking 


Laser Enhancement of the Roughness 

Laser modification of surfaces of plastics consists in the generation of thermal 
effects, which leads to melting of microareas of the surface layer. The effectiveness 
of this method to a large extent depends on the absorption coefficient of laser 
radiation through the plastic material. Hence, this method proves effective for 
polyamides, polyesters, and aramid materials. Changes taking place in effect of 
laser modification consist primarily in the enhancement of roughness of the surface 
and removal of its fragments. When the process is carried out in the presence of 
oxygen, the oxidation of surface layer is also taking place (Hecht 2005; Crafer and 
Oakley 1992; Landolt and Bornstein 2004). 

Laser Micromachining 

Endeavors have been made in all the fields to achieve the highest miniaturization of 
products while ensuring their greatest functionality. Modern lasers offer a more 
precise material modification technology usually beyond the reach of other 
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technologies. Cutting, marking, text, and graphics printing on such materials as 
glass, metal, or plastic is possible with the laser technology (McGeough 2002). 

The term laser micromachining is used with reference to a technique employing 
a laser beam to remove small quantities or thin layers of the machined material in a 
pulsed layer deposition process. Lasers with the duration of impulses of nanosec- 
onds and femtoseconds are usually employed in such processes. The basic laser 
micromachining processes include cutting, boring, and surface formation. 

Laser impulses are focused on the surface of the machined material during 
micromachining. A part of laser radiation is reflected and the remainder absorbed 
by a material. This depends on the physio-optical properties of the material and 
laser radiation wavelength. In nontransparent materials, radiation absorption occurs 
in a thin layer only just underneath the material surface, and the energy absorbed is 
transported deep inside the material in a thermal conductivity process. The material 
situated in a so-called heat-affected zone (HAZ) is partially melted and then 
evaporated. This, however, is usually associated with the degradation of the treated 
material due to its damage within the micromachining area and is exhibited by 
deformation, sintering, or decolorization of the material edge. If the duration of 
impulses is short, and the density of laser beam energy sufficiently high, a phe- 
nomenon may occur of abrupt “ejection” of the liquid phase from the 
micromachining area. Pieces of material can be removed with this phenomenon 
in a way lowering the thermal degradation of the material. The both processes are 
grouped into pulsed layer deposition permitting, in particular, to produce small 
parts with complicated shapes (Webb and Jones 2004; McGeough 2002; Dahotre 
et al. 1994; Tanski et al. 2010). 

The following advantages of laser micromachining should be listed: 

• A wide array of materials may be micromachined: metals, polymers, ceramics, 
glass, insulators, and conductors 

• Fast process speed and low complexity 

• High flexibility when designing the geometry of parts 

The disadvantages of laser micromachining: 

• Relatively small accuracy of about dozen or so micrometers 

• A risk of thermal damage to the machined material in the heat-affected zone 
(HAZ) 

Main Trends and Development Directions 

Presently, worldwide trends involving the application of laser technologies in 
surface engineering are principally based on automation processes using laser 
beam for the treatment of surface layers of materials. By the application of a 
coherent laser of simple design and a fiber waveguide, a laser beam can be guided 
to a treated place of any surface shape, even along the distance of several dozen 
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meters, and when it affords the possibility to use one laser source for many 
technological processes, it can minimize overall manufacturing costs. Laser 
manufacturing techniques, as it can be implied from the forecasts highlighting the 
factors of global development of economy, are the most prospective and effective 
factors ensuring further development in many branches of industry where material 
processing is dominating. It is believed that future conditions of particular econo- 
mies worldwide can be only competitive if they implement laser technologies to the 
possibly greatest extent. In research institutes, the development of no other tech- 
nology involving the treatment of materials enjoys so much attention and expendi- 
tures as the development of laser technologies. 

Materials engineering is one of prime scientific disciplines in terms of historical 
aspects, needs of the society, and the development of technological civilization. 
And the laser technologies are viewed currently as the most dynamic branch in the 
fields of telecommunications, optoelectronics, photonics, and materials engineer- 
ing. Their rapid development is a response to a growing demand for specialists who 
on the one hand have extensive knowledge on commonly understood laser tech- 
niques, and on the other hand, they are aware of the applicability potential of lasers 
in the field of materials science. 


Over the decades, a number of technologies shaping the structure of material 
surface layers have been worked out. More and more frequently, in various 
branches of industry, the properties of particular elements fabricated from various 
groups of engineering materials are being adapted to performance requirements, 
and the fabrication of surface layers which meet the expected requirements has a 
considerable impact on the applicability potential of many products, which consid- 
erably contributes to the change of their usability properties. 

The last decades have witnessed a considerable rise in the demand for modern 


methods of material surface treatment, which might improve the mechanical and 
usability properties of surface layers, so that they can satisfy the growing demand 
posed by manufacturers and ultimately also by consumers. 

According to the forecasts involving the factors contributing to global economic 
development, laser fabrication techniques are currently the most promising and 
effective to ensure further development in many branches of industry comprising a 
commonly understood notion of materials processing. Surface treatment with the 
use of laser is one of the most dynamically developing sectors of the economy. 

The best development perspectives in surface engineering when laser techniques 
are considered are rendered by laser alloying/feeding and laser ablation. The 
advisable action to be undertaken with respect to the said technology groups 
involves making them more attractive, more modem, and automated as well as 
computerized, supported by appropriate promotion, and taking advantage of a good 
economic situation on the market. With respect to laser ablation, further research 
works should be carried out, and in the case of good economic trends on the market, 
it should be modernized and invested. Basing on the analysis of future development 
trends of particular groups of laser surface treatment, the notion that the laser 
incremental production, laser treatment of functional materials, and laser treatment 
of biomaterials belong to the group of technologies whereof significance will be 
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rising can be accepted. And the highest uncertainty from among the investigated 
technology groups is attributed to laser surfacing, whereof significance is changing 
as compared to other groups, which means that it can increase, decrease, or 
maintain the current level. 


Summary 

There has been a growing demand witnessed recently for modem methods of 
surface treatment which would enable to improve the mechanical and usability 
properties of surface layers of various engineering materials. These growing ten- 
dencies enforcing more efficient application of modern laser techniques in materials 
engineering are motivating the scientific environment to make all possible effort to 
find new solutions involving not only the manufacturing engineering but also, or in 
particular, the engineering of material surface treatment, a new generation of layers 
which would satisfy the rising requirements voiced by manufacturers and ulti- 
mately also by consumers (Dobrzanska-Danikiewicz 2012; Kannatey-Asibu 
2009; O’Shea; Crafer and Oakley 1992; Landolt and Bomstein 2004). 

From among numerous techniques enhancing the durability of engineering 
materials, a significant role in the industrial or laboratory practice is played by 
laser technologies with the use of high-power lasers which enable a full control over 
the composition, structure, and properties. Owing to the properties of laser radia- 
tion, the laser treatment is becoming more and more popular in terms of industrial 
applications. At present, with respect to the engineering of material surfaces, laser 
techniques are most commonly applied for quench hardening, annealing, temper- 
ing, glazing, melting, alloying, fusing, and surfacing as well as laser cutting and 
marking. 

Laser radiation can facilitate the fabrication of many precision technological 
operations on various types of materials and surfaces of engineering elements, with 
the output and accuracy exceeding traditional methods, yet some difficulties and 
technological problems connected with it cannot be ignored (Dobrzanska- 
Danikiewicz 2012; Kannatey-Asibu 2009; O’Shea; Crafer and Oakley 1992; 
Landolt and Bornstein 2004). 

Versatile advantages involving the treatment of materials with the use of laser 
are applied in many areas. High purity of the treatment place is ensured by 
contactless work conditions, where the laser beam is remote controlled through 
translucent shields, in the conditions of vacuum, gas atmosphere, and under water. 
The acquisition of extremely high powers can be accomplished by focusing the 
laser beam to unbelievably small size, even of the order of a micrometer. Moreover, 
it affords a possibility of selective impact of the beam on precisely chosen areas of 
the material, e.g., at places which are difficult to reach, without any concerns to 
deform the material. The fact that the treatment time is short and a great amount of 
energy is delivered to the place of treatment and the process can be carried out 
without burning contributes to the minimization of chemical pollution and the 
elimination of oxidizing process. The time and power duration of a radiation 
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impulse generated by modern lasers can be regulated, which means that the 
machine tool with such a laser has the properties of a universal device. Therefore, 
by an appropriate selection of the values of power density and radiation, many 
different technological processes can be realized. One of the flaws of the laser 
treatment of materials in surface engineering still involves the high cost of laser 
purchase and the training of highly qualified engineering staff needed to handle the 
equipment (Dobrzanska-Danikiewicz 2012; Kannatey-Asibu 2009; O’Shea; Crafer 
and Oakley 1992; Landolt and Bornstein 2004). 
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Abstract 

Progress in the manufacturing and improvement of the operational strength of 
structural elements and tools applied in the diverse areas of life is mainly 
achieved by the increasingly widespread use of deposition techniques of thin 
layers made of hard, wear-resistant ceramic materials. A wide variety of coating 
types presently available and coating deposition technologies have resulted from 
the growing demand seen in recent years for cutting edge methods of material 
modification and surface protection. Among the numerous techniques enhancing 
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the life of materials, physical vapour deposition (PVD) methods now play a 
significant role in the industrial practice. 

This chapter presents physical vapour deposition (PVD) methods for fabri- 
cating hard coatings on engineering materials and in particular: deposition 
methods, PVD coatings synthesis and substrate surface preparation, and also 
the applications of PVD coatings, as well as an overview of research methods 
and coating properties. 


Introduction 

The physical vapour deposition (PVD) process is an atomic deposition process in 
which a material is vaporised from a solid or fluid source such as molecules or 
atoms and transported as vapour through a plasma or vacuum environment to the 
substrate where it is condensed. Thin PVD films and coatings are applied in 
structural bulk materials in order to improve the desired properties of the surface, 
such as corrosion resistance, wear resistance, hardness, friction or the required 
colour. The research topics concerning the fabrication of PVD coatings are one of 
the most crucial directions of surface engineering development, ensuring the 
manufacture of coatings with high functional properties in terms of mechanical 
characteristics and wear resistance (Zhang et al. 2007; Lukaszkowicz et al. 2004; 
Dobrzanski and Lukaszkowicz 2004; Dobrzanski 2006; Mattox 2010). 


PVD Deposition Methods 

This chapter presents different methods available for the preparation of PVD 
coatings. The most promising methods for further research and industrial activity 
include magnetron sputtering, vacuum arc evaporation, laser ablation and hybrid 
techniques consisting of a combination of the aforementioned techniques (Table 1). 

It is characteristic for PVD coating deposition methods that metals and alloys are 
evaporated or sputtered in a vacuum and gases and vapours of metals are ionised at 
a reduced pressure and at a lower temperature than in CVD methods, with the use of 
various physical processes. In most of the cases, coatings are fabricated in a PVD 
process in three stages (Dobrzanski 2006; Mattox 2010; Sree Harsha 2006): 

• Obtaining the vapours of the material deposited; 

• Transporting the (neutral or ionised) vapours onto the substrate material; 

• Condensing the vapours of the deposited material onto the substrate and coating 
growth. 

The said stages of physical vapour deposition, depending on the method, may be 
supported by additional physical processes, such as: electrical ionisation of the 
metal vapours obtained and gases supplied, crystallisation of metal or gas phase 
from the plasma obtained. 
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Table 1 Key PVD processing techniques (Zhang et al. 2007) 


Subgroup 

Methods 

Thermal evaporation 

Pulsed laser deposition 

Electron beam deposition 

Ion implantation 

Ion beam deposition 

Sputter deposition 

High-power pulsed magnetron sputtering 

Reactive magnetron sputtering 

Ion beam sputtering 

Arc vapour deposition 

Pulsed arc deposition 

Vacuum arc deposition 

Filtered arc deposition 


Nearly in all PVD techniques, the deposited coating is created from a stream of 
ionised plasma directed onto the substrate by electrical discharges. For this reason, 
some plasma deposition techniques (using ions) are called plasma assisted PVD 
(PAPVD) or ion assisted PVD (IAPVD). The PAPVD processes are characterised 
by several specific features that do not occur during non-ionised vapour 
crystallisation processes. This particularly relates to the presence of not only neutral 
particles in plasma, but also electrically charged particles (ions, electrons) reaching 
a high kinetic energy when accelerated in the electric field. They are employed 
especially for the deposition of high melting point materials such as nitrides, 
carbides, borides and transition metals oxides. Plasma, as a non-independent 
state, requires external energy sources to sustain. The complex and multi-stage 
nature of the inter-particle energy exchange processes causing reactions in the 
plasma may be generally divided into three stages (Dobrzanski 2006): 

• Energy supply to free electrons by the electric field; 

• Energy supply to other plasma particles by electrons, in order to increase their 
internal energy; 

• Progress of chemical reactions occurring between the particles under increased 
internal energy states. 

Plasma techniques have become increasingly important for achieving improved 
substrate surface cleanliness and better adhesion of the deposited coating to 
the base. 

Considering the diversity of methods applied and their varieties and modifica- 
tions as well as the lack of a harmonised nomenclature, many authors, depending on 
the criteria assumed, introduce their own classifications, making it difficult to make 
any clear classification. The authors tend to agree to use of one of the following 
classifications according to (Dobrzanski 2006; Mattox 2010; Sree Harsha 2006): 

• The location of the zone where vapours of the material deposited are obtained 
and ionised; 

• The method of obtaining and ionising the vapours of metals or phases, through: 
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- Thermal evaporation with particles of the material deposited obtained as a 
result; 

- Sublimation where the particles of the material deposited are obtained in a 
high current continuous or impulse arc discharge; 

- Sputtering where the particles of the material deposited are obtained as a 
result of bombarding the target surface of the material deposited with neutral 
gas ions, most frequently high energy argon or under abnormal discharge in 
crossed magnetic and electric fields; 

• The non-existence or existence of deposition process intensification by: 

- Reactive methods related to the application of reactive gases (e.g. N 2 , hydro- 
carbons, 0 2 , NH3) allow production of an appropriate phase featuring high 
hardness (e.g. TiN, A1 2 0 3 ) as a result of a chemical reaction with metal 
vapours; 

- Activated methods related to the activation of the gas ionisation and metal 
vapours process by means of additional glow discharges, constant or variable 
electric fields, magnetic fields, additional sources of electron emission or 
heating the substrate; 

- Mixed methods related to a various combination of the methods given above; 

• And the most significant classification according to the method of metal vapour 

deposition through: 

- Arc evaporation where the particles of the material deposited are obtained in 
high current electric (arc) discharge; 

- Sputter deposition where the particles of the material deposited are obtained 
by bombarding the surface with ions, mostly with high energy argon ions; 

- Ion plating where the particles of the material deposited are obtained through 
thermal evaporation or temperature sublimation. 

High-current low-pressure arc discharge occurring on the evaporated material 
(cathode) surface is employed for arc evaporation processes. As a result of the arc 
discharge taking place between two electrodes with a high current intensity, 
so-called cathode spots with their diameter of up to 100 pm are formed on the 
cathode surface, being the source of highly ionised material vapours. Local evap- 
oration from the cathode material area occurs as a result of high arc energy 
concentration on a very small area, resulting in the rapid growth of power of up 
to several kilowatts. The degree of ionisation depends on the type of material 
evaporated and ranges between 30 % and 95 %. The use of electrostatic screens 
or electromagnetic systems enables to control the size, speed rate and movement 
direction of cathode spots. This allows to reduce the progress of erosion on the 
cathode surface. The arc evaporation process is one of the most popular methods 
due to the presence of multiple ions with a high kinetic energy (10-100 eV), the 
possibility of the ion cleaning of the substrate and base surface and due to evapo- 
ration in the atmosphere of a mixture of reactive gases. The applicability of this 
method is sometimes limited because droplets of the evaporated material taking 
part in coating production are present in plasma. Such droplets are formed through 
the collision of ions with the cathode surface in a liquid state with material droplets 
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being spalled as a result; however, there are methods available to constrain this 
unfavourable phenomenon. Coating deposition devices for arc evaporation pro- 
cesses may feature different designs due to a very small area of the cathode made 
from a solid-state, arc-melted material and of the anode that may be, e.g. a vacuum 
chamber. It is feasible to apply several or more than a dozen of cathodes at the same 
time to deposit large items. This group of methods includes, in particular: pulse 
plasma method, laser beam evaporation. 

The particles of the coating material deposited in ion sputter deposition methods 
are removed from the solid body surface, so-called target, as a result of bombarding 
with ions with 100-1,000 eV energy. The source of sputtering ions are direct 
current (DC) glow discharges, or alternate current (AC) radio frequencies 
(RF) generated by feeding high 1-10 kV voltage between the electrodes at a 
reduced pressure of 1-10 Pa. The particles of the coating material sputtered by 
the ions are ionised while penetrating the plasma area. Subsequently, by reacting 
with the molecules of the reactive gas introduced into the plasma area, they are 
deposited on the coated charge material as a chemical compound. Physical grounds 
for this process may be explained based on sputtering in a diode system. A glow 
discharge in this process takes place between an anode, representing an earthed 
base, and a cathode, made of the sputtered material. The ions of neutral gas 
produced in the discharge are accelerated in a cathode drop and, while bombarding 
the target (cathode), are removing atoms from its surface. Such atoms, following 
transition through the plasma area, are deposited onto the coated charge material. A 
speed rate of sputtering depends on the kinetic energy of the bombarding gas ions, 
on the ion current density with its value depending on interelectrode voltage and on 
gas pressure. The growth of voltage is, however, limited by transition from a glow 
discharge into an arc discharge, while the growth of pressure above 10 Pa is limited 
by a free path of gas atmosphere particles. A major disadvantage of conventional 
ion sputtering, despite the high simplicity of the process, is an insufficient speed 
rate of deposition due to a limited density of ions bombarding the target. This 
method, therefore, can only be applied for depositing layers at a relatively low 
speed rate, mainly wherever the application of relatively thin coatings is required. 

By enriching a gas environment with argon, the intensity of target sputtering is 
enhanced by high-energy ions of the gas. This, in turn, improves a growth rate of the 
coating deposited onto the base. The process is effectively improved also by placing 
magnets underneath the target, which allows formation of an appropriate magnetic 
field (magnetron sputtering). Sputtering in the crossed electric and magnetic fields 
is extending the free path of electrons as a result of which current density is reached 
several dozen times higher than in conventional sputtering. Pressure in a furnace 
chamber is also lowered thus ensuring the improved purity of coatings deposited. 
The magnetron method, consisting in trapping the discharge plasma over the target 
surface, enables significantly higher rates of sputtering speed, hence a higher 
coating deposition rate (by one to two orders of magnitude). It also permits 
evaporation of practically all the materials with a broad melting point range, such 
as: Ti, Al, and provides extensive possibilities of coating deposition by introducing 
appropriate reactive gases, e.g.: Ar + N 2 , Ar + C 2 H 2 . A high energy of the coating 
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material particles ensures the good adhesion of the coatings at a relatively low 
temperature of the base. Multiple magnetron design solutions are available. Their 
common characteristic is the shape of the plasma trapping area accurately 
representing the shape of the magnetic field set by a gap between the poles of the 
magnetron magnetic system. Magnetrons are generally classified in the literature 
according to the cathode shape (flat, cylindrical and angular cathodes). The mag- 
netron method, combining a simple design and easy process control, offers the 
possibility of depositing coatings on very large areas and of depositing multilayer, 
submicrometre coatings. An additional advantage of this method is the possibility 
to accomplish high process efficiency at relatively low costs as compared to other 
thin coatings deposition techniques. Due to this fact, magnetron sputtering is one of 
the most commonly employed coating deposition methods. In addition to the 
mentioned diode sputtering and magnetron sputtering, the group of ion sputtering 
methods may also include triode sputtering, electron cyclotron resonance and 
hollow cathode sputtering. 

In ion plating processes, coatings are deposited by directing a beam of 
low-energy ions directly onto the base material or indirectly by using the vapours 
of the material being deposited. The vapours, after passing through gas glow 
discharges, are ionised and reach the base in such condition. The indirect method 
is currently the most popular method of coatings deposition in ion plating pro- 
cesses. The method allows to utilise fully the ion beam energy in high vacuum 

o 

conditions of 10 -10 Pa. The vapours of the deposited material may be 
obtained, notably, by means of resistance, electron, induction and thermal-ion 
heating. Resistance heating enjoys smaller popularity, however, as it is difficult to 
use materials with a high melting point and due to low efficiency of the vapours. 
Activated reactive evaporation (ARE) is an example of a method applying ion 
plating using an electron beam for producing vapours of the deposited material. By 
applying an additional electrode with positive potential, gas ionisation is seen by 
accelerating the electrons emitted from the melted metal surface. In this area, 
electrons, in inelastic collisions, transfer some of their energy to neutral particles, 
thus either exciting or ionising such particles. Reactive gas is introduced into the 
plasma produced in such conditions as it is highly probable that reactive collisions 
will take place between the particles of the evaporated material and the reactive gas 
leading, as a result of the reaction, to the crystallisation of the formed phase on the 
base surface. A full equilibrium concentration of the deposited coatings and their 
good adhesion to the base is not ensured due to a low kinetic energy of the particles 
deposited onto the base. A higher degree of ionisation and kinetic energy of the 
deposited material particles and higher adhesion can be attained by employing the 
bias activated reactive evaporation (BARE) method. Base polarisation with the 
negative electric potential of 2,000-3,000 V was applied in this method. A degree 
of gas ionisation can be increased to approx. 50 % by introducing additional 
modifications elevating the degree of vapours ionisation by using a magnetic field 
forming a column of plasma or additional electron emitters and by using a positively 



76 Physical Vapor Deposition in Manufacturing 


polarised cathode. Apart from the methods mentioned above, thermoionic arc evap- 
oration and hot hollow cathode discharge are also present in ionic plating processes. 

It can be concluded by analysing PVD coatings deposition and crystallisation 
processes that the role of ions in such processes is critically influencing the structure 
of the layers, their phase composition and adhesion to the base. The ions colliding 
with the base are transferring to the base the energy, momentum and charge 
which, in turn, impact the elementary processes taking place on the base. The 
ions falling - with their respective high energy - on the base material, are removing 
any impurities from the surface and are supportive to the production of transition 
coatings. They also contribute to the improved and accelerated diffusion of 
particles of the material deposited as a result of the growing density of structure 
defects on the base surface. Coatings are attained with high adhesivity to the base 
and good physical properties due to nucleation centres with high density developed 
under the influence of ions, higher surface mobility of atoms and chemical activity 
of the surface. Ions in the processes of physical vapour deposition are heterogenic 
nucleation centres, and their charge may change the vapour pressure over the 
developing nucleus and its growth rate. 

The vacuum arc evaporation technology features high ionisation (>90%) but is 
limited by the material. Magnetron sputtering is a low temperature technology 
where various materials may be deposited, but its ionisation is low. In a magnetron 
sputtering process, bombardment with energy ions (several hundreds to a few 
thousands of electronovolts) is used to vaporise the target. In a magnetron 
sputtering process, the sputtered atoms carry more energy than the evaporated 
atoms; therefore sputter-grown films usually have a higher density. Moreover, ion 
bombardment of the growing film may restrain the growth of grain and permit the 
formation of nanocrystallites. In an arc evaporation process, atoms are removed 
from the source by thermal or electron means. Thermal evaporation has certain 
limitations for multicomponent materials since one of the metallic elements 
typically evaporates before the other. A high deposition rate and fully ionised 
plasma are the advantages of the deposition of coatings by means of arc in a 
vacuum. Disadvantages include the emission of droplets of molten metal from 
the cathodic spot and the uncontrolled movement of the cathodic spot over the 
surface. Nowadays, LARC Technology (Fig. 1) is one of the coating instruments 
commonly used for large-scale industrial manufacturing. The key advantages of 
the LARC Technology derive from rotating cathodes and their lateral position. 
The cathodes are in permanent rotation; a magnetic field is generated by coils and 
permanent magnets controlled both, vertically and radially (Cselle et al. 2002; 
Rosso et al. 2008). 

Hybrid deposition techniques are used in order to enhance the strengths of the 
above-mentioned technologies and to avoid their weaknesses. Hybrid physical 
vapour deposition systems are often used combining ion plating and sputter plating 
to modify the properties of coatings, and to improve control over ion energy and 
the manageable coatings component or the pulsed laser ablation (PLD) of graphite 
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Fig. 1 Coating equipment with the LARC Technology (Lateral Rotating Arc-Cathodes) (Cselle 
et al. 2002) 
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Fig. 2 Schematic of deposition arrangements to produce DLC and CNx coatings by (a) pulsed laser 
deposition and (b) filtered cathodic arc deposition. A Ti magnetron was used to grow a functionally 
graded Ti-TiC-C interface by either a hybrid of laser ablation and magnetron sputtering (a) or a 
hybrid of filtered cathodic arc and magnetron sputtering (b) (Voevodin et al. 2005a) 


in combination with magnetron sputtering of titanium and filtered cathodic arc 
deposition (FCAD) of graphite with magnetron sputtering of titanium (Fig. 2) in 
order to prepare the unhydrogenated amorphous diamond-like carbon (DLC) and 
fullerene-like carbon nitride (Voevodin et al. 2005a). 
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Synthesis of PVD Coatings 

The main objective of the present section is to describe the particular formation 
stages of physical vapour deposition and to address the most common problems 
occurring in PVD coating deposition methods, as well as to outline the classical 
coating deposition models and phenomena taking place during deposition. 

Physical vapour deposition consists in the deposition of coatings under a reduced 
pressure with the presence of a gaseous phase (vapour) obtained as a result of 
physical phenomena. In general, there are a number of successive stages in forming 
a coating in a PVD process: 

• Vapours of the deposited material are produced; 

• Electrical ionisation of the provided gases and vapours of the deposited material 

takes place and the vapours developed are transported along the source-substrate 

path; 

• A coating is grown from the particles absorbed on the substrate; 

• The above process is enhanced, if needed. 

The physical deposition stages mentioned above can assume different forms and 
various intensities. The structure of coatings produced in PVD processes depends 
on multiple factors such as the kinetic energy of particles falling on the substrate, 
ionisation level, substrate temperature or pressure decisive for the coating growth 
mechanism. A nucleation process is connected with the adsorption of particles on 
the substrate surface and their migration at a different speed along the substrate, 
leading to the formation of so-called clusters. When moving along the surface, the 
clusters come across other clusters and join them until a critical cluster is formed. 
Then, the critical clusters become nuclei-islands which expand and form a contin- 
uous coating. The growth speed of particular grains is not the same, but it depends 
on the initial crystallographic orientation of the grain. This pattern corresponds to 
the vacuum deposition processes where a coating is formed from non-ionised 
particles. In such processes, the temperature of the substrate and the pressure are 
the parameters having primary influence on the coating structure. 

Three characteristic structure zones can be distinguished in a coating deposition 
model using classical vacuum deposition proposed by Movchan and Demchishin 
depending on the substrate temperature (Tp) referred to as the melting temperature 
of the film/layer (Tt). The structure of the first type is seen for with Tp/Tt <0.3, 
characterised by porosity convexity and irregular crystallites. At a higher substrate 
temperature for 0.3 < Tp/Tt < 0.5, crystallites assume a distinguished columnar 
structure, exhibiting surface microroughness. Finally, for Tp/Tt <0.5, a dense 
structure of the third type is observed, with large equiaxial grains, similar to the 
recrystallised structure. The properties of a coating in the classical deposition 
method can only be controlled by changing the substrate temperature. 

In a modified structural zone model of coatings fabricated by cathodic 
sputtering, the presence of the particular zones was connected not only with the 
temperature ratio of Tp/Tt but also with pressure in the sputtering zone (Fig. 3). 
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Fig. 3 Thorton structure zone model 


The first type structure has a porous micro structure with grains separated by voids, 
due to the low surface diffusion of atoms. In the second zone, the structure of 
columnar grains is present. Zone 3 possesses large grains, being the result of 
recrystallisation processes. There is a transition zone 4 (zone T - Thorton zone) 
of fine condensed fibre crystallites between zones 1 and 2, turning into columnar 
grains as the temperature grows (Fig. 4). The structure in this zone has the desired 
properties: high strength and hardness with low plasticity and high surface smooth- 
ness and favourable compressive stresses occur there. 

Physical vapour deposition processes with the presence of plasma possess many 
specific features which are not present in non-ionized gas crystallisation processes. 
The difference between these processes stem principally from the fact that not only 
uncharged particles but also charged ones (ions, electrons) are present in plasma. 
Such particles can be accelerated in the electric field, thus reaching a high kinetic 
energy. Coatings with pre-defined properties are developed, respectively, by chang- 
ing the pressure and energy of ions within a wide range of substrate temperature. 

Transition metals (generally Ti, V, Ta, Zr, Cr, Mo, W, Nb), reactive gases 
(nitrogen, oxygen) or vapours (e.g. of boron, silicon) as well as elements obtained 
from various chemical compounds (carbon) - forming nitrides, carbides, borides, 
sulphides and oxides with them with a high melting point - are used for manufactur- 
ing hard, anti wear coatings with PVD methods, designed for tribological applica- 
tions (Cheng et al. 2010; Dobrzanski and Golombek 2005; Dobrzanski et al. 2004, 
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2007, 2008a, 2010; Golombek et al. 2004; Mo et al. 2007; Veprek and Veprek- 
Heijman 2008). The compounds forming various types of coatings feature high 
hardness, high melting point, resistance to wear or corrosion and brittleness, as well 
as varying chemical composition, causing structure changes. 

Three groups of hard coating materials (metallic, covalent and ionic hard 
materials) can be listed here, differing by the nature of atomic bonds in them. 
The materials forming the coatings have mixed bonds and no pure forms of bonds 
occur in them. The mixed bonds present a complex combination of interactions in 
metal-metal, metal -nonmetal and nonmetal-nonmetal systems. The specific prop- 
erties of hard coating materials are determined by the type of the atomic bond 
dominating in the specific materials, which has been schematically presented in 
Fig. 5 as a triangle of bonds. Table 2 shows the physicochemical properties of hard 
coating materials depending on the type of atomic bonds. The table suggests that 
none of the listed material groups fully ensure the required properties sufficient for 
developing a coating with good versatile properties. Materials with a metallic bond 
have the properties closest to them, and are therefore used most widely (Dobrzanski 
2006). 

The coatings obtained in a PVD process can be divided into two basic groups: 

• Simple coatings - termed single-layer or mono-layer coatings, composed of a 
single material (metal, e.g. Al, Cr, Cu or phase, e.g. TiN, TiC); 

• Compound coatings, composed of more than one material where such materials 
occupy various positions in the coating produced. 
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Multi-component, multi-layer, multi-phase, gradient, composite and metastable 
coatings can be classified as modern, compound coatings deposited with the PVD 
method (Lukaszkowicz et al. 2010a, b, 2012a, b; Lukaszkowicz and Dobrzanski 
2008; Dobrzanski et al. 2005; Antonov et al. 2009): 

• Multi-component coatings where a subnet of one element is partly filled with 
another element. In a wide group of ternary compounds of carbon and nitrogen 
with transition metals, the solutions of TiN, VN, ZrN, TaN, CrN, HfN nitrides 
with carbides of the same elements have been investigated most extensively. 
Carbides and nitrides together form continuous solid (ternary or quaternary) 
solutions with better properties, especially tribological properties, than simple 
coatings; such properties may also be controlled using a wide range of relative 
solubility, distinctive for such solutions; 

• Multilayer coatings, also referred to as multilayers, are produced by alternately 
depositing the layers of various materials, mostly simple coatings with various 
properties; the individual layers forming a multilayer coating should provide the 
desired properties. According to their location, while creating transition zones 
between each other - they should guarantee fluent transition between properties 
that are frequently different. The inner layer, closest to the base being coated, 
should ensure appropriate adhesion to the base, the intermediate layer or layers 
should be hard and strong, while the outer layer should impart good tribological, 
anticorrosive or decorative properties; 

• Multi-phase coatings, representing a mixture of various phases, characterised by 
high resistance to abrasive wear; 

• Gradient coatings, constituting a variant of multilayer coatings with their chem- 
ical composition and properties changing seamlessly along their thickness; 
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• Composite coatings, being a particular type of multi-phase coatings, also 
representing a mixture in which one phase is dispersed in another occurring in 
a continuous way; 

• Metastable coatings, combining diverse properties of metallic materials with 
covalent ones, are formed as a result of synthesis of metastable phases, 
e.g. crystallising in a cubic system AIN, SiC (when their hexagonal varieties 
are at equilibrium) enabling to produce solution-enhanced coatings, e.g. (Ti,Al) 
N, (Hf,Al)N, (Ti,Si)C and (Ti,Al,Si)N. 


Another method of classification for coatings fabricated with PVD processes is 
by grouping them into: 


First-generation coatings represented by titanium nitride TiN, TiC; 
Second-generation coatings represented by titanium carbon nitride Ti(C,N), 
aluminium nitride and titanium nitride (Ti, A1)N, chromium nitride CrN, and 
some diamond like carbon (DLC) coatings; generally, second- generation coat- 
ings possess better functional properties for various applications as compared to 
first-generation coatings; 

Third-generation coatings represented by multi-component and/or multilayer, 
gradient coatings, e.g. TiZrN, TiCrN, (TiAlV)N, (TiAlSi)N; 

Fourth-generation coatings represented by nanostructure, nanocomposite and 
adaptive coatings. 


A new generation of coatings, with multilayer structures modulated at a nano 
scale with varied chemical composition, was achieved by decreasing the thickness 
of layers forming part of multilayer coatings and by refining their structure more 
intensively. Such coatings are generally composed of nitrides, carbide-nitrides or 
carbides, as well as oxides representing binary, ternary or quaternary systems in 
various combinations of elements, such as: Ti, V, Cr, Zr, Nb, Mo, Hf, Ta, W, and 
Al, Si, and sometimes B and C (e.g. cBN or CNx). Frequently, such coatings 
contain elements that do not form nitrides or carbides, such as Cu, Ag, Au or Pd, 
their task being, for example, to refine the structure. The substances forming part of 
multilayer coatings at a micro and nano scale have a crystalline structure (are in the 
state of thermodynamic equilibrium) or are in the amorphous form (in the 
non-equilibrium state); the coating is also constituted of a mixture of amorphous 
and crystalline components. When depositing hard coatings with the PVD method, 
phases are encountered that are formed in them, with different relationships 
between them, such as: total mutual solubility (e.g. TiN-TiC, TiC-WC), total lack 
of miscibility (e.g. TiC/Ag, WC/Ag) and coexistence of crystalline and amorphous 
phases (nanocrystalline grains of titanium nitride with an intergranular amorphous 
a-SiNx phase). The coatings are called nanolayer or nano structural coatings when 
the thickness of the layers forming part of the coatings or their grain size exceeds 
100 nm. The formation of phases and their separation in multiphase and multilayer 
coatings, taking place during the coating deposition process itself, lead to unusual 
properties of such coatings, such as, for example, their extremely high hardness 
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Fig. 6 Schematic illustration of materials hardness versus grain size 


reaching even 70 GPa, much higher than the hardness of the layers or phases 
forming part of the coatings, which may be true especially for nano coatings 
(Voevodin et al. 2005b; Dobrzanski et al. 2006; Musil 2012; PalDey and Deevi 
2003; Vavra et al. 2012; Wang et al. 2012; Zou et al. 2010). 

Similar to conventional polycrystalline materials, according to the Hall-Petch 
relationship, as the grain size decreases so increases coating hardness. The same is 
true for nanocomposite coatings. This phenomenon is especially important for 
decreasing grains to a critical value of about 10 nm. For a structure with a grain 
size smaller than the critical value, a material’s hardness lowers as a new deforma- 
tion mechanism emerges, i.e. a reverse Hall-Petch relationship (Fig. 6) (Zhang 
et al. 2007; Voevodin et al. 2005b; Koch et al. 2007). 

According to the Hall-Petch equation, the strength properties of the material rise 
along with the reduction of the grain size. For coatings deposited with PVD 
processes, the resulting structures with the grain size of ~10 nm, ensure maximum 
mechanical properties. Coatings with such structure present very high hardness of 
>40 GPa, and ductility and stability at high temperatures, etc. (Tjong and Chen 
2004; Veprek et al. 2006). 

The dependency known between hardness and abrasion resistance has become a 
basis for developing increasingly harder coating materials. Progress in the field of 
coatings fabrication by physical vapour deposition and/or chemical vapour depo- 
sition allows production of coatings with a nanocrystalline structure with high 
mechanical and functional properties. Coatings with such structure can maintain a 
low friction coefficient (self-lubricating coatings) in numerous working environ- 
ments, while maintaining high hardness and elevated resistance (Voevodin 
et al. 2005b; Donnet and Erdemir 2004). 

According to the model proposed by Veprek (1997), nanocomposite coatings 
comprise at least two phases, a nanocrystalline phase and a matrix phase, where the 


2734 


LA. Dobrzariski et al. 



Fig. 7 Model and TEM image of nanocomposite structure (Source: PLATIT AG) 

matrix can be either a nanocrystalline or amorphous phase. In this system, the 
nanocrystalline grains must be ~10 nm in size and be separated by 1 A- 3 nm within 
an amorphous phase as shown in Fig. 7. The main concept for achieving the high 
hardness of nanostructure coatings and good mechanical properties and the high 
related strength, particularly in the case of nanocomposite coatings, is to limit the 
origin and movement of dislocations. The high hardness and strength of 
nanocomposite coatings derive from the fact that the movement of dislocations is 
suppressed at small grains and in the spaces between them, causing incoherent 
deformations. When the grain size is reduced to that of nanometres, the activity of 
dislocations - as the source of material ductility - is limited. This type of coating is 
also characterised by a large number of grain boundaries with crystalline/amor- 
phous transition across the grain-matrix interfaces, preventing the formation and 
development of cracks. Such mechanism explains the resistance of nanocomposite 
coatings to brittle cracking. At the same time, equiaxial shapes of grains, high angle 
grain boundaries, low surface energy and the presence of the amorphous boundary 
phase facilitating the slide along the grain boundaries ensure the high plasticity of 
the nanocomposite coatings (Voevodin et al. 2005b; Rafaja et al. 2006). 

Nanocomposite coatings are usually formed from a ternary or quaternary system 
with nanocrystalline (nc-) grains of hard transition metal -nitrides (e.g. TiN, TiAIN, 
CrN, BN, VN, ZrN, W 2 N, etc.), carbides (e.g. TiC, VC, WC, etc.), borides 
(e.g. TiB 2 , TiB, CrB 2 , etc.), oxides (e.g. A1 2 0 3 , Ti0 2 , B 2 0 3 , Si0 2 , etc.), or silicides 
(e.g. TiSi 2 , CrSi 2 , ZrSi 2 , etc.) surrounded by amorphous (a-) matrices (e.g. Si 3 N 4 , 
BN, C, etc.). There are many different nanocrystalline grain/amorphous matrix 
(nc-/a-) systems, such as: nc-TiN/a-Si 3 N 4 , nc-TiN/a-Si 3 N 4 /a-&nc-TiSi 2 , nc-CrN/a- 
Si 3 N 4 , nc-W 2 N/a-Si 3 N 4 , nc-TiAlN/a-Si 3 N 4 , nc-TiN/a-BN, nc-TiC/a-C, nc-WC/a-C 
(Zhang et al. 2007; Cheng et al. 2010; Veprek and Veprek-Heijman 2008; 
Voevodin et al. 2005b; Musil 2012; Zou et al. 2010). The synthesis of such 
nanocomposite coatings depends on the ability to co-deposit both phases. 

Based on the nanocomposite coatings created, the adaptive coatings, where the 
previously achieved nanocomposite layers were put together with layers in the form 
of a diffusion barrier and a self-adapting surface was achieved as a result, resistant 
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to cyclic temperature changes. New materials were subsequently developed resis- 
tant to wear, as a combination of nanocrystalline-based carbides (TiC, WC), 
ceramics (YSZ and AlON), bisulphites (MoS 2 i WS 2 ) and amorphous DLC for 
achieving the nanocomposite structure of the coating. A composite coating pro- 
duced within the W-C-S system consisting of 1 -E 2 nm WC and 5 10 nm WS 2 

grains embedded in the amorphous DLC matrix is an example of “chameleon” 
coatings. A WC/DLC/WS2 nanocomposite is self-adaptive to tribological condi- 
tions existing in an aerospace system (Voevodin et al. 2005b). Figure 8 shows a 
scheme of a “chameleon” coating of the YSZ/Au/DLC/MoS 2 type as well of the 
WC/DLC/WS 2 type, where an amorphous matrix and hard nanocrystalline phases 
(YSZ or WC) were used for achieving optimum mechanical properties. 


Preparation of Substrate Surface 

Physical vapour deposition method uses such physical phenomena as the evapora- 
tion of metals or alloys, or vacuum cathodic sputtering and gas and metal vapour 
ionisation with the use of various physical processes. Their common characteristic 
is the crystallisation of metal vapours or phases from plasma. Coatings are depos- 
ited on a base heated to 200-500 °C or a cold base, which allows coating of 
hardened and tempered bases without worrying about reducing their hardness. 
The coating-base combination is of adhesive or adhesive-diffusion nature, which 
is proven by the concentration of elements in the matrix and coating, changing in 
the transition zone. The combination of the coating-substrate is stronger when the 
coated surface is cleaner. In order to obtain the appropriate cleanness of the base 
surface, the latter is prepared, cleaned and activated prior to coating deposition. 

The physicochemical properties, morphology and mechanical properties of the 
substrate surface considerably influence the adhesion and formation process of 
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coatings and, consequently, their final properties. In some cases the surface must be 
cleaned while elsewhere it may be modified by chemical, thermal or other means to 
provide a more desirable surface. In the majority of cases, the desirable condition of 
the substrate surface is obtained through cleaning to remove undesirable impurities. 
Cleaning can be grouped into ex situ cleaning carried out outside the coating 
deposition system, such as stripping, chemical etching, fluxing, deburring, solvent 
cleaning, reactive cleaning, ultrasonic cleaning, rinsing, outgassing, and into 
cleaning carried out inside the deposition system (in situ), such as: plasma cleaning, 
sputter cleaning, laser cleaning, electron desorption. 

In all the coating deposition methods, the good adhesion of the coating to the 
base, and the consequent functional properties of coatings depend on the appropri- 
ate preparation of the base surface, which in turn depends on the item coated and its 
intended use. If the coating is to satisfy its purpose, the surface of the coated item 
should have proper: 

• Hardness achieved in thermal treatment (e.g. hardening and tempering) or 
thermochemical treatment (e.g. nitriding, chroming), less frequently mechanical 
treatment; 

• Smoothness, the surface should be smooth (ground or polished to Ra < 0.8) 
without burrs on peripheries; 

• Cleanness - no particles of mechanical impurities (dust, dirt), organic impurities 
(greases, lubricants, maintenance agents, sweat), chemical reaction products 
(corrosion products, e.g. oxides, sulphides) may be present on the surface. 

The surface preparation process consists of the two main phases: 

• A preliminary phase where mechanical, organic and chemical impurities are 
removed from the surface of the material to be coated, carried out through the 
following types of cleaning: 

- Mechanical cleaning, e.g. removal of mill scale and permanent tarnishes with 
sand-blasting, barrelling, abrasion methods; 

- Chemical cleaning, consisting in the removal - in washing baths - of, 
e.g. organic grease with the method of saponification in base or acid baths, 
in organic solvents and the removal of thin surface layers (oxides, sulphides, 
corrosion products); 

- Physical cleaning by removing impurities with solving or emulsification 
methods in washing baths; 

- Physical and chemical cleaning by removing more persistent impurities in wash- 
ing baths (e.g. organic solvents, liquid freon) subjected to ultrasonic vibrations; 

• Final (ionic) cleaning carried out in a chamber directly prior to the coating 
deposition process to clean the surface thoroughly, activate it and heat an item 
to the desired temperature - such process is performed through ion etching. 

The morphology of the substrate surface, and in particular waviness and rough- 
ness, are considerably influencing the properties of the deposited coatings. 
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Appropriate substrate hardness also has a decisive impact on the behaviour and 
durability of the deposited coatings, in particular when subjected to high loads. 
Adequate chemical and/or phase composition of the substrate surface is essential to 
achieve the high adhesion of the coating to the substrate material. The properties of 
coatings manufactured with PVD processes are greatly influenced by all kinds of 
substrate surface modification processes aimed at improving its physicochemical 
properties such as: polishing, surface hardening by thermochemical treatment 
(nitriding, cyaniding) and surface activation using plasma. 


Applications of PVD Coatings 

PVD methods are applied to fabricate coatings for diverse applications (Table 3) 
(Vavra et al. 2012; Lukaszkowicz and Pancielejko 2012; Bobzin et al. 2009; 
Dobrzanski et al. 2008b; Kim and Choe 2012; Lecis et al. 2012; Mishra 
et al. 2012; Subramanian et al. 2013). The basic group of PVD coatings are those 
resistant to wear on tools and on structural parts. This group essentially comprises 
nitrides and carbides as well as oxides of transition metals and diamond coatings, as 
well. Such coatings, due to their unique properties, in particular very high hardness 
and resistance to abrasion, substantially contribute to the improved life of tools and 
structural parts coated with them. 

Another group of coatings are PVD coatings employed in biomedical engineer- 
ing for medical implants and surgical tools and instruments. The main purpose of 
the coatings is to enhance corrosive resistance to reduce a friction factor in 
acetabulums in osseous system implant coatings. Such coating materials must 
exhibit high biocompatibility. Apart from metallic coatings, this group also 
embraces carbon or hydroxyapatite coatings used for simulating bone growth in 
less severe cases of bone loss and for covering implants (e.g. hip osteoporosis). 

Moreover, PVD coatings have been used in electronics, optoelectronics and 
photovoltaics as antireflective layers produced from titanium oxide and as 
so-called top and bottom metal plating of photoelements produced from pure Ag. 

Intense advancements in the deposition techniques of hard, antiwear coatings have 
nearly removed the impediments related to machining, such as: machining of mate- 
rials hard in machining, particularly metallic-matrix composites, superalloys based on 
nickel and cobalt, materials used in the space and military industry, precise formation 
of hardened steels and cast iron and dry treatment at high machining rates. The main 
benefits of the deposition of antiwear coatings on machining tools include: 

• Reduced manufacturing costs due to extended tool life; 

• Enhanced machining efficiency related to higher machining rates and feed rates 
and limited service downtimes for necessary tool replacement; 

• Improved quality of the treated materials. 

Adequate coating material selection is conditioned by the limitation of wear, 
thermal, mechanical or corrosion processes which are decisive for tool damage. 


2738 


L.A. Dobrzanski et al. 


Table 3 Applications of PVD coatings 


Tribological 

applications 

Biomedical 

applications 

Photovoltaic and optoelectronic 
applications 

Decorative 

applications 

Cutting tools 

Surgical tools 

Antireflective layers 

Imitation 

Plastic forming 

Biomedical 

Metal plating 

jewellery 

tools 

Structural parts 

implants 

Semiconductor layers 



The most popular coatings currently manufactured on machining tools include: 
TiN, TiCN and TiAlN. Clearly improved functional properties of tools deposited 
with hard coatings result from the following characteristics: appropriate hardness, 
oxidation resistance, high chemical stability, high abrasion and erosion resistance 
and low friction coefficient in contact with the treated material. A major factor 
conditioning the properties of tools with antiwear coatings applied is the adhesion 
of hard coatings to the base. High coating adhesion is primarily the outcome of an 
adequate base cleaning process in an argon glow discharge. The surface topography 
of the coated tools is also paramount for their life as this affects the crystallisation 
and nucleation process of thin films related to the so-called masking effect. The 
effect consists of limiting the surface diffusion of the absorbed atoms with insuf- 
ficient energy required for overcoming surface irregularities. A porous, coarse 
coating with adverse properties, e.g. low hardness, may be developed due to this 
effect. The applicational efficiency of coatings deposited with the PVD method is 
best evidenced by the outcomes of comparative tests results carried out simulta- 
neously for coated and uncoated tools. 

In addition to TiN, TiCN and TiAlN coatings that have been in common use, 
very good functional properties should be expected for multilayer coatings, espe- 
cially for a so-called poly crystalline superlattice, e.g. TiN/VN, TiN/Si 3 N 4 deposited 
on tools. The hardness of the TiN/VN-type superlattice may even be 60 GPa with 
higher impact strength than for single TiN or VN coatings. Another group of 
modern coating materials deposited on tools are coatings obtained based on meta- 
stable solid solutions of multi- component and multi-phase composition, 
e.g. TiC/TiB 2 , TiAlVN, TiAlSiN and TiAlN-TiNbN. 

The application of hard, antiwear coatings deposited onto tool materials with 
physical vapour deposition (PVD) methods is regarded to be one of the crucial 
developments in recent years relating to the improvement of operating characterises 
of tools. By depositing anti wear coatings on the investigated sintered carbides and 
tool cermetals, the strength of the tools and quality of the treated surface can be 
clearly improved, treatment costs can be lowered and working fluids eliminated in 
machining. The widespread application of sintered carbides and cermetals in 
machining technology with compound coatings deposited on them in PVD pro- 
cesses has been contributing to increased attention to the state-of-the-art near-net- 
shape technology, i.e. the manufacture of semi-products with their shape and 
dimensions as close to the ready product as possible. 

The relative significance of each property defining a coating can be characterised 
and assessed in a different way, depending on whether the coating is investigated in 
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Table 4 The most 
important properties of 
coatings 


Ranking 

Production 

Application 

1 

Adhesion 

Tribological properties 

2 

Thickness 

Chemical composition 

3 

External appearance 

Adhesion 

4 

Resistance to corrosion 

Resistance to corrosion 

5 

Tribological properties 

Internal stresses 

6 

Hardness 

External appearance 

7 

Porosity 

Structure/morphology 

8 

Chemical composition 

Resistance to fractures 

9 

Internal stresses 

Thickness 

10 

Structure/morphology 

Porosity 


terms of its fabrication or research, or its application. In the first case, the main 
emphasis is placed on quality control and product repeatability, while in the second 
case the key objectives are the properties of coatings and the impact of various 
process parameters on developing a coating meeting the desirable functional 
properties. The most important properties of coatings in terms of production 
process or research and their application are presented in Table 4. 


Properties of PVD Coatings and Testing Methods 

In order to ensure a coating satisfying the desired requirements, appropriate instru- 
ments and procedures must be put in place to determine its structure, chemical and 
phase composition determining the desirable application properties of the coating, 
and, consequently, the properties of the tool covered by such a coating. 

The present section is devoted to the characterisation of the structure and 
properties of coatings deposited in a physical vapour deposition process. In order 
to identify the structure and properties of coatings deposited in a physical vapour 
deposition process, very sublime research methods are applied. The present chapter 
describes the most important research methods (Table 5) for characterising the 
structure of coating materials and coatings properties. 

Rapid developments in PVD processes have contributed to the commercial 
application of coatings with specific properties. Various coatings obtained in the 
physical vapour deposition process are more and more frequently applied to 
improve the functionality of various materials. PVD coatings have been applied 
in optic and microelectronic engineering, biomedicine, aeronautics and aerospace 
industry, power engineering, automobile industry, civil engineering and machinery 
sector. Higher strength, reduced wear rate, improved resistance to high tempera- 
tures, low thermal conductivity and reduced oxidation or corrosion processes have 
all considerably contributed to the application of such coatings. 

When considering the application of coatings fabricated with PVD methods, 
the desirable requirements involve principally the maintenance of mechanical 
properties of the substrate by the coating and the improvement of tribological and 
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Table 5 Investigations methods of PVD coatings 


Chemical 

composition 

Phase composition 

Microstructure 

Properties 

EDS - energy- 

XRD - X-ray 

SEM - scanning electron 

Adhesion 

dispersive X-ray 

diffraction 

microscope 

Microhardness 

spectroscopy 

GIXRD - grazing 

AFM - atomic force 

Residual 

AES - Auger 

incident X-ray 

microscopy 

stresses 

electron 
spectroscopy 
XPS - X-ray 
photoelectron 
spectroscopy 
RBS - Rutheford 
backscattering 
spectroscopy 

diffraction 
EBSD - electron 
backscatter 
diffraction 

TEM/HRTEM - high- 
resolution transmission 
electron microscopy 

Roughness 


anticorrosion properties, depending on the intended application of the coatings. 
The correlation between damage type of the coating and the external factors acting 
on the coating permits development of new coatings satisfying the requirements of 
“an ideal coating” (Lukaszkowicz and Dobrzanski 2008; Lukaszkowicz and 
Pancielejko 2012). 

When discussing the properties produced in PVD processes, particular attention 
should be paid to the problems connected with the following: 

• Mechanical properties (adhesion, hardness, internal stresses, longitudinal mod- 
ulus of elasticity, etc.); 

• Physical properties (density, thermal conductivity, coefficient of thermal expan- 
sion, melting temperature, friction factor, etc.); 

• Resistance to abrasive wear (tribological properties); 

• Anti-corrosion, diffusion and thermal protection; 

• Structure, chemical composition and coating thickness. 

Adhesion is one of the most important properties of coatings. When it is 
inappropriate, the entire functionality of a coating can be compromised. It is 
difficult to explicitly define the adhesion of coatings to the substrate material 
influenced by material microstructure, external load and environmental aspects of 
the surroundings. Many techniques have been established to characterise the adhe- 
sion of a coating to a substrate. 

Scratch test is one of the most popular techniques used to identify the charac- 
teristics and to control mechanical-adhesive and cohesive strength of hard coatings 
obtained in PVD processes. The method consists of scratching a sample surface 
with a diamond blade by applying a gradual or continuous load, called normal 
critical load L c , causing damage to the sample. The diamond blade is performing 
frictional work while scratching, which can be expressed by means of the following 
formula: 
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where: F f - friction force, F N - normal force. 

A critical load registered during an adhesion test is not only dependent upon 
mechanical strength (adhesion, cohesion) of the coating - substrate configuration, 
but also on the various related coating - substrate parameters (hardness and 
roughness of base, hardness and roughness of coating, coating thickness, a friction 
coefficient between the coating and the indenter, indenter tip radius and its wear). 
An interface between the coating and the base is, therefore, important for 
establishing a reliable and durable coating. The adhesion of the relevant coating 
is improved by applying a thin intermediate layer, e.g. Ti or Cr because a soft 
titanium or chromium layer reduces stresses and prevents the propagation of cracks. 

A load at which the first coating damages occur is termed in the literature as the 
first critical load (L C i). The value of the first critical load is coupled with cohesion 
damages related to the spalling of the material inside the coating. Such damage is 
represented by a weak acoustic emission signal and is practically unnoticeable on 
the chart. However, an experienced observer using a light microscope may detect 
such damages. The critical load L C2 may be determined with the three methods: 


• Optical observation with a microscope - a point needs to be found in which more 
than 50 % of the coating is removed from the base; identifying the nature of the 
coating damage; 

• Sound emission analysis - rapid drop of sound emission on the chart; 

• Analysis of the friction force curve - a load characterised by total coating 
damage is considered to be a bending point of a rising friction force curve on 
the chart; such point corresponds to the first contact of the diamond indenter with 
the base, when extensive coating spalling occurs. 


While investigating coating adhesion with the scratch method, damages are 
produced that Burnett divided as follows (Burnett and Rickerby 1987): 


Spalling failure; 
Buckling failure; 
Chipping failure; 
Conformal cracking; 
Tensile cracking. 


Making a scratch is not of the destructive nature, hence tests can be carried out 
directly on the coated items. A scratching test can be considered a sum of the three 
components: 

• State of stresses when forcing in; 

• Internal stresses; 

• Friction. 
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Fig. 9 Map of Huber’s reduced stresses 


Hardness (microhardness) is the basic property of thin coatings. It is particularly 
important during the process of abrasive wear taking place during the contact of the 
coating with hard materials. Hardness can be easily measured with various methods 
and is connected with other properties of coatings, such as: strength, plasticity, 
internal stresses and structure. Usually it is defined as the resistance of the material 
to a local plastic deformation. 

Internal stresses are an important issue connected with thin coatings produced 
with PVD processes (Fig. 9). Mechanical properties such as hardness, adhesion and 
tribological properties of the deposited coatings strongly depend on the value and 
spatial distribution of internal stresses. 

The said stresses are formed due to a misfit developed between the coating and 
the substrate material due to temperature gradient (thermomechanical stresses) and 
due to growing stresses originating from rapid solidification and due to the intensity 
of bombarding with ions. Generally it is assumed that internal stresses consist of 
constituent internal and external stresses formed due to the difference between the 
coefficients of thermal expansion of the coating and substrate. 

Excessive tensile stresses in coatings may cause the formation of fractures 
spreading across the coatings, while high compressive stresses may delaminate 
coatings from the substrate surface on which they have been deposited. Hence the 
necessity to define internal stresses in coatings is becoming a very important factor 
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and involves quality control for deposited coatings. In most of the cases, internal 
stresses can be measured with diffraction methods or by measuring a mechanical 
curvature of the specimen. 

One of the overall objectives of coating deposition with PVD methods is to 
develop a protective film on the substrate surface with abrasive resistance higher 
than the substrate. Reduced friction due to a lower friction coefficient of the 
coatings alone brings substantial economic benefits relating to the use of items 
coated with such protective coatings. Tribology, investigating the phenomena 
taking place between the interacting surfaces being in relative motion, is important 
for recognising the mechanism of material wear and for identifying methods of 
counteracting this negative phenomenon. A pin-on-disc test is used most frequently 
to define resistance to abrasive wear. 

Particularly useful for further improvements in a coatings deposition process and 
for identifying the cause of their damage or faults is different information 
connected with the structure of coatings. The structure of coatings is strongly 
correlated with their physical properties. The knowledge of the topography of 
coating surfaces, their morphology, definition of the grain size and shape, faults 
of structure, texture, crystallographic orientation, definition of phases, etc. appears 
to be of particular importance. X-ray diffraction research (XRD), tests with the 
transmission electron microscope (TEM), scanning electron microscope (SEM) and 
the atomic forces microscope are the ways to make a complex analysis allowing 
recognition of interesting details involving the crystalline structure of coatings. A 
chemical composition analysis allowing to identify chemical elements and to 
determine their concentration, both, on the surface and deep into the material, 
using, for example, the Auger’s electron spectrometry method, tests with the 
X-ray photoelectron spectroscope (XPS) or glow discharge optical spectroscope 
(GDOS) (Fig. 10), are closely linked to this. 

The high resolution transmission electron microscope (HRTEM) has become a 
powerful tool for characterising coatings produced by PVD techniques (Fig. 11) 
due to its high imaging spatial resolution (~0.1 nm). High-resolution image char- 
acteristics can be interpreted directly according to the projections of individual 
atomic positions using adequately prepared specimens (a specimen material needs 
to be specially prepared to obtain thickness allowing electrons to be transmitted 
through) and appropriate operating conditions. By working with the HRTEM, the 
atomic-scale micro structure of lattice defects and other inhomogeneities can be 
determined. Additional information can be obtained from HRTEM studies, includ- 
ing unique observations into the controlling impact of structural discontinuities on 
the range of physical and chemical processes such as phase transformations, 
oxidation reactions and epitaxial growth (Materials Evaluation and Engineering, 
Inc. 2010; Smith 2005). 

Scanning electron microscopy (SEM) is a very useful technique for 
characterising thin films and undoubtedly the most popular one in terms of all 
electron-beam instruments. SEM is a method for high-resolution imaging of sur- 
faces (Fig. 12). The advantages of SEM include high magnification (> 1,000, 000 x) 
and great depth of field. The modem scanning electron microscopes typically have 
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Fig. 10 Changes of 
constituent concentration of 
the CrAlSiN + DLC and the 
substrate materials (GDOS) 



Fig. 11 TEM bright-held 
images and electron 
diffraction patterns of 
CrAlSiN coating 



a resolution of <10 nm (image resolution of about 0.5 nm can now be achieved in 
the last-generation field-emission-gun SEM). The incident electron beam scans the 
specimen surface to create a SEM image. When an electron beam interacts with a 
bulk specimen, different signals may be generated. Figure 13 shows the various 
types of electrons and electromagnetic radiation produced when a high-energy 
electron beam interacts with a bulk specimen (Liu 2005). 

X-ray photoelectron spectroscopy (XPS) is a useful analysis technique 
for obtaining chemical information about different elements at the coating surfaces. 
In XPS, a sample is placed in a high- vacuum environment and a beam of low-energy 
X-rays (or electron) irradiates the sample and produces photo-ionisation. 
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Fig. 12 Fracture of the 
Ti/CrN x 15 coating 
deposited on the CuZn40Pb2 
substrate 
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Fig. 13 Schematic view of 
the different signals generated 
after an electron beam 
interacts with a bulk specimen 



The energy of the ejected photoelectrons is the function of its binding energy and is 
a characteristic of the element from which it was emitted. Important information 
about the examined samples can be acquired based on the binding energy (Fig. 14); 
its constituent elements, a relative quantity of each element and the chemical state 
of the elements. 

Auger electron spectroscopy (AES) is an analytical technique providing infor- 
mation about the chemical composition of the outermost part of the coating material 
(Fig. 15). The advantages of the AES include excellent spatial resolution, surface 
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Fig. 14 Typical XPS core-level spectra of C 1 s from CrAlSiN + DLC films 


sensitivity (~20 A), and detection of light elements (Materials Evaluation and 
Engineering, Inc 2010). Table 6 presents different penetrations and accuracies of 
the selected methods of composition analysis. 

Raman spectroscopy is a chemical analysis method based on the inelastic 
scattering of monochromatic light, usually from a laser source. The wavelengths 
and intensities of the scattered light can be used to identify functional groups in a 
molecule (Fig. 16). Raman spectroscopy can be used for establishing the crystalline 
quality of DLC films. 

Atomic force microscopy (AFM) has become a powerful tool for detecting 
structural changes at nanoscale. AFM has been mainly applied for high-resolution 
examinations of coating surface topography (Fig. 17) and for examining 
local thermal, magnetic, electric and mechanical properties (Magonov and 
Yerina 2005). 

One of the overriding functions of a PVD coating is to produce a protective film 
on the base surface with wear resistance higher than the base. As compared to 
uncoated metallic materials, items with hard coatings with metal nitrides deposited 
onto them possess higher durability, not only in contact with hard materials but also 
in a chemically aggressive environment. Such outcome is achieved with smaller 
grain size and an amorphous structure of coatings. Hard coatings, provided they 
have adequate thickness, are usually chemically resistant in low temperatures, are 
seal-tight and have no columnar structure. However, most PVD coatings have a 
high defect density in the form of pores and columnar micro structures with the 
aggressive factor penetrating into the material through the coating. An important 
group of coatings applied for corrosion protection is represented by new methods of 
manufacturing surface coating layers, including multilayer coatings, produced by 
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Fig. 15 SEM picture of 
transition zone of sputter 
crater (a, b) and line profile of 
component TiCN coating (c) 
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Table 6 Composition analysis methods (Zhang et al. 2007) 


Analysis 

method 

Element 

range 

Detection limits (at.%) 

Spatial 

resolution 

Penetration 

XPS 

Li-U 

0.1 : 1 

100 pm 

1.5 nm 

AES 

Li-U 

0.5 

10 nm 

0.5 : 7.5 

nm 

RBS 

Li-U 

1.0 

1 4 mm 

2 -7- 30 nm 

EDX 

Be-U 

0.1 

0.5 A 2.0 pm 

1 A 3 pm 
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Fig. 16 Raman spectra of the DLC him 
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Fig. 1 7 AFM topography of the CrN/DLC coating 
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depositing various materials onto each other, most frequently simple coatings 
with various properties. A multilayer coating deposition system offers greater 
flexibility in preventing the causes of corrosion such as pores or gaps. Each such 
defect appearing on a single film deposited in a coating deposition process can be 
neutralised by the subsequently deposited layers. This allows extending or 
constraining the path of the corrosive factor. As regards electrochemical corrosion, 
apart from creating an antidiffusion barrier, other properties of coatings are impor- 
tant, including an electrochemical potential in relation to the base. The majority 
of coatings manufactured in a PVD process usually have a positive potential in 
water environments as compared to the protected base. Such coatings are classified 
within the group of cathodic coatings and may fulfil their task providing they 
are seal-tight and, in addition to their own chemical stability, have the physical 
capacity to prevent the penetration of an aggressive solution into the base. 
In analysing the progress of electrochemical corrosion it is especially helpful 
to determine anodic polarisation curves. Apart from their anticorrosive properties 
of the coating, PVD coatings ensure an effective diffusion and thermal barrier. 
Coatings containing Mo, Zr and their compounds have particularly robust proper- 
ties at high temperatures. 

Hard PVD coatings are significantly increasing the abrasive wear resistance of 
the materials deposited. Reduced friction due to a lower friction factor of such 
coatings brings increasingly higher economic benefits related to the use and oper- 
ation of items covered with protective coatings. Wear is defined as a process of the 
gradual loss of material from the coating surface as a result of a mechanical load, 
relative permanent motion, an interworking fluid or gas material or environment. 
Important for identifying the mechanisms of material wear and methods of 
preventing this adverse phenomenon is tribology, investigating the phenomena 
between the mutually reacting surfaces being in relative motion. Wear is also 
associated with the transfer of material and changes in its properties depending 
on friction. The roughness of the coating surface affects abrasive wear greatly. 
A group of physical and chemical processes taking place in the outer layers of an 
item being in contact with another item is defined as wear mechanisms. There are 
several wear mechanisms: 

• Abrasive wear - a process where hard erodent particles penetrate a relatively 
softer surface consequently causing material scratches and removal, resistance to 
abrasive wear is growing as material hardness increases; 

• Adhesion wear - occurs during the contact of two materials when relative 
pressure is so high that it causes local plastic deformations and microadhesion 
as a result of which material is transferred from a soft body to a hard body; 

• Fatigue wear - occurs when the contacting surfaces are subjected to the cycles of 
variable loads and relief; 

• Diffusion wear - takes place by creating a high temperature within the range of 
700-1,200 °C, most frequently in high-speed mechanical treatment, and the rate 
of diffusion wear depends on the ambient and chemical conditions of the 
properties of the contacting materials; 
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• Erosion wear - is a variant of abrasive wear where material loss from the surface 

takes place as a result of relative motion with liquid or gas containing solid 

particles. 

There are a number of different tribological tests carried out in several research 
laboratories. However, there are just a few studies used as standards or representing 
standard tests. A pin-on-disc test is the most popular test for determining resistance 
to abrasive wear. An erosion test with the use of powder erodent is another 
relatively simple and popular method for determining material strength, operating 
properties and class. 

Epitaxy seems to be a significant term describing an immensely important 
phenomenon presented by thin coatings. A single coating crystal is developed on 
the top of the crystals forming the base in this phenomenon. There are two types 
of epitaxy having great scientific and technical importance. The first type, 
called homoepitaxy, is observed when a coating and base are made from the 
same material while the other type, so-called heteroepitaxy, is when a coating 
and base are made of different material. When the crystal structure of coating 
and base crystals have identical parameters, no stresses are present near the 
surface. As for heteroepitaxy, three types of epitaxy can be distinguished 
depending on different parameters of the crystal structure of the coating and 
the base subject to the unmatched area. If the unmatched area of the crystal 
structure is very small, heterobonding near the surface is similar to that of 
homoepitaxy. In this case it is defined as matched epitaxy. However, differences 
in chemical composition and thermal expansion coefficient may affect the 
quality of the joint between the coating and the base. When the parameters 
of the crystal structure of the coating material and base vary largely, strained 
epitaxy or relaxed epitaxy is distinguished. Most frequently, epitaxy is examined 
with the use of the scanning tunnel microscope (STM) or transmission electron 
microscope. 

Thickness is the first parameter characterising a coating. This suggests that the 
protective, technical or decorative properties of coatings depend on their thickness. 
Growing thicknesses, in turn, are beneficial for protective properties and wear 
resistance while impact resistance and elasticity are decreasing. There are many 
methods for determining coating thickness. Optical microscopic measurements and 
mechanical methods are among the most popular ones, including the most popular 
method called “calotest.” 

The relative importance of each property describing a coating can be 
characterised and assessed in various ways, depending on whether a coating 
is considered in terms of production, research or application. In the former 
case, the main stress is put on quality and product repeatability control, 
while in the latter - specific properties of coatings and the influence of various 
process parameters on obtaining coatings with desired usable properties is clearly 
highlighted. 
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Future Trends 

Further research activity in the field of PVD films and coatings should be concen- 
trated on the following problems: 

• Development of films with controlled grains size; 

• Development of a new technology system for the production of coatings in new 
physical and/or chemical conditions; 

• Higher utilisation of coatings in multiple types of applications including those in 
aerospace and automotive industries; 

• Development of protective coatings with oxidation resistance above 2,000 °C; 

• Development of hybrid coatings with nanophase biomaterials for biomedical 
applications; 

• Nanocrystallisation of amorphous materials; 

• Development and scale-up of deposition processes; 

• Development of coatings with high fracture strength; 

• Development of new advanced coatings with unique physical and functional 
properties. 

Hybrid methods are presented in numerous research studies and practical appli- 
cations combining the characteristics of not only PVD and CVD technologies but 
also of structure and surface properties formation technologies of engineering 
materials, including, e.g. thermochemical treatment methods. Hybrid technologies 
merge at least two treatment methods, e.g. nitriding combined with physical vapour 
deposition (PVD) and carburising combined with chemical vapour deposition 
(CVD). The methods allow to produce different types of coatings, from simple to 
compound coatings, and to implement several surface treatment methods, hence 
assuring repeatable properties of coatings and a highly efficient coating deposition 
process, appear to be most promising. Rapid advancements in hybrid technologies 
may be expected with the use of perspective plasma (ionic, glow) nitrification 
coupled with the physical vapour deposition technology. 


Summary 

A wide selection of the currently available types of coatings and deposition 
technologies derives from a growing demand, in recent years, for the state-of-art 
material surface modification and protection methods. From among a myriad of 
techniques enhancing the strength of materials, PVD (physical vapour deposition) 
methods play an important role in industrial practise. 

Many new PVD coating materials have emerged due to dynamic development 
seen in surface structure and properties formation for parts of machines and tools, 
including monolayer, multilayer, multicomponent, graded, nanocrystalline and 



L.A. Dobrzanski et al. 


nanocomposite and adaptation coatings. An innovative approach in surface treat- 
ment is represented by the application of hybrid technologies offering a broad 
spectrum of types of the associated processes permitting to fabricate a whole 
array of materials with their unique properties, for precisely defined applications, 
unachievable with the standard surface treatment methods. 

The importance of a coating deposition technology as a technology of structure 
and surface properties formation of engineering materials, including PVD 
methods, has been rising in the modern industry due to the extensive applicability 
of the technologies for different groups of engineering materials and due to 
their high efficiency, environmental values and for economic considerations. 
In general, industrial expectations concerning new types of coatings on various 
base materials are still on higher strength and resistance to abrasive wear and 
improved thermal resistance. In addition, resistance to external factors is also 
expected, including, in particular, good corrosion resistance at elevated temper- 
atures, resistance to erosion wear and high-temperature creep (for elements of 
aircraft engines). For this reason, the main issue of PVD coating manufacturing is 
to ensure their robust mechanical properties and high resistance to abrasive wear. 
Although difficulties in selecting a coating material often comes from the fact that 
many properties expected for an ideal coating cannot be feasibly achieved at the 
same time, the mechanical and operating properties of the coatings can be 
influenced by selecting adequate type and chemical composition of the surface 
layer and by optimising the conditions of its deposition process. As a higher level 
operation issue, attention should be paid to the application of PVD techniques 
that do not contaminate the environment and can be easily automated and 
customised to series production. 
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Abstract 

This chapter presents CVD technology (chemical vapor deposition), used prin- 
cipally for the formation of surface layers with special structure and properties 
on engineering and biomedical materials and applied also for the fabrication of 
nanostructures. The main emphasis has been placed on modern CVD methods 
applied in many branches of industry, from automotive through electronic to 
medical. The following methods are discussed: high- temperature chemical 
vapor deposition (CVD) processes, plasma-assisted chemical vapor deposition 
(PACVD), metalorganic CVD process (MOCVD), laser CVD (LCVD), 
hot-filament CVD (HFCVD), vapor-phase epitaxy (VPE), atomic layer deposi- 
tion (ALD), as well as low-temperature CVD for the fabrication of polymer 
layers. The methods to produce carbon nanotubes are also disclosed. The 
following sections focus on the structure, properties, and the application of the 
selected CVD coatings such as diamond and DLC coatings, nitride, carbide, and 
oxide ones with high resistance to abrasion and corrosion. 


Introduction 

A dynamic development of industry, engineering, and technology necessitates higher 
requirements to be satisfied by engineering materials in terms of mechanical proper- 
ties, wear resistance, resistance to corrosion, etc. Functional properties of many 
products and their components depend not only on the ability to transfer mechanical 
loading across the full active section of the element made from the applied material or 
on their physicochemical properties but very often, if not mainly, on the structure and 
properties of surface layers. The enrichment technologies of surface layers ensuring 
the required functional properties with simultaneous application of possibly inexpen- 
sive materials for the core of the element, for which rather lower functional properties 
are required, are also vital in view of economic reasons. 

In most cases surface layers are produced on objects before their operation starts. 
Technological surface layers are effected by subjecting an object to processing. 
Surface layers are fabricated on objects during their operation life only in some 
specific circumstances, e.g., in the process of low-wear friction. Depending on the 
type of processes used for the fabrication of surface layers, the fabrication methods 
can be divided into six groups: mechanical, thermomechanical, thermal, thermo- 
chemical, electrochemical, as well as chemical and physical. 

There has been a notable demand recently for modem modification or protection 
methods of material surfaces, which in consequence brought about a great supply of 
available new coatings and coating deposition technologies. From among the 
many techniques which increase the durability of engineering materials, including 
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PVD CVD 



Fig. 1 PVD physical vapor deposition, CVD chemical vapor deposition 


principally sintered tool materials, the following methods play a significant role in 
the industrial environment (Fig. 1 ; Dobrzanski 2006; Dobrzanski and Dobrzanska- 
Danikiewicz 2011; Bunshah 1994, 2002; Pierson Hugh 1999; Tracton Arthur 2005; 
Boxman Raymond et al. 1997; Pakula et al. 2004, 2010; Pakula 2011; Dobrzanski 
et al. 2004, 2006, 2007, 2008, 2009, 2010; Sokovic et al. 2006; Dobrzanski and 
Pakula 2005, 2006; Dobrzanska-Danikiewicz et al. 2011; Cunha et al. 2008): 

• Chemical vapor deposition (CVD) 

• Physical vapor deposition (PVD) 

PVD techniques and in some cases CVD processes are closely connected with 
vacuum technique. The expectations of the market involving the quality of the 
deposited surfaces and the parameters of the applied working gases or their 
mixtures (composition, flow rate, partial pressures of mixture components, total 
pressure) require the application of maximally clean, unpolluted vacuum and 
the maintenance of appropriate gaseous atmosphere suitable for a given techno- 
logical process. These factors, apart from the geometry of the system 
source, specimen, type of plasma source, or temperature, determine the speed 
of deposition, structure, and homogeneity of the obtained coatings. In order 
to ensure the required quality, the said factors must be precisely maintained 
and controlled. 

The expectations of the industry with respect to new types of coatings or layers as 
surface engineering products on the one hand involve higher strength properties, i.e., 
higher mechanical strength, higher resistance to friction-effected wear, or higher 
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thermal resistance, but on the other hand, they also focus on “multifunctionality,” i.e., 
effective resistance to many external factors at the same time, e.g.: 

• Good mechanical properties and at the same time good resistance to corrosion in 
raised temperatures - tools for liquid glass treatment 

• Resistance to erosive wear and high-temperature creep resistance - elements of 
aircraft engines 

• Good mechanical properties and resistance to abrasive wear - tools and machine 
parts subjected to high loading 


CVD methods enable the fabrication of surface layers of diversified phase com- 
position and thickness up to 15 mm, and they are applied extensively for the treatment 
of such materials as sintered carbides or for the treatment of machine elements which 
should be characterized by high resistance to friction wear without being subjected to 
high dynamic loading. They ensure a considerable rise of durability of the treated 
items. CVD methods are applied also for the fabrication of hard and resistant to 
abrasion and corrosion coatings of carbides, nitrides, carbonitrides, as well as metal 
oxides on steel and ceramic substrates or on high-melting metals. Such coatings are 
widely applied for covering cutting tools and hot and cold plastic working tools, 
contributing to a multiple rise of their durability. 

CVD methods are more and more frequently applied for the production of 
modern nano structural materials, nanocoatings, and composites for the application 
in nanoelectronics or photovoltaic s, thanks to the possibility to obtain a consider- 
able number of nanostructures in one production cycle. 


Historical Background 

The years 1880-1890 witnessed a rapid rise in the interest of CVD methods 
instigated initially by the will to apply this method to increase the resistance and 
strength of carbon fibers applied in electrical bulbs. Early experiments with the use 
of CVD method were carried out at the atmospheric pressure, followed by vacuum 
pumping in order to eliminate undesirable gaseous elements. It was not until 1973 
that the first local heating of the coated surfaces was carried out with the use of laser 
(this method is still applied currently, mainly for heating waveguide fibers before 
they are coated in a CVD reactor). Hard, abrasion-resistant layers with the use of 
CVD method were not obtained until the late 1940s in the United States. And the 
first commercial tools from sintered carbides coated with TiC and TiN films 
obtained with the application of these methods appeared on the market in the late 
1960s. Glow discharge treatment (glow discharge nitriding) was introduced to 
industry in the 1970s. Currently, works are being carried out - practically in all 
processes of thermochemical treatment - on the phenomenon of glow discharge in 
terms of their activation, ensuring full control of the process, forming the micro- 
structure, chemical and phase composition, and state of residual stresses of the 
surface zone of the machined items, so in other words shaping not only the 
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constructional but also the functional properties of materials, i.e., metallic, ceramic, 
and polymer ones. These are such processes as boronizing, oxycarbonitriding, 
oxynitriding, carbonitriding, and also plasma-assisted chemical vapor deposition 
(PACVD) with gaseous environment activated with the direct-current glow dis- 
charge, radiofrequency currents (RFCVD), or also microwave currents (MWCVD), 
which are more and more frequently applied nowadays in various fields of industry, 
such as automotive industry, tool production, chemical, or aerospace, or the pro- 
duction of biomaterials and materials for electronics (Bunshah 1994, 2002; Pierson 
Hugh 1999; Tracton Arthur 2005; Boxman Raymond et al. 1997; Jung et al. 2006; 
Mattox 2003; Mitterer et al. 2003; Burakowski and Wierzchon 1995; Merlo 2003; 
Hyett and Parkin 2007; Tu et al. 2006; Allah et al. 2010; Uhlmann and Koenig 
2009; Olszyna and Smolik 2004; Eskildsen et al. 1999; Martinho et al. 2007; 
Kwiatkowska et al. 2008; Mitura 2007; Mitura et al. 2006; Butler et al. 1993; 
May 2000; Suntola 1994; Ritala and Leskela 2001; Wojcik 2008; Schulte 
et al. 2012; Fave et al. 2004; Burk et al. 1999; Jones et al. 2007; Bessonov 
et al. 2008; Huang et al. 2011; Hajduk et al. 2008, 2009; Sugimoto 2006; Dupuis 
1997; Neumayer and Ekerdt 1996; Sobiecki and Wierzchon 2010; Juda et al. 2006; 
Jun 2007; Liszka et al. 2010; Kumar and Ando 2010; Lo and Wei 1997). 

At the turn of the twentieth and twenty-first centuries, apart from PACVD tech- 
nology, we can observe a dynamic development of CVD methods which are used for 
the acquisition of the so-called nanomaterials in the form of thin layers, nanotubes, or 
nanorods. At present one of the most promising acquisition methods of nanolayers is 
the “atomic layer deposition method” (ALD), which is a type of CVD where a single 
layer of molecules is deposited during one loop. The thickness of the layer depends on 
the number of performed loops. This technique is applied for the deposition of high- 
quality coatings. Another distinctive advantage of the ALD method is the possibility to 
ensure uniform coating of the geometrically complex surfaces, including porous 
materials and 3-D objects. ALD layers are applied in optics, electronics, optoelec- 
tronics, photovoltaic s, and surface engineering (e.g., protection against corrosion). 


Characteristic of CVD Process 

A wide range of the coated elements and a wide range of coating materials which 
can be produced with the use of chemical vapor deposition techniques, from 
amorphous to epitaxial layers of high matching and purity level, engender a 
profound interest in CVD techniques for industrial applications. The synergy in 
CVD technology development is being achieved due to the integration of several 
fields of science, including thermodynamics, plasma physics, kinetics, fluid 
mechanics, and chemistry (Dobrzanski 2006). 

What is characteristic of CVD methods is the chemical reaction of the components 
of gaseous atmosphere on the surface subjected to treatment in effect of which a solid 
phase is formed. The major chemical reactions which take part in CVD processes are 
pyrolysis, reduction, hydrolysis, disproportioning, oxidizing, carbonizing, and 
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nitriding. The courses of the chemical reactions can be activated thermally, 
plasmatically, and/or with photons. In the traditional approach, the so-called 
unsupported CVD methods are the processes carried out at atmospheric pressure in 
gaseous atmospheres, containing the vapors of chemical compounds, being a basic 
substrate of the produced layer (carbide, nitride, boride, or oxide layer). The atmo- 
spheres frequently contain volatile halide of the diffusing element and hydrocarbon, 
nitrogen, hydrogen, or inert gas, e.g., argon. Metal atoms (e.g., boron, chromium, 
aluminum, titanium, tantalum) are liberated from the compound (e.g., BC1 3 , Cr, Cl 2 , 
A1 2 C1 3 , TiCl 4 , TaCl 4 ) in effect of a chemical reaction taking place on the surface of the 
charge. The second component of the layer can come from the substrate (e.g., carbon in 
carbide layers) or from the atmosphere (e.g., nitrogen or oxygen in nitride or oxide 
layers). For example, in the case of TiC and TiN coatings, the idea of CVD method 
consists in passing the gaseous mixture containing titanium, nitrogen, or hydrocarbon 
at the temperature of about 1,000-1,100 °C and in the decomposition of the molecules 
of these gases and finally in the synthesis of the compound into the form of coating. The 
said reactions can be written in the following symbolic way (Dobrzanski and 
Dobrzanska-Danikiewicz 2011; Bunshah 1994; Pierson Hugh 1999): 


2TiCl 4 + N 2 + 5H 2 -> 2TiN + 8HC1 + H 2 

1,000 °c 



TiCl 4 + CH 4 + H 2 TiC + 4HC1 + H 2 (2) 

1,000 °c 

Rointan F. Bunshah in his paper (Bunshah 2002) presents the classification of 
CVD process reactions containing four general categories. 

The first category of chemical reactions occurring in CVD processes covers 
thermal decomposition (pyrolysis) reactions, generally described by the reaction: 


AX(g) A(s) + X(g) (3) 

where AX is the gaseous compound, A is the solid material, and X is the gaseous 
product of reaction. 

The second category includes reduction reactions in which the gaseous com- 
pound is reduced by a reductant: 


2AX(g) + H 2 (g) 2A(s) + 2HX(g) (4) 

The third category is the replacement reaction in the AX molecule; X is replaced 
by the B element as in the reaction: 

AX(g) + B(g) — »■ AB(s) + X(g) (5) 

The fourth group of reactions contains the disproportionation reaction: 


2GeI 2 (g) - Ge(s) + Gel 4 (g) 


(6) 
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Fig. 2 Diagram of coating CVD stages: (7) diffusion in of reactants through boundary layer, (2) 
adsorption of reactants on substrate, (2) chemical reaction takes place, ( 4 ) desorption of adsorbed 
species, (5) diffusion of by-products 

TiCl 2 (g) — * Ti(s) + TiCl 4 (g) (7) 

Usually, several types of reactions occur simultaneously during the CVD 
process. 

The rate of the formation and growth of CVD coatings is constrained by the rate 
of the slowest process during the coating deposition. 

The stages of CVD process (Fig. 2; Dobrzanski 2006; Dobrzanski and 
Dobrzanska-Danikiewicz 2011; Bunshah 1994, 2002; Pierson Hugh 1999; Tracton 
Arthur 2005; Boxman Raymond et al. 1997) are as follows: 

• Supply of reactant gases to the reactor 

• Mass transport - diffusion of gases onto the substrate 

• Adsorption of reactants to the substrate surface 

• Reaction between gases and the surface and the formation of coating material 

• Desorption of reaction by-products and their disposal from the substrate 


CVD Processes 

CVD can be realized with the use of different processes and equipment 
configurations: 

• Classical, in which chemical reactions are activated thermally 

• CVD under atmospheric pressure (APCVD) 

• CVD under lower pressure (LPCVD) 

• Plasma chemical - plasma-assisted CVD (PACVD) with the use of physical 
fields: microwaves (MWCVD) and radio frequency (RFCVD) 
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• Laser-activated CVD (LCVD) 

• CVD activated with UV beam (photo CVD) 

• CVD with the use of metalorganic precursors (MOCVD) 

• CVD realized in fluidized bed (fluidized bed CVD) 

• Chemical vapor infiltration (CVI) 

Depending on substrate temperature, CVD processes can be divided into the 
following way (Pierson Hugh 1999): 

• High-temperature CVD 850-1,200 °C 

Exemplary reactions: 

TiCl 4 + CH 4 -> TiC + 4HCL (8) 

TiCl 4 + 2H 2 + 1 /2N 2 TiN + 4HCL (9) 

2A1C1 4 + 3C0 2 + 3H 2 -> A1 2 0 3 + 3CO + 6HCL (10) 

• Medium-temperature CVD 700-900 °C 

Exemplary reactions: 


(TiCl 4 , CH 3 CN, H 2 ) - (TiCxNi-x, HCL) (1 1) 

• Low-temperature CVD 300-600 °C 

Exemplary reactions: 

(WF6, C 6 H 6 , H 2 ) - (W 2 C, HF) (12) 

• Plasma CVD 3,000-6,000 °C 

Exemplary reactions: 

(TiCl 4 , H 2 , N 2 ,Ar) -> (TiN, HCL, NH 3 , Ar) (13) 


High-Temperature CVD (Classical) Processes 

Thermally activated CVD processes of coating deposition can be realized as: 

• Chemical vapor deposition under atmospheric pressure (APCVD) 

• Chemical vapor deposition under low pressure (LPCVD) 

In APCVD and LPCVD processes, thermal activation of gaseous environment is 
taking place. The APCVD process is carried out at atmospheric pressure, 
and LPCVD at the pressure of several to between 10 and 20 hectopascals (Figs. 3 
and 4). 
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Fig. 3 Main deposition 
methods of thin coatings 
using CVD method 
(Dobrzanski 2006). 



Fig. 4 Installation diagram 
for APCVD process: (7), 
working chamber; (2), 
charge; ( 3 ), resistance heating 
elements 


CH 4 — [X} 



evaporator 


filter 


Traditional CVD processes necessitate the application of high temperature 
necessary to effect the decomposition of gaseous reagents (900-1,100 °C), which 
limits their applicability. The deposited layers are connected with the substrate in a 
diffusive way. In the contact zone from the surface of the coatings, there is a 
concentration rise of elements that are components of the substrate with 
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simultaneous decrease of the concentration of elements which are components of 
the coating. This fact can be caused by the presence of a transit zone of diffusive 
character between the substrate material and the coating. 

The development of CVD methods consists in the modifications of traditional 
processes toward lowering the process temperature to 600-500 °C through: 

• The selection of appropriate gaseous atmospheres and the application of com- 
pounds having lower temperature of chemical reactions, e.g., metal organic 
compounds (MOCVD). 

• Lowering the pressure to the value of 500-10 hPa - this is the so-called 
low-pressure CVD method (referred to as LPCVD). 

• Electrical activation of gaseous environment by means of glow discharge or 
high-frequency currents - these are the so-called low-temperature plasma- 
enhanced CVD or plasma-activated CVD (PACVD) methods. 

By lowering the pressure, the diffusion of gases increases, which results in faster 
formation of a diffusive layer at lower partial pressure of reaction gases and at lower 
process temperature. The coatings are obtained as single or composite ones, i.e., 
made of several different layers, e.g., titanium carbide (TiC) and titanium nitride 
(TiN). Their thickness usually does not exceed 10-20 pm. They permit to increase 
the durability of cutting tools or plastic working tools approximately by 1.5-5 
times. Table 1 presents the differences between the discussed methods in glow 
discharge conditions, basing on the example of titanium nitride coatings. 


PACVD 

Very attractive and prospective seem to be the deposition methods of thin layers 
with the use of plasma - PACVD (plasma-assisted chemical vapor deposition) or 
PECVD (plasma-enhanced chemical vapor deposition). 

The first experiments with the application of plasma as the carrier of chemical 
vapors can be dated back to 1911. Plasma-assisted CVD is a very attractive 
technique due to low process temperature, a possibility to deposit nonequilibrium 
phases, and better control over stoichiometry and deposition purity. Low temper- 
ature of coating deposition process is obtained through the excitation of gaseous 
mixture particles with plasma to the energy matching thermal excitation. Then, the 
nonequilibrium reaction, in effect of which a given product is deposited, can take 
place at much lower temperature (below 600 °C) (Fig. 5). 

Plasma is generated in the electrical field between two parallel plates where one 
is cathode and the other, on which substrates are placed, is earthed. In PACVD 
processes the state of plasma can be maintained with direct-current or alternated- 
current (of radio frequency) glow discharge. In this process, chemical reactions are 
taking place in the conditions of electrical activation of gaseous environment. 
Generally speaking, the PACVD technique has the advantages of CVD, such as 
uniform deposition of layers on different materials, eliminating at the same time its 
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Fig. 5 Processes taking 
place in PACVD technologies 
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disadvantages (high temperature at which the CVD process is carried out). Glow 
discharge can be used to enhance and accelerate the chemical processes applied in 
CVD processes, and besides, the deposition process is advantageous in terms of 
energy expenditure, since the low-pressure plasma consumes less energy as com- 
pared to thermal CVD methods. 

In diffusive fluorescent treatment, and in particular in PACVD method, two 
types of working chambers are applied (Dobrzanski 2006; Dobrzanski and 
Dobrzariska-Danikiewicz 2011; Bunshah 1994, 2002; Pierson Hugh 1999; Mitterer 
et al. 2003; Eskildsen et al. 1999; Martinho et al. 2007; Kwiatkowska et al. 2008; 
Mitura 2007; Mitura et al. 2006; Butler et al. 1993; May 2000): 

• With a cold (cooled) anode (chamber walls), in which the charge (cathode) is 
heated with glow discharge. 

• With a hot anode, i.e., with additional heating of chamber walls in a so-called 
retort solution, which ensures the acquisition of better gas flow conditions in the 
chamber and the application of polarizations of the charge other than cathodic 
one. It also enables the realization of thermochemical treatments at lowered 
pressure. 

Currently PACVD method uses two techniques most - the radio -frequency 
plasma-assisted CVD (RF PACVD) and the microwave plasma-assisted CVD 
(MW PACVD) (Fig. 6). There are a lot of studies devoted to structural solutions 
combining several fabrication methods. This encouraged the researchers from Fodz 
University of Technology and other universities to develop a MW/RF PACVD 
(microwave/radio-frequency plasma-assisted CVD) setup. It was furnished with 


77 Chemical Vapor Deposition in Manufacturing 


2767 


Fig. 6 MWCVD systems for 
the deposition diamond 
coatings 
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two independent plasma excitation sources. The first energy source of the frequency 
of 13.56 MHz was connected via the matching circuit with the electrode with a 
specially designed holder with the substrates being coated. The other 2.45 GHz 
energy source was connected via the circulator and matching circuit with a reso- 
nance chamber located on the opposite side of the electrode, in the upper part of the 
setup. Such a solution, depending on the parameters applied, enabled the fabrication 
of coatings of widely controllable properties. They may be used for the synthesis of 
coatings from DLC to UNCD. The plasma excited with two frequencies, micro- 
wave and radio ones, has its individual properties unknown in the conventional RF 
or MW. In the MW/RF systems, microwave plasma characterized by high-density 
ion streams controls the generation and extent of the ionization of active particles, 
while radio-frequency plasma controls the stream and energy of ions. Dual fre- 
quency circuits are successfully applied for the production of very thin films: from 
DLC to SiNli 3 and Si0 2 . The technology was initiated by the works of Martinu and 
Klemberg-Sapieha, and its dynamic development has been observed for years 
(Uhlmann and Koenig 2009; Eskildsen et al. 1999; Kwiatkowska et al. 2008; 
Mitura 2007; Mitura et al. 2006; Butler et al. 1993; May 2000). 


MOCVD 

The MOCVD process (Metalorganic CVD) (Fig. 7) was for the first time presented 
in 1960, when, thanks to of the application of organic compounds, the coatings of 
indium phosphide and indium antimonide were deposited. The experiment demon- 
strated that it was possible to deposit semiconductor layers at the temperature 
considerably lower than in the standard CVD process. We can observe a continuous 
development from that time on with respect to both the applied equipment and 
the compounds and their purity (Pierson Hugh 1999; Jones et al. 2007; 
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Pressure sensor 


Fig. 7 Schematic of a typical MOCVD system 


Bessonov et al. 2008; Huang et al. 2011; Sugimoto 2006; Dupuis 1997; Neumayer 
and Ekerdt 1996; Sobiecki and Wierzchon 2010; Juda et al. 2006). 

Initially, MOCVD was applied in optoelectronics and in the production of 
semiconductors. Later on the applicability scope of this technique was broadened, 
covering also the fabrication of layers resistant to corrosion or enhancing the 
resistance to abrasive wear (Pierson Hugh 1999; Jones et al. 2007; Bessonov 
et al. 2008; Huang et al. 2011). 

The MOCVD technique is most commonly applied in electronics for the depo- 
sition of semiconductor layers in optoelectronics. What characterizes this technique 
is the fact that the precursors are made up of metalorganic compounds such as 
alkyls (methyls and ethyls of group III metals) or hydrides which decompose at 
relatively low temperatures, from 300 °C to 800 °C, and at variable pressure, from 
below 133 Pa to the atmospheric pressure. It should be emphasized that although 
the name contains the prefix “metalo,” we also apply in this method nonmetallic 
elements such as silicon, selenium, tellurium, and arsenic. The layers obtained in 
this way are very thin, within the range from 1 to 10 nm, and are usually epitaxial. 
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The low temperature of MOCVD process is useful when depositing abrasion- or 
corrosion-resistant compounds on steel substrates. The method has become one of 
the major methods of producing A1 2 0 3 and Si0 2 layers as chemical barriers. As 
precursors in the MOCVD method, mainly alkyls, cyclic hydrocarbons, aryl groups 
derived from aromatic hydrocarbons and acetylacetone (Table 2) are used. Such 
precursors are characterized by low decomposition temperatures (below 500 °C) 
with good stability at room temperature. Vapor pressures of the precursors are 
directly connected with their atomic masses - lower atomic mass means higher 
volatility. Table 3 contains a list of metalorganic precursors used in the MOCVD 
technologies for producing coatings with A1 and its compounds (Pierson Hugh 

1999). 

Nitrides of group III A fabricated with the MOCVD technique are widely 
applied in various electronic and optoelectronic equipment, such as blue and 
green luminescence diodes or semiconductor lasers. 

The kinetics and thermodynamics of reactions taking place during the growth of 
nitrides by MOCVD depend largely both on the kind of precursors and substrate 
and on pressure, temperature, transport gas, and geometry of the reactor. In a 
MOCVD process, pyrolysis reaction takes place, which can be generally presented 
as follows: 


R 3 M(g) + NH 3 (g) - MN(s) + 3RH(g) T (14) 

where M stands for metal from group III A (Ga, Al, or In) and R for organic radical 
(usually CH 3 or C 2 H 5 ). 

High-quality nitride layers of metals from group III were initially fabricated with 
the use of RF reactor at the atmospheric pressure of AP-MOCVD. Later on, the 
LP-MOCVD method (low-pressure MOCVD) was applied for the fabrication of 
nitride layers (Fig. 8; Dupuis 1997). 

As the precursors in the growth process of nitrides of metals from group III A, 
simple alkyls of the group III metals are applied most frequently (derivatives of 
methyl and ethyl), due to relatively high vapor pressure, their easy transport by 
means of hydrogen, and the possibility to work with the source at the temperature 
close to ambient one. The most commonly applied sources for GaN are such as 
TMGa and NH 3 . Hydrogen purified with palladium is the commonly applied carrier 
gas in the MOCVD method. 

In order to reduce the temperature of the fabrication process of GaN based on 
NH 3 , new precursors have been introduced, such as dimethylgallium azide, 
[(CH 3 ) 2 GaN 3 ] n , and (CH 3 ) 2 GaNH(N(CH 3 ) 2 ) 2 , which are used to fabricate the 
layers of single GaN crystals in effect of the pyrolysis at low pressure (10 -5 
Torr) in H 2 at the temperature of 600 °C. 

The biggest barrier in the fabrication of high-quality layers on the basis of GaN 
was the lack of substrate materials with appropriately matched crystal lattice. For 
heteroepitaxial fabrication of GaN, various crystals were applied as substrate, such 
as Si, 6H-SiC, 3C-SiC, MgO, ZnO, GaAs, and sapphire A1 2 0 3 . Sapphire of various 
crystallographic orientation was the most popular, but the main problem of its 
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Table 2 Characteristics of precursor groups in MOCVD (Pierson Hugh 1999) 


Group of 
precursors 

Structural formula 

Chemical reactions 
with metal 

Properties 

Alkyl group 

H 

1 

H—C— metal 
1 

H 

Metal alkyls are 
formed via the 
reaction of alkyl 
group (C„H 27?+ r) or 
alkyl halide 
(C„H 2 „ +1 X) with 
metal. Alkyl group 
is formed by the 
abstraction of one 
atom H from alkane 

Alkyl groups are 
nonpolar volatile 
liquids. The 
decomposition of 
methyl compounds 
is taking place at 
200 °C, and that of 
ethyl compounds 
already at 1 10 °C 

Alicyclic 

hydrocarbons 

CH, 

/ \ 

ch 2 ch 2 

\ / 

CH— CH 2 

In effect of the 
reaction, metal 
atoms substitute 
hydrogen atoms in 
cyclic hydrocarbons, 
e.g., in cyclopentane 
(C 5 H 10 ), yielding tin 
cyclopentadienyl - 
(C 2 H 5 ) 2 Sn 

Nonpolar volatile 
liquid 

Aryl group 

metal 

/ c 
'/ \ 

HC CH 

HC CH 

1 

H 

Metal atoms join the 
group corresponding 
to aromatic 
hydrocarbon, 
substituting in it 
atom H 


Acetylacetone 

/CHa 

/ 0_C \ 
metal CH? 

\ / 

N 0 — C 

^CHg 

Metal atom 
combines with 
acetylacetone 
(C 5 H 8 0 2 ) by 

cleaving the bonds 
between oxygen 
atoms and carbon 
atoms and by the 
formation of bonds 
between the metal 
and oxygen atoms 

Easily wettable in 
organic compounds, 
stable in air. Solid 
body 


application resulted from a big difference involving lattice parameter and the 
coefficient of thermal expansion. SiC substrates have better lattice matching, but 
they are expensive and they have worse quality of structure as compared to the 
commonly available sapphire. In general, the growth of nitrides of III A metal group 


77 Chemical Vapor Deposition in Manufacturing 


2771 


Table 3 Example of metalorganic compounds applied in MOCYD method [by M. Betiuk et al.] 


Metalorganic compound 


Formula 

Name 

Obtained coating material 

[(CH 3 ) 2 A1H] 

Dimethylaluminum hydride 

Al 

[(i-C 4 H 9 ) 3 Al] 

Triisobutylaluminum 

Al 

[A1H 3 N(CH 3 ) 2 C 2 H 5 ] 

Dimethylethylamine alane 

Al 

[A1 2 (N(CH 3 ) 2 ) 6 ] 

Hexakis dimethylamido aluminum 

TiAIN 

[A1(0 2 C 5 H 7 ) 3 ] 

Aluminum acetylacetonate 

AI 2 O 3 

[AlH 3 *Net 3 ] 

T rihy bry do(triety lamin) aluminum III 

AI 2 O 3 

A1(CH 3 ) 3 

T rimethy laluminum 

Al, AIN, A1 2 0 3 
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Fig. 8 Schematic of the reactor LP-MOCVD with a rotary disk (RDR) 


on the substrates having weak lattice matching brings about the formation of 
poly crystalline layers, which often break after being cooled in effect of the created 
thermal stresses. 

The application of a buffer layer considerably improves the morphology of the 
coating as well as the electrical and optical properties. The quality of crystal, 
morphology of surface, and electrical and optical properties strongly depend on 
the thickness and quality of buffer coating (Fig. 9; Neumayer and Ekerdt 1996). 
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Fig. 9 (a) Concentration of carriers and (b) mobility of electrons for the 4 mm GaN layer 
depending on the thickness of buffer coating GaN and AIN on sapphire substrate (0001) 


It can be connected with the appearance of various defects or impurities from the 
buffer layer during the growth or with the morphology of buffer coating. 

Metalorganic coatings are also successfully applied as wear- and corrosion- 
resistant coatings on steel substrates. Most frequently, these coatings contain 
chromium, rare-earth elements, and aluminum oxide. The properties of MOCVD 
coatings resistant to wear and corrosion depend principally on process parameters 
and on the structure of the coating. 

Chromium nitrides are commonly fabricated with PVD processes, such as 
reactive magnetron sputtering or cathodic arc evaporation. Yet the layers fabricated 
in this way are often metastable, and in effect of heating, there occur changes in 
micro structure and phase composition, which result in the deterioration of mechan- 
ical properties. In the case of coatings deposited by cathodic arc evaporation, 
macroparticles are formed in the structure which worsen the properties of layers 
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Fig. 10 Coating structure 
after the application of 
heating in nitrogen-hydrogen 
plasma 



5 urn 



and in particular the resistance to corrosion. The said particles can also lead to 
higher porosity of the layer. 

As a good alternative, we can apply a hybrid method combining glow discharge 
nitriding and PACVD with the use of metalorganic compounds (PAMOCVD). The 
layers deposited with this method have better properties as compared to coatings 
deposited with other methods. Moreover, this method enables the coating on 
objects with complex shapes. 

In the PAMOCVD process, chromium (III) 2-ethylhexanol is applied as precur- 
sor, which enables the deposition of a layer having the thickness of about 10 mm at 
the temperature 550 °C. Then the layer is subjected to heating in glow discharge 
conditions in the atmosphere of nitrogen and hydrogen at the temperature 650 °C in 
order to transform the surface layer into chromium nitride. The obtained layer is 
presented in Fig. 10 (Sobiecki and Wierzchori 2010). 

Another example illustrating the application of MOCVD method involves the 
coating of cutting tools made of sintered carbides with aluminum oxide A1 2 0 3 in 
order to prolong their operation life. The tool surface is covered by a multilayer 
coating, using the MOCVD method with aluminum acetylacetonate as precursor and 
argon and air as carrier gases. The first coating of the thickness from 0.06 to 0.07 mm 
is deposited in argon atmosphere and contains carbon which is the residue of the 
organic decomposition of substrate. The function of this layer is to suppress the 
diffusion of cobalt from substrate material to the external layer to ensure adhesion of 
the whole coating to the substrate and to protect the substrate against oxidation during 
the synthesis of external layer. The next layer deposited in the atmosphere of air 



2774 


LA. Dobrzanski et al. 


consists of A1 2 0 3 . Then the layers are synthesized at 800 °C and then further heat 
treated at 1,000 °C, which results in the crystallization of phase a-Al 2 0 3 , which 
contributes to the rise of microhardness and better adhesion of the layer to the 
substrate. The average operation time of a cutting tool covered with such a coating 
is three times longer as compared to the uncoated one (Juda et al. 2006). 


LCVD 


LCVD (laser CVD) is principally used in microelectronics. The excitation of 
gaseous components is effected by means of an incident laser beam. There are 
two types of processes in the LCVD techniques - pyrolytic and photolytic. In the 
pyrolytic LCVD method, the multiatomic gas molecules dissociate within the area 
of the gas-substrate interface in effect of high temperature of the substrate heated by 
the laser. The reactions take place when the applied gas is transparent for the laser 
beam, and the substrate absorbs laser radiation. In the photolytic LCVD method, the 
molecules undergo dissociation near the substrate as a result of a photochemical 
reaction. Photodissociation takes place when gas reagents absorb laser radiation of 
the applied laser beam. The energy of laser beam is absorbed by the whole volume 
of reactive gases located over the substrate. The excited particles of gaseous 
mixture undergo ionization when passing to the lower energy state, bringing 
about the formation of highly active components. Chemical reactions taking place 
in the gas excited in this way are carried out at considerably lower temperature. The 
LCVD techniques are applied for the synthesis of polycrystalline silicon and 
germanium as well as aluminum oxide (Dobrzanski and Dobrzanska-Danikiewicz 
2011; Bunshah 1994, 2002; Pierson Hugh 1999; Tracton Arthur 2005; Boxman 
Raymond et al. 1997; Jun 2007). 

The LCVD device presented in Fig. 1 1 has a 100 W laser of the wavelength of 10.6 
mm. There is only one heat source in this system. The casing of the system consists of 
two chambers - upper chamber, which either can be statically filled or is affected by 
continuous flow of reagents, and bottom chamber in which there are moving parts. The 
two chambers are separated from each other to ensure prevention against potentially 
caustic reagents and reactions. The reactor works on the basis of stationary beam and 
optics. The holder has 4° of freedom. The seals and vacuum system ensure proper 
functioning within the pressure range from 50 to 1,000 Torr (Jun 2007). 

Before the deposition of coating material, the chamber is cleaned and filled with 
the mixture of H 2 , N 2 , and TiCl 4 . The laser beam is moving over the surface with 
constant velocity. The initial temperature of the substrate Ti is lower than the 
temperature of the chemical reaction. The deposition starts when the temperature 
of the surface reaches the temperature of the reaction process. The chemical 
reaction which is taking place on the upper surface of the substrate consumes a 
part of laser energy and uses up TiCl 4 . The difference in concentrations becomes 
the driving power of the process. The physical model of the LCVD process involves 
natural convection, heat exchange in the substrate and in gases, chemical reactions, 
and movement of gases (Jun 2007). 
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Fig. 1 1 Schematic cross section of LCVD chamber 


HFCVD 


In the hot-filament CVD method, we apply hot fiber, most commonly wolfram fiber 
heated to 2,000 °C or more, placed near the substrate for plasma generation and for 
the activation of chemical reactions. At present the method is mainly used for the 
production of diamond coatings, including NCD (nanocrystalline diamond) coat- 
ings in which hydrocarbon radicals (C x H y for x = 1,2 and y = 0,. . .,6) are pro- 
duced as a result of thermal decomposition of CH 4 through thermal activation of 
working gas with the hot fiber. The heated fiber is also used to bring about the 
decomposition of molecular hydrogen H 2 into atomic hydrogen. Atomic hydrogen 

Q 

plays a key role in the creation of “diamond” atomic bonds with sp hybridization, 
and it etches the graphitoid phase at the rate 20 times higher than the diamond 
phase, enabling the nucleation and growth of the diamond layer. The substrate 
temperature should fit within the range from 800 °C to 1,000 °C, and cooling may 
be necessary. The gas composition and other deposition parameters are close to 
those applied in the MPCVD (microwave plasma) process. The coating growth rate 
is low and fits within the limits from 0.5 to 1 pm/h (Fig. 12; Pierson Hugh 1999; 
Bunshah 2002). 
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Fig. 12 HFCVD system for 
the deposition of diamond 
coatings 





VPE 

VPE (vapor-phase epitaxy) occupies a special place among CVD processes due to 
the required structural perfection of the layer. As an option of epitaxial growth, it 
consists in the deposition of oriented crystalline layer on monocrystalline substrate. 
This technique has found applications mainly in electronics for the deposition of 
semiconductor materials. The VPE (vapor-phase epitaxy) reactors are used for the 
production of epitaxial layers, including, among others, SiC, GaN, etc. In the 
reactors, most often made from materials resistant to high temperature, such as 
quartz, graphite, and SiC, usually water cooled, we use reagents in the form of, e.g., 
propane or silan, and hydrogen is used as carrier gas. The process temperature 
reaches even 1,600 °C. The reactors used in VPE technologies are most frequently 
horizontal reactors, vertical reactors with a base and reactors with rotating 
multi substrate base disk, barrel reactors, pancake-type reactors, and horizontal 
multi substrate reactors. Exemplary diagrams of VPE reactors are presented in 
Fig. 13 (Pierson Hugh 1999; Schulte et al. 2012; Fave et al. 2004; Burk et al. 1999). 

The layer growth rate in the VPE techniques depends mainly on the reaction 
kinetics, mass transport, and process thermodynamics. If surface reactions are 
taking place very fast, the reagents are used up faster at the interface boundary 
solid/gas than in the parent gas phase. In this case, the growth depends on mass 
transport efficiency, i.e., convective and diffusive flux. However, when the mass 
transport mechanisms are very efficient or chemical reactions are very slow due to 
the limitations of reaction kinetics, the growth rate will depend on the rate of 
surface reactions or homogeneous reactions. The VPE process is then 
limited by reaction kinetics, while the details involve the fact which of the 
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Fig. 13 Schematic drawings of (a) cylindrical horizontal (with inductive coil), (b) barrel, (c) 
rapidly rotating vertical, (d) planetary horizontal SiC-YPE reactors 


reactions - homogeneous reactions in gas phase or heterogeneous ones at the 
interface boundary solid/gas - will be slower. In the processes limited by mass 
transport, reagents are rapidly used up on the surface of a crystal in chemical 
reactions. The concentrations of reagents are close to (minimum) equilibrium 
values. In the processes limited by the kinetics of chemical reaction, the reagents 
are fed to the interface boundary solid/gas much faster than they can be used up in 
chemical reactions - their concentrations are close to the concentrations in the 
parent gas phase. One of the variations of the VPE method is MOVPE method 
where metalorganic compounds are used as the sources of elements (Schulte 
et al. 2012; Fave et al. 2004; Burk et al. 1999). 


Atomic Layer Deposition (ALD) 

Atomic layer deposition is a variation of CVD method which is characterized by a 
cyclic application of alternating pulses of precursors, between which a rinsing 
process of the chamber with inert gas is taking place. In such a solution it is possible 
to apply precursors of high reactivity. Thanks to the application of highly reactive 
precursors, which immediately react with the substrate, forming a monolayer and 
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ALD Process cycle forming a monolayer (— 1 A) 

Fig. 14 ALD method 

suppressing further reaction, each cycle brings about the rise of layer thickness by a 
strictly defined value within the range of 0.01-0.3 nm. The number of cycles 
depends on the expected layer thickness. ALD method can be applied for the 
deposition of: 

• Single elements, e.g., from group IV of the periodic table 

• Two-component compounds, e.g., metal oxides 

• Multicomponent compounds, e.g., hydroxyapatite 

ALD is a real nanotechnology. Ultrathin layers of the order of several nanome- 
ters are deposited in a strictly controlled way. In the coating fabrication process 
using ALD method, precisely one atomic layer is deposited in each cycle, due to 
which the deposition process is fully controlled in nanometric scale. Precursors 
(gaseous, liquid, or solid) are successively introduced into gaseous state in the 
heated reaction chamber where the substrates are placed. The precursors generate 
strong chemical bonds. The temperature must be high enough to facilitate the 
reaction between the precursors but also low enough to avoid decomposition 
(Fig. 14). The temperature range in ALD processes is from 25 °C to 500 °C. 

In the atomic layer deposition method, the coatings are obtained as a result of a 
growth of monomolecular adsorption layers. The precursors are dosed into the 
reaction chamber in the so-called pulses. Furthermore, the reaction by-products 
and unadsorbed molecules from the reaction chamber are removed after each pulse 
by means of inert gas. In consequence, the precursors are not in contact with one 
another anywhere, except for the adsorption layer. The elementary ALD cycle 
leading to the production of monolayer is as follows: “one reaction partner, rinsing, 
next reaction partner, rinsing.” The cycle is repeated a number of times (even up to 
several dozen thousand) in order to obtain the required thickness of the target 
material (Suntola 1994; Ritala and Leskela 2001; Wojcik 2008). 

Available precursors are applied, often similar to those used in CVD or 
MOCVD. The coatings deposited with the use of ALD have uniform thickness on 
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all walls and holes of the deposited substrate, even in the case of very complicated 
geometric shape. The exemplary coating materials fabricated with the application 
of ALD method are as follows: 


• Oxides: A1 2 0 3 , Ti0 2 , Ta 2 0 5 , Nb 2 0 5 , Zro 2 , Hf0 2 , 
MgO, La 2 0 3 , Y 2 0 3 , Ce0 2 , Sc 2 0 3 , Cr 2 0 3 , Er 2 0 3 , 
Fe 2 0 3 , NiO, Ga 2 0 3 , W0 3 , etc. 


Si0 2 , Sn0 2 , ln 2 0 2 , 
V0 2 , B 2 0 3 , Co 2 0 3 , 


ZnO, 

CuO, 


Nitrides: AIN, TaNx, NbN, TiN, MoN, ZrN, HfN, GaN, WxN, InN, etc. 
Carbides: TiC, NbC, TaC, etc. 

Metals: Pt, Ru, Ir, Pd, Cu, Fe, Co, Ni, W, etc. 

Sulfides: ZnS, SrS, CaS, PbS, etc. 

Fluorides: CaF 2 , SrF 2 , ZnF 2 , etc. 

Biomaterials: Ti0 2 , Ca 10 (PO 4 ) 6 (OH) 2 (hydroxyapatite) 

Polymers: (PMDA-ODA PMDA-DAH) 

Doping: ZnO/Al, ZnS/Mn, SrS/Ce, Al 2 0 3 /Er, YSZ, etc. 

Nanolaminates: Hf0 2 /Ta 2 0 5 , Ti0 2 /Ta 2 0 5 , Ti0 2 /Al 2 0 3 , ZnS/Al 2 0 3 , etc. 
Multicomponent: TiAIN, TaAIN, ATO (AlTiO) TiCrOx, etc. 


The AFD method offers the possibility to obtain high-quality materials at low 
temperatures. With appropriate matching of the conditions, the growth process 
takes place under so-called saturation. In such a case the growth process is stable, 
and the thickness growth is the same in each deposition cycle. Such a self-limiting 
growth mechanism enables the acquisition of thin layers of planned thickness on 
large surfaces as well as the acquisition of multilayer structures. The limitations 
brought about by “atomic layer deposition” technology involve principally a slow 
growth rate. At best, we obtain one monolayer per cycle. And the typical rate of 
deposition is 100-300 nm per hour. The maximum growth that can be achieved is 
1-2 pm per hour, and only specially designed reactors enable the acquisition of 
such a growth rate. This disadvantage is compensated by the possibility to carry out 
processes with substrates having large areas. Fow growth rate does not have to 
mean low efficiency, because the latter characteristic is considered in terms of the 
volume (thickness x area) of the obtained material (Suntola 1994; Ritala and 
Feskela 2001; Wojcik 2008). 

The major advantages of AFD technologies involve the following (Wojcik 2008): 

• Possibility to use high-reactivity precursors, because the reagents get into con- 
tact only near the substrate, which eliminates the reactions in gas phase. 

• Easy control of coating thickness through self-limiting growth process, which 
implies that the layer thickness only depends on the number of cycles. 

• Possibility to create multilayer structures in a continuous process with high 
efficiency of the material. 

• The growth rate does not depend on the homogeneity of the precursor stream in 
time or around the substrate - therefore, sources in the form of solids, which 
frequently have slow sublimation rate, are easier to be applied in the AFD 
technique than in other similar techniques. 
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Nowadays, the ALD method is used to produce among other things hydroxyap- 
atite layers in biomedical engineering (in stomatology), moreover Si0 2 ; Ti0 2 , 
Si 3 N 4 as antireflection layers in optics and photovoltaics, which reduce the reflec- 
tion of light from the surface; as well A1 2 0 3 as anticorrosive layers in surface 
engineering. 


Low-Temperature CVD for the Fabrication of Polymer Layers 

Low-temperature chemical vapor deposition is one of CVD methods for the fabri- 
cation of poly( 1 .4-phenylenemethylidynenitrilo- 1 .4-phenylenenitrilomethylidyne) 
layers, referred to as PPI, obtained in effect of the poly condensation of terephthalic 
aldehyde (TPA) with p-phenylenediamine (PPDA). The PPI layers fabricated in 
different technological conditions of the low-temperature CVD process have very 
high unintentional technical purity, and they are free from impurities. It has been 
demonstrated that surface morphology and optical properties of thin PPI layers are 
closely dependent on the technological conditions of the CVD process such as: 

• Flow intensity of the stream Ar 

• Temperature of p-phenylenediamine (PPDA) 

• Temperature of terephthalic aldehyde (TPA) 

• Pressure and deposition time 

Vacuum technological system for the acquisition of thin layers of 
polyazomethines, using the method of chemical transport in gas phase, based on 
the poly condensation process consists of four basic elements: 

(I) Gas feeding system 

(II) Reaction chamber 

(III) Vacuum system 

(IV) Control system 

The first step in the deposition process involves the heating of crucibles and 
aluminum shield. After opening the gas valve, argon flows through a T-pipe where 
it is separated into three equal streams. Each of the three streams is flowing round a 
separate crucible (the outlets of the streams are beneath the crucibles). The flow of 
gas streams can cause temporary temperature fluctuations, and therefore it is 
recommended to delay the start of the deposition process until the temperatures 
of the crucibles are stable. Particles of monomers subliming in effect of temperature 
are carried away by the stream of argon and transported to a collimator in which 
mixing of all three streams is taking place, and one stream of gas carrying the 
monomers leaves it. As the beam stop is opened, the stream is falling directly on the 
substrate, flowing around it at the same time. The transported particles are absorbed 
directly on the substrate surface, and they move around jumpwise in the adsorption 
layer on the substrate, colliding with other particles. It also means that apart from 
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Fig. 15 Polycondensation 
reaction of PPI 



-2(n-1)H 2 0 




particles which flow round the substrate, there are also particles which fall perpen- 
dicular to substrate surface, diffusing through the stream layer which is falling on 
the substrate. 

The collision of aldehyde particles with amine particles may bring about a 
polycondensation reaction consisting in the formation of the bond C=N. The higher 
the frequency of such collisions, the higher the probability of chemical reaction, in 
effect of which the growth of polymer chain is taking place. In such reactions, water 
particle is the by-product. According to Flory’s theory, end groups demonstrate the 
same chemical activity, irrespective of the length of chain which they join. It means 
that on the substrate surface, there are many centers on which, with equal proba- 
bility, the polycondensation process can take place. Water, being the by-product of 
the reaction, is disposed of outside by means of a rotation pump. 

Thin layers of poly(1.4-phenylenemethylidynenitrilo-1.4-phenylenenitrilo- 
methylidyne), referred to as PPI, obtained in effect of the polycondensation of 
terephthalic aldehyde (TPA) with p-phenylenediamine (PPDA), taking place in 
accordance with the polycondensation reaction of polyazomethine PPI, are 
presented in Fig. 15. Each reaction of the pair of monomers or joining a next 
monomer to the chain is connected with the creation of one particle of water. The 
same process is taking place in the polycondensation reaction of polyazomethine 
PPI2. Thin layers of polyazomethine PPI2 were obtained in effect of the reaction of 
1 ,4-dioxybenzaldehyde (ODBA) with 1,4 phenylenediamine (PPDA), as presented 
in Fig. 16. 


Application of CVD Method for the Fabrication of Carbon Nanotubes 

One of the fabrication methods of single-wall and multiwall carbon nanotubes is 
CVD method. In this method, temperature distribution of hydrocarbon is taking 
place (e.g., methane, ethylene, acetylene, benzene) or that of alcohol vapors with 
other gases (e.g., argon, helium) over the surface of metallic catalyst. Catalysts can 
be transient metals such as iron, nickel, and cobalt, which are deposited on solid 
substrate, most frequently silicon one. Depending on the factors, for example, type 
of catalyst, composition of gas phase, gas flow rate, or process temperature, various 
carbon nanomaterials can be obtained, including carbon nanotubes. 
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Fig. 16 Polycondensation of polyazomethine PPI2 


In effect of the decomposition of hydrocarbons, mainly on the upper surface of 
the metal nanoparticle being the catalyst, carbon is diffusing down into the nano- 
particle, and hydrogen is carried away from the chamber. When carbon reaches 
the threshold solubility in the nanoparticle, it starts to precipitate on its surface, and 
the growth of nanotube begins. The growth of nanotubes can be carried out in 
effect of one or two mechanisms. The first mechanism known as “tip-growth 
model” (Fig. 17a) is taking place when the cohesion of the catalyst with the 
substrate is weak. The nanotube is growing then on the bottom part of the metal 
nanoparticle, pushing it upward. When the cohesion of the catalyst with the 
substrate is strong, then the second mechanism “base-growth model” is used 
(Fig. 17b), in which the nanotube is growing in the upper part of the nanoparticle 
of the catalyst. 

The equipment and apparatus used for the fabrication of diverse nanostructures 
(carbon nanotubes, oxide and nitride nanowires, graphene) with the use of CVD 
method are new generation devices furnished with a control panel and 
computer software, enabling the selection of appropriate parameters, control of 
the processes, and data logging. Due to the control of technological processes, they 
can be optimized. These systems must satisfy very strict safety standards, 
eliminating in this way contraindications involving the application of toxic or 
pyrophoric gases or precursors. In the operating setup, a so-called furnace, there 
is a quartz pipe (Fig. 18) to which one or two gases can be supplied, depending on 
the type of device. In the device of this type, the applied systems must ensure the 
possibility of fast heating or cooling of silicon substrates by moving them effi- 
ciently between the chamber where they have been loaded and the working 
chamber. Due to the fast heating process, high-quality products are obtained. A 
standard fabrication process of carbon nanotubes is carried out under reduced 
pressure (100-700 Torr) at the temperature of about 1,100 °C and takes approxi- 
mately 1 h. 

Also other auxiliary devices can be joined to the system, such as gas cabinets (for 
hazardous gases) and air conditioning (for the disposal of gases and others) 
(Fig. 19). The following gases are applied in the device: Ar, H 2 , V, and C 2 H 4 . 
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Fig. 17 Widely accepted growth mechanisms for CNTs: (a) tip-growth model, (b) base-growth 
model 


Fig. 18 Working chamber 
with quartz pipe (Promotion 
materials of First Nano, Inc.) 



Nowadays, the method is most frequently applied for the acquisition of modern 
nano structural materials, nanocoatings, and composites for the application, among 
others, in nanoelectronics or photovoltaic s, due to the possible acquisition of a 
considerable number of nanostructures in one cycle (Figs. 20 and 21). 









2784 


LA. Dobrzanski et al. 


Gas Cabinet 


Gas Panels 


Process Equipment 

Vent/Pure Gas Lines 


Gas Abatement 


Communication/Safety Interface 




Process Gas Lines 


Vacuum Process 
Exhaust Pump 
Exhaust 






Fig. 19 System EasyTube 2000 with auxiliary devices manufactured by First Nano, Inc. 
(promotion materials of First Nano, Inc.) 
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Fig. 20 SEM image of carbon nanotubes obtained with CVD technique (Promotion materials of 
First Nano, Inc.) 


Properties and Application of CVD Coatings 


One of most significant applications of CVD coatings is the modification of the 
surface of machines or tool elements applied in the toolmaking and machine- 
building industries. The application in this sector involves principally diamond 
layers, diamond-like carbon layers, as well as carbide, nitride, and boride layers 
characterized by high hardness, resistance to mechanical (abrasive) wear, and high 
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Fig. 21 TEM image of 
carbon nanotubes obtained 
with CYD technique 





resistance to erosion or to the impact of aggressive chemical media in raised 
temperatures. CVD methods have been principally applied for the fabrication of 
anti-abrasive and anticorrosion layers. 

Layers of ceramic materials obtained with CVD methods are widely applied in 
medicine also. The titanium nitride and carbide coatings are particularly favored 
since they are used as coatings of surgical tools; nitride layers are also promising in 
perspective to modify the surface of the prostheses of the moving parts of living 
organisms. 


Diamond and DLC Coatings 

Synthetic diamond for commercial purposes has been fabricated in laboratories for 
over 40 years, with HPHT (high-pressure, high-temperature) method being applied 
for that purpose. Under high pressure, 50-100 kbar, and high temperature of about 
2,000 K, diamond is obtained from carbon melted in metal. Yet the size of crystals 
obtained in this way is relatively small (several millimeters), and therefore their 
applicability scope is rather limited. Therefore, the development of methods to 
fabricate thin layers or diamond coatings of considerable size (several centimeters) 
on various substrates has offered new applicability potentials in many other areas. 
One of the methods in question is CVD method, in which diamond layers are made 
from plasma generated from hydrogen- and carbon-containing gases. The layers 
obtained in this way have all the advantages demonstrated by diamond, such as 
extraordinary hardness, mechanical and chemical resistance, high temperature 
conductivity, close to zero electrical conductivity, and useful optical properties, 
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thus getting a well accepted name as diamond-like carbon (DLC) (Dobrzanski and 
Dobrzanska-Danikiewicz 2011; Bunshah 1994; Pierson Hugh 1999; Bunshah 2002; 
Mitterer et al. 2003; Eskildsen et al. 1999; Kwiatkowska et al. 2008; Mitura 2007; 
Mitura et al. 2006; Butler et al. 1993; May 2000). 

The fabrication mechanism of DLC coatings by PACVD process consists of a 
number of chemical and physical processes taking place both in the gas phase and at 
the interface boundary solid-gas (Fig. 5). The main stages of diamond layer growth 
can be divided into transport and mixing of gas reagents (CH 4 and H 2 ), activation 
(decomposition of gas reagents into atoms and reactive radicals), reactions in gas 
phase, adsorption of radicals on substrate surface, reactions between adsorbed 
radicals or between adsorbed radicals and gas phase atoms, nucleation and coating 
growth, and finally desorption into gas phase or diffusion near the surface. 

Atomic hydrogen, being the motor of chemical processes, is a significant 
component of gas phase. Atomic hydrogen is produced in plasma as a result of 
the dissociation due to a collision with an electron (the recombination of hydrogen 
atoms takes place slowly at the working pressure in the reactor, i.e., below 1 atm). 
Then, chemical reactions between hydrogen and hydrocarbons and between hydro- 
carbons alone take place in the plasma produced in the activation process. 

The plasma activating the deposition process is not in contact with the surface on 
which the deposition takes place. The transport from the activation area to the 
substrate surface is carried out through diffusion and/or convection. Outside the 
activation area, the reactive fragments still mix, and before they get into contact 
with the substrate surface, they participate in a number of chemical reactions. Upon 
collision with the substrate surface, they may be adsorbed or enter into reactions, 
desorb back to gas phase, or diffuse into other areas on the surface until they find an 
appropriate place for reaction. One of the options is the reaction leading to diamond 
formation. Diamond growth can be also obtained from gas precursors other than 
methane (such as ethane, propane, butane, or acetylene). However, there is a 
presumption that in most cases atomic hydrogen is an indispensable component 
of gas mixture since it takes part in key reactions of the process. The exact 
mechanism of diamond production in the CVD method is complex and depends 
on several factors. The literature materials present only the models of this process. 
The primary and critical step is the nucleation of diamond on the surface. The 
control over this step is immensely important in terms of the optimization of 
diamond layer properties, such as adhesion, grain size, transparency, or smooth- 
ness. The factor that has a tremendous influence on the nucleation process involves, 
without limitation, the substrate type and its appropriate preparation. 

The building process of a DLC coating can be roughly described in the following 
way. While growing, the diamond surface is almost totally saturated with hydrogen, 
which reduces the number of places in which hydrocarbons (most probably CH 3 ) 
can be adsorbed, and limits the migration possibility of the already adsorbed 
radicals. Surface hydrogen is then separated by gaseous atomic hydrogen, forming 
H 2 with it and leaving behind a reactive place on the surface. This place can be 
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taken up by CH 3 , effectively adding carbon to the surface. If a similar process takes 
place in an adjacent place and another atomic hydrogen separates hydrogen from 
any of the carbon groups, producing a radical, a reaction between the radical and the 
nearest carbon group can take place, closing the nucleus and simultaneously 
bonding two carbons in the diamond lattice. In diamond, carbon is bonded only 

Q 

with sp bonds, but in the process described above, on the substrate surface, also 
graphite sp~ type bonds are formed. And here again the role of hydrogen is 
significant. During the dynamic CVD process, the material is deposited on the 
surface and simultaneously etched by the atomic hydrogen back to gas phase. 
However, the sp~ bonds are etched faster than sp bonds, which enables the 
nucleation and growth of the diamond layer. 

In the mixtures containing oxygen, the OH group plays a similar role to that of 
hydrogen, though it is more efficient in the removal of the graphite form of carbon, 
increasing in this way the growth rate and quality of the diamond layer. 

DLC coatings fabricated with different types of PVD or CVD methods have 
recently raised a lot of interest. High hardness, high resistance to chemical factors, 
low friction coefficient, and low wear make them attractive in terms of protective, 
decorative, and tribological applications. They are applied in food production 
sector, medicine, and various fields of technology (parts of vehicle engines, cutting 
edge tools). Depending on deposition parameters (composition, pressure), DLC 
coatings of different electrical properties can be fabricated, from transparent insu- 
lators to black conductors. From the tribological viewpoint, DLC coatings are 
interesting as they can be applied in tribological nodes, operating in the conditions 
of limited lubrication and dry friction. 

On a broader scope, carbon coatings producible by CVD methods are not limited 
to DLC. They include DLC (diamond-like carbon), NCD (nanocrystalline dia- 
mond), and UNCD (ultrananocrystalline diamond). Each of them is characterized 
by different properties, specific for individual structures and phase composition, 
which determine their applications. 

Carbon coatings are applied in numerous fields of life and are characterized by 
numerous properties which directly result from the applied process parameters and 
their production method. High thermal conductivity of the coatings, high hardness, 
and resistance to abrasive wear are the characteristics used for numerous applica- 
tions, including coatings for cutting tools. Biocompatibility and chemical resistance 
provide a great applicability potential of carbon coatings in medicine. The change 
of properties of electrical carbon coatings subjected to the influence of certain 
compounds is now used for the production of biosensors. The content ratio of two 
basic phases present in biosensors, graphite and diamond, determines the properties 

Q 

of carbon coatings. The type of sp bonds that occurs in such coatings determines 
their high hardness, thermal conductivity, chemical inertness, and perfect wear 
resistance. The graphite phase, in turn, determines their good electrical conductivity 
and low friction coefficient. The proper control over the carbon coating production 
process offers great applicability potential for such coatings. 
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Table 4 Relevant wear and corrosion properties of CVD coating materials (at 25 °C) (by Hugh of 
Pierson) 


Type of 
coating 

Hardness 

kg/mm 2 

Thermal 

Conductivity 

W/cmK 

Coeff. of thermal 
expansion 
m/m x °C.10 6 

Remarks 

TiC 

3,200 

0.17 

7.6 

High wear and 
abrasion resistance 
low friction 

TiN 

2,100 

0.33 

9.5 

High lubricity stable 
and inert 

Ti(C,N) 

2,500-3,000 

0 . 2 - 0 . 3 

App. 0.8 

Stable lubricant 

Cr 7 c 3 

2,250 

0.11 

10 

Resists oxidation to 

900 °C 

SiC 

2,800 

1.25 

3.9 

High conductivity 
shock resistant 

TiB 2 

3,370 

0.25 

6.6 

High hardness and 
wear resistance 

AI 2 O 3 

1,910 

0.34 

8.3 

Oxidation resistant 
high stability 

DLC 

3,000-5,000 

2.0 


High hardness, high 
thermal conductivity 


Wear-Resistant Coatings 

The number of CVD coatings resistant principally to abrasive wear and corrosion is 

relatively small. The most important ones include (Table 4; Pierson Hugh 1999; 

Bunshah 2002; Sokovic et al. 2006; Pakula et al. 2004; Dobrzanski and Pakula 

2005; Dobrzanski et al. 2006; Dobrzanski et al. 2007; Dobrzanski and Pakula 2006; 

Dobrzanska-Danikiewicz et al. 2011; Dobrzanski et al. 2010): 

• Titanium nitride (TiN) - crystallizes in a regular NaCl lattice type, having a 
typical interstitial phase with a very wide homogeneity range from 30 to 50 at. % 
of nitrogen. The microhardness of this phase grows proportionally to nitrogen 
concentration. TiN layers are highly resistant to abrasion, but they oxidize in air 
atmosphere at temperature 670-870 K. 

• Titanium carbide (TiC) - has high hardness and abrasion resistance; 
however, it does not make a good diffusion barrier - as it has a low chemical 
resistance. 

• Carbonitride titanium (Ti (C, N)) - is a solid solution of phases TiC and TiN, 
combining the properties of both. It perfectly protects against abrasion and has 
good slipping properties. It is used for coating tools applied principally in the 
treatment of ceramic, graphite, and polymer materials. 

• Chromium carbide (Cr 7 C 3 ) - has good corrosive resistance and high oxidation 
resistance. It is rarely applied as a single layer; however, it finds application 
when combined with TiN and TiC layers. 
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• Silicon carbide (SiC) - has low thermal expansion, high hardness, and good 
oxidation resistance. It is widely used as reinforcing phase in composite coatings 
with graphite matrix in order to provide resistance to abrasion and oxidation. 

• Titanium boride (TiB 2 ) -is very hard and corrosion resistant and provides good 
protection against abrasion. 

• Aluminum oxide (A1 2 0 3 ) - has high strength and perfect stability and oxidation 
resistance in high temperature. 

• Diamond-like carbon - i.e., carbon with diamond properties. These are self- 
lubricating coatings. Their stmcture resembles diamond, but they have graphite 
stmcture at phase boundaries, which facilitates slipping and reduces friction 
between the surfaces. Their sensitivity to humidity is their significant disadvantage. 

Apart from the coatings mentioned above, the following coatings should also be 
classified to this group of materials, (Ti,Al)N, B 4 C, HfN, MoS 2 , WSe 2 , and 
polycrystalline diamond, which are in the development phase and also significantly 
increase the resistance to abrasion, erosion, or corrosion. Out of the said materials, 
molybdenum disulfide (MoS 2 ) has very interesting properties. It is used for coating 
elements exposed to strong friction and, in effect, accelerated wear and high 
temperatures (so-called dry lubricants). They reduce the friction coefficient and 
simultaneously reduce the heat produced at the contact point between the abrading 
surfaces (Dobrzanski 2006; Dobrzanski and Dobrzanska-Danikiewicz 2011; 
B unshah 1994; Pierson Hugh 1999; Bunshah 2002; Tracton Arthur 2005; Boxman 
Raymond et al. 1997). 


Selected CVD Coatings on Sintered Tool Materials 

Substantial amounts of financial resources are spent every year in the machining 
sector, and the amounts estimated in billions of US dollars are still growing along 
with the increasing use of hard and difficult to process materials, e.g., composites 
and refractory materials applied in machine-building industry and mechanical 
engineering. This cost represents a significant part of the total production cost, 
and in some industry sectors, principally in aircraft industry, it is a major cost 
factor. This means that there is still an immense demand for modern machining 
equipment and the need for highly efficient tool materials. A very important role of 
cost reduction in the tool material sector is played by abrasion-resistant CVD 
coatings, which increase the life of a cutting edge by up to several hundred percent, 
as compared to uncoated material. Titanium nitride (TiN) coating was used on 
sintered carbides for the first time in the 1960s, and the application of CVD coatings 
has been successfully growing ever since. A significant percentage of machining 
tools today has either PVD or CVD coatings, depending on the type of tool 
(Sokovic et al. 2006; Pakula et al. 2004; Dobrzanski and Pakula 2005; Dobrzanski 
et al. 2006; Dobrzanski et al. 2007; Dobrzanski and Pakula 2006; Dobrzanska- 
Danikiewicz et al. 2011; Dobrzanski et al. 2010; Hajduk et al. 2009; Hajduk 
et al. 2008; Sugimoto 2006). 
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Fig. 22 Fracture of the Ti(C,N) + A1 2 0 3 + TiN coating deposited onto the sialon ceramics 
substrate 


CVD coatings are most frequently applied to fabricate hard and abrasion- or 
corrosion-resistant carbide, nitride, carbonitride, and metal oxide coatings on steels, 
ceramics, and high-melting point metals. Such coatings have been widely applied to 
cover cutting edges as well as hot and cold plastic working tools - contributing to a 
multiple increase of their durability. 

One of the application examples of CVD coatings involves multilayer and 
multicomponent coatings used on sintered tool materials, including mainly sintered 
carbides, nitride tool ceramics Si 3 N 4 , and sialon ceramics. 

Numerous studies on the application of wear-resistant coatings on tool ceramics 
have verified the existing opinion that depositing coatings on tool ceramics is 
unreasonable due to its generic high hardness. It has been proven that coating 
ceramic tools is by all means justified, because first of all it adds to the increasing 
durability of machining tool cutting edges as a result of the reduction of the heat 
released in the machining process, thanks to the reduction of friction force on rake 
face. Secondly it has been observed that the coatings covering the pores on tool 
ceramic surfaces eliminate chipping places. Moreover, protective coatings cause a 
delay in the diffusive wear process. The studies included, without limitation, the 
application of CVD coatings, being the combination of isomorphic layers with 
titanium nitride, including TiN, Ti(C,N), and A1 2 0 3 layers (Figs. 22 and 23). The 
research studies demonstrate that the coatings produced on the cutting edges made 
from nitride and sialon ceramics cause a significant increase of durability of the 
cutting edges coated with them, largely thanks to their good adhesive-diffusive 
adherence to the ceramic substrate. Multilayer coatings demonstrate compact 
structure, without pores or discontinuities. A very important role in such coating 
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Fig. 23 Surface topography of the Ti(C,N) + A1 2 0 3 + TiN coating (AFM) 


systems is played by the transitional zones between the substrate and the coating 
and between particular layers. The presence of such zones in TiN + A1 2 0 3 and 
TiN + A1 2 0 3 + TiN + A1 2 0 3 + TiN was observed upon the examination of thin 
foils (cross section) in a transmission electron microscope (Figs. 24 and 25). The 
transitional zones facilitate good adhesion between individual layers, which, com- 
bined with high hardness of the coatings, contributes to high resistance to abrasion 
wear of the obtained surface layer (Table 5). 

The examination results of a thin foil from the cross section of TiN + A1 2 0 3 
coating show that the TiN layer is of columnar structure, while the A1 2 0 3 layer is of 
coarse-grained structure. Between the TiN and A1 2 0 3 layers, there is a transitional 
zone in which fine grains of these two phases can be found. Also a small amount of 
fine A1 2 0 3 grains of monoclinic structure was found in this zone, in contrast to the 
typical structure of A1 2 0 3 phase with rhombohedral lattice which occurs outside the 
boundary area across the entire width of the layer. The analysis of thin foils 
carried out in the perpendicular and parallel plane of the investigated coating 
TiN + A1 2 0 3 + TiN + A1 2 0 3 + TiN confirms that the nitride coating containing 
TiN- and Al 2 0 3 -type phases was deposited on the substrate from tool ceramics. The 
structure of A1 2 0 3 layer of the rhombohedral lattice is characterized in the parallel 
section by fine-grained structure with the grain size not exceeding 500 nm. In the 
same section of thin foils, between the grains of A1 2 0 3 phase, we can also see a fine- 
crystalline structure of TiN phase having a regular lattice (Fig. 24). The analysis of 
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Fig. 24 (a) Structure of TiN + A1 2 0 3 coating: thin foil from cross section of the layer surface 
(TEM). (b) Light held, (c) Diffraction pattern from the area as in figure b. (d) Solution of the 
diffraction pattern from Fig. 24c 


thin foils carried out along the cross section of the investigated coating allows to 
examine the structure of the separation boundary between the substrate created by 
phase P-Si 3 N 4 with hexagonal lattice and TiN layer of regular lattice. The TiN layer 
consists of evenly arranged grains in the form of oblong columns whose maximum 
width is ca. 500 nm. The A1 2 0 3 layer of the rhombohedral lattice in the cross section 
of the investigated coating TiN + A1 2 0 3 + TiN + A1 2 0 3 + TiN demonstrates the 
structure similar to the columnar one (Fig. 25). 

The combination of various layers of multilayer coatings ensures a number of 
beneficial properties, such as good adherence to the substrate, which is one of basic 
properties a CVD coating on cutting edges should have, and furthermore, high hardness 
and abrasion resistance as well as diffusion barrier provided by the A1 2 0 3 layer. 


Anticorrosion CVD Coatings on Steel Substrate 

Metalorganic coatings are successfully applied as corrosion-resistant coatings for 
steel materials. Most frequently, these are coatings containing chromium but also 
rare-earth elements or aluminum oxide. The properties of MOCVD protective 
coatings are to a large extent dependent on process parameters and on the structure 
of the coating alone (Lo and Wei 1997; Kim et al. 2002). 
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Fig. 25 (a) Structure of TiN + AI2O3 + TiN + AI2O3 + TiN coating: thin foil from cross section 
of the layer surface (TEM). (b) Diffraction pattern from the area XI as in figure a. (c) Diffraction 
pattern from the area X 2 as in figure a. (d) Solution of the diffraction pattern from figure b. 
(e) Solution of the diffraction pattern from figure c 

The influence of process parameters on the obtained coatings has been presented 
based on the example of chromium and molybdenum 6-carbonile - (Cr:MO)(CO)6 
- deposited on steel substrate (steel X5CrNil8-10) within the temperature range 
175-450 °C. In the research studies presented in the work (Lo and Wei 1997), the 
influence of process temperature on morphology, density, and chemical composi- 
tion of the coating was determined. The coating was obtained by the application of 
the precursors of chromium 6-carbonile and molybdenum 6-carbonile mixed 
together in the reactor chamber. The coatings were deposited at different temper- 
atures but at constant pressure of 133 Pa. The structures of particular coatings are 
presented in Fig. 26. 







2794 


LA. Dobrzanski et al. 


Table 5 The results of studies into the structure and properties of sintered tool materials with the CVD coatings deposited (Dobrzariska-Danikiewicz 
et al. 2011) 
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Fig. 26 SEM microphotographs presenting the structure of coatings deposited with MOCVD 
method at different temperatures: (a) 225C, (b) 350C, (c) 450C 


Fig. 27 Diagram of 
corrosion current density as 
dependent on the temperature 
of coating deposition process 
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The coatings obtained in lower temperatures are characterized by fine-crystalline 
structure, and the coating obtained at the temperature of 450 °C has amorphous 
structure. The coatings fabricated at the temperatures of 250 °C and 350 °C ensure 
very good resistance to corrosion in contrast to the coating fabricated at the 
temperature of 450 °C (Fig. 27). 

The coatings obtained at 350 °C were characterized by the highest density. The 
research studies have demonstrated how much the obtained coatings are dependent 
on the temperature of their deposition. By changing this parameter alone, we can 
obtain coatings of very different properties. 
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Fig. 28 Diagram for the formation of pitting corrosion in passive layers Fe 2 0 3 -Cr 2 0 2 


Another example (Kim et al. 2002) illustrating the application of MOCVD 
method for the fabrication of coatings resistant to corrosion involves the fabrication 
of artificial passive layers. Such coatings are characterized by better resistance to 
corrosion, as compared to passive layers formed in the natural way on steel surface. 
It has been proved that the deposited passive coatings (Fe 2 03 -Cr 2 03 ) have better 
corrosive resistance than natural coatings having the same chemical composition. It 
is connected with different crystalline structure of particular coatings. Natural 
coatings are characterized by amorphous structure and by high content of M-OH 
bonds, which are responsible for pitting corrosion (Fig. 28). Coatings obtained at 
the temperature 350 °C are characterized by crystalline structure and high number 
of M-0 bonds. 
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For the fabrication of passive coatings, iron acetylacetonate (decomposition 
temperature —140 °C) and chromium acetylacetonate (decomposition 

temperature —150 °C) were applied as precursors. The process is carried 
out in oxygen atmosphere, and nitrogen is used as carrier gas. The pressure is 
1 kPa. 

A very good influence on corrosive resistance of chromium coatings at high 
temperature is obtained by adding rare-earth elements. It is worthwhile mentioning 
that as the competition for chromium oxides, also protective layers based on 
aluminum oxide can be applied. The application of aluminum and zirconium 
oxide (Al 2 0 3 -Zr0 2 ) enables the acquisition of layers resistant to the impact of 
both acids and bases. 


Future Trends 

The leading direction involving the development of the discussed 
CVD methods involves the reduction of process temperature, among others, 
through : 

• Electrical activation of gaseous environment and the worked surface by means 
of glow discharge (e.g., PACVD process - plasma-assisted chemical vapor 
deposition) 

• Application of gaseous atmospheres containing metalorganic compounds (e.g., 
MOCVD process - metalorganic CVD) 

• Application of nanolayers deposition “atom after atom” (ALD - atomic layer 
deposition) 

The prospective direction involving the development of such methods comprises 
new fabrication ways of multilayer coatings - e.g., Ti(OCN) and Ti(CN) - through 
the application of gaseous atmospheres containing metalorganic titanium com- 
pounds (Ti(OC 3 H 7 ) 4 or Ti[N(CH 2 CH 3 ) 2 ] 4 vapors) or multilayer coatings produced 
through the combination of various processing methods, including, among others, 
ion nitriding and PACVD method, diffusion boronizing and PACVD method, or 
chemical nickel plating using the electroless method with ion boronizing process, 
which guarantees the improvement of the usability properties of the processed 
objects, particularly in terms of the rise of their resistance to corrosion and 
abrasive wear. 

The basic development directions of vacuum technologies of surface layer 
modification can be classified within four major groups (Dobrzanski 2006; 
Dobrzanski and Dobrzanska-Danikiewicz 2011; Bunshah 1994, 2002; Pierson 
Hugh 1999; Tracton Arthur 2005; Boxman Raymond et al. 1997; Jung 
et al. 2006; Mattox 2003; Mitterer et al. 2003; Burakowski and Wierzchon 1995; 
Merlo 2003; Hyett and Parkin 2007; Tu et al. 2006; Allah et al. 2010; Uhlmann and 
Koenig 2009; Olszyna and Smolik 2004; Eskildsen et al. 1999): 
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• The optimization of thermal-chemical processing carried out with the use of 
vacuum methods in terms of layer growth kinetics and expected usability 
properties, including: 

1. New generation systems for automatic control of gas nitriding processes, 
using innovative control systems for nitriding atmosphere and process run, 
e.g., a magnetic sensor 

2. Low-temperature diffusion processing, e.g., nitriding and ion oxynitriding 
processes of austenitic steels for aircraft and aerospace industry, nuclear 
power generation, and surgical instruments 

3. Anticorrosion diffusion layers, being an alternative for cadmium coating of 
steel elements for automotive industry 

4. Special diffusion processes to be used in “duplex” processes 

• The formation of the structure of surface coatings and the modification of their 
chemical and phase composition through the application of duplex-type tech- 
nologies, combining thermal-chemical processing with electroplating or with 
PVD/CVD techniques or with other techniques, including: 

1 . “Duplex” technologies associating the nitrided layer of required morphology 
and phase composition with PAPVD or other surface modification tech- 
niques, e.g., ion implantation, laser processing, and mechanical processing 

2. The fabrication of multicomponent and composite surface layers with the 
application of the so-called multiplex processes consisting in combining - in 
the way grounded in terms of economics - several different surface engineer- 
ing methods (formation of the structure and properties of titanium, magne- 
sium, aluminum alloys for aircraft, automotive, power generation industries, 
medicine) 

3. The use of the combination of CVD (MWCVD, RFCVD) and PVD processes 
(e.g., PLD method) in the formation of the properties of plastics and polymer 
composites for the purposes of medicine, automotive, and aircraft industries 

4. The formation of usability properties of sintered materials, metal-ceramic 
composites with hybrid surface engineering techniques 

• The development of mathematical models of processes, based on experimental 
data enabling the simulation of their runs, and the development of computer 
control systems over the technological processes with the use of artificial 
intelligence elements, including: 

1. The creation of central database comprising data on layers and coatings in 
connection with process parameters and the development of methods of its 
supplementation and use 

2. Designing diffusion processing methods with the use of artificial intelligence 
elements, neuron lattices, and fuzzy logic, based on the empirical data sets 
and layers to be used in the duplex processes 
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3. The development of model designing systems of hybrid processes of surface 
processing for dedicated special applications, e.g., drive transmission 
mechanisms 

• The development of industrial vacuum fabrication technologies of special mate- 
rial layers, including: 

1. Nanocomposite antiwear and friction-reducing coatings, based on 
multicomponent nitrides and carbides of transition metals, to be used under 
dry friction and raised temperature conditions 

2. Ultrahard nanocomposite nanocrystalline-amorphous coatings 

3. Multilayer hard coatings of superlattice structure (type TiN/CrN) 

4. Hard coatings of raised resistance to high-temperature corrosion (type 
CrN/TiAIN) 

5. Bioactive coatings produced on the basis of calcium and phosphorus com- 
pounds for endoprosthetic purposes 

The studies on modern coating materials, obtained with the use of PACVD 
technology, subjected to high mechanical loads, high temperatures, and corrosive 
conditions, involve the following fundamental issues: 

• The development of the synthesis method of the coatings of carbide and nitride 
monometallic (MeC, MeN) and multimetallic (Me!Me 2 N) phases 

• The optimization of coating structure in order to obtain the required operating 
properties 

• The development of new plasma sources and machine groups including electri- 
cal power supply packs, control systems ensuring high-quality of the obtained 
structures, and the economics of the fabrication process of large-lot and small-lot 
production 

A considerable advantage of the synthesis of PACVD coating materials involves 
the easy acquisition of substrates in the form of metal ions which are formed in 
plasma environment. The conditions of coatings’ synthesis responsible for the 
phase structure and morphological structure of the coatings are formed by 
the energy and chemical composition of plasma. These basic parameters can be 
controlled by electrical fields, magnetic fields, pressure, and the ratio of reactive 
gases. 

In the industrial coating process of glass packaging, tin compounds are com- 
monly used in order to obtain a thin film of tin oxide on the surface, which forms a 
protective coating against mechanical damage, including also defects and micro- 
cracks on glass surface. The coating reduces friction and scratching risk, enabling 
highly efficient filling of glass packaging. Furthermore, tin oxide in thin layers is 
characterized by synergy as compared to PVC or PE layers “cold-applied” (at the 
end of the annealing process) as the next protective coat, reducing the possibility of 
damaging the packaging during filling, storage, or transportation. 
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There have been numerous reports recently, particularly patent claims, concerning 
the refinement of packaging glass surface, also fiberglass, technical glass, float glass, 
cellular glass, and other types of glass, using CVD method through vapor deposition 
sputtering of aluminum oxide, A1C1 3 , or hydrated aluminum chlorohydrate, A1C1 3 
x 6 H 2 0. These compounds sublime, respectively, at the temperature of about 
1 83 °C and 195 °C. The vapors are brought to the hot glass surface of the temperature 
by about 150 °C higher than the transformation temperature. In effect, it brings about 
a considerable rise of microhardness, bending strength, and, in the case of fiberglass, 
a particularly high increase of tensile strength. 


Summary 

CVD technologies and, in particular, the coatings obtained by means of them have 
been an inseparable part of our everyday life for years. Practically in every sector of 
manufacturing industry, their role and importance are unquestionable - high- 
precision machining tools without carbide or nitride coatings - improving their 
operating parameters to the level unobtainable for uncoated tools are unimaginable. 
The contemporary electronics, particularly microchip electronics, would never 
have achieved development so dynamic in such a short time without the support 
of CVD, just like optics whereof development was substantially assisted by the 
priceless protective and anti-reflexive surfaces obtained in this way. 

The application potentials of PVD and CVD technologies seem to be unlimited, 
especially in the case of VPE and ALD techniques. The two totally different techniques 
offer absolutely new possibilities in their areas. The ALD technique, facilitating the 
acquisition of coatings, having one atomic layer deposition thickness, provides the first 
successful nanoscopic stmctures that have become the basis for NEMS (nanoelectrome- 
chanical systems), the first attempts to build real nanomachines, yielding the constmction 
of the smallest engine in the world, its size being of the order of 500 nm. The VPE 
technique, in turn, offers extraordinary potentials in the fabrication process of new 
materials. It is this technology that enables the fabrication of graphene - the most amazing 
substance known to humankind. Graphenes are the strongest materials known to people. 
They have the highest thermal and electrical conductivity, and one of their fabrication 
methods involves their epitaxial growth on the surface of silicon substrate. 

The possibility of creating completely new materials of much better properties 
than those met in the majority of the currently applied materials, based also on CVD 
technologies, will be still interesting in the years to come. 
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Abstract 

Thermal spray refers to a group of coating techniques whereby droplets of 
molten or partially molten materials are sprayed onto a solid substrate to 
develop the coating. Based on the applied heat source and the process 
characteristics, a large number of thermal spray techniques are commercially 
available, enabling a wide range of materials to be coated. In thermal spray, 
the basic bonding mechanism is mechanical interlocking, and the bonding 
between splats can be improved by increasing temperature or particle 
velocities during particle impact. However, for coating of metallic materials 
or composites, high processing temperatures can increase the amount of 
oxides embedded in the coating and, therefore, reducing their performance 
for structural application. Cold spray is another solid-state spraying process 
in which the coating materials are not melted in the spray gun; instead, the 
kinetic energy of fast-traveling solid particles is converted into heat, and 
there is interfacial deformation upon impact with the substrate, producing a 
combination of mechanical interlocking and metallurgical bonding. In the 
present contribution, a detailed overview of the thermal spray and cold 
spray techniques on coating of materials is presented. Finally, the future 
scope of the application of thermal spray and cold spray techniques is 
presented. 
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Introduction 

Surface coating aims at tailoring the micro structure and/or composition of the near- 
surface region of components by applying another layer on the surface to improve 
surface-dependent engineering properties (Budinski 1998). Figure 1 shows a brief 
classification of surface coating techniques used in practice. The coating techniques 
may broadly be classified in to physical, chemical, electrochemical, and thermal. 
From the subclassifications of the coating techniques, it may be summarized that 
the coating techniques applied in practice include electrodeposition/electroless 
deposition, sol-gel coating, electrophoretic deposition, chemical vapor deposition 
(CVD), physical vapor deposition (PVD), weld overlaying, laser surface engineer- 
ing, and thermal spray (TS) (Budinski 1998). Both the initial equipment costs and 
the operating costs for the electrodeposition/electroless/electrophoretic deposition 
are relatively low. However, the by-products are considered to be highly toxic and, 
hence, not environment friendly. Though the equipment costs for CVD are moder- 
ate, the precursor gas mixtures used for CVD are expensive (Budinski 1998). 
Furthermore, a precise control of the process, cleaning of the system at a regular 
interval, and neutralization of the output gases need to be undertaken, which 
increases the overall cost of the process. The cost of PVD coating equipment is 
very high, due to the requirement of high vacuum chamber of sufficient volume to 
make the process cost-effective. Furthermore, all the coating techniques as 
described above are suitable for thin film deposition. The bonding mechanism is 
predominantly van der Waals force with very weak bond strength. Weld overlaying 
is a technique applied for the deposition of a thick layer by melting the deposited 
materials using the heat source as in welding and applying it on the surface of 


Coating Technique 



Fig. 1 A brief classification of surface coating techniques used in practice (Budinski 1998) 
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substrate (Budinski 1998). Inability to produce precision coating thickness, signif- 
icant dilution, and a wide heat-affected zone are the disadvantages associated with 
weld overlaying (Budinski 1998). Laser surface engineering involves application of 
high-power laser as a source of heat to melt the material in the form of wire or 
powder and, subsequently, depositing it on the surface of a substrate (Draper and 
Poate 1985; Steen 2003; Dutta Majumdar and Manna 2011). The high installation 
cost of laser and the necessity of optimization of process parameters for the 
development of defect-free and homogeneous microstructure are the major chal- 
lenges for industrialization of laser surface engineering techniques. On the other 
hand, thermal spray is a generic term applied for a group of processes in which 
metallic, ceramic, cermet, and polymeric materials may be coated on the surface by 
heating them in the form of powder, wire, or rods to near or somewhat above their 
melting point, accelerating the resulting molten or semi-molten droplets of material 
in a gas stream and projecting against the surface to be coated (i.e., the substrate) 
(Thorpe 1993). On impact, the droplets flow into thin lamellar particles adhering to 
the surface, overlapping and interlocking as they solidify. The total coating thick- 
ness is usually generated in multiple passes of the coating device. The advantages of 
thermal spray deposition over other coating techniques include the ability to coat a 
wide variety of materials, no distortion of the substrate, no thickness limitations, 
precision, environment friendliness, ability to apply the process to repair worn/ 
damaged part, and relatively higher bond strength as compared to the electro/ 
electroless deposition and physical-/chemical-based coating techniques. The 
major disadvantage of the process is the line-of-sight nature of these deposition 
processes. Furthermore, there are also size limitations prohibiting the coating of 
small, deep cavities into which a torch or gun will not fit. 

Cold spray is another solid-state spray process in which the coating materials are 
not melted in the spray gun (such as in conventional thermal spray); instead, the 
kinetic energy of fast-traveling solid particles is converted into localized heat upon 
impact with the substrate and causes interfacial deformation, producing a combi- 
nation of mechanical interlocking and metallurgical bonding (Singh et al. 2012). It 
is generally accepted that sprayable materials require a critical amount of energy 
(related to the velocity of the particles) and impact temperature for effective 
bonding to occur. Around the particle-substrate collision interface, a high-strain- 
rate deformation occurs producing microscopic protrusion of material and localized 
heating which may lead to metallurgical bonding. In the present contribution, a 
detailed overview of the thermal spray and cold spray techniques for coating of 
materials is presented. Future application of thermal spray and cold spray for the 
coating of real components is also discussed. 


Brief History of Thermal Spraying 

The published patents on thermal spray deposition technique date back in 1882, 
and since then few patents are available till 1911 (Thorpe 1993). The first applica- 
tion of thermal spray deposition was by Schoop in Switzerland on deposition 
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Fig. 2 Chronological development of thermal spray deposition techniques (Turner and 
Budgen 1926) 


of tin and lead coatings by flame spraying to enhance corrosion performance 
(Schoop and Guenther 1917). Figure 2 shows the chronological development of 
thermal spray technology schematically (Turner and Budgen 1926). From Fig. 2 it 
is evident that the progress in process development is initially very slow after the 
inception by Schoop in 191 1, following which it increases at a moderate rate until the 
late 1950s. In this regard, it may be noted that the detonation-gun coatings developed 
and applied by Union Carbide Corporation gained popularity in aerospace sector, and 
subsequently, development of stabilized zirconia-based thermal barrier coating by 
plasma spray deposition technique occupied a largest share. From Fig. 2 it is observed 
that the next stage of development occurred in the 1980s by the invention of the 
vacuum plasma spraying (VPS) and the Jet Kote/high-velocity oxy-fuel coating 
(HVOF) techniques. Subsequently, there was development of improved online real- 
time feedback control system with close-loop strategies, intelligent statistical process 
control (iSPC), design of new equipment and spray powders, as well as 3D process 
modeling and improved understanding of the complex nonlinear physics underlying 
the plasma spray process. Fauchais et al. (2006) concluded that present developments 
of thermal spraying focused on four main points: 


1 . Process online control requiring correlation of particle in-flight characteristic as 
well as substrate and coating temperatures to thermomechanical and other 
in-service coating properties 

2. Study of arc root fluctuations and their causes by experiments and 3D modeling 
and relating them to structure and properties of coatings 

3. Study of splat formation and layering mechanisms and their effect on the 
properties of splat- substrate and splat-splat interfaces 
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4. Development of novel spray technique to produce nanostructured coatings, in 

particular using suspension and solution plasma spraying 

In the last three to four decades, there has been increasing interest in thermal 
spray coatings by the military and the commercial sectors. The updated research 
efforts on thermal spraying are reported in Journal of Thermal Spray Technology 
and also reported in journals devoted to the entire realm of surface engineering, for 
example, Surface and Coating Technology , Thin Solid Films , Surface Technology , 
Acta Materialia , Materials Science and Engineering , Advanced Materials , and 
Journal of Materials Science. 

The main research and development activities on thermal spray deposition are 
centered around improving spray equipment, process control through statistical 
design of experiments (SDE), intelligent statistical process control (iSPC), and 
quality function deployment (QFD) techniques; designing new control devices, 
online real-time feedback looping, mass flow controllers, powder metering equip- 
ment, manipulators, and robots; development of novel spray technologies, e.g., 
reactive plasma spraying, suspension plasma spraying, thermal plasma chemical 
vapor deposition, reactive laser plasma coating, high-velocity pulsed plasma 
spraying, high-velocity suspension flame spraying, and electromagnetically accel- 
erated plasma spraying; effective innovation and technology transfer from research 
organizations to small- and medium-sized enterprises (SMEs); and development of 
databases and expert systems as well as process maps (Tucker 1994; Crawmer and 
Thermal Spray Technologies Inc 2004). 


Basics of Thermal Spray 


Thermal spray deposition may be defined as the process where the material to be 
coated, in the form of powder, wire, or rod, is heated to semi-molten or molten state 
and is accelerated toward the substrate by a high-velocity gas stream (Thorpe 1993; 
Chattopadhyay 2004). Figure 3 illustrates the principle of thermal spray deposition 
technique schematically (Chattopadhyay 2004). From Fig. 3 it may be noted that 
during thermal spraying, powder is delivered at the tip of the heat source through a 
powder delivery feeder, melted and, subsequently, sprayed using a spray nozzle 
followed by fragmentation of molten pool and its acceleration. The velocity of the 
molten/semi-molten droplets causes deformation of the particles upon impact and 
builds up a lamellar structure with entrapped inclusions and porosities characteris- 
tics of the thermal spray deposition process (Fongo and Fongo Associates 2005). 

Figure 4 shows the spray pattern illustrating particle deposition during thermal 
spraying and the distribution of its thickness and densities along radial direction 
(Schneider et al. 2006). From Fig. 4, it may be noted that the fastest and densest 
deposits will build up at the center of the jet, where most particles are entrained and 
the highest degree of melting occurs. On the other hand, the porosity content 
increases moving radially out from the center (where fewer particles are entrained 
which tend to be coarser and perhaps semi-molten). Porosity content also increases 
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Fig. 3 Schematic of thermal spray deposition technique (Chattopadhyay 2004) 
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Fig. 4 Schematic diagram illustrating the spray pattern along the cross section in thermal 
spraying and the thickness and porosity distribution (Schneider et al. 2006) 
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Fig. 5 Schematic showing the effect of particle melting on shape and structure of coating 
(Schneider et al. 2006) 


when the impact angle is less than 90°. At the jet periphery, fine particles oxidize 
readily due to entrained air from the surrounding atmosphere and deposit as debris. 
The mass fraction of oxide inclusions increases with reduction in the size of the 
particles as they oxidize rapidly, sometimes completely, to form the major source of 
oxide inclusions in sprayed coatings (Schneider et al. 2006). 

Figure 5 shows the effect of the degree of particle melting at or just before 
impact on the structure of coating (Schneider et al. 2006). The classic lamellar splat 
morphology is created when fully melted particles at or just above the melting point 
impact, flow, and flatten after arriving at the substrate (Fig. 5a). Particle material 
spreads (Fig. 5b) and cools rapidly as the substrate acts as self-quenchant. Lamellar 
particle thickness depends on particle velocity, substrate surface tension, starting 
particle size, and substrate temperature. Superheated particles may splatter on 
impact, throwing out radial splashes of fine spherical or triangular shapes which 
deposit as satellites (Fig. 5c). Debris formed from splatter differs from debris 
produced in the jet in that splatter lodges in the coating, building up at the first 
ridge that stops its radial travel. Air blasting does not remove splatter debris. During 
thermal spraying, superheating should be avoided. The resolidified particles in 
flight may not deposit and may appear as unmelted particles but with clearly 
discernable oxide layers on the particle surfaces (Fig. 5a). 

The important characteristic features which distinguish thermal spray coating 
from other techniques include the lamellar or layered splat structure, entrapped 
unmelted or resolidified particles, porosities, oxide inclusions, grains, phases, 
cracks, and the coating -substrate interfaces. Furthermore, they contain partially 
molten particles that solidify before impacting the surface or react with gas from the 
atmosphere. The aspect ratio of individual splat depends upon the nature of coating 
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Fig. 6 Schematic diagram of coating characteristics and the micro structure of thermal spray 
deposition (Chattopadhyay 2004) 


materials, melting temperature, and the velocity of impact. Figure 6 illustrates the 
characteristics of the thermal spray deposited coating and the representative 
deposit/coating micro structure schematically (Chattopadhyay 2004). The spray 
pattern developed following thermal spraying is an important parameter to deter- 
mine the properties of coating. The effectiveness of the process is termed as 
deposition efficiency (DE), which is the ratio of the amount of powder used to the 
spraying time and, hence, may be calculated if the parameters are known. The 
deposition efficiency may be categorized as process deposition efficiency and target 
deposition efficiency (Crawmer 2005). The process deposition efficiency may be 
defined as the ratio between the weight of the spray pattern deposited on a big flat 
plate and the weight of powder injected, which is a characteristic of the process and 
is influenced by gun design and settings, the powder properties and the standoff 
distance (distance between the gun and substrate), substrate temperature, and the 
spray angle (which is the angle between the plate and the torch axis). On the other 
hand, the target deposition efficiency depends on the component shape and size and 
on the way the torch movement is programmed (Crawmer 2005). 


Parameters in Thermal Spray Deposition 

The performance of thermal spray deposited coating is controlled by a large number 
of variables (Crawmer 2005). For achieving the best properties in the coating, 
optimum combinations of coating parameters are essential. The overall parameters 
in thermal spray deposition may be categorized into (a) spray parameters, 
(b) manipulation variables, (c) substrate parameters, and (d) coating parameters. 
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Spray Variables 

Spray parameters include the kind of heat source used and the associated parame- 
ters, i.e., fuel mixture composition, current and voltage of arc, gun design (e.g., bore 
geometry and exit diameter); total arc/flame gas flow, that is, primary plus second- 
ary gas flows (fuel-to-oxygen ratio being a significant parameter in flame spray 
processes); and arc gas characteristics. 


Manipulation Variables 

Manipulation variables include standoff distance (the distance from the face of the 
gun to the part being coated), surface speed (the relative velocity between the gun 
and the part), pitch or increment (the distance that the gun moved with each 
subsequent stroke of the gun manipulation or revolution of the part), and angle of 
impingement (the variation of the spray stream from normal or 90° to the surface of 
the part). 


Substrate Variables 

The substrate parameters influencing the quality of coating include the physical and 
thermal properties of the substrates like the thermal properties of materials, surface 
roughness, bond coat parameters, and temperature. 


Coating Variables 

Coating variables include the nature and shape of the precursor materials, particles 
size, shape and distribution, flow behavior, feed properties, and thermal/mechani- 
cal/electrochemical properties of coating. 


Mechanism of Bonding 

The main mechanism of bonding of coating with the substrate is mechanical 
interlocking which offers the bond strength of the coating (Guide to Engineered 
Materials 2001). Mechanical interlocking can play a part in coating adhesion and 
cohesion when the surfaces being coated have features that allow molten material to 
flow into and fill negative relief (cf. Fig. 7) (Guide to Engineered Materials 2001). 
In this case, the bonding between impacting particles and the surface is established 
largely through the impact of particles that flow and solidify around the substrate 
surface asperities. Substrate asperities with negative relief can be formed prior to 
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Fig. 7 Schematic illustration 
of the interface developed 
after the thermal spray 
deposition (NN 2001) 



Table 1 Summary of reported minimum bond strength values for a number of coating technol- 
ogies and materials (Smolka 1985) 


Spray materials 

Processes 

Wire 

flame 

spraying 

Powder 

flame 

spraying 

Arc 

spraying 

Plasma 

spraying 

High-velocity 

oxy-fuel 

spraying 

Ferrous 

14 

28 

41 

>34 

62 

Nonferrous 

21 

21 

>41 

>34 

70 

Self-fluxing alloys 


>69 



62 

Ceramics 


14-34 


>21 


Cermets 


34-48 


55-69 

>83 


coating by grit blasting and other mechanical surface preparation techniques. Apart 
from mechanical interlocking, the bond strength of the coating to the substrate and 
cohesion between consecutive splats is also affected by the residual stresses within 
the coating, melting, and localized alloying at the contact surfaces between parti- 
cles and between the substrate and adjoining particles, diffusion of elemental 
species across splat boundaries, atomic-level attractive forces (van der Waals 
forces), and occurrence of local adiabatic shear instabilities at the particle-substrate 
and particle-particle interfaces due to thermal softening. 

Table 1 gives the summary of reported minimum bond strength values for a 
number of coating technologies and materials (Smolka 1985). The bond strength of 
a coating is measured using a tensile adhesive strength test according to DIN EN 582. 
This test method determines the bond strength of thermal- sprayed coatings under 
tensile stress perpendicular to the interface surface. Traction-adhesive strength tests 
reveal the influence of substrate material, coating material, pretreatment of work 
piece surface, and coating conditions on tensile adhesive strength and, furthermore, 
can control metal spray work (DIN EN 1994; Smith 2004). 
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Defects in Thermal Spray Deposition and Its Remedies/Control 

The typical coating defects in thermal spray deposition include porosities, oxide 
inclusions, and residual stress (Davis 2004). Figure 8 shows the coating defects 
generated in thermal spray deposited coating schematically (Davis 2004). From 
Fig. 8 it is evident that the visible defects usually observed in the thermal spray 
deposited coating are porosities, oxide inclusions, and unmelted particles. 


Porosities 

Porosities are usually created due to trapping of a large number of unmelted or 
resolidified particles in the coating (cf. Fig. 9) and deteriorate wear and corrosion 
resistance property by creating poor coating cohesion (Davis 2004). Porosities may 
also be generated due to the following reasons: (a) material shrinkage on cooling 
from the liquid state; (b) entrapped unmelted, partially melted, or resolidified 
particles, leading to void formation; (c) shadowing from these “unmelts,” creating 


Fig. 8 Defects generated in 
thermal spray deposition 
(Davis 2004) 
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Fig. 9 Genesis of porosity development in thermal spray deposition (Davis 2004) 
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line-of-sight voids that are not filled by the spreading droplets; (d) poor intra-splat 
cohesion, leading to separation of splats; (e) poor wetting onto adjacent surfaces or 
splats that can result from cool or slow-moving particles or surface tension phe- 
nomena; (f) inter-splat or intra-splat cracking; (g) high deposition angles that lead 
to shadowing or “snow drifting”; and (h) porosities inherent to the feedstock 
powder manufacturing process. From Fig. 9 it is evident that liquid-state particles 
flow easily and fill most voids. However, among solid particles, few are reflected 
from the solid surface and may adhere locally to, or become trapped in, increasing 
the roughness of the coatings. These “overspray” particles are not well bonded, nor 
are they in intimate contact with the underlying splat, which creates voids that are 
not open to direct line of sight. As stated before, thermal spray is a line-of-sight 
process; hence, the next arriving particles cannot fill voids adjacent to trapped solid 
particles (Fig. 9). In other cases, even in the same instant, some solid + liquid (i.e., 
partially molten) particles may deform and be completely densified, provided 
sufficient liquid is available to fill any voids that form around the small amount 
of existing solid core. It is also possible that partially melted particles act similar to 
solid particles, leaving trapped voids around their solid portions. 

The angle of impingement of the spray stream is another source of porosity, 
because of “shadowing.” Figure 10 illustrates the type of shadowing generally 
associated with coatings when sprayed at angles below 45° from the optimal 
“normal” angle of incidence (Davis 2004). Coating porosity decreases (i.e., density 
increases) as the angle of spray approaches 90°, that is, normal to the surface being 
coated (Davis 2004). It follows that more advanced coating systems are sprayed 
with tighter tolerances on fixture alignment. Plasma coatings may be sprayed at an 
angle of approximately 15°, whereas low-end coatings may be sprayed at an 
angle of 30°. Figure 10 shows the surface protrusions build up and then shadow 
interstices or voids adjacent to, and behind, the protrusions schematically. These 
protrusions produce even more shadowing, particularly as the angle of incidence is 
lower than 90° . 

Poor splat or particle cohesion also leads to premature coating cracking, 
delamination, or spalling. Open porosities may be interconnected up to the 



deposit direction 


Shadow 
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Fig. 10 Shadowing effect associated with thermal spray deposition (Davis 2004) 
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coating -substrate interface, enabling corroding or oxidizing elements to attack 
the base material. Porosity can thus “short-circuit” the inherent corrosion 
resistance of a coating. For hard-facing or wear-resistant coatings, porosities 
decrease coating hardness and contribute to poor surface finishes, thus decreasing 
wear resistance. 

The porosities present in the thermal spray deposited structure may be controlled 
by controlling the size, shape, and method of powder manufacture, the degree of 
melting of the sprayed droplets, and their angle of impact. The degree of melting 
of particle is further controlled by (a) jet temperature and enthalpy distribution, 
(b) jet/particle heat-transfer effectiveness and jet/gas properties, (c) particle size 
and its distribution, (d) particle morphology, (e) particle thermal properties, 
(f) particle dwell time, (g) particle trajectory distributions, and (h) voltage and 
current of arc. 


Oxide Inclusions 

Most metallic coatings undergo oxidation during thermal spraying by particle/atmo- 
sphere interaction and/or heating of the coating surface during deposition. Longer 
dwell times and higher particle temperatures increase the thickness of the oxide or 
nitride layers on the particles, producing higher concentrations of oxide stringers 
within the coating. The oxide inclusions (stringers) in metallic coatings are generally 
seen as dark, elongated phases that appear as strings in the coating cross section, 
parallel to the substrate As the particles spread on impact, surface films fracture with 
the flowing metal and become a part of the deposit as the droplets solidify into splats. 
However, dispersion of the oxide inclusion may increase the hardness of the coating, 
since oxides have hardness exceeding 1 ,000 DPH. Therefore, the composite mixture 
of metal and metal oxides becomes harder than the metal coating itself. This 
increased hardness can lead to brittle coatings, because oxides fracture easily. A 
higher fraction of oxides may interfere with splat-to-splat cohesion, leading to 
decreased cohesive strength of the coating. However, in some applications oxide 
inclusions may contribute positively as a homogeneous distribution of fine oxides 
may improve wear resistance and also decreases thermal conductivity. However, the 
presence of oxides in coatings may deteriorate the toughness, strength, machinability, 
and corrosion resistance properties of coating. 

Inclusion content in the coating may be reduced by (a) proper inert gas 
shrouding; (b) reducing the average temperature of particles; (c) reducing the 
dwell time of the particles by minimizing spray distances or increasing velocities; 
(d) reducing substrate/coating surface temperature using cooling air jets or increas- 
ing the speed of the thermal spray deposition, thus minimizing oxidation at the 
coating surface; and (e) using the proper feedstock powder particle size. Larger 
particles have a lower surface area to volume ratio, which minimizes overall oxide 
content. 
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Residual Stress in Coating 


Rapid quenching of most materials during thermal spraying is accompanied by 
contraction or shrinkage. As particles strike on the substrate surface, they rapidly 
cool and solidify generating a residual tensile stress within the particle and a 
compressive stress within the surface of the substrate. As the coatings built up, 
the tensile stress in the coating increases, reaching a threshold thickness, where if 
the tensile stresses will exceed that of the bond strength or cohesive strength of 
coating, failure will occur. As a result, thin coating is more durable than thick 
coating. Figure 1 1 illustrates the residual stress generation process schematically. 
The residual stress introduced in coating during thermal spray deposition depends 
on the thermal expansion coefficient of the coating materials, technique employed 
for the development of coating, and density of coating. Materials with a high 
coefficient of thermal expansion like austenitic stainless steels are prone to high 
levels of stress buildup and thus have low thickness limitations. Dense coatings are 
generally more stressed than porous coatings. The deposition techniques using very 
high kinetic energy and low thermal energy (HVOF, HEP, cold spray) can produce 
relatively stress-free coatings though they are extremely dense. This is thought to be 
due to compressive stresses developed due to mechanical deformation (similar to 
shot peening) during particle impact counteracting the tensile shrinkage stresses 
caused by solidification and cooling. 

The residual stress developed in coating may be controlled by a proper 
(a) preheating and (b) postheating operation prior to and after deposition, respec- 
tively. However, optimization of process parameters is essential for achieving best 
results in this regard. 
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Fig. 1 1 Schematic showing 
the genesis of residual stress 
generation during thermal 
spray deposition (Davis 2004) 
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Classification of Thermal Spray Deposition 

Figure 12 summarizes the major classification of thermal spray deposition tech- 
nique (Davis 2004; Walser 2004). Thermal spray deposition technique may be 
classified into four main coating processes: flame spraying, plasma-arc spraying, 
electric-arc spraying, and kinetic spraying with many of their subclasses. Each of 
this process encompasses many more subsets. Each process is having its own 
characteristics and, hence, develops coating of its own characteristics. The coating 
characteristics are defined by its bond strength, thickness, micro structure, hardness, 
compositional homogeneity, porosity, and inclusions content. Appropriate coating 
technique needs to be selected for the desired coating characteristics for the specific 
application. In the present section, a detailed overview of individual technique 
would be presented in details. 


Flame Spray 

Flame spray is a thermal spray deposition process which involves combustible gas 
as a source of heat to melt the coating material (Knight and Smith 1998; Pawlowski 
2008; Heimann 2008). Figure 13 illustrates the flame spray deposition technique 
schematically (Knight and Smith 1998). Flame spray guns are designed based on 



Fig. 12 Classification of thermal spray deposition techniques (Walser 2004) 
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Fig. 13 Schematic illustration of the flame spray deposition technique with (a) powder and 
(b) wire feeder (Knight and Smith 1998) 


the kind of precursor coating materials used for spraying, i.e., in either rod, wire, or 
powder form. The combustible gases used as fuel in flame spraying are acetylene, 
propane, methyl-acetylene-propadiene (MAPP) gas, and hydrogen, along with 
oxygen. Flame temperatures and characteristics depend on the oxygen-to-fuel gas 
ratio and pressure. Figure 13a and b shows the spray gun used for powder and wire 
flame spraying process (Knight and Smith 1998). The design of spray gun used for 
rod and wire guns is similar. However, it is necessary to change the nozzle and/or 
air cap to adapt the gun to different alloys, wire sizes, or gases. 

The advantages of flame spray process include a low equipment and processing 
costs, high efficiency, low maintenance, and easy to operate. However, relatively 
poor bond strength, higher porosities, and a narrower working temperature range as 
compared to other thermal spray deposition techniques are the disadvantages 
associated with flame spray deposition process. The coating thickness achieved 
by flame spray deposition can vary from 100 to 2,500 mm for single-pass and multi- 
pass coating, respectively. The porosity content varies from 10 % to 20 %. The 
bond strength determined using tensile adhesion test usually varies from 15 to 
30 MPa for metals and alloys. 

Flame spray deposition is a popular technique for repairing of worn-out parts. 
Commonly used materials for this application include nickel-based alloy, monel, 
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Fig. 14 Schematic of wire- 
arc spraying process (Knight 
and Smith 1998) 


Insulated 



brass, etc. Zinc coating is commonly applied for corrosion resistance application by 
flame spraying in large structures. 


The Electric-Arc (Wire-Arc) Spray 

The electric-arc (wire-arc) spray process uses precursor coating in the form of wire, 
and heating/melting occurs by generating arc between them at the interaction. 
Spraying is done by atomization of molten metal and propelling it onto the substrate 
by a stream of compressed air. Figure 14 shows the schematic of wire-arc spraying 
process (Knight and Smith 1998). The advantages of electric-arc spraying over 
flame spray process include a lower operating cost; higher bond strengths, in excess 
of 69 MPa (10,000 psi); a higher deposition rate of up to 55 kg/h (120 lb/h) and 
deposition efficiency; and lower heating of the substrate and cleaner process. The 
thickness of the coatings is in the range 100-2,000 mm and the porosity content in 
the coating ranges from 10 % to 20 %. The materials used for coating by wire-arc 
spraying are relatively ductile, electrically conductive wire of pure metal or alloys 
of about 1.5 mm (0.060 in.) in diameter. However, the recent development of cored 
wires permits the deposition of some composite coatings containing carbides or 
oxides. Electric-arc coatings are widely used in high-volume, low-cost applications 
such as deposition of zinc and aluminum for corrosion resistance application or 
composite coatings for wear resistance application. 


Plasma Spraying 

Figure 15 schematically describes the plasma spray deposition technique (Knight 
and Smith 1998). In plasma spray deposition, an electric arc is initiated between a 
tungsten cathode and a water-cooled copper anode using a high-frequency dis- 
charge and then sustained using dc power, and a gas (usually argon, but may be any 
other gas including nitrogen, hydrogen, or helium) is allowed to flow to generate a 
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high-pressure gas plasma with a resulting increase in gas temperature, which may 
exceed 30,000 °C, which in turn increases the volume of the gas and, hence, its 
pressure and velocity as it exits the nozzle. Argon is usually chosen for plasma 
generation because of its chemical inertness and ionization characteristics. The 
enthalpy of the gas can be increased by adding the diatomic gases, like hydrogen 
or nitrogen. Power levels in plasma spray torches are usually in the range of 
30-80 kW, but they can be as high as 120 kW. 

During plasma spray deposition, powder is introduced into the gas stream either 
just outside the torch or in the diverging exit region of the nozzle (anode), heated and 
accelerated by the high-temperature, high-velocity plasma gas stream. Torch design 
and operating parameters are critical in determining the temperature and velocity 
achieved by the powder particles. The operating parameters include the gas used for 
generation of plasma, power level of arc used to generate plasma, powder feed rate, 
carrier gas flow, distance from the torch to or from the substrate (standoff distance), 
and the angle of deposition. The size and morphology of powder particles strongly 
influence their rate of heating and acceleration and, hence, the efficiency of deposi- 
tion and quality of the coating. The powder velocity in plasma spray deposition 
usually varies from 300 to 550 m/s. The density of plasma spray coatings is usually 
much higher than that of flame spray coatings and is typically in the range of 
80-95 %. The thickness of plasma spray deposited coating usually ranges from 
about 0.05 to 0.50 mm (0.002-0.020 in.) but may be much thicker for some 
applications. Bond strength may vary from 34 MPa (5,000 psi) to 69 MPa 
(10,000 psi). The bond strength of plasma- sprayed ceramics on metallic substrates 
is in the range 15-25 MPa. Plasma- sprayed bonding alloys (NiAl or NiCrAl) or 
metals (Mo) may reach a strength of 70 MPa or even higher. Generally, a “perfect” 
plasma system should satisfy at least the following major requirements like (a) long- 
term stability of plasma parameters, (b) minimum or no pulsing of plasma parame- 
ters, (c) minimum erosion of electrodes, (d) consistent powder feeding and injection, 
(e) high deposition efficiency, (f) flexibility in operation and (g) a lower investment 
and maintenance costs. The disadvantages of plasma spray include a higher installa- 
tion cost, operating costs and complexity in operation. Furthermore, a long-term 
stability of plasma parameters is needed during production providing long-term 
reproducibility of coating properties and keeping constant deposition efficiency. 
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composite deposit 

Fig. 16 Schematic of plasma spray deposition using reactive precursors (Heimann 2008) 


Plasma spray deposition is mostly applied to develop coatings for wear, erosion, 
corrosion, and high-temperature oxidation resistance application. The substrate 
may be of any size, shape and geometry. Materials for deposition may be metallic 
or ceramic. A polymer dispersion aerosol can be injected into the plasma discharge 
to create a grafting of this polymer on to a substrate surface to modify the surface 
chemistry of polymer (Leroux et al. 2008). 


Reactive Plasma Spraying 

Reactive plasma spraying, an evolution of conventional plasma spraying is a 
viable method for producing a wide range of advanced materials and it allows 
reactive precursors (may be liquids, gases, or mixtures of solid reactants) to be 
injected into the particulate and/or hot gas streams. These reactive precursors 
may be liquids, gases, or mixtures of solid reactants (cf. Fig. 16). On contact with 
the high-temperature plasma jet, they decompose or dissociate to form highly 
reactive and ionic species that can then react with other heated materials within 
the plasma jet to form new compounds. Figure 16 schematically describes the 
reactive plasma spray deposition technique (Heimann 2008). The technique may 
be applied for the development of carbides such as TiC and WC, or under 
certain conditions, diamond or diamond like carbon (DLC) films. The primary 
requirements are that the precursors dissociate into reactive species and that 
the reaction times and temperatures are sufficiently long for the desired products 
or phases to form. 
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The Transferred Plasma-Arc Process 

Transferred plasma-arc process is a technique where a secondary arc current is 
established through the plasma and substrate that controls surface melting and 
depth of penetration by inducing the possibility of substrate heating during opera- 
tion. Figure 17 shows the schematic of transferred plasma-arc welding process 
(Knight and Smith 1998). The advantages associated with transferred plasma-arc 
coating involve an improved metallurgical bonding, high-density coatings, high 
deposition rates, and high thicknesses per pass. Coating thicknesses of 
0.50-6.35 mm (0.020-0.250 in.) and widths up to 32 mm (1.25 in.) can be achieved 
in a single pass at powder feed rates of 9 kg/h (20 lb/h). The process is more 
efficient as much less electrical power is required than with nontransferred arc 
processes. The method of heating and heat transfer in the transferred plasma-arc 
process eliminates many of the problems related to using powders with wide 
particle-size distributions or large particle sizes. Larger-particle-size powders, for 
example, in the 50-mesh range, tend to be less expensive than closely classified 
325 -mesh powders. Some limitations of the process include an alteration of sub- 
strate microstructure during coating. As a result, the applications of the coating are 
limited to substrates that are electrically conductive and can withstand some 
melting. The transferred plasma-arc process is used in hard-facing applications 
such as valve seats, plowshares, oil field components, and mining machinery. 


High-Velocity Oxy-Fuel Flame (HVOF) 

High-velocity oxy-fuel (HVOF) devices are a subset of flame spraying which, 
however, utilizes confined combustion and an extended nozzle to heat and accel- 


Fig. 17 Schematic of 
transferred plasma- arc 
welding process (Knight and 
Smith 1998) 
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Fig. 18 Schematic of a high-velocity oxy-fuel (HVOF) coating device (Knight and Smith 1998) 


erate the powdered coating material. Typical HVOF devices operate at hypersonic 
gas velocities, i.e., greater than MACH 5. The extreme velocities provide kinetic 
energy which help in producing coatings that are very dense and very well adhered 
in the as-sprayed condition. 

Figure 18 shows the schematic of a high-velocity oxy-fuel (HVOF) device 
(Knight and Smith 1998). The fuels used for HVOF spraying are usually propane, 
propylene, MAPP, or hydrogen and is mixed with oxygen and burned in a chamber. 
In other cases, liquid kerosene may be used as a fuel and air as the oxidizer. The 
products of the combustion are allowed to expand through a nozzle, where the gas 
velocities may become supersonic. Powder is introduced, usually axially, in the 
nozzle and is heated and accelerated. The powder is usually fully or partially melted 
and achieves velocities of up to about 550 m/s. Because the powder is exposed to 
the products of combustion, they may be melted in either an oxidizing or reducing 
environment, and significant oxidation of metallics and carbides is possible. With 
appropriate equipment, operating parameters, and choice of powder, coatings with 
high density and with bond strengths frequently exceeding 69 MPa (10,000 psi) can 
be achieved. Coating thicknesses are usually in the range of 0.05-0.50 mm 
(0.002-0.020 in.), but substantially thicker coatings can occasionally be used 
when necessary with some materials. HVOF processes can produce coatings of 
virtually any metallic or cermet material and, for some HVOF processes, most 
ceramics. Those few HVOF systems that use acetylene as a fuel are necessary to 
apply the highest-melting-point ceramics such as zirconia or some carbides. HVOF 
coatings have primarily been used for wear resistance to date, but their field of 
applications is expanding. 

HVOF is most commonly used to produce wear-resistant coatings such as 
cermets (ceramic and metal mixes) like tungsten-carbide cobalt. Coatings of this 
type have wear resistance similar to sintered carbide materials. Because HVOF 
produces very dense coatings (porosity levels typically less than 0.5 %), it can be 
used to produce very good corrosion-resistant coatings made from materials such as 
Inconel®, Stellite®, stainless steel, and ceramics. Some unique coatings produced 
by TST using HVOF technology are ultrahigh-density ceramics that provide 
excellent dielectric strength. 
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Detonation-Gun Spraying 

Figure 19 shows the schematic of a detonation-gun process, where a mixture of 
oxygen and acetylene, along with a pulse of powder, is introduced into a barrel and 
detonated using a spark (Knight and Smith 1998). The high-temperature, high- 
pressure detonation wave moving down the barrel heats the powder particles to 
their melting points or above and accelerates them to a velocity of about 
750 m/s. By changing the fuel gas and some other parameters, the Super D-Gun 
process achieves velocities of about 1,000 m/s. Detonation-gun spraying is a cyclic 
process, and after each detonation the barrel is purged with nitrogen and the process 
is repeated at up to about 10 times per second. Instead of a continuous layer of 
coating as in the other thermal spray processes, a circle of coating about 25 mm 
(1 in.) in diameter and a few micrometers thick is deposited with each detonation. A 
uniform coating thickness on the part is achieved by precisely overlapping the 
circles of coating in many layers. Typical coating thicknesses are in the range of 
0.05-0.50 mm (0.002-0.02 in.), but thinner and much thicker coatings can be used. 

The detonation-gun coatings have some of the highest bond strengths (usually 
exceeding the epoxy strength of the test, i.e., 69 MPa) and lowest porosities (usually 
less than 2 %) which made the process a benchmark against which the other 
coatings have been measured for years. For metallic coating shrouding atmosphere 
should be carefully maintained to avoid oxidation. The extremely high velocities 
and consequent kinetic energy of the particles in the D-Gun process make the 
process flexible for coating of any materials to be deposited with residual compres- 
sive stress, rather than tensile stress as is typical of most of the other thermal spray 
coatings. This is particularly important relative to coating thickness limitations and 
the effect of the coating on the fatigue properties of the substrate. 

Virtually all metallic, ceramic, and cermet materials can be deposited using 
detonation-gun deposition. Detonation-gun coatings are used extensively for wear 
and corrosion resistance as well as for many other types of applications. They are 
frequently specified for the most demanding applications but often can be also the 
most economical choice because of their long life. 
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Fig. 19 Schematic of a detonation-gun process (Knight and Smith 1998) 
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Cold Spray Coating Process 

Cold spray (CS) is a process where powder particles are accelerated by the 
supersonic gas jet at a temperature that is always lower than the melting point of 
the material, resulting in coating formation from particle in the solid state (Singh 
et al. 2012). Cold spraying is also traditionally known by different names like cold 
gas dynamic spraying, kinetic spraying, high-velocity particle consolidation 
(HVPC), high-velocity powder deposition, and supersonic particle/powder deposi- 
tion (SPD) (Karthikeyan 2004). In cold spray deposition, a high-velocity 
(300-1,200 m/s) gas jet is developed using a converging/diverging nozzle to 
accelerate powder particles and spray them onto a substrate. The carrier gas used 
in spraying may be compressed air, helium, or nitrogen, mixture of He and N 2 and 
dry air (79 % N 2 , 21 % 0 2 ). Usually the carrier gases have good aerodynamic 
properties. During the process, the gas accelerates particles at a pressure of 
1-3 MPa. The particle size of precursor powder ranges between 2 and 150 mm. 
The distance between the substrate and the nozzle is approximately 25 mm. The 
particles are heated by HVOF torch and enter the spray nozzle to increase the 
velocity and ductility which allow an easy buildup of coating. As the particles 
impact on the surface, they bond plastically and deposit on substrates. The coated 
particles are plastically deformed on impact due to the high kinetic energy of the 
particles and form splats. As coating is carried out at low temperature, the kinetic 
energy of the particles is responsible for the bonding of coating with substrate. The 
advantages of cold spraying over conventional thermal spray deposition include 
minimization of high-temperature oxidation, retention of chemistry and microstruc- 
ture of coating, minimum loss of coating materials due to melting and evaporation, 
reduced residual stress in coating, minimum distortion of substrate, development of 
high density, high hardness cold-worked structure, and an increased operation 
safety because of the absence of high-temperature gas jet, radiation, and explosive 
gases. The process may be applied for the coating of materials with metastable 
micro structure including amorphous and nano structured coatings. 

The footprint of the cold spray beam is very narrow typically around 5 mm 
diameter due to small size of the nozzle (10-15 mnr) and spray distance 
(5-25 mm), yielding a high-density particle beam, which results in precise control 
on the area of deposition over the substrate surface. This process is similar to a 
micro shot peening, and hence, the coatings are produced with compressive 
stresses, rather than tensile stresses, which results in dense and ultra-thick 
(5-50 mm) coatings without adhesion failure. The low-temperature formation of 
coating leads to oxides and other inclusion-free coatings with wrought-like micro- 
structure (Choudhuri et al. 2008). 

Figure 20 shows the schematic of cold spray deposition process (Tinashe 
Sanyangare 2010). The main components of CS system include (a) powder feeder 
(powders used are in the range of 1-50 pm in diameter); (b) source of a compressed 
gas; (c) gas heater to preheat the gas, to compensate for the cooling due to rapid 
expansion in nozzle; (d) supersonic nozzle; (e) spraying chamber with a motion 
system; and (f) system for monitoring and controlling spray parameters. 
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Fig. 20 Schematic illustration of the cold spray deposition process (Tinashe Sanyangare 2010) 


Dr. Antolli Papyrin and colleagues at the Russian Academy of Sciences were the 
first to demonstrate the cold spray process in the mid-1980s. The cold spray process 
uses the energy stored in high-pressure compressed gas to propel fine powder 
particles at very high velocities (500-1,500 m/s). Compressed gas (usually helium) 
is fed via a heating unit to the gun where the gas exits through a specially designed 
nozzle at a very high velocity. Compressed gas is also fed via a high-pressure 
powder feeder to introduce powder material into the high-velocity gas jet. The 
powder particles are accelerated and moderately heated to a certain velocity and 
temperature where, on impact with a substrate, they deform and bond to form a 
coating. The parameters influencing the quality of coating include particle size, 
density, temperature, and velocity. 

Cold spray technique has a wide scope of application in development of high- 
performance coating for structural application. Typical examples of coating include 
the following. 


Development of Coating for Wear, Corrosion, and 
High-Temperature Oxidation Resistance 

Cold spraying has been successfully applied to improve erosion and corrosion 
resistance of components in marine ship propeller shaft, for development of 
antiseizure coating in the screws at oil well tubing and for increasing the erosion 
and high-temperature corrosion resistance for components in power generation 
plant, wear resistance and cavitation wear resistance of turbine blades, and the 
water pump housing, impeller fins, impeller seal section, and wear rings (Singh 
et al. 2012). The materials used for wear resistance application include tungsten 
carbide, chromium oxide, and Ni-Cr coatings (Singh et al. 2012). It is also an 
effective route for the development of antifriction coatings like Sn, Pb-Sn, and 
Al-Sn alloys to be used as bearing materials in automobile and ship building 
industry. Al-5Sn and Al-lOSn coatings with low porosity and good bonding were 
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successfully developed by high-pressure cold spraying with nitrogen and by 
low-pressure cold spray using helium as carrier gas, respectively (Singh 
et al. 2012). Cold spray is also a useful technique for the development of CuCrAl 
and NiCrAlY coatings with high-temperature oxidation resistance and high- 
temperature creep and fatigue properties for rocket engine liner applications 
(Singh et al. 2012). 


Repairing and Refurbishing 

Repairing and refurbishing is equally important in reducing the cost of material loss 
due to wear in medical, aerospace, electronics, automotive, and petrochemical 
industries. Cold spraying is a versatile technique which is used to remove defects 
created at the surface of rolls during production of paper and polyethylene film, 
repair of the casting defects and machining defects, repair and retrieval of parts and 
plate stocks used in aircraft structures, repair/refurbishment of casings in gas 
turbines, shape restoration, and defect elimination at small automotive workshops, 
like repair of vehicle platforms (Singh et al. 2012). 


Coating for Functional Application 

Cold spraying can also be used for the fabrication of complex conductive patterns in 
solar cells, to enhance surface performance in components made of advanced 
polymer-matrix composites in wind power generation. 


Development of Coating for Bio-implant Application 

Cold spray technique has been successfully demonstrated to the development of 
hydroxyapatite (HAP) (Ca 10 (PO 4 ) 6 (OH) 2 ) coating to be used in dental and ortho- 
pedic implants. Though it may be applied by plasma spray technique, dissociation 
of structure was observed which may be eliminated when used cold spray coating 
(Singh et al. 2012). 


Freeform Fabrication of Components 

Cold spray deposition may be used for fabrication of components with metastable/ 
novel micro structure with improved properties and seamless components. It is also 
used for spraying of copper alloys on “touch” surfaces which are frequently in 
human contact, such as door knobs, light switches, faucets, bed rails, food prepa- 
ration areas, and other hardware (Singh et al. 2012). 
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Mechanism of Cold Spray Coating 

The true bonding mechanism in cold spray process is poorly understood (Ghelichi 
and Guagliano 2009; Dickinson and Yamada 2010). It is, however, accepted that 
during impact, the solid particles undergo plastic deformation and disrupt thin 
surface films (oxides), and in turn, intimate conformal contact is achieved and 
combined with high contact pressure, promoting bonding with the target surface 
(Lima and Marple 2007; Lima et al. 2002a). The common phenomena that have 
been observed during spraying onto various substrates are substrate and particle 
deformation and substrate melting as there is evidence for the formation of a metal 
jet (Ghelichi and Guagliano 2009; Champagne et al. 2005). Figure 21 shows the 
mechanism of bonding of copper sphere of 20 mm diameter impacting on alumi- 
num plate at a speed of 650 m/s (Champagne et al. 2005). From Fig. 21 it is evident 
that the material which is adjacent to the interface behaves as a viscous fluidlike, 
resulting in the formation of interfacial waves, roll-ups, and vortices (Champagne 

et al. 2005). 

It has been suggested that the adhesion strength of the particles is dependent on 
their kinetic energy at impact, which is typically much less than the energy required 
to melt the particle, and hence, cold spray is a solid-state process (Dickinson and 
Yamada 2010; Lima and Marple 2007). It is also accepted that a minimum particle 
velocity is necessary to achieve deposition, because sufficient kinetic energy must 
be available to plastically deform the solid material (Lima and Marple 2007 ; Lima 
et al. 2002a). An empirical model by Champagne et al. (2005) shows that interfacial 
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Fig. 21 Schematic showing the mechanism of bonding of copper sphere impacting on aluminum 
plate during cold spraying (Champagne et al. 2005) 
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intermixing depends on the substrate hardness, coating material density, and the 
particle velocity (m/s) Vp needed for the attainment of interfacial mixing, as 
Vp = [(7.5 x 104)(B/p)]0.5, where B is the substrate Brinell hardness number 

Q 

and p is the particle density (kg/m ). Lupoi and Neill (2010) examined that during 
cold spray process, Cu particles create more erosion of substrate as compared to A1 
and Sn, due to their low specific weight as compared to Cu. 


Particle Velocity and Critical Velocity 

It has been widely accepted that particle velocity (Vp) prior to impact is one of the 
most important parameters in cold spraying which determines whether deposition 
of a particle or erosion of a substrate occurs on the impact of a spray particle. 
Generally, for a given material, there exists a minimum particle velocity commonly 
known as critical velocity (Vc) which must be achieved for transition from erosion 
of the substrate to deposition of the particle. Only those particles achieving a 
velocity higher than the critical one can be deposited to produce a coating 
(Karthikeyan 2004; Ghelichi and Guagliano 2009; Li and Li 2003). The cold 
spraying process consists of three main stages. Figure 22 summarizes the stages 
of cold spray deposition process schematically (Singh et al. 2012). The 1st stage 
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Fig. 22 Stages of cold spray deposition process schematically (Singh et al. 2012) 
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refers to the induction time or delay time which is defined as the time between the 
beginning of surface treatment by the flow of particles and the beginning of particle 
attachment to the surface (Ghelichi and Guagliano 2009). If the particle 
velocity (Vp) is less than (<) the critical velocity (Vc), particles simply reflect 
(bounce) off the surface (Karthikeyan 2004). On the other hand, if the particle 
velocity (Vp) is equal to the critical velocity (Vc), Vp = Vc, then solid particle 
erosion of the surface occurs without any deposition. Vc depends upon the combi- 
nations of spray materials and the substrates (Li et al. 2010). In the second stage 
particles plastically deform and adhere to the substrate, and a first thin layer of the 
particle material is formed (Ghelichi and Guagliano 2009). Finally, in the third 
stage, the particles interact with the surface formed by previously incident particles 
in second stage. It is a buildup stage, characterized by the growing thickness of 
the coating layer (Davis 2004). Coating builds up only if Vp less than up-limit 
velocity. With most spray materials the up-limit velocity is higher than 1,000 m/s 
(Li et al. 2010). 

The critical particle velocity, as reported by many authors (Karthikeyan 2004; 
Ghelichi and Guagliano 2009; Li and Li 2003; Li et al. 2010; Li et al. 2005), 
changes with the spray material, approximately 560-580, 620-640, 620-640, and 
680-700 m/s, for Cu, Fe, Ni, and Al, respectively. However, a variation in critical 
velocity is also reported in Cu (Champagne et al. 2005; Li et al. 2006). This 
variation of Vc is due to its dependence on many factors mostly on the thermome- 
chanical properties of the substrate and the spray powder material, and the particle 
velocity Vp is a function of the spray conditions, including physical properties or 
nature of the driving gas, its operating temperature and pressure, and nozzle design 
of the spray gun and material properties, such as particle diameter, size distribution 
of particles in powder, and density and morphology of powder (Karthikeyan 2004; 
Ghelichi and Guagliano 2009; Li and Li 2003; Li et al. 2005, 2010). 


Post Coating Operation 

Due to the presence of porosities in the coating developed by thermal spray 
deposition, coatings are often sealed with organic materials that penetrate and fill 
the pores (Walser 2004). Sealants may be applied by brushing, spraying, and 
dipping and should have a low viscosity to penetrate the coating to a satisfactory 
depth. The sealant is then cured, which effectively creates a barrier for penetration 
of unwanted materials. The curing parameters (time and temperature) depend on 
the kind of sealant used for curing. Due to the application of sealant, the coating 
quality is improved due to (a) improved corrosion resistance by restricting the 
corrosive species (liquids and gases) from penetrating the coating and attacking the 
substrate, (b) enhanced interparticle cohesion, and (c) inducing hydrophobicity on 
the surface. Table 2 summarizes the corrosion sealant used in sealing of thermal 
spray deposition (Walser 2004). 
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Table 2 Commonly 
applied sealant used for 
sealing of thermal spray 
coating. Summary of the 
corrosion sealants used in 
thermal spray deposition 
(Walser 2004) 


Materials 

Characteristics 

Paints 

Water or solvent solubles 

Waxes 

Low-temperature melts 

Phenolic s 

Air or heat cure 

Epoxy phenolic s 

Air or heat cure 

Epoxy resins 

One part catalyst 

Polyesters 

Air or heat cure or one part catalyst 

Silicones 

Heat cure 

Polyurethanes 

Air dry or one part catalyst 

Linseed oil 

Air dry 

Polymides 

Heat cure 

Coal tars 

Air dry 

Anaerobic s 

Core in absence of air 


Coating Characterization and Testing 

Followed by coating, a detailed characterization of coating in terms of microstruc- 
ture, composition, and phases is usually undertaken by optical, scanning, and 
transmission electron microscopies, X-ray diffraction technique, and energy- 
dispersive spectroscopic analysis (Walser 2004). Specifically, for the thermal 
barrier coating, the following parameters need to be measured. 


Measuring Deposition Efficiency 

Deposition efficiency (DE) is the measure of the percentage of particles introduced 
into a spray jet that actually deposits onto a flat substrate. Deposition efficiency may 
be determined from the percentage weight gain due to the deposition of materials 
with a specific spray technique for a fixed powder feed rate. It may also be 
determined by sampling from the coating and, subsequently, a detailed microstruc- 
tural study of the cross section of coating. Alternatively, the thickness can also be 
measured by ultrasonic thickness monitor gage. 


Bond Strength of Thermal Spray Coatings 

The coating must adhere to the surface so as to perform under loading (Walter 
2005). Adhesion is a property of major concern for thermal spray coatings so as to 
achieve improved performance. The standard coating/fixture assembly used for 
tensile adhesion testing of thermal spray coatings is adapted from ASTM C 
633-01. For adhesion testing, a coating is applied on to a 25 mm (1 in.) diameter 
cylindrical bar followed by which an adhesive bonding agent is applied to the 
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surface and subjected to curing at temperatures ranging from 120 °C to 175 °C. 
After curing, the coating is subjected to increasing tensile load until failure. 

From the test, the force required to separate a thermal spray coating from its 
substrate is determined. Depending on its bond strength, the coating can fail at any 
of several locations including the coating-substrate interface, within the coating, 
and at the interface between the coating and the bonding agent. For some coatings 
(such as HVOF WC-Co), the adhesive strength of the coating is typically greater 
than the strength of the bonding agent. The two most common bonding agents are 
FM-1000 and EC-2214. FM-1000 comes in the form of a wafer, while EC-2214 is a 
liquid. 


Rolling-Contact Fatigue Failure Modes in Thermal Spray Coatings 

Rolling-contact fatigue (RCF) behavior of thermal spray deposited coating is 
measured by rolling-contact fatigue testing. Figure 23 shows the classification of 
the modes of failure by rolling-contact fatigue failure (Ahmed 2002). From Fig. 23 
it may be noted that the rolling-contact fatigue failure of TS coatings is generally 
categorized in four main modes, i.e., abrasion, delamination, (cc) bulk failure, and 
(d) spalling (M1-M4), as indicated in the diagram. The characteristics of each mode 
and failure type are summarized in Fig. 23 (Ahmed 2002). 


Residual Stress Determination in Thermal Spray Coatings 

Residual stress in the coating is the main problem associated with thermal spray 
deposition. Residual stress developed in thermal spray deposited surface is mea- 
sured by hole-drilling method. During the test, a shallow hole (to a hole diameter 
ranging from 0.8 to 5.0 mm) is drilled through the test specimen to a depth 
approximately equal to the hole diameter. The creation of the hole redistributes 
the stresses in the material surrounding the hole which is measured by a specially 
designed three-element strain-gage rosette. The in-plane residual stresses that 
originally existed at the hole location can then be calculated from the measured 
strain reliefs using the method described in ASTM E 837, “Measurements of 
Residual Stresses by Hole-Drilling Strain-Gage Method” (Walter 2005). 


Present Status and Scope of Application of Thermal Spray 
Coatings 

Among all thermal spraying techniques, flame spraying and wire -arc spraying are 
the most popular processes for the development of coating employed for wear and 
corrosion resistance purpose, due to their low capital cost and flexibility of opera- 
tion. Research and development (R&D) activities on thermal spray coatings are 
carried out by different groups in universities and research institutions which are 
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Failure mode 


Failure mechanism 
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initiation 

Crack propagation due to cyclic 
loading 
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(>millimetre in dimension) 


Macropitting/spallat 

ion 


Fig. 23 


Classification of the modes of failure by rolling-contact fatigue failure (Ahmed 2002) 


supported by government departments or industries worldwide. In India, the 
departments which support fundamental research on thermal spray deposition 
techniques are the Department of Science and Technology, Defense Research and 
Development Organization, Department of Atomic Energy, etc. The coating devel- 
opment activities, employing mostly plasma spraying, span a wide range of mate- 
rials from traditional protective materials to those with added functions. Studies for 
the development of high-performance thermal barrier coating (TBCs) have been 
recently performed in many national institutes and universities. The functional 
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coatings include different bioactive coatings (HA, Ti, etc.), photocatalytic coatings 
(Ti0 2 ), electrolyte (YSZ), and catalytic coatings (LSM, YSZ-Ni) for the compo- 
nents used in solid oxide fuel cells (SOFCs) and dye-sensitized solar cells. More- 
over, nano structured coating development is primarily focused on nano structured 
cermets (e.g., WC-Co), nanoceramics (e.g., YSZ, Al 2 0 3 -Ti0 2 , Ti0 2 ), and nano- 
structured nickel-based alloys. Additionally, cold spraying is emerging in the field 
of thermal spray R&D. 


Coating Development Activities 

Silicon coating was deposited on titanium alloy substrates by vacuum plasma 
spraying technology. The morphologies and phase composition of the coatings 
were analyzed by field-emission scanning electron microscopy and X-ray diffrac- 
tion. The thermal expansion coefficient of silicon coating was measured to be about 

f\ 1 

3.70 x 10 K . The bond strength of coating was approximately 20.6 MPa. The 
density, open porosity, roughness, and Young’s modulus of silicon coating were 
also measured. The as-sprayed silicon coating was treated by de-ionized water at 
60 °C, 80 °C, and 100 °C and soaked in simulated body fluids to evaluate its 
bioactivity. The results showed that the water-treated coating could induce apatite 
to precipitate on its surface in simulated body fluid, indicating that the bioactivity of 
silicon coating was improved. The increase of temperature and duration of water 
treatment had a positive effect on the bioactivity of silicon coatings. 


Development of Wear and Corrosion Resistance Coating 

One of the most important uses of thermal spray coatings is for wear resistance 
improvement to resist virtually all forms of wear, including abrasive, erosive, and 
adhesive, in virtually every type of industry. The materials used range from soft 
metals to hard metal alloys to carbide-based cermets to oxides. The coating 
materials usually used for wear resistance application with a good bond strength 
with the substrate include white irons (alloys irons with a hardness of about 
60 HRC); nickel/chromium/boron alloys (these materials are similar to the 
nickel-based hard-facing consumables hardness in the range of 35-60 HRC); 
cobalt-based alloy (similar to the cobalt-based hard-facing consumables; hard- 
facing in the range of 45-60 HRC); and composite materials (nickels or similar 
matrixes containing up to 50 % carbides). The bond strength of coating with the 
substrate is enhanced by boron addition which makes these materials self-fluxing. 
The white iron has abrasion resistance similar to the iron/chromium hard-facing 
consumables, and the cobalt- and nickel-based alloys have wear properties similar 
to the hard- facing alloys that have comparable composition and hardness. The 
composite alloys are somewhat different than the composite welding consumables 
in that carbide addition must be designed so that they do not concentrate at the 
coating -substrate interface during centrifugal casting. Since tungsten carbides are 



2838 


J.D. Majumdar 


much heavier than the matrix metals, the centrifugal force of the casting alloys can 
make the carbides concentrate at the lining/substrate interface instead of at the ID of 
the cylinder, where they are needed for wear resistance. This problem is sometimes 
overcome by using mixed carbides; titanium carbides are lighter than the matrix 
metal and will not migrate to the lining interface. 

The largest areas of application of thermal spraying technologies are in the steel 
industry for coating steel sheet processing rolls for improving wear resistance, 
corrosion resistance, gripping properties, and thermal-shock resistance. One com- 
mon example of roll coatings includes cermet coating (WC-12Co) by HVOF 
spraying applied to rolls in a continuous galvanizing line. Because of an increased 
productivity and demand, thermal spraying are mostly utilized for (1) anti-buildup 
sprayed coatings for hearth rolls in continuous annealing line and (2) coatings 
supporting the service life enhancement for parts used in the molten metal bath in 
continuous galvanizing lines. 

A significant improvement in wear resistance was reported on AISI 304 stainless 
steel by HVOF spraying of Ni-based self-fluxing alloy (Ni, 68.4 wt.%; Cr, 17 wt.%; 
B, 3.9 wt.%; Si, 4.9 wt.%; and Fe, 5.8 wt.%) (Sharma and Dutta Majumdar 2013). 
Deposition was conducted on sandblasted AISI 304 stainless steel substrate 
using Ni-based alloy (Ni, 68.4 wt.%; Cr, 17 wt.%; B, 3.9 wt.%; Si, 4.9 wt.%; and 
Fe, 5.8 wt.%) of particle size 45-60 mm as precursor powder under optimum 
process parameters. Figure 24 shows the scanning electron micrograph of the top 
surface of NiCrBSi coating developed on AISI 304 stainless steel by HVOF 
spraying technique (Sharma and Dutta Majumdar 2013). From Fig. 24 it is evident 
that the top surface of the coating consists of very fine solidified droplets strongly 
adhered to the surface with a low area fraction of microporosities (1-2 %, shown by 
arrowheads). Due to a very high velocity of carrier gas (compressed air) during 
HVOF spraying, the droplets dimensions were smaller with a very smooth surface. 
The coating density increased due to combined influence of high speed of impact and 
tiny droplets dimension. The X-ray diffraction (XRD) profile of the coating confirmed 
the presence of Ni 3 B and Cr 2 B and gamma-nickel (g-Ni) matrix. A detailed study of 


Fig. 24 Scanning electron 
micrograph of the top surface 
of NiCrBSi coating 
developed on AISI 
304 stainless steel by HVOF 
spraying technique (Sharma 
and Dutta Majumdar 2013) 
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Fig. 25 Kinetics of wear in 
terms of cumulative depth of 
wear as a function of time for 
(7) as-received and (2) HVOF 
sprayed NiCrBSi coated AISI 
304 stainless steel against WC 
ball under fretting wear 
condition (with a stroke 
length of 1 mm and frequency 
is 10 Hz) at an applied load of 
(a) 10 N and (b) 20 N 
(Sharma and Dutta Maiumdar 

2013) 




Time (Sec.) 



microhardness shows that the average microhardness of the coating is significantly 
improved to 900 VHN as compared to that of as-received substrate. Furthermore, the 
microhardness decreases with depth. The increase in microhardness of the coating is 
attributed to the presence of nanostructured borides (of Cr 2 B and Ni 3 B) and refine- 
ment of y-Ni matrix. The marginal decrease in microhardness with depth is attributed 
to coarsening of the microstmcture (both the precipitates) with depth. 

Figure 25a and b shows the kinetics of wear in terms of cumulative depth of wear 
as a function of time for (1) as-received and (2) HVOF sprayed NiCrBSi coated 
AISI 304 stainless steel against WC ball with a diameter of 5.2 mm, under fretting 
wear condition (with a stroke length of 1 mm and frequency is 10 Hz) at an applied 
load of (a) 10N and (b) 20 N (Sharma and Dutta Majumdar 2013). From Fig. 25 it is 
evident that there is a significant decrease in kinetics and magnitude of wear in 
HVOF sprayed surface (plot 2) as compared to the as-received AISI 304 stainless 
steel (plot 1). The significant decrease in wear kinetics in HVOF sprayed surface is 
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Fig. 26 Variation of 
coefficient of friction with 
time for ( 1 ) as-received and 
(2) HVOF sprayed NiCrBSi 
coated AISI 304 stainless 
steel against WC ball under 
fretting wear condition (with 
a stroke length of 1 mm and 
frequency is 10 Hz) at an 
applied load of 10 N (Sharma 
and Dutta Majumdar 2013) 



due to presence of nano-sized borides and, hence, improvement in hardness of the 
surface. A close look of the wear behavior shows that wear rate during initial stage 
of wear is faster following which it decreases in both the as-received and the coated 
surface. The initial high rate of wear is due to removal of materials from the surface 
by the abrasive action of WC surface. Gradually, the accumulated worn-out parti- 
cles at the interface change the mechanism of wear from two-body abrasive to 
three-body wear and also reduce the effective abrasive action of WC ball by acting 
as a barrier. As a result, the wear rate gradually decreases. The wear rate, worn-out 
volume, and coefficient of friction were also carefully measured after the end of 

Q 

wear testing. It was observed that the average wear rate (0.0028 mm /mm) and the 

Q 

wear volume (0.0020 mm ) of coated surface are significantly lower as compared to 

Q Q 

the average wear rate (0.0050 mm /mm) and the wear volume (0.0032 mm ) of the 
as-received substrate. A significantly reduced coefficient of friction may also be 
noted on the coated surface (0.15) as compared to the as-received substrate (0.22). 
A detailed study of the coefficient of friction showed that in as-received AISI 
304 stainless steel, the coefficient of friction initially increased from 0.185 to a 
value of 0.275 and remained constant after that (cf. Fig. 26). The increase in 
coefficient of friction with time is attributed to the gradual roughening of the 
surface during wear. The reduced coefficient of friction in deposited surface is 
attributed to possibly a lower rate of wear and, hence, a nominal change in surface 
roughness with time. In addition, the change in microstructure and phase compo- 
sition of the surface also contributed to a reduced coefficient of friction on depos- 
ited surface. It was concluded that a reduced coefficient of friction of the deposited 
surface also decreased the wear rate leading to a decreased wear volume. A detailed 
study of the microstructures of the wom-out debris was undertaken to understand 
the mechanism of wear. Figure 27a and b shows the pictorial views of the real-time 
applications of thermal spray deposition showing (a) tungsten carbide coatings on 
rolls and (b) HVOF coating on boiler tubes (http://www.emcpl.in/). 
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Fig. 27 Pictorial views of the real-time applications of thermal spray deposition showing (a) 
tungsten carbide coatings on rolls and (b) HVOF coating on boiler tubes (http://www.emcpl.in/) 


Development of Coating for Oxidation Protection 

Thermal spray coatings are extensively used by industry to protect steel components 
and structures from heat oxidation at surface temperatures of 1,095 °C (2,000 °F). 
Notable example in this regard includes thermal barrier coatings which are 
essentially ceramic -based coatings applied for providing the thermal insulating 
barrier and high-temperature oxidation resistance properties of the component to be 
used for high-temperature application (Padture et al. 2002). Typical thermal barrier 
coatings consist of two layers: the first one is bond coat of MCrAlY type, where “M” 
may be replaced with Ni, Co, or Fe depending on the material composition of the base 
structure or Pt-aluminide coating, and the second one is usually yttria- stabilized 
zirconia (YSZ) which provides good thermal resistance. Though thermal barrier 
coating is extremely important for improving the lifetime of the component, the 
major concern in thermal barrier coating is induction of large thermal stress during 
thermal cycles in an oxidizing environment due to the thermal expansion mismatch 
between the ceramic top coat and metallic bond coat (Padture et al. 2002). Thermal 
barrier coating may be developed by some processing techniques such as by plasma 
spraying, powder metallurgy, and in situ synthesis (Busso et al. 2007). Plasma spray 
coatings involve successive accumulation of molten or semi-molten droplets spread- 
ing on the substrate surface, and forming thin coating upon solidification remains a 
popular technique for the development of TBC on large structures (Kim et al. 2003). 
Though there are several reports on thermal properties of plasma- sprayed duplex 
coating, however, studies on oxidation resistance of functionally graded thermal 
barrier coatings are limited. Khor and Gu (2000) showed that the thermal cycling 
resistance of YSZ and NiCoCrAlY functionally graded coatings was five times better 
than that of the duplex coating with a similar thickness. The failure modes of the 
coatings under thermal cycling conditions were surface cracking and delamination, 
which were caused by large residual radial/tangential stresses and axial/shear 
stresses, respectively. Vassen et al. (2010) showed that plasma spray deposited 
thermal barrier coatings made of La 2 Zr 2 0 7 and Gd 2 Zr 2 0 7 pyrochlores and SrZr0 3 
perovskites with a double layer have a longer cycling lifetime compared to the single 
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Fig. 28 Scanning electron micrographs of the cross section of the (a) duplex coating and (b) 
graded coating developed by plasma spray deposition technique on top of CoNiCrAlY bond coat 
developed by high-velocity oxy-fuel (HVOF) spray deposition technique (Nath et al. 2013) 


layer systems. In this regard, it is relevant to mention that plasma spray coating 
technique can be employed to achieve specific properties (low thermal conductivity, 
longer thermal cycling life, etc.) of the top coat and bond coat. However, application 
of HVOF spraying technique for the deposition of bond coating has been recently 
reported to form a dense coating with an increased surface roughness which is 
comparable to the same developed by vacuum plasma spraying technique 
(Rajasekaran et al. 2011). However, a detailed study of the oxidation behavior of 
the same has not been undertaken. 

Figure 28a and b shows the scanning electron micrographs of the cross section of 
the (a) duplex coating and (b) graded coating developed by plasma spray deposition 
technique on top of CoNiCrAlY bond coat developed by high-velocity oxy-fuel 
(HVOF) spray deposition technique (Nath et al. 2013). From Fig. 28a, it is evident 
that both the top coat-bond coat and bond coat- substrate interfaces are defect-free. 
Furthermore, there is a deeper penetration of bond coat inside the substrate due to 
coating with a very high velocity in HVOF spraying. Figure 28b, on the other hand, 
shows that the graded thermal barrier coating layer is developed by continuous 
change in the composition with 100 % CoNiCrAlY bond coating (of a thickness of 
80 mm) on the substrate, followed by second layer of 70 % CoNiCrAlY + 30 % 
YSZ (of a thickness of 80 mm), third layer of 50 % CoNiCrAlY + 50 % YSZ (of a 
thickness of 80 pm), fourth layer of 30 % CoNiCrAlY + 70 % YSZ (of a thickness 
of 80 pm), and 100 % YSZ top layer (of a thickness of 100 mm). However, the 
presence of microporosities was observed in both the top coatings (duplex and 
graded) and bond coating with the average area fraction of porosities ranging from 
3 to 8 % at the top coating and 2-3 % at the bond coating, respectively. 

A detailed study of the residual stress developed due to the spraying shows that 
on the surface of both the duplex and graded coating, there is presence of residual 
stress of very low magnitude, i.e., 28.3 MPa in duplex coating and 15.5 MPa in 
graded coating, and are tensile in nature. The presence of tensile residual stress of a 
very low magnitude at the surface is attributed to the thermal quenching effect from 
the molten state during plasma spraying operations. In graded coating, the 
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magnitude and nature of stress, however, varies with depth. Initially it increases to 
34.5 MPa in the second layer (70 % YSZ + 30 % CoNiCrAlY) and 49.8 MPa in the 
third layer (50 % YSZ + 50 % CoNiCrAlY) and then decreases with change of its 
magnitude to a compressive stress at a value of —31.7 MPa (30 % YSZ + 70 % 
CoNiCrAlY) at the near-interface region. The change in stress from tensile to 
compressive is possibly attributed to the application of large velocity particles 
during spraying and the presence of transformational stress while transforming 
from tetragonal to monoclinic zirconia. 

Oxidation proceeds by three stages in as-received Inconel 718: stage I oxidation 
with activation energy of 162 kJ/mol, followed by increased activation energy of 
186 kJ/mol due to the formation of adherent Cr 2 0 3 film on the surface. During stage 
III oxidation, the activation energy of oxidation decreases to 163 kJ/mol due to 
volatilization of Cr 2 0 3 layer at high temperature and formation of other oxides such 
as NiCr 2 0 4 , FeCr 2 0 4 , Fe 2 0 3 , and CrNb0 4 . 

The kinetics of non-isothermal oxidation of graded thermal barrier coating 
shows low value of activation energy at initial stage of oxidation (stage I) due to 
the formation of alumina scales throughout the coating thickness. During stage II 
oxidation, the activation energy increases to 187 kJ/mol which in stage III further 
increases to 419 kJ/mol due to the presence of increased alumina scale which 
hinders oxygen ion diffusion. The non-isothermal oxidation of duplex thermal 
barrier coating follows two stages with activation energy of 167 kJ/mol during 
stage I oxidation followed by 2 1 2 kJ/mol at stage II oxidation due to the formation 
of TGO layer (Nath et al. 2013). 


Repairing 

Thermal spray coatings are popular and user-friendly techniques for repairing 
damaged structures. The damaged parts are often built by thermal spraying with 
improved performance by proper choice of thermal spray parameters and coating 
composition to ensure a minimum residual stress and improved mechanical prop- 
erties including a graded composition. In addition, steps should be carefully 
followed to minimize the possibilities of introduction of defects in the structures. 
For rebuilding or repairing of damaged parts, the following steps are followed: 

(a) Cleaning of damaged parts: The damaged parts may be cleaned by solvent 
degreasing or by mechanical treatment like sandblasting based on the degree of 
contamination at the damaged surface. Any approved industrial solvent may be 
used to completely remove grease or oil from the surface. 

(b) Pretreatment: Finishing may be carried out to remove any major scoring on 
spring tips blending to form a uniform concentric base. 

(c) Masking: The areas surrounding the damaged parts are carefully masked using 
a heavy duty masking tape. 

(d) Inspection: The damaged area is thoroughly inspected to ensure a 
contamination -free surface. 
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(e) Development of bond coat: A thin layer of coating with the near equivalent 
composition of the damaged component is initially developed so that a good 
bonding is achieved between the substrate and the tailored top layer. The 
thickness of the bond coat usually ranges from 0.05 to 0.15 mm. The spray 
stream is at 90° to the surface being coated and traversed by hand to give an 
even coverage over area being refurbished. 


Development of Top Coating 

Finally, the top coat is developed by applying the same parameters as that of bond 
coat, and the coating thickness is dependent on the depth and dimension of damaged 
areas. Typical examples of application of repairing include (a) refurbishing of worn 
clutch diaphragm spring on its tip ends at the point where it comes into constant 
contact with the thrust bearing: the cost involved in removing the old spring from its 
housing is expensive, and hence, metalizing may be applied to repair the damaged 
part of diaphragm spring without removing it from the housing, (b) The reclamation 
of crankshaft is achieved by electric-arc spraying which can produce almost 50-90 
% savings over replacement cost. The need for reclamation may be to correct 
manufacturing errors and to repair an obsolete part or a part that has worn exces- 
sively in service. Thermal spray may be applied for repairing of mill rolls and also 
as a finishing step of mill rolls to improve its performance by (a) providing the 
necessary surface roughness, (b) providing necessary hardness to withstand the 
rolling forces, and (c) applying coating with a high bond strength to the parent metal 
in order to prevent any failure of the coating. 


Electrical Applications 

As with thermal properties, the electrical conductivity of thermal spray materials is 
anisotropic and is reduced compared to their wrought or sintered counterparts due 
to their lamellar microstructure and porosity. Metallic or conductive cermet coat- 
ings are, however, used as electrical conductors where wear resistance must be 
combined with electrical conductivity. For an improved wear and corrosion resis- 
tance, coating needs to be sealed to prevent moisture, even from the air, from 
penetrating the coating and reducing its insulating capability. Thermal spray coat- 
ings have also been used to produce high-temperature thermocouples and strain 
gages. Electromagnetic or radio -frequency shielding can also be provided by flame 
or electric-arc sprayed layers of zinc, tin, or other metals. 


Dielectric Coatings 

Materials such as oxide ceramics are used to provide dielectric properties to several 
electronic components. Dielectric coatings with dielectric strengths of 1 ,000 V per mil 
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(40 V per micrometer) are routinely produced. Dielectric coatings are applied in 
semiconductor heat sinks, in corona suppression in high-voltage systems (includ- 
ing PVD processing), in high-temperature strain gages, for fabrication of thermal 
spray resistance heaters, as substrates for sprayed electrical conductors, and in 
free-standing electrical insulators. The commonly used thermal spray dielectric 
materials are oxide ceramics and polymers. The commonly used oxide ceramics 
are aluminum oxide (A1 2 0 3 ), titanium dioxide (Ti0 2 ), yttria (Y 2 0 3 ), and mag- 
nesium aluminate (MgA 12 0 4 ). Ceramics are more durable, are wear and corro- 
sion resistant, and have higher dielectric strengths than polymers. Dielectric 
coatings may be applied by plasma spray deposition and HVOF spray deposition 
techniques on metallic or composite substrates. However, process parameters 
need to be optimized for achieving a defect-free coating with a minimum stress 
at the interface. 


Thermal Spray Polymer Coatings 

Polymer spraying is a one-coat process that serves as both the primer and the sealer, 
with no additional cure times. Polymer powders are specified by their chemistry, 
morphology, molecular weight distribution or melt-flow index, and particle-size 
distribution. Polymeric coatings by thermal spray deposition technique are gaining 
increasing importance in automotive, aerospace, and petrochemical sectors for 
protecting the surface of components against general corrosion and humidity. 
Polymeric coatings may be applied for reducing coefficient of friction, for electrical 
insulation, and for tailoring electrical and magnetic properties. The advantages 
associated with thermal spray deposition include (a) processing without volatile 
organic solvents, (b) ability to coat large substrates, and (c) ability to coat high- 
viscosity polymers or composite coatings. In addition the coating process is a single 
step. For polymeric coating flame spray, plasma spray and HVOF spray are 
commonly used. The feedstock material is usually in the form of powder with a 
particle size ranging from 45 to 1 80 pm and with a narrow particle size distribution. 
The feedstock powder may be injected internally or externally. Flame spray is an 
attractive cost-effective technique for the spraying of low melting point polymer 
commonly applied in ladder racking, bridges, river tug boat hulls and barges, 
wastewater clarifiers, wastewater surge tanks, and lighting posts and coating of an 
exhaust systems in submarines. However, because of a low flame temperature of 
flame spraying, substrate preheating varying from 90° to 200° is essential for an 
improved bonding of the coating with the substrate. The polymers used are mainly 
polyetheretherketone (PEEK) and polyphenylene sulfide by applying external pow- 
der feeding. Plasma spray deposition is employed to develop polymeric coatings 
like nylons, phenolic, phenolic-epoxy blends, polyvinyl fluoride (PVDF), ethylene- 
acrylic acid copolymer (EAA), ultrahigh molecular weight PE, and epoxy enamels 
to provide corrosion protection of metals and metal reinforced polytetrafluor- 
oethylene (PTFE) coating on rolls and drums in paper, textile, and packaging 
industries, food processing, and plastic molding. It is also used as wear resistance 
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coating, consisting of WC-Co or Mg-Zr filler containing nylons, fluoropolymers, 
and copolymers. However, for each coating chemistry and technique, spray param- 
eters must be optimized to accommodate each particular polymer formulation. A 
detailed review of literature on the polymeric coating by thermal spray deposition 
technique is reported by Petrovicova and Schadler (2002). 


Development of Functionally Gradient Coatings 

Functionally gradient materials (FGMs) are emerging materials with an improved 
service life and are having a promising future for the production of (a) improved 
materials and devices for applications subject to large thermal gradients, (b) lower- 
cost clad materials for combinations of corrosion and strength or wear resistance, 
and (c) perhaps improved electronic material structures for batteries, fuel cells, and 
thermoelectric energy conversion devices (Heimann 2008). Thermal spray forming 
of such graded structures has been proposed because of the unique ability of 
depositing thin, individual layers of a wide range of materials - metals, interme- 
tallics, and ceramics - thereby enabling layered or continuously graded structures to 
be produced. 

The most immediate application for FGMs is in thermal barrier coating, where a 
large thermal stresses could be minimized and component lifetimes can be 
improved by “tailoring” the coefficients of thermal expansion, thermal conductiv- 
ity, and oxidation resistance. These FGMs could, and indeed are, find application in 
turbine components, rocket nozzles, chemical reactor tubes, incinerator burner 
nozzles, or other critical furnace components. 


Development of Nanostructured Coating for Structural 
and Functional Application 

Nanostructured materials possess micro structure in the length scale of 1-200 nm. 
Nanostructured coatings have achieved increased attention in recent years due 
to their outstanding properties like higher strength, hardness, and corrosion 
resistance as compared to those of micron-size counterparts. The improved 
mechanical properties of nanocrystalline materials is attributed to refined grain 
size and the subsequent improvement in hardness as predicted by Hall-Petch theory 
and its continuous modifications based on different dislocation-grain boundary 
interaction mechanisms; nanostructured coating may be developed by physical, 
chemical, and electrochemical deposition techniques. However, adhesion of the 
coating at the interface is very poor. Thermal spraying has commonly been 
accepted as the most effective and economic method for the fabrication process 
of the nanostructured coatings and thus has contributed in a wide variety of high- 
performance applications requiring improved wear, erosion, corrosion, and thermal 
insulation resistance (Lau and Lavemia 1999; Lau et al. 1998). However, the main 
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difficulty in development of nanostructured coating by thermal spraying route 
includes the problem of agglomeration (when the precursor powders are of low 
particle size), coarsening of microstructure, and reaction with environment (due to 
its high specific surface area and reactivity). Due to decreasing particle size, the 
fluidization of the spray powder gets more and more challenging; hence, it is 
difficult to develop coating by thermal spraying. Among all thermal spray deposi- 
tion technique, only cold spray is an effective route for development of nanostruc- 
tured coating without compromising their beneficial micro structure (Lima 
et al. 2002b; Singh et al. 2010; Li et al. 2007). Nanostructured WC-Co cermets 
coatings show a superior wear resistance property due to the increased hardness of 
coatings by the decrease in the particle size of the feedstock powder (He and 
Schoenung 2002). Cold spray deposited nanostructured Cr 3 C 2 -25(Ni20Cr) 
exhibited a 20.5 % increase in microhardness of 1,020 HV300 as compared with 
the corresponding conventional coating of 846 HV300 (Lima et al. 2002a). Nano- 
structured WC-12Co coatings are deposited successfully by cold spraying using a 
nanostructured feedstock with 1,800 Hv0.3 microhardness of sprayed coating, with 
reported critical velocities of about 915 m/s, by nozzle of downstream length 
100 mm, throat diameter of 2 mm and exit diameter of 4 mm, and standoff distance 
20 mm and using helium gas at 2 MPa pressure and 600 °C temperature in the 
prechamber (Champagne et al. 2005). Lima et al. (2002b) produce pure and well- 
bonded nanostructured WC-12%Co coatings on low carbon steel substrate with a 
density of 1 ,225 kgf/mm 2 and Knoop microhardness of around 42 kgf/mm 2 by cold 
spray processing. 

Summary 

The present contribution gives a brief overview of fundamentals of thermal spray 
deposition, history, its classification, and the current understanding of thermally 
sprayed coatings. The major activities of thermal spray deposition concern devel- 
opment of thick coating for structural applications. The most industrially applied 
techniques include wire-arc spraying for corrosion resistance application and 
plasma spraying and HVOF spraying for wear and corrosion resistance application. 
Challenges lie in the development novel coatings by reaction synthesis and com- 
positionally graded coating. Cold spray offers an excellent scope of development of 
nanostructured and amorphous coating. However, detailed studies need to be 
undertaken to optimize the coating parameters. Development of novel coatings by 
hybrid coating technique, where, thermal spraying may act as a base for thick 
coating which subsequently acts as the substrate, for the precision deposition by 
thin film deposition route. The development of a cost-effective thermal 
spraying equipments needs due attention for reducing the cost of the sprayed 
components. In addition, industries should have their research wing in the academic 
institutions to utilize the fruits of the basic research worn in the field of thermal 
spray deposition. 
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Abstract 

Owing to its unique advantages of cost-effectiveness, reliability, and also the 
atom-by-atom replication on the given substrate surface profile, electrochemical 
process has been widely used for high-quality coatings. The present chapter 
presents a review and summary of the principles of electrochemical processes; 
the micro structure and composition control of metals, metal oxides, silicon, 
non-oxides, and conductive polymers in electrochemical process; and also 
their recent applications in microfabrication, energy conversation and storage, 
self-protection as well as drug delivery, etc. 
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Introduction 

An electrochemical process is a chemical reaction caused by the applied electrical 
current (Tsang et al. 2009; Shinomiya et al. 2006; Wu et al. 2010a). It involves 
oxidation-reduction reactions where an atom or molecule is formed with the gain or 
loss of electrons by the charged ions (Sonnenfeld and Schardt 1986). In electro- 
chemical reactions, the electric current is produced as a result of electron transfer 
from the atoms or molecules present in the reaction. Many natural phenomena 
involve electrochemical processes, such as the corrosion of metals (Revie and Uhlig 
2008; Wessling and Posdorfer 1999), the ability of some sea creatures to generate 
electrical fields, and the signal processing and sensing of the nervous systems in 
humans and animals. 

In modern manufacturing technologies, electrochemical processes have been 
widely applied in various fields including electrical power storage of batteries (Gao 
et al. 2004; Trevor et al. 1989; Ocko et al. 1990), electrolysis (Nozik 1975; Bak 
et al. 2002; Rossmeisl et al. 2007), and, most importantly, electrochemical 
deposition. 

This chapter summarizes the recent progress in the electrochemical processes for 
manufacturing various materials, including metals and metal oxides as well as 
electrical conducting polymers. The aim is to present a general understanding in 
the growth mechanisms of particular materials in the electrochemical processes and 
also their applications. 

The main text of this article is divided into three sections. The first section 
discusses the process parameters’ (i.e., potential, current, temperature, and 
precursors) effects on the morphology, sizes, compositions, and physical prop- 
erties of the materials in electrochemical processing. The applications of 
electrochemical process will be reviewed in the following part. Finally, this 
review concludes with the perspectives on the future research directions of 
this area. 


Manipulation of Micro-/Nanostructure 

Electrochemical deposition is used for manufacturing various products of our 
modern life, as a process by which a film is deposited from a solution of ions 
onto an electrically conducting surface. 

During the electrochemical process, the ions in a solution are driven toward a 
substrate or electrode by an applied electrical field and then coated on the electrode. 
The simplest structure of an electrochemical cell is of a two-electrode system that 
consists of a cathode and an anode. Such a cell works well under the equilibrium 
conditions. However, in the nonequilibrium conditions, measuring the potential 
between the two electrodes is inaccurate due to the redox potentials and 
overpotential. To tackle this problem, a third electrode needs to be used as the 
reference potential. In general, the normalized hydrogen electrode (NHE), which is 
defined to be zero potential, is widely used as the reference. In addition, silver 
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chloride electrode is another common reference electrode for electrochemical 
measurements. 

The electrochemical processes can be summarized to consist of following steps 
(Paunovic 2007): 

• Ions migration: The hydrated metal ions in the solution migrate to the cathode 
under the influence of applied potential. 

• Electron transfer: Near the cathode surface, a hydrated metal ion enters the 
diffused double layer where the water/solvent molecules of the hydrated ion 
are aligned. Subsequently the metal ion enters the Helmholtz double layer where 
it is deprived of its hydrate envelope. 

• The dehydrated ion is neutralized and adsorbed on the cathode surface. 

• The adsorbed atom then migrates or diffuses to the growth point on the cathode 
surface. 


Metals 

The electrochemical deposition process has been widely used for coating thin layers 
of different metals (Tsang et al. 2009; Magnussen et al. 1990; Ogata et al. 2006) in 
many traditional technologies. For example, copper plating is used to prevent case/ 
surface hardening of steel on the specified parts of the components (McCullough 
and Reiff 1924), silver plating on tableware and electrical contacts along with 
engine bearings(Faust and Thomas 1939), gold platting on jewelry and watch 
cases, and zinc coating to prevent the corrosion of steel articles (Hinton and Wilson 
1989; Marder 2000; Srivastava and Mukerjee 1976), while nickel and chromium 
plating are used on automobile wheel rims and household appliances. In the modern 
electronic industry, the electrochemical deposition also has many applications such 
as the printed circuit boards, through-hole plating, multilayer read/write heads and 
thin-film magnetic recording media, etc. 

Principle 

A substrate is immersed in a solution (the bath) containing the desired metal in the 
oxidized form either as an aquatic cation or as a complex ion. The anode is usually a 
bar of the metal being plated. During electrolysis, the metal is deposited onto the 
working electrode/substrate by following reactions as: 

At cathode : Mz + (aq) + ze » M(s) 

At anode : M(s) — > Mz + (aq) + ze— 

The amount of metal deposited on the substrate can be estimated by Faraday’s 
laws of electrolysis. 

Structures : The structures of the as-deposited metal layers, e.g., morphology, 
crystallinity, thickness, etc., can be controlled by varying the process parameters 
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Fig. 1 Schematic figure for 
the synthesis of bunch-like 
bismuth (Zhang et al 2009, 
copyrights) 



b 




such as voltage, deposition temperature, and adding agents (Wang et al. 2005). For 
example, the metal nanowires with high length/diameter ratio can be prepared by 
electrochemical deposition with the addition of porous aluminum oxide (PAO) 
template (Gelves et al. 2006). The self-organized morphology of metal nano wires 
can also be prepared by two-step template synthesis using electrodeposition in 
which the anodic alumina membrane (A AM) was used. In the first step, the metal 
nanowires embedded in the AAM were fabricated by pulsed electrodeposition. 
After the growth of nanowires embedded in the AAM, these nanowires array can 
act as nano-electrode array or more specifically as cathode for the following 
deposition process. With applied electrical field vertically to the metal (Bi)-nano- 
electrode array, the nanowires may bend down, which cause their ends assembled 
or knitted together from different orientations, thus generating bunches and the 
nests of the bunch-like metal nanostructure (Zhang et al. 2009). The schematic 
diagram for the process is illustrated in Fig. 1. 

The metal nanowires prepared by the electrochemical deposition are usually 
poly crystalline in nature. In a study carried by Tian et al., the simultaneous growth 
of single and poly crystalline metal nanowires, such as Ag, Au, Ni, Cu, Co, and Rh, 
was reported via the aforementioned electrochemical deposition route (Tian 
et al. 2003). Different growth models have been proposed to understand the growth 
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mechanism of the noble metal nanowire. The single crystals of low melting point 
metallic nanowires, such as Au, Ag, and Cu, in the electrodeposition are grown with 
the 2D-like nucleation mechanism (Tian et al. 2003). Such finding revealed that the 
new grains grew when the size of an initial cluster exceeds the critical dimension N c 
during the deposition. The larger N c facilitates the single crystal to grow from a 
previously nucleated seed grain. The critical dimension N c for a 2D-like nucleus 
can be expressed as N c = bse /(Zerj) , where s is the area occupied by one atom on 
the surface of the nucleus, e the edge energy, Z the effective electron number, r| the 
over potential, and b a constant, respectively. 

The electrochemical deposition process has also been applied to prepare 
transition-metal nano wires (Pan et al. 2005). In a study, Pan et al. reported that 
[220] is the favorable growing orientation of Ni nano wire. It was found that the 
single crystal structure of the nanowires can be changed to poly crystalline structure 
at lower potential or higher processing temperature. The texture of thicker Ni layer 
is a result of competitive growth among adjacent grains at the stage of grains 
formation. The grains grow faster along the normal of the planes that have low 
facet index due to their lower surface energy, resulting in the increase of grain size 
and facilitation of the columnar grain formation. In addition, the confinement of the 
nanoporous structure in the anodic aluminum oxide (AAO) template can also 
promote the formation of columnar grains and the single-crystalline nanowires. 
Chu et al. (2002) demonstrated a novel approach for the fabrication of metal 
nanostructure arrays on the conducting glass substrate. During the growth process, 
a film of aluminum was firstly deposited by sputtering on tin-doped indium oxide 
(ITO) film. With oxidizing anodically, alumina template with 5- to 150-nm pores 
diameter were obtained, and then 0.5- to 2.6-pm-long Ni nanowires were grown 
within the alumina template pores. So the size and orientation of the nano wires 
could be controlled by the porous alumina films. 

A number of morphologies, such as nanowires, decahedral and icosahedral 
shapes, porous nanowires, multipods, nanobrushes, and even snowflake-shaped 
structures, have been successfully prepared with the electrochemical deposition 
technique. Even noble metal -based cubic nanobox-like structure can also be pre- 
pared with the electrochemical deposition technique (Peng et al. 2010). In order to 
successfully obtain cubic nanobox-like structure, various electrodeposition param- 
eters including potential and the concentration of solution with the supporting 
electrolyte and surfactants were optimized. 

Figure 2 schematically illustrated that the Au nanotubes can be fabricated on the 
template of polycarbonate tract-etching (PCTE) membrane with electroless depo- 
sition technique. By slowing down the electrochemical deposition, Te was depos- 
ited on the surfaces of Au nanotubes to form Au-Te nanocables. The final product 
Au-Te nanocables in which Au acts as shell and Te as core was found to have radial 
consistent dimensions with starting templates. 

Nano structured copper (Cu) particles were prepared electrochemically on 
polypyrrole (PPy) film (Zhou et al. 2004). The PPy film was deposited on a 
sputter-coated thin gold film grown on a Si (100) substrate. The deposition condi- 
tions were optimized by altering several deposition parameters, such as film 
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Fig. 2 Fabrication steps: (a) bare nanoporous PCTE membrane, (b) Au nanotube membrane 
obtained after electroless Au deposition, (c) Au-Te nanocable membrane obtained after slow 
electrochemical deposition of Te (Ku et al. 2004) 


thicknesses of gold and PPy, potential and Cu concentration to tune the particle 
density, and crystal size of copper. It was also observed that the gold and PPy film 
thicknesses are critical to exclude gold-silicon interface and to control shape 
uniformity. It was found that the deposition potential was important to control the 
size and density of nanocrystals. 

Cobalt nanoplate array on copper substrate using the electrochemical deposition 
without using any template was fabricated by Xu et al. (2012). The length and 
height of the vertically aligned nanoplates were found to be few micrometers and 
350 nm respectively. The dimensions of nanoplates were controlled by manipulat- 
ing the electroplating conditions. 

From the above discussion, the electrochemical deposition had been proved an 
effective route to fabricate metal micro-/nanostructures. By tuning the reaction 
parameters or using the additional template, the desired morphology can be 
achieved. 

Corrosion of Metals 

Corrosion of metals is the spontaneous chemical destruction processing of materials 
in the corrosive environment. Most often it follows an electrochemical mechanism, 
where the anodic dissolution (oxidation) of the metal and cathodic reduction of an 
oxidizing agent occur simultaneously. It causes enormous losses in our economy. 
Rusting weakens the metal structures of buildings and bridges, the equipment of 
chemical and metallurgical plants, underground pipelines, river and sea vessels, and 
other structures. 

Direct losses attributable to the corrosion include expenditures for the replace- 
ment of individual parts, units, entire lines, or plants and for various preventive and 
protective tasks (such as the application of coatings for corrosion protection). 
Indirect losses arise when the corroded equipment leads to the defective products 
that must be rejected; they also arise during downtime required for the preventive 
maintenance or repair of equipment. About 30 % of all steel and cast iron are lost 
due to the corrosion. A part of the corroded metals can be reprocessed as scrap, but 
about 10 % is irrevocably lost. In this case, corrosion protection has become very 
critical in recent years, for example, (1) the use of thin metallic support structures 
that are designed to reduce the overall metal content, (2) the use of technologically 
advanced equipment operated under extreme conditions such as nuclear reactors 
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and jet and rocket engines, and (3) the mass-scale development of products having 
extremely thin metal films, such as printed circuit boards and integrated circuits. 

Corrosion Protection 

The electrochemical and non-electrochemical ways to protect metals against cor- 
rosion can be distinguished. The non-electrochemical processing includes dense 
protective films that isolate the metal from the medium by using paint, polymer, 
bitumen, enamel, etc. There is a general shortcoming of such a technique. When the 
coating is scratched mechanically, the protection loses and the local corrosion 
activity arises. 

The electrochemical methods of protection rely on different perceptions: 
(1) electroplating of the corroding metal with a thin protective layer of a more 
corrosion-resistant metals, (2) the electrochemical oxidation of the surface, and 
(3) polarization control of the corroding metal (the position and shape of its 
polarization curves). The polarization of a corroding metal can be controlled by 
various additives to the solution, called corrosion inhibitors , which adsorb on the 
metal to lower the rates of the cathodic and/or anodic reaction. Inhibitors are used 
primarily for the acidic electrolyte solutions, sometimes also for neutral solutions. 
Various organic compounds with -OH, -SH, -NH 2 , -COOH, and so on as the 
functional groups are used as best-known inhibitors. 

In summary, metal deposition from solution has attracted renewed interest in 
microelectronics and the related areas as a simple and no versatile technique. On the 
other hand, the metal deposition from solution remains a fascinating area of 
fundamental research, which studies the important aspects including electrocrys- 
tallization, solid-solid interactions, surface dynamics and diffusion, epitaxial 
growth, and atom to solid transitions. 


Silicon 

Silicon is a widely used semiconductor for electronic and photovoltaic devices 
because of its earth abundance, chemical stability, and tunable electrical properties 
by doping. The production of pure silicon films by simple and inexpensive methods 
has been the topic of many investigations. The demand for low-cost silicon-based 
photovoltaic devices has encouraged the quest for solar-grade silicon production 
through processes alternative to the currently used Czochralski process or other 
processes. The electrodeposition of silicon offers a clean, effective, and inexpen- 
sive alternative processing over the conventional silicon processing. The following 
sections will discuss the reaction solvents and crystallinity control of the products. 

Solvents: The electrodeposition of silicon is usually processed in a nonaqueous 
medium due to its very large reduction potential and high reactivity to water. It has 
been studied more than 30 years, and a number of solution media, such as molten 
salts (LiF-KF-K 2 SiF 6 and Ba0-Si0 2 -BaF 2 ), organic solvents (acetonitrile, tetrahy- 
drofuran), and room-temperature ionic liquids, have been employed. The key factor 
of the silicon electrodeposition is the purity of the deposited silicon. The Si used for 
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the photovoltaic devices is in the solar grade of >99.9999 % (6N), while the 
electronic grade of Si needs a purity of 1 IN for electronic devices. In most of the 
cases, the electrodeposited silicon does not meet these requirements without further 
purification. 

Crystallinity: Silica or fluorosilicates have been used as the solutes to obtain 
either the amorphous silicon or metallic silicides. Gu et al. (2013) offer one possible 
means for the direct electrodeposition of crystalline Si. The initial stage involved 
electroreduction of an oxidized precursor (SiCl 4 ) to the fully reduced state (Si) at 
the electrode-electrolyte interface. The electroreduction of SiCl 4 at the solid 
electrodes has been investigated previously and found to produce purely amorphous 
Si. In the electrochemical liquid-liquid-solid (ec-LLS) scheme, the initially 
reduced Si could be partitioned into the liquid gallium phase. The solubility of Si 
in Ga between room temperature and 100 °C as determined by extrapolations of the 

o z 

published metallurgical data for the Ga-Si system ranges from 10 to 10 at.%. 

Although it is very low, the solubility of Si in Ga at 100 °C is comparable to the 
solubility of Ge in Hg at room temperature. The dissolved Si in Ga(/) could then 
reach saturation and supersaturation conditions if SiCl 4 is continually reduced at the 
electrode-electrolyte interface. When a critical supersaturation condition is 
reached, a phase separation of Si (s) from Ga(/) followed by crystal growth occurs, 
as shown in Fig. 3 (Gu et al. 2013). 

Most recently, Mallet et al. (2013) demonstrated a promising technique for 
synthesizing silicon nanotubes via electrodeposition technique. In this study, poly- 
carbonate (PC) membrane was used as an insulated nanoporous template in which 
electrodeposition was carried out by using room-temperature ionic liquids (RTILs). 
A fine adjustment of electrochemical parameters influencing ionic diffusion inside 
the nanopores of the template was used to grow the Si nanotubes at the expense of 
Si nano wires. This indicates that the electrodeposition process in the presence of 
RTILs is competitive to grow the nanostructures with a high surface to volume ratio 
with low cost in a large scale. 

In addition, Salman et al. also reported the room-temperature template (poly- 
carbonate membranes (PC))-assisted synthesis of germanium and silicon nanowires 
with the electrochemical deposition technique (Al- Salman et al. 2008). In the 


Fig. 3 Schematic depiction 
(not drawn to scale) of an 
electrochemical 
liquid-liquid-solid (ec-LLS) 
process yielding crystalline Si 
with a liquid Ga [Ga(l)] 
electrode (Gu et al. 2013) 
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synthesis process, the air- and water-stable ionic liquid 1 -butyl- 1 -methylpyrro- 
lidinium bis(trifluoromethylsulfonyl) amide ([Pyi, 4 ]Tf 2 N) containing GeCl 4 and 
SiCl 4 as a Ge and Si source was used. 

The electrochemical synthesis of amorphous silicon in two non-oxygenated 
organic solvents, acetonitrile and dichloromethane, under controlled atmosphere 
was reported by Bechelany et al. (2012). In both solvents, tetraethylammonium 
chloride had been used as a supporting electrolyte and silicon tetrachloride as a 
silicon precursor. By tuning solvent concentrations and experimental parameters, 
the formation from dense to highly porous Si deposits can be observed. A heat 
treatment under hydrogen was performed to enhance the stability of the deposits 
prior to air exposure. 

Amorphous silicon has also been successfully electrodeposited on copper using 
a SiCl 4 -based organic electrolyte under galvanostatic conditions (Epur et al. 2012). 
The deposited films were tested for their possible application as anodes for Li-ion 
battery. The results indicate that this binder-free amorphous silicon anode exhibits a 
reversible capacity of ~ 1,300 mAh g -1 with a coulombic efficiency of >99.5 % up 
to 100 cycles. Impedance measurements at the end of each charge cycle show a 
non-variable charge transfer resistance which contributes to the excellent 
cyclability over 100 cycles observed for the films. This approach of developing 
thin amorphous silicon films directly on copper eliminates the use of binders and 
conducting additives, rendering the process simple, facile, and easily amenable for 
large-scale manufacturing (Fig. 4). 

The electrochemical deposition of Si films on Ni substrate for lithium-ion battery 
anodes was reported by Zhao and co-workers (2012). Their findings revealed that 
the Li + diffusion coefficient of the Si film was much higher than the pristine Si 
nanoparticles (Fig. 5). 




Fig. 4 Capacity and coulombic efficiency plots cycled at 400 mA g 1 between 1.2 and 0.02 V for 
100 cycles copyright (Epur et al. 2012) 
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Fig. 5 Impedance study of 
the pristine Si nanoparticle 
anode and the Si him anode: 
(a) Nyquist plots, (b) the 
relationships between Z re and 
co 1/2 at low-frequency region 
(Zhao et al. 2012) 


a aoo 

■ 

• 

■ 

M 200 

a 

o 

■ 

* 

S 

^Ni 

a 

_ * 

100 

■ 

• • / * Sittn 

0 



0 


100 



300 


400 


500 


Z (ohm) 



It was proposed that the Si film supported on 3D conductive framework could 
accommodate the volume change in Li + insertion/extraction effectively, which 
caused a high capacity and a stable cycle performance, and that Li + ions exhibited 
fine diffusion kinetics in the Si film. 

In summary, electrodeposition is a promising environmental friendly, 
low-temperature, and low-cost approach to deposit both amorphous and crystalline 
silicon for industry applications, such as anode materials for solar cells and thin film 
transistors, etc. It is prospected that in the near future, the electrodeposition of 
silicon from low-toxic, inexpensive solvents will have great potential for large- 
scale production applications. 


Metal Oxides 

The electrochemical deposition of metal oxides is mainly performed in alkaline 
solutions containing metal precursors. The deposition of metal oxides can be 
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carried out under oxidizing conditions as well as reducing conditions from the 
alkaline solutions. In both cases, the metal ion dissociates from the precursor and 
precipitates on the electrode as the oxide (Azaceta et al. 2012; Schrebler et al. 2006; 
Tench and Warren 1983; Stevenson et al. 1999; Wang et al. 2010, 2011; Qu 
et al. 2006; Anees et al. 2011; Fernandes et al. 2009). What controls the ability to 
deposit an oxide is the stability of oxide under the experimental conditions, i.e., the 
potential, temperature, and local pH values near the electrode surface (Dube 
et al. 1995; Kendig et al. 1994; Wu et al. 2010b). 

Principle 

Metal oxide can be obtained from the decomposition of metal hydroxide with 
thermal treatment produced by metal and oxygen precursor during the electrochem- 
ical deposition. Hydroxyl ions are used as the oxygen precursor and can be obtained 
by two methods. One is directly changing the solution pH value by adding alkali. 
But this method usually requires complex agent to stabilize the metal ion in high pH 
value solution. Once the metal ion gets electrons from substrate, the valence of the 
metal ion will be changed, and then the metal ion becomes unstable with the 
complex agent and comes out to react with hydroxyl ions to form metal hydroxide. 
The other method is using water or other oxygen compounds, 0 2 , H 2 0 2 , or N0 3 , to 
generate hydroxyl ions by getting electrons. The local pH increases and the metal 
ions become metal hydroxide precipitated on the substrate electrode. 


N0 3 “ +H 2 0 + 2e“ - 

> N0 2 “ + 20H - 

(i) 

H 2 0 2 + 2e 

■* 20H" 

(2) 

0 2 + 2H 2 0 + 4e 

' -► 40H“ 

(3) 


Deposition of the binary oxides under the reductive conditions includes the 
reduction of metal ions that form stable oxides. On the other hand, the cathodic 
deposition of metal oxides relies on precipitations due to the electrochemically 
induced local pH alterations. The reduction of the dissolved oxygen or nitrate 
ions results in the local production of hydroxide ions, which, for example, 
enables the deposition of CdO and ZnO. However, the hydroxide ions are also 
produced upon the dissociation of complexes with ligands like lactic, tartaric, 
and citric acid containing hydroxyl groups. During the recent years, there have 
been tremendous reports in the nano structural morphology control of metal 
oxide thin films with the electrochemical process because of the morphology- 
dependent physical and chemical properties of metal oxide thin films. Thus, the 
following sections will discuss recent progress in shape and size control of these 
oxide nanostructures. 

One-dimensional nanostructures: One-dimensional (ID) metal oxide 

nanostructures, including nanowires, nanorods, nanotubes, and nanoribbons, have 
been intensively studied owing to their unique physical properties and fascinating 
potential applications in future nanoelectronic devices. Their superior physical 
properties including superconductivity, enhanced magnetic coercivity, and the 
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unusual magnetic state of some ID nanostructure have been theoretically predicted 
and experimentally demonstrated. Various nanodevices, such as sensors, lasers, 
field effect transistors, optical waveguides (Revie and Uhlig 2008), cantilever 
resonators, and generators, have been fabricated by using the ID nanostructure as 
the building blocks. 

Generally, there are two strategies to produce the ID metal oxide nanostructures 
through the electrochemical process. One is the template-assisted electrodeposition, 
where the ID anisotropic growth is realized by using various templates to confine 
the growth space of the electrodeposits. The template-assisted electrodeposition has 
been demonstrated to be a versatile approach for the preparation of the ID nano- 
structure of numerous materials, including metals, semiconductors, and conductive 
polymers. Another strategy is the template-free electrodeposition, where the ID 
anisotropic growth is achieved by utilizing the intrinsic anisotropic crystallographic 
structure of a targeted material. 

Binary metal oxide nanowires can be fabricated via the electrodeposition by 

o 

dissolving molecular ions such as Cl , S0 4 “ , and CH 3 COO in solutions having 
different anions (Elias et al. 2008a). The nature of anions in electrolyte solution was 
found to be critical in tuning length and diameter of nano wires. The nano wires with 
higher aspect ratio can be obtained in acetate solution for preparing binary oxide 
nano wires. For example, the single-crystalline ZnO nano tubes with an external 
diameter ranged 200-500 nm and length 1-5 pm were prepared electrochemically 
and shown in Fig. 12 (Elias et al. 2008b). The whole process was completed in three 
steps with combination of electrochemical and chemical approaches. In the first 
step, the ZnO nanowires were prepared by the electrochemical deposition. Second, 
the selective etching of ZnO nano wires in KC1 solution was carried out to transform 
them into nanotubular architecture. The KC1 concentration and temperature with 
immersion time were critical during the whole etching process. The evolution of the 
etching (dissolution) starts from the tip of the nanowires toward their bottom. The 
dissolution affects the nanowire core preferentially, leaving the lateral faces, 
resulting in a tubular structure. Degradation of the lateral walls starts when the 
core is fully dissolved, inducing a decrease of the length of the nanotubes until the 
complete dissolution of the ZnO nanostructures. 

The electrochemical synthesis of Ti0 2 nanotubes is one of the research focuses 
in the electrodeposition of metal oxides (Roy et al. 2011), which has been widely 
used in dye-sensitized solar cells. The first self-organized anodic oxides on titanium 
were reported for the anodization in chromic acid electrolytes containing 
hydrofluoric acid by Zwilling and co-workers in 1999 (Z willing et al. 1999). The 
500-nm-thick nanotube structure was not highly organized and the tubes showed 
considerable sidewall inhomogeneity. However, it was recognized that small addi- 
tions of fluoride ions to an electrolyte are the key to form these self-organized oxide 
structures. 

Kelly (1979) explored the influence of fluorides on the passivity of titanium and 
concluded that porous oxide layers formed for low fluoride concentrations. Unfor- 
tunately, the (likely) presence of self-organized Ti0 2 nanotube layers was not 
verified by performing high-resolution electron microscopy (Fig. 6). 
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Fig. 6 TEM photo of a film 
formed at 10.0 V (see) in 10 -2 
M NaF solution (Kelly 1979) 



Most crucial improvements to the geometry of the tubes were established by 
Macak et al. (2005a). It demonstrated that the pH plays a crucial role in improving 
the tube layer thickness, that is, at neutral pH values, much longer tubes could be 
grown (Macak et al. 2005b). In nonaqueous electrolytes, smooth tubes without 
sidewall inhomogeneity (ripples) can be grown to much higher aspect ratios and 
show a strongly improved ordering. 

Wang et al. reported highly ordered Ti0 2 nanotube arrays via the electrochem- 
ical anodization of high-purity Ti foil and Ti thin film-coated indium tin oxide 
(ITO) glass in fluorine containing electrolytes (both aqueous and nonaqueous). 
The formation of the well-aligned Ti0 2 nanotube arrays was affected by the 
electrolyte temperature and the applied anodization potential. In the aqueous 
electrolyte, the anodization potential exerted significant influence on the formation 
of Ti0 2 nanotube arrays, while the limited effect from the electrolyte temperature 
was observed. In the nonaqueous electrolyte, the electrolyte temperature markedly 
affected the Ti0 2 nanotube dimensions while the anodization potential exhibited 
slight influence in this regard. As a consequence, Ti0 2 nanotube arrays with tube 
diameters ranging from 20 to 90 nm and film thicknesses ranging from several 
hundred nanometers to several micrometers were obtained. 

Two-dimensional nanostructures: Two-dimensional (2D) nanostructures, 
especially nanosheets, which possess atomic or molecular thickness and infinite 
planar dimensions, are regarded as the thinnest functional nanomaterials. To date, 
the synthesis of 0D (e.g., nanoparticles and nanodots), ID (e.g., nanotubes and 
nanowires), and 3D nanomaterials (e.g., nanocubes) has been well documented. In 
contrast, 2D nanomaterials have remained conspicuously absent until the recent 
emergence of processes for producing graphene and inorganic nanosheets. How- 
ever, graphene is a conductor, and electronic technology also requires insulators, 
which are essential for many devices such as memories, supercapacitors, and gate 
dielectrics. One reason why 2D nanomaterials have been less thoroughly studied 
may be related to the special synthetic process, involving the delamination of 
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Fig. 7 C 03 O 4 nanosheets deposited on the ITO substrate, (a) X-ray diffraction pattern {inset, 
cross-sectional image), (b) TEM image of the mesoporous sheets {inset, HRTEM with lattice 
spacing), (c) Energy-dispersive X-ray spectroscopy {inset, surface morphology) (Macak 
et al. 2005b) 


layered compounds into single layers. There are many known layered compounds 
with strong in-plane bonds and weak coupling between layers. 

Two-dimensional nanosheets are considered to be excellent candidates for future 
nanoelectronic applications. Such nanostructures and their electronic states play an 
important role in realizing the innovative electronic, optical, and magnetic func- 
tionalities. For example, the operation of almost all semiconducting devices relies 
on the application of two-dimensional interfaces. To date, various nanosheets have 
attracted increasingly fundamental research interest because of their potential to be 
used for different applications like electrochemical capacitors and supercapacitors 
For example, it has been reported that Co 3 0 4 thin films with nanosheet structure 
were prepared with a facile electrochemical deposition method, as shown in Fig. 7. 

Other oxides, e.g., manganese dioxide (Mn0 2 ) nanosheets on flexible carbon 
cloth (CC) via anodic electrodeposition technique, have also been reported. Petal- 
shaped Mn0 2 , having sheet thickness of a few nm and typical width of 100 nm, with 
a strong adhesion on CC has been observed. Besides, freestanding ZnO nanosheets 
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have been electrodeposited on an indium tin oxide substrate in zinc nitrate solution. 
It suggests that the crystallization of the ZnO nanosheet below room temperature 
slows down the whole process. 

The electrochemical deposition of metal oxide nanosheets represents a facile 
approach to fabricate oxide nanomaterials with high surface area, especially suit- 
able for energy conversion and storage applications. However, there are still several 
challenges: (1) the stability of as-prepared nanosheet structure. As the nanosheets 
are weak-bonded, the reliability, durability, and chemical and thermal stabilities are 
still a concern for practical applications. (2) Up to date, most reported nanosheets 
are in the category of binary metal oxides. This is because the complex oxide 
nanosheets, such as perovskite oxides that are unique in both structural diversity 
and electronic properties, are difficult to be fabricated for their potential applica- 
tions in electronics. (3) As these nanosheets are easily agglomerated, it is crucial to 
obtain stable nanosheet suspension solution for thin film and device applications. 

Three-dimensional nanostructures: The development of well-defined porous 
metal oxides with nanometer-scale periodicity is of intense interest for a number of 
applications including photonics, sensing and detection arrays, catalysis, separa- 
tions, microfluidics, and low dielectric constant (k) thin films in the microelectron- 
ics. To date, many of these materials have been prepared by the electrodeposition 
approach with the aid of different templates. For instance, the single-crystalline 
ZnO nanorod/amorphous and Mn0 2 nanoporous shell composites were prepared by 
electrochemical deposition, as shown in Fig. 8 (He et al. 2011). 

A graphene/Mn0 2 nanocomposite-based porous nanofibers in which graphene 
nanosheets were coated on the surface were also reported via electrodeposition 
route (Yu et al. 2011). These porous nanofibers facilitate both loading of active 
electrode and easy approach of electrolyte to the electrodes materials. The dip and 
dry method was evolved first to coat porous fibers on graphene nanosheets and then 
Mn0 2 was deposited electrochemically on afore prepared porous fibers on graphene 
nanosheets. It was also suggested that these architectures could be promising 
candidates for multifunctional designs. 

In summary, nano structured metal oxides with various morphologies from 
electrochemical process have been extensively studied during the last two decades 
due to their interesting electronic, magnetic, electrochemical, and optical properties 
and potential use as catalytic and electrode materials in various devices. In partic- 
ular, the electrodeposition of metal oxide nanostructure with high surface area and 
high chemical activity may be widely used as electrodes for the electrochemical 
energy storage device applications. 


Non-oxides 

To date, there are only a few non-oxides, such as copper indium diselenide 
(Hu et al. 2012; Sovannary et al. 2007) (CuInSe 2 , CIS) and bismuth tellurides 
(Magri et al. 1996; Chen et al. 2010), reported that can be deposited by electro- 
chemical processes. Copper indium diselenide (CuInSe2 or CIS) and related 
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Fig. 8 SEM images of ZnO nanorod/Mn02 shell composites with different MnC >2 shell thick- 
nesses prepared with different deposition times: (a) 30 s, (b) 2 min, (c) 3 min, (d) 15 min. TEM 
images of the different shell thicknesses at different growth times: (e) 1.5 min, (f) 4 min, (g) 6 min, 
(h) 10 min. (i) The curve of shell thickness vs. growth time (He et al. 2011) 


compounds are being used in low-cost photovoltaic applications. Conversion effi- 
ciencies of these solar cells are now approaching close to 20 %. CuInSe 2 has large 
optical absorption coefficient, which results from a direct energy gap. CIS solar 
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cells have already surpassed the conversion efficiency of 19.5 % based on a 
multistep process using evaporation. The evaporation technology is excellent for 
good quality film growth, but difficult to scale up. Currently, a great deal of effort is 
directed to a large-scale, high-quality, and low-cost technology for preparing CIS 
thin films. Electrodeposition is highly suitable to achieve this goal. The record 
efficiency of 11.3 % was reported for a cell using electrodeposition route. 

In order to obtain high-quality CIS thin films, the optimization of 
electrodeposited precursor film composition and morphology becomes extremely 
important. Sovannary et al. (2007) in their report presented vertically aligned arrays 
of CuInSe 2 (CIS) nanowires with controllable diameter and length by pulse 
cathodic electrodeposition from a novel acidic electrolyte solution into anodized 
alumina (AAO) templates, followed by annealing at 220 °C in vacuum. 

Bismuth telluride and its derivatives doped with antimony and selenium are 
the best-known materials for room-temperature thermoelectric applications. 
Electrodeposition has been recognized as an effective method for preparing 
these materials due to its unique features: simple, fast, low cost, and capable 
of selective deposition. The number of publications on electrodeposition of 
thermoelectric materials, particularly bismuth telluride-based materials, has 
increased by a factor of three in the last decade. Most of these studies dealt 
with the electrodeposition of bismuth telluride from acidic solutions using 1 M 
HN0 3 that can dissolve only about 20 mM Te species, which in turn limits the 
Bi concentration in order to obtain the correct alloy composition (Bi 2 Te 3 ). The 
low metal ion concentration causes composition inhomogeneities across the 
film thickness due to the unbalanced depletion of Bi and Te species in plating 
bath. The rather low mass transport in solution also limits the maximum 
deposition current and hence growth rate that can be attained. Magri 
et al. (1996) presented electrochemically prepared bismuth telluride alloy 


films of uniform thickness from a solution containing Bi 3+ and HTe0 2 ^ ions 

Q 

in 1 mol dm nitric acid (pH=0) on stainless steel. The electrodeposited 
films were found to be monophasic, poly crystalline structure and their stoichi- 
ometries vary according to the solution composition. 

In another report (Chen et al. 2010), the bismuth telluride film and nano wires 
array were fabricated by potentiostatic electrodeposition, as shown in Fig. 9. Both 
materials were found to exhibit slightly Te-rich, n-type Bi 2 Te 3 with preferred 
orientation in rhombohedral structure, and they were found to show much better 
thermoelectric performance than their counterparts in bulk. 

Nonaqueous electrolytes often provide advantages such as wide electrochemical 
window, good thermal stability, and high metal ion solubility. Tellurium and 
tellurium alloy films have been electrochemically prepared from dimethyl sulfoxide 
(DMSO) and dimethyl formamide (DMF). Bismuth antimony thermoelectric 
nano wires were prepared from DMSO (Paunovic 2007). Li and co-workers 
used DMSO for the electrodeposition of bismuth telluride and bismuth antimony 
telluride materials. 

In summary, the electrochemical deposition is an effective way to fabricate 
non-oxides with the controlled morphologies for energy storage applications. 
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Fig. 9 Scanning electron micrographs of Bi 2 Te 3 films: (a) planar and (b) cross-sectional view 
of as-prepared film and (c) side view of Bi 2 Te 3 nano wires array partially embedded in AAM. 
(d) Separated nano wires after AAM being dissolved (Chen et al. 2010) 


In particular, the combination of electrochemical deposition and solvothermal can 
result in high-crystalline, high-quality non-oxide nanomaterials with improved 
performance. 


Conducting Polymers 

There are several unique properties of conducting polymers (CPs), which distin- 
guish them from traditional organic coatings. Like metals, CPs are electronically 
conducting, so galvanic coupling with the metal is expected, with concomitant 
alteration of corrosion behavior. CPs are redox active materials, with potentials that 
are positive of iron and aluminum. Thus, as with chromate, interesting and poten- 
tially beneficial redox interactions of CPs with active metal alloys such as steel and 
aluminum are anticipated, again with concomitant alteration of their corrosion 
behavior. The ability of a CP to function as a reservoir for the release of dopant 
ions that can act as corrosion inhibitors may also be an important distinguishing 
feature (Tallman et al. 2004). 

One factor that limits the application of CPs as corrosion control coatings is the 
difficulty in casting such polymers as films or coatings. Since most common 
conjugated conducting polymers (e.g., poly aniline, polypyrrole, or polythiophene) 
are insoluble in environmentally friendly solvents such as water or alcohol, pro- 
cessability is a key issue. Solvent-soluble polymers can be synthesized by 
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polymerization of derivatized monomers containing either an ionic side chain (for 
water solubility) or an alkyl side chain (for organic solvent solubility). For example, 
a long-term corrosion protection of an aluminum alloy (A1 2024-T3) by a poly 
(3 -octyl pyrrole) primer with a polyurethane topcoat was reported by Gelling 
et al. (2001), where the CP was cast from a carbon tetrachloride/ 
dichloromethane/acetonitrile solvent mixture. 

The direct electrodeposition of CPs onto active metals is appealing but is 
generally not feasible due to the positive potentials required for polymeriza- 
tion. At such potentials, the metal oxidizes (corrodes) and an adherent, con- 
tinuous film is not deposited unless special surface preparation and electrolytes 
are used, and even then only at high monomer concentrations (Beck and 
Hiilser 1990). 

Electron-conductive polymers were first reported in 1971 by Shirakawa and 
co-workers. They synthesized conducting poly acetylene and found that it had a 

Q 1 

considerably high conductivity relative to other organic compounds, 10 Scm . 
Since 1971, various conducting polymers and their synthesis mechanisms have 
been studied actively by many researchers. Basically, the conducting polymers 
have electrons that are delocalized in p-conjugated systems along the whole 
polymer chain; their electron conductivity is much higher than that of other poly- 
mers that do not have a p-conjugated system. 

Furthermore, in 1979, Diaz and co-workers synthesized conductive polypyrrole 
film by electropolymerization. Electropolymerization is similar to electrodeposi- 
tion of metal films. Its polymerization reactions occur electrochemically; that is, 
electrons go through the electrode-electrolyte interface when a potential is applied 
to an electrode. In the case of an insoluble polymer generated on the electrode, a 
film of conducting polymer is thus obtained easily. Essentially, regarding electron 
transfer, all electrochemical reactions are classifiable into oxidative or reductive 
reactions. Electrochemical polymerizations can be classified in a similar way. In 
almost all cases, electropolymerized polymers have been synthesized by 
electrooxidation. It seems, however, that films obtained by this method contain 
branching or cross-linking defects. On the other hand, electroreductive polymeri- 
zation performed by using a zerovalent nickel complex is sensitive to the halogen 
atom in the monomer, and therefore it yields polymers with well-defined linkages 
among the monomer units. 

Principle 

Very recently, Armel et al. (2012) proposed two different methods to deposit 
metallic and semiconducting nanoparticles inside a conducting polymer (CP) by 
controlling the diffusion of an active ion within the film (Fig. 10). The pulsed 
electrodeposition was used as primary method, where the “off’ intervals allow the 
ions to flow into the CPs as they reoxidize. The second method involved poly 
(ethylene glycol) (PEG) or ionic liquid (IL) to assist the transportation of the ion 
within the CP. Various nanoparticles such as Pt, Ni, Cu, and CdS were successfully 
deposited inside a conductive film of poly(3,4-ethylenedioxythiophene) (PEDOT) 
by adopting the aforementioned techniques. 
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Fig. 11 Images of electrodeposited Pt nanoparticles in PEDOT obtained by applying pulsed 
reduction current of —600 pA for five cycles in an aqueous solution containing 5 mM H 2 PtCl 6 and 
1 M H 2 S0 4 (amount of charge is 0.12C cm -2 ) at 0.21 V vs. SCE. (a) Cross-section SEM of 
PEDOT-Pt film, (b) cross-section SEM of PEDOT:PEG-Pt, and (c) EDX images confirming Pt 
nanoparticles present throughout PEDOT:PEG-Pt film. Scale bar for (a) and (b) is 100 nm and for 
(c) is 400 nm. In each case, the composite film is below the FTO layer to which it is attached 
(Armel et al. 2012) 


The morphological examination of prepared structures revealed that the 
nanoparticles were well dispersed throughout the films and formed the open and 
layer structures in the presence of PEG as shown in Fig. 11. 

Using a phosphonium-based (triisobutylmethylphosphonium tosylate) IL and 
heat during electrodeposition results in the incorporation of CdS inside a layered 
polymer scaffold as shown in Fig. 12. The presence of the nanoparticles throughout 
the film was confirmed by ED AX measurements. This study provided facile routes 
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Fig. 12 Cross-section and surface SEM image of PEDOT:PEG-CdS fabricated using chronopo- 
tentiometry (method 1). Scale bar is 100 nm. The potential was —0.795 V (Armel et al. 2012) 

to functionalize CP electrodes with either organic or inorganic catalysts by assum- 
ing the crucial interconnection between the catalysts and the polymer. 

In another case study, Luo and Cui (201 1) presented a conducting polymer -based 
smart coating with magnesium (Mg) implant to improve the corrosion resistance of 
Mg and also release drug in a controllable fashion. By considering the facts that the 
ionic liquids are highly conductive and stable solvent with a very wide electrochem- 
ical window, they directly electrodeposited conducting polymer coatings poly 
(3, 4-ethylenedioxy thiophene) (PEDOT) on the active metal Mg in ionic liquid at 
mild conditions, and Mg remained considerably stable during the electrodeposition. 
The coating remained uniform and significantly improved the corrosion resistance 
of Mg as shown in Fig. 13. 

In addition, the PEDOT coatings can load the anti-inflammatory drug dexameth- 
asone during the electrodeposition which can be subsequently released upon elec- 
trical stimulation, which is illustrated in Fig. 14. 

From this study, it is expected that the proposed CPCs could be electrodeposited 
on other active metals and alloys besides pure Mg, and such CPCs with drug- 
releasing properties may find applications in Mg-based implantable devices. 

Electrodes coated with conducting polymer films have attracted considerable 
interest in the last two decades (Tallman et al. 2004; Luo and Cui 2011; Murotani 
et al. 2008; Li and Albery 1992). A multitude of reviews and monographs have been 
written on the subject. Many electrochemical works were performed with aromatic 
or conjugated compounds yielding insulating polymer too. But few studies were 
made up to now with aliphatic molecules concerning their electropolymerization 
behavior. For example, poly(3,4-ethylenedioxythiophene) (PEDOT) is one of most 
attractive electrochromic materials. Cho et al. (2005) presented nanotube structure 
of PEDOT by using electrochemical deposition in the pores of the alumina template 
film. The detailed mechanism of nanotube formation was also presented in these 
studies shown in Fig. 15. 

In summary, the conducting polymers can combine the electronic characteristics 
of metals with the engineering properties of polymers. In the past 40 years, 
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Fig. 13 SEM images (a and 
b) of PEDOT/IL coating 
electrodeposited on Mg using 
cyclic voltammetry (Luo and 

Cui 2011) 
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considerable knowledge of such materials has been gained and this is reflected in a 
number of texts and early reviews. It is notable that novel electrolyte and ionic 
liquid have important applications for depositing conducting polymers. As the ionic 
liquid is a highly conductive and stable solvent with a very wide electrochemical 
window, the conducting polymer coatings can be directly electrodeposited on the 
substrates in ionic liquid under mild conditions. 


Recently Reported Applications of Electrochemical Process 

Microfabrication 

Electrochemical microfabrication offers some unique advantages over competing 
technologies and therefore finds increasing applications in the electronics and 
microsystem industries (Datta and Landolt 2000). In the mid of the nineteenth 
century, electroforming was applied to precision manufacturing of printing plates 
for bank notes. Electrochemical technology entered the electronics industry some 
50 years ago as a manufacturing process for low-end printed circuit boards. Today, 
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Fig. 14 (a) Electrically 
controlled drug release from 
different systems in 
comparison to diffusion. 
PEDOT/IL/Dex/Mg, PEDOT 
coating with Dex 
electrodeposited on Mg; 
PEDOT/IL/Mg, PEDOT 
coating without Dex 
electrodeposited on Mg; Mg, 
bare Mg electrode, (b) 
Accumulated drug release of 
the PEDOT/IL/Dex/Mg upon 
multiple electrical 
stimulation. The error bar 
represents the standard error 
of the mean ( n = 6) (Luo and 

Cui 2011) 
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Fig. 15 Schematic of fabricating PEDOT nanotubes (Cho et al. 2005) 


electrochemical technology is employed widely for the processing of advanced 
microelectronic components, including high-end packages and interconnects, thin- 
film magnetic heads, and microelectromechanical systems (MEMS). 

Electrochemical microfabrication technology includes cathodic, anodic, and 
open-circuit processes. The latter processes are often referred to as chemical 
processes but they involve cathodic and anodic partial reactions which follow 
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Table 1 Electrochemical processes used in microfabrication (Datta and Landolt 2000) 


Type 

Process 

Description 

Cathodic 

Electrodeposition, electroplating 

Electrochemical deposition of metals, 
alloys or semiconductors with direct or 
pulsating current 

Cathodic 

Electroforming 

Fabrication of three-dimensional shapes 
by electrodeposition 

Cathodic 

Through-mask electroplating, micro- 
electroforming 

Selective electrodeposition of micro- or 
nano structures using lithography or 
maskless techniques for defining the 
shapes 

Cathodic 

Electrochemical cleaning 

Surface cleaning by cathodic hydrogen 
evolution 

Anodic 

Electrochemical micromachining, 
electroetching, anodic etching, 
electrochemical photoetching 

Selective anodic dissolution of a 
patterned electrode serving for 
precision shaping and surface 
structuring 

Anodic 

Electropolishing 

Anodic dissolution leading to leveling 
and brightening of the dissolving 
surface 

Anodic 

Anodization 

Growth of an oxide him by anodic 
polarization 

Anodic 
or open 
circuit 

Tunnel etching 

Formation of deep narrow tunnels on 
aluminum used for increasing the 
surface area of electrolytic capacitors 

Open 

circuit 

Electroless deposition, electroless 
plating 

Chemical deposition of a metal or alloy 
by reaction of dissolved ions with a 
reducing agent 

Open 

circuit 

Chemical polishing 

Chemical dissolution in presence of an 
oxidizing agent leading to leveling and 
brightening of the dissolving surface 

Open 

circuit 

Chemical-mechanical polishing (CMP) 

Simultaneous mechanical and chemical 
metal removal used for planarization in 
multilevel chip manufacturing 

Open 

circuit 

Chemical milling, chemical etching, 
chemical micromachining 

Selective chemical dissolution of a 
patterned work-piece serving for 
precision shaping and surface 
structuring 


electrochemical laws. An overview of different processes used in electrochemical 
microfabrication is presented in Table 1. 

Electrochemical microfabrication is a relatively new technology which is rap- 
idly evolving as a technology of choice in the electronics and microsystem indus- 
tries. The successful implementation of electrochemical processes in the 
manufacturing industry has become possible thanks to progress achieved through 
fundamental research in electrochemistry, electrochemical materials science, and 
electrochemical engineering. 
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An improvement of the acoustic liquid manipulation (ALM)-assisted 
electroplating process by a potential tool-less fabrication (TLF) technique for 
microfabrication is reported by Gadkari and Nayfeh (2008). A plated dot size of 

0. 031. in., using highly backed ultrasonic transducers, was successfully achieved. At 
the start of electroplating process, an ion-depleted layer was formed at the cathode 
which acts as a natural mask. A fine stream of fresh electrolyte formed by acoustic 
streaming by an ultrasonic transducer was also directed to refresh the ion-depleted 
layer at the cathode which caused the plating process to continue at a localized 
point. 

In their work, several key factors that can affect the quality of plating process 
were identified as (Gadkari and Nayfeh 2008): 

1. Transducer type: It was determined that the minimum dot size in the focal plane 
was a function of the size of the transducer element, the size of the focal point, 
and the position of the target relative to the focal plane. To this end, a smaller 
element size transducer was used (0.25 in.) with a focal distance of 0.5 in. in the 
electrolyte fluid which resulted in a smaller-diameter stream along with reducing 
the stream spread. 

2. Streaming power level: It was determined that the streaming power is critical to 
the process; higher power causes turbulence and stirring in the tank and results in 
a less-coherent geometry of the plated area. To this end, a highly damped 
transducer was used instead of the free ringing ones used in the previous work. 
This resulted in a reduced forward acoustic power and reduced the harmonics 
which both served to make the stream more laminar and reduced the beam 
spread. 

3. Plating current and ion concentration levels: The relation between concentration 
of the ions in the electrolyte and the plating current is an important factor 
affecting the plating size and quality. Low ion concentration coupled with higher 
plating currents yielded the best plating results. The ideal situation would be 
where the rate of deposition of the copper ions on the target plate equaled the rate 
of replenishment of the fresh ions to the target plate. 

4. Time of plating: As the time of plating increases, the plating thickness increases. 
This factor is important for building plating structures with thickness. The aspect 
ratio of structures by varying the time of plating could not be controlled. 

Finally, although this work demonstrated plating copper on steel, however, the 
same process can be duplicated for plating other materials. 

Three-dimensional (3D) micromachined objects are difficult to make by using 
current techniques; there are few alternatives to using a large number of process 
steps and masks. A facile approach to generate 3D microfabricated structures using 
very few steps and a single photolithographic mask was presented by Lavan and 
co-workers (2003). This new approach relies on a conductive template, which can 
be produced using conventional lift-off microfabrication, or by other means such as 
self-assembly or printing and then an electrodeposition step to produce the full 
structure. At first, gaps between regions of the template were intentionally 
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Fig. 16 Schematic representation of the deposition technique for polypyrrole (the method is 
similar for metal films, but they are deposited on the cathode), (a) Starting with isolated conductive 
patterns on a surface, the deposition initiates at region(s) connected to the anode, illustrated here 
with a wire; (b) the deposited film grows horizontally and vertically from the initial region(s); (c) 
the deposited film bridges to new regions, with deposition continuing over the larger surface; (d) 
the deposited film bridges to another region; the relative heights of each region are determined by 
the spaces between them (LaVan et al. 2003) 


introduced, and then the design of the conductive template determined the full 3D 
structure. As material is deposited, it expands both vertically and horizontally; the 
horizontal expansion bridges the spaces between the conductive regions. Once that 
space is bridged, the electrodeposited material forms an electrical connection with 
the new region and deposition continues on the existing structure, as well as 
initiating at the newly connected region. Figure 16 provides a schematic illustration 
of this process. If a small difference in height were desired between adjacent 
structures (regions), the gap would be small; a large difference in height is created 
with a larger gap. 

Two different materials, polypyrrole (PPy) and Ni with a noticeable difference 
in growth rates and growth behavior, were prepared. The rate of electropoly- 
merization for PPy is 780 nm min -1 vertically and 1,000 nm min -1 horizontally, 
while nickel had a deposition rate of 105 nm min -1 , both vertically and horizon- 
tally. The 4:3 lateral-to-vertical growth ratio for PPy creates a well-defined faceted 
profile with the polymer growing slightly faster outward than upward (Fig. 17a). 
The nickel grows uniformly in all directions, which results in the edges of the 
pattern having a radius equal to the film thickness (Fig. 17b). Depending on the 
application, one or the other of the materials may be more desirable. The PPy 
surface appears smoother than the nickel surface, and it deposits more rapidly. The 
sidewalls of the nickel structures, however, are closer to vertical than the 
corresponding PPy patterns. Vertical sidewalls, or a limit on horizontal expansion, 
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Fig. 17 Test patterns in the 
form of an array of unattached 
circles. The original patterns 
are 200 pm in diameter and 
10 pm apart. In each case, 
deposition originated in the 
lower-left corner of the 
image, from the circle 

attached to the anode, (a) PPy 

2 

pattern grown at 3 mA cm 
for 48 min (the middle 
column has bridged eight 
gaps); (b) nickel structure 

O 

electroplated at 3 mA cm 
for 14 h (the middle column 
has bridged six gaps) (LaVan 
et al. 2003) 



could be obtained by introducing a sacrificial boundary, such as one made from 
photoresist, around the conductive patterns. 

The abovementioned technique opened the possibility of new methods to fabri- 
cate multilevel structures such as electrodes, interconnects, gratings, and photonic 
lattices. The next step could be to develop approaches to augment these structures 
with additional films patterned over the first layer. An additional area to explore is 
the ability of this method to bridge gaps to form low-impedance connections 
between devices and substrates produced by self-assembly or fluidic self-assembly. 
Finally, this approach could be used to modify the LIGA process (electroforming 
and molding process (referred to by its German acronym, Lithography, 
Galvanoformung, Abformung, or LIGA) of microelectromechanical systems 
(MEMS)) where regions of a microdevice are deposited to regulate the effect of 
current densities or to create composite or gradient materials from a single mold or 
pattern. 

Chip-Interconnect Metallization : A paradigm shift in interconnect technology 
was announced by IBM in 1997 when the vacuum-deposited Al was changed to 
electroplated Cu. Since then, most of the leading chip manufacturers have 
converted to electroplated Cu technology. Relative to comparable Al interconnect, 
Cu interconnect has advantages of significantly low resistance, higher current 
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Fig. 18 Cross-sectional view of the fabrication process of Cu interconnects 


carrying capability, and increased scalability (Datta 2003). For similar dimensions, 
electromigration lifetime of Cu interconnects is more than 100 times longer than for 
Al lines. Cu metallization, therefore, supports much higher current density speci- 
fications and makes it extendible to finer dimensions and pitches. Figure 18 shows a 
typical fabrication process of Cu interconnects. 

Further miniaturization trends of the microelectronics and microsystem indus- 
tries provide new opportunities and challenges which include issues related to 
nanoscale structuring, fabrication of high aspect ratio structures, new functional 
alloys, multidimensional interconnects, and automated large-scale processes 
including additive control and recycling of electrolytes. To successfully overcome 
such challenges, fundamental as well as technology-oriented research will be 
needed and interactions between the two should be stimulated. 

Several techniques have been developed to fabricate MEMS -based through- 
wafer interconnections on silicon or silicon-on-insulator (SOI) substrates. In gen- 
eral, cavities formed by wet anisotropic etching or vertical through-wafer vias 
formed using the Bosch process, or the combination of the two, are used in the 
fabrication of through-wafer interconnects. The electrical connection from the front 
side of the substrate to the backside is typically made by metal or polycrystalline 
silicon layers formed on the cavity surface or through metal-filled vias. 

One of the common techniques is to fabricate a separate wafer with through- 
wafer interconnects and bond the die with circuitry or interconnections on the front 
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and backside of the wafer with interconnects. Typically, as the perforations for the 
via holes are formed on the substrate, a hole sealing process or separate substrate 
with seed layer is required to have a void-free fully metal-filled through-wafer 
interconnect. The vertical through-wafer via whose height equals the substrate 
thickness can be fabricated with deep silicon etch using the Bosch process and 
subsequent conductive material deposition inside via holes, but the aspect ratio is 
limited by the inherent characteristics of the Bosch process itself. For vertical 
trenches with a closed bottom surface, a void-free bottom-up fill with electroplated 
metal can be obtained with the aid of additive control, fountain plating, reverse 
pulse plating, and dissolve oxygen enrichment. 

Magnetic heads : The electrodeposition of soft, high magnetic moment alloys 
has become the critical fabrication step in manufacturing of magnetic recording 
heads. The need for ultimately high magnetic moment alloys and the electrode- 
position process capable of delivering magnetic structures with dimension in the 
range of several tens of nanometers are the future trends for next-generation 
magnetic recording devices. Films of CoNiFe ternary alloy with high saturation 
magnetic flux density B s and low coercivity, H c were successfully produced by 
electrodeposition. Osaka reported a typical film, designed as “HB -CoNiFe,” had 
the composition of Co 65 Ni 12 Fe 2 3 (at.%) with B s = 2.0-2. 1 T and H < 2 Oe 
(Osaka 2000). The key to the success in obtaining low H c with high B s was to 
form film with very fine crystals. The inclusion of small amount of sulfur was 
found to be essential for producing such a film with the desired magnetic 
properties. The film has been applied to the construction of a new type of 
merged-GMR head, which is considered as a breakthrough for materializing 
ultrahigh-density magnetic recording. The schematic diagram and morphology 
of GMR is presented in Fig. 19. 

Microelectromechanical systems ( MEMS ): Electrodeposited metal films may 
play an important role in MEMS technology, as it provides an easy, simple, and 
low-cost technology for MEMS device development with comparable mechanical 
properties to the silicon-based material set, while offering an enhanced set of 
properties for transducers. Sensors and actuators based on plated metals have 
been fabricated and studied, including thermal actuators, microcoils, micromotors, 
and pneumatic actuators. 

Ni and NiFe are the electroplated metals most commonly used for MEMS 
devices. Usually, electroplated nickel film can be used to vacuum seal a MEMS 
structure at the wafer level. The package is fabricated in a low-temperature 
(<250 °C) 3-mask process by electroplating a 40-pm-thick nickel film over an 
8-pm sacrificial photoresist that is removed prior to package sealing. A large fluidic 
access port enables an 800 x 800-pm package to be released in less than 
3 h. MEMS device release is performed after the formation of the first-level 
package. The maximum fabrication temperature of 250 °C represents the lowest 
temperature ever reported for thin-film packages (previous low ~400 °C). Imple- 
mentation of electrical feedthroughs in this process requires no planarization. 
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Fig. 19 SEM image and 
schematic illustration of 
GMR head structure (Osaka 
2000) 



CoNiFe Film 



GMR element Lower shield (NiFe) 


Energy Conversion and Storage 

Electrodeposition of metals, alloys, metal oxides, and conductive polymers plays a 
pivotal role in fabrication processes of some recently developed energy devices, 
such as fuel cells, supercapacitors, solar cells, and batteries. 

Fuel cells : Porous platinum, bringing together high catalytic activity and a high 
catalyst surface area, is of high interest for fuel cells, as electrodes for 
electrooxidation of hydrogen, methanol, ethanol, formic acid, and glucose as well 
as for reduction of oxygen consist of platinum or platinum-alloy catalysts. Several 
strategies exist for the fabrication of porous platinum electrodes ranging from the 
use of nanoparticles, over electrodeposition, to dealloying. 

Electrodeposition enables a facile fabrication of strongly adherent 
electrocatalysts on a conductive substrate within only one step. Techniques such 
as pulsed galvanostatic deposition (alternation of deposition and concentration 
relaxation) enable the generation of dendritic structures. Sacrificial templates 
such as microspheres, liquid crystals, polymers, or mesoporous structures (e.g., 
silica or alumina membranes) can be used to additionally control/design the pore 
structure on the corresponding scale (50 nm to 10 pm) (Kloke et al. 2012). 

Supercapacitors : Electrochemical supercapacitors play an important role in 
power source applications such as hybrid electrical vehicles, computers, and 
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Fig. 20 The structure of a 
supercapacitor 
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Fig. 21 SEM images of mesoporous Co 3 0 4 nanosheets on carbon foam for supercapacitor 
application 


short-term power sources for mobile electronic devices. They can be classified into 
two types based on their charge storage mechanisms: (i) electrical double-layer 
capacitors (EDLCs) and (ii) redox supercapacitors. Compared to the EDLC-based 
capacitors, redox capacitors based on transition-metal oxides or conducting poly- 
mers such as Ru0 2 , Mn0 2 , NiO, Co 3 0 4 , V 2 0 5 , and polyaniline may provide much 
higher specific capacitances up to 1,000 Fg _1 of the active material. For example, 
C 03 O 4 nanosheets/carbon foam with excellent supercapacitor characteristics was 
successfully fabricated without using metal substrates. The experimental results 
demonstrate that the as-deposited Co 3 0 4 nanosheets (Figs. 20, 21) exhibited an 
ideal capacitive behavior with a maximum specific capacitance of 106 F/g in 1 M 
NaOH solution at a scan rate of 0.1 V/s. 

Among the existing synthetic approaches to the transition-metal oxides elec- 
trodes for supercapacitors, electrochemical techniques are of great interest due to 
their unique principles and flexibility in the control of the structure and morphology 
of the film materials. The main advantage of the electrodeposition technique is its 
relatively easy and accurate control of the surface micro structure of metal oxides 
with high surface area by changing deposition variables, such as electrolyte, 
deposition potential, and bathing temperature. 
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Recently, graphene-metal oxide hybrid supercapacitors have attracted much 
research interest. As graphene is an ideal building block in composite materials 
combined with a variety of inorganic compounds, it is expected that fabrication 
of graphene/metal oxide hybrid nanomaterials will be an effective and practical 
method to overcome the abovementioned challenge. In such hybrid materials, 
the graphene acts as 2-dimensional conductive template or 3 -dimensional 
conductive porous network for improving the poor electrical properties and 
charge transfer pathways of pure oxides. In addition, graphene provides 2D 
support for uniformly anchoring or dispersing metal oxides with well-defined 
size, shapes, and crystallinity; these metal oxides not only provide a high 
capacity but also suppress the agglomeration and re-stacking of graphene and 
thus to increase the effective surface area of the graphene, resulting in high 
electrochemical activity. 

In situ electrodeposition can be applied to prepare the metal oxide 
nanostructures on graphene. The good conductive graphene will be used for the 
direct electrodeposition of nanostructural metal oxides without using other binders 
or conductive substrates. In addition, the electrodeposition is a simple approach to 
fabricate well-defined metal oxide nanostructures with excellent adhesion on the 
substrates in ambient temperature and atmosphere. In particular, the metal oxide 
nanostructural arrays, such as nanorod and nanotube arrays, offer several important 
advantages for the electrodes: as they possess abundant active sites and large 
surface areas, each nanorod/nanotube acts as a direct ID pathway for the electron 
transport, more favorable for the enhancement of the specific capacitance and rate 
performance of the supercapacitors. 

Solar cells : Electrodeposition has been shown to provide improved benefits in 
several solar cell technologies. It is one of the most selective processes because 
deposition only occurs at positions on a substrate where the substrate conductivity 
is highest. To date, significant progress has been made toward the scaling up of 
electrodeposition processes (both electrolytic and electroless) in microelectronics 
for printed circuit boards and for semiconductor wafers. The scale of substrates for 
the solar industry varies between 156 mm" to m and within the range of the 
microelectronics processing. This scaling has been done on both rigid and metallic 
flexible substrates for different solar cell applications. 

In a study, perylene bis(phenethylimide) (PPEI) was deposited on nonporous 
Ti0 2 electrode (Zaban and Diamant 2000). PPEI layer has a wide spectrum 
throughout the visible range resembling the polycrystalline form of this organic 
semiconductor, therefore the high optical density solar cell can be fabricated by 
only a thin layer of the deposited resistive PPEI film. It was found that liquid 
electrolyte is a quencher; the new electrode could operate as a front electrode in a 
high surface area bilayer OSC solar cell. The schematic view of solar cell is shown. 

Wei et al. exhibited three kinds of Cu 2 0/ZnO hetero structure solar cell synthe- 
sized by electrochemical deposition (Wei et al. 2011). In their synthesis process, 
ZnO film, nanowires, and nanotubes were grown on the FTO first and the Cu 2 0 was 
further coated on the ZnO. The absorption spectra of FTO glass, ZnO film, and 
Cu 2 0 film are shown in the Fig. 22. The absorption spectrum of the Cu 2 0/ZnO 
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Fig. 22 Structure schemes of the three kinds of solar cells based on Cu 2 0/ZnO heterojunctions 
(Wei et al. 2011) 


bilayer exhibits two absorption edges at 380 nm and 600 nm, which correspond to 
the band gap absorption of ZnO and Cu 2 0 films, respectively. 

Another example of optimizing electrochemical conditions for fabricating Cu 2 0 
with a designed surface structure was explored to improve the photovoltaic perfor- 
mance of solar cells (Wei et al. 2012). The enhanced conversion efficiency of Cu 2 0 
film solar cell was carried out by tuning the homojunction interface crystal orien- 
tation and forming a pyramid-like textured surface. By tuning the n-Cu 2 0 film 
growth orientation from [100] to [111] to match the p-Cu 2 0 orientation, the 
formation of interface states during the homojunction epitaxial growth can be 
restrained, which can effectively prevent recombination of electrons in n-Cu 2 0 
with holes in p-Cu 2 0 at the interface region and enhance the built-in potential. 

Li-ion battery : For electrochemical energy storage device, one effective strategy 
to achieve high reversible capacities is to prepare nano structured electrodes with 
sufficient buffering space, e.g., porous, ID structures to alleviate the volume 
swings. Electrochemical process has been widely applied to fabricate nanostruc- 
tured electrodes. For example, interconnected nanoflakes of ZnSb for Li-ion battery 
anode applications were prepared by a template-free electrochemical deposition 
process under high overpotential conditions (Saadat et al. 2011). In the synthesis 
part, copper substrate was immersed into the solution containing ethylene glycol, 
ZnCl 2 , and SbCl 3 under the applied potential in the range of 3-9 V at room 
temperature without any stirring or bubbling inert gas. 


Drug Delivery 

Electrochemical methods, including electroplating and electroless plating methods, 
can be used to create micrometer-scale surface morphologies to induce specific 
types of cellular differentiation or to attract and/or repel a specific type of cell that 
can be used to increase the radiopacity of the device, to produce nanocomposite 
coatings on a device, to enhance or control drug delivery from a device, or to 
accommodate a nonmetallic coating. They can even be used to create a complete 
medical device. Normally, the electrochemical corrosive processes tend to desta- 
bilize surfaces and can undermine the mechanical stability of a device. 
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Micro- and nanofabrication technologies provide the possibility of designing 
small-scale drug carriers capable of delivering precise doses of drugs as near the 
target as possible. Ideally, drug delivery systems should be sufficiently miniaturized 
to reach the disease site but with enough volume to accommodate a relevant dose of 
drug (tens to hundreds of micrograms to be comparable with the current doses of 
potent) systemic drugs (LaVan 2003). In addition, they should satisfy biocompat- 
ibility and nontoxicity standards throughout their desired time of action. Finally, the 
device should be traceable and able to release drugs on demand based on external 
commands or local biochemical changes. 

The intelligence of these devices can be defined during fabrication by choosing the 
appropriate combination of materials and methods. Chitosan, a polycationic saccha- 
ride derived from naturally available chitin, is an ideal candidate for medical and 
pharmaceutical applications due to its intrinsic properties. It is abundant, biocompat- 
ible, biodegradable (being metabolized by certain human enzymes), and bioadhesive 
and appears to act as a penetration enhancer by opening tight epithelial junctions. The 
pKa value of its primary amine groups (around 6.5) allows for the formation of 
pH-responsive and functionalizable hydrogel films for a large variety of applications, 
such as dmg delivery systems, scaffolds for tissue engineering, and targeted radio- 
therapy. These functions can be incorporated into lab-on-a-chip devices by different 
fabrication methods, including casting, printing, and self-assembly. However, only 
electrodeposition provides the unique ability to create three-dimensional structures of 
various size and shape at a low cost and can conformally coat structures using local 
pH gradients on negatively charged electrodes. 

In a recent study, Fusco and co-workers (2013) presented co-electrodeposition of 
drug-loaded chitosan hydrogels to functionalize steerable magnetic microdevices. 
The characteristics of the polymer matrix have been investigated in terms of drug 
loading, and release responses are shown in Table 2 and Fig. 23. 

The effect of the environmental conditions on the dmg release from as-deposited 
chitosan films was also investigated and shown in Fig. 22. When immersed at pH 6.0, 
the hydrogels release almost the entire loaded dmgs in only 72 h. This amount was 
five times higher than the release over the same period at pH 7.4, shown in Fig. 24. 

This work provides a method to incorporate functional smart materials onto 
wirelessly controlled magnetic microdevices using electrodeposition. In the final 
design, magnetic microcylinders form the core of the device, while chitosan 
hydrogels form the pH-sensitive drug delivery device. Chemically mild and 
straightforward dip coating processes tailor the release profiles of BG, an ionic 
model drug used as a topical antiseptic, from the chitosan gels. The platform and the 

Table 2 Comparison between the amounts of BG loaded in the non-treated and treated 
electrodeposited chitosan hydrogels as measured after full release at pH 6 by spectrophotometry 


(Fusco et al. 2013) 

Condition 

2 

Amount of drug loaded (pg/cm ) 

As deposited 

77.83 d= 10.21 

Crosslinked pH 5, 30 min 

42.15 d= 9.4 

Crosslinked pH 8.5, 30 min 

80.33 d= 6.32 
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Fig. 23 Cumulative release 
vs. time from the different 
hydrogel layers maintained in 
PBS at pH 7.4. The data 
represent an average of a 
minimum of five samples 
coming from different sets of 
experiments, analyzed over 
3 weeks. Electrodeposited 
chitosan (7) behavior can be 
modified by ionic cross- 
linking at acid (2) or basic pH 
( 3 ) (Fusco et al. 2013) 



Fig. 24 Effect of pH on the 
drug release from 
as-electrodeposited and not 
neutralized chitosan layers. 
BG is fully released from the 
matrix in approximately 
3 days, with a kinetics that 
differs from the one at pH 7.4 
(Fusco et al. 2013) 
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study proposed here represented new insight into creation of complex microdevices 
able to sense, act, and react to a complex environment such as the human body. 

There is no doubt that the field of electrochemistry and its continual progress will 
have substantial potentials for the future of medical devices and would be viewed as 
an economical, simple, yet powerful technique to modify and create biomimetic 
surfaces and medical devices. 


Summary 

In this chapter, an overview of work directed toward the electrochemical process in 
manufacturing was presented. Different types of materials, including metals, metal 
oxides, non-oxides, as well as conducting polymers, are addressed. For metals, their 
micro structure, particularly in low dimension, as well as mechanical properties and 
optical properties were addressed. Besides, the morphology control and growth 
behaviors of metal oxide thin layers under electrochemical process were described 
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in detail. In addition, the electrochemical process in fabricating silicon, non-oxides, 
and conductive polymers was also discussed. Finally, applications of electrochem- 
ical process in various areas, such as microfabrication, energy conversion and 
storage, and drug delivery, were discussed. In future, the investigation of electro- 
chemical process will be continuously exciting and highly rewarding and there is a 
lot of space for innovating and developing: 

1 . Electrochemical process is a low-energy process and therefore uniquely suited to 
applications that need to deal with modification of soft matter of various types. It 
could be combined with self-assembled templates to prepare nanomaterials with 
exciting properties. 

2. Great achievements with the manufacturing of a number of materials and their 
assemblies have been realized, while significant challenges still exist in low-cost 
deposition process, especially preparation of uniform arrays at large scale. 

3. There are still huge challenges in the delivering magnetic structures with 
dimension in the range of several tens of nanometers. These challenges could 
be addressed by full control of the conditions at the electrochemical interface for 
additive adsorption and incorporation, iron hydroxide formation and incorpora- 
tion, and by careful design of the bath chemistries and processes that would yield 
the high moment alloys with the optimum magnetic properties. 

4. Electrochemical process for solar cells is an active area of research, and it has 
been shown to provide improved benefits in several solar technologies. For the 
silicon solar cell contacts, fabrication of metal contacts may result in solar cell 
efficiency improvement due to a lower processing temperature with better yield 
than screen printing, improved contact resistance, lower overall resistance, and 
the ability to fabricate small contacts that reflect the sunlight less. 

5. The process of electrochemical deposition can be deployed to prepare simple, 
economical, and effective medical devices. 
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Abstract 

Electrochemical deposition is one of the effective and inexpensive processes in 
surface coating technology. It is of great interest to produce electrochemical 
deposits with dense structure and good mechanical properties. The electrochem- 
ical deposited nickel and nickel-cobalt are the most common materials used in 
decorative coatings and micro-fabrications. In this chapter, the authors will 
briefly review characteristics of nickel and nickel-cobalt deposits under differ- 
ent electrochemical treatments and introduce two effective procedures to 
enhance the strength of deposits: one is to vary the applying potential and the 
other to alter the temperature of the electrochemical cell. Both experimental and 
simulation results show that pulse electrodeposition leads to higher concentra- 
tion of ions at cathodic surface and better penetration ability of ions than that by 
direct current (DC) one. More compact and hard deposits are formed during 
pulse electrodeposition compared to DC one. Moreover, usually electrodeposi- 
tion is difficult to perform at low temperature due to ineffective mass transfer. 
Through careful control of the output power to the electrochemical cell, the 
resulting deposit at low temperature presents higher strength than those 
electrodeposited at relatively high temperatures. The possible mechanism of 
strengthening and the corresponding electrochemical phenomena are also 
introduced. 


Introduction: Electrochemical Deposition and Application 

Electrochemical deposition is always an important coating technology for provid- 
ing high deposition rate and low production cost in industrial applications such as 
functional or decorative coatings (DiBari 1996). As microelectromechanical sys- 
tem (MEMS) attracts great interests in academic research and industrial applica- 
tions, electrochemical deposition becomes one crucial option to produce thick 
micro metal structure. Nickel is commonly chosen for electroforming micro struc- 
tures in MEMS due to its good mechanical properties and corrosion resistance 
(DiBari 1996). In 1916, O.P. Watts (1916) proposed the famous Watts bath to break 
through the difficulties in nickel plating. The Watts bath plating overcomes the low 
deposition rate of nickel with high current density of 5 A/dnr which is ten times 
larger than in traditional baths. However, the mechanical properties of Watts bath 
plated deposits are not able to support various demanding applications. As alterna- 
tives, various sulfamate baths have been developed which provide lower internal 
stress and compact structure in deposits than that by means of Watts bath. In recent 
years, researchers continue to study how to enhance the mechanical properties of 
electroformed nickel microstructures. 

One of the effective approaches for strength enhancements in nickel-based 
plating is through alloy codeposition with addition of Co or Fe ions into the 
electrolyte (Chung and Chang 2007; Duch et al. 2002; Zhuang and Podlaha 2002; 
Fan and Piron 1996; Bai and Hu 2002; Qiao et al. 2006; Wang et al. 2005). When 
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two or more elements are codeposited, the effect of the so-called anomalous 
codeposition behavior would appear (Duch et al. 2002; Zhuang and Podlaha 
2002; Fan and Piron 1996; Bai and Hu 2002; Qiao et al. 2006; Wang et al. 2005; 
Dolati et al. 2007; Afshar et al. 2002; Fabio and Lucia 2006; Lupi and Pilone 2001; 
Chung et al. 201 la; Ahadian et al. 2007). That is, the composition of alloy deposits 
is different from that in electrolyte and is related to the current density, temperature, 
pH, and other variables in electrolyte (Afshar et al. 2002; Fabio and Lucia 2006; 
Lupi and Pilone 2001; Chung et al. 2011a). The addition of different anionic 
(Golodnitsky et al. 2000) or cationic surfactant (Gomez et al. 2006) even nanopar- 
ticle (Shi et al. 2005) in the electrolyte could also modify the composition and 
morphology of deposited films. For example, the current density would greatly 
affect Co content of films as well as their properties. Therefore, it is of great 
importance to control the current density during the deposition process to achieve 
the desired Co content in the Ni-Co alloy film. 

Another advantageous approach to get smooth morphology and good mechan- 
ical properties of nickel deposits is to apply pulse waveforms in the electrochemical 
cell. The electrochemical behavior under pulse excitation attracts great interests in 
academic research and industrial applications (Hansal et al. 2006; Chandrasekar 
and Pushpavanam 2008; Dixit and Miao 2006; Quemper et al. 2000; Tsai 
et al. 2002). The roughness of electroplated Ni could be improved by applying 
pulse waveform (Wong et al. 1999) or pulse reverse current (Qu et al. 1997). At the 
same current density, pulse period, and a constant deposition time, the best surface 
finishing was obtained by a rectangular waveform with enough relaxation time 
compared to the ramp and triangular waveforms (Wong et al. 1999). The roughness 
of a film by a rectangular waveform with relaxation time under various current 
density was about 50-170 nm (Wong et al. 1999). Better surface finishing was 
obtained by pulse reverse current than that by pulse current at the same average 
current density (Qu et al. 1997), and the reduction in the internal stress could be 
achieved (Chan et al. 1997). 

Also, the compressive residual stress of thin films plays an important role in film 
strength (Chung et al. 2007, 2011b; Oettel and Wiedemann 1995). Hardness could 
increase at low-temperature electrodeposition due to residual stress enhanced 
strengthening. Low- temperature electrochemistry is characteristic of low diffusion 
coefficient, reduced voltammetric current and high solution resistance (Chung 
et al. 2011b; Opallo and Prokopowicz 2003; Yu et al. 2003; Dolati and Mahshid 
2008; Osetrova et al. 1998). It is of thus interest to use electroplating at relative low 
temperature for improving strength of coatings and to understand the possible 
strengthening mechanism. 

In this chapter various electrodepositions in nickel sulfamate bath are introduced 
such as pulse current electrodeposition, anomalous codeposition, and 
low-temperature electrodeposition for compact deposits. In the next section, the 
conventional DC electrodeposition is presented, with special emphasis on pure Ni 
electrodeposition and Ni-Co anomalous codeposition. The electrochemical behav- 
ior between Ni and Ni-Co deposition under DC excitation is addressed. In the 
section “Enhancement of Mechanical Properties of Deposits by Pulse 
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Electrodeposition,” pulse electrodepositions of Ni and Ni-Co are presented. 
Numerical simulation was carried out on the evolution of ion concentration near 
cathode surface during electrodeposition. Afterward, the morphology and 
nanomechanical properties were evaluated and correlated with the results of numer- 
ical simulation. In the section “Electrochemical Deposit at Low Temperature,” 
electrodepositions at low electrolytic temperature are introduced. In 
low-temperature electrodeposition, mass transfer is more difficult than that in 
relative high-temperature environment. Consequently, the cathodic current 
decreases with decreasing electrolytic temperature. The low-temperature 
electrocrystallization, phase identification, morphologies, and nanomechanical 
properties of deposits are addressed. An overall brief conclusion is given in the 
last section. 

Electrochemical Deposit by Direct-Current Potential 

Electrochemical deposition through direct-current (DC) potential is a common 
technique for decorative coatings. DC electroplating has many attractive advan- 
tages such as low cost and high throughput. Although Watts nickel plating bath can 
deposit both bright and semi-bright nickel at higher deposition rate for decorative 
purposes and corrosion protection, electrodeposition of nickel in sulfamate bath is 
also used for many engineering applications. It is because sulfamate nickel baths 
can provide lower internal stress and compact structure in deposits compared to 
Watts bath. Other plating solutions based on all-chloride, sulfate-chloride, or 
all-sulfate can also be used for various engineering requirements (Sahari 
et al. 2008). A hard nickel solution with an addition of ammonium chloride is 
used when a high hardness deposit is required. 

Over the past decades, electroplating of pure Ni metal has widely been used in 
microfabrication (DiBari 1996) as a coating or a metal mold in LIGA/LIGA-like 
process. In microfabrication, more researchers attempt to enhance the mechanical 
properties of nickel deposits by alloying nickel with an addition of hard elements 
such as cobalt. The addition of Co to Ni can enhance the magnetic strength, 
hardness, and wear resistance (Duch et al. 2002; Zhuang and Podlaha 2002; Fan 
and Piron 1996; Bai and Hu 2002; Qiao et al. 2006; Wang et al. 2005). It is of 
importance to control the Co content of the Ni-Co alloy film codeposited in 
electrolyte for the desired property. Some researchers reported the codeposition 
of Ni-Co films prepared in the simple chloride or sulfate baths (Fan and Piron 1996; 
Bai and Hu 2002), or in the complex chloride-ammonia hydroxide (Fan and Piron 
1996) or chloride-sulfate baths (Qiao et al. 2006; Wang et al. 2005) by adjusting the 
current density and cyclic voltages. 

When two or more elements were codeposited simultaneously, the effect of the 
so-called anomalous codeposition behavior would appear (Zhuang and Podlaha 
2002; Fan and Piron 1996; Wang et al. 2005). The current density would greatly 
affect Co content of films as well as their properties. The addition of different 
anionic (Golodnitsky et al. 2000) or cationic surfactant (Gomez et al. 2006) even 
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Table 1 Bath composition of sulfamate chloride Ni-Co electrolyte 


Bath compositions 

Standard composition 

Nickel sulfamate (Ni(NH 2 S0 3 ) 2 4H 2 0) 

500 ml/1 

Cobalt sulfamate (Co(NH 2 S0 3 ) 2 ) 

20 ml /1 

Nickel chloride NiCl 2 -6H 2 0 

4 ± 1 g/1 

Boric acid (H 3 BO 3 ) 

42 ± 2 g/1 

Wetting agent 

1 ml /1 

Stress agent 

15 ml /1 


nanoparticle (Shi et al. 2005) in the electrolyte could also modify the composition 
and morphology of deposited films. In this section, the characteristics of Ni-Co 
deposits electroplated under DC potential in sulfamate-chloride bath are presented. 
The anomalous codepostion behavior and the corresponding properties of deposits 
at different current densities and temperatures were comparatively introduced. 


Preparing Ni and Ni-Based Alloys by DC Electroplating 

Electroplating of Ni with a codeposition of Fe or Co is commonly used for the 
applications of magnetic materials. In the case of Ni-Fe, the chloride-based elec- 
trolyte bath containing iron chlorides is a popular choice (Afshar et al. 2002; Fabio 
and Fucia 2006). However, electrolytes with Ni sulfate and Co sulfate are used to 
prepare Ni-Co deposits (Dolati et al. 2007). The properties of deposits are altered 
with the ratio of Ni and Co in the electrolyte bath. One typical example for the 
electroplating of Ni and Ni alloys is presented here. In the electroplating, Ni-Co 
alloy films were codeposited on a thin 100 nm Au-coated silicon substrates. The 
bath compositions and its function were listed in Table 1. Codeposition was 
controlled at pH = 4 =b 0.2, in stirred acidic sulfamate-chloride electrolyte. Nickel 
and cobalt sulfamates were the sources of nickel and cobalt ions for deposition, 
respectively. Nickel chloride was used to increase the conductivity of electrolyte. 
Boric acid was a buffer to maintain the pH level. Sodium dodecyl sulfate is used as 
wetting agent which also helps to reduce bubble formation to achieve better 
smoothness. The main composition of stress agent is sodium allyl sulfonate. Stress 
agent was used to decrease the internal stress or to provide a stress-free deposited 
film. Current density was varied from 1 to 5 ASD (A/dm ) while temperatures were 
controlled from 35 °C to 65 °C at a fixed current density of 3 ASD. Thickness was 
controlled at about 3 pm for all the samples and measured by alpha-step (a-step) 
profiler. The composition of films was determined by energy dispersive spectro- 
scope (EDS) attached to field-emission scanning electron microscope (FE-SEM). 
FE-SEM was also used for the characterization of surface morphology of Ni-Co 
films. The phase and crystallinity of the films were characterized by a grazing 
incidence X-ray diffractometer (GIXRD) using Cu Ka radiation of 0.15 nm. The 
microhardness was measured by Vickers hardness tester with a load of 10 g. Six 
indentations were made to get the average value for each sample. 
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Fig. 1 The relationship 
between the thickness of 
Ni-Co films and deposition 
time under different current 
density of 1-5 ASD at 45 °C 



Time (minute) 


Fig. 2 Normalized graph of 
deposition rate and current 
density with the unit of 
pm/ASD*min 



Performance of Electrochemical Deposition Under Direct-Current 
Potential 

Figure 1 shows the relationship between the thickness of Ni-Co alloy films and 
deposition time under a different current density of 1-5 ASD at a fixed temperature 
of 45 °C. The deposition rate calculated from the average thickness at different 
deposition times ranges from 0.312 to 1.540 pm/min for 1-5 ASD which roughly 
increases with current density. Moreover, to elucidate the current efficiency of 
deposition system, the deposition rate is normalized from different current density 
of 1-5 ASD as shown in Fig. 2. By dividing the deposition rate to each current 
density with the unit of pm/min* ASD, the normalized deposition rate results in a 
steady-state condition around 0.305 =b 0.015 pm/min* ASD. It indicates that this 
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electroforming process has a high current efficiency in a stable deposition system. 
From this result, the desired thickness can be obtained precisely by calculating the 
time and current density that are going to be utilized. 

The high current efficiency is related to the high concentration of ions for 
conduction in electrolyte. In electroplating, mass transfer, that is, the movement 
of material from one location in solution to another, arising from differences in 
either electrical potential, diffusion, or convection is governed by the 
Nernst-Planck equation. It is written for one-dimensional mass transfer along the 
x-axis as 





+ Cjv(x) 



where / z (x) represents flux of species i with the dimension and are 

concentration gradient and electrode potential gradient at distance x, respectively; 

Dj is diffusion coefficient of species i with the dimension ^0; anc * z u Cu v(x), F, R, 

o 

and T are the charge (dimensionless), concentration of species i (mol/cm ), velocity 
with which solution moves along the x-axis (cm/s), Faraday constant, gas constant, 
and temperature respectively. The three terms on the right-hand side represent the 
contribution of diffusion, migration, and convection, respectively, to the flux of 
species (/ z (x)), and the minus sign arises in this equation because the direction of the 
flux opposes the direction of increasing potential and diffusion gradient. In fact, 
effect of convection in the case of electroplating thin films is negligible anyway. 
Therefore, the faradic current can be separated into diffusion and migration to the 
flux of the electroactive species at the surface. In order to reach proper diffusion 
rate, the current density that was used in our experiment was relatively small (1-5 
ASD), and the supporting electrolyte (NiCl 2 -6H 2 0) serves important function for 
high conductance. The presence of a high concentration of ions decreases the 
solution resistance, and hence improves conductivity in the bulk of the solution to 
reduce the electrical power dissipated in the cell. Moreover, it often acts as a buffer 
to ensure that the double layer remains thin with respect to the diffusion layer which 
could affect the rates of metal ions approaching to the electrode. Therefore, it 
establishes a uniform ionic strength throughout the solution, even when ions are 
produced or consumed at the electrodes. 


Behavior of Anomalous Codeposition Under Direct-Current 
Potential 

The electroplating of Ni-Co alloys has been recognized as an anomalous 
codeposition, i.e. cobalt content (atomic percentage, at%) in codeposited films is 
found more than that in its electrolyte composition. Figure 3a and b shows the 
dependency of Co at% in the codeposited Ni-Co films under different current 
density of 1 ~ 5 ASD at 45 °C and temperature of 35 ~ 65 °C at 3 ASD, respectively. 
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Current Density (ASD) Temperature (°C) 

Fig. 3 The dependency of Co content (at%) in the 3 pm thick Ni-Co alloy films codeposited 
under different: (a) current density of 1-5 ASD at 45 °C and (b) temperature of 35-65 °C at 3 ASD 


The ratio of Co and Ni content is around 1 :4 in the deposited films formed at 1 ASD 
and 35 °C although the Co content in electrolyte was only 1-25 Ni due to the high 
exchange current density of Co (high activity intrinsically). The cobalt content 
gradually decreased with increasing current density (Fig. 3a) and the same as 
temperature effect (Fig. 3b). Currently there are two different mechanisms 
viable to describe the anomalous codeposition behavior, i.e., two-step adsorption 
and hydroxide mechanism. The two-step adsorption mechanism assumes 
that metal deposition occurs in two-step manner via an adsorb intermediate as 
follows: 


M 2+ + e“ -> M + (ad) 

(2) 

M+(ad) +e“ -> M(s) 

(3) 


o ■ ■ 

One M ion is assumed to be reduced to a M (ad) ion, which adsorbs on the 
electrode surface and is further reduced to the metal M(s). That is, the metal ions 

o ■ 

(M ) in the bulk electrolyte do not directly form metal deposit on arriving at 
cathode surface. Instead, other reactions of M + (ad) adsorption can occur and will 
influence the deposition of Ni metal formation. The standard electrode potential 
(at 25 °C) of Ni 2+ /Ni and Co 2+ /Co are —0.250 and —0.277 V versus standard neutral 
hydrogen electrode (NHE), respectively. According to electromotive force poten- 
tial (emf) stated above, less noble metal (Co) is preferentially to be deposited on the 
cathode resulting in anomalous behavior. The larger concentration gradient is, the 
higher anomalous deposition. 

In electroplating process, besides metal is adsorbed on electrode surface, hydro- 
gen was also reduced simultaneously. The hydroxide mechanism says that the pH 
value at the cathode surface assumed to rise during the process due to H + ions 
consumption at the electrode resulting in hydroxide ions. It is expressed in chemical 
reactions below: 
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2H+ + 2e 


H 


( 4 ) 


2H 2 0 + 2e 


H 2 + 20H 


( 5 ) 


Under these conditions, the surface pH could rise sufficiently high so that metal 
ions would undergo hydrolysis reaction with OH - . The formation of the hydroxide 
precipitate of the less noble metal (Co in this case) at the cathode caused by a local 
increase of pH and its hydroxide may suppress deposition of the noble metal (Ni in 
this case). Hereby, increasing current density, metal ions was consumed rapidly 
accompanied with the deposition time to reduce the concentration gradient of metal 
ions. It would result in the decrease of anomalous behavior near the cathode 
surface. 

With regard to the effect of temperature on anomalous behavior, temperature can 
influence kinetics of metal ions deposition and decrease in cathode polarization. An 
increase of temperature increases the rate of diffusion and ionic mobility together 
with the conductivity of the bath. Because of plentiful metallic ions diffuse into 
cathode surface for reducing concentration gradient, the anomalous codeposition is 
reduced with increasing temperature. In other words, Co content in the deposited 
films could be pushed toward the ratio within the electrolyte composition. How- 
ever, because of high diffusion of metal ions with fast deposition rate, it would 
cause the rough surface morphology, large grain size, and less dense arrangement of 
atoms. Therefore, the suitable temperature is controlled at 45 °C for different 
current density deposition. 


Characteristics of the Direct-Current Ni-Co Deposits 


Figure 4 shows the GIXRD patterns of Ni-Co alloy films codeposited at various 
current density of 1-5 ASD at 45 °C. The crystalline structure of Ni-Co alloys with 
different Co content is examined according to JCPDS card No 03-1051 for pure Ni 
(face-centered cubic, fee structure) and No 01-1278 for pure Co (hexagonal close- 
packed, hep structure), respectively. It is clearly observed that the strong intensity 
of peak at 26 = 44.7° exhibits a Ni(l 11) growth orientation together with other two 
significant peaks Ni(200) and Ni(220) at approximately 52.0° and 77.0° for the 
polycrystalline phase formation. The tiny Co(112) peak at 26 = 39.0° is found at 
1 ASD with the highest Co at% and this peak is not found at higher current density 
(lower Co content). This implies that the Ni-Co deposit formation follows the 
fcc-Ni crystalline structure during codeposition. 

Figure 5a-e presents typical SEM micrographs of surface morphology of Ni-Co 
alloy films codeposited at 45 °C and different current density of 1-5 ASD, respec- 
tively. The surface is smooth at low current density (Fig. 5a) due to smaller grains 
resulted from higher Co at%. Accordingly, the decrease of Co content resulted from 
increasing current density is beneficial for the larger grain growth which leads to the 
rougher morphology. Besides, increasing current density for higher growth rate 
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Fig. 4 GIXRD patterns of Ni-Co alloy films codeposited at various current density of 1-5 
ASD at 45 °C 


results in less compact films. Figure 6a and b shows the Vickers microhardness of 
Ni-Co alloy films of 3 pm thickness as a function of the applied current density of 
1-5 ASD at 45 °C and temperature of 45-60 °C at 3 ASD, respectively. It reveals 
that the average Vickers microhardness of Ni-Co alloy films decreases with 
increasing current density from 556.2 to 413.1 H v due to less Co content in the 
Ni-Co solid solution. From viewpoint of material science in solid solution strength- 
ening, alloys consisting of the same base metal are almost always harder than high- 
purity metals and their hardness increases with the impurity concentration. Here, the 
base metal is Ni and the hardness of Ni-Co alloy can be enhanced by Co content. 
Also, increasing current density leads to higher deposition rate and larger grain size. It 
may contribute to the reduction of microhardness through the Hall-Petch effect for 
larger grains weakening the hardness. In brief, the microhardness of Ni-Co films is 
related to the composition and crystallinity. In addition, the effect of temperature on 
Vickers microhardness has the similar effect. It reduces from 507.4 H v (45 °C) to 
462.7 H v (60 °C). The higher the temperature, the less the Co content or the larger the 
grain, the lower the harness of Ni-Co films is. 


Section Summary 

In this section, one of the effective ways to improve the hardness of deposits by adding 
the hard Co element into electrolyte is introduced. The effect of controlled parameters 
on the anomalous behavior and property of the codeposited nickel-cobalt (Ni-Co) 
films in the complex sulfamate-chloride bath is presented. The Co at% in codeposited 
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Fig. 5 SEM micrographs of surface morphology of Ni-Co alloy films codeposited at 45 °C and 
different current density: (a) 1 ASD, (b) 2 ASD, (c) 3 ASD, (d) 4 ASD, and (e) 5 ASD 


films gradually decreased with increasing current density and the same tendency to 
temperature. The variation of current densities and temperature resulted in different Co 
atomic percentage (Co at%) in the deposited films as well as the morphology and 
microhardness. Co at% in Ni-Co films gradually decreased with increasing current 
density and temperatures with variation from approximately 25 % to 15 %. The Co 
codeposition could also inhibit grain growth of the deposited films. The Ni-Co him 
codeposited at lower current density or temperature would have the higher Co content 
and smaller grain which results in the higher microhamess and smooth morphology. 
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Fig. 6 Vickers microhardness of Ni-Co alloy films of 3 |im thickness as a function of (a) current 
density and (b) temperature 


Enhancement of Mechanical Properties of Deposits by Pulse 
Electrodeposition 

It is very important to get smooth morphology and good mechanical properties through 
adjusting process parameters of electroplated films for more demanding applications. 
Ibanez and Fatas (2005) published the microhardness evolution of electroplated Cu 
under a different DC current density or pulse waveform. Thin copper films with better 
mechanical property were obtained by a DC rectangular wave and the average rough- 
ness of Cu films was generally larger than 100 nm. The roughness of electroplated Ni 
could be improved by pulse waveform (Wong et al. 1999) or pulse reverse current 
addition (Qu et al. 1997). At the same current density and pulse period for a constant 
electrodeposition time, the best surface finishing was obtained by a rectangular wave- 
form with enough relaxation time (Wong et al. 1999). A film roughness of 50-170 nm 
was achieved with a rectangular waveform with relaxation time under various current 
density (Wong et al. 1999). Better surface finishing was obtained by pulse reverse 
current than that by pulse current at the same average current density (Qu et al. 1997), 
and the reduction in the internal stress was also achieved (Chan et al. 1997). But the 
current efficiency of pulse reverse current electroplating is lower than that of pulse 
current. Yeh et al. (2004) reported that pulse electroplating for Ni-Fe alloys was able to 
suppress grain growth and induced the nanocrystalline microstmcture and improve the 
mechanical properties. This section will introduce the electrochemical phenomena 
under pulse excitation and the properties of the deposits. 


Method of Pulse Electrodeposition 

Here, pulse electrodeposition is introduced, with a specific experimental study, 
to improve the hardness of electroplated films (Chung and Chang 2007). 
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Fig. 7 The schematic 
diagram of the experimental 
setup which consists of a 
pump, an electrolyte bath, a 
PC controlled power supply, 
and a temperature unit 


Flow meter 



pump 


Filter 


Figure 7 shows the schematic diagram of the electroplating instrument 
which can be used for DC and pulse electrodeposition. It consists of an electrolyte 
bath, a personal computer (PC) controlled pulse power supply, a temperature 
control unit, a pump, a flow meter, and filters. The substrates used were 
1.5 x 1.5 cm chips diced out from p-type Si(100) wafers. A Cr/Au seed layers 
of about 10/100 nm thick were deposited on the Si substrates with an electron beam 
evaporator. Then they were cleaned with acetone and isopropanol in ultrasonic bath 
for 15 min, rinsed with deionized (DI) water, followed by acid-cleaning with 5 M 
H 2 S0 4 solution for 2 min, and rinsed with DI water. After cleaning, the 
silicon substrates were immersed in the electrolyte for plating. The composition of 
the bath solution for Ni electroplating was nickel sulfamate of 450 g/1, nickel 
chloride of 3-5 g/1, and boric acid 40 g/1. The temperature was kept at 45 =b 1 °C. 
The initial pH of the electrolyte was controlled at about 4.2. In the other experi- 
ments, the nickel sulfamate electrolyte was used for Ni-Co electroplating by mixing 
with cobalt sulfamate at concentration of 20 ± 1 ml/1. The power supply was 
controlled by potentiostat/galvanostat. Pulse frequency was varied from 0 Hz 
(DC), 1, 10, 100, 200 to 500 Hz with duty ratio (T on /T off ) of 1. The waveform of 

o 

pulse current is shown in Fig. 8. The current density was fixed at 3 Adm 
deposition time was 10 min. 

The surface morphology was observed by field-emission scanning electron 
microscope (FE-SEM), operating at 10 KV. Nanohardness was measured by MTS 
nanoindenter. In MTS indenter, CSM offers a direct measure of dynamic contact 
stiffness during the loading portion of an indentation test and allows an accurate 
observation of small volume deformation. The surface roughness and film thickness 
were measured by alpha-step (a-step) profiler. The composition was evaluated via 
energy -dispersive spectrometer (EDS). And the phase identification and crystallin- 
ity of the films were characterized by a grazing incidence X-ray diffractometer 
(GIXRD) using Cu Ka radiation of 0.15 nm. 


and 
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Fig. 8 The waveform of the 
pulse current 



t 



Fig. 9 The relationship between the him thickness and pulse frequencies 


Dependence of Deposition Rate on Pulse Frequency 

Figure 9 shows the relationship between the thickness of Ni films and the pulse 
frequency after 10 min deposition. The deposition rate decreases with the increas- 
ing frequency to 200 Hz and then leveling of at a rate of 0.04 pm/min. This may be 
because the surface concentration of depositing ions does not vary with time at 
sufficiently high frequencies. According to the Fick’s law, pulse current 
electroplating can be described by the following equations (Wong et al. 2000; 
B ard and Larry 200 1 ) 



dt 


( 6 ) 
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dC 

dx 


x=o 



i(t) 

zFD 


C(0, x) = C 0 
C(t, S) = c 0 


i(t) = ip for nT <t < 

n = 0 , 1 , 2, 3 


n \ 

r + 1 




( 7 ) 

( 8 ) 

( 9 ) 


where C is the concentration, z number of electrons, D diffusion coefficient, 
F Faraday constant, and T period. 

The thickness of the convective-diffusion layer is given by 



3/MD5 

T 



where / is the coordinate in the direction of current flow measured from the edge of 
the electrode, v the kinematic viscosity, D the diffusion coefficient, and u is the 
velocity of the electrolyte. In the experimental parameter substituting / = 10 cm, 

o o c o 

v - 1.492 x 10 cm /s, D 10 cm /s, u = 45 cm/s the thickness of the diffusion 
layer can be found: 


S = 151 pm 

For x = 0, n . — » oo, at the end of cathodic pulses, the surface concentration C c is 
given by: 



If pulse frequency is sufficiently high, then the period T is small: 



lim C c 

T — ’() 




It reveals that the surface concentration of ions does not vary with time at a 
sufficiently high frequency. The deposition rate of Ni film will approach to a stable 
value. The stable concentration of ion at such high frequency will lead to much 
more nuclei formed on the surface of cathode at a limited growth rate. It could result 
in the formation of small grains and suppress the growth of protrusion to get smooth 
morphology. 
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Morphology and Hardness of the Pulse Electroplated Ni Films 

The micro structure and properties of electrodeposited Ni films are quite dependent 
on the frequency of the pulse current. Figure lOa-f shows the SEM micrographs of 
the electroplated Ni films at frequency from 0 (DC), 1, 10, 100, 200, to 500 Hz, 
respectively. Insets in Fig. lOa-f are line profile of SEM micrographs. The line 
profile can be used to quantify the surface morphology of the deposits, and the 
peaks in line profile can be considered as protrusions. Higher peak intensity in 
Fig. 10a and b is observed than that in Fig. lOc-f. In other words, the protrusions of 
deposits produced by DC electrodeposition (i.e., 0 ~ 1 Hz) are higher than that by 
pulse electrodepostion at 10 ~ 500 Hz. The morphology becomes smoother at 
higher frequency because protrusions can be avoided in a controlled pulse 
electroplating. Figure 11 shows the relationship between the average roughness 
of Ni films and the pulse frequency. The average roughness decreases with the 
increasing frequency. It becomes very smooth with an average roughness of 
21-16.5 nm as the frequency increases to 200-500 Hz. The improvement of 
smoothness with increasing pulse frequency could be due to more new nuclei 
formed on the surface of the cathode which results in a refinement in grain size 
and relaxation time enhanced adsorption of inhibiting species for blocking of grains 
growth at high frequency. 

Figure 12 shows the relationship between the hardness of Ni films and the pulse 
frequency. The hardness measured by nanoindentation initially decreases rapidly 
from 3.9 GPa at 0 Hz to 1.18 GPa at 10 Hz, then increases with frequency to about 
4.87 GPa at 100-200 Hz, and then decreases slowly to 4.0 GPa at 500 Hz. Figure 13 
shows load-displacement curves of nanoindentation for nickel films under various 
pulse frequency. At the same load of 1,000 pN, the deformation is about 80 nm at 
0 Hz while it is 52 nm at 100 Hz and 67 nm at 200 Hz. Smaller deformation means 
better stiffness. It indicates that the stiffness of Ni films electroplated by pulse 
current is higher than that by dc electroplating. It is attributed to nickel ions able to 
diffuse into the surface defects at the off time (T 0 ff) of high frequencies (over 
100 Hz). The stiffness obtained at 100 Hz is better than that at 200 Hz. It reveals 
that the Ni films obtained by pulse electroplating have smoother morphology and 
better mechanical properties than that by dc electroplating. The compromised Ni 
film with smooth morphology, good hardness, and proper deposition rate is 
obtained at frequency of 100-200 Hz under the current density of 3 Adnr. 


The Anomalous Behavior and Properties of Ni-Co Films 

Figure 14a and b shows the effect of the pulse frequency at 3 ASD and the DC 
current density on the anomalous Co content of the codeposited Ni-Co films, 
respectively. The frequency varies from 0 Hz (DC) to 100 Hz and current density 
varies from 1 ASD to 20 ASD. Compared to about 4 % Co content in electrolyte, the 
anomalous Co composition in the Ni-Co films is around 13-21 % due to the high 
exchange current density of Co. In terms of pulse frequency, the Co content 
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Fig 10 (a-f) SEM micrographs of the electroplated Ni films at frequencies of 0 (DC), 1, 10, 
100, 200-500 Hz, respectively. The insets are their line pro hies. The morphology becomes 
smoother at higher pulse frequency 


decreases from 21.20 % (0 Hz) to 20.16 % (100 Hz) at constant 3 ASD. It indicates 
that pulse deposition modes have no significant effects on the composition. 
However, the Co content distinctly decreases with increasing current density 
from 21.18 % (1 ASD) to 13.19 % (20 ASD) at DC codeposition. 
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Fig. 11 The relationship 
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Fig. 1 2 Variation of Ni film 
hardness with the pulse 
frequency. The hardness 
measured by nanoindentation 
initially decreases rapidly 
from 3.9 GPa (0 Hz) to 1.18 
GPa (10 Hz), then increases 
with frequency to about 4.87 
GPa (100-200 Hz), and then 
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(500 Hz) 




1 




Pulse frequency (Hz) 


Although the mechanisms of anomalous codeposition are not well understood, 
the general explanation of anomalous codeposition behavior is depicted below. In 
electrodeposition process, aside from metal being adsorbed on electrode surface, 
hydrogen ion was also reduced simultaneously. The other hydroxide mechanism is 
that the pH value at the cathode surface during process is assumed to rise due to H + 
ions consumption at the electrode and resulting in hydroxide ions. It is expressed in 
chemical reactions below: 


2H + + 2e~ -► H 2 
2H 2 0 + 2e~ -> H 2 + 20H" 



Under these conditions, the surface pH could rise sufficiently high so that metal 
ions would undergo hydrolysis reaction with OH - . The local increase of pH results 
in the formation of hydroxide precipitate of the low electrode potential metal at the 
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Fig. 13 The load-displacement curves of nanoindentation for nickel films under the various pulse 
frequency. Smaller deformation means better stiffness. The stiffness of Ni films electroplated by 
pulse current is higher than that by DC electroplating 


cathode and its hydroxide (Co(OH) + ) may suppress deposition of the high electrode 
potential metal (Ni). Therefore, increasing current density may consume metal ions 
rapidly to reduce the concentration gradient of metal ions with the deposition time 
and leads to the decrease of anomalous behavior near cathode surface with reduced 
diffusion. 


Phase Identification, Morphology, and Hardness of Codeposited 
Ni-Co Films 

Figure 15a and b shows the GIXRD patterns of Ni-Co alloy films codeposited at 
frequencies of 0-100 Hz at constant 3 ASD and DC current density of 1-20 ASD, 
respectively. The strong intensity of peak at 2 6 = 44.7° are from Ni(lll) plane 
together with other two distinct peaks Ni(200) and Ni(220) at approximately 20 = 
52° and 77° for the polycrystalline phase characteristics. The peaks marked with * 
symbol are diffracted from the Au seed layer. The Ni(l 1 1) peak intensity increases 
with increasing current density due to high Ni or low Co content in the Ni-Co film. 
It implies that the highest packing density of fcc-Ni(lll) plane dominates the 
growth orientation of film during codeposition. Figure 16 shows the SEM micro- 
graphs of the electroplated Ni-Co films at pulse frequencies of 0 Hz (DC), 10, 50 
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Fig. 14 The effect of: (a) pulse frequency at constant 3 ASD and (b) DC current density on the 
anomalous Co content of the codeposited Ni-Co films 

and 100 Hz. The morphology becomes smoother at higher pulse frequency. This is 
due to more new nuclei formed on the surface of the cathode to result in a refined 
grain size as well as relaxation time enhanced adsorption of inhibiting species for 
blocking of grain growth at high-frequency pulse deposition. Moreover, Fig. 17 
shows the SEM micrographs of the electroplated Ni-Co films at DC current 
densities of 1, 3, 5, 10, and 20 ASD. The morphology becomes rougher at higher 
current density. This is similar to a common phenomenon of a rougher deposit 
obtained for higher current density deposition with higher deposition rate. 

Figure 1 8a and b shows the hardness of Ni-Co films as a function of the pulse 
frequency and current density, respectively. The hardness of all Ni-Co films is 
higher than pure Ni (4GPa) due to the addition of hard Co content. In terms of pulse 
frequency, the mean hardness at 10-100 Hz is close to 6.67-6.82 GPa (10-100 Hz) 
while that at 0 Hz (DC deposited film) is high for 8.45 GPa. Since the Co content is 
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Fig. 15 GIXRD patterns of 
Ni-Co alloy films 
codeposited: (a) at various 
pulse frequency of 0-100 Hz 
at constant 3 ASD. (b) At 
various DC current densities 
of 1-20 ASD. The diffraction 
peaks (*) come from the Au 
seed layer 
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not significantly affected by varying the deposition mode from DC to pulse plating, 
hardness of Ni-Co deposits should be related to grain size, roughness, and/or 
internal stress. The smaller grain size is beneficial to hardness enhancement. The 
calculated mean grain size of Ni-Co deposits slightly decreases with frequency 
from 12.3 to 10.5 nm. Also, the internal stress of the electrodeposited film is less 
than GPa which can have little contribution to the film hardness. Therefore, 
roughness effect is the most likely reason for the DC deposit with higher hardness 
than pulse deposits. Because the hardness by nanoindentation is calculated from the 
ratio of loading force to the indentation area, the rougher surface of the DC deposit 
may remain the underestimated area for the higher calculated hardness value. In 
terms of DC current density, the mean hardness decreases a little from 8.7 GPa 
(1 ASD) to 7.13 GPa (20 ASD) due to less amount of hard Co content. The effect of 
current density on the trend of hardness variation of Ni-Co films is similar to that of 
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Fig. 16 SEM micrographs of the electroplated Ni-Co films at frequencies of 0 Hz (DC), 10, 50, 
and 100 Hz. The morphology becomes smoother at higher pulse frequency 


Co content. And the morphology becomes rougher at high current density. Overall, 
reducing current density and increasing pulse frequency will be beneficial for the 
formation of good Ni-Co films with high hardness and smooth morphology. 


Section Summary 

This section presented with a case study the effect of pulse current with different 
frequencies on the morphology and mechanical properties of Ni films deposited by 
electroplating. The average roughness of Ni film decreases with increasing fre- 
quency. Results showed that the smoothest Ni surface with average roughness of 
16.5 nm is achieved at the frequency of 500 Hz. Meanwhile the nanohardness of Ni 
film initially decreases rapidly from 3.9 to 1.18 GPa at 0 to 10 Hz, respectively, then 
increases to about 4.87 GPa within the range of 100-200 Hz, and decreases slowly 
at 200-500 Hz. In pulse electroplating, the cathode provides an off time that induce 
nickel atoms to diffuse into surface defects. Because of that, pulse electroplating 
renders better surface stability characteristics and mechanical properties. 

The composition, surface morphology, and nanomechanical properties of the 
anomalous Ni-Co alloy films have been investigated at various pulse frequencies 
and current densities. The anomalous behavior of electrodeposited Ni-Co films is 
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Fig. 17 SEM micrographs of the electroplated Ni-Co films at DC current densities of 1-20 ASD. 
The morphology becomes rougher at higher current density 


due to the high exchange current density of Co content with low electrode potential 
and preferred adsorption of more active Co(OH) + on the deposit compared to Ni 
content. The Co content decreases from 22.53 % (1 ASD) to 13.39 % (20 ASD) at 
DC codeposition but has no pronounced variation for the pulse deposition modes. 
The Co content of Ni-Co alloys is nearly constant under pulse electroplating and 
results in similar hardness. The surface morphology of films becomes smoother at 
high pulse frequency and low current density. The reduced Co content in Ni-Co 
films can result in lower hardness which decreases from 8.72 GPa (1 ASD) to 7.13 
GPa (20 ASD) at DC current. 
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Fig. 18 (a) The hardness of 
Ni-Co films as a function of 
pulse frequency at constant 
3 ASD, and (b) the hardness 
of Ni-Co films as a function 
of DC current density 



Current Density (A/dm ) 


Electrochemical Deposit at Low Temperature 

Coatings with high strength and good smoothness are always desirable in either 
decorative or functional coatings (Orinakova et al. 2006). Two common ways are 
used to improve strength of coatings in electrodeposition; one is through pulse 
voltage and the other is alloyed with hard elements. It is noted that the compressive 
residual stress of thin films from ion bombardment or thermal annealing can 
enhance hardness of coatings (Chung et al. 2007; Oettel and Wiedemann 1995). 
So, hardness might be enhanced at low-temperature electrodeposition due to resid- 
ual stress enhanced strengthening. Low-temperature electrochemistry is character- 
istic of low diffusion coefficient, reduced voltammetric current, and high solution 
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resistance (Opallo and Prokopowicz 2003; Yu et al. 2003; Dolati and Mahshid 
2008; Osetrova et al. 1998). Here, an interesting approach of electroplating at 
relative low temperature to improve strength of coatings is introduced through a 
case study. 


Experimental Details of the Low-Temperature Deposits 

Ni films were deposited on thin Cr/Au-coated silicon (Si) substrates at low electro- 
lytic bath temperatures which range from 20 °C to 5 °C. p-type Si(100) wafers were 
used and diced into 0.5 x 0.5 cm chips. A Cr/Au seed layers about 10/100 nm thick 
were deposited on Si substrates by electron beam evaporator. Then substrates were 
cleaned with acetone (ACE) and isopropanol (IPA) in an ultrasonic bath for 1 5 min, 
rinsed with deionized (DI) water, followed by acid-cleaning in 5 M H 2 S0 4 solution 
for 2 min, and rinsed with DI water. After cleaning, the Si substrates were immersed 
in the electrolyte for electroplating. The electroplating system consists of a three- 
electrode electrochemical cell with the anode, the cathode, and the saturated 
calomel electrode as a reference electrode, electrolytic bath, potentiostat, and 
cooling system inside the isothermal container. An isothermal container was used 
to maintain the low electrolytic bath temperatures at 5-20 °C. Both potentiostatic 
and galvanostatic modes were used for electroplating. Galvanostatic mode is 
capable of keeping the current through an electrolytic cell, disregarding changes 
in the load itself, whereas potentiostatic mode is capable of keeping the loaded 
potential through an electrolytic cell, disregarding changes in the current. The 
composition of the bath solution was nickel sulfamate 450 g/1, nickel chloride 
4 g/1 and boric acid 40 g/1. The pH of the electrolyte was controlled at 4.2. The 
applying potential was — 1 V in the potentiostatic mode, and electroplating duration 
was fixed at 30 min. 


Difference Between Potentiostatic and Galvanostatic 

Figure 19a and b shows the electrodeposited films at 5 °C by means of potentiostatic 
and galvanostatic modes, respectively. The golden triangular area at the corner is 
used as the sacrificial area to connect the conducting wire, and the silver area is for 
the deposits. The deposits performed by potentiostatic mode have relatively good 
morphology, whereas the other one precipitates unfavorable sulfonic crystal. In 
potentiostatic mode, a constant voltage applied between anode and cathode func- 
tions as a driving force, and electrochemical reaction occurs depending on the 
conductivity and mass transfer efficiency of the electrolyte. However, mass transfer 
is more difficult at low electrolytic temperature environment than that at room 
temperature or higher. Accordingly, the diffusion-limited current at low tempera- 
ture is quite small and difficult to measure by galvanostatic mode. The sulfonic 
crystal precipitates when the setting current is larger than diffusion-limited current 
by galvanostatic mode, because mass transfer can not satisfy the setting reaction 
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Fig. 19 The photo of deposits performed by (a) potentiostatic mode and (b) galvanostatic mode. 
The size of specimen is 0.5 x 0.5 cm 


rate. It reveals that deposition at low temperature can be easier in potentiostatic 
mode than in galvanostatic mode. 


Chronoamperometry 

The characteristics of nucleation and diffusion behavior at various electrolytic 
temperatures can be traced by recording the evolution of current with time, i.e., 
chronoamperometry. In chronoamperometry, potential was stepped up from open 
circuit to 1.3 V at which reduction of nickel ions proceeds. Figure 20a and b shows 
the current (I) versus deposition time (t) curves and I versus t -1/2 curves, respec- 
tively. The evolution of current at temperatures of 5 ~ 20 °C can clearly be seen in 
Fig. 20a. The magnitude of current is linked to the diffusion coefficient, bulk 
concentration, surface area of electrode, and time and can be governed by Cottrell 
equation (Bard and Larry 2001): 



nFAD±C 0 




where n is the number of electrons, F the Faradic constant, A the surface area of 
electrode, D the diffusion coefficient, C Q the bulk concentration, and t the time. 
Cottrell equation describes change of current with respect to time in a controlled 
potential experiment. The current decreases with decreasing electrolytic tempera- 
ture, and the estimated current densities at5°C, 10°C, 15 °C, and 20 °C are 4.377 
=b 0.139, 6.629 d= 0.323, 9.371 =b 0.323, and 11.200 =b 0.323 mA/cm 2 , respectively. 
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Fig. 20 Chronoam- 
perometry curves for 
determining the diffusion 
coefficient: (a) the current (I) 
versus deposition time (t) 

i /o 

curves and (b) I versus t 
curves 
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It reveals that the current is limited in the low -temperature electrolyte. For realizing 
the current evolution at relatively low-temperature environment, the mass transfer 
behavior under the electroplating process should be considered. Chronoam- 
perometry was used for estimating diffusion coefficient. The diffusion coefficient 
can be obtained by substituting all known parameters into the calculated slope of 

j u 

I—t “ curves, as shown as Fig. 20b, and can be governed as the following 
expression: 


t nFAD'Co 

slope = T 

n 2 



Therefore, the diffusion coefficients at 5 °C, 10 °C, 15 °C, and 20 °C are 0.31 x 
10 -8 , 0.77 x 10 -8 , 1.13 x 10 -8 , and 1.23 x 10 -8 cnr/s, respectively. It decreases 
with decreasing electrolytic temperatures and results in the decreased current. That 
is, the reduction current is limited at low electrolytic temperature in aqueous 
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electrochemical cell due to low diffusion rate. Diffusion coefficient is a 
temperature-dependent proportionality constant correlated to mass flux due to 
molecular diffusion and concentration gradient of species. The lower diffusion 
coefficient at low temperature leads to the lower reduction current. The rate of 
electrode kinetics is always faster than mass transfer. Therefore, mass transfer plays 
as a rate determinant step in electrochemical reaction especially at low temperature. 
For linear mass transfer, the flux can be governed by the Nernst-Planck equation 
linked to mass transfer subsequently in terms of diffusion, migration, and convec- 
tion (Bard and Larry 2001): 





+ CMx) 



where J, is the flux of current density of charged species /, D, the diffusion 
coefficient, C/ the concentration, z, the number of electrons, F Faraday constant, 
R the gas constant, T the temperature, <p the voltage drop and v the stirring velocity. 
Two kinds of convection are involved in aqueous electrochemical system, one is 
natural convection due to the density gradient and the other is artificial stir. The 
effect of convection can be ignored in a stirred electrolyte. Migration in bulk 
solution (i.e., away from the electrode) can be ignored, too. Then the current near 
the electrode surface is dominant mainly by diffusion due to high concentration 
gradients. Therefore, diffusion dominates the total mass transfer for electroplating 
to be a rate determinant step. Therefore, the decreased diffusion coefficient at low 
temperature results in low reduction current during low-temperature electroplating. 
Reaction current in electroplating corresponds to deposition rate. High current 
density leads to high deposition rate. The temperature-dependent deposition rates 

at 5 °C, 10 °C, 15 °C, and 20 °C are 0.087 =b 0.017, 0.147 =b 0.029, 0.170 =b 0.034, 
and 0.223 =b 0.045 pm/min, respectively. Low-temperature electrodeposition pro- 
duces relatively low deposition rate due to limited mass transfer and may alter 
characteristics of deposits. 


Electrocrystallization at Low Electrolytic Temperatures 

The current-time curve, Fig. 20a, also provides information of electrocrystalli- 
zationat temperatures of 5 ~ 20 °C. In Fig. 20a, there exists a rising current to a 
maximum peak (7 max ) at a specific critical time (f max )- It correlates with either 
growth of new phase or the amount of nuclei. Formation of each nickel nucleus has 
individual diffusion zone, and overlap of these zones results in a peak which is 
characteristic of nucleation and growth. Shifting the peak at low electrolytic 
temperature to longer time reveals the prolonged time required for overlapping 
the localized diffusion zones. It indicates that the decrease of growth rate occurs at 
low temperature (Chung et al. 2007) due to the limited mass transfer. Whereas the 
current does not decline much after rising to peak, it may be attributed to the occurrence 
of Ni deposit with hydrogen reduction meanwhile (Dolati and Mahshid 2008; 
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Fig. 21 GIXRD patterns of 
Ni films at electrolytic 
temperatures of 5 ~ 20 °C 
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Sahari et al. 2008) to affect the i-t transients significantly. Dolati and Mahshid (2008) 
reported that even though the i-t transient was not obvious, the following analysis of 
nucleation behavior can also be done. Furthermore, nickel subsequent replenish- 
ment at the electrode surface is high enough to compensate consumption as a kinetic- 
controlled process (Dolati and Mahshid 2008; Sahari et al. 2008; Burstein and 
Wright 1974; Correia et al. 2000). 


Phase Identification and Hardness of Low-Temperature 
Deposited Ni Films 

The polycrystalline phase characteristics of low-temperature electroplated Ni 
deposits were examined by means of GIXRD patterns as shown in Fig. 21. The 
Ni films (JCPDS card No. 040850, face-centered cubic, fee structure) are polycrys- 
talline with three distinct diffraction peaks from Ni(lll), (200), and (220) planes. 
The strong intensity of peak at 26 = 44.7° is from Ni(lll) plane while other two 
distinct peaks from Ni(200) and Ni(220) are at approximately 20 = 52° and 77°. 
The peaks marked with * symbol are from the Au seed layer. It implies that the 
highest packing density of fcc-Ni(lll) plane dominates the growth orientation of 
film during electroplating. The mean grain size of the deposits can be calculated 

from the well-known Scherrer’s formula ( t = B ^ 6h , where B represents full width 

at half maximum (FWHM) of GIXRD patterns in radian, 6 b is 26 of the peak in 
degrees, and A is the wavelength of X-ray in Angstrom or nm). The calculated mean 
grain sizes of Ni films at 5 ~ 20 °C are about 23-25 nm, which decreases with 
decreasing temperature. 

The hardness of the deposits was measured by MTS Nano Indenter® with 
continuous stiffness measurement (CSM) technique. Figure 22 shows the 
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Fig. 22 Load-displacement 
curves of Ni films at 
electrolytic temperatures of 
5-20 °C using CSM 
nanoindentation 
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load-displacement curves of Ni films at electrolytic temperatures of 5-20 °C using 
CSM nanoindentation. The hardness via nanoindentation is defined as the indenta- 
tion load divided by the projected contact area of the indentation. It is the mean 
pressure under the indenter (Oliver and Pharr 1992): 




where P max is the maximum applied force and obtained directly from the force- 
displacement curve and A is the projected contact area of indenter tip with the 
material. All the indentations of low -temperature Ni films are under a constant 
indented depth of 1,000 nm; therefore hard deposits have greater ability to against 
the deformation than soft one and results in high imposed loading. The imposed 
loading during the maximum indented depth at5°C, 10 °C, 15 °C, and 20 °C are 
128.6, 102.3, 93.0, and 90.2 pN, respectively. It indicates that hardness of deposit 
can be enhanced at lower electrolytic temperature. Figure 23 shows hardness of Ni 
films as a function of electrolytic temperatures. The average hardness of deposits at 
5 °C, 10 °C, 15 °C, and 20 °C are 6.18, 5.30, 4.21, and 4.01 GPa, respectively. The 
hardness increases with decreasing electrolytic temperatures. Note that hardness of 
pure Ni films is in the range of 3.90-4.87 GPa (DiBari 1996), and Ni-Co alloy films 
are around 8.72 GPa (Chung and Chang 2009). It evidences that hardness of Ni 
films can be much enhanced by low -temperature electroplating without additions of 
hard element. 


Strengthening Mechanism 

Two main factors are concerned with hardness enhancement. The first is the dense 
film which can be obtained at low electrolytic temperature, and the second is the 
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Fig. 23 Hardness of Ni films 
is as a function of electrolytic 
temperatures 
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Fig. 24 AFM images of Ni films at electrolytic temperatures of 5 ~ 20 °C 


compressive residual stress strengthening. Because the cavity or voids in films can 
be more easily filled during fine grains growth and packing, the compact film 
commonly exhibits higher hardness. Figure 24a-d shows AFM images of the Ni 
films at5°C, 10 °C, 15 °C, and 20 °C, respectively. The smoother morphology is 
obtained at low temperature than that at high temperature. Smooth morphology of 
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Fig. 25 (a) The schematic heterogeneous semi-spherical nucleation at the cathode surface within 
liquid electrolyte and (b) the schematic homogeneous spherical nucleation with free energy 
change versus embryo/nucleus radius curves together with volume free energy and surface free 
energy at both high and low temperatures, on which show the critical free energy change (AG*) 
and the critical nucleus radius (r*) 


the film at 5 °C has compact clusters all over the scanning area. Moreover, 
Fig. 24b-d shows rough morphologies with much larger clusters arrangement. The 
average roughness of deposits at 5 °C, 10 °C, 15 °C, and 20 °C are 30.43 =b 3.38, 
66.17 =b 3.59, 78.62 =b 7.62, and 76.43 =b 11.52 nm, respectively. The lowest 
roughness is achieved at 5 °C and it drastically increases at 10-20 °C with high 
growth rate for large clusters. The smooth morphology of the compact film is 
beneficial for higher hardness. It is because the compact micro structure at low 
temperature (Fig. 24a) usually has fewer defects of cavity or voids in films, which 
can be formed during fine grain growth and arrangement for enhancing the resistance 
of plastic deformation to exhibit higher hardness compared to that at high tempera- 
ture. The compact films with smooth morphology obtained at low electrolytic 
temperature can be explained by the theory of nucleation and growth (Callister 2005). 

As ions migrate onto the cathode surface, the electrocrystallization begins 
immediately and regards as heterogeneous nucleation. Figure 25a shows the sche- 
matic heterogeneous nucleation at the cathode surface within liquid electrolyte. The 
cathode-nucleus electrolyte ( y CN ), nucleus-liquid electrolyte (ynl), and 
cathode-liquid electrolyte (/cl) interfacial energies are represented by vectors, 
and the wetting angle (6) is also shown. Let us consider the nucleation on a flat 
electrode surface and a solid particle from a liquid phase. There are three interfacial 
energies which exist at two-phase boundaries as well as the wetting angle (6). 
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Taking a surface tension force balance in the plane of the flat surface leads to the 
following expression (Callister 2005): 

Ycl = Ycn + Ynl cos 0 (1 8) 

According to Eq. 18, the geometry of nucleus depends on these three surface 
tensions, that is, 6 depends on y CN , y NL , and y CL . On the other hand, the phase 
transformation (i.e., liquid solution to solid phase) will occur when free energy 
change (AG) has a negative value, and each nucleus has a mean radius r. The model 
of heterogeneous semispherical liquid-solid phase transformation can be modified 
from homogeneous spherical one as shown in Fig. 25b. The scheme displays the 
homogeneous spherical liquid-to-solid free energy change versus embryo/nucleus 
radius curves together with volume free energy and surface free energy at both high 
and low temperatures, on which show the critical free energy change (AG*) and the 
critical nucleus radius (r*). The total free energy change is contributed from two 
terms. The first is the free energy change between the solid and liquid phases, i.e., 
the volume free energy, A G v . Its value is negative during the spontaneous liquid-to- 
solid phase transformation and the magnitude of its contribution is the product of 
AG V and the volume of homogeneous spherical nucleus (i.e., |/rr 3 ). The second is 

the formation of solid-liquid phase boundary during the transformation or the 
surface free energy, y. Its value is positive and the magnitude of its contribution 
is the product of y and the surface area of the nucleus (i.e., 4 nr ). The total free 
energy change (AG) is equal to the sum of these two contributions. In the case of 
heterogeneous semi- spherical nucleation as shown in Fig. 25a, the shape factor S(6) 
should be added and can be presented as (Callister 2005) 

A G = 0 nr 3 A G v + 4nr 2 yj S(0) (1 9) 

The values of S(0) are approximately from 0.01 to 0.5 for 0 angles of 30° to 90°. 

According to Eq. 19, the free energy change (AG) of the curve corresponding to 
the first term (i.e.,|/rr 3 AG V , A G v is negative) decreases with r 3 . And the second term 

9 

(i.e., 4 nr y) of Eq. 19 are positive and increase with r . Thus, the total free energy 
change increases first, pass through a maximum and finally decreases. It reveals that 
as a solid semi-particle starts to form as atoms in the liquid cluster together, its free 
energy first increases to reach a critical value, AG*. If this cluster reaches a size 
corresponding to the critical radius, r*, the cluster will grow with decreasing free 
energy. Comparing the conditions at both high and low temperatures, as shown in 
Fig. 25b, both of the critical radius, r*, and the activation free energy, AG*, decrease 
with decreasing temperature, i.e., AG# > AG2 and /# > f L . In physical sense, this 
means that nucleation occurs more readily with decreasing temperature (Callister 
2005). And this result can be supported with other literatures (Callister 2005; Ramirez 
et al. 2004). The increased nuclei at low temperature favor for the formation of fine- 
grain film. Therefore, the compact deposits with fine grains can thus be obtained at 
low temperature, and it is favorable for hardness enhancement. 
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Fig. 26 The schematic diagram of compressive stress generation in low-temperature 
electrodeposition 


The compact deposits produced in low-temperature environment improve the 
hardness, but the grain size variation is within small range (23-25 nm) at temper- 
atures of 5-20 °C. Therefore, the other contribution in hardness enhancement 
should be considered seriously, that is, the thermal-induced compressive residual 
stress. The compressive residual stress of thin films from thermal annealing can 
enhance hardness of coatings (Chung et al. 2007). Figure 26 shows the schematic 
diagram of thermal stress generation in low-temperature electrodeposition: 
(a) coefficient of thermal expansion (CTE) of deposits (Ni, 13 x 10 -6 °C _1 ) is 

z 1 

higher than substrate (Si, 2.6 x 10 °C ); (b) electroplating proceeded at low 
temperature and both deposits and substrate are shrunk; (c) after electroplating, the 
specimens recover to room temperature environment and both deposits and sub- 
strate expand; the magnitude of Ni expansion is higher than Si due to higher CTE; 
and (d) in fact, the deposits are constrained onto the substrate; thus deposit endures 
a residual compressive stress and substrate is tensile stress. The residual compres- 
sive stress tends to be against the external loadings for hardening. The thermal 
stress (< 7 th ) in deposits can be estimated by the formula 

Ot h = -j— ^ — (Ar)(Aa) (20) 

1 - Vf 

where Ef and Vf are the Young’s modulus and Poisson’s ratio of deposits, respec- 
tively. AT and Aa are the difference in temperature and CTE between deposits and 
substrate. The larger temperature difference at low electrolytic temperature causes 
larger compressive thermal stress for higher hardness than that at high temperature 
as shown in Fig. 23. In brief, low-temperature electrodeposition is advantageous for 
enhancement of Ni deposits hardness due to the smooth compact film and com- 
pressive residual stress. 
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Section Summary 

A study of the characteristics and hardening of Ni films formed by low-temperature 
electrodeposition have been presented. Chronoamperometry was used to study the 
nucleation and growth of films as well as the estimation of diffusion coefficient. 
The diffusion coefficient and growth rates decrease with decreasing temperature. 
The reduction current and deposition rate decrease with decreasing electrolytic 
temperature due to insufficient mass transfer. Smooth and compact films can be 
obtained at low temperature of 5 °C due to more nuclei and low deposition rate 
during deposition. The compressive residual stress is generated in films during 
low-temperature electrodeposition and it tends to be against the external loadings 
for hardness enhancement. Both the smooth compact morphology and compressive 
residual stress of the Ni deposit at low-temperature electrodeposition enhance 
hardness up to 6.18 GPa at 5 °C. 


Summary 

This chapter presented, through case studies, the characteristics of deposits under 
different electrochemical treatments and the effective approaches to enhance the 
strength of deposits. Pulse waveform will lead to much more nuclei formed on the 
surface of cathode at a limited growth rate to get small grains for smooth morphol- 
ogy. The stiffness of Ni films electroplated by pulse current is higher than that by 
DC electroplating. It is attributed to Ni atoms able to diffuse to the unoccupied sites 
at the off time (T 0 jf) of high frequencies. The anomalous behavior of 
electrodeposited Ni-Co films is due to the high exchange current density of Co 
content with low electrode potential and preferred adsorption of more active 
Co(OH) + on the deposit compared to Ni-containing ions. The surface morphology 
of films becomes smoother at high pulse frequency and low current density. The 
reduced Co content in Ni-Co films can result in lower hardness. Codeposition of 
Ni-Co alloy at higher temperature leads to less Co content, larger grains and lower 
harness. 

Compressive residual stress is generated in films during low-temperature 
electrodeposition, and it tends to go against the external loadings for hardness 
enhancement. The diffusion coefficient and growth rates decrease with decreasing 
temperature. The reduction current and deposition rate decrease with 
decreasing electrolytic temperature due to insufficient mass transfer. Smooth 
and compact films can be obtained at low temperature of 5 °C due to more nuclei 
and low deposition rate during deposition. Both the smooth compact 
morphology and compressive residual stress of the Ni deposit at 
low-temperature electrodeposition enhance hardness of the films, up to 6.18 
GPa for deposition at 5 °C. 
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Abstract 

This overview article will deal with a special application of the sputtering 
process, namely, the “magnetron sputtering technique.” In order to under- 
stand the underlying physical processes behind magnetron sputtering, a brief 
recapitulation of the sputtering phenomena will be given as a primary 
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contribution of this article. Furthermore, the special concept of magnetron 
sputtering, from both a physical and a technical point of view, will be 
presented followed by some industrial applications, practical aspects, com- 
parison with other PVD (“physical vapor deposition”) techniques, advantages 
and disadvantages, and finally some of the latest developments in the field, 
including future trends. 


Introduction 

The so-called magnetron sputtering technique has been used as a coating method 
since as early as around the 1970s. Long before the physics of the sputtering process 
was really known, it was called “cathode sputtering,” “Kathodenzerstaubung” 
(German), or “pulverisation de cathodique” (French). It has mainly been used for 
surface studies such as ion-induced radiation damage, surface analysis, surface 
cleaning, ion-beam etching, the behavior of gas-discharge tubes, ion accelerator 
technology, development of vacuum facilities, first wall problems in fusion energy 
devices, space research, etc. One of the first observations was performed by W. R. 
Grove (1852), who used a glow discharge tube which provided a qualitative picture 
of the sputtering process. 

It was not before the sputtering process itself was combined with magnetic fields 
(hence the expression “magnetron”), in order to enhance the “sputtering yield,” that 
the technique became used worldwide in a broader field of industrial applications 
such as the deposition of thin films onto various materials. This enables surface 
engineers of today to tailor desired surface properties while retaining the bulk 
properties of the original material in question, a situation which long had been 
the dream of many surface engineers and scientists. 


Fundamentals of the Sputtering Process 

Sputtering is defined as the ejection of particles (atoms, ions, and clusters) from a 
surface which is bombarded by energetic ions. Sometimes the term “sputtering” is 
used more generally to denote erosion processes induced by the impact of atoms, 
molecules, neutrons, or electrons. One should also distinguish between “chemical” 
and “physical” sputtering. 

Chemical sputtering involves chemical reactions between the incoming beam 
particles and the target material, as, e.g., the formation of compounds on the 
surface, which are subsequently sputtered as a result of the continuous particle 
irradiation. 

The concept of physical sputtering applies to the process in which the energetic 
particles penetrate a certain distance into the solid, where they will give rise to 
collision cascades or thermal spikes, eventually resulting in the ejection of target 
material. This is schematically shown in Fig. 1. Usually “sputtering” means phys- 
ical sputtering and this is how it is used in this context. 
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Fig. 1 The upper impinging ion is implanted into the bulk material where it gives rise to a 
collision sequence leading to the ejection of a substrate atom, i.e., sputtering. The lower impinging 
ion is reflected against the substrate atoms. Note the radiation damage in the bulk target material in 
the form of atom displacements which is induced by the ion irradiation 


The progress of high- and ultrahigh vacuum technology during the last decades 
and the access to mono-energetic mass-separated ion beams in ion accelerators 
have made it possible to study the nature of the sputtering process quite accurately. 
Some classical review articles, which have discussed the topics of the sputtering 
phenomena, are given in references (Behrisch 1964; Kaminsky 1965; Carter and 
Colligon 1968; McDonald 1970; Carter et al. 1971). Based on the findings 
presented in these references, a short description of the main features of the 
sputtering process is presented below. 

As indicated in Fig. 1, sputtering is caused by collision cascades taking place 
inside the solid, resulting in a flow of target atoms, some of which are moving 
toward the surface. The fraction of atoms which intersect the surface with energies 
exceeding the surface binding energy will then be ejected. Some of the incident ions 
will scatter off the target surface, and thus they will not contribute to the sputtering 
process. In this context the “sputtering yield” contains a great deal of information. 
This quantity is defined as the number of sputtered atoms (including the material 
ejected as ions, molecules, and clusters) per incoming particle, i.e., it is a measure 
of the sputtering efficiency of the target material in question for the respective 
incoming particle mass. 
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Many measurements have been carried out to determine the sputtering yield as a 
function of ion mass and energy, angle of incidence, target mass and temperature, 
bombardment dose, etc. These experiments have been performed for different 
ion-target combinations, surface conditions, and crystallographic orientations 
using single crystals (McCracken 1975; Wehner 1956; Molchanov and Chicherov 
1961; Nelson 1965; Onderdenlinden 1966; Benninghoven 1969; Andersen and Bay 
1972, 1973, 1974, 1975a, b; Roth et al. 1976; Behrisch et al. 1976). Summarized 
below, some of the factors affecting the sputtering yield are presented. 

Since 1926 (Von Hippel 1926), several theories about the sputtering mechanism 
have been put forward (Lehman and Sigmund 1966; Thompson 1968; Sigmund 
1968). It was long believed that sputtering was a pure evaporation phenomenon, 
i.e., a thermal spike or “hot spot” is created as the incident ions penetrate and lose 
their energies in the target lattice. Not until G. K. Wehner, who used single crystals, 
found that there are preferential directions from which particles are sputtered was it 
realized that collision energy transfer processes are involved (Wehner 1956, 1957, 
1959, 1968). This conclusion was confirmed by the discovery that the average and 
maximum ejection velocities are too high (a factor of ten or more) to be explained 
by pure evaporation mechanisms which give rise to ejection velocities 
corresponding to thermal energies (Wehner 1956; Nelson 1965; Thompson 1968; 
Benninghoven 1965; Oechsner and Reichert 1966; Chapman et al. 1972). 

One of the more successful sputtering theories has been developed by 
P. Sigmund (1968, 1969, 1973, 1974). In his formulation crystallographic effects 
were ignored, i.e., only amorphous materials were considered. According to 
P. Sigmund, a sputtering yield calculation consists of a number of steps defined 
by the determination of: 

• The energy deposited by the impact ion and recoil particles close to the target 
surface 

• The amount of low-energy recoil atoms this energy transfer gives rise to 

• The number of recoiling atoms reaching the surface region 

• The fraction of recoiling atoms having enough energy to overcome the surface 
binding energy 

P. Sigmund showed that the sputtering yield S(E) for an energetic ion having the 
energy E exceeding 1 KeV can be written as: 

S(E) = 3/4ji 2 * l/C 0 U 0 *a*S n (E) (1) 

where C G is a factor determined by the interaction potential of the target atoms at 
recoil energies, U Q is the surface potential barrier (binding energy of surface atoms), 
a is a measure of efficiency of momentum transfer in the collisions, and S n (E) is the 
elastic stopping power (i.e., the energy loss per unit length of the penetrating 
particle due to nuclear collisions) given by J. Lindhard et al. (1968). 

For lower impinging energies, up to about 1 KeV, the sputtering yield is 
given by: 
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S(E) = 3/4ji (i) 2 * a * T m /U 0 



where 


T m = 4M 1 M 2 /(M 1 +M 2 )*E 

i.e., the maximum energy transfer. Mi and M 2 are the masses of the incoming ion 
and the target, respectively. 

P. Sigmund has also given an expression for the variation of the sputtering yield 
with the angle of ion incidence 9 to the target surface normal (Sigmund 1968): 

S(E,h)/S(E, 1) = (cos0) -f (3) 

where h is the direction cosine, S(E,1) the sputtering yield for normal ion incidence, 
and f a number which lies between 1 and 2. Equation 3 applies to moderate mass 
ratios and for incidence angles up to about 70°. For very oblique incidence, ion 
scattering at the surface lowers the probability of sputtering. 

According to theory it can be shown that the kinetic energy E and the angular 
distribution 0 of sputtered atoms from the surface may be expressed as: 

F(E,0) = C 0 S * E/(E + U 0 ) 3 * cos (0) (4) 

Obviously the maximum in the energy distribution occurs at E = U 0 /2. 

Important information about the sputtering process is also obtained from mea- 
suring the energy distribution of the sputtered particles. The energy spectrum of the 
ejected atoms has a peak at an energy of a few eV. The precise position of this peak 
depends mostly on the surface binding energy and to a lesser degree on the 
incoming beam energy (Thompson 1968). At higher energies of the secondary 

o 

particles, the flux density decreases as E . This behavior is consistent with the 
collision cascade model, assuming an isotropic distribution of the recoil velocity. 


The Concept of Magnetron Sputtering 
Physical Aspects 

‘‘Normal” sputtering occurs when a target material is bombarded with energetic 
ions without any help of magnetic fields (see Fig. 1). These energetic ions can be 
created mainly in two different ways, namely: 

(i) Gas ionization in some kind of ion source followed by extraction and post 
acceleration through one or several electric fields to obtain their final energy 
before hitting the target in question. 

(ii) The target to be hit by the energetic ions is held on a chosen negative potential 

(bias). When the gas (e.g., Ar) is introduced into the system, some of the gas 
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Table 1 presents the 
sputtering yield data for 
some elements when argon 
is used for the sputtering 
gas which is ionized and 
accelerated to the energy of 
0.5 KeV onto the target 
material in question (From 
Vossen and Cuomo 1978, 
Chapman 1980) 


Sputtered material 

Sputtering yield (atoms/ion) 

Ag 

3.12 

A1 

1.05 

Au 

2.40 

Be 

0.51 

C 

0.12 

Co 

1.22 

Cr 

1.18 

Cu 

2.35 

Fe 

1.10 

Ge 

1.10 

Mo 

0.80 

Ni 

1.45 

Pt 

1.40 

Si 

0.50 

Ta 

0.57 

Ti 

0.51 

W 

0.57 


atoms are ionized by this electric field, i.e., a discharge has been created. As the 
ions are positively charged, they will now be accelerated toward the negatively 
charged target surface and give rise to sputtering of the target material. The 
sputtered material is then deposited on a given substrate, which consequently is 
coated by the target material. This is part of the concept (for the moment 
disregarding any magnetic fields) which is used in the magnetron sputtering 
method and will be discussed henceforth. 

In the case of “normal” sputtering, the deposition rates are decided by the 
sputtering yields at which a certain ion-substrate combination is coated. Table 1 
shows sputtering yields for different metals when argon ions (which is the mostly 
used gas element in sputter deposition systems) with the energy of 0.5 KeV are 
bombarding different target materials. These sputtering yield values may be used as 
guidance when different target materials are to be used in a sputtering deposition 
device. For conventional sputter conditions, i.e., when the target only is bombarded 
with energetic ions without introducing any extra magnetic fields, these rates are far 
too low to be used as practical means of thin film production, as the time it takes to 
coat layers of around 1 or several micrometers becomes unrealistically long (days 
or even weeks). Therefore, in order to be able to use the sputtering procedure at an 
industrial scale, the “magnetron sputtering” concept was developed during the last 
40 decades. Note that the argon ion energy (here 0.5 KeV) corresponds to the target 
potential U T (here 500 V) as illustrated in Fig. 2. 

The main principles of the magnetron sputtering process are presented in Fig. 2, 
which shows the side view of a magnetron. The idea is to enhance the ionization of 
the atoms to hit the target so that the number of sputtered material is increased 
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Fig. 2 Side view of a “magnetron” ( left side of the figure) illustrating the principles of magnetron 
sputtering by showing the arrangement of the magnets (usually permanent magnets) behind the 
target which is held at a negative potential — U T . This gives rise to the electric field E and the 
magnetic field lines, where B p is the magnetic field perpendicular to the electric field E. The two 
hollowed areas shown on the target indicate the racetrack. See the text and Fig. 6 for details. The 
right side of the figure illustrates the substrate, deposited with a layer of target material and held at 
a lower negative potential biasing -U B than the target 


compared to conventional sputtering. This is achieved by introducing magnetic 
fields close to the target surface with the help of horseshoe magnets arranged at the 
back side of the magnetron, as shown in Fig. 2. The discharge which is created in 
front of the target is often denoted as the magnetron “plasma” as ions and electrons 
are separated here. The simplest magnetrons consist of a circular- shaped geometry, 
but square magnetrons and other forms are also quite common. Cylindrical-shaped 
targets which are rotating around an inner array of permanent magnets do also exist 
on the market of today. Normally, permanent magnets are used, but electromagnets 
do the same work and can even be more flexible in an attempt to arrange the wanted 
magnetic field configuration, although the mechanical construction of the magne- 
tron will be more complex. 

In Fig. 2 the magnetic field lines start from the two outer north poles and end at 
the central arranged south pole. In addition to the magnetic field lines, an electric 
field E, perpendicular to the target surface, is created by applying a negative 
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potential — U T at the target. The magnetic field lines parallel to the target surface 
and perpendicular to the electric field E are denoted by B p . 

Consider a magnetron and substrate configuration similar to that shown in Fig. 2, 
where the components are held in a vacuum chamber (not shown here) with a base 

Q 

pressure of about 10 Pa. The different steps for the magnetron to start a sputtering 
process are the following: 

• In order to ignite the magnetron discharge, a DC high voltage in the range of 
1,000-2,000 V is applied to the target. Then this voltage is lowered to 
400-800 V for continuous operation. It is also possible to connect an alternating 
potential from an RF (radio-frequency) or HF (high-frequency) supply onto the 
target. This will be mentioned further below. 

• A suitable gas, normally argon, is introduced in the vacuum chamber with the 
help of a gas regulating system. 

• Due to the high voltage, an ionization process of the argon gas will take place 
(“glow discharge”) creating a plasma where the ions and electrons are separated. 

• The free electrons will be affected by the force F derived from the electric field 
E and the magnetic field lines B p , namely: 

F = -q * (E + v x B p ) (5) 


where q is the charge and v the velocity, respectively, of an electron. For electrons 
emitted perpendicular to the target (so-called secondary electrons) and parallel to 
both E and B (i.e., the magnetic field perpendicular to B p ), the term v x B in Eq. 5 
vanishes, i.e., these electrons are only influenced by the electric field E. Initially, 
these electrons move away from the negatively charged target surface and are thus 
accelerated away from the cathode, exhibiting a helical motion. 

Now consider the situation where the magnetic field lines B p are oriented 
parallel to the target and perpendicular to the electric field E as shown in Fig. 2. 
At this point they are bent back to the target again by performing a helical motion. 
In this way the electrons are trapped in the E x B p field above the target surface 
where each electron is capable of ionizing an enormous amount of argon gas 
atoms. This area is called “dark space,” and its existence is the reason for the 
drastically enhanced ionization of the argon gas. Once the ionization of the 
argon gas has started, the positively charged argon ions are accelerated to the 
negatively charged target, thus giving rise to sputtering of the target material. 
The sputtered material will thereafter be transported and deposited onto the 
substrate. 

The above discussed trapping of electrons, which is the reason for the enhance- 
ment of the sputtering efficiency, can induce increased sputtering rates of several 
magnitudes (up to ca. 1 pm/min.) compared to conventional sputtering and is 
observed on the target surface as a “racetrack” (see Fig. 6). 

Some of the sputtered target particles will leave the surface in an ionized state, 
normally 5-10 %. These positively charged ions are accelerated against the 
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negatively charged substrate and will thus have energies exceeding the neutral 
sputtered atoms. As will be discussed further below, the sputtered ions will have a 
positive influence on the quality of the deposited film. Some means by which the 
amount of sputtered ions can be controlled will be presented below in the section 
“Technical Description.” 

In Fig. 2 the magnets mounted on the back side of the target are all of equal 
strength. This type of configuration is called “balanced magnetron sputtering.” A 
somewhat modified concept is the so-called unbalanced magnetron sputtering; see, 
e.g., Kelly and Arnell (2000). In this version, the center magnet is weaker than the 
outer magnets. This will change the magnetic field line configuration so as to give 
rise to that some of the magnetic field lines at the end of the magnetron are open, 
i.e., they do not follow a closed loop of magnetic lines. The reason for this will, 
however, not be discussed here the result of this concept will be a denser plasma 
close to the target surface which thereby leads to enhancement of the rate of 
sputtered ions which are leaving the surface and accelerated toward the substrate. 
There are available computer programs with the help of which one can calculate the 
optimum wanted magnetic configuration. Normally an increased number of the 
sputtered ions will create denser and harder coatings compared to when less 
sputtered ions hit the substrate material. 

With the abovementioned prerequisites, all electrically conductive coatings such 
as metals can be fabricated. A special application of the magnetron sputtering 
technique is the concept of “reactive sputtering” in order to create compound 
coatings. Reactive sputtering involves the normal sputtering procedure described 
above, i.e., one element sputtering with the help of the argon driving gas, combined 
with the introduction of some reactive gas mostly oxygen-, nitrogen-, or any 
carbon-containing gas. If the right parameters during the deposition process are 
maintained, such as gas pressure, substrate temperature, rate, and energy of the ions 
bombarding the substrate, a compound material is formed on the substrate surface 
by means of condensation and chemical reactions taking place on the substrate 
surface. Some of the more commonly used compounds which can be produced in 
this way are presented below. 

Nitrides, carried out with the help of nitrogen or ammonia gas: 

TiN, AIN, TiAIN, CrN, TaN, NbN, SiN x , etc. 

Carbides, carried out with the help of, e.g., methane, acetylene, or propane gas: 

TiC, SiC, WC, DLC (“diamond-like carbon”), etc. 

Oxides, carried out with the help of oxygen gas: 

A1 2 0 3 , Si0 2 , ln 2 0, Sn0 2 , Ta 2 0 5 , etc. 

Other kind of compounds, carried out with the help of more than one reactive gas: 

TiCN, SiCN, etc. 

In later years one of the above presented coatings, namely, DLC, has found an 
increasingly greater demand as a coating for the improved characteristics observed 
in a number of applications (see below). A DLC coating is originally defined as 
consisting of an amorphous material with islands of diamond- structured crystals 
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evenly distributed in the film. However, the denotation “DLC” is also used in a 
wider interpretation as any film containing carbon compounds without any crystal- 
lographic structure. 


Key Factors 


Figure 3 illustrates some of the consequences of ion bombardment during deposi- 
tion. The figure shows both the sputtered incoming atoms and the ions which are 
deposited on the substrate surface. In the ideal case, one desires to have as many 
energetic ions as possible, say in the range of 1 KeV (corresponding to a bias 
potential -U B of —1 KV) in the initial stage of the deposition process. This is the 
case shown in Fig. 3a. Here the ions, due to atomic knock on collisions, will result 
in interfusion of the sputtered target material and the bulk atoms, a process 
sometimes denoted “ion-beam mixing” which thus can be considered as one type 
of radiation damage. Some of the incoming ions will be implanted at a certain depth 
into the bulk material, depending on the ion energies. 

If the deposition procedure is continued, but now with lower ion energies, let us 
say 30-90 eV (corresponding to a bias potential -U B of —30 to —90 V), to prevent 
“ion-beam mixing” and radiation damage, a more or less homogeneous film is 
created on top of the substrate surface as the relatively low ion energies give rise to 
a close packing procedure of the particles arriving at the substrate surface. This is 
shown in Fig. 3b. Furthermore, the interface now consists of the intermixing region 
between the film and the substrate material. This concentration gradient of the film 
atoms will drastically improve the adhesion properties of the deposited film as, e.g., 
compared to if pure evaporation deposition was performed. In the latter case the 
deposited atoms have thermal energies, in the range of 0. 1-0.4 eV, and thus cannot 
induce any “ion-beam mixing” effects. 

Another aspect in attempting to explain the main important features of the 
magnetron sputtering technique is to point out the influence of the substrate 
temperature during deposition. At too low temperatures, the atomic packing of 
the atoms arriving at the substrate surface will not lead to good coating properties. 
The reason is that the deposited atoms are more strongly bound to each other than to 
the substrate, i.e., they are not very laterally movable across the surface. This may 
create the so-called island formation and finally give rise to voids in the deposited 
layer, which in turn negatively affects the film properties such as density, hardness, 
and adhesion. 

For higher substrate temperatures, the lateral movement of the deposited atoms 
will increase due to vibrational energy, thereby minimizing the abovementioned 
destructive effects. The lack of enough vibrational energy may also be solved by 
enhancing the intensity of impinging ions onto the substrate surface. What the 
temperature does not do is now done by the energetic ions. However, the ion 


energies should not exceed around 100 eV (U B 


100 V), as otherwise the 


deposited layer starts to sputter away and will give rise to unwanted radiation 
damage of the sputtered layer. It is also known that ion bombardment, as well as 
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O = sputtered impinging ion 
o = sputtered impinging atom 
Q = bulk atom 


Fig. 3 (a) Is a schematic illustration of the initial stage of a deposition event, when the bias 
potential U B is held at around 1 KV. The impinging ions may consist of charged sputtered ionized 
target material or resputtered argon ions. One observes that deposited and bulk atoms are mingled 
close to the surface area due to the so-called “ion mixing” phenomena, (b) Shows the situation at a 
later stage of the deposition. Now the bias potential U B is reduced to a value between 30 and 90 V, 
which prevents “ion mixing.” The result is a deposited layer with high quality and good adhesion 
as the sputtered material is sewn onto the bulk material at the interface between the coating and 
bulk region (see text for detailed information) 


the substrate temperature during deposition, will influence the film structure (Kelly 
and Arnell 2000; Sundgren 1982; Sundgren and Hultman 1995; Growenor 
et al. 1984; Movchan and Demchishin 1969; Thornton 1977). The abovementioned 
facts illustrate the importance of keeping the right temperature and ion intensity as 
well as ion energy under control during the deposition procedure. 
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A prerequisite condition for optimal coating performances to be considered is the 
base pressure of the vacuum vessel in which the deposition procedure takes place. This 

/T Q 

base pressure should not exceed the value of 10 mbar (10 Pa) for normal 
conditions. If the base pressure is too high, impurities such as oxygen, hydrocarbons, 
or other gases may influence the deposition results, especially for reactive sputtering. 
If, e.g., TiN is to be coated and the impurity content in the form of oxygen in the 
vacuum chamber is too high with respect to the deposition rate, some of the freshly 
sputtered titanium material will be oxidized to form some titanium oxide instead of 
titanium nitride. The maximum allowed vacuum pressure is determined by the 
sputtering rate (power delivered onto the magnetron) as well as the maximum allowed 
impurity content of the sputtered coating. For higher sputtering rates, the surface is 
kept clean from impurities in a higher degree than for lower sputtering rates. 

One factor in addition to what have been discussed above is the existence of 
intrinsic stresses which acts on the deposited film, especially when reactive 
sputtering is performed. These stresses come about due to the different properties 
of the film and bulk materials, the most important of which is thermal stress, i.e., 
due to differences in the coefficient of thermal expansion (CTE) of the two 
materials. Most depositions take place under elevated temperatures, and when 
cooled down to room temperatures, the film will undergo stress buildup effects. 

Other origins of film stresses are the incorporation of impurities in the film, 
chemical reactions taking place on the surface, differences of the lattice spacing of 
the film and substrate, recrystallization processes, phase transformations, and other 
microscopic lattice defects. The result of excessive built-up stresses may be that the 
deposited film will exfoliate or partially peel off the substrate surface. The way to 
overcome most of these difficulties is to introduce an intermediate layer between 
the final film and the substrate surface in order to pick up formed stresses and 
thereby reduce, e.g., lateral stresses. If TiN is to be sputtered on a metal substrate, 
one might sputter a thin layer of pure metallic Ti prior to the deposition of the actual 
TiN. This will reduce the lateral stresses, thus enhancing the adhesion and thereby 
preventing exfoliation. 

An important aspect when applying reactive sputtering is that the sputtering rate 
may decrease in this case compared to when normal argon sputtering is used. The 
reason is eventual compound formation, not only at the substrate surface but also at 
the target surface. This will decrease the sputtering yield mainly due to the 
enhanced surface binding energy U G which is evident from Eq. 2. As the sputtering 
yields for compounds normally are lower compared to pure metals, this will result 
in a lower deposition rate. Decreases of more than 50 % can occur. That may lead to 
the so-called ‘‘poisoning” effect of the target surface, which now consists of a less 
conductive compound which may change the sputtering conditions drastically. 
During a deposition process though, poisoning has to be balanced out by keeping 
the reactive gas flow below the threshold value in question. In addition to this, there 
are other possible ways to overcome this problem, e.g., by pulsing the magnetron 
bias supply which also can be combined with so-called twin target systems. 

Whenever DC sputtering is applied, the target has to be electric conductive. 
However, if an electric insulator material is to be sputtered such as SiN, Si0 2 , or 
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A1 2 0 3 , it is not longer possible to create a negative target potential (U T ). In this case 
one uses a HF power in the frequency region of megahertz (usually around 1 3 MHz) 
connected to the target. In this scenario, due to their mass, the ions are not 
influenced by the high-frequency field close to the target surface as much that 
they can be accelerated onto the surface. An electron, however, due to that its mass 
is about a thousand times smaller than the ion mass, now has time enough to be 
attracted to the target surface during the short time the target is held at a positive 
potential. The region where this happens is called sheath potential. The result of the 
electron bombardment on the target is that its surface is held for a longer time at a 
negative than at a positive potential and thus the ions are able to be accelerated 
toward the target surface where they consequently give rise to sputtering events. 

Technical Description 

Practically, a magnetron sputtering device may look as the one shown in Fig. 4. The 
magnetrons are supplied with a negative voltage (— U T ) with the help of power 
supply devices. The applied power value is determined by the magnetron size and 
the desired sputtering rate. In the initial stage, i.e., to “ignite” the magnetron, a 
rather high negative voltage has to be applied in the range of — 1 ,000 V or higher. 
After this “ignition” state, the magnetron voltage is kept at around —200 to —600 V 
for normal sputtering conditions. This voltage regulation is usually taken care of 
automatically with modern power supplies, which also takes care of eventual 
electric breakdown on the target surface (arcing) with the help of an automatic 
electronic arc handling system. 

Figure 4 shows an example of a PVD system containing two opposite mounted 
magnetrons. Systems with other arrangements exist, e.g., with four magnetrons 
arranged in a special way. For two oppositely arranged magnetrons, one has the option 
to arrange the magnets into two fundamentally different ways. For a strong magnetic 
coupling, which will give high ion currents, the magnets are arranged in a north-south 
configuration through the deposition chamber. In order to obtain a comparably weak 
magnetic field, the magnets are arranged in a north-north or south-south way. 

The substrate has to be kept at a negative potential between —30 and —100 V, 
which will allow the samples to be bombarded by the most profitable ion energies 
(see discussion above). In some cases, the substrate is held at a floating potential, 
which will induce ion bombardment with ca. 20-30 eV energies, due to the 
so-called “self-biasing” potential held up by the plasma discharge. One important 
factor, for optimal coating thickness homogeneity, is that the individual specimens 
with more or less complicated geometries should be evenly deposited all over their 
surfaces. This is accomplished by rotating the substrate holder during deposition. 
Often a two- or even threefold rotation is necessary (not shown in Fig. 4). 

Normally, the vacuum vessel walls will be kept at earth potential, but other 
configurations also exist. In some cases one introduces an anode arrangement which 
picks up free electrons in order to obtain a good discharge without any “runaway” 
plasma. 


2942 


M. Braun 


turbo pump 


gas flow regulation system 


gas flow regulation system 



magnetron at -Uy 


magnetron at -Uy 


rotating substrate holder at ~Ue bias potential 


Fig. 4 Showing the main features of a magnetron sputtering deposition device, with two opposite 
mounted magnetrons. The high vacuum condition in the chamber is obtained by a forepump (not 
shown in the figure) and a turbopump. The gas inlet flow of argon and eventual reactive gases is 
achieved by mass flowmeters. In order to obtain uniform coatings, the samples are mounted on an 
electrically isolated rotating substrate holder kept at a bias potential of -U B 


The whole deposition system has to be held under a vacuum, with a base 

/T Q 

pressure of about 10 mbar (10 Pa) (see above). The vacuum is kept constant 
at this low pressure with forepumps and turbopumps. This makes it possible to 
introduce the driving gas, normally argon, which is kept at a pressure of about 

n o 

10 -5 x 10 mbar (1-5 Pa) during the deposition via a gas flow regulation 
system. The gas regulation consists of an extremely accurate mass flowmeter. For 
reactive sputtering additional mass flowmeters have to be installed to regulate the 
different gases independently from each other. Due to hysteresis effects depending 
on the pumping efficiency and the fact that the sputtering rate is not constant 
especially for reactive processes, the gas flow rate (or rates) has (have) to be 
adjusted throughout the deposition process. In modern concepts this is achieved 
by automatic computer process-controlled systems. 

Another quantity to be controlled in magnetron sputtering is the substrate 
temperature before and during deposition. Before, because in many applications 
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one needs to outgas the samples to be coated prior to the deposition. All metallic 
materials contain intrinsic trapped gases in the form of, e.g., oxygen, hydrogen, and 
nitrogen. These gases may be released during the deposition due to increased 
temperatures and ion-induced desorption processes leading to unwanted properties 
of the coated film such as lower adhesion, failed chemical surface reactions, etc. To 
prevent this situation, outgassing in high vacuum conditions is necessary. 
Depending on the type of coating and the material to be coated, temperatures in 
the range of 100-400 °C should be used. For too high temperatures, there is always 
the risk of geometric deformation of, e.g., a tool to be coated. 

The temperature during deposition has also to be monitored and kept as constant 
as possible. This is important for the right chemical reaction rate and film quality to 
occur. Usually this temperature is lower than the outgassing temperature, i.e., 
around 100-300 °C. There are several ways of regulating the substrate temperature, 
some of which are (i) direct ohmic heating in the chamber; (ii) plasma heating, i.e., 
the emitted and picked up energy by the ions (and eventually electrons) hitting the 
substrate during deposition; (iii) infrared heating with the help of in situ arranged 
infrared radiators or through infrared-transmitting windows; and (iv) RF (radio- 
frequency) techniques. None of these methods are indicated in Fig. 4. 

It should also be mentioned that for a coating cycle to be successful, one has to 
apply good cleaning procedures prior to the coating (Mattox 1996). This is impor- 
tant because any impurities on the surface, e.g., in form of grease, dust, or other 
particles, will prevent from obtaining the desired coating properties. Surface 
cleaning is normally carried out with the help of acids or basic solutions, often in 
combination with ultrasound washing. Once the samples are mounted and the 
deposition chamber is evacuated, a so-called “plasma cleaning” procedure is 
applied. By holding the substrates at a high potential, preferentially at several KV 
and simultaneous argon gas inlet, the samples are cleaned by the sputtering process, 
i.e., the reverse process takes place compared with the coating stage. 

The technique of manufacturing “multilayers” with the help of magnetron 
sputtering has become an increasingly growing interest in many fields of applica- 
tions (Petrov et al. 1997). Multilayer coatings are fabricated by the use of several 
magnetrons, mounted in front of the substrates to be coated. One way is to turn on 
and off the magnetrons successively to produce the desired multilayers. Another 
way is to use a rotating substrate holder, such as that shown in Fig. 4. The use of 
different target materials on the magnetrons then gives rise to the multilayered 
structure of the obtained coatings. A simpler but more material limited method is to 
vary the gas concentration during the deposition (Deng and Braun 1995). There are 
also advanced application-tailored coatings by means of nanoscale multilayers with 
so-called superlattice structures which have been developed (Hovsepian and Miinz 
2002). In principle many different targets could be used in combination with 
reactive sputtering to produce a manifold of multi-coating variations. 

Figure 5 demonstrates how a DC plasma may look during the deposition phase. 
The photo is taken through a viewing port of the vacuum deposition chamber, 
showing some tools (traction lugs) which are deposited with TiN by reactive 
magnetron sputtering. The reddish color of the plasma arrives from the nitrogen 
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gas inlet which is necessary for the compound formation (in this case TiN). Here the 
exited and partially ionized nitrogen atoms are emitting optical radiation mostly in 
the red part of the atomic line spectrum mixed with the bluish lines coming from 
exited argon atoms and ions. Note also the brighter region in the background which 
originates from the sputtering magnetron. 

Figure 6 is a picture of a rectangular-shaped metal target, taken after being used 
for many hours in magnetron sputtering implementation. It can be seen from the 
figure that comparatively large amounts of sputtered material has been eroded away 



Fig. 5 This photo demonstrates how a DC plasma may look during the deposition of TiN on some 
tools. Note the brighter background which originates from the sputtering magnetron (With 
courtesy of Micromy AB) 


Fig. 6 Showing a picture of a 
rectangular- shaped metal 
target, after being used for 
many hours in magnetron 
sputtering implementation. 
Note the distinct eroded 
region, specially the so-called 
racetrack observed as a cavity 
around the target surface 
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all over the target surface. Also seen is the so-called racetrack observed as a cavity 
region around the target. This craterlike racetrack is created by the enhanced 
sputtering rate arising from the magnetic and electric field lines which interact 
according to Eq. 5 above. Obviously the erosion rate is very high here. 


Industrial Applications 

Magnetron sputtering has become a worldwide spread coating technique in many 
different fields of industrial applications. One may distinguish between two main 
segments of applications, namely, functional coatings and decorative coatings. It is 
the last mentioned branch which so far has become the biggest. However, the 
applications of functional coatings are growing fast, not least because of the solar 
energy where magnetron sputtering has become a new promising method of 
manufacturing multilayer coatings. For a lot of cases, the coatings have both 
decorative and functional values. 


Decorative Applications 


As already mentioned, this is the largest field of PVD applications (Miinz 
et al. 1982; Fleicher et al. 1998). Some examples where decorative coatings are 
found are: 

• All kind of armatures 

• Interior and exterior home decorations 

• Watchcases 

• Spectacle frames 

• Computer and mobile telephone casings 

• All different kinds of plastic goods 

• Nautical products 

• Firearms 

• Musical instruments 

• Automotive fittings 

• Knives and scissors 

• Jewelry 

• All kinds of luxury goods 

Some examples of decorative coatings are shown in Fig. 7. Due to economical 
reasons, the thickness of decorative coatings is only a few tenth of a micrometer, 
say, 0. 1-0.3 pm. This of course reduces the well-known advantage of a PVD 
coating of having excellent wear- and corrosion-resistant properties, as these 
kinds of thin layers are relatively quickly worn out or contain large amounts of 
pin holes, respectively. However, for a well-polished surface, a coating performed 
by magnetron sputtering techniques will have an outstanding brilliant deep shining 
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Fig. 7 Some examples of products in which the magnetron sputtering deposition technique is 
applied for decorative purposes (With courtesy of Micromy AB) 


metallic color appearance which cannot be achieved by any other means. Figure 7 
presents some examples of items which have been coated with the magnetron 
sputtering technique. For a pure metallic appearance, i.e., without any special 
coloring, the sputtering process is carried out with a metal target. 

The coloring itself from a specific target material, such as Zr, Ti, Cr, Nb, or A1 
alloys, is achieved by reactive sputtering where the gas mixture, mostly the 
driving gas combined with nitrogen, oxygen, or any hydrocarbon gas, is deciding 
the color of the coating. As the exact hue of any color is very sensitive to the actual 
gas flow rate, it is essential to control any fluctuation of the gas inlet during 
deposition. In fact, even if the gas flow control is good, the special gas distribution 
inside the deposition chamber may vary and cause differences in the color hue. This 
problem gets more serious the bigger the deposition chamber is. However, with 
wise automatic computer control systems in combination with in situ spectropho- 
tometer detectors, these concerns can be handled satisfactorily today (Sullivan 
et al. 2002). 
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Industrial Tools 

Up to date, the most important application of the magnetron sputtering tech- 
nique has been coating of different kind of tools. The beneficial properties 
which are searched for here are essentially hardness, low wear and friction 
(i.e., tribological performances), and enhanced corrosion and oxidation resis- 
tance. The largest application in this field is certainly cutting tools such as 
drills, mills, knives, and other tools with sharp edges (Lefler et al. 1998; 
Holmberg and Matthews 2009). The aim of the coating is here to prolong 
the lifetime of a cutting tool before the edge gets blunt or dull. In this context 
the magnetron sputtering technique is particularly suited for very sharp edges 
such as surgical knives and hairdressing scissors, because in these kinds of 
applications, the deposition must be carried out in a very controlled way in 
order not to destroy the sharp edges by heat, resputtering, or rough deposition 
layers. For such applications, magnetron sputtering has been proven to be by 
far the best deposition technique of all available PVD methods (see the section 
“Comparison with Other PVD Techniques” below). 

Other examples in which extremely and outstanding smooth surface topogra- 
phies can be obtained by magnetron sputtering are injection molding tools and dies 
where extremely low surface roughness factors and defects are needed. Figure 8 
shows a pressing tool for optical storing which is coated with TiN. A defect-free 
surface coating corresponding to an R a value (roughness) down to 0.002 pm and a 
flatness within an optical wavelength are observed, i.e., topographic dimensions 
down to almost atomistic scales. 

Further areas in which magnetron sputtering coating is applied in a 
larger industrial scale are punching, forming, mechanical drives in all kind of 
engines including especially the automotive section, textile and food processing 
machines. 


Fig. 8 Showing a pressing 
tool for optical storing DVD 
which has been coated with 
TiN by magnetron sputtering 
techniques (With courtesy of 
Micromy AB) 
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Optical and Photovoltaic Coatings 

Since more than 50 years, optical components, made of glass or any polymer 
material, have been coated with the purpose of tailoring lenses, filters, and mirrors 
with respect to shaping spectral light in the visible, infrared, and UV wavelength 
region. Typical applications of optical coatings are antireflection, high-reflection, 
beam splitters, long- and short-wave passes, monochromatic absorption and shap- 
ing filters, and polarizing filters (including beam splitters). Today optical coatings 
are also applied to, e.g., collector and laser mirrors with high reflectance values at 
short wavelengths in the deep UV region. In most cases multilayers are fabricated, 
where each layer may act as antireflective, light absorbing, front protecting, adhe- 
sive, buffering, or barrier in the actual optical component (Sproul et al. 1995; Meng 
and Santos 1997). 

Similar techniques which are used for optical coatings are also applied to 
produce photovoltaic coatings for solar energy (Carlsson and Wronski 1976; 
Mellaud et al. 2012) and different kinds of biosensors. Not to forget about the 
semiconductor industry in which the magnetron sputtering technique has played an 
essential role, such as for the fabrication of VLSI (“very large-scale integration”) 
circuits. For some of these last mentioned applications, the substrates to be depos- 
ited consist of silicon instead of glass or any polymer material which is used for 
optical coatings. 

Some of the more frequently utilized materials in the optical and photovoltaic 
fields are ZnO, Zr0 2 , Si, Si0 2 , SiO x , Sn0 2 , Ti0 2 , Hf0 2 Ta 2 0 5 Si 3 N 4 In, Cu-Ga, 
Mo, Cr-Al, Se, etc. As a diversity of elemental materials are involved in the 
production of these multilayered films, any coating device must contain several 
independent magnetron sputtering targets, which act in a successive way. Figure 9 



Fig. 9 Illustrating a large-scale and high-throughput industrial horizontal in-line photovoltaic 
sputter production coating system (With courtesy of Leybold Optics GmbH) 
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shows a large-scale and high-throughput industrial horizontal in-line photovoltaic 
sputter production coating system. In this roll-on device, the substrates are hori- 
zontally moved along different deposition stages, each section being differentially 
pumped. A good review of optical and photovoltaic coatings is given in (www. 
leyboldoptics.com). 


Medical Applications 

One of the more interesting applications of the magnetron sputtering technique is 
given by the creation of medical coatings. It has been found that many of the 
conventionally fabricated coatings have biocompatible properties, i.e., they are 
tissue and blood compatible. Thus the activities in this field have been focused on 
the biocompatibility of medical implants such as heart valves, stents, artificial 
arteries and veins, orthopedic and dental implants, etc. Practically all kind of metals 
and their compounds can be produced as coatings on top of every metal bulk 
material. The most commonly used compound coatings for medical applications 
are TiN, TiCN, TiC, DLC, TiAIN, ZrN, etc. Which type of coating is to be used 
depends on the application in question. 

For TiN and DLC coatings, it has been found that the effect of blood clotting is 
reduced drastically. This enhanced hemocompatibility has in practice been proven 
on artificial heart valves and coronary stents (Windecker et al. 2005). 

Orthopedic implants such as artificial hip and knee joints have been coated with 
different kinds of compound materials with the aim to enhance the wear perfor- 
mance by improved tribological properties (Narayan et al. 1994). One such exam- 
ple can be seen in Fig. 10, which shows a knee joint which is surface treated with 
TiN deposited using magnetron sputtering techniques. For the same reason, surgical 
instruments, such as scissors, scalpel blades, forceps, needle holders, pliers, etc., 
have been coated for a few years now. In addition to the improved tribological 


Fig. 10 Showing an artificial 
knee joint, which has been 
deposited with a 
biocompatible TiN coating 
{goldish color) by magnetron 
sputtering techniques (With 
courtesy of Micromy AB) 
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properties, the surface treatments of these products also yield beneficial character- 
istics of, e.g., non-allergy, noncorrosive, scratch resistant, easy to sterilize, less 
disturbing light reflection, and prolonged lifetimes. 

Furthermore, abutments of dental implants are coated today with biocompatible 
layers such as TiN or ZrN, which give a warm, gold-shaded color under the 
gingival. This elicits a natural look of the dental implant material. It has also 
been scientifically proven that titanium material coated with TiN or ZrN essentially 
decreases the possibility of bacterial growth compared to untreated titanium sur- 
faces. This means that the risk of periimplantitis for abutments made of pure 
titanium material, or alloys hereof, minimizes by the TiN or ZrN coating. 


Comparison with Other PVD Techniques 

Although the concept of PVD is normally used as an unambiguous concept, indeed 
it consists of a variety of methods. Some of the more frequently used PVD 
techniques are discussed below and compared with the magnetron sputtering 
method. 


Evaporation 

The oldest applied technique is certainly pure evaporation of a metal, but also 
reactive evaporation has been used for thin film production. The material to be 
coated is evaporated by means of ohmic heating, causing the material in question to 
melt and thus be evaporated onto the substrate. It is also possible to achieve material 
melting by directing an electron beam onto the target, thereby generating the 
evaporation. For very high-temperature melting materials, the electron beam 
method is best suited. 

The main difference between magnetron sputtering and evaporation is the 
energy of atoms or ions that impinge on the substrate to be coated. Whereas only 
thermal energies (ca. 0.1 eV) of ejected atoms emerge from the target in an 
evaporation system, magnetron sputtering gives rise to much higher ejected ener- 
gies (in the range of 2-30 eV). This difference in bombardment energies leads to an 
increased adhesion, denser packing of deposited particles and more pronounced 
grain orientations for the magnetron sputtering technique. The comparatively low 
ejection energies of the atoms which are established in evaporation also leads to the 
deposition process being carried out in a relative low-pressure environment, i.e., in 

Z Q O Z 

the range of 10 -10 mbar (10 -10 Pa), depending on the prescribed purity 
of the film, whereas in magnetron sputtering the pressure at which deposition still is 
possible may be as high as 10 -2 mbar (10 Pa). This is so because at lower energies, 
the ejected atoms are stopped more easily by collisions in the vacuum chamber than 
atoms with higher energies. Table 2 below presents some comparing characteristic 
features of the film properties obtained by magnetron sputtering versus evaporation 
techniques. 
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Table 2 Presenting and comparing some important film properties deposited by magnetron 
sputtering and evaporation 


Magnetron sputtering 

Evaporation 

Moderate deposition rates 

High deposition rates 

Good adhesion quality 

Bad adhesion quality 

Extremely low surface roughness values (0.002 mm) 

Quite low surface roughness values 

High film density 

Low film density 

Low pinhole frequency 

High pinhole frequency 

Directional deposition (line-of-sight) 

Directional deposition (line-of-sight) 

Most target materials available 

Limited target materials available 

Some gas incorporation (argon) 

Low gas incorporation 

Columnar growth possible 

Less columnar growth 


Ion Plating 

Ion plating, which was evolved in the early 1970s by Mattox, is a development of 
evaporation as a coating technique. Here, the substrate is bombarded by energetic 
ions giving rise to sputtering prior to and during an electron beam evaporation 
procedure. Before deposition, the substrate is held at a comparable high negative 
voltage in the range of 2-5 kV, causing surface cleaning by sputtering. During 
deposition this voltage is lowered to minimize the sputtering effect. The rate of 
deposition is then balanced by the difference of the evaporation and sputtering power. 

The main advantage with ion plating is the high degree of ion bombardment 
intensity. This leads to excellent adhesion properties, generally in accordance with 
the process shown in Fig. 3. The particle exposure of the deposited material also has 
a beneficial effect on film density. In addition, the relatively high vacuum pressure 
during evaporation, which can be maintained due to the comparably large energies 
of the ejected ion, leads to a large amount of ion scattering which reduces the line- 
of-sight effect. It should also be mentioned that variations of ion plating exist in the 
form of additional operational arrangements such as external plasma enhancement 
or ion sources. For and against ion plating compared to magnetron sputtering is 
given in Table 3 below. 


Arc Deposition 

Out of the different types of PVD methods, the cathodic arc deposition is the one 
which most resembles the magnetron sputtering technique. Just as in magnetron 
sputtering, magnets although weaker are arranged behind the target which is held at 
a negative potential. However, for the arc to ignite close to the surface, a much 
higher power must be applied to the target. A self-sustaining arc discharge is 
achieved for discharge currents exceeding 1 A which may increase to values up 
to 100 A and more. In this region no bearing gas, such as argon, is needed and a high 
degree of ionization of the target material is created. 
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Table 3 Comparison between magnetron sputtering and ion plating 


Magnetron sputtering 

Ion plating 

Moderate deposition rates 

High deposition rates 

Good adhesion quality 

Excellent adhesion quality 

Extremely low surface roughness values (0.002 
mm) 

Quite low surface roughness values 

High him density 

High him density 

Low pinhole frequency 

Low pinhole frequency 

Directional deposition (line-on- sight) 

Nondirectional deposition (cavities may be 
coated) 

Most target materials available 

Limited target materials available 

Some gas incorporation (argon) 

Low gas incorporation 

Columnar growth possible 

Less columnar growth 


Furthermore, the energies of emitted ions in arc deposition are higher than in 
magnetron sputtering. However, in the arc deposition method, micro-droplets are 
eroded and ejected from the target surface in the range of 0.1-100 pm. This 
certainly degrades the ambition to produce a defect-free and shiny coating on a 
highly polished substrate surface. There exist several filtering arrangements to 
eliminate these droplets with varying results. The target utilization for arc deposi- 
tion is better than in magnetron sputtering. This is so because the erosion track 
which is observed in magnetron sputtering is not seen in arc deposition due to the 
fact that arc cathode spots move around the target surface in a more or less erratic 
manner. For and against arc deposition compared to magnetron sputtering is given 
in Table 4 below. 


lon-Beam-Assisted Deposition (IBAD) 

This coating technique mainly involves two different configurations. One involves 
the use of an ion-beam source which is aimed at a target in order to induce 
sputtering combined with a second ion-beam source producing the impinging 
ions (from an inert or reactive gas) onto the substrate. The second version consists 
of an ion-beam source aimed at the substrate combined with an evaporation system 
which here acts as the deposition approach. 

The purpose with any IBAD system is evidently to maximize the effect of ion 
bombardment during the deposition process. As discussed in some detail above, this 
type of energy dissipation to the substrate will enhance a variety of improved 
coating characteristics (Harper et al. 1984). For and against ion-beam-assisted 
deposition compared to magnetron sputtering is given in Table 5 below. 

Summarizing the comparisons of the different PVD techniques, one might 
conclude that for applications in which micro -defect-free coatings with extremely 
flat topographies, i.e., with R a values as low as 0.002 pm are wanted, magnetron 
sputtering or IBAD are the most suited techniques to be considered. Taking also 
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Table 4 Comparison between magnetron sputtering and arc deposition 


Magnetron sputtering 

Arc deposition 

Moderate deposition rates 

High deposition rates 

Good adhesion quality 

Excellent adhesion quality 

Extremely low surface roughness values (0.002 
mm) 

High to moderate surface roughness values 

High film density 

High film density 

Low pinhole frequency 

Moderate pinhole frequency 

Directional deposition (line-on- sight) 

Nondirectional deposition (cavities may be 
coated) 

Most target materials available 

Most target materials available 

Some gas incorporation (argon) 

Low gas incorporation 

Columnar growth possible 

Less columnar growth 


Table. 5 Comparison between magnetron sputtering and ion-beam-assisted deposition 


Magnetron sputtering 

Ion-beam-assisted deposition 

Moderate deposition rates 

Low to high deposition rates 

Good adhesion quality 

Excellent adhesion quality 

Extremely low surface roughness values 
(0.002 mm) 

Extremely low surface roughness values 
(0.002 mm) 

High film density 

High film density 

Low pinhole frequency 

Low pinhole frequency 

Directional deposition (line-on- sight) 

Directional deposition (line-on- sight) 

Most target materials available 

Most target materials available 

Some gas incorporation (argon) 

Some gas incorporation (argon or others) 

Columnar growth possible 

Less columnar growth 


economical aspects into account, leaves magnetron sputtering as the only possible 
choice. For applications in which other criteria than, e.g., the surface topography 
will determine which PVD method is to be chosen, one has to consider further 
individual requirements in question. 


Emerging and Future Application Trends 

During the last years, magnetron sputtering has been the subject of an increasingly 
intensified interest in the field of medical application. This yields especially the 
possibility to create tailored coatings with different kinds of improved biocompat- 
ible properties. It has been found in several clinical studies that, e.g., TiN and DLC 
coatings may reduce bacterial growth on different substrate surfaces. Further 
studies are underway, involving different coating materials. 

Another characteristic feature which has been found is the reduction of resteno- 
sis (a kind of blood clotting) on human-implanted coronary stents, treated with 
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magnetron- sputtered DLC or titanium-nitride-oxide coatings. Moreover, the com- 
bination of magnetron- sputtered artificial veins and arteries with a specially 
designed so-called aptamer coating has shown epithelial cell growth on the vein/ 
artery surface. 

Nano-sized particles, e.g., gold, have been coated with different kinds of bio- 
compatible materials with the purpose of clearing calcified veins and arteries, 
tracing cursors and inducing drug transport to various deceased areas in the 
human body. This new type of medical prognoses and treatments is on the way to 
revolutionizing the whole area of human healthcare and will certainly go on for 
many years to come. 

Concepts similar to the ones mentioned above have been studied for some years; 
however deeper insight into the microorganic mechanisms are necessary to get a 
better understanding of the underlying phenomena which play the most important 
role in the so far made observations. All these studies are worldwide continued at 
different laboratories from which mankind certainly will benefit in the future. 

Considerable interest has emerged from the studies of nano -composite coating 
materials. These types of coatings consist of nanocrystalline phases, sometimes 
combined with amorphous phases, and eventually lead to the development of nano- 
composite ceramic coatings. Coating materials such as titanium-, chromium-, or 
aluminum-nitrides deposited simultaneously with silicon nitride have been studied 
lately. As the two separately deposited materials do not mingle, one has created a 
composite material consisting of both crystalline and amorphous phases. The 
interesting part is that such nanometer- sized coatings may possess outstanding 
properties in terms of hardness, toughness and yield strength, i.e., eagerly awaited 
features in material science and possible applications. 

Due to the worldwide energy supply and greenhouse climate problem, non-cost 
consuming and easy to use methods for the fabrication of inter alia solar cells which 
create direct electric energy will be a growing concern for humanity. As already 
mentioned earlier, magnetron sputtering is one of the already used techniques for 
the fabrication of solar cells (see Fig. 9), and there is no doubt that this will be one 
of the more important and growing fields. 

One further prominent development of magnetron sputtering is the so-called 
HiPIMS (“high power impulse magnetron sputtering”) technique. It was proposed 
and developed by Dr. Vladimir Kouznetsov around 1970 at the Royal High School 
of Technology in Stockholm (Kouznetsov et al. 1999). In short, the method is based 
on that the magnetron is driven by a pulsed power supply which is capable of 
delivering a power of several MW with a pulse length of some ms. This causes 
power densities at the target surface of several kW/cm . As a comparison, conven- 
tional magnetron sputtering can give a few W/cnT. One of the consequences of this 
drastically enhanced power intensity is a considerable higher ionization degree of 
the plasma. Another effect, such as the creation of multiply charged ions, will result 
in higher energies of the sputtered particles. 

As the degree of ionization is of great importance, the HiPIMS technique results 
in a higher coating quality than can be achieved in conventional PVD deposition 
processes. Thus HiPIMS may improve hardness, density, corrosion resistance, 
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ductility, adhesion, film stresses, etc. of the produced coatings. Other advantages of 
the HiPIMS technique are that the deposition can be performed at a lower temper- 
ature and that there is a better target utilization than in the case with most 
conventional magnetron sputtering devices. It should also be mentioned that the 
introduction of HiPIMS techniques in an already existing magnetron sputtering 
device is relatively easy, as the only effort is to connect a pulsed power supply 
capable to deliver the right pulse characteristics to the device in question. 

Although HiPIMS has been used for a while to a smaller extent in industry, there 
are still details regarding the technique to be improved further. One disadvantage is 
the relative long deposition time due to the pulsing mode. This fact together with 
the complex electronics of the power supply, which has to deliver short pulses in the 
MW range, makes the HiPIMS technique rather expensive compared to conven- 
tional PVD coating methods. Another problem, which does not seem to have been 
studied to any significant extent, is the substrate biasing and its influence on the 
coating quality. However, at the time being, there are current works going on at 
several high schools and industrial R&D departments all over the world to further 
develop the technique of HiPIMS. 

Due to that magnetron sputtering is a relatively simple concept from a technical 
point of view, one might expect that many different new geometric shapes of the 
magnetrons, including cylindrical rotating targets which already exist in the market, 
will be constructed in the future. New, more effective vacuum pumping systems 
will probably be developed, together with larger and more complex in-line systems, 
capable to increase the quality, efficiency and the economy with respect to the line 
of production. All together this will improve the total magnetron sputtering coating 
concept. 

If one finally let one’s imaginative power to have a free rein, a longer term of 
perspective may implicate that PVD plasma processes may be carried out in space 
laboratories. Here the environment, in principle, would be a perfect dwelling place 
for the manufacture of super high-quality coatings. The a priori given ultimate 
vacuum conditions together with the absence of any necessary vacuum vessel 
materials in space will, in principle, produce totally impurity-free coatings. This 
should especially be of interest for the microelectronic manufacturing section. The 
absence of vacuum pumps and all surrounding equipment as well as the possibility 
to take advantage of solar energy would be the dream of a professional job coater in 
order to facilitate his hassled and tortuous life of thin film production. 


Summary 

This overview article has illuminated the magnetron sputtering technique, both 
from a more fundamental and from an applied point of view. Some physical aspects 
have been given whereby key factors of importance were presented. In addition, a 
technical description of the process in question has been presented to an extent from 
which a nonspecialist in the field is thought to have fairly large exchange. A special 
chapter was devoted to different kinds of industrial applications of today. 
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Comparisons with the magnetron sputtering technique have been made with other 
similar PVD techniques. Last but not least, some recent developments together with 
prospective future trends of magnetron sputtering have been presented and 
discussed. 

In conclusion it may be stated that the magnetron sputtering technique is a very 
versatile implement to produce many different kinds of coatings in a wide area of 
applications. Together with the fact that practically any material can be sputter 
deposited and the outstanding inherent power to create extremely high degrees of 
defect-free and smooth coatings, it certainly is an eminent and compatible method 
compared to other coating technologies. Following the technical improvements 
with regard to necessary electronic and mechanical devices, the magnetron 
sputtering technique will improve coating qualities, as well as the efficiency and 
economic output of the method which will extend its area of application. 
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Abstract 

Atomic layer deposition (ALD) is a thin-film growth technology that is capable 
of depositing conformal, pinhole-free, and uniform films on high-aspect-ratio 
surfaces with atomic precision. It is similar to chemical vapor deposition (CYD), 
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but compared to CVD, it usually produces thin films with better mechanical, 
thermal, and electrical properties. ALD is a rapidly growing field, and it is 
currently introduced in the semiconductor and solar cell industries. 

In this chapter, the basics of chemistry and mechanism of ALD process are 
firstly described. ALD is then compared with other coating processes, such as CVD, 
physical vapor deposition (PVD), and electroless plating, to present its distinct 
advantages and also limitation. The elements of ALD process such as precursors, 
thin-film material, and substrates are also individually discussed. The evolvement 
of the ALD reactors, including plasma-enhanced ALD and the latest high- 
throughput ALD designs, is introduced. A number of existing and potential future 
applications of the ALD process for industry are presented at the end of the chapter. 


Introduction 

ALD (atomic layer deposition), originally known as ALE (atomic layer epitaxy), is a 
special kind of chemical vapor deposition process. It was firstly developed by 
T. Suntola in Finland as early as 1974. It was intended to fabricate poly crystalline 
luminescent ZnS:Mn and amorphous A1 2 0 3 insulator films for electroluminescent flat 
panel displays. Because of the lack of understanding on the surface chemistry, there 
was not much improvement in this area till 1985. This technique then received a huge 
push in the mid-1990s when semiconductor industry was forced to decrease device 
dimensions and increase aspect ratios. In order to achieve faster computational speed 
and smaller compacter size, the thickness of these films has to be controlled precisely 
to the order of a few nanometers and avoid pinholes at the same time. ALD process 
became the logical choice because of its layer-by-layer deposition which produces 
high-quality film of uniform thickness and excellent conformality. 


ALD Theory and Chemistry 

ALD was developed from chemical vapor deposition (CVD) and uses similar pre- 
cursors. For CVD, all the precursors are introduced onto the substrate surface 
simultaneously. But each of precursors in AFD was alternatively exposed on the 
surface. Generally, there are four basic steps for every AFD process as shown in 
Fig. 1 . First, one precursor (A) gas or vapor was introduced onto the substrate surface, 
whereas chemisorption or surface reactions take place. The entire surface is saturated 
with A in a certain amount of time. Secondly, the AFD reactor is purged with inert gas 
like N 2 to remove the unreacted precursor from the surface and ensure that only single 
layer of the precursor A is attached onto the surface. Thirdly, the other precursor (B) is 
released into the AFD reactor, where it reacts with the previous precursor (A) on the 
substrate surface. At last a purging step takes place to remove excessive B at the 
surface to form an atomic layer of targeted thin film (AB). This four-step cycle is 
repeated multiple times to produce thicker film on the surface. 
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Fig. 1 Process flow for ALD 
process 
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Fig. 2 First half reaction of RS-ALD 


The surface reactions in ALD are all self-limiting, which is the unique advantage 
of ALD (Sneh et al. 2002). Because of the self-limiting characteristics, the coating 
thickness can be precisely controlled. Furthermore, for the same reason, every part 
of the substrate surface takes the same amount of the precursor each cycle which 
results in a pinhole-free and conformal thin film. 

One of the self-limiting mechanisms in ALD is sequential surface chemical 
reactions. The film deposition in this process is as a result of the chemical reactions 
between the reactive molecular precursors and the substrate. The first half reaction 
of process sequence is given in Fig. 2. 

Taking the A1 2 0 3 ALD process as example, the top figure shows that at the 
beginning, the substrate surface is activated by OH groups. This surface is then 
exposed to the first metal precursor A1(CH 3 ) 3 (TMA). TMA molecules react with the 
surface OH-reactive species to form 0A1(CH 3 ) 2 groups. The reaction is given in Eq. 1: 


A1(CH 3 ) 3 + -OH -► 0A1(CH 3 ) 2 + CH 4 


( 1 ) 
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Fig. 3 Second half reaction of RS-ALD 


where CH 4 is the by-product, and this reaction self-stopped when all the OH groups 
on the surface are converted to 0A1(CH 3 ) 2 groups. Then the surface is fully passiv- 
ated by TMA. Some of the A1(CH 3 ) 3 molecule reacts with two OH groups on the 
surface and produces two CH 4 molecules. A purging process is then followed to 
remove the first precursor molecules physisorbed on the substrates and on the reactor 
wall. It also removes the precursor molecules which are flowing inside the chamber. 
Once the first precursor (TMA) is cleared, the second precursor, H 2 0, is released onto 
the surface fully covered with 0A1(CH 3 ) 2 . The reaction is given in Eq. 2: 

20A1(CH 3 ) 2 + 3H 2 0 -► 0 3 A1 2 (0H) 2 + 2CH 4 (2) 

As shown in Fig. 3, this reaction self-saturates until all 0A1(CH 3 ) 2 are converted 
to 0 3 A1 2 (0H) 2 . The H 2 0 molecules completely passivate the CH 3 -terminated 
surface to stop further reactions after the saturation. The surface is then converted 
similar to the initial surface, terminated with OH groups. A purging step is followed 
to remove excessive H 2 0 on the surfaces (substrate and reactor wall) and inside the 
chamber to reduce the chance to gas-phase reaction. Until then, the first atomic 
layer of A1 2 0 3 is formed uniformly on the surface. These four steps are called one 
cycle and repeated to grown films layer by layer. Each deposition cycle includes 
two half reactions (Eqs. 1 and 2), and during each half reaction, the surface 
functional groups change from one surface species to another alternatively (George 
and Ott 1996). 

ALD process typically runs in temperature range from 200 °C to 400 °C. If the 
deposition temperature is too high, chemical bonding cannot survive on the surface 
or the density of the chemically reactive site is reduced; thus, the deposition rates 
reduced. If the deposition temperature is too low, thermally activated chemisorption 
is suppressed and film-forming reaction rate decreases. As the deposition temper- 
ature increases, the deposition rate increases to a peak and then starts to drop. The 
temperature operation window for maximum deposition rate is relatively wide 
compared to CVD processes, which is much more temperature sensitive. Figure 4 
illustrates the allowable temperature window for ALD. It shows the stability and 
reactivity of the precursors determined the working temperature of ALD process. 
Furthermore, as a general rule, the higher the process temperature within the 
allowable temperature window, the higher the quality of the thin film as it can 
anneal better at higher temperature. However, the process temperature needed for 
the plasma-enhanced ALD is much lower. The plasma-enhanced ALD integrates 
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Fig. 5 Saturation time for 
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plasma source to the precursor feed and creates very active precursor species such 
as the radicals which react much faster to the surface species. The detailed intro- 
duction will be discussed in section “Plasma ALD.” 

The pulsing time, the duration for a precursor gas to stay in the chamber before 
being purged, of the precursor into the reactor is important because it has to meet 
the minimum saturation time to create a fully covered monolayer. As shown in 
Fig. 5, the reaction rate reaches to maximum after certain length of the pulsing time. 
The reactivity of the precursors to the substrates and to each other plays a major role 
in determining the pulsing time. As the reactivity of precursors increases with the 
temperature increase, shorter pulsing time can be used. The other critical factor is 
the surface geometry of the substrate. The higher the surface area or surface aspect 
ratio, the longer the pulsing time is needed to let the precursor gas reach to every 
location of the substrate surface. 
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Based on its working mechanism, there are several critical elements to have good 
ALD performance: the volatility and stability of the precursor materials, their inter- 
action with the substrate surface and each other, and their pulsing time. The volatility 
of the precursor material determines the working temperature for this ALD process. 
The stability of the precursor material affects the film thickness and uniformity. The 
pulsing time of precursor decides the total process time. The interaction of precursor 
with substrate surface and each other impacts the film quality. This part will be 
discussed in detail in section “Coating Material and Precursor.” 


Advantages of ALD 


Because of its self-limiting growth process, ALD has atomic level control of film 
composition. It enables ALD to produce sharp interface and superlattice. It also makes 
the interface modification possible. Since the film thickness depends only on the 
number of the deposition cycles, ALD is capable of precise, reproducible, and simple 
thickness control. For the reasons that ALD is only driven by interface reactions and 
has no requirement to the consistency of the reactant fluxes, large area and large batch 
deposition becomes practical. At the same time, ALD can achieve great conformality 
over any shape of substrate, and the final coating is not affected by the vaporization 
rate of the precursor as long as the surface is saturated for every step. 

Also because of layer-by-layer growth mechanism of ALD at the surface of the 
substrate, the thin film with great qualities, such as high hardness, high density, and 
low defects, can be produced at lower temperature. The other advantage of ALD is 
that the scaling up in mass production is very straightforward. There are high- 
throughput ALD reactors developed aiming to increase the production yield such as 
the batch ALD reactor and spatial ALD reactor. Both of the reactors will be 
discussed in the section “High Throughput ALD Systems.” ALD is also powerful 
to create special structure materials. Due to the wide process temperature window, 
creating thin film with unique structure, such as nanolaminate, doped thin film, 
graded layer, and ternary thin film, can be done in a continuous process. 

Other advantages include low deposition temperature to produce thin film on the 
thermally sensible substrates and wide range of selection of deposition material 
from oxide, nitride, and fluoride to elemental metal. 

Compared to other methods to produce thin film, as shown in Fig. 6, ALD has 
very distinct advantages on the conformal coverage especially on high-aspect-ratio 
surface. As for liquid-phase coating method, the sol-gel process produces low-cost 
thin film, but the film uniformity is poor. Electroplating and electroless plating 
process is a surface -initiated process similar to ALD and produces relatively good 
coverage for 3D material, but its limitations are obvious. Its deposition speed 
depends on the ion diffusion in liquid phase, which is much slower than that of 
reactant diffusion in gas phase. It is usually for depositing much thicker film and for 
low-aspect-ratio structure. 

As for vapor-phase deposition methods, CVD (chemical vapor deposition) pro- 
vides better coverage than PVD (physical vapor deposition), but both of them have 
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Fig. 6 Thin-film deposition method comparison 


trouble to achieve uniform coating on the surface with very high aspect ratio feature. 
For the most time, the protmding areas are coated with thicker film. PVD operates by 
vaporizing the desired film material and condensates them onto various substrate 
surfaces. Those heavy “vapors” cannot diffuse to the back of substrate to form film. 
Even with the rotation of the substrates, because of the shadowing effect, the coverage 
on the high- aspect-ratio stmcture is still poor. CVD uses similar precursors as ALD to 
create gas-phase reaction, but the distinct difference is that reactions between the 
precursors in CVD occur frequently outside the substrate surface and ALD only 
allows those reactions to take place on the surface. CVD can deposit thin film on 
the shadowed area, but it is much thinner than the area fully exposed to the precursors. 
For both CVD and PVD depositions, pinholes are inevitable and thickness control 
cannot be done as accurately. Those intrinsic drawbacks limit their application on the 
thin-film deposition for the 3D substrates. As shown in Fig. 7 (Ritala et al. 1999), 
ALD is capable to produce ultra-uniform thin films even on extremely high-aspect- 
ratio surface, which are impossible to achieve by other methods. 

Although there are distinct advantages over other thin-film coating methods, the 
limitation of ALD is also obvious. Its deposition speed is slow compared to other 
methods. Thus, the low productivity of ALD limits its application. But there are 
new types of ALD such as batch-type ALD and special ALD designed to improve 
the productivity. The details of such systems are going to be discussed in section 
“ALD Process and Equipment.” 


Coating Material and Precursor 

Precursor chemistry plays an essential role in ALD (George and Ott 1996). Precursors 
must be volatile and thermally stable. They are preferably liquids and gases, but solids 


2966 


W. He 


Fig. 7 Perfect coating on the 
high-aspect-ratio surface 

(Ritala et al. 1999) 



are also used, provided they have no sintering-related problems under high temper- 
ature. They need to be efficiently transported within the reaction chamber, so that the 
ALD process will not be limited by precursor flux. The approximate vapor pressure of 
precursors should be about 0.1 Torr. The reactivities of precursors to the surface and 
between each other are very important. Precursors must chemisorb on the surface very 
quickly or react rapidly with surface groups. The aggressive reactions help to attain 
saturation over the entire surface within short cycle times, complete reactions with 
high film purity, and avoid gas -phase reactions. On the other hand, precursors should 
not etch and/or dissolute into the substrate, as this would prevent the self-limiting film 
growth. Precursors need to be very stable within the allowable temperature window 
and should not undergo self-decomposition. Self-decomposition destroys self- 
limiting film growth, which results in deterioration of thickness uniformity, accuracy, 
and film contamination level. 

Purging is crucial to separate the gaseous contact between the precursor mole- 
cules and remove excessive precursors on the surface and reactor wall. The 
substrate temperature, reactor-wall temperature, and purging time should be fixed 
so that the monolayer remains on the surface and does not desorb during the purging 
period and maximum cleaning effect can be attained (Suntola and Simpson 1990). 

Reported materials deposited by ALD (selected) are given in Table 1, ranging 
from oxides, nitrides to fluorides and others. 

Table 2 shows the corresponding precursor combinations for selected coating 
materials reported in the literature, and its application is listed for reference. 

The process details of typical thin-film materials for each category will be 
discussed in the following sections. 

Inorganic Thin Film 

Metal oxide, metal nitride, and metal chalcogenides are discussed here in detail. 
Other inorganic materials like elemental film will be discussed in section “Plasma 
ALD.” The metal fluoride film will be briefly introduced in section “Optical 
Applications.” More information can be further traced in the review paper of 
Puurunen (Miikkulainen et al. 2013). 
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Table 1 Thin-film materials deposited by ALD (Ritala and Leskela 1999) 


II-VI compounds 

ZnS, ZnSe, ZnTe, ZnS!_ x Se x , CaS, SrS, BaS, SrSi_ x Se x , CdS, 
CdTe, MnTe, HgTe, Hgi_ x Cd x Te, Cd!_ x Mn x Te 

II-VI-based TFEL 
phosphors 

ZnS:M (M=Mn, Tb, Tm), CaS:M (M=Eu, Pb, Ce, Th), SrS:M 
(M=Tb, Pb, Ce, Mn, Cu) 

III-V compounds 

GaAs, AlAs, A1P, InP, GaP, InAs, Alx_ x Ga x As, Gai_ x In x As, 
Gai_ x In x P 


Nitrides 


Semiconductors dielectric 

AIN, GaN, InN, SiN 

Metallic 

TiN, TaN, Ta 3 N 5 , NbN, MoN 


Oxides 


Dielectric 

AI 2 O 3 , Ti02, Ta20 3 , Zr02, Hf02, Nb20 3 , Y20 3 , MgO, Ce02, 
Si0 2 , La 2 0 3 , SrTi0 3 , BaTi0 3 

Transparent conductors/ 
semiconductors 

ln 2 0 3 , In 2 0 3 :Sn, In 2 0 3 :F, In 2 0 3 :Zr, Sn0 2 , Sn0 2 :Sb, ZnO, ZnO: 
Al, Ga 2 0 3 , NiO, Co0 2 

Superconductor 

YBa 2 Cu 3 0 7 _ x 

Other ternaries 

LaCo0 3 , LaNi0 3 

Fluorides 

CaF 2 , MgF 2 , SrF 2 , ZnF 2 

Elements 

Si, Ge, Cu, Mo, Pt, Ru, Ni, Ag, Pb 

Others 

La 2 S 3 , PbS, In 2 S 3 , CuGaS 2 , SiC 


Metal Oxide 

High-k oxides for both transistors and DRAM capacitors are the major driving 
forces for ALD metal oxide research. The research and development on transparent 
conducting oxides, such as ZnO:An, represent the increasing need of organic and 
flexible electronics and displays. The properties of the oxide films strongly depend 
on the crystal structure. 

The property of the thin film is affected by not only the difference between 
amorphous and crystalline phases but also the difference between the different 
crystalline phases of the oxides. Here two kinds of oxide film are discussed to 
explain the ALD process to produce metal oxide thin film and their properties. 

AI2O3 

The working mechanism of A1 2 0 3 deposition was discussed in section “ALD 
Theory and Chemistry,” and it has been developed as model ALD system 
(Puurunen 2005). As shown in Table 2, the metal precursors for A1 2 0 3 include 
A1C1 3 , AlBr 3 , and A1(CH 3 ) 3 (trimethyl aluminum [TMA]). Because of the highly 
reactive nature of TMA, it is now the most used precursor for ALD to synthesize 
A1 2 0 3 thin films. Besides water, ozone is also used with TMA as the other 
precursor. The use of 0 3 instead of H 2 0 is because in some cases 0 3 has higher 
activity in ligand elimination. Since 0 3 does not absorb onto the reactor walls as 
easily as H 2 0, the purging process is more thorough. The focus here will be on ALD 
A1 2 0 3 using TMA and H 2 0. 

The growth of A1 2 0 3 per ALD cycle is much less than one A1 2 0 3 “monolayer.” 

o 

The growth rate of A1 2 0 3 per AB cycle is 1. 1-1.2 A which is estimated using the 
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o 

density of 3.0 g/ cm for A1 2 0 3 ALD films grown at 111 °C. Quartz crystal 
microbalance (QCM) is usually employed to record the mass increase during 
each ALD process to estimate the growth rate (Groner et al. 2004). The monolayer 

o 

thickness calculated from bulk A1 2 0 3 is 3.8 A, which is much thicker than the 


Table 2 ALD precursor pairs for inorganic thin films and its applications 


Film 

Precursors 

Applications 

Full chemical names 

AI 2 O 3 

A1C1 3 /H 2 0 or 0 3 

High-K dielectric 

a TEMAH: tetrakisethyl 
methylaminohafnium, Hf[N 

(C 2 H 5 )(CH 3 )] 4 

AlBr 3 /H 2 0 

Organic light-emitting 
diode (OLED) passivation 

b Ti(OEt) 4 : titanium ethoxide, 

Ti(OC2H5 ) 4 

A1(CH) 3 /H 2 0 or 0 3 

Antireflection and optical 
filters 

c Ti(0 1 Pr) 4 : titanium 
isopropoxide, Ti[OCH 

(CH 3 ) 2 ] 4 

Wear- and corrosion- 
inhibiting layers 

d Zr(OtBu) 4 : tetrakisethyl 

methylaminozirconium, 

Zr[N(CH 3 )(C 2 H 5 )] 4 

Hf0 2 

HfCl 4 /H 2 0 

TEMAH a /H 2 0 

High-K dielectric 

Ti0 2 

TiCl 4 /H 2 0 

High-K dielectric 

e ZnEt 2 : diethylzinc, 

Zn(C 2 H 5 ) 2 

Ti(0Et) 4 b /H 2 0 

Photocatalysis and 
photonic crystals 

Zn(OAc) 2 : zinc acetate 
dihydrate, Zn 

(0 2 CCH 3 ) 2 (H 2 0 ) 2 

Ti(OPr) 4 c 

UV-blocking layer 

g Ta(OEt)5: tantalum ethoxide, 
Ta(OC 2 H 5 ) 5 

Si0 2 

SiCl 4 /H 2 0 

Dielectric 

h La(thd) 3 , where thd is 2, 2,6,6- 

tetramethyl-3,5-heptanedione, 

(CH3) 3 CCOCH 2 COC(CH 3 ) 3 

Zr0 2 

ZrCl 4 /H 2 0 

High-K dielectric 

• 

1 La[N(SiMe 3 ) 2 ] 3 : lanthanum 

tris[bis(trimethylsilyl)amide], 

C 18 H 54 LaN 3 Si 6 

Zr(OtBu) 4 d /H 2 0 

Photocatalysis 

ZrI 4 /H 2 0 2 

Wear- and corrosion- 
inhibiting layers 

• 

J Zr(NMe 2 ) 4 : zirconium(tetra) 
dimethylamide, Zr[N(CH 3 ) 2 ] 4 

ZnO 

ZnEt 2 e /H 2 0 

Piezoelectric layers 

k Ta(NtBu)(NEt 2 ) 3 : tris 

(diethylamido) 

(tertbutylimido)tantalum, 

(CH 3 ) 3 CNTa[N(C 2 H 5 ) 2 ] 3 , 
where NtBu is called 
tertbutylimido group 

Zn(0Ac) 2 f /H 2 0 

UV-blocking layer 

1 Cu(acac) 2 : copper(II) 
acetylacetonate, (C 5 H 7 0 2 ) 2 Cu 

Photocatalysis and 
photonic crystals 

Optical applications (solar 
cells, integrated optics, 
optical coatings, laser) 

m Cu(hfac) 2 *H 2 0: copper 
(II) hexafluoroacetylacetonate 
hydrate, C 10 H 2 CuF 12 O 4 *H 2 O 


(< continued ) 
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Table 2 (continued) 

Film Precursors Applications Full chemical names 

Sn0 2 SnCl 4 /H 2 0 Antireflection and optical 

filters 

SnI 4 /0 2 Sensors (gas sensors, pH 

sensors) 

Ta 2 0 5 TaCl 5 /H 2 0 Antireflection and optical 

filters 

Ta(0Et) 5 s /H 2 0 Sensors (gas sensors, pH 

sensors) 

High-K dielectric 

La 2 0 3 La(thd) 3 h /0 3 High-K dielectric 

La[N(SiMe 3 ) 2 ] 3 7 

H 2 0 

ZnS ZnCl 2 /H 2 S Piezoelectric layers 

Optoelectronic applications 

WS 2 WF 6 /H 2 S Solid lubricant layers 

Zr 3 N 4 Zr(NMe 2 ) 4 j /NH 3 Biomedical coatings 

(biocompatible materials 
for in vivo medical devices 
and instruments) 

Ta 2 N 5 TaCl 5 /H 2 0 Photonic crystals 

Ta(0Et) 5 /H 2 0 

TaN Ta(NtBu)(NEt 2 )k/ Diffusion barrier for Cu 

NH 3 metallization 

AIN AlMe 3 /NH 3 Piezoelectric layers 

A1C1 3 /NH 3 

TiN TiCl 4 or Til 4 /H 2 Diffusion barrier for Cu 

metallization 

Conductive gate electrodes 

Biomedical coatings 
(biocompatible materials 
for in vivo medical devices 
and instruments) 

MgF 2 Mg(thd) 2 /TiF 4 or HF Optical application 

Cu CuC1/H 2 

Cu(thd) 2 /H 2 
C u(acac) 2 '/H 2 
Cu(hfac) 2 x H 2 O m / 

ch 3 oh 

Ni Ni(acac) 2 , 2-step 

process NiO by 0 3 
reduced afterward 
by H 2 

Ta TaCl 5 /H 2 

W WF 6 /B 2 H 6 or Si 2 H 6 

Ti TiCl 4 /H 2 
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growth per AB cycle. The big difference is due to the surface chemistry of A1 2 0 3 
ALD. For every AB cycle, the growth of every layer is not always matched with 
stoichiometry of “monolayer.” For instance, when the ALD process temperature 
increases from 177 °C to 300 °C, the number of surface “OH” or “CH 3 ” species on 
a given area decreases. This in turn decreases the growth rate for every AB cycle 
(Ott et al. 1997). 

A1 2 0 3 films grown below 600 °C are amorphous regardless of the types of the 
substrates. Post-deposition annealing of ALD A1 2 0 3 thin film changes the crystal- 
linity of the film to increase the dielectric constant and hardness. ALD of alumina 
films that are deposited above 600 °C are crystalline when A1C1 3 is employed as the 
precursor. The stability of A1C1 3 at such high temperature is the major concern. 

Because of the continuous and pinhole-free nature of A1 2 0 3 , it presents better 
electrical properties than the film made from other techniques. The ALD A1 2 0 3 
films have a dielectric constant of ~7 and display very low electron leakage 
(Groner et al. 2002), which are distinct signs of the absence of any defects or 
pinholes in the ALD A1 2 0 3 films. Based on these excellent properties, the ALD 
A1 2 0 3 film was used as gate oxides (Huang et al. 2005) and to passivate semicon- 
ductor (Ye et al. 2003). 

ZnO 

ZnEt 2 and H 2 0 are commonly used to produce ZnO ALD film, and two half 
reactions are proposed to occur as (Ferguson et al. 2005) 

-OH + Zn(CH 2 CH 3 ) 2 -> -OZn(CH 2 CH 3 ) + C 2 H 6 (3) 

— OZn(CH 2 CH 3 ) + H 2 0 -» -OZnOH + C 2 H 6 (4) 

where -CH 2 CH 3 and -OH are surface species after each step. The two half 

o 

reactions are repeated to deposit thicker ZnO films. The growth rate is 2. 2-2. 5 A 
per cycle at 100-160 °C. 

ZnO is a versatile material because of its beneficial physical, electrical, and 
chemical properties. ZnO is a semiconductor with a wide bandgap of 3.37 eV and 
large exciton binding energy, 60 meV, which makes it a promising optoelectronic 
material. Zinc oxide thin films made by ALD are typically crystalline even when 
deposited at low temperature. Only on certain materials thin amorphous zinc oxide 
films can be produced by ALD. Crystalline ZnO films produced by ALD are all of 
hexagonal phase. The crystal orientation determines the behavior of hexagonal 
ALD ZnO. The orientation appears to be a function not only of substrate and 
deposition temperature but also of other deposition conditions, such as precursor 
purge time. 

Metal Nitride 

Metal nitrides are hard, chemically resistant, sometimes catalytically active, and 
often electrically conductive. Since the increasing interest in applications as diffu- 
sion barriers and electrodes in microelectronics, nitride deposition by ALD has 
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developed rapidly. Metal chloride precursors or other halides and ammonia are 
mostly used for growing ALD metal nitride thin films. These processes give nitrides 
at relatively high temperatures (typically 350-500 °C) (Satta et al. 2002). The 
residues of the respective halogens are one of the concerns for this precursor. 
Metal-organic alkylamide-based compounds have been targeted for some transition 
metals (TiN, ZrNx, MoNx, HfNx, TaNx, WNx) as the metal precursor in order to 
solve this problem. With the alkylamides, the process temperatures are much lower 
(typically ca. 150-250 °C) than with the halides (Elam et al. 2003). But the biggest 
limitation is that the thermal decomposition temperature of amide compounds is 
very low (even during storage). 


TiN 

TiN is one of the most investigated ALD processes for metal nitride (Miikkulainen 
et al. 2013). TiCl 4 -NH 3 process is the most commonly applied process to deposit 
TiN. Since the oxidation state of Ti in the reactants is +4, whereas it is +3 in the 
desired product nitride, TiN, a reducing agent is needed. In many cases, the nitrogen 
source (e.g., NH 3 ) also serves for this purpose, while in other processes, separate 
reducing agents have been added. Growth-per-cycle values between ca. 0.2 and 0. 4 

o 

A have been reported (Cheng and Lee 2006). Cl and H are the surface species for 
each step. As seen in Eq. 5, the surface -OH group reacts with TiCl 4 to form 
-OTiCl 3 species and generate HC1 as by-product. The surface -OTiCl 3 species then 
encounter NH 3 to form -TiNH 2 (Eq. 6) with NH 3 also reducing Ti from oxidation 
state of +4 to +3. This half reaction terminates at -TiNH 2 . For the next cycle 
(Eq. 7), the surface species are -NH 2 , and they react with TiCl 4 to form another 
layer of TiN: 


-OH + TiCl 4 -► — OTiCl 3 + HC1 (5) 

— TiCl 3 + NH 3 -► — TiNH 2 + HC1 (6) 

-NH 2 + TiCl 4 — NTiCl 3 + HC1 (7) 

In addition to inorganic halides, metal-organic precursors have been used for 
TiN deposition. They have been used for special cases, where the substrates, such 
Cu, are sensitive to the halide residue or where the ALD reactor material, such as 
stainless steel, is not compatible with halide. Growth-per-cycle values are usually 

o 

very high (2 A up to several nanometers per cycle) when alkylamide precursors are 
used. This growth rate equals to several TiN monolayers per cycle which is due to 
the decomposition of the alkylamide precursor. 

The amorphous/crystalline nature of TiN depends on several ALD process 
parameters. When the metal halide reactants are used, the TiN films are mainly 
crystalline, while TiN films deposited from the alkylamide reactants are mainly 
amorphous - both with some exceptions. Temperature and impurities also affect 
crystalline form of the film. 
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Metal Chalcogenides 
ZnS 

From metal chalcogenides, most researchers paid their attention to ZnS because of its 
applications in the thin-film electroluminescence (EL) panel (Suntola and Antson 
1977). ZnCl 2 and H 2 S were first employed as precursors in 1977, but the halide 
residue is detrimental to the performance of the EL panel. Since then, Zn acetates, Zn 
(CH 3 ) 2 , Zn(thd) 2 (thd = 2,2,6,6-tetramethyl-3,5-heptanedione), and Zn(C 2 H 5 ) 2 , have 
been studied as zinc precursors. For ZnCl 2 and Zn(C 2 H 5 ) 2 , similar growth rates of 

o 

about 1 A/cycle can be achieved at 500 °C and 150 °C, respectively. Process 
temperature for alkyl compounds is room temperature which for halides, it needs to 
be over 500 °C. Cl (or CH 2 CH 3 ) and H are the surface species for each step. 

Because it is known that crystalline order is beneficial for light emission, there 
has been no interest in amorphous ZnS films. The phase depends on the temperature 
and at atmospheric pressure. The low -temperature cubic phase transforms to hex- 
agonal form at 1,020 °C. 

Other Special Material 

Polymer 

Other than inorganic materials as metal oxide and nitride, ALD can be used to grow 
organic polymers employing similar self-limiting surface reactions. This is 
described as molecular layer deposition (MLD) or alternating vapor deposition 
polymerization (AVDP) because a molecular fragment is deposited during each 
ALD cycle. Figure 8 illustrates the MLD process (Du and George 2007). Acyl 
chlorides and amines were used to produce various polyamides. For example, poly 
(p-phenylene terephthal amide) (PPTA) MLD layer is made by precursor of 
terephthaloyl chloride (C1C0C 6 H 4 C0C1) and p-phenylenediamine (NH 2 C 6 H 4 NH 2 ) 
(PD). Every precursor is a bifunctional molecule. The surface reactions for PPTA 
MLD are proposed as follows: 

-NH 2 + C10C 6 H 4 C0C1 -NHCOC 6 H 4 COCl + HC1 (8) 

-nhcoc 6 h 4 coci + h 2 nc 6 h 4 nh 2 -► NHCOC 6 H 4 CONH(CH 2 ) 6 NH 2 + HC1 

( 9 ) 

where -NFL and -Cl are the surface species after each reaction. The PPTA MLD 

o 

growth is linear but growth rate between 0.5 and 4.0 A per AB cycle was achieved 
for individual experiments. This variation was attributed to varying numbers of 
“double” reactions between the bifunctional precursor and the surface species 
(Adamcyzk et al. 2008). 

Similar processes have been developed to produce other organic polymers using 
MLD techniques. For example, 1 ,4-phenylene diisocyanate and ethylenediamine 
were used to make MLD polyurea (Kim et al. 2005). Area- selective patterning and 
orientation control (Yoshimura et al. 2006) of polyazomethine MLD have been also 
demonstrated. 
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Fig. 8 Illustrates the MLD process (Reprinted with permission from Du and George (2007). 
© 2014 American Chemical Society) 


Special Substrates 

Because of the advantages of ALD process, researchers have been using it to 
produce uniform films on various substrates from ceramic, metal, to fiber for 
various applications. Among those substrates, three kinds of special substrates for 
ALD process are discussed here to illustrate its versatility. 

Nanoparticles and Other Nano-objects 

ALD has been used to change the surface chemical properties of particles while 
retaining the bulk properties of the original particles. ALD thin layer can serve for 
purposes as the oxidation protection, electrical conduction, optical enhancer, and 
mechanical supporter. It is very natural to produce core/shell structures by ALD. 

Proper agitation in particle ALD is necessary. During agitation, the upward force 
caused by vibration or upward gas pulse to particles bed equals the downward force 
of gravity. The equal forces result in the constant movement of the particles. 
Without agitation, a static particle bed takes long for gaseous ALD precursor to 
infiltrate, similar to the porous substrates. There is also a possibility of “glue effect” 
after the ALD film is formed when the particles are not moving. The so-called glue 
effect is referring to the newly formed ALD thin film that connects the neighboring 
particles and “glues” them together. With agitation, the moving particles are fully 
exposed to the precursors in a much shorter time, and chance for particles to glue 
together is very low. Although particle aggregates still form during agitation, the 
aggregation processes are dynamic and the constant exchange of particles between 
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Fig. 9 TEM images of A1 2 0 3 
on BN particles (Reprinted 
from Powder Technology, 
142, Jeffrey R. Wank, Steven 
M. George, Alan W. Weimer, 
Nanocoating individual 
cohesive boron nitride 
particles in a fluidized bed by 
ALD, 59-69, © 2014, with 

permission from Elsevier) 



the aggregates prevents the particles from being “glued” during ALD. The specially 
designed fluidized and rotary ALD reactors are going to be discussed in the ALD 
reactor section. 

Ultra-uniform coatings of ALD A1 2 0 3 were observed on BN particles with a 
platelet shape as in Fig. 9 (Wank et al. 2004). Producing ALD thin film on other 
large quantity of nano-objects such as nanotubes and nanowires also faces the same 
challenges as to nanoparticles. The fluidized or rotary ALD reactor is needed to 
agitate these nano-objects to obtain efficient surface reactions (Lee et al. 2003). 

Polymer 

Because of the thermal stability of the polymer material, low-temperature deposi- 
tion is needed to coat thin film. ALD has produced thin film on polymers to alter its 
surface functional groups, to create unique inorganic/organic polymer composites, 
and to deposit gas diffusion barriers. 

As low as 33 °C as the deposition temperature for ALD A1 2 0 3 has been 
developed. The resulting thin film has very low surface roughness values, low 

o 

leakage currents, high dielectric constants, and growth rates in excess of 1 A/ 
cycle. These properties are comparable with the properties of ALD A1 2 0 3 films 
grown at higher temperatures of 177 °C. However, decreasing densities and 
increasing hydrogen concentrations were observed at lower temperatures. 

Since most polymers do not contain the necessary surface chemical species to 
initiate ALD, it has a different growth mechanism compared to ALD on other 
substrates. For ALD A1 2 0 3 on polyethylene using TMA and H 2 0 as precursors, the 
TMA precursor initiates the reaction by firstly diffusing into the polyethylene and 
then reacting with the subsequent exposure H 2 0 to form -AlOH species. These 
-AlOH species then serve as the bed to grow more -AlOH cluster in the near- 
surface region of the polyethylene. With more -AlOH clusters forming on the 
surface, they start to coalesce and cover the whole surface to stop the diffusion 
process. A normal A1 2 0 3 ALD process is then followed after that. This mechanism 
is illustrated in Fig. 10 (Wilson et al. 2005). 
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Fig. 10 The mechanism of ALD on polymer (Reprinted with permission from Wilson 
et al. (2005). © 2014 American Chemical Society) 


ALD thin film on polymer is mostly used to serve as the water barrier and gas 
barrier. Detailed discussion is in the application on Organic light- emitting diode 
(OLED). ALD A1 2 0 3 has also been effective as a capping layer and a gate dielectric 
for polymer-based transistors. Surface modification of natural fiber and woven 
fabric materials has utilized ALD A1 2 0 3 . 


Biological Template 

Low-temperature ALD also enables uniform coating on biological templates. One 
of the first demonstrations of ALD on biological templates was ALD Ti0 2 and ALD 
A1 2 0 3 on tobacco mosaic virus (TMV) and ferritin (Knez et al. 2006). The outer 
sheath of the virus is replicated by a thin film made of Ti0 2 . The duplication of the 
shape of butterfly wing by ALD A1 2 0 3 was also accomplished (Huang et al. 2006). 

Selective-Area ALD 

To realize the selective-area ALD, a mask has to tightly cover the surface and block 
the diffusion of the precursor gases to the protected areas. Lor example, the 
patterned self-assembled monolayer (SAM) is a very good mask layers because 
of the strong chemical bond between the monolayer and the surface. The SAM 
passivate the surface against the ALD growth, and ALD film is deposited only on 
areas without an SAM. Alkyl silane SAM on hydrogen-terminated silicon (Larm 
et al. 2008) and alkyl thiol SAM on metals (Larm et al. 2012) helped to form 
patterned ALD layer. Unreactive and thermally stable polymer films have also been 
used as mask layers for selective-area ALD (Gay 2010). A lift-off process is carried 
out to selectively remove the unwanted areas. 
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ALD Process and Equipment 
Process Classification 
Thermal 

Thermal ALD process is defined as using thermal energy to activate the surface 
ALD reactions. These surface chemistries need to have net negative heats of 
reaction. They occur spontaneously at various temperatures, for example, a growth 
per AB cycle of ~0.4 A from 150 °C to 600 °C for Ti0 2 ALD (Ritala et al. 1993) 
and 2. 2-2. 5 A from 100 °C to 160 °C for ZnO ALD (Yamada et al. 1997). These 
processes are referred to as thermal because they can be performed without the aid 
of plasma or radical assistance. 

The most common thermal ALD chemistry is binary metal oxides such as A1 2 0 3 , 
Ti0 2 , ZnO, Zr0 2 , Hf0 2 , and Ta 2 0 5 . Other common thermal ALD systems are 
binary metal nitrides such as TiN, TaN, and W 2 N. Thermal ALD systems also exist 
for sulfides such as ZnS and CdS and phosphides such as GaP and InP. 

Plasma ALD 

Plasma ALD has several advantages over thermal ALD and other vapor-phase 
deposition techniques. Plasma-enhanced ALD produces species with high reactiv- 
ity to the substrate surface, which results in lower reaction temperature and better 
film quality. The processing condition is more tolerable for the precursor and 
substrates. A wider range of thin-film materials can be produced. 

For single-element ALD materials, such as metals and Si, they are very difficult 
to deposit using thermal ALD processes. Plasma-enhanced ALD is able to accom- 
plish such task (Kim 2003). The radicals or other energetic species in the plasma 
help to trigger reactions that are not easy to occur with thermal energy alone. 
Plasma sources and hot tungsten filament have been used to produce hydrogen 
radicals. An H 2 plasma ALD was first demonstrated for Ti ALD (Rossnagel 
et al. 2000). Ta ALD is another ALD system that has been studied extensively 
using hydrogen radicals from H 2 plasmas. The precursors for Ta ALD are TaCl 5 
and hydrogen radicals (Kim et al. 2002). The surface chemistry for Ta ALD can be 
expressed as 


Ta + TaCl 5 TaTaCl 5 



TaCl 5 + 5 FI- — > Ta + 5HC1 (1 1) 

Firstly, TaCl 5 is attached to the surface. Subsequently, the hydrogen radicals 
reduce the Ta atoms and replace the chlorine from the surface. Because of the steric 
hindrance of the large TaCl 5 admolecule on the surface, the growth during Ta ALD 

o 

is only 0.08 A per AB cycle. The ALD Ta films show good Cu barrier property. But 
they are not conformal in trenches with a high aspect ratio of 40:1. Inside the 
trenches, the recombination reactions, between the highly reactive hydrogen 
radicals, occur before they can reach to the bottom of the trench (Grubbs and 
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George 2006). Radical recombination effect limits the general utility of plasma 
ALD in high-aspect-ratio structures. 

Other single-element metals like Pt can be made by plasma-enhanced ALD with 
(methylcyclopentadienyl)trimethylplatinum as the metal precursor (Knoops 
et al. 2009). In addition, plasma-enhanced Ru ALD has been accomplished using 
bis(ethylcyclopentadienyl)ruthenium and NH 3 plasma (Kwon et al. 2004a). These 
plasma-enhanced Ru ALD films have potential as adhesion layers for copper 
interconnects (Kwon et al. 2004b). 

The ALD of single-element semiconductors such as Si can also be deposited 
using SiH 2 Cl 2 and hydrogen radicals. Firstly, the substrate is coated with chlorine- 
containing silicon precursor such as SiH 2 Cl 2 . With enough thermal energy, H 2 and 
HC1 desorb upon SiH 2 Cl 2 adsorption, but SiCl 2 does not desorb from the silicon 
surface. The buildup of chlorine on the surface limits more adsorption of SiH 2 Cl 2 . 
The hydrogen radicals recombine with surface chlorine upon contact to desorb as 
HC1 or with other surface hydrogen atoms to desorb as H 2 . The hydrogen radical 
flux eventually removes all the surface chlorine. Studies of Si ALD using SiH 2 Cl 2 
and H radicals have demonstrated the self-limiting nature of Si ALD growth versus 
both SiH 2 Cl 2 and hydrogen radical exposures (Hasunuma et al. 1998). A Si ALD 

o 

growth per cycle of ~1.6 A was observed between 550 °C and 610 °C. At higher 
temperatures, the Si ALD growth per cycle increases as a result of Si CVD. Because 
of the incomplete surface reactions, at lower temperatures, the Si ALD growth per 
cycle decreased. 

One important advantage of plasma-enhanced ALD is that it can deposit films at 
much lower temperatures than thermal ALD. For example, plasma-enhanced ALD 
can produce A1 2 0 3 film using TMA and 0 2 plasma at temperatures as low as room 
temperature (Heil et al. 2006). This is very useful when the low deposition 
temperature is needed for the thermally sensitive substrates such as polymer 
(Langereis et al. 2006). It was found that the A1 2 0 3 films made by plasma-enhanced 
ALD have better electrical properties compared with thermal ALD A1 2 0 3 and lead 
to excellent passivation of silicon substrates (Hoex et al. 2006a). 

Plasma-enhanced ALD has also been primarily used to deposit metal nitrides, 
such as TiN and TaN, which generally require high temperature with thermal ALD. 
ALD TaN has been achieved using organometallic tantalum precursors such as 
tertbutylimidotris(diethylamido)tantalum (TBTDET) and H 2 radicals. TaN films 
made by plasma-enhanced process have much lower electrical resistivity and higher 
density than ALD TaN films grown using thermal TaN ALD (Park et al. 2001). 


ALD Reactor 

The first few generations of ALD reactors did not use a carrier gas. As seen in 
Fig. 11, the two precursors feed the ALD chamber which is connected to a pump. 
The arrows indicate the flowing direction. Valve #1 is first opened for precursor A 
to enter the ALD reactor to saturate the sample surface. Then precursor A is 
removed by switching off valve #1 and opening valve #3 to the pump reactor. 
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Fig. 1 1 ALD reactor design without carrier gas 
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Fig. 12 ALD reactor design with carrier gas 


After closing the valve #3, the same process is done to pressure B (controlled by 
valve #2) to expose sample to precursor B. Because of the long residence times of 
the precursors in the reactor, these exposures can utilize precursors very efficiently. 
However, the evacuation time for these ALD reactors is slow in the absence of a 
purge gas. 

In order to shorten the ALD cycle time, the carrier gas was then introduced 
(Fig. 12). The carrier gas is in viscous flow and flows continuously to the pump. 
Precursors can be mixed into the carrier gas stream if they have sufficient vapor 
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pressure. Alternatively, when they have lower vapor pressures, the carrier gas can 
flow over a solid or liquid precursor or through the liquid precursor. Because of the 
carrier gas, shorter residence time in the reactor is achieved. As shown in Fig. 12, 
showerhead is often used to distribute the gas evenly onto the sample. 

Most ALD reactors now operate with an inert carrier gas in viscous flow. The 
optimum pressure for viscous flow reactors is around ~ 133.3 Pa. This optimum 
pressure is a trade-off between gas interdiffusion and entrainment. The diffusion of 
precursor vapor in N 2 gas at 1 Torr is sufficient for removal of precursors and 
products from stagnant gas in ALD reactor in a reasonable time. The pressure of 
1 Torr is also high enough for the N 2 to be an effective carrier gas. At pressure lower 
than 1 Torr, the gas mean free paths are longer and entrainment of the stagnant gas 
is less effective. Synchronously modulated flow and draw (SMFD) was created then 
to optimize reaction time and purging time for ALD reactors (Sneh 2005). The 
synchronized modulation of the inert flowing gas between the reactor inlet and the 
reactor outlet enables high-speed gas flow switching, for example, <1 s for ALD 
cycle times for ALD systems such as A1 2 0 3 ALD. 

Other classification of ALD reactor is based on the temperature of the reactor’s 
wall. In “hot wall” reactors, the walls, gas, and substrates in the reactor are all 
heated to the temperature of the walls. In “cold wall” reactors, only the substrate is 
heated, and the walls remain at room temperature or are only warmed slightly. 

Because of the distinct advantages, plasma-enhanced ALD becomes increas- 
ingly important. Inductively coupled plasma (ICP) is frequently now used for 
plasma source. Plasmas usually operate at pressures of ~ 13-65 Pa. Plasma- 
enhanced ALD is not performed with an inert carrier gas during the plasma reaction 
cycle. However, the plasma reaction cycle may alternate with a conventional 
reactant ALD cycle using an inert carrier gas. A commercial plasma ALD reactor 
for 200 mm wafers was also recently described for TiN and Hf0 2 ALD (Fig. 13) 
(Kim et al. 2002). 

Special design has to be done to create fluidized particle ALD system. As seen in 
Fig. 14, during fluidization (Hakin et al. 2005), the upward force of gas to float the 
particles equals the downward force of gravity. A rotary ALD reactor (Fig. 15) that 
tumbles the particles in a porous metal cylinder has also been used to prevent 
agglomeration of the particles (Cavanagh et al. 2009). Compared to the fluidized 
bed reactor, the rotary reactor uses static exposures because the rotation of the 
reactor results in the constant movement of particles. All the special features of 
fluidized particle ALD systems aim to minimize the aggregation of particles while 
making core-shell structures. 

The gas flow for the wafer-based ALD reactors is more versatile compared to 
other ALD reactors. Some wafer-based ALD reactors use the “cross-flow” reactors 
which have parallel gas flows across the wafer surface. On the other hand, the 
“showerhead” reactor design introduces the gas into the reactor perpendicular to the 
wafer surface through a distributor plate. The gas then flows radially across the 
wafer surface. Modules for plasma-enhanced ALD and ultrahigh-aspect-ratio depo- 
sition are designed to be integrated to the wafer-based ALD reactor to create 
different thin films on various substrates. 
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Fig. 13 Plasma ALD reactor 
(Kim et al. 2002). © IOP 
Publishing. Reproduced with 
permission. A11 rights 
reserved 
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High-Throughput ALD Systems 


When the ALD technique was in the research and development phase for applica- 
tions, almost all investigations were carried out on single-wafer systems as all the 
experiments were focusing on the short overhead time and wafer turnaround time. 
Economic considerations such as throughput and chemical cost were yet as sec- 
ondary concern to device and process development. 

ALD is currently being used commercially to form metal/high-k gate oxide 
stacks for field-effect transistors and capacitors for dynamic random access mem- 
ory devices in semiconductor industry. It is also used by thin-film magnetic head 
industry to form gap dielectrics. Solar cell industries also utilize ALD to create 
uniform passivation layer. The buffer layers in CuInGa(Se,S) (CIGS) and moisture 
diffusion barrier layers for OLED also can be made by ALD to achieve better 
properties. These applications require high-throughput and low-cost production 
techniques to make them economically viable. Unfortunately, traditional ALD is 
considered as a slow process because of the long separation time of the precursor 
flows and the long saturation time for every cycle. However, two kinds of new ALD 
designs, batch ALD and spatial ALD, have been developed to increase the produc- 
tivity for ALD application for the industry. 


Batch ALD Reactor 

The reactors that can deposit on many samples simultaneously, opposite to the 
reactor to treat single wafer, are known as batch reactors. The batch ALD reactors 
have much bigger chamber than normal ALD reactor and can coat multiple samples 
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Fig. 14 Fluidized ALD reactor (Hakin et al. 2005. © IOP Publishing. Reproduced with permis- 
sion. A11 rights reserved) 


at the same time. The chemical cost and process time are dramatically reduced. 
Because of larger reactor volumes and higher surface area, the pulsing time (and 
exposure time) for every precursor and purging time constants are longer in batch 
reactors. However, the average cost is still more economic. 

As shown in Fig. 16 (Granneman et al. 2007), the precursor gases are introduced 
through a single-point injector at the top of the reactor and removed through an 
exhaust in the bottom of the reactor. The gases transport from top to bottom through 
convection, and the gas movement in between adjacent wafers is through diffusion. 
The number of wafers usually for a batch reactor is in the range 50-150. 

When thicker films are needed, the overall (single-wafer) process time becomes 
long, which makes it attractive to process 50-100 wafers in a large batch. When 
large volumes of identical sample are to be processed, batch tools are also attractive 
from an economic point of view. This is generally true for commodity product such 
as DRAM and/or flash memory devices. For example, of a 10 nm Hf0 2 ALD 
process, batch and single-wafer systems have throughputs of 16 and 7 wafers/h, 
respectively. Batch-type systems generally have longer uptimes than their single- 
wafer counterparts which translate into a lower cost of ownership. 
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Fig. 15 Rotary ALD reactor (Cavanagh et al. 2009. © IOP Publishing. Reproduced with 
permission. A11 rights reserved) 


Fig. 16 Illustration of batch 
ALD (Granneman et al. 2007) 
(Reprinted with permission 
from Kim et al. (2002). 

© 2014, American Vacuum 
Society) 
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Fig. 17 Picosun R-300 Advanced ALD with plasma generator and robot loader (left) and roll-to-roll 
module (right) 

It has to be addressed that for some cases the single-wafer ALD reactor is needed 
instead of batch ALD reactor. Some substrates are sensitive to air and need a 
protective film before ALD. Some substrates have extreme high-aspect-ratio fea- 
tures and need very long precursor exposure time to saturate. 

Roll-to-roll (R2R) process is to produce devices on rolls of the flexible material, 
such as polymer and metal film. Most of the R2R ALD employ spatial ALD 
principle, which will be discussed in the next section, but Picosun® has developed 
an R2R module to introduce the flexible film to the single-wafer ALD reactor 
(R200), as seen in Fig. 17. Work has been done to integrate this module to the 
batch-type ALD reactor. The detailed mechanism is still not disclosed, but from the 
open information, this module follows the traditional principle to grow ALD layer 
on the flexible substrate. The module with two reels is designed to transfer the 
flexible substrates into and out of the ALD chamber. The thickness depends on the 
total time of the flexible substrate inside the chamber and number of cycles within 
this period of time. In order to get thicker ALD layer, a slower reeling speed is 
needed with the same cycle duration. Generally speaking, the throughput is not as 
big as the R2R ALD employing the spatial principle. 

Spatial ALD Reactor 

The batch ALD system is not always compatible with the in-line processes which 
are often used in the applications mentioned before. Several “new” ALD schemes 
have been recently proposed and developed to meet this challenge. Many of these 
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Fig. 18 Schematic 
representation of the spatial 
ALD process with the 
position of the substrate 
( black line ) and precursors 
0 colored blocks) with respect 
to the process time (Reprinted 
with permission from Poodt 

et al. (2011). © 2014, 
American Vacuum Society) 



Time 



approaches employ spatial separation of the half reactions and can be summarized 
under the term “spatial ALD” (Poodta et al. 2012). 

As shown in Fig. 18, in spatial ALD, there is no pulse control, and the precursor 
gases are supplied continuously in different physical locations (Poodta et al. 2012). 
Thus, there are (at least) two zones where a half reaction can take place as the 
substrate is present. For every zone, the duration of the substrate exposure needs to 
be long enough to form a saturated monolayer. The substrate moves to both zones to 
complete one ALD cycle and form one ALD monolayer. The substrate still has to 
move through an inert gas zone in between the two precursor gas zones to clear the 
unreacted precursors. Alternatively, the substrate location is fixed and the gas 
supplies are moved, or some combination of the two. Thicker films are obtained 
by repeating the desired number of cycles. 

The main distinct feature of spatial ALD is the omission of the purge steps. Since 
there is no individual cycle time limitation, the deposition rate solely depends on 
the moving speed of the substrates between the two precursor zones. As a conse- 
quence, the deposition rate is drastically improved. Though the ultimate limit 
depends on the kinetics of the specific surface reaction involved, for most of the 
ALD process, the monolayer forms within the order of a few milliseconds when 
planar substrates are used. For substrates with high-aspect-ratio features, longer 
exposure times in each zone are needed, similar to conventional ALD. 

The most critical issue of spatial ALD reactor design is to avoid the contact 
between two precursors, which means decoupling the half-reaction zones. For 
conventional ALD, this is solved through the use of purge steps in between the 
precursor pulses. In spatial ALD, the precursor gases are separated by a combina- 
tion of physical barriers and continuously flowing purge streams. 

In spatial ALD, the complete separation of the reaction zones ensures that 
precursors never meet to produce CVD-like product. Therefore, unlike conven- 
tional ALD, there is no deposition on the reactor walls and only on the desired 
substrates. The precursor consumption and the cost of ownership can be reduced. 
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Fig. 19 Principle of a close proximity spatial ALD reactor concept (Reprinted with permission 
from Poodt et al. (2011). © 2014, American Vacuum Society) 


The vacuum system to the traditional ALD reactor is very bulky and limits the 
production of the system. The newest development of the spatial ALD design to be 
operated under atmospheric press is called “close proximity spatial ALD” which 
follows the concept as shown in Fig. 19 (Poodta et al. 2012). This approach ensures 
precursor separation by using a very close proximity of the substrate to the 
precursor injector (<100 pm). These precursor injectors create the half-reaction 
zones alternatively, and substrate moves underneath the injectors. Between and 
surround the precursor injectors, there are injectors for the inert gas to serve as the 
gas shield. Another important function of the gas shields is to act as gas bearings, 
facilitating virtually frictionless movement between injectors and substrate. By 
proper design and gas flow rate, the distance between the injector and the substrate 
is well controlled at the order of several tens of micrometers. These small gap 
heights also help to create excellent diffusion barriers. Furthermore, gas bearings 
completely seal off the reaction zone, which makes the reactor completely inde- 
pendent of the environment. 

Here three kinds of spatial ALD reactors are introduced to show the latest 
development in this area to improve the throughput and productivity. As seen in 
Fig. 20, two half-reaction zones are fixed in a round reactor head intercepted by two 
exhaust zones (Fig. 20a) (Poodta et al. 2012). Gas-bearing planes are designed to 
surround these zones. A rotating substrate table sits under the reactor head 
(Fig. 20b). Round substrates with diameters of 150 or 200 mm and (semi)square 
15.6 x 15.6 cnr wafers can be mounted on the substrate table. A convection oven is 
used to host the entire construction. The growth rates of 1 .2 nm s -1 can be achieved 
with ALD A1 2 0 3 , which also shows excellent surface passivation. This implies a 
revolutionary breakthrough in industrial throughput ALD of A1 2 0 3 passivation of 
silicon solar cells. 

Levitech®, a spin-off from ASM International, developed a spatial ALD tool, 
called Levitrack, for the passivation of silicon solar cells with A1 2 0 3 layers as in 
Fig. 21, top. Silicon solar cell wafers are transported in a straight track crossing 
areas with precursor gas inlets. Two gas bearings are used to support the wafers 
during transportation. They also protect the backside of the wafer against A1 2 0 3 
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Fig. 20 Schematic of the bottom side of TNO’s rotating spatial ALD reactor head (Reprinted with 
permission from Poodt et al. (2011). © 2014, American Vacuum Society) 


deposition. Wafers pass through the gas inlets in one direction. Every 4 gas inlets 
(TMA/N 2 /H 2 0/N 2 ) work as a group and produce one ALD layer of A1 2 0 3 . Every 
ALD Levitrack cell is composed of many groups of those gas inlets to produce one 
nanometer thick of A1 2 0 3 , and the total number of cells determines the final 
thickness of the ALD thin film. Wafers pass such a cell within 1 s which results 
in a throughput of 3,600 wafers/h. The gas flow rate was set to meet the track 
dimensions so that no gas mixes inside the track when empty or during processing 

o 

of (156 x 156 mm ) wafers. Heat transportation is very efficient through top and 
bottom wall. There is almost no TMA wasted because of the proper tuning of the 
TMA vapor pressure. 

Levitrack reactor (as shown in the bottom of Lig. 21) (Poodta et al. 2012) 
produces passivation layers on 156 x 156 mm wafers which are compatible 
with existing industrial technologies (optimized for SiN x and SiO x passivation). 
Superior results were obtained with the ALD A1 2 0 3 layers with thicknesses in 
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Fig. 21 In-line wafer spatial 
ALD systems with two 
double-sided gas bearings 
used by Levitech (Reprinted 
with permission from Poodt 

et al. (2011). © 2014, 
American Vacuum Society) 




the range 5-10 nm. The highest efficiency obtained for n-type crystalline silicon 
cell was 18.3 %. 

The first two spacial ALD reactor are designed for the wafer samples and there 
are great need for ALD thin film on flexible substrates. TNO developed a roll-to-roll 
spatial ALD reactor (Omega 300) for flexible substrate. It consists of a central drum 
that contains combinations of TMA and water nozzles (Fig. 22, left side) (Autho- 
rized material from TNO). These nozzles are separated and surrounded by nitrogen 
gas bearings. The gas bearing helps to maintain the short separation (typically tens 
to hundreds of micrometers) between the flexible film and the drum when the film is 
pulled through and isolate the system with outside environment. Therefore, parti- 
cles from the CVD-like process and outside environment are blocked from con- 
taminating the film, which leads to pinhole-free surfaces. 

The total thickness is determined by the rotation frequency of the drum and the 
pulling speed of the film. When the drum stands still, the thickness is constant since 
the number of the precursor zone pairs is the same. The thickness increases as the 
drum is rotated in the opposite direction of the movement of the film. As the pulling 
speed of the film increase, the reaction time decrease and the thickness reduces. The 
throughput is controlled by the pulling speed of the film. The high deposition rate is 
achievable with 50 nm ALD A1 2 0 3 film at 3 m/min pulling speed. The maximum 
width for the flexible film this machine can take is 300 mm. 
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Fig. 22 The structure illustration of central drum (left side) of Omega 300 and its actual machine 
(right side) with gas delivery and wheels assemble (Courtesy of TNO the Netherlands) 


ALD Applications 

As stated before, the application of ALD in industry right now is limited within the 
semiconductor and solar cell areas. Because of the superb quality of ALD thin film, 
its applications on other areas are very promising. Here in this section, the areas of 
potential usage of ALD are introduced. 
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Semiconductor 

Applications in semiconductor industry have been the main driving forces for the 
recent development of ALD technologies. The International Technology 
Roadmap for Semiconductors (ITRS) has included ALD for high-dielectric- 
constant gate oxides in the MOSFET structure and for copper diffusion barriers 
in back end interconnects. Ultrathin high-K dielectric films with precise control of 
the thickness are a major bottleneck for the semiconductor industry. Traditional 
CVD and PVD processes are not capable to realize such task. The unique 
capability of ALD to engineer composite films helped to solve this problem 
(Sneh et al. 2002). 

Most semiconductor ALD applications aim at the thickness range from 20 to 

o 

200 A. With the help of batch and spatial ALD, industry standard throughput of 
10-20 wafers/h/module is not very difficult to achieve. Temperature uniformity is 
another key requirement because the deposition rate depends on temperature. 
Right now the temperature control across the wafer by the batch-type ALD 
is good enough to have the max thickness inhomogeneity in wafer and in batch 
within 1 %. 


MEMS 

The ongoing reduction in the dimensions of MEMS devices and components 
dramatically increases the reliability requirement of every part within. The existing 
techniques such as CVD and PVD are not able to meet the requirement for some 
MEMS devices. ALD is a very promising replacement process to accomplish such 
goal since the coatings made by ALD are usually denser and smoother and with less 
defects. Researchers have used ALD process to build multiple parts of MEMS such 
as protective layers for biocompatible coating, high-dielectric-constant layers in RF 
MEMS, low-temperature insulating layers, lubrication layer of moving parts, or 
controlled gap filling (de Groot et al. 2009). Since ALD process can be easily 
integrated into the standard CMOS processing, there are not much obstacles for 
commercial MEMS fabrication lines to use it in the future. 

For example, the sliding elements of MEMS devices are prone to have wear 
failures as they were rubbed constantly. A protective interface to MEMS structures 
helps to reduce such failures. It has been shown that with a 10-nm-thick A1 2 0 3 
coating on polysilicon structure via ALD (Mayer et al. 2003), less wear particle is 
generated than a native oxide layer in a MEMS microengine. 

TiN can be used as diffusion barrier layers for through- silicon vias (TSV) in the 
MEMS industry, but the high deposition temperature (>300 °C) in conventional 
CVD or PVD processes limits its application in this field. PEALD can produce 
high-quality TiN film at low temperature (<200 °C) which is compatible with 
MEMS fabrication processes. The surface morphology, stress, electrical, and wet 
etching properties of the thin film are well satisfied the requirement of the barrier 
layer of as a ruthenium (Ru) preseed liner (Samal et al. 2013). 
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Corrosion and Wear Resistant 

The ability of ALD thin film to cover high-aspect-ratio surface without pinhole 
enables its excellent corrosion-resistant capability. For the other coating methods, 
such as CVD, PVD, and sol-gel, the anticorrosion coating cannot uniformly cover 
the entire surface, and the defects are almost inevitable. Since every part of the 
surface can be covered with conformal and almost defect- free film, there is no weak 
point against corrosive species in an ALD process part. For example, the conformal, 
adherent, and defect-free Ti0 2 /Al 2 0 3 coatings by ALD strongly improve the 
corrosion resistance of stainless steels in chloride-containing media. At least two 
orders of magnitude the passive current density decreases, and breakdown poten- 
tials shift toward more positive values (Marin et al. 2011). 

The metal oxide layer made by ALD is also used for solid lubrication purpose 
since, compared to bulk oxide ceramics, the nanocrystalline oxide ceramics have 
better mechanical and tribological properties. It has been shown that oxide 
nanolaminates (Zn0/Al 2 0 3 /Zr0 2 ) made by ALD on porous carbon-carbon com- 
posites (CCCs) and graphite foams have better thermal stability and sliding wear 
resistance (Mohseni and Scharf 2012). A significant reduction in the sliding wear 
factor (2.3 x 10 -5 to 4.8 x 10 -6 mm 3 /Nm) and friction coefficient (0.22-0.15) 
were observed with the ALD nanolaminate in comparison to uncoated CCCs. 


Optical Applications 

Metal fluoride thin films have excellent light transmission in a broad wavelength 
region and have been used for many optical applications. Mainly physical vapor 
deposition (PVD) method has been used to deposit these metal fluoride films before. 
However, the main problems of evaporated fluoride films are their porous nature 
and low packing densities. The thickness control of the multilayer stacks is also 
very challenging. The intrinsic advantages of ALD can solve these issues and 
produce high-quality films. 

The metal precursors to produce ALD metal fluorides are metal-thd (2, 2, 6, 6- 
tetramethyl-3,5-heptanedione) compounds. HF, by thermally decomposing NH 4 F, 
was used as the fluorine precursor as in ALD for depositing CaF 2 , ZnF 2 , and SrF 2 
films. However, HF is not an ideal choice for ALD because in addition to its unsafe 
chemical nature, it etches glass and many oxides. TiF 4 is used then to improve the 
quality and productivity in ALD (Pilvi et al. 2007). 


Organic light-emitting diode (OLED) 

Most organic devices are extremely sensitive to moisture, and its moisture-driven 
degradation can be only prevented by proper encapsulation. OLED encapsulation 

c o 

has rigorous requirement of the water vapor transfer rate (<10 g/nT/day). 
However, flexible polymer substrates are highly permeable to all atmospheric 
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gases. The water and gas diffusion barrier quality is directly responsible for the 
quality and lifetime of products with polymer materials as substrates. The encap- 
sulation of OLEDs today remains heavily dominated by relatively simple cover 
glass technologies using epoxies to seal the edges. But OLED device market is 
changing to larger and flexible format and has longer lifetime requirements. The 
existing encapsulation methods are not suitable for future development. Thus, the 
industry is shifting toward simpler, better, and less expensive barrier technologies 
in order to unlock the huge market potentials. 

For example, ALD A1 2 0 3 film as thin as 10 nm on polyethylene naphthalate 
(PEN) and Kapton reduces the water vapor transmission rate (WVTR) over three 

O A O 

orders of magnitude to 10 -10 g/m / day (Carcia et al. 2006). Even lower 
WVTRs were achieved by bilayer or multilayer barriers using ALD A1 2 0 3 and 
SiN by plasma-enhanced CVD (Groner et al. 2006), ALD A1 2 0 3 and rapid ALD 
Si0 2 (Carcia et al. 2009), and ALD A1 2 0 3 and ALD Zr0 2 (Dameron et al. 2008). 
ALD offers the potential for deposition of pristine, pinhole-free thin layers of 
encapsulation material with excellent barrier performance which could provide 
the solution to the requirement of the OLED industry. 


Lithium Ion Battery 

The capacity fading and potential safety risks are the major challenges for lithium 
ion batteries (LIBs). The origins of these challenges lie in the unwanted side 
reactions during charge-discharge cycles, which cause electrolyte decomposition, 
solid electrolyte interphase formation, and active material dissolution. Surface 
modification of LIB electrodes is a viable strategy to address these issues. 

The solution-based methods are unable to modify prefabricated electrode surface 
evenly. ALD was demonstrated to be highly effective in enhancing the electrochem- 
ical performance of LIB electrodes, for both the cathode and the anode. It is a cost- 
and time-saving coating technique for LIBs. A number of coating materials (A1 2 0 3 , 
Ti0 2 , ZnO, and TiN) were investigated (Meng et al. 2012). The ultrathin ALD films 
down to subnanometers on the electrodes are uniform and conformal with very facile 
operation. They can be successfully deposited at low temperatures (typically less than 
200 °C with possibility of going down to RT). Specifically, ALD thin films serves to 
fasten micro- and nano-sized particles to electrodes, protect metal surfaces against 
diffusion and corrosion, tailor the composition of porous material surface passivation, 
and passivate electrode surface in general. 


Solar Cell 

ALD has unique features which can be used to face processing challenges in 
different types of solar cells. The uniformity and the pinhole-free nature make the 
ALD an attractive approach to deposit CdS-free buffer layers in CulnS 2 (CIS) and 
Cu(ln,Ga)S 2 (CIGS) thin-film solar cells. For a-Si:H solar cells, ALD is also used to 
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make ALD ZnO thin film as front contact. For c-Si solar cell, ALD produces 
excellent surface passivation layer made of A1 2 0 3 films on both low resistivity 
n-type and p-type Si (Hoex et al. 2006b). 

ALD is also used to make ZnO nanotubes and nanowires for dye- synthesized 
solar cells. The ALD A1 2 0 3 passivation film also helps to improve the efficiency of 
the dye-synthesized solar cells. Since only ultrathin films are needed and the 
emerging of the batch ALD and spatial ALD reactors, the solar cell manufacturing 
is adopting this method to make the passivation layer. 


Packaging Material 

Biopolymers (modified natural polymers and biodegradable synthetic polymers of 
bio-based monomers) have been considered to be the environmentally friendly 
solution for packaging materials in the future to replace the nonbiodegradable 
polymers like polystyrene, polypropylene, and polyethylene terephthalate. How- 
ever, similar to OLED, poor barrier properties, especially of natural polymers, and 
sensitivity to moisture remain to be the block to wider use of these materials. One 
way to improve the barrier properties of biopolymers is to coat them with a thin 
inorganic layer. Suitable depositing methods need to deposit this thin film at very 
low temperature but with low thickness and high quality. Low-temperature ALD is 
able to achieve this goal. The developed coatings with ALD on biodegradable 
polymers have excellent gas and water vapor permeation resistance (Hirvikorpi 
et al. 2010), which reached the level required in commercial packaging applications 
for dry food and pharmaceuticals. 


Summary and Future Development 

From the presented overview of the research and development on ALD processes, it 
is clear that ALD is a technique of high interest for anyone who wants a high- 
quality thin film. The unique depositing mechanism ensures the uniform and 
pinhole-free thin films. Because of the layer-by-layer deposition nature, the crys- 
talline property is much improved which results in better chemical, thermal, 
electrical, and mechanical performance. ALD builds up thin film not from 
gas-phase reactions but through surface reactions, and its reaction speed is con- 
trolled mostly by the gas diffusion speed. Therefore, even on very high-aspect-ratio 
structure surfaces such as nanoparticles, nanowires, and 3D structures, ALD is able 
to produce uniform and conformal coating. Plasma-enhanced ALD uses plasma to 
activate precursor molecules to achieve higher reactivity on the surface, which 
reduces the temperature to form thin film. It enables deposition of ALD thin films 
on thermally sensitive substrates such as biodegradable polymer and biological 
template. ALD is used to produce various inorganic materials ranging from metal 
oxide, nitride, sulfide to fluoride. Metal thin film is also obtained by ALD with 
reducing agent used during the process. 
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The development of ALD reactors is another focus of ALD research. The 
introduction of the flow gas to the ALD reactor marks the first productivity leap 
of. The integration of the plasma power sources to ALD reactors versatile the usage 
of ALD and greatly shortens the gap the real application of ALD system. The first 
application of ALD system in industry occurs in producing high-quality insulating 
layer for semiconductor and solar cell sections. Because of the high equipment cost, 
the relative low throughput hindered the expansion of the ALD to other industry 
field. This driving force pushed the emergence of the recently high-throughput 
reactor designs such as batch ALD reactor and spatial ALD reactor. Other deriva- 
tive like roll-to-roll ALD reactor is also invented to meet the needs of mass 
production. 

Increasing the versatility and productivity is the main theme for future develop- 
ment of ALD systems. To increase the versatility, researchers in ALD field focus on 
growing different functional thin films to different substrates for novel applications. 
To increase the productivity, more exploration is needed to coat faster and on 
bigger substrates. The research and development of batch ALD and spatial ALD 
system are going to be paid more attention too. 

New energy exciting methods are equipped to ALD to promote the surface 
reaction. Since the conventional low-pressure plasmas cannot be used for the spatial 
ALD, atmospheric pressure plasma is a logical choice to be used to the spatial ALD 
in the future. The exciting methods used now include thermal and plasma energy. 
Photons, laser, and microwave can be the next exciting method too for ALD 
application. The research on the precursor for ALD is also crucial as the surface 
chemistry determines the ultimate speed of ALD process. More user-friendly pre- 
cursors are needed for ALD process in the future for new application. 

ALD is capable to produce superior property thin him with precise control, and it 
provides an alternative solution other than traditional CVD and PVD for special 
coating needs. The industry applications of ALD are going to be expanded from 
existing area like semiconductor and solar cell to other fields like oil and gas, 
marine, precision engineering, medical device, food processing, optical device, and 
others. With the continuous demands in top-quality coating for products from every 
aspect of the life, ALD will become more and more popular. 
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Abstract 

Integration of high-quality functional thin layer of oxides on semiconductor, in 
particular wide-bandgap silicon carbide, is of extreme importance in order to 
realize near future generation of metal-oxide-semiconductor (MOS)-based 
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devices for high-power, high- temperature, and/or high-radiation applications. 
Although nitrided Si0 2 on SiC produced acceptable results, limitations and 
issues have been reported. Therefore, evolution and justification of changing 
this type of oxide to high dielectric constant oxide (high-K) on SiC are being 
reviewed. This chapter presents the current understanding of simultaneous 
thermal oxidation and nitridation of sputtered Zr- semiconductor interfaces as 
the most promising technique for achieving device-quality interfaces required 
for commercial applications. It is mainly focused on the technological methods 
of producing oxidized/nitrided Zr on SiC. An exceptional section is devoted to 
the recent developments of nitrided high-K gate dielectrics on SiC. It starts with a 
detailed discussion of high-K gate dielectric characteristics and the current 
knowledge of simultaneously oxidized and nitrided Zr film as high-K dielectric 
on SiC. Via this technique, the role of N 2 0 gas ambient on oxidizing and 
nitriding Zr film on SiC, coupling with physical and electric characteristics of 
oxidized/nitrided Zr film on SiC, is discussed. A growth mechanism of simulta- 
neous thermal oxidation and nitridation of Zr film on SiC is subsequently 
presented. Finally, the properties of oxidized/nitrided Zr thin films based on Si 
and SiC substrates are compared. 

Introduction 

The advent of technology boom has motivated the use of wide-bandgap (WBG) 
semiconductors as alternative substrates to replace silicon for high-power, high- 
temperature, and/or high-radiation metal-oxide-semiconductor (MOS)-based 
switching devices (Dimitrijev et al. 2004). Of WBG semiconductors, silicon carbide 
(SiC) has developed into one of the leading contenders due to its commercial 
availability and ability to grow native oxide (Si0 2 ) (Dimitrijev et al. 2004). SiC 
exists in different polytypes (Casady and Johnson 1996). 4H-SiC is one of the 
polytypes that provides a wide bandgap of 3.26 eV, high breakdown field strength 

I r ~j i 

of ~3 MV cm , high saturation electron drift velocity of ~2 x 10 cm s , and high 
thermal conductivity of ~3.7 W cm -1 °C (Afanas’ev et al. 2004). These properties 
promisingly enable operation of SiC-based devices under harsh conditions. 

In order to realize next-generation SiC-based MOS devices, a high-quality gate 
oxide must be deposited or grown between a metal electrode and the semiconductor 
substrate to sustain a high transverse electric field and a low gate leakage current in 
the devices (Cheong et al. 2008a). To date, thermally nitrided Si0 2 is regarded as 
the best gate oxide on SiC due to its low interface and slow trap densities and high 
reliability (Jamet and Dimitrijev 2001; Jamet et al. 2001; Cheong et al. 2003, 2007, 
2008b, 2010). Nonetheless, there are problems associated with the Si0 2 thin film or 
gate based on SiC. In SiC-based MOS devices, the 1ow-k value of 3.9 in Si0 2 
compared to SiC with k value of 10 limits the permissible electric field in SiC-based 
devices (Lipkin and Palmour 1999). As a result, the gate oxide may electrically 
break down prior to the SiC substrate, thus defeating the purpose of using SiC as the 
primer substrate for high-power and high-temperature applications. In short, the 
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integration of high-K oxides on SiC is to lower the electric field being imposed on 
the gate oxide itself. 

Although nitrided Si0 2 on SiC produced acceptable results, limitations and 
issues have been reported. Evolution and justification of changing this type of 
oxide to high dielectric constant oxide (high-K) on SiC and recent development of 
nitrided high-K gate dielectrics on SiC are being reviewed. The recent develop- 
ments of nitrided high-K gate dielectrics, in particular on nitride Zr0 2 , on SiC, with 
a detailed discussion of high-K gate dielectric characteristics and the current 
knowledge of simultaneously oxidized and nitrided Zr film as high-K dielectric on 
SiC are presented. Through this technique, the role of N 2 0 gas ambient on oxidiz- 
ing and nitriding Zr film on SiC, coupling with physical and electric characteristics 
of oxidized/nitrided Zr film on SiC, is reviewed. The final section presents a growth 
mechanism of simultaneous thermal oxidation and nitridation of Zr film on SiC. 


Nitrided High-K Dielectric on SiC 

Up-to-date, nitrided Si0 2 on SiC is regarded as the first choice gate dielectric owing 
to its acceptable low interface and slow trap density, high reliability, and low 
leakage current (Dimitrijev et al. 2004; Jamet and Dimitrijev 2001; Jamet 
et al. 2001; Cheong et al. 2003; 2010; Wong and Cheong 2012a). Nevertheless, 
there are woes associated with nitrided Si0 2 as gate dielectric based on SiC, 
namely, insufficient stability at elevated temperature and high electric field (Lipkin 
and Palmour 1999). The low dielectric constant (k) value of nitrided Si0 2 of 3.9, as 
compared to SiC with k value of 10, could cause oxide breakdown and reliability 
issues. As stated by Gauss law (Lipkin and Palmour 1999; Wong and Cheong 
2012a): 


E 


OX 


K 


ox 


EsiC * K SiC 


( 1 ) 


The high electric field strength in SiC of ~3 MV cm 1 would give a high electric 
field of ~7.69 MV cm -1 in Si0 2 , in which it is a multiple of ~2.56 times higher 
electric field being imposed on the gate oxide than on the SiC. As a result, the 1ow-k 
gate oxide may electrically break down much earlier than the SiC substrate, thus 
diminishing the purpose of using SiC as the potential substrate for high-power, 
high- temperature, and/or high-radiation applications (Wong and Cheong 2012a). 
Therefore, in order to lower the electric being imposed on the gate oxide, super- 
seding the 1ow-k nitrided gate oxide with high-K nitrided gate oxide on SiC is an 
approach to circumvent the problem (Choyke et al. 2004). 


High-K Gate Dielectric Characteristics 

Unlike Si-based devices, downscaling of device dimensions is not the driving force 
for development of SiC-based power devices. Instead, the motivation for the 
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investigation of alternative high-K dielectrics on SiC is to extend the performance 
capabilities in high-power, high- temperature, and/or high-radiation applications. 
To select a promising high-K material to replace Si0 2 as dielectric based on SiC, 
some criteria in terms of dielectric constant, band alignment, interface quality, and 
thermodynamic and kinetic stabilities need to be satisfied (Robertson 2004; Wong 
and Iwai 2006; Wilk et al. 2001a; Wong and Cheong 2010). Different deposition 
techniques produce different characteristics and quality of high-K gate dielectric. 
By taking an example of Zr0 2 , as a potential high-K gate dielectric, the crucial 
general characteristics and quality of Zr0 2 thin film produced by different deposi- 
tion techniques will be elucidated. 


Dielectric Constant 

The k value is the first criteria to be taken into account to select a material for gate 
dielectric application. An MOS structure can be modeled as a parallel -plate capac- 
itor. Hence, its capacitance is governed by the dielectric constant of the oxide layer 
and can be related by the following equation (Robertson 2004; Wong and Iwai 
2006; Wilk et al. 2001a; Wong and Cheong 2010): 



where C ox represents capacitance of oxide layer, s 0 is permittivity of free space 
(8.85 xlO Tm ), A is area of capacitor, and t ox is thickness of oxide layer. 

As mentioned earlier, Gauss law (Eq. 1) demonstrates that the electric field 
scales with the materials dielectric constant and requires that E • k be in equiva- 
lence at the interface of two adjacent materials. When effective electric field in Si0 2 
is ~2.56 times higher than that in SiC, consequently, the SiC substrate cannot be 
deployed to its maximum potential, deteriorating its advantage in high-power, high- 
temperature, and/or high-radiation applications. Alternatively, a material with k 
value equal to or higher than that of SiC is desirable. According to Lipkin 
et al. (1999), E 0 • k is a critical measure of a materiaTs applicability for high- 
power devices, where E 0 is the highest safe operating field for the dielectric. 
However, E 0 is typically limited to about 20 % of the critical field by electrons 
tunneling from the substrate into the gate dielectric. The safe operating field in Si0 2 
is less than 2 MV cm -1 , which corresponds to a field in SiC of 0.78 MV cm -1 , 
about 3.85 times less than its avalanche breakdown field. A device’s blocking 
voltage scales with electric field squared, thereby resulting in a tenfold drop in 
blocking voltage capability owing to the limitations of the thermal Si0 2 . Therefore, 
as a promising candidate for gate dielectric application, a material must have a 
higher k value, preferably greater than 10. 

Numerous researchers across the world are intensively looking for an alternative 
oxide with high k properties for gate dielectric to supersede Si0 2 in Si-based MOS 
devices. As the device dimensions scale down, it is necessary to reduce the 
thickness of the gate oxide in order to maintain the capacitance. However, electron 
tunneling increases exponentially as the oxide thickness reduces. Superior proper- 
ties of Si0 2 allowed the fabrication of properly working MOS devices with Si0 2 
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gate layers as thin as 1.2 nm (Wong and Cheong 2010; International Technology 
Roadmap for Semiconductors 2011); thinner than that will be problematic, for 
instance, (1) the occurrence of relatively huge leakage current through the oxide, 
due to direct tunneling of carriers, is highly possible; thus, the device performance 
will be substantially degraded; (2) difficulty in fabricating such a thin layer of film; 
and (3) reliability of Si0 2 films against electric breakdown declines in thin films 
(Robertson 2004; Wong and Iwai 2006; Wong and Cheong 2010). Hence, the use of 
high-K gate oxide enables a physically thicker dielectric layer to attain the similar or 
better MOS characteristics that are equivalent to the Si0 2 . Referring to Eq. 2, it tells 
that, in order to attain the same capacitance for the same area, the ratios of the 
dielectric constants to the thicknesses of two materials must be equal. In MOS 
structure, all dimensions scale proportionally without affecting the electric designs 
(Robertson 2004; Wong and Cheong 2010). Therefore, it is convenient to define an 
equivalent oxide thickness (EOT), which allows direct comparison of high-K oxide 
and Si0 2 films based on their dielectric constants (Wilk et al. 2001a; Wong and 
Cheong 2010): 


EOT = t high . K = ?sio 2 ) (3) 

where t hlgh _ K and tsio 2 are the thicknesses of high-/c dielectric material and Si0 2 , 
respectively, while and KsiOo are the dielectric constants of the high-K 

dielectric material and Si0 2 , respectively. In short, a hypothetical high-K oxide 
film with a much thinner EOT can be achieved. Although the aforementioned 
principles are commonly applied in Si-based MOS technology, they can be appli- 
cable for SiC-based MOS devices as well and serve as an essential platform to the 
understanding of the potential of high-K materials. Though many transition metal 
oxides offer significantly higher k values than Si0 2 , however, not every high-K 
oxide can be employed as dielectric in MOS device. If the k value of a material is 
too high, for example, Ti0 2 , which offers its k value of 80 (Robertson 2004; Wong 
and Iwai 2006; Wilk et al. 2001a; Wong and Cheong 2010), it will cause fringing 
field from the drain through the gate dielectric. This fringing field can deteriorate 
the source -to- channel potential barrier, thus giving undesirably poor subthreshold 
performance (Wong and Iwai 2006; Wong and Cheong 2010; Mohapatra 
et al. 2002). As a promising candidate for gate dielectric application, Zr0 2 provides 
high enough k value of about 22-25 (Robertson 2004; Wong and Iwai 2006; Wilk 
et al. 2001a; Wong and Cheong 2010, 2011a, b, c, 2012b; Kurniawan et al. 2011; 
Wong and Cheong 2012c), to be used for a reasonable number of years of scaling. 

Band Alignment 

A potential candidate for high-K dielectrics must have large band offsets. Band 
offset between an oxide and semiconductor substrate can be defined as the barrier 
for holes or electrons to be injected into the oxide, and it is closely related to 
bandgap of a material, whereby a material with larger bandgap will correspond to 
a larger band offset (Wong and Cheong 2010; Robertson and Peacock 2004). 
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With large band offsets (high barriers) at the oxide/semiconductor interface, the 
carrier generation and conduction can be minimized (Wilk et al. 2001a; Wong and 
Cheong 2010), thus suppressing the leakage current (Wong and Iwai 2006). According 
to the periodic table of elements that have been known, as the atomic number of an 
element increases, the ionic size (radius) of an element increases but the ionic bonding 
force (cohesive force) decreases. This leads to a high dielectric constant but a narrow 
bandgap (Wong and Cheong 2010; Boer 2002). Subsequently, material with higher k 
value will possess undesirable smaller band offset value. Based on Clausius-Mossotti 
equation, the inverse relationship between energy bandgap and dielectric constant can 
be depicted (Wong and Cheong 2010; He et al. 1999): 

E ‘ * 20 (jTi) 2 (4) 

where E g defines energy bandgap and k defines dielectric constant value. 

Integration of gate dielectric in Si-based MOS devices requires valence and 
conduction band offsets at the oxide/semiconductor interface of larger than 1 eV to 
minimize carrier injection into its bands thus reducing leakage current (Wong and 
Cheong 2010). If this criterion is applied to 4H-SiC, with its E g of 3.26 eV 
(Afanas’ev et al. 2004; Wong and Cheong 2012b; Kurniawan et al. 2011), by 
assuming a symmetric band alignment, the high-K oxide bandgap must be of 
minimum 5.3 eV. 

Zr0 2 has reasonably high band offsets of 5. 8-7. 8 eV (Robertson 2004; Wong 
and Cheong 2010; Wilk et al. 2001b), with its electron offset or conduction band 
offset (AE C ) values of 1.3-2. 3 eV, by assuming a symmetric band alignment. 
Hence, Zr0 2 , with A E c > 1 eV, can avoid carrier generation (electron transport), 
either from enhanced Schottky emission, thermal emission, or tunneling, thus 
minimizing leakage current (Wong and Cheong 2010). Figure 1 illustrates a simple 
schematic of bandgap and band offsets of Zr0 2 by assuming a symmetric band 
alignment carrier injection mechanism in its band states. 

Interface Quality 

In an MOS device, charge carriers induced by the metal gate electrodes are 
significantly influenced within the nanometers of the interface of the oxide and 
the semiconductor (Robertson 2004; Wong and Cheong 2010). This has pointed 
that the quality of the interface of the oxide and the semiconductor is the determin- 
ing factor for carrier mobility, as well as device functionality. Hence, the oxides 
must form a good quality interface with semiconductor, with minimal or no 
interface states or defect states within the semiconductor bandgap (Robertson and 
Peacock 2004). With a good quality of interface, an MOS device can perform very 
well electrically as the current can flow in the semiconductor channel next to the 
interface . 

To date, exponentially higher interface trap densities have been observed at 
energy levels beyond the Si bandgap yet within the SiC bandgap. It would be 

i i o i 

desirable to achieve interface trap density (D it ) of less than 10 cm ~ eV for 
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Fig. 1 Simple schematic of 
bandgap and band offsets of 
ZrC>2 and carrier injection 
mechanism in its band states. 
CB conduction band, VB 
valence band 



\ 





5. 8-7. 8 eV 





high-K gate dielectric based on SiC MOS devices. As compared to the state of the 
art for Si0 2 on SiC, gains in field strength would make the high-K oxide, for 
example, Zr0 2 , a preferable dielectric for SiC power devices. 


Thermodynamic and Kinetic Stabilities 

Basically, the objective of thermodynamics is to provide a description of a system 
of interest (high-K oxide/SiC) in order to investigate the nature and extent of 
changes in the state of that system as it undergoes spontaneous change toward 
equilibrium and interacts with its surroundings. On the other hand, the goal of 
kinetics is to describe the rate of a reaction to reach equilibrium, which means input 
of energy (sufficient activation energy) is necessary to enable the reaction to 
proceed in the forward direction (turning reactants to products) (Wong and Cheong 
2010 ). 

Ideally, thermodynamic stability is achieved if there is no reaction between 
high-K oxide and SiC under equilibrium condition to form an undesirable interfacial 
layer. On the contrary, if there are interface reactions between the oxide and the 
semiconductor substrate driven by thermodynamics, these reactions will lead to 
substrate oxidation and the formation of hybrid compounds, which are generally 
detrimental to interface quality and device performance. The interfacial layer, in 
general, has a lower k value than that of the bulk oxide due to the formation of 
silicates. A high-K oxide is considered as kinetically stable on SiC if it is able to 
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Table 1 The properties of 
Zr0 2 compared with other 
high-K oxides 


Oxides 

Dielectric constant (k) values 

Bandgap (E g ) 

ai 2 o 3 

9-10 

8.8 

Ce0 2 

26 

6.0 

Gd 2 0 3 

12 

5.0 

Hf0 2 

20 

5. 6-5. 7 

La 2 0 3 

27 

5.8 

Pr 2 0 3 

26-30 

3.9 

Y 2 o 3 

15 

6.0 

Zr0 2 

22-25 

5. 8-7. 8 


Data from Wong and Iwai (2006), Wong and Cheong (2010), Lim 
et al. (2010), and Nigro et al. (2006) 


perform operation at high temperature and withstand the rigors of MOS device 
processing, which typically requires temperatures of 900-1,000 °C for drive-in 
annealing, field oxidation, and dopant activation. The high-K oxide must not 
degrade or react with the substrate. In addition, for high-temperature applications, 
there must be minimal impurity diffusion or electric conduction through the oxide 
during prolonged use. Of several high-K oxides, Zr0 2 is stable with underlying Si 
and 4H-SiC substrates up to a certain high temperature (900-1,000 °C), thus 
making it to be considered as a potential candidate. Table 1 demonstrates the 
properties of Zr0 2 as a potential candidate compared with other high-K oxides. 


Deposition Techniques 

Several thin-film deposition techniques have been investigated for the growth of 
gate oxide, such as thermal evaporation, electron beam evaporation (EBE), pulsed 
laser deposition (PLD), cathodic arc deposition, atomic layer deposition (ALD), 
metal-organic chemical vapor deposition (MOCVD), and sputtering. 

Sputtering is the preferred deposition technique of high-quality gate oxide on Si 
and SiC. This deposition technique involves ejection of material from the surface of 
a solid due to the momentum exchange associated with surface bombardment by 
energetic particles (Wong and Cheong 2010; Dwbrowski and Mussig 2000). A 
source of deposited material, either Zr or Zr0 2 target, is placed into a vacuum 
chamber along with the substrates, and the chamber is evacuated to a pressure 

A r ~l 

typically in the range 5 x 10“" to 5 x 10 Torr. The bombarding species are 
generally ions of a heavy inert gas. Argon is most commonly used. Direct current 
(DC) sputtering and radio frequency (RF) sputtering are two most common modes 
in sputtering. DC sputtering is generally used when the target material is a good 
electric conductor, for instance, Zr target. RF sputtering is used when the target 
material is a poor conductor or an insulator, for example, Zr0 2 target (Wong and 
Cheong 2010). 

Sputtering offers high homogeneity (Wong and Cheong 2010; Ben Amor 
et al. 1998), good uniformity (Wong and Cheong 2010; Ben Amor et al. 1998), 
and low impurity contents (Wong and Cheong 2010; He et al. 2005) in the sputtered 
film. Additionally, it favorably offers high deposition rate (Wong and Cheong 2010; 
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Hembram et al. 2007). Thus, employment of this technique can easily be scaled up 
from the small-sized substrates used in laboratory experiments to large-scale 
industrial applications. Sputter deposition of Zr0 2 thin films on Si and SiC can be 
achieved in three routes: (1) by direct sputtering when the substance to be sputtered 
is Zr0 2 , (2) by reactive sputtering when Zr metal is sputtered in the presence of an 
oxygen atmosphere with an inert gas (Ar gas is commonly employed), and (3) by 
nonreactive sputtering when Zr metal is sputtered in an inert gas ambient (Ar gas is 
commonly employed), followed by an oxidation process. Of these routes, the third 
route enables control of the stoichiometry of the deposited films (Wong and Cheong 
2010, 2011a, b, c, 2012b, c; Hembram et al. 2007; Wong and Cheong 201 Id). On 
the contrary, when metallic target atoms react with the active components of the gas 
(oxygen), the film stoichiometry is difficult to control and generally presents 
numerous pinholes and low density (Wong and Cheong 2010; Chaneliere 
et al. 1998). 


Nitridation Process 

Si- and SiC-based devices tend to suffer from severe degradation of the electron 
mobility due to high density of interface traps close to the conduction band edge at 
the high-K oxide/Si and high-K oxide/SiC interface. It has been found that 
nitridation can greatly reduce the interface traps and effectively passivate the 
oxygen vacancies in the high-K oxide (Dimitrijev et al. 2004; Chen et al. 2010). 
Therefore, nitridation process is advantageous and regarded as one of the effective 
processes to improve the interface properties of Si- and SiC-based MOS devices. 
Nitridation can be defined as the formation of nitride by the high-temperature 
exposure of a surface to nitrogen-contained environment (Zant 2004). In semicon- 
ductor industry, this process consists of an oxidation or a reoxidation in a mixture 
of O- and N-containing gases such as NO and N 2 0 and is also termed 
oxy nitridation. This process is analogous to the diffusion of oxygen in the 
classical Deal-Grove picture (Wong and Cheong 2010); nitrogen diffuses into a 
film, with or without interacting with the oxide network. Nitrogen can react at the 
near-interface region with unoxidized or partially oxidized Si atoms from Si and/or 
SiC substrates. 

Gupta et al. (1998) and Enta et al. (2006) reported on the formation of 
Si-oxynitride in N 2 0, and they proposed that at elevated temperature, N 2 0 can 
decompose into oxygen, nitrogen, and oxygen-nitrogen compounds, and the 
nitrogen-related compound acts as the nitridation source. As proposed by Gupta 
et al. (1998), the decomposition of N 2 0 is based on a 5-step reaction. The initial 
decomposition step of N 2 0 is stated in Eq. 5: 

N 2 0 -> N 2 + O (5) 

The atomic oxygen reacts further with N 2 0 based on the following steps in 
Eqs. 6 and 7: 
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N 2 0 + o -> 2NO (6) 

N 2 0 + O N 2 + 0 2 (7) 

In addition to these reactions, the other essential reactions are stated in Eqs. 8 
and 9: 


NO + O 


N0 2 


( 8 ) 


N0 2 T O — ■> NO -{- 0 2 


( 9 ) 


Miller and Grassian (1997) reported that Zr0 2 is an effective catalyst for the 
decomposition of N 2 0, whereby N 2 0 will decompose exclusively to N and O 
compounds. The same study also found that Zr cations were involved in the 
decomposition reaction of N 2 0. Zhu et al. (2005) reported that a complete catalytic 
decomposition of N 2 0 on Zr0 2 happens at a temperature of 700 °C, while Petrucci 
et al. (2002) reported that there is no free energy change -temperature dependence in 
the decomposition of N 2 0 into N and O compounds. Therefore, the decomposition 
is spontaneous at all reaction temperatures. 

In nitrogen-contained environment, Si atoms from Si and/or SiC substrates can 
react with nitrogen and form silicon nitride (Si-N) (Jamet and Dimitrijev 2001; 
Jamet et al. 2001; Cheong et al. 2003; Cheong et al. 2010). Meanwhile, as reported 
by Reddy et al. (2007), it is reported that the zirconium nitride (Zr-N) can be formed 
by reacting nitrogen with sub-stoichiometric Zr0 2 . Ngaruiya et al. (2004) reported 
that the nitrogen atoms can occupy oxygen sites within crystalline phase of Zr-O, 
resulting in an almost unchanged Zr0 2 structure with a sevenfold metal-nonmetal 
coordination and interatomic distances around 2.04-2.27 A. The reported 
interatomic distances have assured Zr tetravalency when nitrogen has been incor- 
porated into stoichiometric Zr-0 layer. Therefore, Zr-0 compound remains stoi- 
chiometric at the topmost surface. In this study, Si-N, Zr-N, and/or Zr-O-N were 
formed, in the bulk of the film and/or at the interface region sandwiched between 
the film and the substrate when Zr films were subjected to N 2 0 gases at higher than 

700 °C. 


Role of N 2 0 Gas Ambient on Oxidizing and Nitriding of Zr 
Film on SiC 


In general, the interfaces of high-K gate oxide/semiconductor are often regarded as 
nonideal due to the presence of interface defects. These defects may be originated 
from surface-structural defects, oxidation-induced defects, or radiation-induced 
defects (Wong and Cheong 2011b, 2012b; Schroder 2006). The interface trap has 
a significant influence on the reliability and lifetime of an MOS device because it is 
a main source causing leakage current besides the characteristics of the oxide itself. 
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It is therefore crucial to control the thickness and composition of interfacial layer in 
between the gate oxide and semiconductor, so that the interface trap density can be 
minimized to an acceptable level and good electric characteristics of an MOS 
device can be achieved (Wong and Cheong 2011b, 2012b). 

Numerous studies have been carried out to improve the MOS characteristics and 
the oxide -semiconductor interface quality. Researchers have reported on the influ- 
ence of pretreated Si substrate with N 2 0 and NH 3 gas in order to suppress formation 
of interfacial defect by controlling the growth of IL thickness (Chen et al. 2007; 
Choi et al. 2011). Moreover, post-deposition annealing of Zr0 2 film in nitrogen 
ambient was also studied in order to improve its electric properties (Lin et al. 2003). 
It was found that incorporation of nitrogen into the film may retard the growth of 
interfacial layer and improved electric properties of the film. This improvement 
may be attributed to two factors (Wong and Cheong 2011a). Firstly, the presence of 
nitrogen is postulated to passivate the oxygen vacancies by forming Zr-N, Si-N, 
and/or Si-O-N bonds, thus reducing amount of interfacial traps. Additionally, these 
particular bonds are believed to effectively suppress crystallization that may 
enhance thermal stability of the film. Secondly, dangling bonds of Si surface may 
be passivated by the nitrogen-rich species generated during nitridation process 
(de Almeida and Baumvol 2003). In short, incorporation of nitrogen in the film is 
a possible solution. 

In order to improve the SiC-based MOS characteristics and the oxide-SiC 
interface quality, it has been found that oxidation and/or post-oxidation annealing 
in a nitrogen-containing ambient has two beneficial effects, i.e., enhanced removal 
of carbon and passivation of silicon dangling bonds (Dimitrijev et al. 2004). The 
utilization of nitrous oxide (N 2 0) gas has contributed to the growth of thermally 
nitrided Si0 2 as gate oxide on SiC with low leakage current and low interface trap 
density (Jamet and Dimitrijev 2001; Cheong et al. 2010). At high temperature, N 2 0 
gas is decomposed exclusively into N and O compounds, which are indispensable in 
performing oxidation and nitridation processes on the gate oxide (Cheong 
et al. 2010; Wong and Cheong 2011a). In general, the beneficial effects of utilizing 
N 2 0 gas in growing thermally nitrided Si0 2 gate on SiC-based MOS structure have 
been highlighted. 

According to the accumulated knowledge (Dimitrijev et al. 2004; Jamet and 
Dimitrijev 2001; Jamet et al. 2001; Cheong et al. 2003, 2007, 2008b, 2010) from 
thermally nitrided and oxidized Si on SiC to form nitrided Si0 2 gate, nitrogen 
source may enhance removal of carbon that has been accumulated at the interface 
during thermal oxidation and may improve passivation of dangling bonds on 
surface of the semiconductor depending on the equilibrium rate of oxidation and 
nitridation that has been achieved by diluting the oxidation and nitrogen sources 
(Dimitrijev et al. 2004). The most effective gases in achieving these effects are 
nitrous oxide (N 2 0) and nitric oxide (NO) (Dimitrijev et al. 2004). The former gas 
is more preferable due to its nontoxic property (Cheong et al. 2003, 2010; Wong 
and Cheong 2011a; Enta et al. 2006). 
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A Case Study: General Manufacturing Processes in Laboratory Scale 
Substrate Cleaning Process 

First of all, Si and SiC wafers must undergo a cleaning process. This process 
consists of ultrasonic cleaning, RCA cleaning, and HF dipping. A step-by-step 
cleaning process is depicted as follows: 

Step 1 : Ultrasonic cleaning 

Objective: To remove small particles from the substrate surface. 

Procedures: 

(i) Substrates were placed in a Teflon holder and placed in a container 
containing deionized (DI) water. 

(ii) Then, the container was placed into ultrasonic bath for 5 min. 

(iii) After that, substrates were rinsed with DI water. 

Step 2: RCA-1 

Objective: To remove organic residues from the substrate. 

Procedures: 

(i) Ingredients for RCA-1 (NH 4 OH: H 2 02:H 2 0) were prepared with the ratio 
of 1:1:5. 

(ii) 100 ml of DI water was placed in a beaker and heated to 90 °C on a hot 
plate. 

(iii) 20 ml of NH 4 OH was added into heated DI water at 90 °C. 

(iv) 20 ml of H 2 0 2 was then added into the mixture, and the temperature of the 
solution was decreased to 80 °C. 

Step 3: HF dipping 

Objective: To remove native oxide from the substrate. 

Procedures: 

(i) HF was prepared in DI water with ratio 1:50 (HF:H 2 0). 

(ii) Substrates in Teflon holder were immersed in HF solution for 10 s and then 
cleaned with DI water. 

Step 4: RCA-2 

Objective: To remove the metal ions from the substrate. 

Procedures: 

(i) Ingredients for RCA-2 (HC1: H 2 0 2 :H 2 0) were prepared with the ratio of 
1 : 1 : 6 . 

(ii) 120 ml of DI water was placed in a beaker and heated to 90 °C on a hot 
plate. 
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(iii) 20 ml of HC1 was added into heated DI water at 90 °C. 

(iv) 20 ml of H 2 0 2 was then added into the mixture and the temperature of the 
solution was decreased to 80 °C. 

(v) It was observed that the solution started to bubble vigorously after about a 
minute. 

(vi) The substrates were then soaked in the solution for 15 min and the 
temperature was maintained at 80 °C on the hot plate. 

(vii) After 15 min, the substrates were cleaned with DI water and dried with 

air gun. 

Sputtering Process in Laboratory Scale 

All Zr thin films were sputter-deposited by a magnetron-assisted RF sputtering 
system (Edwards Auto 500) in a high vacuum chamber with working pressure of 
1.6 x 10 -7 mbar and RF power of 170 W. 

Prior to sputtering process, sputtering machine parts were cleaned by using 
Decon 90 and acetone. After cleaning, the parts were fixed into the machine. Zr 
target was fixed at the target holder, while Si and SiC substrates were attached to the 
rotating substrate holder for deposition. At atmospheric pressure, after the target 
and substrates were loaded into the chamber and locked, the chamber was pumped 

n 

to high vacuum (low pressure), i.e., 1.6 x 10 mbar within 30 min. When the 
desired pressure (working pressure) was reached, an inert Ar gas plasma was struck 
using an RF power source (170 W), causing the gas to become ionized. At this time, 
pre- sputtering was done for about 2 min to remove the native oxide on the Zr target 
surface. The deposition rate was controlled at 2 A s” . After pre -sputtering, the 
substrate holder was rotated so that a uniform film can be formed on the substrate. 
Shutter was then opened and allowed the ions to accelerate toward the surface of the 
target, causing atoms of the source material to be ejected from the target in vapor 
form and condensed on Si and SiC substrate surfaces. A 5-nm-thick Zr thin film was 
formed. 

Simultaneous Oxidation and Nitridation Process 

Simultaneous oxidation and nitridation process was carried out in N 2 0 ambient in 
the quartz tube placed in the horizontal tube furnace. Beforehand, quartz tube, quart 
boat, gas inlet, and gas outlet must be cleaned with HF solution (1 HF:9 H 2 0) in 
order to remove the undesirable impurities and particles to avoid samples from 
contamination. After the apparatuses had been washed, quartz tube was first 
inserted into a horizontal tube furnace. Samples were then arranged on a quartz 
boat and placed into the quartz tube, as shown in Fig. 2. 

The following describes the design of experiments (DOE): 

(i) Effects of oxidation/nitridation durations on sputtered Zr based on Si substrate 
Samples were heated up from room temperature to 700 °C in an Ar flow 
ambient and the heating rate was set at 10 °C min -1 . Once the set temperature 
was achieved, N 2 0 gas was purged in with a flow rate of 150 ml min -1 for a 
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Horizontal tube furnace 


Fig. 2 Setup for oxidation/nitridation process 


set of durations (5, 10, 15, and 20 min). The samples were withdrawn from the 
furnace after the furnace was cooled down to room temperature in an Ar 
ambient. 

(ii) Effects of oxidation/nitridation temperatures on sputtered Zr based on Si 
substrate 

Samples were heated up from room temperature to a set of temperatures 
(500 °C, 700 °C, 900 °C, and 1,100 °C) in an Ar flow ambient, and the heating 
rate was constant at 10 °C min -1 . Once the set temperature was reached, N 2 0 
gas was then purged in with a flow rate of 150 ml min -1 for 15 min. Samples 
were eventually taken out at room temperature after the furnace was cooled 
down to room temperature in an Ar ambient. 

(iii) Effects of oxidation/nitridation temperatures on sputtered Zr based on SiC 
substrate 

Samples were heated up from room temperature to a set of temperatures 
(400 °C, 500 °C, 700 °C, and 900 °C) in an Ar flow ambient, and the heating 
rate was set constant at 10 °C min -1 . Once the set temperature was achieved, 
N 2 0 gas was then introduced with a flow rate of 150 ml min -1 for 15 min. 
Samples were then taken out at room temperature after the furnace was cooled 
down to room temperature in an Ar ambient. 

(iv) Oxidation/nitridation of sputtered Zr based on SiC substrate in diluted N 2 0 
Samples were then inserted into a horizontal tube furnace at atmospheric 
pressure and heated up from room temperature to 500 °C in an Ar flow ambient, 
and the heating rate was set constant at 10 °C min -1 . Once the set temperature 
was achieved, a set of different concentrations of N 2 0 gas was introduced, i.e., 
10 %, 30 %, 70 %, and 100 % N 2 0 mixed with 90 %, 70 %, 30 %, and 0 % of 
high-purity N 2 gas, for duration of 15 min, with a flow rate of 150 ml min -1 . 
Once completed, the furnace was cooled down to room temperature in an Ar 
ambient, and the samples were withdrawn from the furnace at room 
temperature. 

Metallization and Photolithography Processes 

For the purpose of electric characterization, the oxidized/nitrided Zr films were 

fabricated into MOS capacitors. A1 layer of ~100 nm thick was first thermally 

evaporated on top of the him as gate electrode by using a turbo thermal evaporator 
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K950X. W wire was made into a coil and fixed at the particular holder. After that, 
samples were located on the sample holder and ready for thermal evaporation. The 

/r 

chamber was locked and pumped to high vacuum condition at ~6 x 10 mbar. The 
vacuum is required to allow the atoms (Al) to evaporate freely in the chamber and 
subsequently condense on sample surfaces. Once the particular condition has been 
reached, current was regulated at about 20 ampere (A). This is done in order to heat 
the tungsten coil. Source material (Al) was concurrently heated as well. When the 
source material was heated to its boiling point, it started to evaporate and eventually 
condensed on the sample surfaces. 

Photolithography was conducted after Al was successfully evaporated (depos- 
ited) on top of the oxidized/nitrided Zr films. A layer of primer solution was coated 
on the Al surface at 2,500 rotation per min (rpm) by using a spinner, followed by 
coating of a layer of photoresist solution on the Al surface by using the same 
spinner and same rotation speed. After that, a soft-bake in an oven at 100 °C for 
15 min was performed before the ultraviolet (UV) exposure. A patterned mask 

o o 

with 11 by 11 squared cells, defined at 2.5 x 10 cnr each cell, was used and 
aligned on top of Al surface. It was then exposed under UV light for 30 s. After the 
exposure, the sample was immersed in developer solution until all the defined 
windows (cells) were clearly seen, followed by a thorough DI water rinse. 
Sample was then blow-dried with an air gun and brought to hard-bake in an 
oven at 120 °C for 20 min. The Al gate electrode was etched with 
aluminum etchant. The aluminum etchant was prepared by mixing phosphoric 
acid (H3PO4), acetic acid (CH 3 COOH), and nitric acid (HN 0 3 ) together, with 
ratio of 20:4:1. The etching was done within 10 s as the etching rate was high. 
The photoresist was eventually stripped by photoresist remover. The capacitor area 
was 2.5 x 1CT 3 cm 2 . 

For back contact fabrication, a ~100-nm-thick Al layer was thermally 
evaporated onto the backside of the Si substrate. The formation of back contact 
was similar to the process adopted for front contact as discussed in the earlier 
paragraph. In order to obtain a better back contact (Ohmic contact), the native oxide 
on the Si backside surface was removed by diluted HF solution (1 HF:50 H 2 0). 
Figure 3 shows an overview process of metallization (front and back contact) and 
photolithography applied in this work. 


Physical Characteristics of Oxidized/Nitrided Zr Film on SiC 

Compositional and depth profile analysis is a formidable challenge to retrieve 
compositional information in ultrathin high-K dielectric films, of homogeneous 
layer, of multilayers, or of layers with a gradient in their composition. None of 
the characterization techniques covers the whole range of sensitivity or depth 
resolution needed. In order to determine the composition and/or composition 
variation in an ultrathin film up to a few nanometers, only a few characterization 
techniques have the potential to achieve the needed depth resolution, i.e., time-of- 
flight secondary ion mass spectrometry (TOF-SIMS), elastic recoil detection 
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Fig. 3 Overview process of 
metallization (front and back 
contact) and photolithography 
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analysis (ERDA), medium-energy ion scattering (MEIS), and X-ray photoelectron 
spectroscopy (XPS) (Conard et al. 2007). 

Of these techniques, XPS was used to determine the composition and depth 
profile of thin oxidized/nitrided sputtered Zr film of approximately 20 nm based on 
SiC substrate, as reported by Wong and Cheong (201 Id). Figure 4 shows an atomic 
composition depth profile obtained by XPS. It depicts that Zr-oxynitride was 
formed with nitrogen incorporated into the Zr-0 network (Zr0 2 ) of the film. Due 
to the vast growth kinetics of thin oxide of Zr on SiC, it is of importance to 
determine the amount of nitrogen and its distribution related to its interface, as 
shown in Fig. 4e. From this figure, it is demonstrated that the nitrogen in the sample 
oxidized/nitrided at 500 °C is distributed more uniformly throughout the entire film, 
including interfacial layer, as compared to other samples. 

Figures 5, 6, and 7 elucidate the Zr 3d , Si 2 p, and N Is spectra, respectively, as a 
function of etching time for different oxidation/nitridation temperatures of 400-900 
°C, obtained from XPS analysis. From the analysis, it was identified that 
Zr-oxynitride film of Zr-O, Zr-N, and/or Zr-O-N and its interfacial layer comprised 
of mixed Zr-O, Zr-N, Zr-O-N, Zr-Si-O, Si-N, and/or C-N compounds were formed 
on SiC substrate. 

To attain high-quality high-K dielectric films with very low EOT and high 
dielectric constant, one of the critical issues being focused is by controlling the 
interfacial layer. Ideally, no interfacial layer should be present, as this will affect the 
available capacitance budget for the high-K dielectric film. However, as reported by 
some researchers (Conard et al. 2007; Choi et al. 2005), the presence of interfacial 
layer can provide positive effects on the growth of high-K dielectric film and on the 
enhanced mobility obtained with the stack. Therefore, thickness and composition of 
interfacial layer are regarded as a crucial part of high-K dielectric film analysis. 
Transmission electron microscopy (TEM) was utilized to observe the cross section 
of the oxidized/nitrided Zr films, as displayed in Fig. 8. Patches of lattice fringes 
with interplanar spacing, d, of 0.258-0.305 nm, indicate that polycrystalline struc- 
ture of bulk oxide was formed and the range of d values matched with Zr0 2 (Wong 
and Cheong 2011a). By comparing the results obtained from XPS and TEM, it is 
inferred that Zr-N and Zr-O-N may be in amorphous structure which embedded in 
the polycrystalline Zr0 2 . Besides, it is also inferred that interfacial layer of mixture 
of sub- or nonstoichiometric Zr-O, Zr-N, Zr-O-N, Zr-Si-O, Si-N, and C-N, which 
are in amorphous structure. 

In monitoring any deposition or growth process, either in a production line or for 
process development, thickness control is a key parameter (Conard et al. 2007). 
This is practically true for gate dielectric thickness monitoring, as it is a fundamen- 
tal parameter to control to attain high device performances. TEM provides a direct 
measure of the physical thickness. The physical thickness of the bulk Zr-oxynitride 
decreases as the oxidation/nitridation temperature increases (Fig. 9). Conversely, 
interfacial layer thickness that formed in between the bulk Zr-oxynitride and SiC 
substrate increases with the increasing oxidation/nitridation temperature. In gen- 
eral, the total thickness of the combined oxide (bulk and interfacial layer) reduces 
with the increase of oxidation/nitridation temperature. Qualitatively, from the 
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Fig. 4 Atomic percentage of 
Zr, Si, O, N, and C as a 
function of etching time for 
sample oxidized/nitrided 
samples at (a) 400 °C, (b) 

500 °C, (c) 700 °C, (d) 

900 °C, and (e) nitrogen depth 
profile of all investigated 
samples 
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Fig. 5 Evolution of Zr 3 d 
core level XPS spectra as a 
function of etching time for 
different oxidation/nitridation 
temperature samples: (a) 

400 °C, (b) 500 °C, (c) 

700 °C, and 900 °C 
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Fig. 6 Evolution of Si 2 p 
core level XPS spectra as a 
function of etching time for 
different oxidation/nitridation 
temperature samples: (a) 

400 °C, (b) 500 °C, (c) 

700 °C, and 900 °C 
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Fig. 7 Evolution of N Is 
core level XPS spectra as a 
function of etching time for 
different oxidation/nitridation 
temperature samples: (a) 

400 °C, (b) 500 °C, (c) 

700 °C, and 900 °C 
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Fig. 8 Cross-sectional EFTEM images of the investigated samples at different oxidation/ 
nitridation temperatures: (a) 400 °C, (b) 500 °C, (c) 700 °C, and (d) 900 °C. Lower magnification 
of each of the investigated samples is shown on the right side respectively 








83 Surface Modification of Semiconductor by Simultaneous Thermal. . . 


3019 


Fig. 9 Average thicknesses 
of Zr- oxynitride, IL, and total 
(Zr-oxynitride + IL) for the 
investigated samples. Error 
bars define the maximum and 
minimum thicknesses 
obtained 



Oxidation/Nitridation Temperature, T (°C) 


Fig. 10 Arrhenius plots of 
Zr-oxynitride, IL, and total 
(Zr-oxynitride + IL) growth 
in the N 2 0 ambient 



images with lower magnification (Fig. 8), the oxidized/nitrided Zr film at 500 °C 
has the smoothest surface. Quantitatively, it has the smoothest surface with RMS 
roughness value of 2.45 nm with wavy topography, as reported by Wong and 
Cheong (2012b). 

It is known that Zr-oxynitride thin film, along with its interfacial layer, can be 
formed on SiC substrate at processing temperatures of 400-900 °C for a duration of 
15 min. However, the activation energies for the formation of Zr-oxynitride thin 
film and its interfacial layer still remain a question. In order to answer this question, 
Wong and Cheong (2012b) extrapolated Arrhenius plots of Zr-oxynitride, interfa- 
cial layer, and total (Zr-oxynitride + interfacial layer) growth in the N 2 0 ambient, 
as shown in Fig. 10. The calculated E a for the Zr-oxynitride, interfacial layer, and 
total growth are -0.0335, 0.1543, and -0.0078 eV, respectively. The negative signs 
of E a indicate that the rate of reaction decreases as the oxidation/nitridation 
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Voltage, V (V) 

Fig. 1 1 C-V characteristics of MOS capacitors with Zr-oxynitride oxidized/nitrided at various 
temperatures (400-900 °C). The inset shows effective dielectric constant of the investigated 
samples deduced from the C-V measurement as a function of oxidation/nitridation temperatures 

(400-900 °C) 


temperature increases. In a nutshell, the densification happens with increasing 
temperature. The positive sign of E a indicates an increment of reaction rate as the 
temperature increases. It may indicate an expansion of the investigated layer 
happens as the temperature increases. On the other hand, the tendency of a reaction 
or growth to happen can be determined by the magnitude: a small magnitude 
indicates a fast growth, while a large magnitude indicates a slow growth. 

Complementarily, the aforementioned physical characteristics of oxidized/ 
nitrided Zr film on SiC substrate can be further supported by XRD and Raman 
analyses, as reported by Wong and Cheong (2012b). From both XRD and Raman 
analyses, they concluded that Zr0 2 was in tetragonal phase, while the remaining 
undetected compounds are in amorphous structure. 


Electric Characteristics of Oxidized/Nitrided Zr Film on SiC 

After discussing the physical effects of simultaneous oxidation and nitridation 
technique on sputtered Zr film on SiC, it is now to explore the practicality and 
functionality of the film as high-K dielectric in MOS devices. In order to examine 
the main MOS characteristics, the capacitance-voltage ( C-V) and leakage current 
density-electric field (. J-E ) measurements are the most commonly applied tech- 
niques. According to the report by Wong and Cheong (Wong and Cheong 2012b), 
from a correctly obtained high-frequency C-V curve, as demonstrated in Fig. 11, 
several parameters such as dielectric constant, flat-band voltage, gate accumulation, 
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Electric Field, E (MV/cm) 


Fig. 12 J-E characteristics of investigated samples oxidized/nitrided at various temperatures 

(400-900 °C) 


gate depletion, effective oxide charge, slow trap density, and interface trap density 
can be extracted. On the other hand, the J-E characteristics of the oxidized/nitrided 
Zr films on SiC were investigated, as shown in Fig. 12. It verifies that 500 °C 
oxidized/nitrided Zr film on SiC yielded the best result, owing to the highest electric 

1 z o 

breakdown field of 5.05 MV cm at 10 A cm . This is ascribed to the reduction 
in interface trap density, total interface trap density, effective oxide charge, and 
increment of barrier height between conduction band edge of the film and semi- 
conductor. The oxidized/nitrided Zr film on SiC yielded its capacitance of 2,147 pF, 

1 O O 1 o o 

effective oxide charge of -5 x 10 cm , slow trap density of 6 x 10 cm , total 

1 o ^ 

interface trap density of 3.7 x 10 cm - , and dielectric constant of 49.68. 


Growth Mechanism of Simultaneous Thermal Oxidation 
and Nitridation of Zr Film on SiC 

Based on the report by Wong and Cheong (201 Id), a possible model of oxidation 
and nitridation mechanisms of sputtered Zr on SiC substrate using N 2 0 gas at 
temperatures of 400-900 °C can be deduced and is summarized as follows. Due to 
the fast initial oxidation of Zr at room temperature, a very thin monolayer (~1 nm) 
of Zr0 2 is formed on the topmost surface. Since Zr cations in Zr0 2 is an 
effective catalyst for the decomposition of N 2 0 above 350 °C, hence, N 2 0 may 
decompose exclusively to N and O compounds. In the decomposition of N 2 0 into N 
and O compounds, there is no free energy change as a function of temperature. 
Therefore, the decomposition is a spontaneous reaction at those investigated 
temperatures. 
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At 400 °C, Zr cations as catalytic ions help to decompose N 2 0 into N and O 
compounds. As oxygen atoms are easily being sensed and absorbed by metallic Zr, 
the absorbed O atoms may react with Zr to form Zr-0 compound. This process may 
proceed further inward, depending on the availability of oxygen produced at this 
temperature. A stoichiometric Zr-0 (Zr0 2 ) is formed when sufficiently high con- 
centration of oxygen is supplied. As the thickness of Zr0 2 increases, less oxygen 
may diffuse in and cause an incomplete reaction between Zr and O. Thus, a 
sub- stoichiometric Zr-0 compound is formed toward the interfacial layer between 
the stoichiometric Zr-0 (Zr0 2 ) compound and the SiC substrate. 

Simultaneously, nitridation process happens. Since N atoms are smaller than O 
atoms, N atoms may diffuse faster than O atoms at the same processing temperature 
and duration. Since Zr-0 involved is sub- or nonstoichiometric, the formation of 
Zr-N is more favorable. N atoms may diffuse further inside and react with 
sub- stoichiometric Zr-0 to form Zr-N compound by releasing oxygen or react 
with unreacted Zr to form Zr-N compound. The Zr-N compound may be embedded 
in the Zr-0 layer toward the interfacial layer. Hence, Zr-oxynitride layer consists of 
Zr-0 and Zr-N, with Zr-O-rich compound located at the topmost surface and Zr-N- 
rich compound located at the near-interface region. The released O atoms from the 
formation of Zr-N compound and an inward diffusion of O atoms from the ambient 
may react with the unreacted Zr and bridge to the topmost Si layer (dangling Si 
bonds) of the bulk-terminated SiC surface, thus forming Zr-Si-0 compound. Alter- 
natively, O atoms in the sub -stoichiometric Zr-0 may react and bridge to the Si 
dangling bonds of the bulk-terminated SiC surface, forming Zr-Si-0 compound. In 
other words, the O atom in Zr-Si-0 compound helps to passivate the dangling Si 
bonds and functions as a connector to connect Zr-oxynitride network with SiC 
substrate. 

The excessive N atoms which diffuse further inside toward the interfacial layer 
also help to passivate the dangling Si bonds by forming Si-N compound. In 
addition, minute amount of Si atoms from the SiC substrate may diffuse out to 
form Si-N compound as well. 

Due to the catalytic effect offered by Zr and/or Zr0 2 , SiC may be slightly 
decomposed at low temperature of 400 °C. As a result from the minute 
out-diffusion of Si atoms from SiC substrate, there are minute C atoms that diffuse 
out as well. These out-diffusing C atoms will react with N atoms in the interfacial 
layer, forming C-N bonds. The formation of C-N bonds essentially removes the 
carbon and the formed carbon clusters. 

As the oxidation/nitridation temperature is elevated at 500-900 °C, stoichiomet- 
ric Zr-0 (Zr0 2 ) can also be formed when sufficiently high concentration of oxygen 
is supplied. Moreover, more O atoms are produced and able to diffuse further 
inside; hence, more Zr-0 bonds (sub- stoichiometric Zr-O) are formed toward the 
interfacial layer. Concurrently, more N atoms are able to diffuse in and react with 
sub- stoichiometric Zr-0 to form Zr-N compound by releasing oxygen or react with 
unreacted Zr to form Zr-N compound toward the interfacial layer. Excessive N 
atoms may also diffuse further inside toward the interfacial layer to passivate the 
dangling Si bonds by forming Si-N compound. Minute out-diffusing Si and C atoms 
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from the SiC substrate may react with inward-diffusing N atoms to form Si-N bonds 
and C-N bonds, respectively. On the other hand, bridging O atoms in Zr-Si-0 
compound are formed at elevated temperatures of 700-900 °C. 

At a specific temperature of 500 °C, it is postulated that a mixture of 
sub- stoichiometric Zr-0 and Zr-N compounds is in the midst of exchanging and 
balancing the O and N atoms among themselves in order to achieve thermodynamic 
equilibrium condition. As a result, Zr-O-N compound is formed. During the process 
of exchanging O and N atoms among Zr-0 and Zr-N compounds, there are also N 
atoms from the ambient that diffuse in and incorporate substitutionally and/or 
interstitially into stoichiometric Zr-0 (Zr0 2 ) region, forming Zr-O-N compound. 
Hence, Zr-oxynitride layer formed at this temperature consists of Zr-0 and Zr-O-N, 
with Zr-O-rich compound located at the topmost surface and Zr-O-N-rich com- 
pound located at the near-interface region. 


Comparison of Oxidized/Nitrided Zr Thin Films on Si and SiC 
Substrates 

After the experiments of simultaneous thermal oxidation and nitridation of 
sputtered Zr on Si and SiC substrates in 100 % N 2 0 were conducted, the optimized 
parameters in terms of oxidation/nitridation temperatures and duration were 
obtained. The optimized parameters are tabulated in Table 2. For Si-based samples, 
the oxidation/nitridation temperature was optimized at 700 °C, whereas for 
SiC-based samples, the temperature was optimized at 500 °C, with their durations 
of 15 min for both samples. 


Structural and Chemical Properties 

Structural and chemical properties of oxidized/nitrided Zr thin films on Si and SiC 
substrates in 100 % N 2 0 are compared in Table 3. It shows that Zr0 2 was formed 
with IL of Zr-silicate oxynitride (ZrSiON) on Si-based substrate, while 
Zr-oxynitride (ZrON) was formed with an IL comprised of ZrSiON and carbon 
nitride (CN) on SiC-based substrate. In both Si-based and SiC-based samples, IL of 
ZrSiON was produced with different combinations of mixed compounds. Distribu- 
tion of nitrogen in the SiC-based sample is more uniform, with higher maximum 
atomic percent, as compared to Si-based sample. However, the conduction band 
offsets (AE C ) of SiC-based sample is lower than Si-based sample, owing to the 


Table 2 Optimized parameters of simultaneous oxidation and nitridation of sputtered Zr on 
different semiconductor substrates of Si and SiC in 100 % N 2 0 


Base substrate 

Optimized temperature (°C) 

Optimized duration (min) 

Si 

700 

15 

SiC 

500 

15 
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Table 3 Comparison of structural and chemical properties of oxidized/nitrided sputtered Zr based 
on Si and SiC substrates 


Characterization 

tools 

Properties 

Unit 

Si based 

SiC based 

XPS 

Chemical 

composition of bulk 

— 

Zr0 2 (Zr-O) 

Zr-oxynitride (Zr-O 
and Zr-O-N) 

Chemical 
composition of IL 


Zr-O, Zr-N, 
Zr-Si-O, and 
Si-N 

Zr-O, Zr-N, Zr-O-N, 
Si-N, and C-N 

Distribution profile of 
nitrogen 

— 

Relatively less 
uniform 

Relatively more 
uniform 

Maximum atomic 
percent of nitrogen 

at% 

2.20 

2.71 

E a of bulk extracted 
from XPS 

eV 

6.40 

5.00 

E g of IL extracted 
from XPS 

eV 

8.80 

8.50 

AE C extracted from 

XPS 

eV 

3.40 

2.00 

EFTEM 

Thickness of bulk 

nm 

6.00 

16.90 

Thickness of IL 

nm 

4.00 

1.86 

Total thickness 
(Bulk + IL) 

nm 

10.00 

18.76 

AFM 

RMS value 

nm 

0.50 

2.45 

XRD 

t-Zr0 2 planes 

— 

(101) and (002) 

(Oil) and (002) 


combination of relatively narrower bandgaps of Zr-oxynitride and IL. This could 
lead to a reduced breakdown field of the sample. Through EFTEM analysis (Fig. 13 
and Table 3), a thicker Zr-oxynitride, with a thinner IL, was produced on SiC-based 
substrate as compared to Si-based substrate. This could help in enhancing the oxide 
capacitance (C ox ) and k value of the film on SiC-based sample. There are two 
different two-dimensional (2D) surface topographies of Zr0 2 and Zr-oxynitride on 
Si and SiC substrates, respectively, as shown in Fig. 14. The produced 
Zr-oxynitride is in wavy topography on SiC substrate, while Zr0 2 is in non-wavy 
topography. With that, it was found that the Zr0 2 thin film on Si substrate has 
smoother surface, with lower RMS value, as compared to Zr-oxynitride thin film on 
SiC substrate. The crystalline planes of t-Zr0 2 on Si-based sample are (101) and 
(002), whereas on SiC-based sample are (Oil) and (002). 


Electric Properties 

Table 4 compares the electric properties of oxidized/nitrided Zr thin films on Si and 
SiC in 100 % N 2 0, in terms of C-V and leakage current density-electric field ( J-E ) 
characteristics. It shows that SiC-based sample has higher C ox than Si-based 
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Fig. 13 Cross-sectional EFTEM images of (a) Zr0 2 on Si substrate and (b) Zr-oxynitride on SiC 
substrate 



Fig. 14 Two-dimensional AFM surface topography of (a) Zr0 2 on Si substrate and (b) 
Zr-oxynitride on SiC substrate 


sample. This is attributed to the higher k value and/or reduced IL thickness. As 
possessed in the table, the k value of the film in SiC-based sample is double the k 
value of the film in Si-based sample, while the IL thickness in the SiC-based sample 
is half the IL thickness in the Si-based sample. SiC-based sample showed an 
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Table 4 Comparison of electric properties of oxidized/nitrided sputtered Zr based on Si and SiC 
substrates 


Measurements 

Properties 

Unit 

Si based 

SiC based 

C-V 

Cox 

P F 

1,731 

2,147 

Qeff 

cm -2 

+4.50 x 10 11 

-5.00 x 10 13 

STD 

cm -2 

4.62 x 10 11 

+6.00 x 10 12 

D it at (E c -E) — 0.15-0.25 
eV 

eV' 1 

cm -2 

Mid 

10 12 -10 13 

10 14 

D total 

cm -2 

7.00 x 10 12 

3.70 x 10 13 

K 

— 

21.82 

49.68 

J-E 

Ehdb 

MV cm -1 

13.60 

5.05 

J 

A cm -2 

10 -6 

10 -6 

(pB 

eV 

1.33 

1.67 


enormous C-V curve shift to the positive voltage. This has been affirmed by the 
presence of enormous amount of negative effective oxide charge (Q e ff) in the oxide. 
As for Si-based sample, the C-V is shifted slightly to the negative voltage; thus, a 
lower positive <2^ was recorded. A relatively larger hysteresis, with higher STD of 
one order of magnitude, was identified as compared in SiC-based sample to 
Si-based sample. Interface trap density ( D it ) and total interface trap density (A otal ) 
in SiC-based sample are higher than in Si-based sample. According to J-E charac- 
teristics, SiC-based sample has lower breakdown field ( E HDB ) than Si-based sam- 
ple. This could be due to higher Q e ff, STD, D it , Z) total , and lower AE C (Table 3) in the 
SiC-based sample. The barrier height between the oxide and the semiconductor 
(< j ) B ) obtained in SiC-based sample is higher than in Si-based sample. The differ- 
ence in A E c and <fi B may be ascribed to the following reasons and assumptions 
(de Almeida and Baumvol 2003). First, by using Fowler-Nordheim (FN) plot, the 
calculated <fi B takes into account the barrier height lowering and quantization of 
electrons at the semiconductor surface, and it is not strictly constant. Second, it 
could be attributed to the electron and oxide effective masses. 


Summary 

Evolution and justification of substituting nitrided-Si0 2 to high-K oxide on SiC 
have been reviewed, with a detailed discussion of high-K gate dielectric character- 
istics and the current knowledge of simultaneously oxidized and nitrided Zr film as 
high-K dielectric on SiC. Via this technique, the role of N 2 0 gas ambient on 
oxidizing and nitriding Zr film on SiC, coupling with physical and electric charac- 
teristics of oxidized/nitrided Zr film on SiC, has been discussed. Finally, a growth 
mechanism of simultaneous thermal oxidation and nitridation of Zr film on SiC is 
presented. Physical and electric properties of oxidized/nitrided Zr thin film on 
4H-SiC substrate in N 2 0 ambient for different oxidation/nitridation temperatures 
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(400-900 °C) were presented. Bulk Zr-oxynitride was formed, which consists of a 
tetragonal phase of poly crystalline Zr0 2 with embedment of amorphous Zr-N 
and/or Zr-O-N. The IL may be comprised of sub- or nonstoichiometric Zr-O, 
Zr-N, Zr-O-N, Zr-Si-O, Si-N, and C-N, which are in amorphous structure. These 
properties were subsequently related to the electric characteristics of the films 
evaluated by MOS structures. From the electric characteristics, it was verified 
that 500 °C oxidized/nitrided sample gave the best result, owing to the highest 

1 z o 

electric breakdown field of 5.05 MV cm at 10 A cm . This is attributed to the 
reduction in interface trap density, total interface trap density, effective oxide 
charge, and increment of barrier height between conduction band edge of the film 
and the semiconductor. 
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Abstract 

Magnesium and its alloys have been found to have a variety of industrial 
applications owning to their high strength-to- weight ratio. The strength of 
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magnesium alloys is comparable with that of aluminum alloys or steels; how- 
ever, their corrosion resistance when exposed to severe conditions is relatively 
weak. Surface treatments are applied to magnesium and magnesium alloy 
articles to enhance their corrosion resistance and appearance. This chapter 
covers two classes of treatment for magnesium alloy surfaces for moderate to 
severely corrosive environments: (1) chemical treatments and (2) anodic treat- 
ments, followed by sealing and organic coatings. Necessary cleaning proce- 
dures, including mechanical cleaning, chemical cleaning, pickling, and fluoride 
anodizing, are described. 


Introduction 

Magnesium and its alloys have found a variety of industrial applications. They are 
emerging to be one of the most basic types of choice for light structural and 

Q 

constructional materials. With a density of 1.7 g/ cm , it is easy to see why 
magnesium is attractive in automotive and aerospace industries. Unfortunately, 
bare magnesium alloys corrode rapidly when subjected to industrial atmosphere 
exposure and other severe corrosion environment, unless the alloy surface is 
properly treated and protected. Tables 1 and 2 indicate that magnesium has the 
lowest density among the light metal materials but the most active electrochemical 
property as compared with other typical structural alloys. Therefore, the competi- 
tiveness of magnesium alloys in industrial applications much relies on the surface 
treatment technologies to improve the resistance against oxidation or corrosion. 
Now the application of magnesium alloys is expanding, and they have been ranked 
the third most commonly used structural metals, following steel and aluminum 
alloys. This should be attributed to the development of effective and reliable surface 
treatment technologies, as well as a comprehensive understanding on the protection 
scheme of selecting appropriate surface treatment methods (Gray and Luan 2002; 
Chen et al. 2011a; Avedesian and Baker 1999; Kainer and Kaiser 2003; Friedrich 
and Mordike 2006; Hawkins 1993; Hillis 2005). 

An optimal protection scheme provides magnesium alloys with reasonable and 
adequate resistance against corrosion and avoids unnecessary cost. Standard prac- 
tices for preparation of magnesium alloy surfaces for moderate to severe environ- 
ment cover two classes of treatment, as shown below (ASTM D1732 2007). 


Table 1 Comparison between magnesium and other light metal materials 


Light 

metal 

Density 

(g/cm 3 ) 

Corrosion 

reaction 

Standard reduction potential 
(V vs. SHE) 

Equivalent weight 
(g/equ) 

Mg 

1.7 

Mg 2+ + 2e~ <-> 
MgCO 

-2.372 

12 

Al 

2.7 

Al 3+ + 3e~ <-> 
Al(s) 

-1.662 

9 

Ti 

4.5 

Ti 2+ + 2e~ «-> 
Ti(j) 

-1.63 

24 
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Table 2 Electrochemical 
series for structural alloys 


Reduction potential series for structural alloys 

Magnesium alloys 

Active 

Aluminum alloys 

> 


Carbon steel 



Cast iron 



Copper 



Titanium alloys 



Nickel alloys 

> 

f 

Stainless steels 

Passive 


Recommended standard for moderate environment 

Cleaning - chemical conversion coating - primer - top paint 

Recommended standard for severe environment 

Cleaning - anodizing - sealing - primer - top paint 

The chart in Fig. 1 demonstrates a protection scheme of surface treatment 
sequences for magnesium alloys according to different classes of surface treatment. 
Generally, chemical conversion coating is adequate for moderate protection 
scheme, while anodizing and sealing are recommended for severely corrosive 
environment. Details of each step will be discussed in the next sections. 


Cleaning 

Although certain anodizing processes simultaneously remove contamination from 
and produce anodic films on magnesium alloy surface, generally the chemical and 
anodic treatments require the article surface to be free from all contaminations and 
impurities. Appropriate cleaning methods can effectively remove impurities on the 
surface, such as oxide rust, grease, rolling scale, silicate sands, as well as other 
metal debris or intermetallic compounds. Failure of removing these contamina- 
tions, particularly heavy metal impurities, will lead to galvanic corrosion and 
coating delamination, hence the reduced protective value. Therefore, cleaning 
process is crucial to obtain good corrosion protection. For magnesium alloys, the 
commonly used cleaning processes include mechanical pretreatment, solvent 
cleaning or alkaline cleaning, and acid pickling or an etching step (Avedesian and 
Baker 1999; Kainer and Kaiser 2003; Friedrich and Mordike 2006). 


Mechanical Cleaning 

Surface cleaning by mechanical methods has been around for almost all types of 
metal. Mechanical cleaning is usually taken as a preliminary step on raw magnesium 
alloys. According to the surface quality and geometry of the magnesium alloy 
articles, appropriate mechanical treatments include blasting, grinding, and polishing. 
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Fig. 1 Step-by-step protection scheme of magnesium alloys for moderate to severe environment 

These mechanical cleaning methods can effectively remove majority of contaminants 
on article surface; nevertheless, some metal loss is involved during these processes, 
which could affect the surface quality and geometry of precise die casting parts. 

It has to be noted that mechanical cleaning in dry conditions should be avoided 
whenever possible. Magnesium debris is flammable, for example, the mixture of 
magnesium dust with air at a certain ratio is highly explosive; burning of larger 
debris particles can be causes of fire and other accidents (Kainer and Kaiser 2003; 
Herbert 2011; Dickman 2011; ASM International 1996; Kostilnik 2005). 
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Sandblasting 

Sandblasting is operated under wet condition, known as air blast and wet blast 
(Kainer and Kaiser 2003; Hillis 2005; Herbert 2011; Kostilnik 2005). Sandblasting 
under dry condition is reported, but not recommended. Purified sand (Si0 2 ) is 
typical for blasting magnesium alloy surface; other sands such as glass beads and 
corundum (A1 2 0 3 ) are also used. It should be noted that the blasting sands must be 
free of any metal contamination, particularly iron, copper, and other heavy metals, 
as these impurities can cause galvanic corrosion on the magnesium surface and 
reduce the resistance against corrosion dramatically. Therefore, the use of metal grit 
is not recommended. On the other hand, magnesium alloy materials are relatively 
soft than Si0 2 or A1 2 0 3 ; the blasting pressure needs to be adjusted to avoid sand 
embedment into the material surface. 

Grinding 

Grinding is commonly used on magnesium alloys for the removal of flash and 
surface imperfections from die castings or scratches from surfaces of extrusions 
(Kainer and Kaiser 2003; Hillis 2005; ASM International 1996; Kostilnik 2005). 
Typical speed of grinding on belts or disks is in the range from 900 to 1,800 m/min. 
Silicon carbide and corundum at 60-400 mesh are recommended for use as grinding 
agents. Grinding magnesium alloys at dry conditions should be avoided, because 
the mixture of magnesium debris with air at a certain ratio is highly explosive. 
Magnesium debris or dust must not be allowed to accumulate, because any 
accidental sparks produced from contacting between the grinding disk and metal 
parts or other components of the grinding machine can lead to explosion. For 
this reason, wetting and dust suction device are required for grinding magnesium 
alloys. 

Polishing 

Magnesium alloys are also suitable for polishing. Cotton-covered polishing disk is 
recommended. Typical speed of polishing is in the range from 1,800 to 2,500 m/min 
(Kainer and Kaiser 2003; Hillis 2005; Dickman 2011; Kostilnik 2005). However, 
polishing of magnesium alloys is usually not necessary in standard practices for 
preparation of magnesium alloy surfaces for moderate to severe environment, as the 
subsequent chemical treatment and anodic treatment rendered will roughen the 
surface again (ASTM D1732 2007; ASTM B879 2003). 

Chemical Cleaning 

After the preliminary mechanical cleaning, some impurities may still remain on 
magnesium alloy surfaces. These contaminations can be removed by acid pickling 
or fluoride anodizing. In other cases, the raw magnesium alloy materials are often 
received with temporary protective grease or wax layer over the surface, very 
similar to the case of cold roll steel. Therefore, alkaline or solvent degreasing is 
required to expose the metal prior to acid pickling or fluoride anodizing. In addition, 
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Table 3 Typical composition of alkaline degreaser (Hillis 2005; Cormier 2005) 


Name 

Chemistry 

Condition 

Basic caustic soda a 

2-5 % NaOH in water 

5-30 min 100 °C 

Modified caustic 
soda a 

2-5 % NaOH, 1 % Na 3 P0 4 , 0.06 % surfactant 

5-20 min 80-90 °C 

Simple soak bath 

1.5 % NaOH, 2.2 % Na 2 C0 3 • 10H 2 O, 0.06 % 
surfactant 

5-15 min 80-100 °C 

Milk alkaline 

1 % Na 3 P0 4 , 2.5 % Na 2 C0 3 • 10H 2 O, 0.06 % 
surfactant 

5-15 min 100 °C 


a Cleaners containing more than 2 % NaOH 


magnesium alloy articles that have been mechanically treated are also 
recommended to go through a degrease process to avoid any leftover machine oil. 

The following standards cover processes which are needed for the preparation of 
metal surfaces prior to the application of surface pretreatments and coatings: 

ISO 27831-2:2008 Metallic and other inorganic coatings - Cleaning and prepara- 
tion of metal surfaces - Part 2: Non-ferrous metals and alloys (ISO 27831- 
2:2008) 

ASTM D1732 Standard Practices for Preparation of Magnesium Alloy Surfaces for 
Painting (ASTM D1732 2007) 

ASTM B879 Standard Practices for Applying Non-Electrolytic Conversion Coat- 
ings on Magnesium and Magnesium Alloys (ASTM B879 2003) 

ASTM B322 Practice for Cleaning Metals Prior to Electroplating (ASTM B322 
2003) 

Alkaline Degrease 

Magnesium alloys have a good resistance to common alkalis except pyrophos- 
phates and some polyphosphates. The level of metal removal in common alkaline 
degreaser above a pH of 12 is negligible. Typically, alkaline degreaser for 
low-carbon steel can effectively remove the oil or grease present on the magnesium 
surface without causing any significant corrosion (see Table 3) (Avedesian and 
Baker 1999; Kainer and Kaiser 2003; Friedrich and Mordike 2006; Hillis 2005; 
Cormier 2005). Alkaline degreaser containing soap-like surfactants, such as 
octylphenol ethoxylates, nonylphenol ethoxylates, alcohol ethoxylates, secondary 
alcohol ethoxylates, and alkylamine ethoxylates, is recommended. 

The alkaline degreasing process can also be accelerated by applying a moderate 
cathodic potential on the article, known as cathodic cleaning. This electro-assisted 
method generates hydrogen on the magnesium alloys and promotes the removal 
efficiency of the impurities on the surface. The current density on the magnesium 
alloy articles is recommended to be controlled within 5 A/dnr, and normally, the 
potential required to sustain the current is approximately —4 to —8 V (Friedrich and 
Mordike 2006; Hillis 2005). By means of cathodic cleaning, the entire degreasing 
process can be effectively reduced to 3-5 min. It should be noted that the 
corresponding anode electrode has to be carefully chosen, because any heavy 
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Table 4 List of solvent degrease agent (Rupp and Surprenant 2005) 


Name 

Chemistry 

Condition 

Solvent immersion 

Paraffin or white spirit liquid 

5 min at RT 

Vapor treating 

Trichloroethylene vapor or trichloroethane vapor 

5 min in vapor 


metal ions that dissolve from the anode would eventually deposit on the magnesium 
alloy surface, resulting in galvanic problems. 

Solvent Degrease 

Organic solvents are also used to remove grease, oil contaminants, and protective 
wax. Normally, the solvent degrease process involves both solvent immersion and 
solvent vapor treating; the latter is required for completely cleaning the magnesium 
alloy surface (Avedesian and Baker 1999; Friedrich and Mordike 2006; Hillis 2005; 
Rupp and Surprenant 2005). The combination of immersion and vapor treating can 
effectively clean the surface without any metal removal, as shown in Table 4. 
Trichloroethylene and perchloroethylene are the most often used, and methylene 
chloride is effective in removing the excess organic resin. Methanol should not be 
used, as it may react with magnesium surfaces. 


Acid Pickling 

Acid pickling is suitable for removing contamination that is tightly bonded to the 
surface or insoluble in alkalis or solvents (Avedesian and Baker 1999; Friedrich and 
Mordike 2006; Hillis 2005; Hudson 2005). However, acid pickling should be 
applied to magnesium alloys carefully, as the alloy surface can be easily roughened 
and part of the metal would dissolve into the solution. Usually based upon the 
concern of etching effect, acid pickling is limited to die casting parts or those with a 
rough surface. On the other hand, acid pickling is effective on removing heavy 
metal impurities, as well as rust and residual conversion coatings on reused 
magnesium alloys. 

As shown in Table 5, most pickling agents contain chromic acid and 
hydrofluoric acid, which may increase the surface porosity, as different phases on 
the magnesium alloy surface dissolve at different rates. The associated corrosion 
resistance and mechanical strength may be affected. For this reason, the use of acid 
pickling should be carefully considered. 


Fluoride Anodizing 

Fluoride anodizing is applicable to all magnesium alloys and is recommended for 
all magnesium alloy articles prior to protective pretreatments and finishing 
(Table 6). Compared to acid pickling, fluoride anodizing is safer and more effective 
on removing contaminants of heavy metal left from mechanical cleaning, such as 
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Table 5 List of acid pickling agents (Hillis 2005; Hudson 2005) 


Name 

Chemistry 

Condition 

Nitric acid 

5-10 % HN0 3 in water 

Immersion or dipping, RT 

Sulfuric acid 

2-5 % H 2 S0 4 in water 

Immersion or dipping, RT 

Chromic acid a 

10-15 % Cr0 3 in water 

15 s-30 min 100 °C 

Hydrofluoric acid 

10 % HF in water 

30 s-5 min, RT 

Acetic acid/sodium 
nitrate 

20 % HAc, 5 % NaN0 3 in 

water 

3-5 min, RT 

Chromic/sulfuric acid 

10 % Cr0 3 , 0.1 % H 2 S0 4 in 

water 

Wiping 

Nitric acid/hydrofluoric 

(1)5-10% HN0 3 in water 

2 min in HN0 3 , followed by 15 min 

acid 

(2) 10 % HF in water 

in HF 


a A subsequent surface activation in HF or alkali is required prior to chemical conversion coating 


Table 6 Chemical composition and anodizing condition for fluoride anodizing treatment 
(Friedrich and Mordike 2006) 


Name 

Chemistry 

Condition 

Fluoride 

anodizing 

15-25 % NH 4 HF 2 in water 

AC voltage, gradually increase to 90-120 V 


metallic debris from grinding or blasting (Friedrich and Mordike 2006; Hillis 

2005). 

During fluoride anodizing, an AC voltage is applied and gradually increased 
until 90-120 V. Intensive current is generated at first but diminishes rapidly as 
impurities are removed and a barrier layer of MgF 2 forms on the article surface. 
This process is complete when the current drops to blow 0.5 A/dirr, usually within 
10-15 min. The fluoride film produced during the process is a weak base for 
adhesion and needs to be removed in chromic acid rinsing followed by hydrofluoric 
acid rinsing before chemical conversion coatings, but not necessary if the article 
will be subsequently subjected to anodizing. 


Chemical Conversion Coating 

Chemical conversion coatings are a family of chemically bonded superficial layer 
used to replace the native oxide surface and to passivate metal substrates. They are 
formed by contacting a metallic surface with an aqueous solution, which conven- 
tionally contains film-forming agents such as soluble chromate species. Since 
magnesium is highly reactive, once exposed to air, the surface of magnesium 
alloys is spontaneously oxidized, forming a thin oxide layer. However, the passivity 
of the thin oxide layer is low, which only provides limited protection in atmospheric 
exposure. Chemical conversion coating can be applied on magnesium alloys to 
serve as a better passive layer. In contrast to the native magnesium oxide, 
chemical conversion coatings have dense structures and improved corrosion 
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resistance owning to their inert chemical composition, typically trivalent 
chromium oxide, or phosphate. Meanwhile, chemical conversion coatings play 
an important role in improving the adhesion of organic primers and finishing 
paintings. 

The following standards cover processes which are needed for the application of 
chemical conversion coatings: 

ASTM D1732 Standard Practices for Preparation of Magnesium Alloy Surfaces for 
Painting (ASTM D1732 2007) 

ASTM B879 Standard Practices for Applying Non-Electrolytic Conversion Coat- 
ings on Magnesium and Magnesium Alloys (ASTM B879 2003) 


Principles of Chemical Conversion Coatings 

Chemical conversion coatings provide magnesium alloys with a passive surface 
against corrosion and a base layer for further polymer paint (Chen et al. 2011a; Ono 
et al. 1998, 2001; Chong and Shih 2003; Wu et al. 2005; Weng et al. 2006; Zheng 
and Liang 2007; Takai and Takaya 2008; Umehara et al. 2003; Simaranov 
et al. 1989, 1992). Chemical conversion coatings are formed through a series of 
spontaneous chemical reactions, which can be generalized into two stages 
(Simaranov et al. 1989, 1992). During the first immersion stage, the native magne- 
sium oxide and part of the magnesium substrate dissolve into the solution, as the 
conversion coating bath usually has a low pH value. Meanwhile, the conversion 
coating species precipitates and deposits onto the substrate surface, forming a 
gel-like hydroxide layer. In the second stage, the hydroxide layer further undergoes 
dehydration and condensation. Eventually, a dense barrier coating is formed over 
the magnesium alloy substrate with a rough and dendrite-like surface. The actual 
chemical conversion coating formation mechanism on magnesium alloy is more 
complicated and different between each different type; however, the simplified 
model is widely accepted for understanding the coating structure and the chemical 
treatment process. 

Although different versions of chemical conversion coating systems may con- 
tain different compounds such as chromates and phosphates, they are commonly 
characterized by acting as a passive layer that replaces the original magnesium 
oxide protecting the metal substrate. The new conversion coating layer is thicker, 
denser, more chemically stable, and with less permeability to aqueous environment, 
some even containing corrosion-inhibiting compounds as a reservoir of self-healing 
agents. 

Chemical conversion coatings are also known as a good base layer for bonding 
organic primer and finishing paintings owning to their rough and dendrite-like 
surface. It should be noted that the corrosion protection by a chemical conversion 
coating layer itself is only temporary and not considered to be protective finishes. 
These coatings must be used in conjunction with other primers or surface finishing 
coating to exhibit full potential. 
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Chromate Conversion Coatings 

Chromate conversion coatings have been shown to significantly improve the cor- 
rosion resistance on all types of magnesium alloys in moderate service conditions. 
Substrates with fine-grained micro structure best respond to chromate-based con- 
version coatings. The first type of commercial chromate conversion coatings for 
magnesium alloys was known as Dow 1 (ASTM D1732 2007; ASTM B879 2003; 
Simaranov et al. 1989, 1992). Because of the low cost and good performance, this 
type of conversion coating has been used for decades. However, the critical 
disadvantage of chromate conversion coatings is the content of various hexavalent 
chromium compounds, which are toxic and carcinogenic, leading to increasingly 
restricted usage. 

The chromate conversion coating is dominated by Cr 2 0 3 , which is both chem- 
ically inert and physically robust; in addition, the coating also contains some Cr 
(VI) species remained from the solution as well as some magnesium compounds 
such as Mg0 2 , MgF 2 , and NaMgF 3 , among which Cr(VI) species are still chemi- 
cally active, and provides the treated magnesium surface with self-healing capa- 
bility over long-term usage. 

Some other commercial chromate conversion coating solutions contain HF, 
which is known as a surface etch agent or activator (ASTM D1732 2007; ASTM 
B879 2003). HF is added to assist the removal of the native magnesium oxide and 
other metal impurities on the substrate surface. Sometimes, HF activation is 
separated from the conversion coating bath as a pre-step prior to the coating 
deposition. Generally, the addition of HF can improve the efficiency of the coating 
deposition and the quality of the coating structure. 

Dow 1 

Dow 1 conversion coating system, also known as acid chromate treatment or 
chrome pickle developed by Dow Chemical, was the first generation of commer- 
cialized chromate conversion coating systems for magnesium alloys. In Table 7, a 
significant content of HN0 3 is present in the immersion bath, which is designed to 
sustain the pH and strip off the native oxide surface. However, the magnesium alloy 
substrate dissolves rapidly in Dow 1, which may cause approximately 15-25 pm 
loss by surface dissolution. Therefore, the Dow 1 treating process is restricted 
within 2 min and not recommended for finely machined parts or articles with 
tight tolerances. Table 7 gives the standard practices of Dow 1 according to 
ASTM B879 and D1732 (ASTM D1732 2007; ASTM B879 2003). 

Dow 7 

In Dow 7, or known as dichromate treatment, the HN0 3 is removed, and an 
independent HF activation pre-step is added prior to the immersion (Table 8). A 
thorough rinse is required after HF activation, as excessive F" residuals may reduce 
the quality of the final conversion coating, leading to lower corrosion resistance. 
Compared to Dow 1, Dow 7 treatment only causes negligible metal loss on 
magnesium alloy surface and produces film that is excellent for paint base. 
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Table 7 Chemical composition and condition of Dow 1 (ASTM D1732 2007; ASTM B879 2003) 


Name 

Chemistry 

Condition 

Dow 1 

1 8 % Na 2 Cr 2 0 7 • 2H 2 0 

1-2 min immersion at RT 

18 % HN0 3 

Drain for 5-15 s, rinse, and dry 

Dow 1 for 
castings 

1 8 % Na 2 Cr 2 0 7 • 2H 2 0 

15 s to 0.5 min immersion at 70-80 °C 

18 % HN0 3 

Followed by 10 s in Dow 1 again at 50-60 °C, 
rinse and dry 

0.15 % NaHF 2 or KHF 2 or 
NH 4 HF 2 


Table 8 Chemical composition and condition of Dow 7 (ASTM D 1732 2007; ASTM B879 2003) 


Name 

Chemistry 

Condition 

Dow 7 

30 % HF 

1-5 min immersion at RT and rinse thoroughly 

Activation 

pre-step 



Immersion 

12-18 % Na 2 Cr 2 0 7 • 2H 2 0 

30-40 min immersion at 100 °C 


0.25 % MgF 2 or CaF 2 

Rinse in cold water followed by hot water 


4.1 < pH < 5.5 

(<80 °C) and dry 


Chrome Manganese and R.A.E. Bath 

These later developed chromate-based conversion coating systems have a reduced 
content of chromate and a moderate pH value. Metal loss due to etching or dissolution 
during the immersion stage is negligible (4-6). Table 9 gives the chemical compo- 
sition and condition for chrome manganese coating and R.A.E. bath. 

Chromic Acid Brush-On 

Repairing damaged surface on the chromate-treated magnesium alloy articles 
emphasizes an easy but effective method. For this reason, using a simplified but 
similar performed method is much more practical than re-treating the damaged 
articles in the original immersion bath. These simplified systems are designed to fix 
local damages by wiping or swabbing, known as touch-up chromate conversion 
coatings. The rest part of the article surface remains intact. Table 10 gives some 
commercial touch-up products developed by Dow and Henkel. 

Chromate-Free Conversion Coatings 

Recent research has focused on the development of various non-chromate and 
chromium-free systems in response to environmental protection laws, chromate 
minimization policies, and other regulations (Fahrenholtz et al. 2002; Kendig and 
Jeanjaquet 2002; Zhang et al. 2005; Yong et al. 2008; Costa 1997; Costa and Klein 
2006; Hu et al. 2009). Nevertheless, the performance of non-chromate systems is 
still not comparable to chromate-based systems particularly in severe corrosion 
environment. 
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Table 9 Chemical composition and condition of chrome manganese bath and hot half hour 
R.A.E. bath (ASTM D1732 2007; ASTM B879 2003) 


Name 

Chemistry 

Condition 

Chrome manganese 

1 5 % Na 2 Cr 2 0 7 • 2H 2 0 

2 h immersion at RT 

0.5 % MnS0 4 • 5H 2 0 

Rinse and dry 

5 % MgS0 4 • 7H 2 0 

4 < pH < 6 

R.A.E. Bath 

1.5 % Na 2 Cr 2 0 7 • 2H 2 0 

30 min immersion at 100 °C 

1.5 % (NH 4 ) 2 Cr 2 0 7 

Rinse in warm water and dry 

3 % (NH 4 ) 2 S0 4 

pH: 

For Mg-Zr or Mg-Al alloy: 

0.27 % NH 4 OH 

5.90 < pH < 6.05 

For Mg-Mn or Mg-Zn-Mn alloy: 

0.37 % NH 4 OH 

6.00 < pH < 6.10 


Table 10 Chemical composition and condition of Dow 19 (Friedrich and Mordike 2006; ASTM 
B879 2003) 


Name 

Chemistry 

Condition 

Dow 19 touch-up 

1 % Cr0 3 

Apply by wipe for 1-3 min 

5 % CaS0 4 

Rinse and dry 


Other commercial products: Alocrom 1200 touch-up, Alodine 1132 


Generally, these non-chromate replacements rely on phosphate and compounds 
containing transition-metal elements such as Mn, V, Zr, and Mo to serve as the 
main material source of forming the inert barrier layer. Similar to Cr, these elements 
possess multilevel valency stages: (1) at high valency stage, they are very strong 
oxidants, like chromate, and can oxidize magnesium by reducing to a more stable 
state, and (2) at low valency stage, they are less soluble and chemically stable. From 
this point of view, these non-chromate conversion coating systems work in a very 
similar way that the chromate system does on magnesium alloys. The current 
development of non-chromate replacement has identified some promising systems 
that provide comparable corrosion protection in moderate corrosion environment; 
however, the overall performance still needs improvement, such as deposition rate, 
coating firmness, solution stability long-term performance, ease of applying, etc. 


Phosphate-Permanganate Conversion Coatings 

The concept of using permanganate-based system as a conversion coating treatment 
for magnesium alloys is based on its similar chemical properties to chromate 
(Umehara et al. 2001, 2003; Zucchi et al. 2007). However, the corrosion protection 
performance solely based on permanganate treatment is not satisfactory in practical 
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Table 1 1 Chemical composition and condition of phosphate -permanganate conversion coatings 
(Gray and Luan 2002; Friedrich and Mordike 2006) 


Name 

Chemistry 

Condition 

Phosphate-permanganate 

2.0-2. 5 % KMn0 4 

5-10 min immersion at 80 °C 

10-15 % Na 2 HP0 4 

Rinse in warm water and dry 

5 % H 3 P0 4 


application. It has been found that the Mn(IV) species precipitate too fast, resulting 
in a loose structure rather than a dense conversion coating layer. To overcome this 
problem, permanganate is used in combination with phosphate. The phosphate- 
permanganate system works successfully on magnesium alloys (Table 11) (Chen 
et al. 2011a). In addition, it is more environmentally friendly and has been shown to 
have corrosion resistance comparable to chromate treatments. 

Fluorozirconate Conversion Coatings 

Fluorozirconate-based treatments have also shown to be a promising alternative for 
magnesium alloy articles (Chen et al. 2011a; Tomlinson 1995, 1988, 1999). Zirco- 
nium is believed to form continuous three-dimensional polymeric or metalloid- 
oxide matrix in a similar way as chromium ( Atanasy ants and Nikitin 200 1 ; Mogoda 
1999). This makes zirconate-based system an attractive choice for non-chromate 
alternatives. However, disadvantages of this type of treatment are the low stability 
with hard water and the relatively weak corrosion resistance in severe corrosion 
environment. Recent efforts on improving fluorozirconate-based conversion coat- 
ings have been focused on systems that combine other components such as vana- 
date, chromium(III), and cerium(IV) (Zanotto et al. 2011; Wang et al. 2009; Lin and 
Fang 2005; Li et al. 2008; Ardelean et al. 2008). 


Anodizing Methods 

Anodizing methods have been extensively used as reliable protective pretreatments 
for magnesium alloys to achieve a better corrosion resistance for severe environ- 
ment. The anodized magnesium alloy articles also possess higher surface hardness 
and better wear resistance, as well as enhanced paint adhesion. The concept of 
anodizing magnesium alloys was developed in the 1940s (Budiloff 1942; Buzzard 
1943; Jenny 1933; Keelee 1926). Commercialized hard anodizing methods were 
developed in the 1950s and were continuously improved (Evangelides 1955). For 
example, Dow 17 introduced hot aqueous acidic method by modifying its chemical 
conversion coating system (Herbert 1959). Meanwhile, Harry A. Evangelides 
invented an aqueous alkaline system, known as HAE treatment, which is applicable 
to most magnesium alloys (Evangelides 1955, 1959). Dow 17 and HAE treatments 
produce more substantial coatings (5-30 pm) and have been widely used as basis 
for heavy-duty paint systems (Kainer and Kaiser 2003; Hillis 2005). 
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With the development of plasma electrolytic oxidation (PEO) method, oxide 
growth efficiency and film thickness can be significantly improved (Bai and Li 
2004; Blawert et al. 2006; Luo et al. 2006; Wu et al. 2007). Aluminate and borate 
were introduced to replace previous transition metals such as Cr, V, and Mn, 
removing the undesirable dark surface color (Kobayashi et al. 1988; Schmeling 
et al. 1990). Currently commercialized anodizing systems, such as MAGOXID- 
COAT and TAGNITE, are PEO process. 

The following standards cover processes which are needed for the application of 
magnesium anodizing: 

ASTM D1732 Standard Practices for Preparation of Magnesium Alloy Surfaces for 
Painting (ASTM D1732 2007) 

ASTM B893 In Specification for Hard-Coat Anodizing of Magnesium for Engi- 
neering Applications (ASTM B893 2003) 


Principles of Magnesium Anodizing 

The oxide layer on magnesium and magnesium alloy surface can also be enhanced to 
improve the corrosion resistance via anodic polarization in proper electrolyte (Budiloff 
1942; Buzzard 1943; Jenny 1933; Keelee 1926). During the anodizing process, the 
growth of the oxide layer is accelerated and sustained by the electric current. Proper- 
ties of the oxide layer such as thickness, density, and porosity are largely determined 
by the current/potential and polarization time; by contrast, the growth of chemical 
conversion coating is spontaneous and self-terminated. The anodized surface usually 
has a layered stmcture: (1) a thin barrier layer at the metal-coating interface followed 
by (2) a porous thick layer. The porous thick layer can be either ordered cellular or 
irregular stmcture. The ordered stmcture is often observed in hard anodizing coatings, 
while the irregular structure is identified in PEO coatings. 

PEO anodizing produces thicker and denser oxide coatings (Schmeling et al. 1990; 
Ross et al. 1998; Barton et al. 2001; Dolan 2004; Dolan 2005). This process requires a 
higher voltage, and it is characterized by sparking, known as plasma. These sparks are 
discharged near the surface of magnesium alloy and associated with partial melting of 
the amorphous oxide and the metal substrate, which ultimately convert the original 
surface into an oxide ceramic layer. As the oxide ceramic layer builds up, the voltage 
increases accordingly to overcome the dielectric breakdown, hence a linear relation- 
ship between the film thickness and the final voltage. 


Hard Anodizing 
Dow 17 

Dow 17, developed by Dow Chemical, can be applied to all magnesium alloys. The 
anodizing bath employed in this treatment is acidic and consisted of chromate, 
fluoride, and phosphate (see Table 12). Film growth rate can be adjusted by 
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Table 1 2 Chemical composition and anodizing condition of Dow 17 process (Kainer and Kaiser 
2003; Hillis 2005; ASTM D1732 2007) 


Name 

Chemistry 

Condition 

Dow 17 

350 g/L NH 4 HF 2 (for DC) or 250 g/L NH 4 HF 2 (for AC) 

^constant 0.5 5 A/dlll 

100 g/L Na 2 Cr 2 0 7 • 2H 2 0 

v max 1 00 V 

90 g/L H 3 P0 4 (85 %) 

15-25 min 

pH = 5 

T = 70-80 °C 


After anodizing, rinse the articles in cold water, followed by a short immersion in hot water, and 
dry with hot air. Dark green surface, moderate hardness, good throwing power, chromate 
containing 


choosing a proper anodizing current. The anodizing voltage increases with time as 
the film grows up. Normally, the voltage is in the range of 60-70 V after 4-5 min 
anodizing and reaches 90 V after 25 min, when the film can grow up to 40 pm 
(Kainer and Kaiser 2003; Hillis 2005; ASTM D1732 2007). 

Dow 17 process produces a two-layer coating. The first layer is deposited at the 
initial stage as a thin interface with approximately 5 nm, while the bulk part of the 
coating is formed during steady state of film growth at a higher voltage. Similar to 
chromate-based chemical conversion coating, Dow 17 coating has a dark green 
color. 

HAE 

HAE process, named after Harry A. Evangelides, is applicable to all types of 
magnesium alloys (Gray and Luan 2002; Kainer and Kaiser 2003; Hillis 2005; 
ASTM D1732 2007). Different from Dow 17, HAE bath solution is strongly basic, 
containing considerable KOH (see Table 13 below). As the anodizing voltage 
approaching higher, a dark brown film of 30 nm is produced. The color is caused 
by the spinel compounds of Mg, Al, and Mn. The wear resistance of HAE coating is 
higher than Dow 17 and chromate -based conversion coatings. 

Both Dow 17 and HAE coatings require a post-sealing treatment to exhibit their 
full corrosion resistance. 


PEO Process 
TAGNITE 

TAGNITE surface treatment, invented by Technology Applications Group, USA, is 
chromate-free and suited for all types of magnesium alloys (Friedrich and Mordike 
2006; Bartak et al. 1993a, b, c, 1995). TAGNITE treatment is a two-step process, 
which involves (1) an electroless initiation to create a firmly bonded protective 
interface by immersion in an ammonium fluoride bath and (2) a PEO stage, which is 
carried out in a strongly basic bath containing hydroxide, fluoride, and silicate 
compound. This process produces a ceramic coating, mainly MgSi0 3 , which pro- 
vides good hardness and wear resistance (Table 14). The TAGNITE coating has a 
white appearance, due to free of compounds with colors as in Dow 17 and HAE. 
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Table 13 Chemical 
composition and anodizing 
condition of HAE process 
(Kainer and Kaiser 2003; 
Hillis 2005; ASTM D1732 
2007) 


Name 

Chemistry 

Condition 

HAE 

120-160 g/L KOH 

^constant 1*5 2.5 A/dlll 

34 g/L KF 

V 90 V 

y max y 

34 g/L Al(OH ) 3 

10-90 min 

34 g/L Al(OH ) 3 

T = 20-30 °C 

19 g/L K 2 M 11 O 4 

pH = 14 


Dark brown surface, good hardness, moderate throwing power 


Table 14 Chemical composition and anodizing condition of TAGNITE process (Bartak 
et al. 1993a, b, c, 1995) 


Name 

Chemistry 

Condition 

TAGNITE 

20-40 g/L NH 3 F 

30-40 min 

Electroless initiation 

T = 70-80 °C 

PEO stage 

5-7 g/L KOH 

^constant 1 ^ A/dm 

15-20 g/L K 2 S0 3 

V max 400 V 

8-10 g/L KF 

T = 10-20 °C 

pH = 13 


White surface, good hardness, good throwing power 


Compared to Dow 17 and HAE, TAGNITE coating exhibits very good corrosion 
resistance and wear resistance, which can be attributed to the PEO process and the 
more compact ceramic structure. 

M AGOXI D-CO AT 

MAGOXID-COAT, invented by GmbH Ltd., is used on all types of magnesium 
alloys (Friedrich and Mordike 2006). Similar to TAGNITE, MAGOXID-COAT is 
also carried out in a two-step process: (1) HF acid activation, followed by (2) PEO 
in a neutral bath. The bath electrolyte may contain inorganic anions such as 
phosphate, borate, silicate, aluminate, or fluoride. Certain organic acids such as 
citrate, oxalate, and acetate are also employed. Table 15 summarizes the typical 
electrolyte composition and anodizing parameters of MAGOXID-COAT process. 
The coating normally grows at the rate of 1.5 pm/min. MAGOXID-COAT 
consists of three layers: (1) a thin (100 nm) barrier layer at the metal surface, 
followed by (2) an oxide ceramic layer of low pore count, and (3) a porous 
ceramic layer consisting of a mixture of magnesium oxide, fluoride, phosphate, 
and borate on the top. Coating surface has a white appearance, and the color can be 
altered by compounds of transition-metal oxide compounds, such as introducing 
vanadate or molybdate into the anodizing electrolyte. The addition of these ions 
usually renders a smoother surface, improved corrosion resistance, but inevitably a 
dark color. 
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Table 15 Chemical composition and anodizing condition of MAGOXID-COAT process 
(Friedrich and Mordike 2006) 


Name 

Chemistry 

Condition 

MAGOXID- 

COAT 

10 % HF (40 %) 

30-60 s, RT 

Acid activation 



PEO 

30 g/L HF (40 %) 

T 1 2 

^constant L ^ 

A/dm 2 


60 g/L H 3 PO 4 (85 %) 

V max 400 V 


35 g/L H 3 BO 3 

T = 10-20 °C 


360 g/L hexamethylenetetramine or 
hexamethylenediamine 



pH = 7-7.2 



White surface, good hardness, good throwing power 


Sealing and Finishing 
Sealing 

Sealing is a standard subsequent process after anodizing and sometimes is also 
applicable to chemical conversion coating for a better corrosion resistance. 
Although the anodized surface is coated with a thick passive layer, some even 
possessing a ceramic structure, the majority of the coating layer is still not fully 
dense. The corrosion resistance of as-prepared anodized surface would be inevita- 
bly reduced, if the porous structure is open to the moisture or corrosion environ- 
ment. Generally, during the sealing process, these open pores are closed by 
hydrated compound or by the volume expansion caused by the hydration of oxide 
structure. A successful sealing treatment can improve the abrasion and wear 
resistance as well as the corrosion resistance of the magnesium alloy surface. 

Hydrate Sealing 

Hydrate sealing is an effective method to close the open pores on anodized 
magnesium alloy articles. According to the operation temperature, sealing pro- 
cesses for magnesium alloys can be categorized into “hot sealing” and “cold 
sealing.” As an analogous process to the sealing of anodized A1 alloys, “hot 
sealing” using hot or boiling water with a relatively long treatment time is 
employed as a standard for anodized magnesium alloy articles (Table 16) (Basaly 
et al. 1995; Massad et al. 1995). However, hot sealing method is highly energy 
intensive and, due to its duration, may cause a bottleneck in the production line. To 
improve the sealing efficiency and lower the temperature, ammonium difluoride 
and nickel, titanium, or zirconium salts are added into the sealing bath, hence the 
“cold sealing.” However, due to the toxic properties of nickel salts, cold sealing 
implies costly wastewater treatments. 
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Table 16 Chemical composition and condition of Mg sealing methods (Hillis 2005; ASTM 
D1732 2007) 


Name 

Chemistry 

Condition 

Water glass sealing 

50 g/L Na 2 Si0 3 

95-100 °C 

Immersion for 15 min 

Ammonium difluoride sealing 

100 g/L NH 3 HF 2 

20-30 °C 

Dipping and drying without rinsing 


Sol-Gel Sealing 

Based on the concerns of the environmental issue and energy consumption, new 
sealing systems for magnesium alloys by employing hydrosilicons or sol-gel-based 
systems are currently under development (Zanotto et al. 2011; Banerjee and Raman 
2011; Chen et al. 2011b; Hu et al. 2010; Quach et al. 2008; Wojtala 2011; Xue 
et al. 2011; Zhang and Wu 2009; Zucchi et al. 2008; Lamaka et al. 2008). 
Hydrosilicons, a group of silicon-based organic -inorganic chemicals, are emerging 
to be a very promising alternative to conventional toxic nickel, titanium, or zirco- 
nium salts based on cold sealing systems. In light of their chemical properties, the 
hydrosilicon, when attaches to the anodized magnesium alloy surface, spontane- 
ously forms a gel layer that penetrates into the porous structure and converts into a 
barrier layer dominated by silicon after condensation. This barrier layer is chemi- 
cally bonded to the underneath oxide and isolates the substrate from moisture or 
corrosion environment. Dying at an elevated temperature of 80-100 °C can accel- 
erate the condensation rate. 

However, the performance provided by current sol-gel sealing technologies is 
still not as satisfying as the conventional methods. A number of papers related to the 
sol-gel treatments on magnesium alloys, though only on academic level, have 
identified several promising hydrosilicon combinations (Zanotto et al. 2011; 
Banerjee and Raman 2011; Chen et al. 2011b; Hu et al. 2010; Quach et al. 2008; 
Wojtala 2011; Xue et al. 2011; Zhang and Wu 2009; Zucchi et al. 2008). Silicon 
layer with stronger interfacial adhesion and denser structure is the key factor 
that contributes to the better corrosion resistance. With further modification by 
resins, inhibitors, and surfactants, the sol-gel sealing system can provide an envi- 
ronmental and health hazard-free alternative solution for magnesium anodizing and 
sealing. 


Organic Coatings 

Organic finishing is the final stage of the entire surface treatment process. Painting 
of magnesium alloys is similar to that of other metals, including single coating 
system to multilayered system with primers and top paints. These organic coatings 
should be applied over the pretreated magnesium alloys to enhance the surface 
abrasion and wear resistance and achieve decorative purposes as well as superior 
corrosion resistance for long-term protection. 
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Primers 

For magnesium alloys used in moderate to severe corrosion environment, it is appropriate 
to choose multilayered paint systems as the final finish, because the primer layer can 
promote the adhesion to top organic paint. Alkali-resistant and moisture-impermeable 
epoxy resins are strongly recommended as primers for painting magnesium alloys. The 
magnesium alloy substrates must be completely dry to avoid humidity and guarantee a 
good adhesion. Typical primers that have been successfully used on magnesium alloys 
are epoxy-based or epoxy-polyester-based resins. Acrylates, polyvinyl butyral, and 
polyurethane are also used. Pigments, such as chromate, zinc, or titanium dioxide, are 
commonly added in these primers for inhibition or corrosion prevention. Chromate-based 
pigment (> 15 wt% in dry film) is usually employed in the primers for severe conditions. 


Top Paints 

Magnesium alloy articles can be literally coated with any type of top paints, such as 
epoxy, acrylate, polyurethane, and polyester. Epoxy paint offers a superior wear or 
abrasion resistance; polyurethane is more resistant to UV degradation; and others 
with vinyl-based compounds display good thermal stability. For regulations and 
standards of organic coatings, please refer to the technical data sheets from the 
companies supplying paints. 

The following standards cover specifications needed for selecting primers and 
top paints for magnesium alloys: 

ASTM D16 Standard Terminology for Paint, Related Coatings, Materials, and 
Applications 

AMS 3108E Primer, Ocher, Phenolic 

AMS 3116 Primer Coating Epoxy, Chemical & Solvent Resistant, Non-Chromate 
AMS 3116A Primer Coating: Epoxy, Chemical and Solvent Resistant 
Non-Chromated, Corrosion Preventive 
AMS 3130F Vehicle Paint, Glyceryl Phthalate 

AMS 3140 Coating: Urethane, Aliphatic Isocyanate, Polytetrafluoroethylene 
AMS 3143 Powder Coating Materials, Epoxy 
AMS 3143A Powder Coating Materials, Epoxy 
AMS 3145C Paint, Marking Epoxy 

AMS C83231 Coatings, Polyurethane, Rain Erosion Resistant, Aircraft & Missiles 
MIL C-83466 NOT-1 Polyurethane Coatings for Aircraft Applications 
MIL C-85322B(2) Elastomeric Polyurethane Coating for Rain-Erosion 
MIL PRF-22750F Coating Epoxy, High Solids 
MIL PRF-24712AQ) Powder Coatings 

MIL PRF-32239 Coating Systems, Advanced Aerospace Applications 
MIL PRF-81352C Aircraft Touch-up 

MIL PRF-85285D(1) Polyurethane Coating for Aircraft & Support Equipment 
MIL STD-7179 Finishes, Coatings to Protect Aerospace Weapons Systems 
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Testing Methods for Treatment Evaluation 

Typical testing methods for evaluating the quality of the surface treatment and 
coating systems on Mg alloy substrates include salt spray resistance, wet adhesion 
of organic coating, wear resistance, and impact resistance as listed below: 

ASTM D1654 Standard Test Method for Evaluation of Painted or Coated Speci- 
mens Subjected to Corrosive Environments 
ASTM B117 Standard Practice for Operating Salt Spray (Fog) Apparatus 
ASTM D3359 Standard Test Methods for Measuring Adhesion by Tape Test 
ASTM D4060 Standard Test Method for Abrasion Resistance of Organic Coatings 
by the Taber Abraser 

ASTM D2794 Standard Test Method for Resistance of Organic Coatings to the 
Effects of Rapid Deformation (Impact) 

Mg panels used in salt spray corrosion resistance tests are commonly 10 x 15 cm, 
thickness less than 10 mm. Painted surface should be scribed with an “X” prior to salt 
spray exposure. Typical salt spray test (ASTM B117) exposure duration for painted 
surface can last for 1 month, depending on the quality of the samples. Periodically, the 
evaluation in accordance with ASTM D1654 is required during the test. For unpainted 
surface, “X” scribes are not necessary, and commonly accepted exposure duration for 
PEO coating without paint protection is 4 days to 1 week. Please note that Dow 17, HAE, 
and other non-PEO pretreatment layers do not exhibit good corrosion resistance without 
painting, and the exposure duration without corrosion is usually within 2 days. 

Wet adhesion of organic paint is measured by the crosscut tape test in accor- 
dance with ASTM D3359. Please note that the finished samples are required to be 
immersed in DI water at 49 °C for 4 days before testing. 

Wear resistance of unpainted hard surface can be measured by using a Taber Abraser. 
The test is required to be conducted in duplicate in accordance with ASTM D4060. The 
surface coating is normally abraded by CS-17 wheels under an applied load of 1 kg. The 
weight loss is measured after 1,000 abrade cycles as the Taber wear index (mg/1,000 
cycles). Normal Taber wear index of PEO coating is within 50 mg/1,000 cycles. 

Gardner impact tester can be used for evaluating the impact resistance of surface 
treatments on Mg alloy. Impact tests in accordance with ASTM D2794 use an 
indenter of 900 g and 12.7 mm to impact the specimen surface. The adhesion of the 
coating system in the impacted area is evaluated by tape text, ASTM D1654). The 
height of the weight increases until the point of cracking, or loss of adhesion of the 
coating system, to determine the impact failure end point (g-m). 


Summary 

In general, complete mechanical cleaning and chemical cleaning of magnesium 
alloys are essential to good coating adhesion and the final corrosion resistance. It is 
important that special precaution must be taken to avoid sparks during grinding or 
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polishing of magnesium alloys in dry conditions. Owning to the application of 
current or voltage, anodizing processes usually provide denser and thicker coating 
structure and therefore better corrosion protection than the performance of chemical 
conversion coatings. Magnesium anodizing is recommended when a high corrosion 
resistance is desired, which is the case in many automotive, aerospace, and marine 
applications. In response to environmental protection laws, chromate minimization 
policies, and other regulations, chromate-based surface treatments, as specified in 
current standards of chemical conversion coating and anodizing for Mg, are 
increasingly restricted. Research efforts have been made to look for alternative 
processes. The newly developed chromate-free PEO processes are more effective 
and reliable to achieve better paint adhesion and corrosion resistance. 
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Abstract 

This chapter presents a comprehensive review of the thermally induced stress in 
multilayer thin films within both the elastic and elastic-plastic deformation 
ranges and several approaches to determine the thermomechanical properties 
of thin films. The elastic analysis based on the linear strain assumption results in 
the closed-form solutions and approximations (for very thin films). Subse- 
quently, the review is extended into the elastic-plastic deformed films in bilayer 
structures. Closed-form solutions of the maximum, average, and minimum film 
stresses and curvatures are discussed in details for plastically deformed films. 
The difference among the maximum stress, average stress, and Stoney stress in 
films is systematically revealed. As an example, the result of a case study reveals 
that the yield start point may be estimated as a linear function of temperature in 
the elastic-plastic deformation range. 

A newly developed simple approach to determine the values of five 
thermomechanical properties of thin films, namely, the Young’s modulus, the 
coefficient of thermal expansion, yield start stress, strain hardening modulus, 
and Poisson’s ratio, is presented in details, together with some simple and 
generic approaches for the characterization of thin films with nonlinear stress 
versus strain relationship and/or temperature-dependent material properties. In 
the case of very thin films, analytical solutions are available. 

These new approaches and solutions are applied to investigate the moduli of 
metallic films. The film thickness effect on the modulus of Ag films is confirmed. 
The critical role of a compressive stress in thin Ti0 2 layer atop NiTiCu film in 
the reversible trench phenomenon is identified. 


Introduction 

Thin film technology is simultaneously one of the oldest arts and one of the newest 
sciences (Ohring 2002). Thin films are coated atop a substrate by various methods, 
such as, evaporation, sputtering or electrolytic deposition, etc., to construct multi- 
layer structures, which have a wide range of applications. 

The importance of thin films has increased dramatically over the past several 
decades. The micro-electro-mechanical systems (MEMS) industry is rapidly grow- 
ing in quantity and diversity, which raises a range of great opportunities for micro 
mechanical components based on thin film technology (Tada et al. 2000; Townsend 
et al. 1987). Consequently, thin films have attracted more and more attention from 
many engineering/research communities. 

Nowadays, multilayer stmctures have already been widely adopted in applied 
physics research and numerous industrial applications. Thin films can store energy 
and provide activation force and/or large displacement due to, for instance, the 
mismatch in coefficient of thermal expansion (CTE) or phase transformation. 
A number of applications in micro actuators, such as micropumps, microvalves, 
and microsensors (Benard et al. 1998; Krulevitch et al. 1996) etc., have been 
achieved based on these principles. Thin film based surface coating to protect 
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structural materials in harsh environment is another application of commercial 
significance. In addition, thin film technology is also utilized extensively for magnetic 
data storage in hard disk drives (Freund and Suresh 2003). 

Residual stress in thin films may be resulted due to different reasons. The most 
important ones in micro thin films are as follow (Lima et al. 1999; Elshabini-Riad 
and Fred 1998; Liu and Murarka 1992): 

1. Thermal stress 

Thermal stress results from the difference in CTE between thin film and 
substrate. 

2. Interfacial stress 

Interfacial stress is largely caused by the difference in the lattice structural 
properties between thin film and substrate. 

3. Intrinsic stress 

Intrinsic stress in thin films is normally induced during deposition or 
postdeposition treatment, e.g., annealing. 

Residual stress inevitably has an important influence on the physical properties 
and performance of thin films (Maissel and Glang 1970; Roll 1976; Jin et al. 2001). 
Several failure mechanisms associated with the residual stress in thin films in 
integrated circuit (IC) industry have been identified. Consequently, it is an impor- 
tant issue to determine film stress for various purposes. Structural integrity and 
structural dynamic characteristics are two among others (Chen and Ou 2002). 

The determination of thermomechanical properties of thin films has also 
attracted great interest from many communities, since the stability and reliability 
of thin film-based devices are very much dependent on these properties (Huang 
et al. 2004a). 

It is well known that the properties of thin films may be significantly different 
from those of bulk materials (e.g., Lima et al. 1999; Zhao et al. 2000a; Kalkman 
et al. 2001). In addition, the thermomechanical properties of thin films may also 
depend upon the film geometry (e.g., thickness), fabrication techniques, and 
processing history (Fang and Wickert 1995; Fang et al. 1999; Jain et al. 2001). It 
is of great importance to use the actual thermomechanical properties of thin films, 
not those of bulk materials, for a precise understanding of their response. 

On the other hand, finite element analysis (FEA) is often utilized for the 
prediction of the stress and deformation of thin film-based complicated structures. 
For such an analysis, accurate thermomechanical properties of thin films as well as 
those of substrate, for example, the Young’s modulus, CTE, Poisson’s ratio, and 
strain hardening modulus, are necessary as the original input. 

Although the residual stress in thin films has been investigated for decades and 
numerous publications can be found in the literature, for multilayer thin films with 
arbitrary thickness and properties, closed-form solutions were not available until 
1983 (Feng and Liu 1983). However, in these previous methods, the number of both 
unknowns and boundary conditions increases dramatically with the number of 
layers (Townsend et al. 1987; Liu and Murarka 1992). Furthermore, all previous 
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closed-form solutions are limited to within the elastic deformation range for both 
thin films and substrates. 

There are also well-documented approaches in the literature about the charac- 
terization of thin films due to the practical importance. However, most of the 
existing techniques require either very complicated and/or dedicated equipments 
or tedious procedures and have many difficulties in dealing with a few properties by 
just a couple of simple experiments. 

This chapter presents a systematic analysis of thin film stress in multilayer 
structures in the whole elastic-plastic deformation range. In addition to traditional 
approaches, newly developed simple approaches to determine thermomechanical 
properties of thin films deposited atop a substrate are introduced. Applications are 
demonstrated. 

Literature Review 

Stress in Deposited Thin Films 

Interfacial bonding of thin films to each other and to a substrate causes physical 
interaction and maybe chemical interaction as well. That is, the films and substrate 
can be held together under a state of compressive or tensile stress, known as 
interfacial stress, which appears due to differences between structural properties 
of the substrate and thin films (Lima et al. 1999). Besides interfacial stress, there are 
other two sources of stress: intrinsic stress and thermal stress. Intrinsic stress is 
incorporated into a film during deposition or postdeposition treatment, i.e., it is 
intrinsic to these processes and strongly depends on the materials involved, the 
technique applied, and the preparation conditions, such as substrate temperature 
during deposition, growth chamber conditions, etc. (Freund and Suresh 2003; 
Maissel and Glang 1970). Thermal stress results from the difference in the coeffi- 
cients of thermal expansion (CTE) of thin films and substrate when a multilayer 
structure undergoes temperature fluctuation, which might be due to the variation of 
environmental temperature (Lima et al. 1999). The sign of thermal stress depends 
on the relative magnitudes of the respective CTEs. 

Some researchers categorized interfacial stress and intrinsic stress as one, i.e., 
intrinsic stress or growth stress, while treated thermal stress and others arising from 
external influences, such as electrostatic or magnetic forces as extrinsic stress. In 
this chapter, the focus is mainly on the thermally induced stress, while others are 
assumed to be ignorable or can be determined by other technologies. 

Thin film stresses as well as their effects on delamination, cracking, and perfor- 
mance of thin film-based structures were recognized as early as in the nineteenth 
century (Freund and Suresh 2003; Stoney 1909). A thin film stress reveals infor- 
mation about the behavior of the deposition process and affects the integrity of the 
film-substrate system as well as its performance. It may have various beneficial or 
detrimental effects. Whether thin film stress causes any problems in an application 
mainly depends on the stress level. In general, a lower level of compressive stress 
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Fig. 1 Cracking of thin films (a) flat elliptical through crack lying in the film plane loaded 
uniaxially in tension, (b) crack leading to coating fracture due to tensile stress, (c) crack leading to 
coating delamination due to compressive stress, (d) mechanism of metal whisker formation and 
cracking of overlying brittle film layer (Modified from Ohring 2002) 


may effectively strengthen a thin film, because it reduces the chances of a thin film 
being under a high tensile stress which causes fracture in some applications, such as 
tool-bit coatings (Elshabini-Riad and Fred 1998). Corrosion resistance is also 
improved by avoiding tensile stress. A low strain of either sign can improve the 
properties of epitaxial structures in electronic applications (Liu and Murarka 1992). 
High compressive stress in a film can be beneficial to prevent the initiation of 
surface cracks by suppressing crack formation and propagation (Gunnars and 
Wiklund 2002). On the other hand, strain/stress in films may modify the electronic 
transport characteristics of layered semiconductor systems through modification of 
the band structure of the material (Singh 1993). 

However, if the resultant stress is over a certain limit, which is the point of 
catastrophic or long-term failure, it naturally leads to many problems. Tensile stress 
failure is characterized by cracking, which appears as a mosaic pattern when 
viewed from the top (Ohring 2002). The cracked film may then peel away from 
the substrate at the crack edges, where the stress is concentrated. Compressive 
stress failure is characterized by de-adherence and buckling, which from the top 
appears sometimes as domes or bubbles and some other times as an undulating 
meander pattern looking like a mole tunnel (Ohring 2002). Other effects may 
include substrate warpage and/or stress-induced phase transformations (Huang 
et al. 2001). 

As a summary, typical failure modes due to cracking in thin films are illustrated 
in Fig. 1. 


3060 


Y.Y. Hu and W.M. Huang 


Stoney Equation 


As mentioned above, the reliability and stability of multilayer structure are very 
much dependent on the level of stress induced in the structure. As such, attention 
has always been focused on it, in design, and/or optimization of multilayer struc- 
tures. In fact, considerable efforts have been devoted to this for over 100 years. 

The earliest equation to deal with the stress in a thin film coated on a beam is 
developed by Stoney (Stoney 1909). Stoney observed that metallic films deposited 
by electrolysis on a thick substrate might be in a state of tension or compression 
even in the absence of any external loading. And they strained the substrate on 
which they were deposited by bending. Then he suggested a simple method of 
measuring the curvature (the amount of bending) together with the thickness of film 
to determine the stress in the film in 1909. With some minor modifications, the 
equation proposed by Stoney can be expressed as: 



where E s is the substrate’s Young’s modulus, t s and tf are the substrate and film 
thickness, and r is the radius of curvature of the film-substrate structure. As can be 
seen in Eq. 1, film properties were not included in the relationship between radius of 
curvature and film stress. The only information of the film is its thickness. All others 
about the substrate, such as the Young’s modulus and thickness, are assumed to be 
known. Stoney equation is based on a model of the film-substrate structure, which 
involves several assumptions. The main assumptions are as follows (Freund 

et al. 1999): 


1. Both the film and substrate thicknesses are small compared to the lateral 
dimensions. 

2. The film thickness is much less than the substrate thickness. 

3. The substrate material is homogeneous, isotropic, and linearly elastic. 

4. The film material is isotropic. 

5. Edge effects near the periphery of the substrate are inconsequential, and all 
physical quantities are invariant under change in position parallel to the 
interface. 

6. All stress components in the thickness direction vanish through the material. 

7. The strains and rotations are infinitesimally small. 

Stoney equation was derived based on the beam theory. It serves as a cornerstone 
of experimental work in which film stress is obtained from the measured curvature 
(Freund and Suresh 2003). And it has been used without any doubt for several decades 
since it was developed. One of the merits of this equation is that it can be conveniently 
used to determine the residual stress in the thin film based on the measurement of the 
curvature. When Stoney equation is used to evaluate residual stress in thin film, 
Stoney equation should be corrected by subtracting the initial curvature (l/r 0 ) from 
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the measured curvature (1/r) since some of the substrates may have initial curvature. 
So the average film stress can be calculated from the change of the radius of curvature 
of the bilayer structure using the following form of Stoney equation: 



However, apart from the limit that it is only applicable to bilayer structures only, 
it also results in considerable error in thicker films (e.g., Chen and Ou 2002; Klein 
2000a). This is a significant problem in many applications, for instance, in actua- 
tors, where thick films are required for larger forces. 


Recent Modifications of Stoney Equation 

Stoney equation has been modified continuously for years due to its limitations 
(e.g., Kim et al. 1999; Klein 2000b; Hsueh and Paranthaman 2008). In 1925, 
Timoshenko presented the classic solution for the problem of a bimaterial strip 
subjected to uniform heating with various end conditions (Timoshenko 1925). 
Some simplifications were made in this solution: i.e., the difference in CTE remains 
constant during temperature variation; and the width of the strip is small as 
compared with the length of the beam. The analytical approach utilized by Timo- 
shenko has been widely adopted at present. The next remarkable contribution is 
from Brenner and Senderoff (1949). They extended Stoney equation to the case of 
thicker films with various boundary conditions. As discussed later, this extended 
Stoney equation also introduces big error in the case of relatively thicker films. 
Some years later, an integral solution was proposed by Davidenkov in 1961 
(Townsend et al. 1987). In Davidenkov’ s study, the approximations, which are 
applied in Brenner and Senderoff (1949), are not required. 

It was pointed out in Hoffman (1966) that plate theory should be used to study the 
film-substrate structure. He extended the Stoney equation to the case of a thin film 
deposited on a plate by utilizing a biaxial elastic modulus. When the plate theory is 
used, the Young’s modulus of the substrate should be replaced by its biaxial elastic 
modulus, EJ(1 — v s ) , where v s is the Poisson’s ratio of substrate. Subsequently, Saul 
(1969) and Reinhart and Logan (1973) calculated the stresses in particular within 
some regions of one -dimensional composites with multiple film layers coated on a 
substrate. Olsen and Ettenberg (1977) obtained an expression for stresses within 
individual layers of a multilayer composite as a function of position within the 
layer. Their calculation is based on the equilibrium of forces and moments, and 
equal and isotropic elastic constants are also assumed. Alternatively, Roll (1976) 
presented an analysis technique, which departs from the strength of material approach 
and utilizes a field description of the material strain. Many approximate formulas for 
the radius of curvature and thermally induced stress obtained by the above researchers 
have used the assumption that all layers of the multilayer structure have the same 
Young’s modulus and deposition temperature. 
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Although analytical solutions have been proposed for thin films on a thick 
substrate based on the thin film approximation, for multiple layers of arbitrary 
thickness and different properties, e.g., the Young’s modulus and CTE, closed-form 
solutions are only obtained till the 1980s (Townsend et al. 1987; Feng and 
Liu 1983). In 1983, Feng and Liu (1983) derived generalized formulas for the 
radius of curvature and stresses in layers caused by thermally induced strain in 
semiconductor multilayer structure with different elastic moduli and growth 
temperatures. In Townsend et al. (1987), the description of the stress distribution 
and state of elastic strain in a hypothetical composite plate consisting of layers with 
different elastic and/or thermal properties was presented. One example of such 
configuration is a structure resulting from the deposition of films onto a substrate. 
When these layers have CTEs that differ from that of the substrate, changes in the 
temperature lead to differential rates of thermal strain within the plate. However, 
the layers are confined at each interface, and this leads to elastic strains in the films 
and substrate resulting in end forces or applied moments. A description of the state 
of elastic strain and distribution of stresses within the plate must satisfy the 
requirement that total resultant end forces and total resultant bending moments 
must be zero (Townsend et al. 1987). Suo et al. (1999) presented the basic 
mechanics relations for externally forced and thermally induced bending of the 
film-on-foil devices. 

As can be seen, in most of those methods, both the number of unknowns and the 
number of boundary conditions increase with the number of layers in the multilayer 
structures. More recently, Klein (2000a) have extended the Stoney equation to large 
deformations using a potential energy minimization analysis. 

In 1999, Schafer et al. (1999) reported their study of film stress as 



where K is the curvature of the structure. Compared with the classic Stoney 
equation, this equation is modified from Stoney equation with an additional cor- 
rection factor which is 1/(1 + tf/t s ). This correction factor takes the thickness ratio of 
film over substrate into accounts. Hence, it is a great improvement in Stoney-type 
equation, especially since it also does not require information on the film’s elastic 
properties (Klein 2000a). 

Another modified Stoney equation used by Rats et al. (1995), which they 
attributed to Brenner and Senderoff (1949), is 


&BS 






where Ef is the Young’s modulus of the film. In 2000, Klein evaluated the correc- 
tion of Stoney equation and the above two modified versions and concluded that 
(Klein 2000a): 
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1 . The correct equation for the stress in the film of bilayer plates can be expressed 

Q 

in terms of Stoney equation and a correction factor equal to (1 + yA )/(l + A), 
where y designates the ratio of the biaxial moduli of thin film and substrate and A 
is the ratio of the layer thicknesses. 

2. Stoney equation does not cause any serious errors for thickness ratio A < 0.1. 

3. The modification proposed by Atkinson, which does not require information of 
the film’s modulus, is a great improvement which is applicable to a thickness 
ratio up to A 0.4. 

4. Brenner- and Senderoff-type expressions can be very misleading and should be 
avoided. 

More recently, Zhang et al. (2005) systematically investigated the errors 
involved in using some typically approximate solutions, such as Stoney equation 
(Stoney 1909), Brenner-Senderoff modification (Brenner and Senderoff 1949), 
Saul approximation (Saul 1969), Roll approximation (Roll 1976), Vilms-Kerps 
approximation (Vilms and Kerps 1982), and Teixeira approximation (Teixeira 
2001). The comparison is interesting. However, the average film stress was taken 
as reference for comparison. This is only useful provided that the thickness of a film 
is much less than that of the substrate. In the case of relatively thick film, the 
maximum film stress should be more meaningful in engineering application instead 
of the average film stress because the maximum film stress can be much larger than 
the average film stress. 

Hsueh (2002) proposed a model to study the elastic deformation of multilayer 
structure due to residual stresses and external bending. In that model, the strain 
distribution in the multilayer structure, e, is decomposed into a uniform component 
and a bending component, i.e., 


e = c + - — — (5) 

r 

where c is the uniform strain component, dictates the location of the bending axis 
(which is defined as the line in the cross section of a structure where the bending 
strain component is zero), r is the radius of curvature of the structure, and z is the 
coordinate axis which is defined such that the interface between the substrate and 
layer 1 of the film is located at z = 0. Note that the bending axis is different from the 
conventional neutral bending axis, which, in the classic beam bending theory, is 
defined as the line in the cross section of a structure where the normal stress is zero. 
The strain/stress distributions are contingent upon solutions of three parameters, c, 
t h , and r, which can be determined from the following three boundary conditions: 

(a) The resultant force due to the uniform strain component is zero, i.e., 

n 

y, E, ( c - (XiAT)ti = 0 

!= 1 


(6) 
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(b) The resultant force due to the bending strain component is zero, i.e., 





i/ 





(c) The sum of the bending moment with respect to the bending axis is in equilib- 
rium with the applied moment, i.e., 



where E is the Young’s modulus, a is the CTE, AT is the temperature variation, M is 
the applied moment per unit width of the multilayer, and n is the number of layers of 
the multilayer structure, respectively. The subscript denotes the ith layer. 

One of the advantages of this model is that there are always only three unknowns 
in spite of the number of layers of multilayer structure. However, it only focuses on 
the elastic deformation range. Furthermore, only the average film stress has been 
investigated, but not the maximum film stress. As will be seen in section “Elastic 
and Elastic-Plastic Analysis of Multilayer Thin Films,” the difference between the 
average stress and maximum stress may be significant in some applications, such as 
actuators, where the thickness ratio is larger. 

In 2003, Nikishkov (2003) developed a closed-form estimation for the curvature 
in hinged multilayer structures with initial strains based on Hsueh’s three -parameter 
approach. In the case of small width, the hinged multilayer structure is in the plane 
stress state, while in the case of a wide strip with bending constraint in one 
direction, it is in the plane strain state. 


Thermomechanical Properties of Thin Films 

As mentioned above, many micromechanical components have been used in a large 
variety of technological applications. Typically, such micro-components are thin 
film-based plate/beam structures, fabricated by silicon bulk micromachining or 
surface micromachining (Ziebart 1999). Apart from the geometry, the mechanical 
behaviors of these structures are determined by the thermomechanical properties of 
these thin films. Furthermore, the stability and reliability of these structures are very 
much dependent on their thermomechanical properties. 

For example, elastic properties, such as the Young’s modulus and Poisson’s ratio, 
and the residual film stresses determine the static and dynamic mechanical behavior 
of the stmctures (Petern 1978). As another example, the elastic property is one of the 
determinants of the deflection range of a microgripper. In addition, there are several 
problems resulting from the thermal expansion of thin film and substrate. The 
mismatch of thermal expansion strain between thin film and substrate may lead to 
residual stresses in thin film and substrate, and then over-deformation and permanent 
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damage of the structure may occur. On the other hand, the thermal expansion effect 
can be exploited to drive microgrippers and microactuators. To sum up, thermome- 
chanical behavior of micro stmctures can be significantly influenced by the properties 
of materials (Huang et al. 2001; Fang and Wickert 1996). 

Thus, the knowledge of these thermomechanical properties, namely, the Young’s 
modulus, Poisson’s ratio, CTE, yield start stress, and strain hardening modulus, is 
essential for the design, development, and optimization of these structures. 

Bearing in mind, the properties of a thin film may be significantly different from 
those of its bulk material. Some investigations addressed below are examples. 

Lima et al. (1999) observed that the CTE of tetrahedral amorphous thin films 
depends on the network strain in their work. They found that the CTE of tensile 
films is smaller than that of corresponding crystalline semiconductors, but it is 
higher for compressive films, and that the elastic biaxial modulus of the amorphous 
and metallic films is smaller than that of crystalline counterparts. 

Fang et al. (1999) found that CTE of aluminum film differs by up to 66 % when 
the film thickness increases from 0.3 to 1.7 pm and CTE of Ti film differs by up to 
60 % when the film thickness increases from 0. 1 to 0.3 pm. That is, CTE of thin film 
materials is very sensitive to the variation of film thickness. Fang et al. (1999) also 
found that CTE of aluminum film is close to that of bulk aluminum when the film 
thickness is around 1 pm. However, the difference in CTE between the bulk and 
thin film of aluminum increases with the film thickness when it is over 1 pm. 

Tada et al. (2000) suggested that CTE of poly- Si thin films may be significantly 
higher than that of bulks based on their experimental results. 

Zhao et al. (2000a) revealed that the product of the biaxial modulus and CTE of 
film decreases with film thickness and assumed that CTE increases in ultrathin films. 
The modulus was found to be dramatically smaller than that of the bulk material. 

Kalkman et al. (2001) observed that the values of the moduli of poly crystalline 
aluminum and Au films are considerably smaller than the corresponding values of bulk 
materials, but the modulus of polycrystalline W film is about the same as the bulk one. 

Badawi et al. (2002) used a technique coupling in situ tensile testing with X-ray 
diffraction to measure the Young’s modulus and Poisson’s ratio. In their study of 
tungsten thin films, the obtained Young’s modulus was close to that of bulk 
material, whereas the Poisson’s ratio was significantly larger. 

In addition, the mechanical properties of thin films, both the elastic and plastic 
properties, can also depend upon the film thickness, the fabrication processes, and 
processing history (Kalkman et al. 2001; Fang and Wickert 1995; Fang et al. 1999; 
Jain et al. 2001; Taylor et al. 2003). 

Bulk materials usually have Poisson’s ratio in a range 0 < v < 0.5, although a 
negative Poisson’s ratio is not forbidden by thermodynamics (—1 < v < 0.5). 
However, Renault et al. (1998) have shown that the negative Poisson’s ratio 
assumption proposed in Fullerton et al. (1993) and Fartash et al. (1993) was the 
result of an incorrect determination of strains from X-ray diffraction measurements. 

It is therefore not a reliable approach to extrapolate the thermomechanical 
properties of bulk materials to thin films. It is more accurate to in situ characterize 
the properties of thin films, i.e., with substrates untouched. 
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Determination of Thermomechanical Properties of Thin Films 


Thermomechanical properties of thin films have received numerous attentions 
because of their strong influence on the behavior and performance of thin films 
and implications on the reliable operation of thin film based micro-components 
(Chang et al. 2007; Varguez et al. 2008). At present, in MEMS design, elastic and 
elastic-plastic behaviors (although a few examples of anelastic and viscoelastic 
responses exist) are mostly concerned and the mechanical properties of interest are, 
for instance, the Young’s modulus, yield stress, and residual stress (Srikar and 
Spearing 2003). There are several available techniques to measure the thermome- 
chanical properties of bulk materials (e.g., Kinzly 1967; Jacobs et al. 1970; 
Courtney 2000; Aviles et al. 2008). For example, one of the most widely used 
methods to measure CTE of a sample is optical method. This method involves the 
measurement of the change in dimension, due to a variation in temperature, with 
respect to the original size. 

Unfortunately, most of the conventional techniques for the characterization of 
bulk materials have tremendous difficulties in dealing with very thin films since thin 
films are easily deformed and broken. Experimental difficulties in dealing with thin 
films result in that most of conventional techniques become unreliable (Tien 
et al. 2001). 

On-wafer characterization of thermomechanical properties of thin films is an 
active field in both applied physics research and engineering applications (Zhao 
2000). In fact, several methods dedicated to thermomechanical properties of thin 
films have been developed for decades (Nix 1989). Each method has its own 
advantages and disadvantages. 


Bulge-Testing Method 

One of the first methods to determine the Young’s modulus and residual stress of 
thin films with submicron or micron thickness is bulge-testing technique, also 
referred to as membrane load-deflection method (e.g., Kalkman et al. 2001; 
Zheng et al. 2000), which is suitable for free-standing thin films. 

In this method, the thin film is sealed to the end of a hollow cylindrical tube and 
pressurized with gas. The thin film is loaded by a differential pressure, while the 
maximum height of the resulting deflection in the film is detected optically by a 
microscope or interferometer and then converted into strain. The relationship 
between the dome height ( h ) and the applied differential pressure (P d ) can be 
determined by Hoffman (1966), 





Here, the him thickness and specimen radius are tf and r sp , respectively; a is the 
residual stress in the him under a zero pressure differential; Ef and Vf are the 
Young’s modulus and Poisson’s ratio of the thin him, respectively. 

Using above model, the residual stress and a combination of the Young’s modulus 
and Poisson’s ratio can be extracted from the linear and nonlinear membrane 
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response, respectively. And Poisson’s ratio can be extracted by further testing mem- 
branes with square and rectangular geometries. 

Since the nonlinear response varies with the fourth power of the lateral mem- 
brane size, precise sample fabrication and accurate size determination are crucial. 
The bulge test is only sensitive to mechanical properties in the plane of the film. 
This makes the interpretation of the experimental data more straightforward than in 
other methods, such as nanoindentation. 

A further advantage of this method is that precise bulge test samples can be 
fabricated by selective removal of the silicon substrate underneath the film using 
silicon micromachining techniques. This avoids the delicate handling of free- 
standing thin films (Ziebart 1999). Zheng et al. (2000) even used the experimentally 
derived biaxial moduli, together with the thermal stress data of the film samples, to 
estimate CTE of dense and porous poly-ary lethers films. 

However, in bulge test technique, the extraction of elastic modulus involves 
many simplifications. In order to get accurate result, numerical methods, such as 
finite element analysis, are needed. 

Gauge Technique 

Another method to determine the Young’s modulus of thin films is the strain gauge 
technique (Chiu 1990). The formulas for calculating the in-plane elastic modulus 
are derived using beam theory. To apply these formulas, a strain gauge is used to 
measure the surface strain. However, since it is in a contact manner, the strain gauge 
may have impact on the film and may introduce errors to the measurement of elastic 
modulus of thin film. Furthermore, it may be very difficult to bond a gauge onto the 
film, which is crucial to this technique. 

Chae et al. (1999) used microgauge method, which has some merits as compared 
with commonly used optical methods, to measure the CTE of poly-Si. In the 
process of measuring the CTE of thin films, the elastic modulus and CTE of the 
substrate are no longer needed in the microgauge method. CTE of a thin film was 
expressed as a function of the average temperature of the microgauge and the 
measured displacement at the vernier gauge. However, this technique is limited 
to conductive thin film only. 

Nanoindentation Method 

Nanoindentation is another widely used technique to determine the material behav- 
ior of thin films on a substrate, such as the Young’s modulus of thin films (e.g., in 
Oliver and Pharr 1992; Caceres et al. 1999; Kucheyev et al. 2000; Bradby 
et al. 2002; Malzbender and Steinbrech 2004) and even plastic properties of thin 
films (Nowak et al. 1999). 

Nanoindentation is a characterization technique similar to the hardness test com- 
monly used for bulk materials. In this method, a diamond indenter with well-defined 
geometry is pressed into the film (normally, there are two kinds of shapes of indenters, 
namely, spherical and sharp). The indentation load-displacement data obtained at 
each depth are analyzed to determine the hardness, //, and elastic modulus, E. In this 
method, the hardness and modulus are determined from Tsui and Pharr (1999): 


3068 


Y.Y. Hu and W.M. Huang 


H 


P 


max 


A 


E 


1 


v 


2 VA 

P 


1 


-l 


V: 


ind 


Jn S 


E, 


ind 


( 10 ) 

( 11 ) 


where P max is the peak indentation load, A is the indentation contact area, p is a 
constant which depends on the indenter geometry, S is an experimentally measured 
contact stiffness, v is Poisson’s ratio of the specimen, and v ind and E ind are Poisson’s 
ratio and the elastic modulus of the indenter. 

However, plastic and elastic deformations occur simultaneously upon loading. 
Due to this reason, the elastic properties are extracted from the elastic unloading 
curve of the indentation process. In comparison with other methods, this method 
has proven to be a simple and easy technique, and sample preparation is relatively 
simple. The thin film has neither to be removed from the substrate nor to be 
structured. Another merit is that a sample area of a few square micrometers is 
sufficient. 

However, as well known, the stress state underneath the indenter tip is highly 
nonuniform. Thus, the elastic response of the film during unloading is a compli- 
cated function of the possibly anisotropic material properties of the tested sample 

(Ziebart 1999). 

Another intrinsic problem in nanoindentation is that a complicated hydrostatic 
stress is applied instead of a directional stress. Moreover, it fails to provide the 
fundamental data needed for the characterization and modeling of thin film struc- 
tures (Florando and Nix 2005). Also, the substrate influence cannot be negligible 
when the penetration depth of the indenter is greater than 10 % of the film thickness 
(Tsui and Pharr 1999). This makes the extraction of reliable elastic modulus of thin 
films difficult. Like bulge test, there are many simplifications in the process of 
extraction of elastic modulus. In order to get accurate result, a numerical method, 
e.g., finite element analysis, is required. Another possibly fatal drawback of 
nanoindentation method is that what it can determine is the out-of-plane properties, 
while the in-plane properties are normally more of our interest from the application 
point of view. 

In summary, the application of nanoindentation to thin films is complicated and 
affected by several factors, such as indentation size effects, substrate effects, sink-in 
and pileup of material around the indenter, and surface conditions (Su 2004). 
As such, its application is rather limited. 


Brillouin Scattering Technique 

The Brillouin light scattering technique has been exploited in order to study the 
elastic properties of dielectric films used in microelectronics (Carlotti et al. 2002). 
By means of the finite element method, the resultant elastic constants were 
used to reproduce the measured stress versus temperature curve, from which 
estimations of both CTE and the viscosity of films were obtained, as shown in 
Carlotti et al. (2002). 
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However, in the Brillouin light scattering technique, the elastic constants are 
obtained by fitting the experimental data to the dispersion relation deduced from the 
theoretical model. The number of parameters for fitting increases as the film 
thickness decreases. This raises the uncertainty of the determined elastic constants 
for the film. Furthermore, the elastic property obtained from Brillouin scattering is 
actually the high-frequency dynamic response but not the static elastic property. 

X-Ray Diffraction Technique 

X-ray diffraction measures residual stress and the thermomechanical properties of 
thin films by detecting the slight changes in the inter-planar spacing of crystal 
planes in different directions with respect to the specimen surface. These changes 
are revealed as shifts in angular positions of respective diffraction peaks. The 
variation of the crystal plane spacing has a certain relationship with the angle 
made by the normal of the specimen surface and that of the crystal plane. X-ray 
stress measurement devices measure this variation and subsequently determine the 
stress. 

An alternative analytical method has been proposed in Ohring (2002). As an 
example, consider a polycrystalline film with an isotropically distributed biaxial 
tensile stress in the x — y plane ( a z = 0). The film contracts in the z direction by 

£ z — ( V /Ef) i^x T &y) — Vf (^8 X T Sy) (12) 

Here, Vfis Poisson’s ratio, Ef is the Young’s modulus, o is stress, and e is strain 
(x and y denote directions). By measuring the lattice spacing (dj) in a stressed film 
relative to the unstressed bulk lattice (d/ 0 ) in the z direction using X-rays, Bragg’s 
law yields s z [=(d/ — d/ 0 )/d/oL Since o x = a y = o y, one has 



Ef{di — dio ) 




Zhou et al. (1999) proposed two methods to measure CTE of anodic films, which 
are based on the X-ray diffraction. One method is to detect the change of the lattice 
spacing (d/) of crystalline phase as the temperature increases. The measured d/ is 
plotted as a function of temperature. CTE, a, can be evaluated as 


A d L \_ 
AT dio 



where Ad// AT is the slope of the dj — T curve and is the original lattice spacing 
of the specified plane. The other method is to determine the thermal stress in the 
film, which allows CTE to be determined numerically (Zhou et al. 1999). 

Cornelia et al. (1997) used nonsymmetrical X-ray diffraction technique to 
determine Poisson’s ratio of crystalline thin films without any knowledge of the 
elastic properties of the thin film material. 
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Badawi et al. improved this method and proposed a technique coupling X-ray 
diffraction measurements with in situ tensile test to determine the Young’s modulus 
and Poisson’s ratio of a poly crystalline tungsten thin film (Badawi et al. 2002; 
Renault et al. 1998). 

However, there are several drawbacks with X-ray diffraction technique in the 
determination of thermomechanical properties of thin films. First, it needs an 
intense X-ray source to obtain enough signals. Second, only the out-of-plane CTE 
can be worked out by X-ray method. Third, X-ray diffraction is only applicable to 
crystalline films. Unfortunately, many low k dielectric films, which are very popular 
at present, are noncrystalline (Zhao et al. 2000b). Furthermore, it is time consuming 
and only a slow strain rate can be applied (Hommel and Kraft 2001). 

Microtensile Test 

Tensile test is a standard approach to evaluate elastic/plastic response of bulk 
materials. Microtensile test is a miniaturized version. Unlike its bulk counterpart, 
microtensile test on thin films is far from routine. 

Microtensile test on free-standing thin films produces straightforward data to 
determine their stress versus strain relationships without any additional assumptions. 
For thin films coated atop a substrate, the extreme delicacies required in handling and 
detaching thin films from the substrate, gripping and aligning them, applying load, and 
measuring the mechanical response are experimental challenges. Loading is normally 
achieved by an electromagnetic force transducer, and strain is typically measured by 
an optical method in a contactless fashion. Some novel microtensile testing devices 
have been incorporated within electron microscopes, enabling direct observation of 
defects and recording of diffraction patterns during straining (Ohring 2002). 

The main advantage of this method is that the stress state of a sample is nearly 
uniaxial, making the interpretation of the experimental data straightforward. As 
compared with curvature methods, it can not only obtain but also control stress/ 
strain in the film without thermal loading/unloading of the film-substrate system 
(Haque and Saif 2003). 

However, sample fabrication, handling, and alignment in testing preparation are 
crucial. If the sample is not properly aligned, the applied force is not uniformly 
distributed within the film and wrinkling may occur. These problems are partially 
solved in some new sample design and preparation techniques based on silicon 
micromachining (Ziebart 1999). Measurements on a free-standing film can separate 
intrinsic thin film properties from substrate contributions. It should be borne in 
mind that a free-standing film may have different properties from that coated on a 
substrate due to the interface effects. 

Microstructures Technique 

Various techniques based on micro structures to determine properties of thin films as 
well as the residual stress have been developed (e.g., in Fang and Wickert 1995; 
Fang et al. 1999; Fang and Wickert 1996; Godin et al. 2001; Xu et al. 2001). 

Fang et al. exploited single layer and bilayer microcantilevers to determine CTE 
of thin films. The single layer and bilayer cantilevers would exhibit an out-of-plane 
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deflection after subjected to a temperature change. Thus, thermal expansion of thin 
film materials can be determined using optical interferometric technique on ther- 
mally loaded microcantilevers (Fang et al. 1999; Fang and Lo 2000). 

The main inconvenience of this method is that the relationship between 
angular deflection of a microcantilever and the CTE is required to be determined 
by the finite element analysis. In addition, since a microcantilever beam has a 
free end, residual stress in the film may be released through the free end 
before testing (Xu et al. 2001). In general, it is impossible to determine residual 
stresses and the Young’s modulus simultaneously from such a microcantilever 
beam test. 

Beam buckling technique is based on the principle of Euler buckling, which 
depends on compressive stress, and the beam size and shape (Guckel et al. 1985). 
Fabricating an array of beams with different sizes, the critical beam length at which 
a beams starts to buckle can be determined. Subsequently, the film strain is 
obtained. This method is very sensitive to the film stress which is compressive. 
However, it has the disadvantage that a large number of different sized beams are 
required in order to obtain high accuracy (Ziebart 1999). 

Su et al. (2000) used bilayer microbridge method to characterize films. Zhang 
et al. (2000) and Xu et al. (2001) extended the microbridge method to trilayer 
samples, which consist of Si 0 2 /Si 3 N 4 /Si 02 and S^N^SiC^/S^N^ to estimate 
residual stress and the Young’s modulus in each layer. 

Beam Bending Method 

Various beam bending methods (four-point bending, three -point bending, and 
bending a cantilever beam) have also been used to determine the elastic modulus 
(Kim et al. 2007; Tsai et al. 2009). For example, in a four-point bending test (refer 
to Fig. 2), the elastic modulus of a thin film can be determined from the slope of a 
load (P) -deflection (5) curve by (Malzbender and Steinbrech 2004) 



a(3Lx — 3x 2 — a 2 ) A P 

12 7 ~Kd 



where I is the moment of inertia and x indicates the position, a < x < LI 2 at which 
the deflection S is measured. Most likely, <5 is measured either at the center, x = LI 2, 
or at the position where load is applied, x = a. 

Four-point bending is also applicable to bilayer or multilayer structure (refer to 
Fig. 3). The formula to determine elastic modulus is more complicated (Malzbender 
and Steinbrech 2004). 

Florando and Nix (2005) proposed a microbeam bending method to study the 
stress versus strain relationship of metallic thin films deposited atop a silicon 
substrate, in which triangular silicon microbeams were used instead of the common 
rectangular beams. The advantage of triangular beam is that the entire film on 
the top surface of the beam is subjected to a uniform plane strain when the beam 
is deformed. 
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Fig. 2 Four-point bending of 
a monolithic material 



Fig. 3 Four-point bending of 
a bilayer material 



Curvature Measurement Method 

Curvature measurement method is one of the most widely adopted techniques to 
determine the residual stress. In the case of thin films, based on Stoney equation, the 
change in substrate curvature induced either during film deposition or temperature 
variation provides insight into the evolution of residual stress. One merit of 
curvature measurement method is that it does not require the material properties 
of the film. Other advantages of the curvature measurement method are: 

1. It is fast, cheap, and uncomplicated measurement. 

2. It has a high accuracy. 

3. It is applicable to amorphous materials. 

Curvature measurement method can also be utilized to extract the Young’s 
modulus and CTE of films (Lima et al. 1999; Gunnars and Wiklund 2002; Zhao 
et al. 2000b). Apart from the elastic properties of films, curvature measurement 
method also provides useful information about the plastic response of films. The 
main disadvantages of curvature measurement method are: 

1 . Large temperature change is required in the experiment. 

2. Only averaged properties of a film can be obtained (Llorando and Nix 2005). 

3. It is not applicable to free-standing films. 

There are a lot of mature techniques available for curvature measurement. 
Lreund and Suresh (2003) categorized them broadly into four groups, namely, 
mechanical method, capacitance method, X-ray diffraction method, and optical 
method. Among them, mechanical method is usually a contact approach, while the 
rest three are contactless methods. Except X-ray diffraction method, which is 
only applicable to crystalline films, all others measure the out-of-plane deflection 
of a curved bilayer structure. The most commonly used method is optical method, 
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which offers the convenience, accuracy, and flexibility in measuring curvature 
through remote sensing. There are several different types of optical techniques, 
such as scanning laser method, multi-beam optical stress sensor, grid reflection 
method, and coherent gradient sensor method. 

Other Techniques 

Yuan et al. (1998) measured the coefficient of linear thermal expansion of ZnSe thin 
films with a temperature-regulated spectroscopic ellipsometer. They found that the 
coefficient of linear thermal expansion varies linearly within a certain temperature 
range. 

Carmen et al. (2002) used narrow band photoacoustics to characterize mechan- 
ical properties and residual stress of free-standing nanometer- sized thin films. 
However, the results have to be analyzed numerically since in general it is not 
possible to obtain simple analytical expressions for the phase velocities of the 
various modes in terms of the material properties and the waveguide dimensions. 

A technique based on resonant frequency analysis of beam or plate structures has 
been developed (e.g., Liu and Murarka 1992). The structures are excited to vibrate 
and the variation of excitation frequency enables the resonant frequencies to be 
measured. For the evaluation of mechanical properties, the mass density of the 
materials has to be predetermined separately. The Young’s modulus also depends 
strongly on sample geometry. Therefore a precise geometrical characterization of 
the tested sample is crucial. 

Tien et al. (2001) proposed a method associated with an image processing 
technique to determine the mechanical properties of thin films. In that method, 
the change of deflection of a substrate coated with a thin film was obtained by 
digital phase-shifting interferometry. The main drawback of that method is that 
image processing is necessary to determine the properties. 

Electrostatic beam bending is also a popular technique to measure the Young’s 
modulus and residual stress of free-standing and thin structures (Osterberg and 
Senturia 1997). In this technique, it is crucial to measure the gap size and beam 
thickness accurately. Finite element simulation is required to work out the Young’s 
modulus (Srikar and Spearing 2003). 

The recent development in microelectromechanical systems has brought for- 
ward many useful microdevices. The performance and reliability of these 
microdevices are largely dependent on the thermomechanical properties of mate- 
rials involved, especially those of thin films. The decrease in size of these devices 
with features at micron/submicron scale requires new testing methods to determine 
their mechanical properties. 

As a result, many new microscale mechanical testing techniques have been 
developed. Mentioned above are a couple of typical examples. Most of them 
have advantages as well as disadvantages. The selection of a testing technique 
depends on the thermomechanical properties of our concern, such as the Young’s 
modulus, yield stress, CTE, strain hardening modulus, and Poisson’s ratio, and also 
depends on the film structure, loading/supporting conditions, etc. 
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Elastic and Elastic-Plastic Analysis of Multilayer Thin Films 
Multilayer Thin Film on Substrate (Elastic Analysis) 


Assume that a sample is long enough as compared with its total thickness and the 
strains are isotropic and uniform. Consider the case of an n- layer thin film atop a 
substrate as shown in Fig. 4. The thickness, Young’s modulus, and coefficient of the 
thermal expansion of layer i are denoted as t t , E h and a z , and those of substrate are t s . 


E s , and a s , respectively. Note that in the case of biaxial stress, biaxial modulus, E 


l 


(= £//(! 


Vf), where v,- is Poisson’s ratio), should be used instead of E,. For 


convenience, the coordinate system is defined as that shown in Fig. 4a, where the 
bottom of the substrate is located at y = 0, and the interface between the substrate 
and layer 1 of the film is located at y = t s . 

Upon temperature fluctuation, the multilayer structure tends to bend due to the 
difference in CTE. Since the strain, e, is continuous in the thickness direction and 
may be assumed as linearly dependent on y, it can be expressed in terms of the strain 
at the top, s t , and the strain at the bottom, s h (refer to Fig. 4b), i.e., 


&b | r\£L\ 

z = — n y + £b (16) 

U + t s 

i = 1 

k 

For convenience in the expressions followed, ti is defined as h k (k = 1,2... n), 

i= 1 

i.e., the distance from the upper face of the kt h film to the upper face of substrate. 
Additionally define h i} = 0. For small deformation cases, the radius of the curvature, 
r, can be calculated by ( h n + t s )/(s t — s h ). The total strain, s, includes two 
parts, namely, thermal strain s T (=aAT) and elastic strain s e , i.e., s = s T + s e . 
Excluding the thermal strains, the induced normal stresses in the substrate and the 
films, o s and a h can be expressed as 


0 S = E s (e - a s AT) (17) 

a i =E i (s-a i AT) (18) 

where subscripts ‘V’ and stand for the substrate and the zth film, respectively, 
and AT is the variation of temperature. The equilibrium conditions of the multilayer 
structure are that the total resultant in-plane force must be zero and the total 
moment must be null, i.e., 



85 Thermal Stress Analysis and Characterization of Themo-Mechanical. . . 


3075 



Eyi j a n 


E 2 , t2, OC 2 
Ei, t\, a\ 


F t 

^ S ? L Sr> 







Fig. 4 Thin film and substrate, (a) Coordinate system, (b) illustration of strain distribution 


where y e is the location of moment equilibrium axis, which depends only on 
the properties of the structure and is independent of y. Rearranging Eq. 19b 
results in 
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Referring to Eq. 19a, Eq. 19c can be reduced to 
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With Eqs. 16, 17, 18, 19a, and 19d, the explicit expressions for s t and 8 h can be 
obtained. Given e t and s h , 1/r can be conveniently obtained as 
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( 20 ) 


Equation 20 is a closed-form solution, which is identical to that reported in the 
literature solved by other approaches (e.g., Townsend et al. 1987; Feng and Liu 
1983; Hsueh 2002; Nikishkov 2003). 

In the case that the total thickness of thin films is much less than that of the 

n n 

substrate, i.e., £,• « t s and ^^Ejtj « E s t s , s t , s h , and 1/r can be estimated as 

i= 1 i= 1 
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then, the distribution of strain e may be expressed as 
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Equation 21c is identical to, for example, Eq. 9c in Hsueh (2002) and Eq. 31b in 
Townsend et al. (1987). The location of the neutral axis is where s e is zero. 
According to Eq. 22, the neutral axis in the substrate is located at y n s = tj 3, 
where e = a s AT and s e = 0. Similarly, e = a f AT at y = y ", where 



The condition that the neutral axis is in the ith film is t s + h t _\ < yl < t s + h f . 
Equation 23a can also be expressed in terms of radius of curvature and thermal 
mismatch by substituting Eq. 21c into Eq. 23a, i.e., 



r(a i - a s )AT + t s /3 


(23b) 


Thus, the general expression for the location of neutral axis in a layer if 
existing is 


y n = r(a - a,)AT + t s /3 ( a = a t , a s ) (24) 

This expression reveals that in the case of very thin films, the location of neutral 
axis in a layer depends on the radius of curvature and the thermal mismatch 
between the layer and the substrate. 

Upon defining y = 6ATEjtj(aj — a s )/E s t 2 s , based on Eq. 21c, 

l 
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This expression shows that the resultant curvature, 1/r, can be expressed in terms 
of the components contributed by the thermal mismatch between the substrate and 
each thin film layer. Equation 25 has also been reported in Townsend et al. (1987) 
and Hsueh (2002). 


Bilayer Structures (Elastic Analysis) 


Now consider bilayer structures, which consist of only one layer of thin film atop a 
substrate. They are the special case of multilayer thin films. For convenience, 
subscript “/’ is used standing for the thin film. Therefore, the expressions for s t , 
8b, and r can be reduced to 




Equation 26c is identical to Eq. 6 in Freund et al. (1999), which was obtained 
using a potential energy method. With Eqs. 16, 17, 18, 26a, and 26b, the stress 
distribution in the thickness direction can be obtained as 



In some applications, the film is so thin that the average stress in the film, which 
can be obtained by 
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is of the interest. 

Substituting Eq. 27b into Eq. 28 results in that 

E r‘f) ( 

Eflj + E^t^ + 2 EfE s tft s (2tj + 3tft s + 2 
By substituting Eq. 26c into Eq. 29, of ean also can be expressed as 



A TE s E f t s [E s t] + 
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Equations 29 and 30 are identical to those in Hsueh (2002). 

In many other applications, such as microactuators in MEMS, where a large 
deflection is the major concern, the thickness of the film may be of the same order as 
that of the substrate. Hence, it is important to find the maximum stress in the film 
and the location of the maximum stress as well. According to Eq. 27, it is obvious 
that both the maximum stress in the film and the maximum stress in the substrate 
always appear in the interface between them, i.e., 
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max 
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Of y=( 
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Apart from that, according to Eq. 27, the stress at the top surface of film, <7f ° p , can 
be expressed by 




y=tf+t s 




Comparing Eq. 32 with Eq. 31, the condition for the film stress to be of the same 

Q O O 

sign is E s t s — 2Efy — 3Efyt s > 0, which can be rewritten in terms of the Young’s 
modulus ratio and thickness ratio as 
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Figure 5 plots the contour of 1-2^- against the Young’s 

modulus ratio and thickness ratio. If Eq. 33 is satisfied, of p is the minimum film 
stress. The focus will only be on the case that Eq. 33 is satisfied, since for ordinary 
materials and under most of the situations, according to Fig. 5, Eq. 33 should be 
always satisfied. 

For easy comparison, the result of the Stoney equation (Stoney stress) is labeled 
as < 7 0 , i.e., 


oo = -E s t 2 j6t f r 

As such, o} iecm and o f ax in Eqs. 30 and 31a can be rewritten as 
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(35) 

(36) 


Assuming that E s =162 GPa (for silicon) and Ef= 83 GPa (for silver), a™ ean and 
of ax are plotted against the thickness ratio of the film over the substrate (tf/t s ) in 
Fig. 6 using Eqs. 35 and 36. As can be seen, the Stoney equation is a good 
estimation for a thickness ratio up to about 0.1. 

In order to compare the difference between of ean and cf ax , rj = (of ax - of*” 1 )/ 
of ax may be defined as a parameter. Hence, 



3 Eftf/E s t 3 s + 3 Eftj /E s t 2 s 


l+4E f t 3 f /E s E s +3Eft 2 f /E s 



( 37 ) 
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Fig. 6 Normalized stress against thickness ratio (tf/t s ) 


Fig. 7 Contour of rj (%) 
against thickness ratio (t/t s ) 
and modulus ratio ( EJE S ) 



Young's modulus ratio 


r\ is plotted against the thickness ratio and the ratio of Young’s modulus in Fig. 7. 
It reveals that the difference between of ean and <7™ ax can be significant for relatively 
thick films and in particular for hard films. Depending on the precision requirement, 
one may refer to Fig. 7 to determine whether the average stress is good enough in 
the stress analysis. 
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Neutral Axis in Bilayer Structures (Elastic Analysis) 


There are a few different definitions of neutral axis in the literature. For instance, 
Klein (2000c) defined the position where the bending force is zero as neutral axis 
and the position where the normal stress is zero as null axis. In Hsueh (2002), the 
position of zero stress is defined as neutral axis. The latter definition is taken 
hereinafter. 

For convenience, a new coordinate system as illustrated in Fig. 8 is adopted here. 
The interface between the film and substrate is therefore located at y = 0. 

Following the same method as mentioned above, the stress distributions in the 
film and substrate in the thickness direction can be expressed as 




By setting o s = 0 and Of = 0, the locations of neutral axes can be obtained in the 
film and the substrate (y = t^ n and y = t SJ1 ) from Eq. 38 as follows: 



E s t^ + Eftj (4 tf + 3 t s ) 

^Eftf (tf + 


t 


s,) n 


Eftj + E s tj (4 t s + 3 tf) 




(39b) 


The condition for the existence of neutral axis in the film is 0 < tf n < tf -, while 
that in the substrate is —t s < t s n < 0. Equation 39 also reveals that the locations of 
the neutral axes in the film and the substrate are dependent upon the elastic modulus 
and thickness of the film and substrate and independent on CTE and temperature 
variation. 

To show the location of neutral axis, consider a case that an aluminum film is 
deposited atop a silicon substrate. Parameters used in this case study are listed in 
Table 1. 

Figure 9 shows the stress distributions at two temperatures, namely, AT = 10 °C 
and AT = 15 °C. It is clear that the location of the neutral axes is independent on the 
exact temperature variation. 
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Fig. 8 Bilayer structure, (a) 
Coordinate system, (b) 
illustration of strain 
distribution 



Film 


Substrate 







Table 1 Parameters used in neutral axis case study 


Parameter 

Young’s modulus 

CTE 

Thickness 

Aluminum 

70 GPa 

23 x 

10 6 /°c 

5 gm 

Silicon 

1 62 GPa 

2.6 x 

10“ 6 /°c 

15 gm 



Compressive < 


Stress 

(MPa) 


► 


Tensile 


Fig. 9 Locations of neutral axes in film and substrate 


Bilayer Structures (Elastic-Plastic Analysis) 

In the case that the temperature fluctuation is significant, the film may go beyond its 
elastic deformation range partially or even fully. However, in most cases, it might 
be reasonable to assume that the plastic strain is well limited within a regime 
slightly over the yield start point. 
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Fig. 10 Bilinear strain 
versus stress model 



if) 

if) 

0 








C 



Fig. 1 1 Film stress development, (a) Pure elastic, (b) partially in plastic, (c) pure plastic 


For convenience in the discussion hereinafter, it is assumed that the material 
constants are temperature independent and the substrate is always within the elastic 
deformation range. The stress versus strain relationship of most engineering mate- 
rials in a range slightly over the elastic limit may be estimated using a bilinear 
model as illustrated in Fig. 10, where E , Hf , and o Y are the Young’s modulus, strain 
hardening modulus, and yield start stress, respectively. 

Let us start from the stress-free state. With the increase of temperature fluctuation, 
the film stress and the maximum film stress will increase accordingly. At a critical 
temperature, namely, AT^, plastic deformation will appear in the film-substrate 
interface first. A T x can be solved by substituting o f 13 * = o Y into Eq. 31a, i.e., 



With further increase in temperature variation, more part of the film reaches the 
plastic state, until the whole film is in plastic range at another critical temperature, 
denoted by A T 2 (which will be solved later), as illustrated in Fig. 11. 
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Consider the case of pure plastic deformation, i.e., temperature variation >A T 2 . 
The stress versus strain relationship of the film can be expressed as 



Then, the conditions for equilibrium (both the total resultant force and the total 
moment must be zero) are 


■* t s +tf 

a f dy = 0 (42a) 

ts 

a f ydy = 0 (42b) 

ts 

With Eqs. 16, 17, 41, and 42, one has 


o s ydy + 


/ 


o s dy + 


0 


mean 

°f 



a 


max 


/ 



a 


a Y 


+ HfOy) 





Rewriting them in terms of the radius of curvature and <j 0 yields 


a 


mean 


f 


E s t] + Hftf 1 + Hftf /E s t 

= gq — 


6tf(t s + t f )r 


1 + t/j t 


(44a) 


G 


max 


/ 


E s t 3 s + 4 Hft} + 3Hftjt s 1 + 4 Hft 3 f /E s t 3 s + 3H f tj/E s t 

= &() 


6tf(t f + t s )r 


1 + tf / 1 


(44b) 


G 


mm 


E r 

^S l s 


f 


2 H f t) - 3 H f tjt s 1 - 2 H f t) / E s t ] - 3H f tj /E s t 

= 


6tf(t f + t s )r 


1 + tf/t, 


(44c) 
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Fig. 12 Contour of difference ( ° u JL . in %) 
whole film is in plastic deformation 


against Hf/E s and thickness ratio (t/t s ) when the 


where <7 0 is the result of Stoney equation, in which r may be expressed as 



Note that here, subscript “min” stands for minimum. 

Comparing Eqs. 35 and 36 (pure elastic solution) with Eqs. 44a and 44b (pure 
plastic solution), the only difference is that Ef is replaced by Hf. 

Figure 12 shows the difference between <r™ ax and a 0 when the whole film is in 
plastic deformation, while Fig. 13 is for the difference between of ean and <r 0 . 

As mentioned above, the only difference between the pure elastic solution and 
pure plastic solution is that Ef is replaced by Hf if they are presented in terms of 
r (which is different in the elastic or pure plastic analysis). Thus, Figs. 12 and 13 can 
also be used for the pure elastic case after replacing HJE S by E f /E s . Furthermore, 
Fig. 7 can be used in pure plastic deformation for the difference between of ean and 
<7™ ax , after replacing EjJE s by Hj/E s . 

Figures 12 and 13 reveal the limit of the Stoney equation in stress analysis. 
According to Fig. 12, for a difference within ±10 % between a 0 and <7™ ax , the 
thickness ratio is recommended to be about 0.1 or less. 
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Fig. 13 Contour of difference 
plastic deformation 


OQ - O 


mean 


f 


O 


mean 


f 



against HjJE s and tjjt s when the whole him is in 


As shown in Fig. 13, which is identical to Fig. 1 in Klein (2000a) (note: that is for 
elastic analysis), the difference between a 0 and of ean is < 10 % for a thickness ratio 
<0.1, independent on the modulus ratio. Hence, it may be concluded that as 
compared with the Stoney stress, for a precision within ±10 %, the only require- 
ment is that the thickness ratio is less than about 0.1. 

As defined previously, A T 2 is the temperature at which plastic deformation is 
within the whole him. Now substituting <7} mn = o Y into Eq. 43c, one has 





E f E s t s 



Hftj ( 2tf + 3fs) 





At a temperature between AT^ and A T 2 , only part of the him is deformed 
plastically. The condition of equilibrium (the total resultant force and the total 
moment must be zero) requires 



and 
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* ts 

* t s -\-d 

a s ydy + 

J 0 

o' f ydy + 

t s 


' t S + t f 


t x -\-d 


o/ydy = 0 


(47b) 


where d is the location of the yield start point in the film at a certain temperature 
between A T x and A T 2 (refer to Fig. lib). In this case, Of 


y=t s +d 


f 


y=t s +d 


o Y . 


Note that Eqs. 47a and 47b are difficult to solve analytically. Numerical solution 
can be obtained if the values of the parameters are available. 

The difference among all the solutions within this temperature range should be 
between those of the elastic solution and pure plastic solution. 


A Case of Elastic-Plastic Analysis 

To reveal the possible difference in of ax , d} ,eai \ and of 111 as compared with o 0 in 
the whole temperature range, consider the case that an aluminum film is deposited 
atop a silicon substrate. Ignore the possible intermediate layer between them 
formed upon deposition. Most of the parameters listed in Table 2 are taken 
from (Gere and Timoshenko 1991). Three values for Hf, namely, 15 GPa (strain 
hardening), 0 GPa (i.e., ideal elastic-plastic material), and —15 GPa (strain 
softening), are used for analysis and comparison. A7\ and A T 2 of three situations 
are listed in Table 3. 

The stress distributions in the film and substrate at two temperatures of 25 °C and 
35 °C are shown in Fig. 14. 

Film stresses (maximum, average, minimum stress) are plotted against temper- 
ature variation in Fig. 15. It can be seen that the film stresses have linear relation- 
ships against temperature at a AT below A7\ or above A T 2 . Between them, i.e., 
the film is partially in plastic deformation, the relationship is nonlinear. 
When the whole film is in the plastic range, the difference between of ax and of ean , 

j , becomes smaller. However, the difference between 

oq and of ax or of ean becomes larger as Hf is smaller than Ef. 

Note that in the case of Hf = —15 GPa, the stress at the interface is not the 
maximum stress of the film after the yield start stress is reached due to softening. 
For the same reason, the minimum stress is not at the top surface of the film. Hence, 
the minimum and maximum stress presented in Fig. 15c should be read as the film 
stress at the top surface and the interface, respectively, to reflect the true physical 
meaning of them. 

Figure 16 shows the exact value of d in the temperature range from AT^ to A T 2 
together with that determined by a linear estimation, i.e., 

AT - AT i 

xt f 



d 


A T 2 - AT i 


(48) 
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Table 2 Parameters used in elastic-plastic case study 


Parameter 

Young’s 

modulus 

CTE 

Thickness 

Yield start 
stress 

Hf 

Aluminum 

70 GPa 

23 x 10 6 /°C 

5 pm 

20 MPa 

15 GPa 

0 GPa 

-15 

GPa 

Silicon 

162 GPa 

2.6 x 10 6 /° 

C 

15 pm 




Table 3 AGandAGof 
three kinds of film with 
different Hj 



H f = 15 GPa 

H f = 0 GPa 

H f = -15 GPa 

A T, 

(°C) 

22.38 

22.38 

22.38 

£ 

< 

(°C) 

30.08 

29.47 

28.90 


The errors in stress between the exact solution and that of Eq. 48 in percentage 
(taking the exact solution as reference) are shown in Fig. 17. It reveals that the 
maximum error in stress in percentage in all three Hf is less than 0.1 %. Therefore, 
in this particular case d can be estimated as a linear function of AT. 

Figure 1 8 plots the curvature as a function of temperature variation. As can be 
seen, the curvature slope decreases sharply after the film reaches plastic deforma- 
tion. In the case that Hf = 0, it becomes a constant after the whole film is over its 
elastic limit, while in the case that Hf — —15 GPa, it drops down linearly. 


Numerical Simulation 

Consider the same case that an aluminum film is deposited atop a silicon substrate 
as assumed in section “Numerical Simulation.” However, now three values of Hf, 
namely, 15 GPa (strain hardening), 5 GPa (strain hardening), and 0 GPa (ideal 
elastic-plastic material), are used for analysis and comparison, which are different 
from those in section “Numerical Simulation.” The thickness of substrate is 15 pm, 
while that of film is 5 pm except the cases that the effect of film thickness is studied 
in which film thickness varies from 1 to 5 pm. The length of the bilayer structure is 
240 pm. 

Commercial finite element analysis package ANSYS 8.0 was utilized to simulate 
the thermal mismatch-induced stress in a steady-state analysis. The substrate is 
modeled as isotropic and perfectly elastic. The plastic behavior of film is assumed 
to be isotropic. Hence, its stress versus strain relationship is modeled using bilinear 
isotropic hardening (BISO). In isotropic hardening, the yield surface remains 
centered at its initial centerline and expands in size as the plastic strains develop. 
The interface between the film and substrate is assumed to be perfectly bonded at all 
times during simulation. The simulation is simplified as a two-dimensional model 
using eight-node quadrilateral elements (PFANE82 with plane stress option) with 
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Fig. 14 Stress distributions 
in the film and substrate at 
25 °C and 35 °C. 

(a) H f = 15 GPa; (b)H f = 0; 
(c )H f = -15 GPa 
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Fig. 15 Film stresses as 
function of temperature 
variation, (a) Hf= 15 GPa; 
(b) H f = 0; (c) H f = - 15 GPa 
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Fig. 16 Location of d against 
AT in the range between 
and A T 2 . Symbol , exact 
solution; line, determined by 

Eq. 48. (a )H f = 15 GPa; (b) 
H f = 0; (c )H f = -15 GPa 




Temperature variation (°C) 



3092 


Y.Y. Hu and W.M. Huang 


Fig. 17 Difference between 
exact solution and 
approximate solution (in %). 
(a) H f = 15 GPa; (b)H f =0; 
(c )H f = -15 GPa 
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Fig. 18 Curvature as a 
function of temperature 
variation 



two degrees of freedom at each node: translations in the nodal X and Y directions. 
The element size of the film is minimized gradually from top to interface, as 
interface is most likely to be with high stress concentration since both the maximum 
stress in the film and maximum stress in the substrate could appear at the interface 
as found in the previous studies. Due to the fact that film stress is more of our 
interest, coarser mesh is used for the substrate except in the area near the edge 
where stress concentration may also happen. The bilayer strip is symmetrical about 
its midpoint in the length direction, and hence only half of it needs to be modeled. 
Finally, a two-dimensional rectangular mesh for the right half of the bilayer strip 
consists of 576 elements and 1849 nodes (in the case of 5-pm film). 

The boundary and initial conditions are as follows: (a) the middle line in X 
direction (left boundary of the model) is treated as symmetrical axis, and hence all 
nodes on it are constrained from moving in X direction (X direction is along the 
length of the bilayer structure); (b) the node at the lower left corner is fixed to 
prevent from moving in both X and Y directions in a rigid body fashion (Y direction 
is along the thickness of the bilayer structure); (c) all other nodes are free to move; 
(d) the bilayer strip is in free stress state during deposition. The simulation assumes 
a uniform temperature distribution in the strip, and the temperature variation (A T) 
starts from cooling from the deposition temperature. 

Film stresses ( o x ) are plotted against temperature variation in Fig. 19. For finite 
element method (FEM), stress in the area far away from the edge is plotted, as can 
be seen later; film stresses may change significantly at the edge. 

Figure 19 shows that thermal stress (both at top and at interface) increases with 
the increase of temperature variation, and the analytical result agrees well with that 
of FEM. In the range that the film is fully elastic or plastic, the relationships 
between film stresses and temperature variation are linear, which agree well with 


3094 


Y.Y. Hu and W.M. Huang 



Fig. 19 Film stresses o x as function of temperature variation (Hf= 15 GPa) 


the analytical results. As mentioned earlier, a linear variation in film stress over the 
thickness direction occurs when the film is fully elastic state, with the maximum 
film stress located at the interface. The film starts to yield at the interface first 
(in this studied case, at a temperature variation of about 22.4 °C), and then yielding 
part expands along the thickness direction upon further increase in temperature 
variation till the whole film is in plastic state. Therefore, the relationship of film 
stress versus temperature variation is nonlinear when the film is partially in plastic 
deformation. There is no sudden turning point in average film stress versus tem- 
perature curve (for a clear view, average film stress is not plotted in Fig. 19, refer to 
Fig. 15) even if the film has an obvious yield point in its stress versus strain 
relationship. Similar phenomenon has been reported in bending of a bilayer micro- 
beam (Florando and Nix 2005). 

Figure 20 shows FEM results of film stresses both at top and at interface versus 
film thickness when temperature variation is 15 °C and Hf= 15 GPa. Both stress at 
top and stress at interface increase as film thickness decreases. On the other hand, 
the thinner the film is, the closer the stress at top and the stress at interface are. That 
is to say in the case that substrate is much thicker than the film, the film stress is 
approximately uniform over the thickness direction. 

Figure 21 shows the film stress ( o x ) at the interface at three different temperature 
variations, namely, 20 °C, 26 °C, and 35 °C. Note here, at AT = 20 °C, the film is 
in elastic range, while at AT = 35 °C, it is in whole plastic range. Between them, 
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Fig. 20 Film stress o x versus film thickness (temperature variation is 15 °C and Hf = 15 GPa) 
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Fig. 21 Film stress (<7 X ) at the interface at three different temperature variations (Hj — 15 GPa) 
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Fig. 22 Film stress (<7y) at the interface at three different temperature variations (Hf — 15 GPa) 

at AT = 26 °C, it is partially in plastic deformation range. From Fig. 21, it can be 
seen that at a place which is far away from the edge, the FEM results agree well 
with the analytical values. However, FEM results show gradual decrease starting 
from about 100 pm due to the edge effect, and it becomes dramatic near the edge. 
The range of the edge effect in in-plane is around the total thickness of the bilayer 
strip (20 pm). On the other hand, the larger the temperature variation is, the bigger 
the o x is. However, the rate of increase of a x is smaller after yielding. 

Figure 22 shows film stress (a y ) at the interface at three different temperature 
variations. While o y is nearly zero in the area far away from the edge, it increases 
suddenly in the area near the edge till reaching the maximum. For instance, when 
temperature variation is 35 °C, the maximum o y is 22.51 MPa, which is about the 
same as the maximum a x (22.95 MPa). Hence, o y may also cause failure of adhesion 
at the interface and/or spallation of the film. In addition, the range for the edge 
effect is also about 20 pm from the edge. 

Film shear stress ( a xy ) at the interface is shown in Fig. 23. Like o y , o xy is almost 
zero in the part far away from the edge. At the edge, it increases significantly 
following the same trend at three temperature variations. Besides o x , a y and a xy may 
also be the reasons for the failure of film due to the fact that o y and o xy may 
concentrate at the edge or at the area near the edge as shown in Figs. 22 and 23. 

Figure 24 shows film shear stress (<j xy ) distribution through film thickness at 
different locations, namely, (x = 100, 110, and 120 pm), and with different strain 
hardening moduli (Hf= 15, 5, and 0 GPa, respectively) when temperature variation 
is 35 °C. It can be seen that at x = 100 pm from the middle point, the film shear 
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Fig. 23 Film shear stress {o xy ) at the interface at three different temperature variations (Hj =15 GPa) 
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Fig. 24 Film shear stress (o xy ) distribution through him thickness at different locations 
(x = 100, 110, and 120 pm) and with different strain hardening moduli (. Hf= 15, 5, and 0 GPa, 
respectively) when temperature variation is 35 °C 
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stresses are almost zero through thickness of film in all three strain hardening 
moduli cases. At x = 110 pm , the maximum shear stress occurs at the interface, 
but it almost remains the same in three strain hardening moduli cases. While at 
x = 120 pm, i.e., the edge, the lesser the strain hardening modulus is, the smaller the 
maximum shear stress is, which is located at the interface. 


Determination of Thermomechanical Properties 

A lot of dedicated techniques have been developed to determine the thermome- 
chanical properties of thin films, such as the Young’s modulus, coefficient of 
thermal expansion (CTE), Poisson’s ratio, etc., as reviewed briefly in section 
“Literature Review.” Despite of these previous efforts, most of these techniques 
require either a very dedicated equipment (and normally very expensive as well) or 
a complicated procedure in testing and analysis and have some difficulties in 
dealing with a few properties by just a couple of tests. 

In 1980, Retajczyk and Sinha (1980) proposed the so-called thermally induced 
bending method, in which two different materials were used as the substrate, to 
simultaneously determine two in-plane properties of thin films, namely, biaxial 
modulus and CTE. Continuous improvement has been made since then (e.g., Lima 
et al. 1999; Thum and Hughey 2004). For example, Zhao et al. (1999) improved this 
method to determine the Young’s modulus and CTE of thin films by measuring 
thermal stress in thin films deposited atop Si and GaAs substrates. Extensions of 
this technique even allow to estimate the film’s Poisson’s ratio. Tien et al. (2001) 
proposed a way associated with an image processing technique to determine the 
mechanical properties of thin films atop a substrate, in which the change of the 
substrate deflection was obtained by a digital phase -shifting interferometry. At 
present, the Young’s modulus, Poisson’s ratio, and CTE can be determined using 
this method (Zhao et al. 1999, 2000b). 

Thermally induced bending method is based on the variation in curvature of 
bilayer structures upon temperature fluctuation, which can be easily measured by 
many conventional techniques. But most of the methods proposed in the literature 
are derived from simplified solutions and are more applicable in finding only some 
mechanical properties of elastic thin films. 

On the other hand, residual stress, which normally includes two parts, namely, 
the intrinsic stress and thermal stress, in multilayer structures is an important issue 
largely due to its possible strong influence on the performance and reliability of 
structures. Many approaches have been proposed in the past years in order to 
determine the residual stress. However, most of them are not applicable to multi- 
layer structures and/or fail to quantitatively separate the intrinsic stress and thermal 
stress. 

This section reviews simple approaches to determine thermomechanical prop- 
erties of thin films on an elastic substrate. 
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Five Properties in a Bilinear Material 


Simplified Formula in the Literature 

In section “Elastic and Elastic-Plastic Analysis of Multilayer Thin Films,” it has 
been shown that the average stress in the thin film can be expressed as 



The meanings of the symbols are the same as those in section “Elastic and 
Elastic-Plastic Analysis of Multilayer Thin Films.” In the case that tf « t s , i.e., 
very thin film, by ignoring terms with orders higher than tf and replacing o f ean by oy, 
Eq. 49 can be reduced to 


Of = Ef{a s — ay) AT (50) 

From Eq. 50, one can easily determine the stress versus temperature variation 
curve. The slope of the stress versus temperature variation curve can be expressed as 


dof f 

—A = Ef (a s 
dT n 




E 


Note that in the case of biaxial stress, biaxial modulus, Ef (= where Vf is 


Poisson’s ratio), should be used instead of Ef. The simplified formula, i.e., Eq. 51, 
has been used widely by many researchers. 


Determine Thermomechanical Properties of a Thin Film 

Consider a layer of thin film deposited atop a substrate. The properties of this thin 
film are unknown. For simplicity, assume the material constants of the thin film and 
substrate are temperature independent (at least within the testing temperature 
range), the properties of the substrate are known, and the substrate is always in 
the elastic deformation range. The thickness, Young’s modulus, and coefficient of 
thermal expansion of thin film are denoted as tf , Ef, and ay, and those of substrate are 
t S9 E s and a 5 (^ay), respectively, as defined in section “Elastic and Elastic-Plastic 
Analysis of Multilayer Thin Films.” Given a small temperature variation (AT), the 
bilayer structure tends to bend due to the difference in CTE. The stresses in the film 
and substrate are denoted by Of and o s , respectively. 

Based on the equilibrium conditions, the curvature of the bilayer structure, K e , 
can be expressed as (Hsueh 2002; Hu and Huang 2004) 



( 52 ) 
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Fig. 25 Illustration of 
curvature as a function of 
temperature variation. 
Subscripts “1” and “2” stand 
for the results of two 
structures. (A Ti, K p] ) is any 
point in the full plastic range 
(for the him) 



Temperature variation AT 


where the subscript “e” indicates that the thin film is in the elastic range. Upon 
heating or cooling, one can obtain the curvature versus temperature relationship of 
the structure as illustrated in Fig. 25. 

The curvature has a linear relationship with temperature variation if the whole 
film is in the pure elastic deformation range (refer to section “A Case of 
Elastic-Plastic Analysis”). Let dKJdT = l/T^. From Eq. 52, one can obtain that 



If the temperature variation is significant, the film may be over its elastic 
deformation range partially or fully. However, for most cases, it might be reason- 
able to say that the plastic strain is well limited within a regime slightly over the 
yield start point. 

The stress versus strain relationship of most engineering materials in a range slightly 
over the elastic range may be estimated using a bilinear model as illustrated in Fig. 10. 

Hence, the curvature of the bilayer structure after the thin film is fully in the 
plastic range can be expressed as 



where the subscript “p” indicates that the thin film is in the pure plastic range 
(Fig. 25). Our previous investigation in section “A Case of Elastic-Plastic Analy- 
sis” shows that the curvature has a linear relationship with temperature variation 
again if the whole film is in pure plastic deformation. If only part of the film is in 
plastic deformation, the relationship is nonlinear. Let dK p /dT = iP¥ p (Fig. 25). 
From Eq. 54, one can obtain that 
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One may carry out two tests on two structures (for instance, same thin film 
thickness but different substrate thickness, namely, t s i and t s2 ), and two curvature 
versus temperature variation curves can be obtained. Hence, two } ¥ e CP e ,i and ^, 2 ) 
and two 1 and 2 ) are resulted. Provided that E s and a s are known, E f and ay- 

can be obtained by solving two equations in the form of Eq. 53, which results in 



where 


0\ — ¥*,1 (tf + k,i) (2tf + 3t/t s ,2 + ^, 2 ) 
@2 = ^e,2 (tf + ts 9 2 ) ( 2 /y + + 2^0 


^3 — 1^5,1 ((/ “b (s,l) 


#4 = 2^5,2 (*/ + ^,2) 

^5 = + ^,0 

^6 = ^,2^ ? i (// + ^, 2 ) 

Note that in the case of t s , 1 > ^, 2 , the sign of square root in Eq. 56 should be plus; 
otherwise, minus. The expression of ay ( i.e., Eq. 57) can also be expressed in terms 
of t s 2 and *P gj2 . 

After Ef and ay- are obtained, ///and <7y can be resulted by solving Eqs. 54 and 55, 
which results in that 



( 59 ) 
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where 



<?8 = H)t) + E 2 / s + 2 HfEstfts (it 2 + 3 tft s + lA 

and AT*i and K* p X are defined in Fig. 25. 

In the case of tf « t s , i.e., very thin film, after deleting terms about tf with an 
order higher than 1, Eq. 53 can be reduced to 


2 Eftf 2 t 





Subsequently, Ef and otf can be solved as (first-order approximation) 



E 


f 


a f 


Es A - 

- j 

(61) 

4tf ('P e ,2?s,l - 

- WeltJ) 

2(^,1 - 

\ + oc s 

(62) 




¥ 




For the zero-order approximation, any term about tf i s removed in Eq. 53 except 
the one multiplying with x i f e . Thus, Eq. 53 can be reduced to 


dK t 

~dT 


6Eft f (a f 


a 


EJ 


(63) 


Obviously, one cannot determine both Ef and ay from Eq. 63. Note that combin- 


ing Eq. 63 with the well-known Stoney equation (Stoney 1909), which is 


one can obtain that 


a 


E s t]K e 


6 1 


(64) 


/ 


do 

dT 


E f («/ 


a 


(65) 


which is identical to that in, for instance, Lima et al. (1999) and Zhao et al. (1999). 
Similarly, for a first-order approximation, Eq. 55 can be reduced to 


2 Hftf 2 1 


3^ ( 01 f 


a 


+ EJ 


0 


( 66 ) 


Subsequently, Hf and o Y can be obtained as 
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F t 2 

1 






For a zero-order approximation, 




At this point, the Young’s modulus, CTE, yield start stress, and strain hardening 
modulus are obtained based on the curvature versus temperature variation curves 
measured in two tests. One more curvature versus temperature variation curve is 
required to determine Poisson’s ratio. Assume that in the abovementioned two tests, 
the structures are under uniaxial stress state. As such, the additional curve should be 
obtained from a biaxial stress state test. Otherwise, the additional test should be 
under uniaxial stress state instead. 

In the case of biaxial stress, biaxial modulus, E (where stands for the case of 
biaxial stress), should be used instead of uniaxial modulus. Then, 



where Ef = Ef/(l — Vf) and E s = E s /{ 1 — v s ). u^and v s are Poisson’s ratios of the 

film and substrate, respectively. If Ef and ay are available, Vf can be obtained from 
Eq. 71 as 




- 3*P e (t f + t s ) (a/ - a*) 



In the case of tf« t s , Vf can be solved as 





as a first-order approximation. 
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For a zero-order approximation, Vj can be solved as 


°/ = 1 (74) 

The derivation presented above is mainly for the case that the substrates are of 
the same material but in different thickness. In real engineering practice, there are 
many alternatives. For instance, one may consider using substrates made of differ- 
ent materials (same as that in Retajczyk and Sinha (1980)), or different film 
thickness instead of different substrate thickness (provided that the properties of a 
thin film is independent on the thickness). 

The comments in Malzbender (2005) on the above approach (Huang et al. 2004b) 
present a few remarks to further extend our approach as an effective tool to simplify 
the analysis of the mechanical properties of materials. The use of differentiates was 
suggested in Malzbender (2005) for assessing the properties of thin films at a particular 
temperature and applied strain. It was also mentioned that the strain-load (ds/dP) or 
load-deflection differentiation ( dP/dd ) curve in a uniaxial or biaxial bending test can 
be utilized to determine the entire nonlinear stress-strain relationship of a thin film. 
Further extension of this idea into multilayer composites is, at least in theory, possible. 


Determination of Intrinsic Stress and Thermal Stress 

Residual stress is almost unavoidable in multilayer structures. Residual stress may 
cause cracking and/or other damage to the stmctures. For instance, if the residual 
stress is over a certain limit, catastrophic or long-term failure may occur. Contact peel 
off due to a higher tensile stress and de-adherence and/or buckling due to a high 
compressive stress are two typical examples (Ohring 2002). On the other hand, the 
presence of residual stress may sometimes enhance the strength of stmctures. For 
example, a high compressive stress can prevent the initiation of surface cracks. 
Therefore, in order to obtain reliable multilayer structures, it is necessary to determine 
the residual stress. Our literature review reveals that this issue has attracted a lot of 
attention for some years (Janda and Stefan 1984; Janda 1986; Nie et al. 2006). 

It is well known that in the absence of external loads, there are two major 
contributors to residual stress in multilayer structures: intrinsic stress and thermal 
stress. The former is largely due to the lattice mismatch between two adjacent 
layers and growth of strains (in the literature some researchers define the stress due 
to the difference in structural properties between two adjacent layers as interfacial 
stress), while the latter is due to temperature change, which most likely refers to the 
fluctuation of environmental temperature and/or a high film deposition temperature. 
In the past decades, many dedicated methods have been developed in order 
to determine the residual stresses, but without separating the intrinsic stress and 
thermal stress (Nie et al. 2006). Recently, some researchers proposed some 
methods to separate the thermal stress and intrinsic stress in bilayer structures 
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Fig. 26 Schematic sketch of 
a multilayer structure ( n - 
layers atop a thick substrate) 





(Yu et al. 2000; Jeong et al. 2001; Choi et al. 2005), but none of them is for 
multilayer structures. Most of these methods require the thermomechanical prop- 
erties of thin films to be known first (Janda and Stefan 1984). Using the equivalent 
reference temperature (ERT) technique, Nejhad et al. (2003) managed to model the 
intrinsic strain in a trial and error fashion. 

The residual stress can be determined using many advanced technologies. 
Among them, curvature-based method, in which Stoney equation is utilized (Stoney 
1909), might be the most popular one (Gunnars and Wiklund 2002; Mezin 2006). In 
this section, a curvature-based approach, which can determine not only the residual 
stresses but also the intrinsic stress and thermal stress in each layer, is reviewed, 
which only requires the thermomechanical properties of substrate to be known, but 
nothing about the thin film. 

Determination of Residual Stress 

Assume that a multilayer structure is long enough as compared with its thickness and 
its curvature is much larger than the thickness. Consider the case of an n - layer thin film 
on the top of a substrate as illustrated in Fig. 26. The thickness, Young’s modulus, and 
coefficient of thermal expansion of layer z are denoted as t n E h and a h and those of 
substrate are t S9 E s , and a s , respectively. Note that in the case of biaxial stress, biaxial 

modulus Ej (=£)■/( 1 — u z ), where u z is Poisson’s ratio) should be used instead of E t . 

In Fig. 26, each layer suffers force and moment, which can be replaced by a 
statically equivalent combination of an average force and a moment (Ohring 2002) 
as illustrated in Fig. 26, in which P z and M { are in the zth layer and P s and M s in the 
substrate. Force P z can be imagined to be uniformly applied over the whole cross- 
sectional area (t;w, where w is the width of the structure). Hence, the total stress o t 
(including intrinsic stress and thermal stress) in the zth layer can be expressed as 
PiKtiW). The equilibrium conditions for force and moment are (Feng and Fiu 1983) 

n 

<7 s t s w + OjtiW = 0 (75) 

i=i 

n n 

M s + o s t s w{t s / 2 ) + Mj + OjtiW t s + {til 2 ) + tj 

i= 1 i= 1 j<i - 1 


o (76) 
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O Q 

where M t = Eg, w/( 1 2r), M v = E s t s w/( 1 2r), and r is the radius of curvature. In the 

n n 

case of thin films, i.e., t s ti and E s t s Ejti (Klein 2000c), Eq. 76 can be 

i= 1 z=l 

simplified as 

n 

E s t 3 s w/ (12 r) + a s t s w(t s /2) + ^ ait(wt s = 0 (77) 

1=1 

With Eqs. 75 and 77, one has 


) (78) 

i=\ 

In the above equation, the minus sign is due to the sign convention in defining 
the curvature. The residual stress in the nth layer o n can be determined from the 
curvatures measured before and after the removal of the nth layer, i.e., 




where r n is the measured curvature of the multilayer structure before the /?th layer 
is removed, while r n _ i is the curvature after the nth layer is removed, i.e., n—l 
layers are still on the substrate. If the thin films are etched away layer by layer 
from the top layer and the corresponding curvatures are measured, the residual 
stress in each layer can be determined. This sounds conventional (Nie et al. 2006). 
However, a quantitative determination of the intrinsic and thermal stresses is still 
a challenge. 

Determination of Intrinsic Stress and Thermal Stress 

As described above, after film processing (including deposition and postdeposition 
treatment, e.g., annealing), the residual stress has two components, namely, thermal 

t • 

stress <7 which is temperature dependent and intrinsic stress o in which is temper- 
ature independent (i.e., <7/ = of + of ) (Choi et al. 2005). 

In the case of thin layers, the thermal stress in the ith layer can be expressed as 
(Townsend et al. 1987; Hsueh 2002) 



where Tj is the deposition temperature of the ith layer, T) is the temperature at 
which the curvature and residual stress are measured, and of 1 is the thermal stress 
in the ith layer at Tj . 

If two identical multilayer structures are prepared (i.e., produced simultaneously 
in batch or following the same process under the same conditions), and subse- 
quently measured, the curvature of both structures before and after the removal of 
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1 o 

the zth layer at two different (but constant) temperatures T t and T t . Consequently, 
one has the following two equations: 





i 9 

Note that both residual stresses 07 and 07 in the above equations can be 
determined by Eqs. 78 and 79. Referring to Eqs. 81 and 82, the intrinsic stress in 
the zth layer can be determined as 



After the intrinsic stress is determined, thermal stress at a given temperature can be 

• t 

obtained subsequently as 07 = of + of. Repeat the above procedure layer by layer, 
and the intrinsic stress and thermal stress in each layer can be determined. Practically, 
a few identical samples may need to be tested at some different temperatures for data 
fitting, so that more accurate result of intrinsic stress can be obtained. 


Determination of Stress Versus Strain Relationship and Other 
Thermomechanical Properties of Thin Films 

In recent years, many dedicated techniques have been developed for characterizing 
thin films, as discussed briefly in section “Literature Review,” such as X-ray 
diffraction technique (Badawi et al. 2002), membrane deflection methodology 
(Espinosa et al. 2003), nanoindentation (Malzbender and Steinbrech 2003), etc. In 
general, these proposed techniques either require dedicated equipments or are very 
tedious in sample preparation. Therefore, simple approaches for characterizing the 
thermomechanical properties of thin films, including stress versus strain relation- 
ship, coefficient of thermal expansion (CTE), Young’s modulus, and Poisson’s 
ratio, are still highly in demand. 

In section “Determination of Thermomechanical Properties,” a simple technique 
to determine five thermomechanical properties of thin ductile linear strain harden- 
ing films has been proposed. Since it is based on the thermally induced curvature 
approach, it is only applicable to films with temperature-independent material 
properties (at least within the temperature range of interest). 

In this section, we further extend our review to some more generic approaches, 
which can deal with films with nonlinear stress versus strain relationship and/or 
temperature -dependent material properties. 

Thermally induced curvature test and bending test (e.g., four-point bending, three- 
point bending, and bending a cantilever beam) are standard techniques for testing thin 
films atop an elastic substrate for various purposes (Florando and Nix 2005). The 
approaches purposed here involve no other tests but only these conventional ones. 
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Thermally Induced Curvature Approach 

Before determining the nonlinear stress versus strain relationship of a thin film, the 
CTE of the thin film is assumed to be a constant. If it is an unknown, it can be 
determined either from the thermally induced curvature test following the approach 
described in section “Determination of Thermomechanical Properties” or by the 
method mentioned later. 


Nonlinear Stress Versus Strain Relationship of Thin Films 
by Thermal Tests 

Consider a thin film deposited atop an elastic substrate (its material properties are well- 
known) as illustrated in Fig. 27a. Hereinafter, the symbols and nomenclature (thick- 
ness, Young’s modulus, and CTE) are the same as those defined in section “Elastic and 
Elastic-Plastic Analysis of Multilayer Thin Films.” Subscripts /and s stand for the film 
and substrate, respectively. Note that here otf is assumed to be a constant. 

Upon temperature fluctuation (AT), a bilayer structure tends to bend due to the 
difference in CTE between the film and substrate. The strain (e) at a point y away 
from the interface can be expressed in terms of the strain at the interface (e 0 ) 
(Fig. 27b) and curvature of the bilayer structure (A K) by 

£ = A Ky + £ 0 (84) 

Here, £ has two components, namely, thermal strain ( s T = a AT) and mechanical 
strain (s m ), i.e., £ = s m + a AT. A K can be expressed in terms of the strains at the top 
and bottom surfaces (s t and s b ) by 

A K = (e, - e b ) / (t f + t s ) (85) 

At a particular temperature, A K can be measured by many conventional tech- 
niques. Therefore, one can obtain a curvature variation versus temperature variation 
relationship as illustrated in Fig. 28. 

In the first step (<1>), given a very small temperature variation (AT <1> ), a 
linear stress versus strain relationship can be assumed for the film in the very early 
stage. Thus, £q x> (strain at the interface) and H x (slope of the stress versus strain 
relationship of film, refer to Fig. 29) are two unknowns to be determined from 
A K <1> and A T <1> . The equilibrium conditions require that both the total force and 
total moment are zero, i.e., 



( 86 ) 
ds = 0 


( 87 ) 
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Fig. 27 Film and substrate, (a) Coordinate system, (b) illustration of strain distribution 


Fig. 28 Curvature variation 
versus temperature variation 



ak < m> 

AK'' 



Fig. 29 Stress versus strain 
relationship 



Equations 86 and 87 can be 
maximum mechanical strain 


solved for £q l> and H <1> . Subsequently, the 
can be obtained as 



( 88 ) 
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provided that the stress versus strain relationship of film (/ <z> (e^)) has been 
obtained from the previous i steps (Fig. 29). In the next step (</ + 1>), 1 > 

(through ?+1> ) and H <l+l> are determined. The equilibrium conditions are 


,</+i> 

0 


£^ l+]> —AK <i+l> t 


E s (e - a s AT <i+1> )de 


+ 


C+«/ A7 ' 


< / ■ 1 > 


(7 


<z> 


+ 


p </+l> 
b 0 

E< i+l> + AK <i+l> tf 


+ H <i+1> { e - cc f AT <i+1> 


C+“/ Ar 


</+!> 


/ 


<Z> 


s - a f AT <i+1> )de = 0 


<i> 

0, m 



(89) 


,</+ 1 > 

0 


EAs — a 


A7’< i+l> 'l - — 0 


s< i+l> -AK <i+l> t. 




A K 


</+l > 


dz 


+ 




<;+!> 


a 


<i> 


+ H 


</+ 1 > 


+ 


p</+l> 
fc 0 

eJ /+1> +A/(' < ' +l> z / 


a/ AT 


</+l> p</> 

m 


4:i+<*f AT 


</+!> 


/ 


<Z> 


a/AT 


O C </+1> 

</+l>\ b 0 




AK 


</+i > 


ds = 0 


<J + 1 > 

0 




<C/+ 1 > 


Je (90) 


8o ,+1> (then 8o,m !> ) an d // </+1> can be solved from Eqs. 89 and 90. Step by step, one 
can get the whole stress versus strain relationship of the film based on the measured 
points in the curvature variation versus temperature variation curve (Fig. 28). 


Closed-Form Solutions for Very Thin Films 

As technologies are moving continuously toward smaller, thinner, and lighter devices, 
the thickness of films has reduced dramatically, toward micron and even nanometer 
regime (Baker et al. 2003). In this part the review is extended to obtain closed-form 
solutions for very thin films with a nonlinear stress versus strain relationship. 

In the case of very thin film, the film stress Of can be considered as a constant 
(refer to Fig. 7). Similarly, the strain in the film is also approximately a constant (s t ) 
and can be estimated to be the same as the strain atop the substrate. In other words, 
the stress and strain in the film are approximately uniform in the thickness direction 
(Hsueh 2002; Hu and Huang 2004). As such, s t « £ 0 i n Fig- 27b. 

Given a (A K, AT) point in Fig. 28, based on that the equilibrium conditions are 
that the total resultant in-plane force must be zero and the total moment must be 
null, one has 




i/ 



a s AT)dy + 










a s AT)ydy + 
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where 



A Ky + 


8 t t s + Sfjtf 

tf + t s 


Solving Eqs. 91 and 92, one has 



ak :e4 

6ff (tf + t s ) 




•T a s A T 




A Kt s (3tf H- 2t s ) 

6(f/ + 1 s ) 


-f~ a s A T 


(93) 

(94) 

(95) 


The total strain, e, has two parts, namely, thermal strain s T = (aAT) and 
mechanical strain e m , i.e., e = e T + Consequently, the mechanical strain in the 
thin film s m can be expressed as 



s t — ocfAT 






Corresponding to each (AT, AK) point in the curvature versus temperature 
variation relationship, which can be measured experimentally, s m and Of obtained 
from Eqs. 96 and 93 form a point in the stress versus strain relationship of the film. 
Thus, given a series of (AT, AK) point, a series of s m and Of are resulted from Eqs. 96 
and 93. As such, the stress and strain relationship of the thin film is obtained. 

The second term of the right side of Eq. 96 is the total thermally induced 
mismatch. The film accommodates most of the mismatch strain, but not all. The 
first term is normally smaller as compared with the second term and is often ignored 
in many previous papers (Baker et al. 2003). However, a closer look shows that, 
under normal conditions, the first term is about 1/4 of the second term at tj/t s = 1/10. 
On the other hand, given a thickness ratio tjjt s up to 1/10, the variation in film stress 
is less than about 5 % against the average. 

For simplicity, take the linear elastic case as an example for analysis. With 
Eq. 26c, Eq. 96 can be rewritten as 



The second term of the right side of Eq. 96 is denoted by s sec , i.e., 




sec 


( a s — aA AT 


(98) 
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Fig. 30 Contour of rj e (%) against thickness ratio (tjjt s ) and modulus ratio ( Ef/E s ) 


In order to compare the difference between s m and 8 sec , rj s = (8 sec — 8 m )/8 sec may 
be defined as a parameter for measurement. Hence, 


6yA 3 + 9yA 2 + 4yA 
1 + 4yA 3 + 6yA 2 + 4yA + y 2 A 4 



where y is Ef/E s and A is t/t s . r\ & is plotted against y and A in Fig. 30. 

It reveals that rj s may be significant for relatively thick films and in particular for 
hard films. Depending on the precision required and the thickness ratio, one may 
refer to Fig. 30 to determine which set of equations are more suitable in the 
determination of stress versus strain relationship of a film (Eqs. 96 and 93, or 
Stoney equation and Eq. 98). 

The thermomechanical properties of film and substrate listed in Table 1 are 
used to reveal the difference. However, the thicknesses of film and substrate 
are now 5 and 40 pm, respectively. Utilizing Eq. 26c, a “virtual” curvature 
versus temperature variation curve can be obtained. Then Eqs. 96 and 93 
have to be used to determine the stress versus strain relationship of the film. 
For comparison, Stoney equation and Eq. 98 are also used to determine the 
stress versus strain relationship of film based on the curvature versus temperature 
curve. 

Figure 31 shows that there might be some cases in which the approximation 
(Stoney equation) is not precise enough, but it is still valid to assume that the stress 
and strain in the film are uniform. In these cases, Eqs. 93 and 96 are required. 


85 Thermal Stress Analysis and Characterization of Themo-Mechanical. . . 


3113 



35 



25- 





0 


/ 


/ 


S' 

S / 


// 
* 

// 

, r . ' 
# / 


/■ / 
V 

/X / 


/ 


/ 




S' 

s / 


✓/ 

<• / 
// s 

<• ■ 


s' 


S' 


Assumed curve 

Equations 5.10 & 5.13 

— • - Stoney equation 5.15 & 4.44 


// 






0 


0.01 


0.02 


0.03 


0.04 


0.05 


0.06 


Strain (%) 


Fig. 31 Comparison of stress versus strain relationships 


Four-Point Bending Approach 

Thermally induced curvature approach is only applicable to films with temperature- 
independent material properties. In the case that the mechanical properties of a film 
are temperature dependent, bending test at a fixed temperature can be applied as an 
alternative . 

Nonlinear Stress Versus Strain Relationship of Thin Films by 
Bending Test 

Take the four-point bending test illustrated in Fig. 32 as an example. Upon loading 
( P ), the bending moment in the middle span M ( =Pa ) versus displacement (S) curve 
can be measured. Since M in the middle span is a constant, the central region has a 
constant curvature (Gere and Timoshenko 1991), which can be expressed as 

A K = -2 8/(S 2 + 1 2 /4) = (e, - s b )/(t f + t s ) (100) 

Here, for convenience, A K has a minus sign, which is opposite to the conven- 
tional definition. In a bending test, the stress versus strain relationship of the film 
can be derived from the measured moment versus displacement curve in a similar 
manner as that described previously in the thermally induced curvature test. 
Note that here, thermal strain can be deleted, since temperature is a constant. 
Hence, s = s m . Apart from that, in the thermally induced curvature test, the 
maximum mechanical strain in the film is located in the interface. In contrast, in 
the four-point bending test, it is located at the top surface of the film. 
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Fig. 32 Four-point bending 
of bilayer structure 



Closed-Form Solutions for Very Thin Films 

Similarly, closed-form solutions can be obtained for very thin film cases in four- 
point bending test. In the case of very thin films, the equilibrium conditions are 
(refer to Fig. 32) 


o 


E s sdy + 


-t 


* tf 


o 


Gfdy = 0 


o 


E s sydy + 


-t 




o f ydy — M — Pa 


o 


Hence, Of and s t can be solved as 


( 101 ) 

( 102 ) 



6Pa (4<5 2 + l 2 ) + 4 8E s t] 
3ff (tf + t s ) (4 <5 2 + 1 2 ) 


(103) 



(104) 


Thus, from the 8 versus M(=Pa) curve, through Eqs. 103 and 104, the stress 
versus strain relationship of the film can be resulted. 

Similar to the abovementioned four-point bending test, the stress versus strain 
relationship of a film can also be determined by the three-point bending test or 
cantilever test. If the beam is in a particular type of triangular shape, so that the 
bending moment per unit width of the bilayer beam is a constant (Florando and Nix 
2005), after ignoring the biaxial effect, the curvature of the beam is uniform 
throughout the beam. Practically, it is reduced to that of the four-point bending test. 

In addition to above stress versus strain relationship of thin films, using the slope in 
the early unloading part of a bending test, one can obtain the Young’s modulus of the 
film. With it, ctf (if it is a constant) can be derived from the slope in the early thermally 
unloading part of a thermally induced curvature test. Measuring the elastic constants 
in uniaxial and biaxial stress states, one can estimate Poisson’s ratio. 


A Case Study 

This section shows a case study of determination of nonlinear stress versus strain 
relationship of films using thermally induced curvature method. First, a nonlinear 
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Fig. 33 Nonlinear stress 
versus strain relationship 



Table 4 Parameters 


Parameter 

Young’s modulus 

CTE 

Thickness 

Thin film 


23 x 

10 6 /°c 

1 pm 

Silicon 

1 62 GPa 

2.6 x 

10“ 6 /°C 

40 pm 


stress versus strain relationship is assumed. Second, based on the assumed relation- 
ship, two methods are utilized to obtain the curvature versus temperature variation 
curve, namely, analytical method and numerical method (finite element analysis). 
Finally, based on the “calculated” curvature versus temperature variation curve, the 
method developed in the above section is utilized to determine the nonlinear stress 
versus stress relationship, and then the determined nonlinear stress versus stress 
relationship is compared with the assumed one. 

As a demonstration, consider a layer of thin film with a nonlinear stress versus 
strain relationship as (Exadaktylos et al. 2001) 

a(e) = Es + Fe 2 (105) 

It is deposited atop a silicon substrate. Here, E is 70 GPa and F is — 1 12,000 GPa 
(= — 1,600 E). Refer to Fig. 33 for the stress versus strain curve. Other parameters 
are listed in Table 4. The length of bilayer strip is assumed to be 800 pm in the finite 
element analysis. 

The curvature versus temperature variation curve can be calculated by the 
method described in section “Elastic and Elastic-Plastic Analysis of Multilayer 
Thin Films.” For a nonlinear stress versus strain relationship of a film here, it needs 
some minor modifications. The equilibrium equations are 


r* 







a s AT)dy + 


r l f 


o 


E(e - of AT ) + F(e - a f AT) 2 dy = 0 
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Fig. 34 Curvature versus 
temperature variation 
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(107) 


Then, the curvature can be obtained by Eq. 85 after solving Eqs. 106 and 107 as 
shown in Fig. 34. 

In the numerical simulation, commercial finite element analysis package 
ANSYS 8.0 is utilized. The substrate is modeled as isotropic and perfectly elastic. 
The nonlinear stress versus strain relationship of the film is modeled using 
multilinear isotropic hardening principle by von Mises plasticity (MISO). The 
nonlinear stress versus strain relationship of the film in the numerical simulation 
is represented by 20 linear segments, which is about accurate enough. 

The bilayer strip is simplified as a two-dimensional model using eight-node 
quadrilateral plane stress elements PLANE82 with two degrees of freedom at each 
node. Due to the symmetry of bilayer strip, only half of it needs to be calculated. 
When the radius of curvature r is much greater than the length, the curvature of the 
bilayer strip can be estimated as 



(108) 


where L is the length and S is the bow of the bilayer at position L as shown in 
Fig. 35. In the simulation, the bilayer strip changes its curvature upon temperature 
fluctuation. The displacement of the node located at the bottom of substrate in Y 
direction in the simulation is defined as the bow (5). Then through Eq. 108, the 
curvature versus temperature variation curve can be obtained. 

The curvature obtained by numerical simulation is plotted in Fig. 34. Using the 
values at the points with symbols in the curvature versus temperature variation 
curve, the stress versus strain relationship of the thin film is obtained from Eqs. 93 
and 96. As shown in Fig. 36, it is identical to the initially assumed one. 
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Fig. 35 Schematic 
illustration of the bow of the 
bilayer 




Fig. 36 Comparison of the stress versus strain curves 


Applications 

Two example applications are included in this section to demonstrate some poten- 
tial applications. First, the thermally induced curvature method for on- wafer deter- 
mination of the Young’s moduli of the three types of metallic thin films (copper, 
aluminum, and silver films deposited atop a (lOO)-type silicon wafer) is applied. 
Next, the approaches and formulas discussed in section “Determination of Stress 
Versus Strain Relationship and Other Thermomechanical Properties of Thin Films” 
are applied to investigate the underlying mechanism of the reversible trenches atop 
NiTi shape memory thin films. 




3118 


Y.Y. Hu and W.M. Huang 


Characterization of Metallic Thin Films 

Thermally induced curvature method is applied to determine the Young’s moduli of 
the three types of metallic thin films, namely, copper (Cu), aluminum (Al), and 
silver (Ag), deposited on a (100) -type silicon wafer. The thicknesses of both Cu and 
Al films are 1 pm, while Ag films have four different thicknesses, ranging from 
0.4-0. 8 to 1.2-1. 6 pm. 


Experimental Procedure 

All metallic thin films were prepared by sputtering at room temperature 
(Coaxial Magnetron Sputter). Commercially obtained 4-in. (lOO)-type silicon 
wafers were used as the substrate for all metallic thin films. Before 
deposition, the Si substrates were ultrasonically cleaned in acetone, methanol, 
and then deionized water, successively. The base pressure of the main 
chamber was 2.0 x 10~ 5 Pa (1.5 x 10 4 mTorr) before deposition. The substrate- 
to-target distance was 100 mm, and argon gas flow rate was 20 seem and the 
working pressure was 0.5 Pa (4 mTorr). The substrate holder was rotated during 
deposition for good uniformity of thin films. The deposition duration varied 
for Ag films. Thus, Ag films with different thicknesses, as listed in Table 5, 
were obtained. 

Since the deposited thin film and silicon substrate are firmly bonded together, the 
mismatch due to their different CTE will result in the deflection of the bilayer 
structure upon temperature fluctuation. A Tencor FLX-2908 laser system was used 
to measure the curvature of the bilayer structure in situ at different temperatures at a 
heating/cooling rate of 1 °C/min for all metallic films. 

For Cu films, the temperature was first increased from 24 °C to 100 °C and then 
decreased down to 24 °C, followed by heating again from 24 °C to 250 °C. For Al 
films, a thermal cycle was from 24 °C to 100 °C and then to 24 °C. For all Ag films, 
a thermal cycle was conducted from 24 °C to 100 °C and then to 24 °C and repeated 
one more time. 

Based on the curvature variation versus temperature (or temperature variation) 
curve and Eq. 93, one can work out the relationship between the film stress and 
temperature. In the same way, film strain can also be derived using Eq. 96. 

Note that curvature variation (A K) should refer to the initial curvature of wafer 
before deposition, and the deposition temperature (room temperature) is taken as 
the reference point for temperature variation (AT). 

Taking film stress and film strain at the deposition temperature as the starting 
point, the relationship between the relative stress and relative strain of the film can 
be established. The Young’s modulus and Poisson’s ratio of silicon wafer used in 
our calculation are 130.2 GPa and 0.28, respectively (Fu et al. 2006). The CTEs 
used in calculation are listed in Table 6. CTE of Si is from Fu et al. (2006). All 
others are taken from Freund and Suresh (2003). It is noteworthy to point out that 
CTEs of metallic films are slightly temperature dependent. So the average values 
within the testing temperature range are used. 
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Table 5 Thicknesses of 
metallic films 


Films 

Cu 

A1 

Ag 

Thickness (pm) 

1 

1 

0.4 

0.8 

1.2 

1.6 


Table 6 CTEs of materials used in calculation 


Materials 

Si 

Cu 

A1 

Ag 

CTE(10 < 7° C) 

2.6 

17.23 

23.98 

19.17 



Temperature (°C) 


Fig. 37 Curvature variation versus temperature relationship of Cu him deposited on Si substrate 
during heating 


Experimental Results and Comparison 

The measured curvature variation versus temperature relationships of Cu and A1 films 
are plotted in Figs. 37 and 38, respectively. Those of Ag films are plotted in Fig. 39. 

Based on these curvature variation versus temperature relationships, using 
Eqs. 93 and 96, both the stress versus temperature relationship and strain versus 
temperature relationship can be worked out. 

The stress evolution and strain evolution in Cu film are plotted in Fig. 40. Upon 
heating, both Cu film and Si substrate tend to expand. Cu film would expand more 
than Si substrate if they were not bonded together since the CTE of Cu is larger than 
that of Si. Now Cu film is unable to expand freely and subjected to the constraint 
from Si substrate because of the bond between them. As such, Cu film is in 
compressive stress state and the compressive stress will increase as the temperature 
increases. As can be seen in Fig. 40, the initial part of the stress-temperature curve 
of Cu film is almost linear from about 24°Cto 150 °C. 

The stress evolution and strain evolution in A1 film are plotted in Fig. 41. The 
stress-temperature curve of A1 film is almost linear within the temperature range 
from 24 °C to 100 °C. The stress in A1 film is also compressive. 


Curvature variation (1/m) o Curvature variation (1/m) 
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Fig. 38 Curvature variation versus temperature relationship of A1 film deposited on Si substrate 
during cooling 
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Fig. 39 Curvature variation versus temperature relationship of Ag film deposited on Si substrate 
during second cooling (the curves obtained in the first cooling process follow almost the same 
curve) 


3121 


85 Thermal Stress Analysis and Characterization of Themo-Mechanical. . . 


500 










Temperature (°C) 


Fig. 40 Stress evolution and strain evolution in Cu film 



Temperature (°C) 


Fig. 41 Stress evolution and strain evolution in A1 film 


The stress evolution and strain evolution in Ag films are plotted in Fig. 42. 
Figures 43, 44, and 45 plot the relative stress versus relative strain relationships of 
Cu, Al, and Ag films, respectively. 

Based on the slope in these stress versus strain relationships, the moduli of films 
which are listed in Table 7 can be obtained. The results of this work are compared 
with others from references. 

As can be seen in Table 7, the moduli of both Cu film and Al film determined 
here are comparable to those reported in the literature. For Ag films, the modulus 
of 0.4-pm Ag film is 65.8 GPa. At a thickness of 0.8 pm, the modulus decreases to 
50.3 GPa. However, in the case of 1.2-pm thick Ag film, the modulus increases 



Stress (MPa) o Stress (MPa) cr 
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Film thickness: 1.6 |um 



Fig. 42 Stress evolution and strain evolution in Ag films 



Relative compressive strain (%) 


Fig. 43 Relative stress versus relative strain relationship of Cu film 


slightly to 55.2 GPa. For 1.6-pm Ag film, the modulus increases further to 75.9 
GPa, which is about the same as the value reported in Zoo et al. (2006). The 
moduli of different thickness Ag films are plotted in Fig. 46. 

In the case of 0.4-pm Ag film, there might be strong nano-effect on the 
modulus, which results in an increase in modulus. As the thickness of Ag film 
increases, the nano-effect becomes weaker. As such, the modulus decreases in 
0.8-pm Ag film. However, with further increase in thickness, the modulus 
increases from 50.3 to 55.2 GPa and to 75.9 GPa, which is comparable to the 
value reported in the literature. 
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Fig. 44 Relative stress versus relative strain relationship of A1 film 


Reversible Trenches atop NiTi Thin Films 

Reversible Trenches atop Shape Memory Alloy Thin Films 

NiTi-based shape memory thin films (NiTiCu) of about 4 pm thick were prepared at 
350 °C by co-sputtering of a 3-in. TiNi target (RF, 400 W) and a 3-in. pure Cu target 
(DC, 2 W) on a standard commercial 4-in. (100) silicon wafer of 450 pm thickness. 
The films were then left in air for several days before postannealed at 450 °C for 1 h 
inside a vacuum chamber (Wu 2009). 

Observation using a temperature-controlled atom force microscope (AFM) 
reveals that there are reversible trenches atop the surface of thin films upon 
thermal cycling. This phenomenon is repeatable even after many thermal 
cycles. Some subsequent tests reveal that a thin oxide layer (Ti0 2 ), tens 
of nanometers thick, is formed on the surface of NiTi-based thin films. This 
Ti0 2 layer is indeed one of the requirements for the trenching phenomenon 
confirmed in Wu (2009). 


Determination of Curvature and Stress in TiO 

In order to investigate the stress evolution in the Ti0 2 layer, curvature test on 
NiTiCu thin film was conducted to measure curvature evolution upon thermal 
cycling using a Tencor FLX-2908 laser system. A typical result is shown in Fig. 47. 

Based on the curvature versus temperature relationship and Eqs. 93 and 94, one 
can find the stress and strain in the NiTiCu film (denoted by Of and Sf 9 respectively). 
The annealing temperature should be taken as the initial temperature. The material 
properties and other parameters used in the analysis are listed in Table 8. 

The contribution of Ti0 2 layer on the stiffness of the whole structure is 
neglectable, since its thickness is much thinner as compared with those of both 
NiTiCu film and Si substrate. Another point, which needs to be noted, is that the 
CTE of NiTiCu film changes during the phase transformation. The CTE must be 
estimated based on the phase fraction experimentally (Wu 2009; Fu et al. 2006). 
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Fig. 45 Relative stress versus relative strain relationship of Ag films 


Table 7 Moduli of films obtained by stress versus strain relationship slope 






Ag 

Films 


Cu 

Al 

0.4 pm 

0.8 pm 

1.2 pm 

1.6 pm 

Modulus (GPa) 
(this work) 

212.3 

91.3 

65.8 

50.3 

55.2 

75.9 

Reference 

(GPa) 

Film 

160. 8 a 
(1.6 pm) 

62 ± 10 b 
(0.4 pm) 

50 ± 10 b (4.8 nm) 


Bulk 

178. l a 

105. 3 b 


76 

C 



a Zhao (2000) 
b Chae et al. (1999) 
c Zoo et al. (2006) 


After the determination of stress and strain in NiTiCu film, the stress in Ti0 2 
layer (a 0 ) can be worked out based on the following relationship (Wu 2009): 


a 


O 


E 0 (5-1 1+e 



a 


o 



AT 


(109) 


where a 0 and cif are CTEs of Ti0 2 layer and NiTiCu film, respectively. S is the 
length ratio of Ti0 2 line over NiTiCu line, which is found experimentally. E 0 is 
biaxial modulus of Ti0 2 layer. 

The stress evolution in Ti0 2 layer is plotted in Fig. 48 together with stress 
evolution in NiTiCu film. 

It is found that the Ti0 2 layer is always under compression during thermal cycle 
and the reversible trenches in NiTiCu film are indeed the result of buckling of Ti0 2 
layer due to large compressive stress. 
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Fig. 46 Moduli of different thickness Ag films 
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Fig. 47 Curvature evolution upon thermal cycling in a NiTiCu film 


Table 8 Material properties and thickness of each layer used in the analysis (Wu 2009) 



CTE 

E (GPa) 

V 

Thickness 

Ti02 

9 x 10 6 /°C 

282.76 

0.28 

10 ~ 20 nm 

NiTi 

Austenite 

11 x 10“ 6 /°C 

83 

0.33 

4.2 pm 

Martensite 

6.6 x 10“ 6 /°C 

20 

Silicon (100) 

2.6 x 10~ 6 /°C 

130.2 

0.28 

450 pm 
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Fig. 48 Stresses in NiTiCu thin film and Ti0 2 layer upon thermal cycling 
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Summary 

This chapter provides a systematical review of the techniques to study of the 
thermally induced stress in multilayer thin films within both the elastic and 
elastic-plastic deformation ranges and several approaches to determine thermome- 
chanical properties of thin films. 

The elastic analysis based on the linear strain assumption actually results in the 
identical closed-form solutions and approximations (for very thin films) as those 
reported in the literature. 

Closed-form solutions are reviewed for fully plastically deformed films. As 
discussed, the difference between pure elastic solution and pure plastic solution is 
that the Young’s modulus (Ef) is replaced by the strain hardening modulus (Hf) if 
both solutions are expressed in terms of the radius of curvature (r). 

It is also revealed that in most of the cases, the film stress within one layer should 
be of the same sign upon heating or cooling. The location of neutral axes can be 
determined by the Young’s modulus and the thickness of the film and substrate. 

The difference among the maximum stress, average stress, and Stoney stress in 
films against the thickness ratio and Young’s modulus ratio of film and substrate is 
systematically reviewed. We can conclude that taken Stoney stress as reference, for 
an error within ±10 %, it is required that the thickness ratio should be about 0.1 or 
less for whatever elastic or elastic-plastic analysis. 

The result of a case study reveals that the yield start point may be estimated as a 
linear function of temperature in the elastic-plastic deformation range. 

The approach for the cases of multilayer thin films is reviewed in section 
“Elastic and Elastic-Plastic Analysis of Multilayer Thin Films.” 
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A simple approach to determine the values of five thermomechanical properties 
of thin films, namely, the Young’s modulus, coefficient of thermal expansion 
(CTE), yield start stress, strain hardening modulus, and Poisson’s ratio, is reviewed 
in details. The approach is based on the conventional curvature test on bilayer 
structures upon temperature variation and can be easily implemented using many 
conventional techniques. Three tests are enough to determine the values of these 
properties. Apart from that, a curvature-based approach, which can determine not 
only the residual stress but also the intrinsic stress and thermal stress in each layer 
of a multilayer structure, is reviewed. 

Some simple and generic approaches for the characterization of thin films with 
nonlinear stress versus strain relationship and/or temperature-dependent material prop- 
erties are reviewed. In the case of very thin films, analytical solutions are available. 

The potential applications of these solutions are demonstrated in the investiga- 
tion of the moduli of Cu, Al, and Ag films. The moduli of Cu and A1 films obtained 
by analysis are comparable to those reported in the literature. The film thickness 
effect on the moduli of Ag films is confirmed. The approach reviewed here is also 
utilized to identify the critical role of a compressive stress in thin Ti0 2 layer atop 
NiTiCu films in the reversible trench phenomenon. 
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Abstract 

The remanufacturing of machinery is a process of disassembly, where parts are 
inspected, repaired, reconditioned or replaced, recertified, and then reassembled to 
“like -new” condition. In the modem manufacturing industry, remanufacturing 
offers the advantages of sustainable energy development, cost savings, and pol- 
lution reduction, among other benefits. Assessment of the remaining useful life 
(RUL) of machinery is key for remanufacturing to determining what components 
can be shut down and when for off-line remanufacturing. Hence, the accuracy of 
RUL online assessment is critical to remanufacturing industry practice. 

This chapter first discusses the concept and technology of remanufacturing 
and then summarizes the modeling of online RUL assessment, with both phys- 
ical and data-driven models, comparing the advantages and disadvantages of 
each technique. Also discussed are two new specific methods in remaining life 
assessment widely cited in the literature: support vector machine (SVM) method 
and state-space method (SSM). Two case studies of machine tooling are illus- 
trated in detail, using examples and data from real industry applications to 
demonstrate RUL assessment. The chapter ends by raising, deliberating, and 
dispelling some practical concerns with online sensing data modeling and RUL 
assessment. 

Introduction 

In 1983, Robert and other researchers defined remanufacturing as a process 
of disassembling end-life machinery to extract key parts or components to inten- 
sively repair and reassemble to “like-new” condition (Goldhor and Robert 1983). 
The performance and life expectancy of the remanufactured part should equal or 
even surpass that of the original parts. Professor Bin-Shi Xu, from the Armored 
Engineering Institute in China (Xu 2006), defined remanufacturing as a general 
designation of all activities and engineering processes in the functional recovery of 
worn electrical and mechanical products by means of modern and advanced 
manufacturing technology, considering the optimization of quality, efficiency, 
cost, environmental impact, etc., of the product’s entire life cycle. Remanufacturing 
is a natural extension of traditional manufacturing, providing for a circular 
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economy through the reuse of materials and energy savings. Remanufacturing has 
emerged as one of the most promising industrial manufacturing fields. 

The remanufacturing industry has grown over the last 30 years, with applications 
including automobiles, engineering machinery, metallurgy, aerospace and defense, 
railroads, and computers. In 1996, the United States had 73,000 remanufacturing 
companies, with annual sales of $53 billion and 480,000 employees (Lund 1996). 
While the annual export value of remanufacturing is roughly equal to the US 
industries of pharmaceuticals ($67 billion), computer manufacturing ($55 billion), 
and steel ($56 billion), total employment (480,000) is higher than these industries, 
suggesting that remanufacturing can not only create enormous wealth but also 
stimulate the economy by increasing employment (Camci and Chinnam 2010). In 
2005, it was projected that US remanufacturing employment would reach 100 mil- 
lion and sales would exceed $100 billion by 2010 (The US Department of Energy 
Office of Industrial Technologies 1996). In 2009, the United States began to address 
the underlying strategic issues of the remanufacturing industry, such as taxes and 
tariffs on remanufactured products, remanufacturing codes and subjects, govern- 
ment procurement programs for priority remanufactured products, remanufacturing 
information systems, and so on (Hatcher et al. 2013). 

Several US companies have made noteworthy advances in the field. Cummins 
Engine Remanufacturing Company, the biggest auto parts industrial manufacturer 
in the United States, has the special registered trademark “ReCon®” for its engine 
remanufacturing. The company’s commitment to achieving equal quality specifi- 
cations for remanufactured parts and engines has been widely accepted in North 
America. Cummins processes more than 50 million pounds (about 22,700 t) of 
remanufactured parts and supplies every year. The resulting energy savings could 
sustain 10,000 US households and reduce gas emissions by 200 million pounds 
(about 90,700 1). Cummins has factories in Brazil, Australia, and China. Caterpillar, 
the largest engineering machinery manufacturing enterprise in America, 
established remanufacturing facilities in 18 countries and regions (Steinhilper 
et al. 2006). The export value of remanufacturing in Caterpillar reached more 
than 51 % of the total company’s revenue in 2009. 

The remanufacturing industry is also well developed in other countries. Since 
2000, the British remanufacturing company Lister Petter has offered more than 
3,000 remanufactured engines per year for the British and US Army (Bryan 2005). 
Since 2004, Volkswagen has remanufactured 7.48 million auto engines and 2.4 
million transmissions (Xu 2010). The scale of Japan’s remanufacturing industry is 
already very impressive. Of the more than 90,000 used units remanufactured in 
2008, 58 % of them were used in Japan, 34 % were exported to other countries, and 
8 % were sold as supplies (Xu et al. 2005). Overall, the remanufacturing industry in 
the United States, Europe, and Japan has formed a relatively self-contained indus- 
trial system regarding technical standards, production technology, process, supply 
chain, and management, covering used parts collection, remanufacturing, market- 
ing, and after-sales service. 
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Key Concepts and Technologies of Remanufacturing 

Remanufacturing involves a series of specialized processes to revitalize old equip- 
ment and fully recover a product’s specifications and functions. The performance 
specifications of remanufactured parts will meet or exceed those of new products 
from traditional manufacturing. Strictly speaking, machinery remanufacturing is an 
industrial process using high-tech repair and transformation towards renewed 
equipment by remanufacturing key components. The remanufacturing process 
focuses on the corrupted or worn parts which are used as cores. Based on the 
analysis of performance failure and life evaluation of the cores, the 
remanufacturing design and process planning will select and perform a series of 
advanced manufacturing processes to recover the full specifications and functions 
of the core parts. Remanufacturing engineering is classified as either the 
remanufacturing of worn parts with remaining useful life left (remanufacturability) 
or the remanufacturing of obsolete products for function updating. 

The theoretical base of remanufacturing engineering includes product evaluation 
for remanufacturability, failure analysis, and product life prediction. These ele- 
ments are briefly described below. 

1. Product evaluation for remanufacturability. Evaluation and design are the 
primary theoretical issues in remanufacturing. Evaluation of remanufacturability 
of a product usually involves a qualitative and quantitative comparison with an 
information database of existing remanufacturing practice to determine the 
remanufacturing process planning (RPP). Remanufacturing information data- 
bases, however, synthesize all remanufacturing process effects, considering both 
on techniques and economics, including nonlinear and multifactorial models. 

2. Failure analysis. Failure mechanisms of different products are often not the 
same. Failure and failure mechanism, identified through failure analysis, will 
determine the remanufacturing process selected in the design of the RPP. 

3. Product life prediction. The part is inspected for defects of fatigue, cracks, and 
stress concentration by means of nondestructive testing technologies, such as 
eddy current testing and magnetic memory testing. The part’s RUL is then 
estimated based on the failure and defect information. 

4. Remanufacturing technology. Remanufacturing technology is a general desig- 
nation of all techniques used to repair and remanufacture worn parts and 
components. The variety of techniques used in different remanufacturing pro- 
cesses (Zhu and Yao 2011) is depicted in Fig. 1. 

Common key technologies adopted in remanufacturing include the following: 

1. Disassembly 

2. Cleaning (including mechanical, physical, chemical, and electrochemical 
methods) 

3. Parts inspection (including penetration, magnetic particle, ultrasonic, eddy cur- 
rent, and other nondestructive testing) 
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Fig. 1 The general processes adopted in remanufacturing 

4. Parts remanufacturing process (main surface processing, such as nanocomposite 
brush plating, high-efficiency ultrasonic plasma coating, and automatic high- 
velocity arc spraying, and remanufacture machining techniques, such as turning, 
grinding towards remanufacture coating, and nontraditional machining 
techniques) 

5. Remanufacturing assembling 

6. Running-in experimental techniques 

7. Remanufacturing coating for part surface process 

8 . Information-based remanufacturing 

There are many other processes and specific techniques applied for 
remanufacturing. Each has its own advantages and disadvantages in application. 


Environmental Impact and Energy Sustainability 

A manufacturing process is conceptually an EPP/value- added process of translating 
resources into products. Its major side effects are resource consumption and 
environmental impact. Remanufacturing, on the other hand, has demonstrated 
contributions to environmental protection and energy sustainability. 

The aspects of the manufacturing process most harmful to the environment are 
raw material acquisition and processing side effects. Complex techniques for 
processing metal and nonmetallic materials in production (such as mining, 
sintering, and smelting) produce significant air and water pollutants, as well as 
solid wastes. Similarly, the main technologies in manufacturing (casting, forging, 
heat treatment, machining, etc.) generate a lot of fumes and harmful gasses (such as 
C0 2 and S0 2 ) as well as heavy metal pollutants. In addition, energy consumption in 
manufacturing a product’s metal core accounts for nearly 10 % of the entire 
production process, most of which is nonrenewable resources, such as metal 
(iron, aluminum, etc.) and chemical fuel. Meanwhile, the primary technologies 
used in remanufacturing (dismantling, sorting, cleaning, inspection, etc.) produce 
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Fig. 2 The bathtub curve of 
product failure 



significantly less pollution and consume less energy. Studies show that 
remanufacturing of a product, rather than making a new one, will save 50 % in 
total cost, 60 % in manufacturing energy consumption, 70 % in materials, and 
reduce pollution by up to 80 % (Zhu and Yao 2011). 


Remaining Useful Life (RUL) 

There are three distinct failure periods in the technical life of a product: early, 
accidental, and exhaustion. As Fig. 2 depicts, each of these periods can be depicted 
by the “bathtub curve,” and the typical remanufacturing time point is located at the 
end of the second period of the product. 

1 . The Early Failure Period 

This occurs in the stages of product adjustment and commissioning 
during preproduction when product failures are more common and the failure 
rate is higher. With more run-ins and exclusion of failures, the failure 
rate decreases gradually until a stabilized state is achieved. Forms of product 
failure in the early failure period not only reflect the technical quality of product 
design, manufacturing, and installation but also associate with adjustment 
and operation. In this period, products are often under warranty, so the users 
can return the product if there is an unrecoverable failure that needs to be 
repaired. 

2. The Accidental Failure Period 

The early failure stage is followed by the lowest failure rate period. Occa- 
sional failure occurs during normal device operation, when the failure rate of the 
product decreases to a basic constant, but these are randomly occurring 
unpredictable failures without influence from run-time. This is the best working 
time for products and is also known as the useful life of the equipment. At this 
stage, the product and all its parts can maintain long-lasting good working 
condition, according to design requirements. Failure in this period is mainly 
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accidental (such as excessive stress). In cases of accidental retirement, mass 
remanufacture is usually carried out at the end of the occasional failure period. 
3. The Exhaustion Failure Period 

This occurs in the later period of product use. Due to mechanical wear, 
chemical corrosion, and changes in physical properties, the product fault rate 
begins to increase. After the second long stable stage, the product failure rate 
increases rapidly, due primarily to excessive wear and fatigue of parts and 
materials. Remanufacturing is difficult or even unfeasible in this period (less 
remanufacturability) . 

Remanufacturing timing refers to the selection of time to shut down the machin- 
ery for remanufacturing. Remanufacturing is an important means to eliminate 
product wear, both tangible and intangible. Choosing the right time for 
remanufacturing and understanding which components of the machinery need to 
be remanufactured is essential. 


Remanufacturing Philosophy, Technology, and Process 
Key Processes in Machinery Remanufacturing 

For decades, the remanufacturing method has been developed and perfected in 
Europe and the United States. The technical features of this remanufacturing 
method are as follows: (1) replace directly the excessively damaged parts with 
new parts and (2) for the lightly damage parts, change the size of the parts and 
restore the geometric accuracy by turning, grinding, boring, and preforming other 
machining operations then match up the parts with nonstandard new parts with 
increasing size. The largest engine remanufacturing enterprises in England and 
America, Liszt Pat and Cummins, both make use of this remanufacturing method. 
Also the remanufacturing process is usually used for mild surface injury and rarely 
for parts where the surface is severely damaged. To this end, China have developed 
the “surface restoration and performance improvement method” as the primary 
process of remanufacturing technology, named “incremental restoration” method, 
since 1999 when the remanufacturing concept was first formally proposed 
(National Natural Science Foundation of Engineering and Materials Science 
Department 2010). Nondestructive testing theory and technology, surface engineer- 
ing theory and technology, and clad forming theory and technology were first 
adopted in remanufacturing process of “surface restoration and performance 
improvement method.” Remanufacturing has advantages when repairing the size 
tolerance of the damaged part, while consuming less materials with more control by 
using new materials with better wear resistance, corrosion, and fatigue resistance, 
resulting in the improvement of the overall performance of the remanufactured 
parts, as compared to new parts from traditional manufacturing. 

However, there are some major difficulties in the prediction of remanufactured 
product life and safety in the “incremental restoration” method: 
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1. The surface coatings of remanufactured parts have characteristics of complex 
composition and highly disordered structure, which result in high free-energy 
level and multiple interfaces level. 

2. The remanufactured parts have characteristics of a binary and multiphase mate- 
rial system, which make the life evolution very complex and of nonlinear 
dynamics. 

3. The remanufactured parts are of uncertain and uncontrollable quality, and the 
remanufacturing critical timing has diverse selection factors and a diverse target. 


Research on the surface/interface effects and mechanism of the remanufactured 
parts under the designed working conditions reveals the law of the competitive life 
evolution of the remanufactured parts with high free-energy level and the multiple 
interfaces level associated the competition. Understanding the failure mechanisms 
and development of the remanufactured products, under the interactions of the 
strong physical fields such as power, electricity, magnetism, heat, and stochastic 
load, are major scientific challenges to the study of life and service security 
prediction of the remanufactured products. The scientific challenges also include 
how to predict the service life of the remanufactured products accurately and ensure 
the security while in the service. The characteristics of the multi-interface level and 
the reaction of the remanufactured products by “incremental restoration” method 
make the life evolution highly complex. 

Remanufacturing is an interdiscipline of machinery engineering, material engi- 
neering, mechanical engineering, maintenance engineering, surface engineering, 
and environmental protection engineering. Quality control of the remanufacturing 
makes sure the remanufactured products have the same specifications, functions, 
and service life as the new products. Essential to the theoretical basis of the 
remanufacturing engineering is the evaluation for remanufacturability, failure 
analysis, and lifetime prediction. In the United States, Europe, and Japan, the 
remanufacturing industry has formed a relatively complete industrial system from 
technical standards, manufacturing technology, and processing equipment to the 
recycling of waste products, product sales and after-sales service, and so on. 

New manufactured products from raw materials have uniformed qualities and 
standard process planning, while the used products in remanufacturing have differ- 
ent degrees of damage with diverse RUL, namely, the old parts which have already 
served for a life cycle. Therefore, the remanufacturing engineering cannot be 
achieved until the following fundamental problems are solved: 


1. The uncertainty of damage (wear, corrosion, fatigue, etc.) behavior and mech- 
anism for remanufacturing object are determined. 

Various types of cross-scale damage will occur, such as wear, corrosion, and 
fatigue, in an object in a long-term service process. The objects’ damage level 
determines the remaining useful life of the part and whether it can be 
remanufactured. 

2. The surface bond-dissociation/dissociation issues such as oil layer, scale layer, 
rust layer, and diffusion layer of the hardened layer are all identified. 
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The surface and subsurface of the remanufacturing blank tend to form a 
contamination layer or degeneration layer with submicron or even nanometer 
level when in a complex service environment with loads. In order to meet the 
requirements of the remanufacturing bonding, dissociating the contamination 
layers, removing the denatured layer, and recovering the function of the surface 
should be implemented. 

3. Reprocessing or forming problems of damaged portion. 

Reforming technology, another key remanufacturing technology, is a 
material -forming technology used to recover the parts’ original size and 
recover or even enhance its service properties and functions. It involves 
using three-dimensional remanufacturing technology and process development 
on the damaged parts according to the remanufacturing precision forming 
technology and processes. This includes studying noncontact 3D scanning 
reverse measurements, applying the calibration method on each subsystem 
and applying the modeling method of remanufacturing, using the method of 
stratification for space surface, forming path planning, studying deformation 
mechanisms and laws, and controlling form in the process of reshaping the 
remanufactured parts. 

4. The evolution of life and service security issues of remanufactured parts and 
products in a new service cycle. 

Life test and remaining useful life (RUL) assessment of the remanufactured 
parts is key to mastering their quality status and service security when they 
return into service. The failure mechanism should be discovered by system 
physical dynamics and model and life prediction method with commonality 
and guiding significance extracted. Compared to the new parts, the characteris- 
tics of the remanufactured parts are more complex because of the system of dual 
multiphase metastable materials. The life evolution of remanufactured products 
is highly complex and nonlinear that is caused by the complexity of surface 
components, high structural disorder, and high free-energy level and multi- 
interfacial properties, leading to a life assessment and security predication 
difficulty. 

5. Energy consumption assessment and low-carbon manufacturing in 
remanufacturing . 

Assessment of energy consumption and low-carbon manufacturing in the 
remanufacturing is also an essential study. Based on theory and experimentation, 
energy assessment methods for typical remanufacturing processes proposed can 
realize low-carbon remanufacturing optimization. To this end, data inventory of 
resource consumption, environmental emissions, and energy efficiency need to 
be established as a theoretical antithesis for establishing standards for those 
manufacturing processes. 

Despite current efforts to solve these problems, the essential basic theory 
which is regarded as the bottleneck restricting the development of machinery 
and equipment remanufacturing industry has not yet been conquered. Thus, 
there is a significant petition to carry out basic research on remanufacturing of 
machinery and equipment. 
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Challenging in Remanufacture Engineering 


The fundamental research of machinery remanufacturing involves efforts in the 
disciplines of machinery, materials, mechanics, mathematics, physics, chemistry, 
environment, and many others. As a nontraditional manufacturing technology, 
remanufacturing is faced with many new challenges (Xu and Zhang 1999). 

The most important issue for remanufacturing, however, is having an effective 
assessment of the remanufacturability (the possibility to be remanufactured). Dur- 
ing service, some key machinery components or parts always undergo a variety of 
damages, after a certain usage due to alternating thermal/mechanical loads, such as 
macro-/microscale abrasion, corrosion, big cracks fatigue, microscopic, submicro- 
scopic scale micro-crack fatigue, micro-cavity creep, tiny pores, submicroscopic, 
nanoscale lattice distortion, and so on. The complexity of the damage mechanism, 
diversity of damage behavior, and uncertainty about the damage consequence in the 
service process of the machinery create extreme difficulty in the evaluation of the 
remanufacturability. The objective is to assess whether the remanufacturing will 
result in an optimal service renewal, according to a comprehensive quantitative 
evaluation of the effects of technology, economics, energy sustainability, environ- 
mental impact, etc., or to assess whether the components are able to serve for 
another life cycle after the remanufacturing. The major difficulties faced by 
remanufacturability assessment of used parts and components are as follows: 
(1) the multi-coupling cross fields (force field, thermal field, materials, and other 
organizational fields) make the component in service have a strong correlation with 
fatigue, wear, and corrosion damage; (2) the damage (including internal damage) 
mechanism and evolution pattern of component to remanufacture both have inher- 
ent multiscale features, involving a coupling effect and a correlation among differ- 
ent physical mechanisms ranging from nanoscale and submicroscopic scale to the 
macroscopic scale; (3) the damage characteristics of components selected to 
remanufacture vary with service conditions and the damage behavior and develop- 
ment are highly uncertain. In order to effectively evaluate the remanufacturability 
of the part to be remanufactured, there are several basic technological problems that 
must be studied. Fully understanding the mechanism and evolution of the cross- 
scale impact and the damage due to structure, materials, variable loads, etc., is 
necessary. The challenge is how to establish the quantitative model of the surface 
change and the relationship with the internal damage development of the part. Since 
an implicit damage is the incentive for crack initiation and tiny damages are the 
source for crack propagation, it is extremely difficult to give a clear cross-scale 
damage evolution and to establish a quantitative model. Another difficult point is 
how to determine a critical threshold of remanufacturability index for the part 
remanufacturing. The crux of which is the correlative mechanism between the 
remaining life of the part and the subsequent remanufacturing processes to 
undertake. 

The essential issue for basic remanufacturing research is the accurate control of 
shape and properties during the process of bond dissociation/bonding of the blank, 
namely, how to perform an accurate controlled remanufacturing process. The main 
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difficulties of bond dissociation/bonding include (1) the complexity of the compo- 
sition of the remanufacturing blank’s contamination layer and the difficulty 
of efficient, environment-compatible bond dissociation/dissociation brought by 
the variety of combination mechanism of contamination layer and the blank 
surface; (2) complexity of the performance control mechanism resulting from the 
diversity of scale, structure, and composition of the remanufacturing blank’s 
denatured layer; (3) the interaction and diffusion between atoms occurring in 
heterogeneous material caused by transient effects of an energy beam during the 
bonding process, which leads to significant influence on the temperature field and 
stress field of remanufactured parts forming layer, introducing qualitative control 
issues for the high-energy beam. 

Another key issue of remanufacturing research is the reliability property of 
remanufactured product, namely, the life reliability function. Remanufactured 
parts are constituted by the binary material system of blank and forming layer, 
which consists of multiphase structures including the matrix phase, the hard 
dispersion strengthening phase, antifriction solid lubricant phase, and corrosion 
phase. The material -forming layers not only present multicomponent composition 
but also show a high disturbance of organizational structures, with an associative 
competitive effect between high free-energy layer and the multi-interface layer. A 
high free-energy layer can improve the wear resistance, while a multi-interface 
layer tends to induce micro-crack initiation and reduce fatigue life, presenting an 
overall performance of nonlinearity of life evolution and the complexity of life 
prediction of remanufactured parts. In addition, the nonlinear dynamic behavior 
during the service of remanufactured products; the influence mechanism of local 
filling for inherent service characteristics, such as dynamics of the remanufactured 
parts; and the correlation rule between remanufactured parts failures are all emer- 
gent theoretical difficulties to face. 

Prediction of service safety and life assessment of remanufactured products face 
the following technical difficulties: (1) remanufactured parts surface coating has a 
complex composition and structural characteristics of high disorder, namely, that 
parts have a high free-energy layer and multi-interface layer; (2) remanufactured 
parts possess characteristics of a material system with binary and multiphase levels, 
whose life evolution presents high nonlinearity; and (3) the status and remanufac- 
turability of products to be remanufactured are uncertain and all kinds and sorts; in 
addition the factors and objectives for selection of a critical time point for active 
remanufacturing are highly variable. Research on surface/interface effects and its 
mechanism of in-service remanufactured parts reveals the competitive life evolu- 
tion rule of remanufactured parts under the associated competition between the high 
free-energy and the multi-interface layers. The main concern lies in how to accu- 
rately predict remanufactured products’ life and to ensure its service reliability. The 
multi-interface layer of remanufactured products has made its life evolution 
become highly complex. How to build a remanufacturing-oriented full life cycle 
assessment and decision-making criteria for remanufacturing engineering and 
remanufactured part evaluation system is paramount for the remanufacturing 
practice. 
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RUL Assessment for the Time Point to Remanufacture 
Product Lifetime Defined 

In general, according to the current performance state of the actual product in 
service, there are several concepts of the assessment of the product life (Zhu and 

Yao 2009): 

1. Physical life refers to the time period from the beginning to the malfunction 
which prevents the products that operate under the normal conditions from 
maintaining the prescribed functions caused by tangible wear, also known as 
the natural life. The physical life of the product can be extended through the 
normal use, maintenance, and repair of the product. 

2. Technical life is the time period from the time when the product is put into use to 
the time when the product cannot equal to normal use because of the second 
form of invisible wear leading to the backward function. The current rapid 
development of technology promotes the faster technological upgrading, espe- 
cially the electronic products field. By upgrading through the remanufacturing, 
the technical performance of products can be further improved, thus extending 
the technical life of the product. 

3. Economic life refers to the time period from the time when the product is put into 
service to the abortion of product due to the economic weigh results. Once the 
product reaches the extent beyond economic worth, the product must be 
scrapped. 

4. Environmental life refers to the time period from the time when the product is 
put into use to the service end of product due to the product itself or the violation 
of new environmental regulations. 

5. Preference life refers to the time period from the time when the product is put 
into service to the product abortion because of the user’s personal preferences. 

From the above, the physical life is the longest one among the lifetimes defined, 
which is usually expressed as the product life. The retired products that have not 
reached the physical life because of technical, economic, environmental, and 
personal preferences could extend the product life through the corresponding 
remanufacture processes, according to the RUL at the retiring stage of the products. 
The time to retire for remanufacturing depends on RUL and also is elaborated from 
technical, economic, environmental, and physical life aspects. 


Time for Remanufacturing Based on Comprehensive Evaluation 

In recent years, when making a decision on part remanufacturing time, more and 
more scholars consider comprehensively evaluating the changes of product perfor- 
mance from environmental, economic, and technical aspects in order to have an 
optimal time to retire for remanufacturing in favor of environmental protection, 
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maximization of economic benefits, as well as the most technical feasibility (Rao 
and Padmanabhan 2010). 

From an environmental perspective, the method used to analyze the environ- 
mental impacts for products is the full life cycle assessment (LCA). Life cycle 
assessment is a methodology to assess environmental impacts associated with all 
the stages of a product’s life from cradle to grave (i.e., from raw material extraction 
to materials processing, manufacture, distribution, use, repair and maintenance, and 
the end-of-life disposal or recycling). A proper remanufacturing timing schedule 
can help spare raw materials, save energy, reduce pollution, etc. There is an 
optimization apprehension existing. 

From a technical perspective, time to retire for remanufacturing can be determined 
from the feasibility and technological easiness for the process of remanufacturing. 
The feasibility and easiness are in regard to disassembly, cleaning, failure detection 
and classification, remanufacturing process planning, reassembly, etc. It mainly 
focuses on ensuring that the product can be advanced through the use of modem 
manufacturing means, information system, CNC, and automation technology to 
achieve its functional recovery and performance improvement. 

From the viewpoint of economics, LCC (life cycle cost) can be used to analyze 
the economic benefits of products. LCC refers to the total cost paid for the purchase 
and maintenance of a system or device in the whole life cycle. The analysis is 
mainly based on the existing technology and technical level to get full use of its 
added value to obtain the economic benefits. If the higher the added value and the 
lower the manufacturing cost, then the better economic feasibility and the higher 
benefit the remanufacturing will be achieved. So choosing the right 
remanufacturing time point can help realize the maximum economic benefits. 

In order to have a comprehensive decision-making program that covers envi- 
ronment, technology, and total cost, the corresponding evaluation criteria in these 
three areas are necessary to be predesigned, and then the corresponding weight 
(index) can be calculated. The analytic hierarchy process (AHP) is a mathematical 
method for analyzing complex decision problems with multiple criteria. Users of 
the AHP usually decompose their decision problem into a hierarchy of more easily 
comprehended subproblems, such as objectives, criterions, and strategies, based on 
which the qualitative as well as quantitative analysis is carried out to make a 
comparison among alternatives. By making the comparisons, the decision makers 
use their judgments about the elements’ relative meaning and importance, and a 
numerical weight or priority is derived for each element of the hierarchy, which 
provides an effective way to solving complex problems which cannot be solved by 
analytical method in a closed form. 


Time for Remanufacturing Based on Physical Life 

Currently, based on the analysis of the physical life of the product, the determina- 
tion of the time to retire for remanufacturing can be divided into the following 
categories: 
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1 . Remanufacturing at the end of the physical life 

Remanufacturing is selected at the time when the product reaches the end of the 
physical life. Aiming at restoring the performance of the original product, the 
remanufacturing will take after the mechanical products have retired. In order to 
timely complete remanufacturing process and back to service, manpower, tools, 
and spare parts must be fully prepared and available. This is one of the most 
frequently used for remanufacturing time decision, mainly happening after the 
product decommissioning. However, with very high reliability, the life of most 
parts at decommissioning is still in the second stage of the life curve, and they often 
can still be used for a new product life cycle. Therefore, before the retiring for 
remanufacturing, it is necessary to assess the current reliability status and the RUL 
of the functioning part to make the decision on retirement or another usage cycle. 
For those of scrapped parts, displacement and part remanufacturing are obligatory. 

2. Regular remanufacturing scheduled on usage 

Regular remanufacturing belongs to one form of preventive remanufacturing 
strategies, which determines remanufacturing based on service intervals 
according to the product’s usage that is regarded as the time restriction. Once 
it reaches the predetermined time period, remanufacturing will be performed 
regardless of the state of the part performance in the service. This is a mandatory 
prevention remanufacturing (or repairing). Regular remanufacturing allows the 
part to be remanufactured before its end life, thus reducing the waste and 
improving production efficiency. It is an effective way to prevent failures and 
to enhance the reliability. However, this approach mainly bases on the wear law 
for maintenance scheduling; fails to take into account the failure forming and 
development; and lacks flexibility for considering of part actual working 
situation. 

3 . Condition-based remanufacturing 

It refers to the remanufacturing decision during the service of the product, which 
determines the time to retire and the processes to take in remanufacturing based on 
the part performance evaluated from online monitoring and diagnostic. A decision 
has to be made when the online performance monitored cannot meet the requirement 
for performance reliability and quality. Online monitoring includes sensing and data 
collection of performance condition, working status, trends, and other physical vari- 
ables, all of which are conducted online and routinely implemented. The advantages 
of condition-based maintenance (CBM) are that it can fully develop the potential of 
products, improve the effectiveness of the prevention of damage to the product, and 
reduce extra costs in remanufacturing and human error. But it requires a system for 
online data acquisition and modeling and the reliability model for the forecasting. 

4. Irregular retired opportunity remanufacturing 

Opportunity remanufacturing mainly aims at environmental scrapped, prefer- 
ential scrapped products or damaged ones due to accidents. Such remanufacturing 
process mainly occurs in the case of abnormal retired products or the period during 
which the product is in use, but has not yet reached the end of life. Opportunity 
remanufacturing is an effective process when condition-based remanufacturing 
and periodic remanufacturing occur simultaneously. For example, the new car 
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engine, which may run into malfunction due to material failure at the early stage of 
use, cannot be repaired, only to be returned for remanufacturing. Also the irrep- 
arable fault due to abnormal causes will lead to the abnormal product retirement, 
thus leaving the only choice for remanufacturing. 


Currently due to improved detection methods and techniques, diagnostic 
accuracy has been highly improved. In order to maximize the utility value of 
scrap parts, condition-based remanufacturing gets more and more attention. As 
the basis for the implementation of remanufacturing, the remaining life of the 
product becomes an un-negligible problem with respects to time to retire for 
remanufacturing. The prognosis of the remaining life of a product can be divided 
into two categories, according to real-time detection information. (1) For com- 
ponents with failure data readily available, the first step is to determine the 
lifetime distribution model according to the historical failure data, and then 
determine the unknown model parameters based on real-time monitoring of the 
data, and finally calculate the remaining service life and reliability. L R = L M — t , 
where L R is the residual useful life, L M is the useful life time, and t is the current 


OC 

r* 


time that product has served. The reliability at time t is R(t) 


f(t)dt , where /(f) 


t 


is the failure probability density function. (2) For components without historical 
failure data, online monitoring data will be used to estimate parameters of the 
mathematical state model which is modeling the performance state. The reliability 
or the system failure can be defined in terms of the performance state - the probability 
that performance state will not exceed a predefined threshold (in a smaller is better 
case). In recent decades, RUL assessment-based remanufacturing process planning/ 
design has become a hot research topic (Zhang et al. 2013). The main idea is the time 
to retire for remanufacturing of a part is determined by the remaining useful life of the 
part and the expected performance reliability. The scheme for time to retire for 
remanufacturing based on reliability is shown in Fig. 3. 


Fig. 3 The concept of the 
time to retire for 
remanufacturing based on 
reliability assessment 
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Refer to Fig. 3, where R(t 0, R , and R(t 2 ) are for estimated reliabilities of 
performance at times t u T, t 2 . When the reliability value at the current moment t\ 
is larger than the predefined threshold R , that is, t\ < T, R(t i) > R* as shown in the 
figure, and the reliability at the future moment t 2 , estimated by online data and the 
state model, is less than the reliability threshold value, that is, t 2 > T, R(t 2 ) < R*, 
and remanufacturing of the components is needed to increase the reliability. 
However, when both the reliability value at the current time and the estimated 
reliability of the future time are larger than the threshold, the components can be 
continuously in service without parts remanufacturing. When the reliability value at 
the current time and the reliability at the future time are both less than the 
predefined reliability threshold, processing costs and technical feasibility analysis 
need to be further carried out to determine whether the parts are to be 
remanufactured or to be recycled. 

Online deciding the time for a part to retire for manufacturing requires a 
comprehensive model and analysis. Based on the above two types of methods 
discussed, a comprehensive assessment procedure has been proposed to decide 
the optimal time for the part to retire for remanufacturing based on the resultant 
evaluation on cost, performance, technical process, energy sustainability, and 
environmental impact. The flowchart is shown in Fig. 4. 


RUL and the Mathematical Model 

Security issues have been arousing worldwide concern nowadays, since the system 
failure always causes severe accidents, leading to significant economic losses. 
System safety is of great importance for civil and defense security in many areas, 
such as electric power, petrochemical processing, metallurgy, aerospace, weapons, 
vehicles, and other major or important mechanical products and infrastructure. 
Timely and accurate prognostic of the remaining useful life of key components 
for the system can not only reduce accidents and losses but also provide the 
information for preventive maintenance, decision for remanufacturing, and 
remanufacturing process planning. Effective remaining useful life modeling and 
prediction and the online performance reliability assessment have been catching 
more and more research attention. There are two major methods for the remaining 
life modeling - physical model-based method and data-driven statistical method. 
The following describes the development status and trends about these two models. 


Physical Model-Based Method 


It is the life model based on theoretical formulas of physical mechanics and 
dynamics. It is an important research direction for life prediction. From the initial 
stress-based approach to assuming that fatigue life depends on the development 
process of crack initiation and expansion till fracture, the physical model for life 
prediction has gained a long-term development with the application of fracture 
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Fig. 4 The flowchart of the comprehensive assessment for the decision on the time to remanufacture 
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mechanics and crack propagation theory methods. From the 1850s to 1860s, the 
concepts of S-N (stress-rpm) curve and fatigue limit were first introduced by the 
German engineer August Wohler (1867) in his study of damaging fatigue test of 
rail vehicle axle. Later, effects of average stress on life got research attention. 
Americans Paris and Erdogan (1961) proposed the laws of crack spreading, i.e., 
Paris law, using fracture mechanics method in 1963. Paris law provided a new 
method for the fatigue life estimation, developed the “damage tolerance design,” 
and made an amazing breakthrough of the study of fatigue in theory. Then, 
researchers made more meaningful research based on Paris’s work. Ray and 
Tangirala (1996) put forward a nonlinear stochastic model of crack propagation, 
estimating crack length, crack growth rate, and residual life by the measured 
sensor signals. 

The formula about the equivalent theoretical fatigue limit and stress fatigue life 
was derived based on fatigue limit of room temperature by Xiaohua Yang (1998) 
and modified by introducing frequency factor to predict the fatigue life of 2Crl3 
steel at different frequencies. Through fracture mechanics analysis, Feng Zhang 
(1997) estimated the wheel damage tolerance crack size and durability and also 
analyzed the causes of fatigue damage based on Hertz theory. Xiaoyang Zheng 
(1999) proposed a new cumulative damage rate method to estimate fatigue life 
under spectrum loading. Based on the statistical analysis on constant amplitude 
fatigue, logarithmic probability density function of fatigue life under any stress 
condition was derived; thus, the components’ fatigue life under fatigue load spec- 
trum can be estimated with the help of the cumulative failure probability. 

Xiulin Zheng (1994) proposed that the size of load spectrum can be represented 
by overloading caused by maximum load in the load spectrum. Miner’s rule could 
be applied to predicting the cumulative fatigue damage value of the fatigue crack 
initiation (FCI) of elements under variable amplitude loading after the load inter- 
action. Miner’s rule can also be used to predict the fatigue life and the critical value 
of cumulative fatigue damage under variable amplitude loading. 

Glodez (2002) presented a method to calculate the stress cycle numbers needed 
for crack initiation by using the strain-life method. Oppenheimer and Loparo (2002) 
established a physics-based approach for rotor crack diagnostics and prognostics 
using integrated observers and life models. The observers could be used to estimate 
the length of the crack, assuming that the velocity and forces on end shafts could be 
calculated with measured data according to the dynamic model of rotor. And a life 
model based on the Forman crack growth law of linear elastic fracture mechanics 
was developed to determine the remaining cycle numbers of the shaft, until a failure 
occurs. Table 1 shows the comparison and analysis of advantages and disadvan- 
tages among all kinds of physical approaches according to the developing sequence 
of remaining life prediction (Lee et al. 2014). 

In recent years, more in-depth experimental research on physical mechanics 
methods for life prediction is conducted. The new methods are featured in deep 
understanding and development of fatigue theory and comprehensive modeling. 
Experimental data are expected to be more approximated to the actual situation and 
specialized research achievements on vital structures or components are gotten, 
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Table 1 The comparison of physical model-based methods 


Approach 

Advantage 

Disadvantage 

Application 

Life 

prediction 

Simple and easy to 
analyze 

Low accuracy 

The initial design 
estimation 

based on 
stress 

Parameters required 
are few and easy to get 

Poor adaptability for 
the parameters depends 
on geometry and 
loading forms 

Long-life components 
such as the spring shaft 
and gears and other 
high-strength materials 


Rich data sources with 
accumulated 
experience can be 
obtained 

Unable to analyze the 
gap effect without 
considering crack 

Whole-life analysis 
combined with linear 
elastic fracture 
mechanics 

Life 

prediction 

based on 
• 

Direct access to the 
strain parameters by 
means of measurement 

Complex calculations 

Insufficient gap 
analysis 

Fatigue test with strain 
as control condition 

strain 

Able to conduct 
notched fatigue 
analysis 

Only the crack 
initiation is considered 

Situation of high 
temperature, large 
strain, and high stress 
concentration 


Able to judge the 
impact of the loads 
order 

Able to express the 
cyclic stress-strain 
response 


Components with less 
load cycles, large 
plasticity, such as 
low- strength structural 
steel 


Conducive to fatigue - 
creep mixture analysis 


Whole-life analysis 
combined with linear 
elastic fracture 
mechanics 

Life 

prediction 
based on 
accumulative 
fatigue 
damage 

The impact of actual 
magnitude and the 
order of loads has been 
taken into 
consideration 

Only parts of the 
influence factors have 
been considered; failed 
to conduct a 
comprehensive analysis 
of the complexity of life 
prediction 

The material and 
mechanical parts 
subjected to cyclic 
loading 


Matured and widely 
used 

Narrow scope of 
application 

More used in 
engineering 

Life 

prediction 
based on 
fracture 

The mechanism of the 
crack propagation can 
be physically 
interpreted since the 

Difficult to measure and 
estimate the initial 
crack size; no research 
on the initial crack 

Large and important 
parts and structures, 
such as aircraft and 
nuclear reactors 

mechanics 

crack propagation is 
taken into 
consideration 

Uneasy to calculate the 
stress intensity factor of 
the components with 
complex geometry 

Metal materials with 
metallurgical defects in 
themselves and 
components with pores, 


Able to control the 
initial damage, the 
examination period 

Elastic-plastic fracture 
mechanics is used since 
linear elastic fracture 
mechanics can hardly 

slag, and weld defects 
created in welding 
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Table 1 (continued) 


Approach 

Advantage 

Disadvantage 

Application 


and working loads, 
etc., to ensure safety 

meet requirements in 
general cases 


Life 

Good agreement with 

Complex calculation 

Only applied for some 

prediction 

the fatigue mechanism 

and analysis 

metal materials, 

based on 

in experimental 


composite materials, 

damage 

observations 


and concrete materials 

mechanics 

Easy to measure the 
fatigue damage 

Insufficient research on 
damage mechanics of 
some materials and 
components 


Life 

Unified representation 

Insufficient research 

Composite panels of 

prediction 
based on 
energy 

and strong universality 


composite tissue, such 
as alloy laminate, 
coating structure 


Allowed to take the 
mean stress and the 
impact of 

multidirectional load 
into account 

Few applications 

Piezoelectric material 
and the composites 


which thereby greatly expand the scope of the study, leading to more specific 
research results. However, due to the high degree of complexity in the theory of 
the life prediction method based on physical mechanics and dynamics, the model- 
ing becomes more difficult. On the one hand, modelers’ professional backgrounds 
are highly demanded; on the other hand, the model’s prediction error highly 
increases as the increasing of the model complexity. Therefore, the prediction 
based on the theory and physical models hardly meets the life research for 
remanufacturing in spatial state. 

Lifetime prediction in terms of new equipment’s working conditions can help 
formulate a set of reasonable inspection and maintenance standards, enabling the 
equipment to operate at its best condition and to achieve its secured service life 
extension. The assessment process for remanufactured products’ remaining life at a 
given time can be expressed as follows: (1) build a three-dimensional solid model 
according to its geometry, size, and/or the assembly state of products; (2) acquire 
performance data of materials through empirical or experimental data, such as 
material physical performance parameters, fatigue properties, and S-N curve; 

(3) based on steps 1 and 2, conduct the finite element stress and strain analysis; 

(4) make a second service load spectrum analysis based on historical records and the 
measured data of the product, such as remanufacturing residual stress and secondary 
assembly stress; (5) given environmental factors, establish a cumulative damage 
model for life prediction. Because the initial conditions of components will change 
after servings, the model and the prediction for parts in traditional manufacturing 
have difficulties without knowing the changed initial conditions for remanufactured 
parts. Thus, modeling of the initial conditions of remanufactured products plays an 
essential role in the remaining life modeling and prediction. 
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Data-Driven Stochastic Model Method 


Analytical method based on probability statistics is a traditional reliability theory, 
mostly regarding lifetime information as objects for statistical analysis. Researchers 
first get failure data through a large number of tests and then use statistical criterion 
to select the most appropriate statistical distribution model to obtain lifetime 
distribution. Stribeck, R. (1907) used a probabilistic approach to predict the life 
of mechanical parts in 1924. Based on the test results, they suggested that the 
acceptable life should be defined as the lifetime when the sample parts have 10 % 
failure or 90 % survival left. 

Goode noted that the failure process of many devices (such as hot rolling mill 
pump) could be divided into stationary and nonstationary process (Goode 
et al. 2000). Also he proposed statistical control method to distinguish the 
two processes. In the stationary process, life could be predicted by using the 
reliability data (the length distribution of the steady and non- steady -state process). 
While in the nonstationary process, assuming the device’s status monitoring 
data grows exponentially, and using condition monitoring data and reliability data 
together to predict the life. Subsequently, autoregressive moving average 
model (ARMA) and autoregressive integrated moving average model (ARIMA) 
were widely used in time series modeling and predicting. Yan (2004) recommend 
using logistic regression to establish relationships between characteristic variables 
and failure probability of the device, in the meantime suggested also 
applying ARMA model to predict the characteristic variables, and then 
using the trained logistic model for residual life prediction with the predicted 
value input. 

Volk recommended proportional intensity model (PIM) for life prediction and 
evaluating the effect of preventive maintenance (Volk et al. 2004). Liao compared 
the predicted performance of logistic regression methods with PHM models of a 
single device’s remaining life in the example of bearing (Liao et al. 2006). In the 
paper, root-mean- square value and Camber value of the monitored vibration signal 
are set as covariates of the PHM in the life prediction, and experimental results 
show that the prediction accuracy of PHM is better than that of logistic regression 
model. 

However, the life model based on statistics does neither consider the difference 
of failure mechanisms and changing of operational conditions nor fully utilize the 
relevant information before failure. Moreover, this kind of life analysis and predic- 
tion methods are based on probability statistics, supported by a large number of 
experimental data, without emphasizing on the individual subtle change. So the 
results of life prediction are the “average property” under the given conditions. 
Although this life analysis and prediction method can be used to effectively 
calculate the distribution of the product’s life through accumulated massive data, 
it can hardly meet individual device life research such as online bearing in a spatial 
state. More and more research work focuses on modeling of online performance 
reliability and RUL assessment of a critical system or machinery from real-time 
performance data sensing. 
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Remaining useful life modeling and prediction based on condition monitoring 
data is defined as predicting the time from the current time to the time that system 
failure occurs or to the time when a defined the system failure to reach. Jardine 
(2006) expressed remaining useful life as X t = {x t :T— t\T > t , Z{t )}, where 
T represents a random variable of life time, t represents the current time, and vector 
Z(t) represents the observation data available to the current time. 

Engel (2000) showed the prediction of the remaining useful life of the helicopter 
gearbox using polynomial model to extrapolate characteristic variables. Besides, he 
also pointed out that an effective life prediction method required not only the 
expected value of remaining useful life but also the uncertainty of predictions 
(expressed with confidence interval). Baruach and Chinnam recommended HMM, 
hidden Markov model, for diagnosis and prediction establishing independent 
HMMs for each health state during equipment failure, and then estimating the 
state -changing point of each sample, and finally figuring out the conditional distri- 
bution of state-changing points (Chinnam and Baruah 2005). 

Camci and Chinnam regarded hierarchical HMM as dynamic Bayesian network 
for diagnosis and prediction (Camci and Chinnam 2010). The methods can be used 
to estimate transition probabilities directly of health status of equipment. Consid- 
ered the continuity of the state identification and residual life prediction, Mullen 
applied the improved algorithm - hidden semi-Markov model - to build a frame- 
work for equipment’s residual life prediction which was verified by simulation and 
realized the residual life prediction on the basis of device degraded state recognition 
(Gokhale and Mullen 2004). In the paper (Bie and Wang 1997), analysis and 
comparison between the Monte Carlo method and the analytical method are 
elaborated, and then it gives a comprehensive introduction of the application of 
Monte Carlo method in the power system reliability assessment, such as the way to 
improving the convergence rate, the combination of Monte Carlo method with the 
analytical method, and how to deal with the loads in the Monte Carlo method. 

In recent years, the state-space model is widely used to model and predict the 
residual life of mechanical products and reliability evaluation in economic time 
series analysis. In general, state-space models consist of two equations: one is the 
state equation and the other is the observation equation. The state equation repre- 
sents the transition from the current state to the next state, namely, mutual trans- 
formational relation, and the observation equation stands for the interrelation 
between the actual observed vectors and the state vector. The establishment of 
the two equations provides a consistent modeling framework for adequately 
describing the motion characteristics of the dynamic system. 

However, there is a lag in the research of state estimation problem based on 
state-space model, and for quite a long time basically remains at a level of linear 
models based on Kalman filtering. In the 1990s, the particle filtering theory had 
gained a great development since Gordon put forward the recursive process of 
resampling (Gordon et al. 1993). The reference (Zhang et al. 2005) proposes a fault 
detection and isolation algorithm based on SIR (sequential importance resampling) 
particle filter likelihood function value. The algorithm divides the fault system into 
several fault subsets; every fault subset uses a particle filter to estimate, while all 
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particle filters run in parallel. The algorithm is simple and applicable to general 
nonlinear systems. 

Using the state-space method to establish the model for mechanical product life 
prediction is a hot research work in recent days. In terms of state estimate and 
model parameter estimate in a state-space model, there are limitations of the 
existing methods, causing prediction accuracy severely reduced. When applying 
state-space model in application for RUL assessment, there are still many technical 
problems to be solved. As for nonlinear non-Gaussian state-space model of RUL, 
either the theory or applied algorithm is still in the beginning stage. Comparative 
analysis among a variety of life prediction methods is made in Table 2. Advantages 
and disadvantages of each method together with corresponding scope of applica- 
tions are also listed (Lee et al. 2014). 


Table 2 The comparison of data-driven stochastic model 


Approach 

Advantage 

Disadvantage 

Application 

Life prediction 
based on 
probability 
theory 

Simple and easy to 
analyze 

Too much 
accumulated 
experience data 
required 

The initial design 
estimation 

Few parameters 
required and easy to 
get 

Only a general 
estimation without 
regard to the specific 
cases of working loads 
and failure 

Less important, small 
sized, mass produced 
components and parts 

Able to obtain 
confidence limits 

Inaccuracy prediction 
in terms of the 
individual 


Life prediction 
based on ARMA 
or ARIMA 

Simple to model, 
methods mature, and 
widely used 

Unable to be used for 
long-term prediction 

Suitable for linear 
time, invariant 
systems when the 
performance of its 
characteristics are 
smoothly changing 

No need for historical 
data or fully 
understanding the fault 
mechanism 

Unable to synthesize 
prior knowledge 

High efficiency in 
computing and 
allowed to be used 
online 

Sensitive to noise and 
the initial state; poor 
effects on the dynamic 
process of 
nonstationary 

Life prediction 
based on logistic 
regression (LR) 

No need to set the 
critical threshold 

Not feasible unless 
normal feature domain 
description and 
unpredictable behavior 
are both available 

Service performance 
evaluation for 
degradation failure 
equipment like 
machine tools 

Able to obtain the time 
dynamic 

characteristics without 
making too much 

Unable to do real-time 
prediction with 
off-line data 
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Table 2 (continued) 


Approach 

Advantage 

Disadvantage 

Application 


assumptions for 
equipment failure 
process and the 
distribution function 



Life prediction 
based on 
proportional 
hazards 

Variety of status 
information are 
integrated into account 

Complex computation 
of parameter 
estimation involving 
numerical integration 

Risk analysis of the 
joint characteristic 
variables 

The characteristic 
index able to change 
with time 

No need for fault 
history data 

Assumption that risks 
change with variables 
proportionally 

Analysis of data 
distribution, residual 
distribution unknown, 
or censored data 

Life prediction 
based on neural 
network 

Strong self-learning 
ability, nonlinear 
fitting ability, and 
good robustness 

No standard method to 
determine the structure 
of the neural network 

Pattern recognition or 
some certain signal 
processing 

Excellent memory 
ability and nonlinear 
mapping skill 

Sufficient sample data 
required for the 
parameter calibration 

Uncertainty of 
structure and weight 

Life prediction 
based on 
Bayesian 
networks 

Able to deal with 
uncertain problems 

Unable to model the 
unknown fault 

Used to solve 
uncertainty and 
relevance fault of 
complex equipment 

Able to avoid data 
overfitting 

Difficulty in 
computing for the 
unknown network 

Effective multisource 
information fusion and 
expression 

Relies on reliability of 
the prior knowledge 

Widely used in the 
intelligent systems of 
computer intelligence 
science, industrial 
control, medical 
diagnostics, and other 
fields 

Able to visualize the 
dependency links 
among each pair of 
variables 

Results are sensitive to 
the selection of prior 
distribution 

Life prediction 
based on support 
vector machine 

(SVM) 

Concrete realization of 
the structural risk 
minimization criterion 

Not suitable for large- 
scale data processing 

Small sample, 
nonlinear, and high- 
dimensional mode 
recognition, such as 
predicting porosity 
and clay content in oil 
well logging 

Simple structure, good 
promotion 

performance, and fast 
learning 

Theoretical defects in 
kernel function for 
nonlinear 

classification problems 

Only one minimal in 
optimization solution 

Slow solution speed 


(continued) 
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Table 2 (continued) 


Approach 

Advantage 

Disadvantage 

Application 

Life prediction 
based on 
Markov model/ 
hidden Markov 

Able to model 
different stages of 
degradation 

Unable to model the 
unknown fault 

Unable to directly 
observe the state, but 
able to observe the 
vector sequence 

model (HMM) 

No need for prior 
knowledge of fault 
mechanism, able to 
process incomplete 
data 

Complex hidden semi- 
Markov model is 
required if the fault 
time is not 
exponentially 
distributed 

Widely used in pattern 
recognition, speech 
recognition, behavior 
recognition, character 
recognition, and fault 
diagnosis and other 


Predictions with 
confidence intervals 

The model calculation 
increases with the 
number of states 

fields 

State-space 
prediction based 
on Kalman filter 

Able to do the linear 
unbiased minimum 
mean square error 
estimation 

State equation and 
measurement equation 
need to be defined 

Real-time online 
prediction for linear 
Gaussian system 


Prediction accuracy 
does not change with 
the prediction time 
interval; good 
robustness 

Noise level could 
influence the 
performance and 
stability of the 
algorithm 



Able to estimate the 
current state and also 
predict the future state 

Only works with linear 
system and Gaussian 
noise 


State-space 
prediction based 
on particle filter 

By finding a random 
sample to carry out the 
approximate 
calculation for the 
probability density 
function to obtain the 
state minimum 
variance distribution 

Large amount of 
sampling are required 
to avoid degradation 

Computation is more 
complex than Kalman 
filtering 

Real-time online 
prediction of strong- 
nonlinear, 
non-Gaussian noise 
system, such as radar 
tracking and robot 
localization 


High accuracy 

High-dimensional 
system and increasing 
particles complex 
computation 



RUL Online Assessment 

The online assessment of RUL of machinery usually implies that real-time 
performance observations are available across the time, and the RUL is estimated 
by the conditional probability of the performance observations of the unit. 
The online assessing algorithms are recursive, depending on preceding state obser- 
vations only. The state- space model is widely used with recursive algorithms in 
application. 
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Support Vector Machine (SVM) Methods 


Support vector machine (SVM) is a machine learning algorithm proposed in the 
1990s and based on Vapnik’s statistical learning theory. Support vector machine is 
not based on traditional empirical risk minimization principle, but VC (Vapnik- 
Chervonenkis) dimension theory and structural risk minimization principle in statis- 
tics. SVM can solve the structural problems of high-dimensional model with a limited 
number of samples and has good predictive performance. The initial SVM is mainly 
applied to pattern recognition problems. Through short-term research and develop- 
ment SVM also makes a good result in regression (Steinwart and Christmann 2008). 

SVM techniques have developed greatly in recent years, giving a rise of a new 
kind of learning machines that use the central concept of SVM techniques for a 
number of learning tasks. Kernel machines provide a modular framework that can 
be adapted to different tasks and domains by using different kernel function and the 
base algorithm. They are used in a variety of fields, including biomedicine and 
bioinformatics (Furey et al. 2000), image analysis and artificial vision (Guo 
et al. 2001), and other engineering fields (Taboada et al. 2007). SVMs have been 
used by researchers to solve classification and regression problems. In this research 
study, SVM for regression (SVR) is used as an automated learning tool with a 
different focus to successfully predict the SOC (state of charge) of a high-capacity 
lithium iron phosphate battery cell as a function of cell voltage, cell current, and cell 
temperature. SVM model is used an alternative to the traditional regression 
approaches. As a nonlinear estimator, SVM is more robust than a least squares 
estimator because it is insensitive to small changes. 


Support Vector Regression (SVR) Theory 

Vapnik (1998) introduced insensitive loss function into support vector machine, 
which can solve the problem of nonlinear regression. Support vector regression 
(SVR) is divided into linear regression and nonlinear regression. In most cases, the 
sample shows a nonlinear relationship. For the nonlinear case, the basic idea of 
support vector regression machine is that the sample points are mapped to high- 
dimensional feature space by a nonlinear mapping O : R' 2 — > //, and linear 
regression is applied in high-dimensional feature space to get the nonlinear regres- 
sion estimation in the original space (Hong et al. 2005). 

For nonlinear regression presented in Fig. 5, the estimation function is 


y=f(x) = (co,0(x))+b (1) 

where go is weight vector and is the bias, , <.,.> is the operation of dot product. 

The unknown coefficients go and b are estimated by a minimization of the 
following function: 




/ 

+ ^c(f( x d 

7 = 1 




86 Remanufacturing and Remaining Useful Life Assessment 


3163 


Fig. 5 Support vector 
regression 









y=( co ■ x )+b 



Fig. 6 The s-insensitive loss 
function 



In the above objective function, the first term is the regularization item, the 
second term is the empirical risk, and C is a positive number which determines the 
balance between empirical risk and the regularization parts. 

Empirical risk is measured by Vapnik’s s-insensitive loss function, which is 
shown in Fig. 6, defined as (Xu et al. 2005). 


C(f(x)-y,x) 


y ~f(x) 



y-f(x) I < 6 



y-f(x) I > £ 



where s is the width parameter of the differential gap (dead band) in Vapnik’s 
insensitive loss function. 

In Eq. 4, slack variables & and & are introduced, and the above minimization 
problem can be converted into the following problem, namely, 


min 



Ti- 


co, <&(*,-)) + b - y t < + e 

-b<$ + e 

1 2 l 

JL ^ j ••• j l 


co, <l>(x 

ft, ft > o, i -- 



s.t. 


(4) 
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The above constrained optimization problem is a typical quadratic programming 
problem and can be solved by Lagrange multiplier method. The Lagrange multi- 
plier a z and f] f are introduced, then 



Function L is minimized to at, b , 



then the extreme conditions of function L is 



According to Wolf dual theory, the original problem is transformed into its dual 
problem, namely, 


min 




Solving the dual problem, suppose the solution is a* = (at, at, . . ., aj), a = (a 1? 

/ 

a 2 , . . ., aj) then, oo = (a, — a*)d>(x z ). According to the Karush-Kuhn-Tucker 

i= 1 

(KKT) conditions (Fletcher 1987) of quadratic programming, a z a z = 0, that is to 
say, of a z and a z , at least one of them is zero. Only a few are not zero, and their 
corresponding sample points are the support vector. The regression estimation 
function is decided by the regression vector and has nothing to do with the support 
vector. The definition of inner product kernel function is introduced, and support 
vector regression estimation function can be written as 
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/ 

/(*> = E {a* - ai)K{X„X) + b 

i=\ 


yj -J2( a i- ai)K(Xi,Xj) + e ctj G (0, C) 

# 1 

where b = < 

- ai)K(Xi,Xj) +s a* € (0,C) 



SVM Model and Forecasting 

In support vector regression, the choice of kernel function and other parameters 
determines model’s regression results and generalization ability. There are several 
parameters in the solution algorithm of SVR and need to be determined. One is the 
type of kernel function and its parameters, the other is the penalty factor C, and 
another is the insensitive function parameter s. 

Kernel functions have a very important impact on the solution process of SVM. 
SVM model is characterized by the training set. Kernel function and different forms 
of kernel functions can generate different SVM regression models, and commonly 
used kernel functions are linear kernel, polynomial kernel function, radial Gaussian 
kernel function, and so on. Penalty factor C is used to balance the complexity of the 
model and empirical risk values making its generalization performance the best. The 
optimal C in different data subspace is different. In order to control the complexity of 
the model, the value of C should be generally small, but the value cannot be too small 
to avoid a big model experience error, s determines the minimum allowable fitting 
error of the learning machine, s can control the regression approximation error, so as 
to control the number of support vectors and model’s generalization ability. 

As the above parameters also determine the forecasting accuracy of support 
vector regression model, the most appropriate parameters need to be selected to 
obtain the best prediction results. It is an optimization problem, and methods such 
as the cross-validation method, the grid search algorithm, particle swarm algorithm, 
and genetic algorithm can be used to seek the optimal value (Chapelle 2002). 

Forecast is predicting the future based on the past and present state. In other words, 
it is to make scientific predictions and reasonable inference to the direction of the 
future developing trend and the possible future state (Xu et al. 2007). For a given time 
series {x t },i = 1, 2, . . ., N. Take the first r (r < N) data as training samples, and the 
rest of the data set as the test samples. In order to make more effective use of the data, 
phase space of the one -dimensional time series is reconstructed, and the series is 
transformed into a matrix form. The one-dimensional time series {x,} can be 
transformed into the following matrix form: 



/ Xi 

*2 

X m \ 


*2 

• 

• 

• 

*3 

• 

• 

• 

Xm -\- 1 

• • 

• • 

• • 

, Y 

\ Xy— m 
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Fig. 7 The concept of 
support vector machine 
supervised training process 



xl xm xm+1 




Training group 1 



Forecasting points 




x2 xm+1 xm+2 



Training group 2 




Forecasting points 


where X, = X™ = (x h x i+ \ , . . x i+m _i) and m is the forecasting embedding 
dimension. Figure 7 (Xu et al. 2007) is the actualization of support vector machine 
supervised training - ( X , Y) is the training pair for the machine input and output. 

Then the forecast regression function is 

r—m 

m = £ (a* -a)K{X h X,)+b (9) 

t= 1 

In Eq. 9, t = m + 1, m + 2, . . ., r; X( v ) is kernel function. Lagrange multiplier a \ ^ 
and offset b can be obtained from the following two quadratic programming 
problems: 


min 




r—m r—m 

+ (a* + <Xj) - Yji («* - «/) 

i = 1 7=1 


N—m 

J2(ai-a*) =0 

7=1 

0 < < C, / = m + 1, m + 2, . . . , r 



One-step ahead forecasting is the forecast of x r+1 , x r+ \, given input vector X r _ m+1 . 
Data vector used for forecasting of the next moment is the actual observed value 
with the m moment up to current time t, rather than the forecasting data, while the 
forecasting data points will be used as input moment for multiple-step ahead 
forecasting. According to Eq. 9 and X r _ m+] = (x r _ m+1 , x r _ m+2 , . . x r ), the 1-step 
ahead forecasting is 
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r—m 

Xr+ 1 = X («i - a*)K(Xi,X r - m+ 1 ) + b ( 11 ) 

Z =1 

In addition to forecast unknown points, it is important to be able to give a 
confidence interval under a certain confidence level. Here is the forecasting interval 
estimation based on ^-distribution (Chen and Zhou 2008). 

For any real value x and the forecasting value x, the relationship between them is 
x = x -j- e, and e is the error. Assume the overall error e ~ N( 0, a ), forecasting error 
set can be obtained from the training set: E— { e x , . . ., e n }, where E is a sample of e. 
Take e n+ \ = x n+ \ — x n+ \ as the forecasting error of a single point, which is from the 
error overall and is independent from the elements in E , then 


n 


Tl 1 C n - |_i 6 


n + 1 


5 


~ t(r,_ 


(«-0 


( 12 ) 


where 


1 n ] 

e = -Y / e i S 2 = - V 

n U n ix 


n 


—\ 2 
e\ - e 


Given that a is the probability of type I error, through solving P(\f]\ < X) = 1 — a, 
so \rj\ < t a , 2 (n - i), and then 



n -j - 1 _ In T 1 

^ ta/2(n— 1)^\/ ~ j" — 1 — ^ E ta/2(n—\) 



n — 1 


(13) 


Besides e n+ \ = x n+ \ — x n+ \ , then the forecasting interval of x„ +i is 


Xn+l E e — t a /2( n -\)S 


n + 1 






j,X„ + i + e + t a /2( n -\) 


12+1 
22 — 1 


(14) 


Case Study 

A case study is conducted for the milling tool’s RUL assessment. The test is carried 
out in OKUMA vertical three axis milling machine at the Molding Tool Graduate 
School of Dalian University of Technology shown in (Fig. 8). The tool to test is 
7792 VXD cow nose cutter, and the diameter, overhanging length, and blade 
number are 32 mm, 200 mm, and 3, respectively. Spindle speed, cutting depth, 
and feeding speed are 400 mm/min, 0.4 mm, and 1,000 rpm, respectively. The 
acoustic emission signal and the force signal are measured at the same time. The 
acoustic emission (AE) signal is collected every 10 s. The sampling frequency is 
2,048 kHz, and the sampling length is 512,000 of pulses. 


3168 


H.-c. Zhang et al. 



Fig. 8 OKUMA vertical three axis milling machine for the test 


Both amplitude and distribution of the acoustic emission signals change along 
with tools’ state from fresh to worn. Some signal features are noticeable and closely 
related to the wear, but the others are not. The acoustic emission signals are 
decomposed into 64 bands by wavelet packet (WP) transform, and the WP energy 
of different sampling time is calculated and normalized that is shown in Fig. 9. 
It can be clearly observed that the signal energy mainly concentrates on low 
frequency band and the maximum energy concentrates on Band 2. WP energy of 
Band 2 is used to estimate the tool wear and its change is shown in Fig. 10 with a 
nonmonotonically increasing trend. This trend may be caused by the increase of 
tool wear, which results in increasing contact area between the tool and the work 
piece. 

The calibration and the subsequent forecasting work of SVR in this study were 
performed by resorting to the LIBSVM software package. The LIBSVM software is 
freeware, and the source codes, written in C++, are open to public. This study has 
modified the codes for this tool wearing assessment. The grid search method with a 
cross-validation technique (Hsu et al. 2003) was used to derive the SVR model 
parameters, including the penalty parameter C and the kernel parameter y. The error 

o o 

tolerance e is set to 0.01. The search range is 2 ~ 2 and the search step is set at 


1 .0. A threefold cross-validation was used, and the parameters were obtained by the 
minimum root-mean- square error (RMSE) regarding each fold cross-validation. 
The model parameters (C, y) = (27.86, 0.06) were then determined by averaging the 
derived three sets of parameters from cross-validation, and the RMSE was 0.03 by 
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Fig. 1 1 The estimation result of SVR parameters 


simulating the calibration events. The estimation result of SVR parameters is shown 
in Figs. 11 and 12. 

Figure 13 presents the result of 1-step ahead forecasting after t = 165. At time t , 
several realizations of x u namely, x t ^ ( j = 1,. . ., r), can be obtained from the SVR 

residuals. A sample of r realizations of the forecast error ej can be obtained and used 
to “address” the deterministic forecast value obtained from the trained SVR model. 
The result is a distribution of r forecasts x t ^ , whose average is the deterministic 

prediction x t . The empirical distribution of x t ^ represents the probabilistic 

forecast. The probability distribution pertaining to each t variable can be constmcted 
using a plotting position formula (Vogel 1986). This work applied Hazen plotting 
position relationships to describe the WP energy distribution of t. A threshold for the 
normalized energy is set at 0.4. When the normalized energy value exceeds the 
threshold, the tool is considered failed. The results of reliability estimation at t = 
165, t = 185, and t = 200 are shown in Figs. 14a-c, respectively. Figure 15 is the 
forecast confidence interval based on t - distribution ( a = 0.05), and it can be seen that 
the reliability reduces gradually with the increase of forecast step. 
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Fig. 14 Predictive 
probability distribution 
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Fig. 15 Forecasting confidence intervals based on ^-distribution 


State-Space Method 
State-Space Model (SSM) 

State-space model (SSM) is a method of typical correlation analysis, which was first 
proposed by Akaike (1975) and further evolved by Mehra (1979). The applications 
in the analysis of mechanical products’ remaining useful life using SSM are 
increasing rapidly because the method can make some fairly complex issues into 
simple form. SSM provides a consistent analytical framework for processing 
practical problems on account of the similar model structure. 

The ideas to create an SSM are as follows: (1) introduce the concept of 
unobserved state variables; (2) establish a model for describing the status changes, 
which is called state equation; and (3) determine the observation equation, 
which contains the state transfer information. An SSM is built by two equations: 
one is the state equation and the other is the observation equation. The state 
equation presents the relationship between the current state and the next state, 
and the observation equation reflects the intrinsic relationship between the obser- 
vations and the state of the system. The mathematical definitions of SSM are given 
below: 

Definition 1: state vector x h the system state variables, used to reflect the intrinsic 

characteristics of the dynamic system at time /, is random and unobservable 

generally. x t in A-dimensional Euclidean space that is x, E R N \ where R Nx is the 
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state space. In general, the state sequence at time t = 1, s is denoted by x 1; 


and x 1: 


Ui, x 2 . 


,x s }- 


Definition 2: observation vector y t of the dynamic systems, which can be observed, 
is also random because of the noise. The observation sequence at time t = 1, . . ., 


n is denoted by y l:n and y l:s = {y u y 2 . 


,y n }- 


Assume the system states can seem as a first-order Markov process and 
observations under states are independent with each other. The general state-space 
model of the system (may be a nonlinear and non-Gaussian system) can be written as 


x 


t 


f t {x t -\,u t ,rj t ,0),t E T 


(15) 


y t = ht{x u u u e u O),teT 


(16) 


where Eqs. 15 and 16 are the state and observation equations, respectively, x, E R Nx 
is the system state vector, j, E R Nx is the observation vector, i#,- E R Nx is the 
input vector of the system, and 6 is the static parameters of the model. (e,- E 
R Nx ) and tji (r| e R Nx ) denote the observation noise and state noise, respectively; they 


are independent with each other. / 1 : R Nx x R Ne \-^R Nx is state function and h\ : R Nx 

xR N,] \-^R Ny is observation function. These two functions assumed known, depend 
on fi t (sometime it is omitted for simplicity). Prior distribution of the initial state x 0 
is assumed to be P(x 0 ). 


■N 


Model Estimation and Particle filtering 

Estimation can be divided into three categories based on the given information. 
They are prediction, filtering, and smoothing. For the state sequence x 1;5 : (1) if t > s, 
the future status can be predicted by the available state information, namely, 
prediction; (2) if t = s, the current status can be predicted by the obtained 
information, namely, filtering; (3) if t < s, the past state can be estimated by the 
available information, namely, smoothing. 

Bayesian filtering is a method of statistical inference. The principle of Bayesian 
filtering is to construct the posterior probability density of the state variables by 
using all the known information. The way is predicting the prior probability density 
of the state using state transition function, then get posterior probability density by 
updating it using the recent observation. The purpose of establishing SSM is to 
estimate the current state variables x t using observed information y l:t . The posterior 
probability density p(x t \yi :t ) will be updated once a new observation becomes 
available, so it will make the calculation inconvenient. However, recursive Bayes- 
ian filter, the basic state estimation theory of dynamic system, was introduced to 
update the recursive estimates, instead to store and reprocess the previous mea- 
surement data, which can save storage space and improve the operation speed. 

Recursive filtering sequence estimation can be used to update the state vector for 
discrete-time and parameter time- varying system. That is estimating the current 
state vector according to the current observation vector and several previous state 
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vectors without depending on all of previous data (Markov memory less property). 

Generally, this process is carried out in two steps. 

Predictions: to estimate the priori probability density of current state vector using 
the system state functions. Because of the effect of noise, there will be some 
deviation between the prior probability density and its actual status. Updated: to 
incorporate at time t + 1 , the new observations y t+ 1 into the estimator of the state 
vector to correct the obtained prior probability density and get the posterior 
probability density of the state vector by Bayesian method. 


The recursive formula of the conditional probability density function is 


Initial value: p(x 0 |yo) = p( x o) 
Prediction: p(x, |yi :t _i) = J 

Update: p(x t \y lt 


i 


P(y t \xt)p(x t \y i-,_i) 


where p(y t \yi:t-i) = J p(yt\ x t)p( x t\yi:t-i)dxt is called as normalized factor 


p(y,\yut-i 


p(y t \xt)p(xt\yi:t-i)dx t . 


As the probability density function in application is often not in a closed form, it 
is difficult to get the exact solution by an analytical method in practice. Only in 
linear constant coefficient system with zero mean and Gaussian white noise, 
Kalman filter method provides an optimal solution to the problems of prediction 
and updating. 

The ^-dimensional state equation and m-dimensional observation equation of the 
system can be expressed by 


x k — ®k,k-\ x k-l +r M -iW*-i 

y k = C k x k + V k 


(17) 


Assume that both the state noise W k and measurement noise V k are independent, 
standard white noise (sequence of uncorrelated random variables with zero mean), 

that is, E[W k ] = 0, co v[W h Wj\ = E[W k Wj T ] = Q k d kj , E[V k ] = 0, cov[V fo V,] = 
E[V k Vj T ] = R k 8 kj , and cov[W 7 /, Vj\ = E[W L V/] = 0., where Q k and R k are the 
covariance matrixes fomoise W k and V k respectively. 


The statistical properties of the initial state x 0 is 


E[x o] = xq , Var[xo\ —E (xq — xq ) (xq — xq ) 


- \T 


T 


T 


P o> cov[x 0 , W k ] = E[x 0 W k ] = 0, 


and cov[x 0 , V k ] = E[x 0 , V k ] 


0. 


The optimal linear filtering recursive procedure of the state x k can be obtained as 


follows: 


The filter equation is 


x k x k\k—l + K k (y k - Cxm-j) 


( 18 ) 
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where x k \ k _i = O k , k -ix k -i and the prior value is xq = io|o = *o- 

The gain equation is 


K k =P k \ k - l C k T (C k P k \ k - 1 C k T +R k ) 1 (19) 

where P k \ k - 1 = E 
matrix. 

Prediction error variance equation is 



T 


%k %k\k— 1 ) l %k %k\k—l 


is the prediction error variance 


Pk\k-1 = ®k,k- \Pk-\^> T k,k-\ +^k,k-lQk,k-lQk-l^ T k,k-l- (20) 

The filtering error variance equation is 

Pk = (I — K k C k )P k \ k -\ (21) 

and the prior value is P 0 = Var[x 0 \ . 

A recursive procedure commences at time t 0 = 0 by choosing To and P 0 to be the 
best estimates of the mean and the covariance of x 0 , respectively. The estimate for 
P l/0 is governed by the Eq. 20, then through Eq. 19 ,K X can be calculated. The state 
estimation x\ at t = 1 can be obtained through Eq. 18. Next, Py 0 was brought into 
Eq. 21 to calculate Pi. By repeated application of the recursive procedure, the real- 
time state estimation can be obtained. 

Particle filter is an approximation Bayesian filtering algorithms based on Monte 
Carlo simulation. Recursive Bayesian filtering can be done through nonparametric 
Monte Carlo simulation method, and the posterior probability can be effectively 
calculated for non-Gaussian, nonlinear, and high-dimensional data. The main idea 
is to use random independent samples called particles with weights to approximate 
the probability density p(x k \z k ). The mean of the samples can instead integral 
operation to obtain the minimum variance estimation of the state. As each sample 
(particle) representing a possible state of the system, the probability density func- 
tion of the particles gradually approaching that of the state with the increased 
number of particles, then the optimal Bayesian estimation results will be possibly 
obtained. Particle filter can be applied in nonlinear and non-Gaussian state-space 
model without constraints. Because of the advantages of the particle filtering 
technology, particle filter has a wide range of applications in time- varying param- 
eter modeling and becomes a hot research topic and application. 

The mathematical description of the particle filter algorithm is the following: for 
a stationary random process, assuming the posterior probability density of the 
system at time — 1 is p(x k _i\z k _i), select n random sample points according to 
certain principles. When the measurement information at time k is available, the 
posterior probability density p(x k \z k ) of the n particles can be updated. The process 
of particle filtering can also be divided into the two phases of the state estimation 
and the estimation correction. In the state estimation, a large number of samples 
will be selected from the probability distribution of x k _ i . The state of each particle 
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can be changed based on the state equation and the controlled variable; the original 
particles become predicted particle. In the correction phase, the predicted particle 
values are substituted into the observation equation, update the particle weights 
after noise processing, and make weight normalization. 

Hammersley, Morton, and Rosenbluth put forward the sequential importance 
sampling (SIS) method in the 1950s (Chen 2007). In the late 1960s, SIS method was 
applied in the control system applications. Scholars in various fields continued to 
study the SIS modes in the 1970s. The sequential importance sampling (SIS) 
algorithm is as follows: 


1 . Initial value t = 0, sampling particles {xo j) }f = x from prior distribution p(x 0 ), and 


the weight of each particles is Wq = for /= 1 ,2,. . .,7V. 


2. For t 


12 T 


i. For i = 1,2,. . ., TV, extracting particles x[ ?) ~ ^(x^lxo^ t -\yv.t) from important 


function q(x t lxQl_ x y\. t ), and assuming x$! t 


(0 


r (0 *(9 


ii. For i = 1,2,. . .TV, updating the weight by w) 


(0 


p(y t \ x t 

)p( x t 


q{ x t ] 

(0 \ 


w 


(0 

t- 1- 


iii. For i = 1,2,. . .,7V, normalizing the importance weight by w, = w\ l ^ w\ l \ 


(0 


N 


(0 


/■= 1 


3. The updated particle set {x^ : i = 1, . . ., TV} is to approximate the posterior 


N 


probability distributions of the state p(xo :t \y x . t ) E WtSlxo-j 

i=l 


Xq.) 1 and the 


N 


expectations of some functions such as E[g t (x o :t 



0 i ) 


.8(.) is the 


t= l 


Dirac delta function. 


After a number of iterations of the algorithm, the variance of the particle weights 
will gradually increase over the time, and a small number of particles are with large 
weight and most of the particles with small weights, which can be neglected, 
resulting in a type of particles’ “degradation.” The degradation of the particles in 
iteration process will lead to that the observed value Y of the system couldn’t be 
gotten after all the predictions. To this end, Gordon proposed the concept of 
resampling until 1993 (Gordon et al. 1993), which laid the base for the particle 
filter algorithm, and degradation was effectively inhibited (Storvik 2002). The 
concept of effective sampling scales is defined as N e jf = — where : 



w t 


(0 


i= 1 


l 


= 1, . . ., N are unnormalized weights of the particles. This formula is difficult to 
accurately estimate the actual calculation; the following approximated formula is 


generally used as N e ff 


l 


N 


where w t 


(0 


i 


1 , . . . , N are normalized 


2 
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Fig. 16 The diagram of resampling process 


weights. If N e ff < N th , then resampling should be adopted. In general, N th = 2 N/3 as 
a defined threshold number for sampling scales. The most commonly used 
resampling method can be introduced as follows: First, generate n random numbers 
{///:/= 1,2,. . .,n} in the [0, 1] uniform distribution, then use search method to find 

m— 1 m 

an integer number m satisfying the formula wj < ju t < wj. The new samples 

7=0 7=0 

Xk' n) are joined in the new set of sampling particles. The interval [0, 1] is divided 

l 

into n intervals by Aj = Wj(i = 1, 2, • • •, n). When the random number /// belongs 

7=0 

to the m th the interval I m = [2 m _i, 2], the samples are copied. Obviously, the samples 
with larger weight can be copied repeatedly, and some samples with smaller weight 
will be abandoned in the case of n sample number all the time. The weight of particles 
is now reassigned as 1/N. , and the resampling process is realized. 

The standard particle filter algorithm process with resampling is introduced and 
Fig. 16 (Zhao et al. 2013) is the schematic of this process: 


(i) Initialization: t = 0, choosing particles {xq } }f = i from prior distribution p(x 0 ), 
and the weight of each particles is w 1 ^ = jj for /= 1,2,. . .JSf. 


(ii) Importance sampling: for i— 1,2, . . ., N, extracting particles from q(x t \x^ t _iyi./), 


that is, x) 


(0 




q ( xMol-xy U ) ■ Assuming i® 


r (0 f (0 
:t-v x t 


for i— 1,2,.. ., N. 
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Updating the weight by w) 
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(iv) The posterior probability distributions of the state 


Although the particle filter algorithm has been widely adopted in many appli- 
cations, there are still some shortcomings. To improve the performance and stabil- 
ity of the particle filter further, many improvements have been proposed in recent 
years, mainly in two aspects: selecting the importance function and maintaining the 
particles diversity. Auxiliary particle filter (APF), local linearization methods, 
evolutionary particle filter algorithm, etc., are examples. 

In data-driven statistical modeling, the model structure and the mode parameters 
for a particular application are required to set through the understanding of the 
application and the available data. Then the model parameters’ estimation of a 
state-space model is necessary. Maximum likelihood estimation and maximum- 
posterior-likelihood estimation are the methods widely used in the aspect of 
parameter estimation due to its asymptotically optimal properties. However, there 
are few analytical approaches in solving the likelihood function, and it is calculated 
by means of mathematical optimization methods in practical application. But in 
some cases it is difficult to get the gradient of the likelihood function. To this end, 
the common method in the presence of latent variables is the expectation- 
maximization (EM) algorithm. The EM algorithm was first proposed by Dempster, 
Lair, and Rubin in 1977 (Dempster et al. 1977). EM algorithm can make a more 
complex optimization problem of likelihood function into a series of relatively 
simple functions by data augmented technology, which provide a framework for 
maximum likelihood estimation of unknown parameters with incomplete data 
(missing data) set, while the traditional methods such as Newton-Raphson estima- 
tion more complicated. EM algorithm belongs to iterative algorithm and consists in 
two steps for each iteration algorithm: expectation step called the E-step and 
maximization step called the M-step. 

State-space model with algorithm of particle filtering (modified MCMC method) 
is currently the most studied method for online RUL modeling and assessment due 
to the HMM property of the state model and MCMC effectiveness for nonlinear, 
non-Gaussian system. For a particular application, the state-space model 
(the state function and the observation function) is built based on the given system 
degradation dynamics and failure modes and failure mechanisms. The system 
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initiation conditions are also determined by the application. The available 
observation data will be used for model coefficients estimation through the EM 
algorithm by Monte Carlo simulation. The state of the space model and its 
estimate are used for the assessment of the degradation, the hidden stochastic 
process. The particle filter is applied to perform Bayes recursive estimation of 
posterior probability density function of the system state, given every updated 
observation. This is a recursive simulation approximation. The numerical 
calculation yet is very expensive at each step, once a new update observation is 
available. The accuracy of the algorithm is highly depending on the number of 
particles. As digital computer system has been improved greatly in power and 
speed, it makes the particle filter and state-space model possible in RUL online 
application. 


Online Degradation and RUL Predictions 

The purpose of estimating the model parameters and joint status of the state is to 
predict the future degradation of the individuals, that is the estimation of the state 
(x, 

joint posterior distribution at time t\ 


+ 1 I t + / 


yi . t ). Assuming the posterior sampling 0 ( t l } i is obtained, the 



The particles generated during the filtering estimation also represent the behav- 
ior of the individual unit degradation. Consequently, the continued evolution should 
be able to characterize the future degradation of the working unit. With the 
recursive estimation the /-step ahead prediction of the degrading state, i.e., 
(x t + i : t + i\y i : t ) 9 can be obtained by the following steps: 


For 7 = 1, . . ., /, 

For /=!,.. .,N, sample x^ l lj ~ p(x t + j\x ( t l) + j _ i, 0\ l) ) and x^ t+j = (x^ t+j _ i, x, + y) 


At the end of iteration, a sample set {x ( t l) t + j)\ 


N 


P(x t 


+ 1 \t + / 


yi . t ) can be estimated as 


available, and an estimation of 






(yX t 1 -j J ) 



Using the predictive distribution at time t + 1 as input for recursive estimation, 
the corresponding degradation prediction, for /-step ahead, can be calculated, 
namely 



( 24 ) 
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The object of prediction is to obtain time-to-failure (TTF) from current time, 
i.e., the RUL, which is the time from the current to the time the system fault 
indicator reaches its corresponding failure threshold. The RUL is a random 
variable, the result for RUL estimation is presented in the form of probability 
and probability distribution. Referring to (Orchard et al. 2005), given a 
predetermined failure threshold A for a state x, the two-sided criterion for system 
failure is defined as 


Cf \H low 2^ x 5^ H up j 

Where, Hi ow and H up are the upper and lower boundaries of the interval failure 
respectively. 

As shown in Fig. 17, H up and Hi ow are set symmetrically on both sides of A. The 
sample particle swarms of state estimation from t k to t k + / overlap with the hazard 
zone (the light shaded area). The sum of the normalized weights of all sample 
particles which locate in the light shaded area at any time step between t k and t k + / 
represents the probability of system failure occurring at the corresponding time 
step. The normalizing constant is the sum of weights of total sample particles 
which locate in the light shaded area from time t k to t k + /. Therefore, an 
approximation of the probability distribution of RUL can be obtained through a 
set of an equal interval discrete samples with their corresponding probabilities, 
that is 
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Fig. 1 7 The estimation principle of RUL (Orchard et al. 2005) 
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Where, w k+J - is the normalized weight of each particle at each prediction time, 

p(-) is the probability to failure when the particle value is within the range of the 
defined failure band. The mean of remaining useful life is therefore estimated to be: 


EruL — ^exp h ( 26 ) 

where, t ex p is the expected failure time. 

There are much research worked on RUL assessment based on the degradation 
assessment with online observations, namely p(xo :t \yi :t ) in Bayesian estimation. 
However, there are various concerns about online remaining useful life (RUL) 
assessment based on the monitoring data in practice. The brief discussion is 
summarized below: 


1 . The definition of failure by a performance threshold is basically by the experi- 
ence of the engineers or the analysis of old data, which is inaccurate because the 
life of the identical products is different even under the same working environ- 
ment. It is very subjective in the defining the threshold in current RUL assess- 
ment practice. 

2. The RUL is caused by system degradation. The underlying degradation of the 
system is a stochastic process, It is considered monotonic (no decreasing for 
small is better case), time-varying, and random. But the performance or the 
performance variable of the system could be up and down with fluctuation. 
Online signal in data acquisition usually shows a strong fluctuation in real 
application. 

3. It is difficult to measure the degradation signal directly in practical applications. 
Usually other signals highly related to the degradation of the system are mea- 
sured to assess the remaining useful life. The variables for online monitoring are 
regarded as covariates, and proportional hazards model are applied to assess the 
residual life. The advantage of this method is no need to define the failure 
threshold; however, when dealing with a variety of covariates, the model 
needs to be deeply study. 

4. The degradation of the system performance couldn’t be measured directly and 
usually some measurable physical variables that highly related to decline of the 
system degradation will be measured. There are two stochastic processes: 
system underlying degradation and the measurable performance. Hidden Mar- 
kov model (HHM) is proposed to study the hidden process from the observation 
of the measurable process. HMM has been study and applied widely, but there 
are no uniform approach to determined the direct relationship between observa- 
tions and states of the hidden process. And how to determine the parameters of 
the model is still remaining to discuss. 
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5. State space model (SSM), belongs to HMM category, is most studied method in 
predicting the remaining useful life. Kalman filtering method, widely used for 
constant coefficient linear systems with Gaussian noise, provides optimal esti- 
mation and forecasting results, but not directly for those nonlinear systems with 
non-Gaussian noise. No theoretical form can be applied in model parameter 
estimation and state estimation, and only Monte Carlo simulation method can be 
adopted for recursive estimate online. The definition of the baseline model is a 
difficult point in the term of SSM, the relationship between the observations and 
the state variables need to totally study and be modeled for an application. The 
state variables should be linearly independent and can express the system 
dynamics completely in an SSM. 


A Case Study 

Experimental Design and Setup 

The data set in this case study is taken from PHM Society (2010). Monitoring 
signals were obtained from the sensors (acceleration sensors, force sensors, acoustic 
emission sensors) mounted on the workpiece. Six cutters with three blades of the 
same type (Cl, C2, C3, C4, C5, and C6) were used in the milling test. All the six 
cutters were experimented on the same test workpiece for 315 times, milling cutting 
operations under the same setting condition. The vibration signals, force signals, 
and acoustic emission signals were measured during each milling process; the 
abrasion losses of Cl, C4, and C6 were also measured. Experimental data were 
saved in 315 sample files for each cutter; there are 6 sets of files in total for six 
cutters (C1-C6) respectively. 

The maximum wear values of tool Cl were used to establish the state-space 
model. As shown in Fig. 18, it is a nondecreasing process. Since the section of the 
data before milling time 125 showed great volatility, the data of milling cycle from 
125 to 315 were set as validation data. 


Modeling and Parameter Estimation 

According to the general degradation model about the cutting tool, the individual 
degradation process is modeled in a Wiener process with an unknown drift constant 
ft, that is, 


X(t) = fit -\- a B • B{t) (27) 

where B(t) is a Brownian motion, o B is the corresponding diffusion coefficients, and 
both ft and o B are unknown. 

The true value of the degradation cannot be measured directly; the observation 
Y(t) contains the error c R , i.e., 
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Fig. 18 The maximum wear value of toolCl 


7(0 = X(t) + • s(t) (28) 

where 8(0 is a Gaussian white noise and o R is the measurement error. Since the 
measurement system is generally known with a controllable precision, the value of 
o R in the model is assumed to be known. 

In order to use the particle filtering method for recursive estimation of 
model parameters and of prediction of degradation, a state-space model is 
established for the process, and the state of the model represents the unobservable 
degradation dynamics. Since the unit of interest is monitored through a digital 
computer DAQ system, a discrete-time state-space model is applied (Jin 
et al. 2013): 


X n +\ — X n + + (JB,n ’ W n (29) 

Y n = X n + gr, u • V n (30) 

Here, [3 n , o B n , and o R n are the values of the model parameters at step n , and W n 
and V n are normal random noise terms. 

In the Bayesian inference framework, a reasonable prior distribution is first 
assigned for the unknown parameters of the model. Since both the system dynamics 
and the degradation process are affected by some of the common factors, including 
the same working environments, there is likely some dependency existing between 
P and o B . Therefore, the joint distribution o B ) of ft and o B is considered as the 
prior distribution of the unknown parameters in the model. 
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'9 9 

Let mo = up, vq = 6p, v B = o B \ from the modern parametric Bayesian inference 

principle, the conjugate prior distributions for these two parameters are applied, 

i.e., a normal inverse-gamma distribution, which is conjugate to the Gaussian 
likelihood: 


(PWl) ~ N {mo, o\ / n 0 ) 

xo ( gb) ~ /G ( C/;; , /!;; ) 

where n 0 = v B /v 0 , IG(a , A) is the inverse-gamma density, a is the shape parameter 
and A is the scale parameter, and its probability density function is 

fix 

In order to get the prior distribution of the model parameters, the 
hyperparameters in the prior distribution have to be estimated, which can be 
obtained through the distribution fitting. The bootstrap method is applied to gener- 
ate multiple sampling for the estimation of the hyperparameters in the prior 
distribution. 

According to the degradation model assumption, 8 t = x t — x f _ \,i = 1, . . ., n is 
the degradation increment, which is an i.i.d. sample from N(j3, <j b ). Because o R in 
the model is assumed to be known, the degradation increment can be considered as 
an approximation of the observations, i.e., Sj = x t — x t _ i ~ y* — y* _ i, / = 1,2 , . . .,n. 
For a Wiener process, y 0 = 0; therefore, an estimation of /? and o B can be obtained by 
normal distribution fitting based on the n degradation increments. 

By the bootstrap method, it can generate k bootstrap samples based on the 
history data, i.e., k degradation process, and k estimations of /? and o B are available. 
The hyperparameters in the distribution can then be attained through distribution 
fitting, and the prior distributions of the model parameters are therefore calculated. 
The corresponding algorithm is described as follows: 

1. Generating k bootstrap samples based on historical data y,-, i = 1,2, . . ., n 

2. For each sample, according to the assumption Sj ~ N(J3 , o B ), getting an estima- 

2 2 b 

tion of P and o B through distribution fitting, i.e., ft and dp b = 1, 2, . . ., k 

9 9 9 

3. According to the assumption n o,i(^lc 5 ) ~ N(m 0 , o B ln § ), ^o,i (& B ) ~ IG(a B , A B ), 
getting the hyperparameters in the distribution through distribution fitting and 
obtaining the prior distribution of the model parameters 

Based on the discrete state-space model, the method of sufficient statistics is 
used to estimate the state and parameters jointly. The sufficient statistics and the 
conditional posterior distributions of the unknown parameters need to be identified. 
From the Bayesian inference theory, given the prior distributions of ft and o B , the 


a, A) 


/l 


r (a) 


x i a+l ) e X ! x x > 0 ,a > O.A > 0 


(31) 
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corresponding posterior distribution can be calculated based on the conjugate 
nature. After having obtained n observations, the corresponding posterior distribu- 
tion of (5 and a B is normal inverse gamma, with parameter values: 
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statistics and the parameters of these posterior distributions can be updated 
iteratively. 

The flow chart of the proposed method is shown in Fig. 19. Starting from the 
current degradation data of the tool, resampling samples are generated using the 
bootstrap method. According to the state-space model and model parameter distri- 
bution assumption, the priori estimation of model parameters can be obtained. Then 
the posteriori estimation of the model parameters can be performed by combining 
particle filter algorithm and the sufficient statistics. The state-space model with 
updated parameters is then identified based on the current degradation data. The 
tool future degradation can be predicted based on the model. Following the method 
of RUL prediction discussed above, the random variable of the tool RUL can be 
systematically evaluated. 


Results 

At milling cycles of K1 = 35, K2 = 75, K3 = 105, and K4 = 135, 900 bootstrap 
samples are generated respectively. In the particle filtering process, particle number 
is 1,000, the prior probability density function is adopted as the importance 
function, and the polynomial resampling method is applied. The parameter estima- 
tion results are listed in Table 3. 

The multistep prediction results are attained from the stochastic degradation 
model. The threshold for the soft failure is set at 0.15 mm. The prediction of the 
cutter degradation at K1 = 35, K2 = 75, K3 = 105, and K4 = 135 are shown in 
Fig. 20. The red lines represent the degradation process of each particle, and the two 
horizontal lines are the threshold band. The black dotted line is the estimated value 
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Fig. 19 The flowchart of the 
proposed method 



Table 3 The parameter estimations at different milling cycles 


Milling cycle 

A/? 

a 

A 


cr 

K1 - 35 

0.3768 

45.5507 

0.421 

0.0094 

0.5 

K2 - 75 

0.3779 

328.2323 

0.252 

0.00072 

0.5 

K3 - 105 

0.3456 

743.8327 

0.113 

0.00015 

0.5 

K4 - 135 

0.3505 

533.9785 

0.128 

0.00024 

0.5 


of the degradation, and the blue line is the degradation prediction. The 95 % 
confidence intervals for the predictions are also shown. 

When the wear value exceeds the predefined threshold, the tool is considered to 
fail, and the corresponding milling cycle is the TTF. The RUL probability density 
function at different milling cycles is shown in Fig. 21. 

Shown in Table 4 are the RUL and TTF results at different forecast origins and in 
Fig. 22 is the contrast of the predicted and the real values of RUL. It can be seen that 
the predicted value of the remaining life and the real value is very close, and it 
proved the model validity. 
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Fig. 20 Performance of the model at (a) K1 = 35, (b) K2 = 75, (c) K3 = 105, and (d) K4 = 135 


Summary 

Remanufacturing engineering has demonstrated a positive impact on energy sus- 
tainability, environmental protection, and global economic development. The the- 
ory and technology of remanufacturing has developed rapidly, and the market for 
remanufactured products continues to grow worldwide. The remaining useful life 
(RUL) of in-service machinery is a direct determinant of its remanufacturability. 
The selection of time to retire for remanufacturing is not only derived from the RUL 
but also from a comprehensive balancing of all effects regarding technology, 
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Fig. 21 The probability density function of RUL at different milling cycles 


Table 4 RUL and TTF results at different forecast origins 


Milling cycle 

Predicted TTF 

Real TTF 

Predicted RUL 

Real RUL 

K1 - 35 

155 

146 

120 

111 

K2-75 

148 

146 

73 

71 

K3 - 105 

155 

146 

50 

41 

K4 - 135 

151 

146 

16 

11 


economics, and environmental impact. This holistic approach optimizes the 
remanufacturing process. 

Remanufacturing is a complex system process. Remanufacturing theory pro- 
vides support for the following steps in the process: (1) determining the 
“remanufacturability” of an old part; (2) comprehensive balancing and judgment 
for remanufacturing process planning (RPP) design; (3) and assessing the quality 
and reliability, or lifetime, of the remanufactured part. An accurate and effective 
assessment of the RUL will result in a reasonable and correct evaluation of 
remanufacturability . 

RUL modeling can be categorized as physical model based and data-driven 
model based. The major RUL forecasting methods are support vector machine 
(SVM) and state-space model (SSM), as illustrated in the case studies. 
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Fig. 22 The contrast of the 
predicted and the real values 


of RUL 




The accuracy of RUL assessment in the data-driven model is affected by not only 
the model itself but multiple factors also such as data acquisition, signal processing, 
and measurement errors. In the search for a more accurate model for RUL assess- 
ment, a robust data acquisition system design is recommended to reduce noise 
effectiveness in RUL modeling. 
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Previous research studies have indicated that barriers to the remanufacturing 
process can be traced to the initial product design stage, and these have ignited 
the concept of Design for Remanufacturing (DfRem) as a much pursued design 
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activity. In this chapter, the definition and scope of DfRem activities are firstly 
introduced to provide the readers with an understanding of the topic. 
Next, a review on the design tools and methodologies for DfRem is presented. 
Five DfRem approaches have been identified and the strengths and limitations 
associated with each approach are analyzed. Although various tools and 
methodologies have been proposed for DfRem, few of them have been adopted 
by the industry. Therefore, a study on the factors that can help successful 
integration of DfRem into product development has been conducted. 
The identified factors are explained and organized according to the impact 
they have on different product development stages. Finally, future research 
activities and directions for DfRem are suggested for promoting the 
remanufacturing industry. 


Introduction to Remanufacturing and Design for 
Remanufacturing 

Faced with stringent environmental legislations and motivated by growing customers’ 
awareness of environmental issues, many organizations and companies have adopted 
the practice of sustainable development. To achieve sustainability, a closed-loop 
material flow needs to be formed. Reusing, remanufacturing, and recycling are 
currently the most commonly adopted end-of-life (EOL) strategies in a closed-loop 
system. Among these strategies, remanufacturing is gaining popularity. 

Remanufacturing is the process of bringing products back to sound working 
status, through the process of disassembly, sorting, inspection, cleaning, 
reconditioning, reassembly, and testing, as shown in Fig. 1 (Lund 1984). The idea 
of remanufacturing as an academic research topic began to emerge only in early 
1980s, with Robert Lund’s original remanufacturing study. Since then, there has 
been increasing academic interest in remanufacturing arising from its recognized 
benefits and potential role in changing our society. 

Previous research studies have indicated that barriers to the remanufacturing 
process can be traced to the initial product design stage (Amezquita and Bras 1996). 




Disassembly 



Cleaning 

Sorting 



Reconditioning 

Reassembly 

Testing 

l J 



Fig. 1 Remanufacturing processes 
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Table 1 DfRem activities 


Design for core collection 

Design for restoring 

Design for disassembly/reassembly 

Design for multiple lifecycles 

Design for inspection 

Design for standardization 

Design for cleaning 

Design for handling 

Design for access 

Design for upgrade 

Design for durability 

Eco-design 


Product features and characteristics may have positive or negative impacts on the 
efficiency of remanufacture, depending upon decisions made during the design 
process (Charter and Gray 2008). These have ignited the concept of Design for 
Remanufacturing (DfRem) as a much pursued design activity (Sundin 2004). The 
imperative for connecting design and remanufacture is further reinforced by Nasr 
and Thurston (2006), who stated that the full societal benefits of remanufacturing 
cannot be achieved unless DfRem is integrated with the product development 
process. 


DfRem Activities 

The definition of DfRem, as presented by Charter and Gray (2008), is “a combina- 
tion of design processes whereby an item is designed to facilitate remanufacture.” 
DfRem is not only a part of “Design for X” (DfX) mewthodology, where 
X represents one of the aims of the methodologies, it incorporates a series of DfX 
strategies, such as design for core collection, design for upgrade, design for 
disassembly, etc. (Charter and Gray 2008). Sundin (2004) suggested that DfRem 
stands for a collection of many tasks or considerations which prioritization may 
vary depending on the process needed of the products. Table 1 summarizes the 
design activities involved in the DfRem methodology. 


Desired Product Characteristics for DfRem 

Remanufacturing is often practiced by the original equipment manufacturers 
(OEMs), who remanufacture their own products; contracted remanufacturers, 
who remanufacture the products under contract from the OEMs or customers; or 
independent remanufacturers (IR), who buy used products to remanufacture and 
resell them. However, the ability to resolve the difficulties in remanufacturing is 
most often owned by the OEM, since they control the product design stage and can 
potentially control remanufacture. Before an OEM considers designing their prod- 
ucts for remanufacturing, they should examine whether their products possess the 
following qualities: 
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Product is made up of standard interchangeable parts (Lund 1998). 

The cost of obtaining and reprocessing the core is low compared to the 
remaining value-added (Lund 1998). 

Technology exists to restore product (Nasr and Thurston 2006). 

Product technology is stable over more than one life cycle (Lund 1998). 
Sufficient customer demand for the remanufactured product (Ayres et al. 1997). 
The core is durable and has high value (Charter and Gray 2008). 

Potential to be upgraded (Shu and Flowers 1999). 

There are channels for reverse flow of used product (Ayres et al. 1997). 


Tools and Methodologies for DfRem 

The goals of product design for remanufacturing are to alleviate the problems which 
may occur during remanufacturing operations and improve the efficiency and 
effectiveness of product remanufacturing. Various design tools and methodologies 
have been proposed to facilitate product design for remanufacturing. Based on their 
approaches, these tools and methodologies have been classified into five categories 
(Fig. 2). Each of them will be discussed in the following sections. 



Fig. 2 Approaches for design for remanufacturing 
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Table 2 References used in compiling list of guidelines for DfRem 


Reference 

Core 

return 

Disassembly 

Sorting 

and 

inspection 

Cleaning 

Refurbishing 

Reassembly 
and testing 

Amezquita 
et al. (1995) 


7 

V 

V 

V 

V 

Mabee et al. 
(1999) 


7 

7 

7 

7 

7 

MtGlothlin 
and Kroll 

(1995) 

7 






Shu and 
Flowers 
(1999) 

7 

7 

7 

7 

7 


S undin and 
Bras (2005) 


7 

7 

7 



S undin and 
Lindahl 
(2008) 


7 


7 

7 

7 

Charter and 
Gray (2008) 

7 

7 

7 

7 

7 

7 

Ijomah et al. 
(2007a, b, 
2009) 


7 

7 

7 

7 

7 

Yuksel (2010) 


7 

7 

7 

7 



Guidelines for DfRem 

The most commonly used and effective approach to facilitate product design for 
remanufacturing is through providing design guidelines to steer a design toward 
higher remanufacturability. It is noted that the design guidelines proposed from 
various literature and research articles have presented a complementary but some- 
times overlapping insight. An overview of the design guidelines for successful 
product remanufacturing is therefore conducted. The collated design guidelines will 
be presented in a generic and general manner and categorized according to the 
six steps that constitute the remanufacturing process, namely, core collection, 
disassembly, inspection and sorting, cleaning, refurbishment, and reassembly and 
testing. The results can be used to identify the opportunities for enhancing 
remanufacturing design, setting goals, and measuring progress. Table 2 summarizes 
the literature sources drawn for composing these guidelines. 

Design for Reverse Logistics 

End-of-life products usually need to be returned to the specific remanufacturing 
factory in order for remanufacturing to take place. If this process is not well dealt 
with, a large cost barrier could occur: 
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• Product structure. To facilitate core collection, the structure should be 
designed in such a way so as to minimize the occurrence of damage during 
transit. For products which movement requires the use of forklifts, sufficient 
clearance and support at the base should be provided. In addition, structures that 
protrude outside a regular geometric volume should be avoided, since they are 
prone to become damaged during the transportation and may also hinder 
stacking during storage (Shu and Flowers 1999). 

• Product description. Labels, graphical communication, and the form of the 
product should be placed on the exterior or interior surface of the product to 
communicate the information of the product. For example, radio-frequency 
identification (RFID) is frequently regarded as a form of label to allow a vast 
array of information to be held (Charter and Gray 2008). 

Design for Disassembly 

Disassembly is not a simple reversal of assembly. Many permanent techniques 
which have been developed to realize and fasten the assembly process, such as 
plugging, pressing, forming, sonic welding, and adhesive, can cause problems for 
the disassembly process (Mabee et al. 1999). Basically, there are four areas that 
need special attention in design for disassembly: 

• Joint selection. The selection of the types of joints would critically affect the 
efficiency of the disassembly process. Nonpermanent joints are generally pre- 
ferred since they are simple to loosen (Mabee et al. 1999), e.g., bolt joints are 
usually preferred over adhesives. 

• Plan for nondestructive disassembly. Disassembly should not be destructive 
(Bras and McIntosh 1999). After the disassembly, the components are expected 
to be separated without being damaged or cause damage to other parts of the 
product. In addition, it is desirable for the fasteners to be reused. 

• Prevent corrosion/rust. Corrosion and rust are the greatest hindrance reported 
in an automotive industry survey (Charter and Gray 2008). Prevention of 
corrosion and rust will lead to better isolation of parts from the elements, 
using the less or non-corrosive materials or switching to other fastening 
mechanisms. 

• Clear instructions for disassembly steps. The disassembly instructions should 
be properly displayed on the returned core to facilitate the disassembly process. 
This is particularly important for third-party remanufacturers, who do not have 
detailed specifications of the products. 

Design for Sorting and Inspection 

Depending on the various inspection results, parts are sorted into three classes, 
namely, reusable without reconditioning, reusable after reconditioning, and not 
reusable. Design guidelines to facilitate sorting and inspection include: 

• Features for easy identification. Parts fulfilling the same function should have 
identical or distinctly dissimilar features. For example, to differentiate the gears 
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that fulfill different functions, gears could be made of different color-coding 
systems or have a specific number on them to easily identify them (Mabee 

et al. 1999). 

• Features for easy assessment. Sundin and Bras (2005) stated that determining 
and accessing the point for testing should be made easy and the time required for 
the inspection of the parts should be minimized. Design features such as the 
sacrificial parts for indicating the component’s condition over time should be 
encouraged. Sensors can also be embedded to record the useful data and com- 
municate the information over time (Mehmet and Surendra 2010). 

Design for Cleaning 

Cleaning is the most energy- and labor-intensive process in remanufacturing 
(Gonzalez 1983). Therefore, it is important to take the cleaning process into 
consideration during design; otherwise, a simple cleaning operation can become 
too laborious, expensive, or even impossible: 

• Selection of texture and geometries. Texture and geometries that facilitate easy 
cleaning are encouraged, such as a relatively flat surface which has a lower 
tendency to trap dirt or collect residue from cleaning (Amezquita et al. 1995). 

• Minimization of cleaning methods. Structures that require fewer variation of 
cleaning methods are always preferred. In this way, the cleaning process can be 
simplified. The material of the product that requires special cleaning methods 
should be avoided as much as possible, so as to minimize the cleaning cost as 
well as waste generation (Shu and Flowers 1999). 

• Labels that can withstand cleaning processes. During the cleaning process, 
labels and instructions which carry the product information on the component 
should be prevented from being washed away, since this may cause problems in 
subsequent refurbishment and reassembly processes (Sundin and Bras 2005). 

Design for Reconditioning 

During the refurbishment process, parts will be restored geometrically and proper- 
ties to be restored with surface treatment. For those parts which cannot be reused, 
they will be replaced by new spare parts. The following aspects should be focused 
while designing for refurbishment: 

• Durability design. Bulky and slightly overdesigned components are preferred 
than products with thin and less material, as the former could provide more 
margin of materials to be worked on with during refurbishment of components 
(Shu and Flowers 1999). Surfaces should also be designed in such a way that 
they have strong wear resistance, since the product may need to go through 
several use cycles. Moreover, it is appropriate to increase the dimensions to 
maximize usage cycles since part wear tolerance and material removal must be 
considered in these areas (Mabee et al. 1999). 

• Minimize the impact of wear and failure. Failure and wear are to be isolated in 
small parts such as inserts wherever possible. It is desirable that wear and failure 
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can be concentrated in detachable, consumable parts to avoid undue expense 
(Shu and Flowers 1999). 

• Platform and modularity design. Platform design is used to cluster the com- 
ponents based on the technical and use life; the importance of the components, 
technology trends, changes in aesthetic preference; or the possibility of being 
defunct, which will allow the defunct aspects to be removed while retaining the 
useful aspects of the product. Moreover, platforms may also be created such that 
the next-generation product can use these platforms. Therefore, the interface and 
the part should be standardized (Charter and Gray 2008). 

Design for Reassembly and Testing 

Designing products for reassembly and final testing can be improved from the 
following two aspects: 

• Adjustments. During reassembly, the number of the adjustments should be kept 
low, with adjustments being easy to make and are independent from each other. 

• Upgradability. The design should be flexible enough to be able to adapt to 
future technology migration as well as accommodate new configurations of 
the part. 

Discussion 

The lists of design guidelines have provided an understanding of the barriers that 
may be encountered during remanufacturing processes, as well as directions to 
enhance the efficiency of product remanufacturing. Table 3 provides more detailed 
remanufacturing requirements, and their related design criteria are summarized. 
Remanufacturing requirements are gathered from the feedback of remanufacturers 
with respect to improving the efficiency of the remanufacturing process. The design 
criteria are interpreted and “translated” from the remanufacturing requirements, 
bringing abstract requirements to concrete design specifications. Table 3 aims to 
provide the product designers with the most comprehensive guidelines to enhance 
product design for remanufacturing. However, the designers may still need to make 
proper judgment during the design of each individual product. 

Though straightforward and comprehensive, the approach of design guidelines 
for DfRem has been criticized as overly daunting, since it is impossible for 
designers to consider all these criteria simultaneously and some of the 
remanufacturing design requirements are intrusive on traditional design (Zwolinski 
et al. 2006). In addition, there are other issues that the design guidelines do not fully 
address, such as the subjectivity and customization guidelines (Hatcher et al. 2011). 


Quantitative and Qualitative Assessment Tools for DfRem 

Much work has been reported on developing new quantitative and qualitative 
assessment tools to evaluate the remanufacturability-related product properties 
and provide design feedback to the product development team. 
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Remanufacturing 

process 

Remanufacturing requirement 

Design criteria 

Reverse logistics 

Basic description of the product 

Labels, graphical communication, 
packaging, or even the form of a 
product could be positioned on the 
packaging 


Avoid damages during 
transportation 

Sufficient clearance and support at 
the base 



Avoid structures extruding outside 

Disassembly 

Easy access to internal regions 

Time to remove items for access 


Easy to loosen joints/fasteners 

Number of items to remove for 
access 


Reduce the variation of the tools 
used 

Number of fastener to remove 


Prevent part damage during the 
disassembly process 

Number of different tools to unlock 
the joints 


Prevent the corrosion of parts 

Number of permanent joints 


Clear instruction of the products 
disassembly process 

Number of parts damaged 


Easy access to the fastener/joints 

Number of fasteners damaged 


Easy identification of the fastener 

Isolate the part from the elements 


Using one disassembly direction 

Use non-corrosive materials 


Multi-disassembly should be 
possible with one operation 

Disassembly layout/instructions 
provided 



Position of the parts 



Type of fasteners/joints 



Types of parts 



Position of the fasteners/joints 

Standardization of the fasteners/ 
joints 

Sorting and 
inspection 

Ease of classification of the 
components 

Parts are identical or grossly 
dissimilar 


Ease of assessing the condition of 
the components 

Standardization of the parts 


Request for more objective testing 
methods 

Small number of components and 
connections 


Tools to facilitate the sorting 
process 

Color-coding/numbering system for 
similar parts 


Ease in detecting wear and 
corrosion 

Small number of inspection tools 


Component information are 
clearly indicated (life cycle, 
composition, wear indicator, etc.) 

Simple part test 


Testing points are easy to access 

Description of life cycle, 
composition, and wear indicator is 
provided 


(i continued ) 



3204 


S.S. Yang et al. 


Table 3 (continued) 


Remanufacturing 

process 

Remanufacturing requirement 

Design criteria 

Cleaning 

Accessibility of the internal parts 

Number of cavities/corners difficult 
to clean 

Simple method for cleaning 

Surface roughness 

Simple inner and outside surfaces 

Total waste generated 

Standard cleaning methods 

Time to clean 

Less wastes and health concerns 

Total cleaning material used 

Less variation of the cleaning 
methods 

Specify cleaning methods 

Instruction for cleaning methods 

Labels and instruction are able to 
withstand the cleaning process 

Labels and instruction to 
withstand cleaning processes 

Type of materials 

Shape of the parts 

Reconditioning 

Parts are robust 

Bulky - overdesign 

Avoid subjective criteria 

Wear-resistant surface design 

Fewer parts for replacements 

Number of the usage cycles 

Avoid technological or aesthetical 
obsolescence 

Number of wear and failure prone 
positions 

Modularity updatable 

Number/cost of reparable 
components 

Clear information of the product 
displayed 

Technological cycle of core 
components 

Texture areas are refurbishable 

Aesthetical cycle of core 
components 


Component modularity 

Upgradability of components 

Contains a tracking method for life 

Number of discarded components 

Number of parts refurbished 

Number of parts replaced 

Reassembly and 
testing 

Ease for adjustments 

Number of adjustments 

Capable and adaptable for 
upgradability 

Time to reassemble 

Simple methods for testing 

Time of final testing 

Upgraded configurations assembly 
without modification 


Bras and Hammond (1996) have developed a number of matrices, including 
assembly and disassembly matrices, testing and inspection matrices, cleaning 
matrices, and part refurbishing and replacement matrices, to evaluate the properties 
of a product design toward remanufacturability. The indices of these matrices were 
strategically combined into a single remanufacturability assessment index to pro- 
vide feedback with respect to product designers on the remanufacturability of the 
product. Various case studies have been conducted, ranging from customer 
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electronics to automotive products, to validate the effectiveness and efficiency of 
the proposed assessment model. 

Shu and Flowers (1998) proposed a reliability analysis model for comparing 
different design alternatives for product remanufacturing. The proposed reliabil- 
ity model is capable of predicting system failure under the condition that failed 
parts are replaced with components of the same type or different types and the 
failure characteristics of the constituent parts in the series. Furthermore, an 
integration of this reliability model with life cycle cost optimization has been 
achieved to demonstrate the effects of component specifications on product life 
cycle performance and the use of this information to enable remanufacturing to 
be more cost-effective and viable. 

The “RemPro Matrix” proposed by Sundin (2004) has successfully identified the 
correlation between product attributes and the specific remanufacturing steps. This 
matrix suggests that designers of new products should address product properties, 
such as “accessibility,” “wear resistance,” “ability to disassemble,” as well as 
“controllability” through appropriate product design. The matrix also suggests 
that the prioritization of the product properties would differ depending on the 
process needs of the product. 

Zwolinski et al. (2006) have built a remanufacturable product profile (RPP) 
which encapsulates the knowledge of both remanufacturing context and 
remanufactured product properties. A quantitative assessment of the product prop- 
erties would be made to guide the product designers toward an existing product 
profile which properties are well adapted to remanufacturing. The specific infor- 
mation to improve the internal design of the product can also be provided by RPP. 

Du et al. (2012) have developed an integrative model that includes matrices for 
assessing the technology feasibility, environmental benefit, and economic benefit. 
The technology feasibility is evaluated based on all the remanufacturing processes. 
The economic benefit is assessed from the aspect of remanufacturing cost. Energy 
saving, material saving, and pollution reduction are used to evaluate the environ- 
mental benefit. A machine tool is used as a case study to illustrate the feasibility and 
validity of the proposed methodology. The result obtained from the proposed model 
can assist the decision maker to decide whether the core should be remanufactured, 
and serves as a feedback for product designers to improve the weak points that have 
been identified. 

While the trend of using quantitative or qualitative assessment tools for improv- 
ing product design for remanufacturing has received increasing research attention, 
most of these models and tools remain within the academic realm and have hardly 
been utilized in the industry today. Some of the reasons, as indicated by Hatcher 
et al. (2011), are that these design tools are quite complex and are only applicable at 
the late design stage when most of the decisions have already been made. More- 
over, the holistic life cycle thinking is often ignored in these methods, or the design 
tools may lead to a suboptimization situation. For example, a screw insert is favored 
in remanufacturing product design. However, introducing a new material, such as a 
metal part into the original material like plastics, may counter the recycling efforts 
(Shu and Flowers 1999). 


3206 


S.S. Yang et al. 


Life Cycle Thinking for DfRem 

Life cycle thinking for product design is a concept of addressing problems from a 
systematic or holistic life cycle perspective. In this concept, product design is 
conducted with a goal of reducing the undesirable impact throughout its life 
cycle. The EOL options, such as disposal, recycling, reuse, and remanufacturing, 
are fully addressed in life cycle thinking, in addition to the manufacturing and usage 
stages (Fava 1993). Among the methodologies that have been used to quantify the 
life cycle impact of a product design, life cycle analysis (LCA) and life cycle cost 
(LCCA) analysis are the most commonly adopted ones. 

LCA has been widely used to assess the environmental benefits of 
remanufacturing as compared with original manufacturing. A comprehensive 
LCA remanufacturing study conducted by Smith and Keoleian (2004) shows that 
a remanufactured engine could be produced with 68 % to 83 % less energy and 
73 % to 87 % fewer carbon dioxide emission, as compared to original manufactur- 
ing. The LCA remanufacturing study by Kerr and Ryan (2001) found that 
remanufacturing could reduce resource consumption and waste generation over 
the life cycle of a photocopier by up to a factor of three and greater reductions could 
be achieved if a product is designed for remanufacturing. Meanwhile, the monetary 
benefit of remanufacturing has been reported through the use of LCC analysis, e.g., 
remanufactured alternators (Erwin et al. 2012) and engine remanufacturing (Sahni 
et al. 2010). 

Besides being used to demonstrate the benefit of remanufacturing, life cycle 
thinking has also been proposed to evaluate and select different design alternatives. 
Shu and Flowers (1999) have proposed a DfRem framework for the selection of 
product fastening and joining methods by using the LCC analysis. The impact of 
joint selection on the remanufacturing stage relative to other life cycle stages was 
estimated in cost, which provides a quantitative and straightforward feedback to the 
product designers. 

With the emergence of sustainable development, some products are designed to 
be used for several life cycles before being finally retired from use. This has led to 
the change of life cycle thinking from a linear life cycle model to a multiple life 
cycle model. Sutherland et al. (2008) proposed a methodology to estimate the 
embodied manufacturing/remanufacturing energy across multiple use cycles. This 
model has demonstrated the potential energy benefits achievable through product 
remanufacturing as compared with new manufacturing, throughout multiple prod- 
uct life cycles. In addition, it has been used to analyze and compare the impact of 
different engine head designs over multiple life cycles. 

It should be noted that DfRem is only one of the factors that determines a product 
design. Design improvements interpreted from the remanufacturing point of view 
may have different impacts on other life stages of a product. Therefore, a proper 
assessment of the impact over the entire life cycle is necessary during product 
design for remanufacturing. For example, if the remanufacturing enhancement of a 
redesign has made a product more difficult to manufacturing, the redesign should 
not be adopted because commercial considerations would not allow it to be 
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produced in the first place. However, life cycle analysis has frequently been 
criticized to be very detailed, expensive, and time consuming. In addition, during 
the initial design stage, the product information is still incomplete, and this makes 
life cycle analysis difficult to be conducted. 


DfRem Through Disassembly Strategy Planning 

Disassembly, which allows the separation of the reusable and nonreusable 
components for further processing, is closely related to EOL strategy determina- 
tion and regarded as a new frontier of product design for remanufacturing. 
Many researchers have proposed different methodologies to measure the 
disassembability of a product and generate an optimum disassembly sequence. 
For example, Gungor and Gupta (1997) have proposed a disassembly sequence 
generation heuristic which could generate the optimum disassembly sequence for 
a product. Turowski and Tang (2005) have developed a Fuzzy Petri Net (FPN) 
model to represent mathematical uncertainty management and human factor in 
disassembly. The model could dynamically estimate the impact of these factors 
on the disassembly process and find an optimum disassembly path with the 
highest economic value. 

The factors that affect the EOF strategy include the disassembly sequence, the 
disassembly time, the disassembly cost, and the benefits from reusing and recycling 
the components (Jun et al. 2007). There is, therefore, a growing amount of research 
work on proposing methods for generating “recovery plans,” which attempts to 
balance the value of the reclaimed parts with the disassembly cost. Gonzalez and 
Adenso-Diaz (2005) have introduced a model which could determine the optimal 
EOF strategy for each component and the subsequent disassembly strategy that led 
to the highest profits. The result from the proposed method could be used to 
enhance the EOF design from the early stage of product development. 

Besides economic profits, other works have included the environmental benefit 
as a criterion for assessing the EOF strategy. Fee et al. (2010) have developed 
an EOF decision model for remanufacturing options. The maximization of the 
economic value and environmental benefit has been achieved in this model through 
an integrative approach. Possible design changes could be interpreted from the EOF 
decision model. 

The returned product/component quality, which is an essential consideration for 
EOF strategy determination, has been taken into account in the model proposed by 
Krikke et al. (1998) to generate a quality -dependent recovery and disassembly 
strategy. The optimization of the disassembly strategy was achieved through maxi- 
mizing the overall net profit using a two-phase dynamic programming algorithm. 

In addition, active disassembly design has been proposed by Chiodo and Ijomah 
(2012) to enable rapid, nondestructive self-disassembly of products at the EOF 
stage. The potential fusing of active disassembly and DfRem is discussed to address 
the goal of sustainable manufacturing. The applicability of the proposed method has 
been verified using an automotive electronic control unit as a case study. 
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The results obtained from disassembly strategy planning can serve as a feedback 
to enhance product design for remanufacturing. For example, if two easily 
remanufactured components are disposed of, redesign should be considered if 
there is a high disassembly cost to separate them. However, it should be noted 
that DfRem through disassembly planning is only a stepwise approach. To be more 
comprehensive, integration with product life cycling thinking can be a possible 
future research direction. 


Integrating DfRem with Existing Product Design Tools 

Some researchers have used existing design tools to improve the remanufac- 
turability of products. King and Burgess (2006) have adapted platform design for 
remanufactured products. Platform design is “the strategic architecture of common 
and parametric components that forms the basis for a product family.” To apply this 
concept to remanufacturing design, potential multiple life components will be 
regarded as common components which form the base platform, while single life 
components will form the parametric components, which increase product design 
variability. 

The Failure Mode and Effects Analysis (FMEA) has been modified to facilitate 
product design for remanufacturing (Sherwood and Shu 2000). The data from the 
waste-stream analysis of automotive remanufacture are used to derive values of the 
indices of occurrence, detectability, and repairability. These values are used as the 
input for FEMA. The outputs are the priorities of the factors that impede the reuse 
of parts, thus providing the designer with the insight on how to improve product 
design for remanufacturing. 

QFD is a proven methodology for translating consumer demands into appro- 
priate technical characteristics and specifications for product development and 
production. Yang et al. (2013) proposed a QFD model for remanufacturing based 
on three key modifications to the traditional QFD. The first modification expands 
the conventional scope of the “customers” to include the remanufacturers, envi- 
ronment concerns, cost factors, as well as product users. The second modification 
involves a hierarchical structuring of the engineering requirements and computing 
their weights. The third modification uses the fuzzy set theory to overcome the 
vagueness and impreciseness involved in the QFD decision-making process. 
Through this fuzzy QFD approach, the remanufacturing requirements would be 
mapped to the engineering requirements that the designers need to focus on in 
order to improve product remanufacturability. A case study for the automobile 
remanufacturing industry was used to illustrate the applicability of the proposed 
methodology. 

The DfRem trend has moved from developing abstract solutions, e.g., the 
DfRem matrix (Bras and Hammond 1996), to using the existing design tools for 
DfRem to provide detailed suggestions, e.g., platform design (King and Burgess 
2006). These problems associated with the integration of existing design tools with 
DfRem as most of them are not developed for DfRem purposes and the use of these 
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tools as a guide to carry out DfRem will need to be further explored (Hatcher 
et al. 2011). The review of the design tools has indicated that DfRem is most 
effective in the early design stage when few design decisions have been made and 
less technical data is available (Amezquita et al. 1995; Zwolinski et al. 2006). 


Integration of DfRem in Product Development 

Design for remanufacturing implies actions to be taken during the product devel- 
opment stage to enhance the remanufacturing efficiency of a product without 
compromising other essential product characteristics, e.g., cost and performance. 
It is practical and realistic to adapt DfRem for existing product development 
strategies rather than to expect an original product design model to change to 
accommodate DfRem (Hatcher et al. 2013). Therefore, it will be useful to identify 
factors for successful integration of DfRem into product development and provide 
an overview of addressing these factors to facilitate the integration of DfRem. 

Various research teams have embarked on the investigation of the factors that 
lead to this successful product development. The factors generally acknowledged to 
affect the success of product development are examined. A flowchart for a generic 
design process is shown in Fig. 3, which is adapted from Pahl and Beitz’s model 
(1996). It consists of three stages, namely, task identification and prioritization, 
product concept and solution generation, and design concept evaluation, which the 
development team must iterate and follow closely. 

Brown and Eisenhardt (1995) have developed a comprehensive model of factors 
affecting the success in product development and classified these factors into 
following areas, namely, market, customer, senior management, project leader, 
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communication, team organization, team composition, supplier involvement, and 
development process. This model has provided a good starting point and reference 
for investigating the factors that lead to the integration of DfRem into product 
development. 

Factors that have been considered as essential when integrating DfRem into 
product development are introduced in the next section according to the major 
impact they have on the product development processes. 


Task Identification and Prioritization Stage 

In the product development stage, the design team clarifies the design targets to be 
achieved by considering the various aspects, such as customer demand, legislation, 
environmental concern, technology trend, and cost consideration; the team assigns 
the importance to each of these aspects based on company business decision and 
priority. Evaluating and analyzing the performance of the existing products or 
competing products can add to knowledge input in this stage. 

In the task of integrating DfRem into the product development process, 
the following factors need to be addressed during the task identification and 
prioritization stage: 

• Investigate customer demand and market competitiveness 

Sufficient customer demand is always the kick-start factor for the integration 
of DfRem in product development. Market investigation should be carried out on 
the demand from the new product users and the types of market for 
remanufactured products, such as the market for low-cost spare parts, the market 
for “green consumers,” the market for “product service business system,” etc. 
(Seitz 2007). The needs identified through market investigation should be put as 
the forefront of the product design considerations. Besides, companies can take a 
proactive approach to inculcate the mind- set of their customers for 
remanufactured products and provide well-documented evidence to them that 
remanufactured products have equal, if not more superior, quality as compared 
with the original product, so as to grow a larger remanufacturing market 
(Subramoniam et al. 2009a and 2010). 

• Identify barriers in remanufacturing processes 

There is often a misalignment between product designers and remanufac- 
turers on the product design requirements for remanufacturing, which results in 
wasted effort during product remanufacturing (Hammond et al. 1998). There- 
fore, effective communication between remanufacturer and OEM design engi- 
neer needs to be promoted, so that the development team can remain informed of 
any barriers during remanufacturing processes and alleviate these problems 
through proper product redesign. In addition, frequent external communication 
can expose the project team to new information, e.g., latest remanufacturing 
technology, which can impact the way that designers design the product 
(Johansson 2002). 
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• Plan DfRem strategically 

This factor implies that DfRem should be addressed at the operational level 
and the strategic level. The indicators for determining the performance of a 
product remanufacturability need to be determined, e.g., the successful 
remanufacturing rate, processing cost, etc. These factors should be synthesized 
and balanced well with the existing product performance indicators, e.g., product 
lifetime, profit rate, etc., to ensure a smooth integration (Charter 1997; Ritzen 
2000; Takata et al. 2003). 


Product Concept Generation and Solution Stage 

This step is critical for product development, where product designers will deter- 
mine the mechanisms, principles, and solutions, based on the tasks defined in the 
first stage. To ensure the successful implementation of DfRem in this stage, the 
following points need to be addressed: 

• Education and training of product designers 

The mind-set that DfRem is considered throughout the concept development 
stage should be established through proper education and training (Hatcher 
et al. 2013; Magnusson and Johansson 1999; Karlsson 1997). Training and educa- 
tion can be carried out in various forms, e.g., plant visits to remanufacturing sites to 
raise the designer’s consciousness of the link between product development and 
remanufacturing activities, the training of the designers of using different 
eco-design methods and remanufacturing design-related tools, and providing 
remanufacturing design guidelines and good design solutions to support the learning 
of the product designers on remanufacturing issues (Subramoniam et al. 2009b). 

• Management support for DfRem 

Senior management support is acknowledged to be critical for the integration 
of DfRem. This support can garner strong financial and corporate backing for the 
project and the recruitment of professional team members to ensure the success- 
ful implementation of DfRem (Johansson 2002). Besides the senior management 
support, mid- and lower-level supervision is also vital for carrying out DfRem 
operations, for example, the involvement of remanufacturing experts in the 
product development activities to advise the design team on the ways to alleviate 
the barriers in the remanufacturing processes through proper design, analyze 
the impact of design alternatives on remanufacturing efficiency, or even inspire 
the design team to consider DfRem issues (Roy 1999). 

• Close supplier relationships 

A close relationship with suppliers can have a positive impact while the 
designers are searching for product design that is beneficial for remanufacturing. 
It is unlikely that the OEMs are responsible for producing all the components of 
their products, and therefore the expertise of the suppliers in the outsourced 
components can be a valuable input for product remanufacturing design 
(Johansson 2002; McAloone 1998). 
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• Cross-functional design team 

Product development requires communication and collaboration among dif- 
ferent design teams. The design considerations include productivity, aesthetics, 
cost, environmental performance, functional performance, and especially 
remanufacturability in this context. The complex product development process 
requires frequent cross-functional communication, which includes the sharing of 
data and information, design trade-off, and feedback of design from different 
perspectives. Although most of the time remanufacturing may not be on the top 
of the design priority list (Hatcher et al. 2013), the presence of the 
remanufacturing requirements in this stage is critical for ensuring that 
remanufacturing concerns are not neglected (Charter and Gray 2008). 

• DfRem tools are applied 

Different design tools and approaches have been proposed to facilitate product 
design for remanufacturing. Examples include the RemPro Matrix, DfRem guide- 
lines, QFD for remanufacturing, REPR02, etc. (Sundin 2004; Ijomah 2009; 
Yuksel 2010; Zwolinski et al. 2006). Introduction of the specific tools into product 
development or adaptation of existing design tools to include remanufacturing 
perspectives can be an effective way to support product DfRem integration. 


Design Concept Evaluations 

The design concept should be evaluated from the various aspects, such as eco- 
nomic, environmental, remanufacturing, etc., by using the indicators established in 
the task identification and prioritization stage. This is the last stage to ensure the 
satisfaction of the task for remanufacturing design and alter the product design 
before the final implementation stage. The factors to ensure that the DfRem issues 
are considered in this stage include the following: 

• Management support on the final design decisions 

Most importantly, the prioritization of DfRem with other design issues, such 
as functionality, cost, and manufacturability, is significantly influenced by man- 
agement support, since the final decision of product design is usually made by 
senior management (Hatcher et al. 2013). Therefore, a firm support on product 
design for remanufacturing from the senior management is indispensable for 
successful implementation of DfRem, especially in the final evaluation stage. 

• Tools for DfRem reviews 

The well-developed and established tools for evaluating life cycle perfor- 
mance can be adopted, e.g., LCA and LCC. However, modifications of these 
tools are necessary for evaluating the product performance for closed-loop life 
cycle, instead of the traditional single life cycle. Examples of modified versions 
of LCA tools have been reported by Sutherland et al. (2008) and Gehin 
et al. (2009). One of the advantages of adopting these existing evaluation tools 
is the familiarity that the product designers have with them and thus allowing for 
easy implementation of remanufacturing evaluation in this stage. 
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Fig. 4 Factors that affect the integration of DfRem with product development 

Figure 4 summarizes the deciding factors for the integration of DfRem 
with product development. As compared with Fig. 3, it can be observed that the 
factors that are important for product development have great impact on the 
integration of DfRem into product development, e.g., customer and market, 
management support, close supplier relationship, cross-functional design team, 
etc. This implies that the likelihood that the integration of DfRem can be high 
when a company’s product development management is successful. However, 
there are some specific factors in DfRem integration, e.g., the identification of 
special requirements from the remanufacturing sites, development of DfRem 
tools, and education and training of the product developers, which are the factors 
that a company would need to pay special attention to, if integration of DfRem is 
to be carried out. 

Challenge and Future Research Directions 

Despite the appealing benefits in carrying out DfRem, there are additional barriers 
and complications that a company may face. First of all, compared with other DfX 
issues, e.g., DFA, DfRem is usually not given priority since most OEMs’ main 
focus is on the manufacturing and usage phases. Whenever there is a conflict 
between DfRem and other issues, e.g., assembly and manufacturing, DfRem is 
usually placed in a lower priority as it is viewed as less useful in terms of time and 
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cost due to the lack of awareness among designers. Secondly, some OEMs delib- 
erately play down on remanufacturing through product design to stifle the indepen- 
dent remanufacturing activities. This is because OEMs do not have strong desire to 
enhance remanufacturability for benefitting the independent remanufacturers, who 
are viewed as strong competitors of their own products. Thirdly, DfRem guidelines 
involve a variety of design issues, which will form a new set of challenges that 
the manufacturers may not be ready to deal with. Lastly, the reluctance of a 
company to share their in-house methods, tools, and knowledge with the outside 
world also leads to the barrier between the academic and the industry world 
(Hatcher et al. 2011). 

It can be observed that though the number of tools and methodologies developed 
for DfRem is large, few of them have been adopted by the industry. Some of the 
reasons are that these tools require significant knowledge of remanufacturing, 
which are only available at the later design stage. In addition, these tools have 
been criticized to be complex, time consuming, and difficult to apply (Willems 
et al. 2008). Therefore, there is a strong need to develop effective methods 
and tools. 

Incorporating life cycle thinking into DfRem is a future research direction that 
has been addressed by many researchers (Shu and Flowers 1999; Ijomah 2009). 
Ignoring the holistic life cycle thinking in DfRem tools can sometimes lead 
to suboptimization situations, since DfRem is often in conflict with other DfX 
methodologies. To improve the effectiveness and applicability of DfRem tools, 
there is a need to consider the impact of remanufacturing enhancement of product 
features on all the other life cycle stages, such as manufacturing and usage stages. 

DfRem is often viewed to be under the umbrella of Design for Environment 
(DfE). As compared with DfRem, DfE is a more well-explored and developed 
research area. Therefore, the literature on DfE may provide the insight on the 
approaches that are likely to be applicable in DfRem development. However, it 
should be noted that not all the requirements for DfE are mutually compatible with 
DfRem. For example, there are conflicts between DfE and DfRem, e.g., DfRem 
may require components to be overdesigned such that subsequently 
remanufacturing operations such as machining and grinding can be performed, 
while DfE may require components to be designed with minimum use of materials 
so as not to waste resources. 

In this chapter, the methodologies and tools for DfRem have been mainly 
focused on the product design level, which is the level that is directly affected 
by the mechanical properties of the design. The process-related design issues, 
e.g., remanufacturing production, supply chain, core collection mechanism, etc., 
are not considered since they are usually beyond the control of product 
designers and are more influenced by the company organization structure 
and business strategy. Discussion on the opportunity, barrier, and effort for improv- 
ing the process design for remanufacturing could be found in the work by Bras and 
McIntosh (1999) and Umeda et al. (2012). Another future research direction is the 
focus on the integration of product and process design for remanufacturing. 
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Abstract 

The service industry has become an engine of regional and global economic 
development, and the employment and revenue of the service industry are 
increasing dramatically in recent years. This chapter discusses product-service 
supply chain (PS SC) design by outlining the PS SC implications and focusing on 
the crucial elements of PS SC design. Viewing the subject from the perspective 
of manufacturers, several presentations are used in practice, such as the supply 
chain for after-sales service; maintenance, repair, and operations; and product- 
service systems providing. It first explains what is the PSSC and then moves on 
to the prominent features and the structure of the PSSC. In a general sense, PSSC 
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design is designed toward the value of all supply chain members by configuring 
the service resource reasonably. From the viewpoint of supply chain design, five 
elements are elaborated in this chapter, which include the PSSC strategy, service 
facility location, outsourced service supplier selection, service network config- 
uration, and process design for PSSC. The approaches for the PSSC design are 
reviewed, suggested, and elaborated. 

Introduction 

One of the dominant trends in manufacturing industry is that the manufacturers are 
trying to extend the value chain and transform themselves from manufacturing and 
goods-oriented organizations to service -oriented organizations (Wang et al. 2011; 
Gebauer 2009; Gebauer et al. 201 1). In order to keep up with the trend, the innovative 
product-service strategy is increasingly popular to manufacturers (Phumbua and 
Tjahjono 2012). Enterprises have proclaimed and tried to undergo this transition, 
but great difficulties were found in reality (Salminen and Kalliokoski 2008). 
Although the product service has been paid attention, the product-service supply 
chain (PSSC) has demonstrated new features. The longer-lasting responsibility for 
their offered solutions and the willingness to react on changing demands of the 
customers lead to high demand for the service organization and the belonging supply 
chain (Sanchez and Mahoney 1996). Otherwise, it is difficult to ensure that partners 
have the necessary knowledge and resource to perform the service adequately, 
especially more advanced service for small partners (Ojanen et al. 2011). The 
product-service providers have serious intent to establish the PSSC to integrate 
resources of the supply chain members. PSSC management is becoming more and 
more important. There are huge opportunities for organizations to make an improve- 
ment in this area in terms of cost and value (Ellram et al. 2007). 

During the transformation, the PSSC needs to emphasize the benefits of 
suppliers’ and customers’ satisfaction simultaneously in actual applications. The 
successful PSSC relies heavily on the initial design of the service supply chain. 
Chopra and Meindl (2007) claimed that a supply chain design problem is comprised 
of the decisions regarding the number and location of production facilities, the 
amount of capacity at each facility, the assignment of each market region to one or 
more locations, and the supplier selection for subassemblies, components, and 
materials. PSSC design extends this definition to include the development of 
PSSC strategy, selection of the outsourced supplier, allocation of the service 
facility, configuration of the service network, and design of the PSSC process. 

This chapter aims at providing an introduction to PSSC and guiding you to 
understand the main elements of PSSC design. This chapter discusses the subject by 
outlining the PSSC and focusing on the crucial elements of PSSC design. Viewing 
the subject from the perspective of manufacturers, several presentations are used in 
practice, such as the supply chain for after-sales service; maintenance, repair, and 
operations; and product-service systems providing. It first explains what is the 
PSSC and then moves on to the prominent features and structure of the PSSC. In 
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a general sense, PSSC design is designed toward the value of all supply chain 
members by configuring the service resource reasonably. From the viewpoint of 
supply chain design, five elements are elaborated in this chapter. Firstly, PSSC 
strategy is introduced, which indicates the demand frequency of a product service 
has significant influence on the service supply chain strategy. Secondly, the 
approaches for service facility location are discussed. Thirdly, the suggested prin- 
ciples and most widely used methods for the outsourced service supplier selection 
are presented. Fourthly, considering the difference between the deliveries of a 
series of product service, it makes sense to configure the service network for 
any special product- service deliveries. Finally, the critical activities in PSSC are 
introduced and elaborated. 


Product-Service Supply Chain 
What Is Product-Service Supply Chain 

With regard to the product providers, especially the equipment providers, the 
PSSC providing the service related to product is always exiting and described 
as supply chain for after-sales service (Jalil et al. 2010; Cohn 2006); supply chain 
for maintenance, repair, and operations (Maclnnes and Pearce 2003; Driessen 
et al. 2010); and supply chain for PSS providing in literature (Erkoyuncu 
et al. 2010; Johnson and Mena 2008; Wang and Fu 2010). Erkoyuncu 
et al. (2010) defined the PSSC as “the network of suppliers, service providers, 
consumers and other supporting units that performs the functions of transaction of 
resources required to produce services; transformation of these resources into 
supporting and core services; and the delivery of these services to customers.” 
However, the motivation mechanism for the traditional SC has changed to the new 
value instead of customer demand (Baltacioglu et al. 2007). Customer value is the 
trigger of the SC for PSS. Porter and Millar (1985) define the customer value as a 
product or service that the customers are willing to pay for. In industrial applica- 
tions, the things that customers are willing to pay for can be described as produc- 
tivity, which is able to create added value for them in the productions. Given this, 
the chapter primarily proposes that: 

With the purpose of offering productivity to customers, the PSSC is a network 
which consists of manufacturers, service providers, customers, and other supporting 
units and performs the functions of service demand forecast, service resource 
allocation, and service delivery management, as well as supportive activity 
management. 


Structure of Product-Service Supply Chain 

Prior to proceeding with the design of PSSC, it is necessary to identify the role of 
the SC members. In the product- service providing, the customers, main provider, 
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Fig. 1 Structure of product-service supply chain 


other providers, and suppliers constitute the network for PSS delivery. The PS SC 
structure is drawn up in Fig. 1 to illustrate the role of the members. One of the 
distinguishing features of the PSSC is the flat structure of the service network, 
which determines that all the partners have the chance to provide service to 
customers directly. 

The customers, who pay for the service and help to realize its value, are the 
receivers of the product service in the first layer. The customers include not only the 
end users but also the product owners. An example is the energy saving service 
provided by Siemens Group, who offers the energy saving equipment for energy 
saving company. Meanwhile, the company provides energy saving service to 
factory or community. Therefore, the customers of Siemens Group include the 
energy saving company, factory, or community. In the complicated process of 
service delivery, the customers, who cooperate with other units on training, infor- 
mation sharing, and other aspects, play an active role but not just a passive receiver. 

The second layer of the PSSC is the providers including the main provider and 
other providers. The providers offer product service directly to customers. The main 
provider is the product manufacturer and also is the service strategy developer. The 
customer’s requirement and feedback are gained and applied to the life-cycle man- 
agement of the service strategy for the continuous improvement. The other providers 
undertake the service outsourced by the main provider, such as warehouse service, 
logistics service, technical support, and other supporting activities. 

The third layer is the suppliers who engage in the service strategy development 
and offer support to providers. Some of them offer support to customers directly 
and then they are transformed to be providers. As an example, the GE Aviation sells 
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the aircraft engines to aircraft manufacturers and renders innovative service to the 
airlines. The GE Aviation acts as the supplier and the service provider in air service 
delivery. 


Features of Product-Service Supply Chain 

With the adoption of the innovative service strategies, such as product-service 
systems (PSS) (Wang et al. 2011) and servitization (Johnson and Mena 2008), the 
PSSC demonstrates new features. A thorough investigation into the characteristics 
of the PSSC is helpful to deliver the service efficiently and effectively. The 
prominent features can be summarized as: 

1. Value co-creation. The innovative service strategies act as a solution for cus- 
tomers on the basis that the value proposition for customers, providers, suppliers, 
and other stakeholders are identified and realized. The value chain will be more 
clear and reconfigured under new business models. Value co-creation requires 
that benefits and risk should be shared exhaustively between the members. 

2. Activeness. The feature is manifested not only in active service offered by the 
providers but also in the active participation of the customer. Providing service 
actively will benefit both the providers and customers mutually. The customer 
value creation is more active rather than responsive. Moreover, the outsourced 
service supplier selection is subject to the customers. 

3. Uncertainty. It involves managing resource availability, capacity, and capability 
along the service network (Erkoyuncu et al. 2010). The uncertainty is due to the 
following reasons: (1) the interaction between the providers and the customers in 
a variety of ways, (2) the service complexity and the delivery urgency 
(Erkoyuncu et al. 2010), (3) the resource availability owing to the perishability 
of service, and (4) the customer’s participation because the resource of the 
customer makes contribution to the service delivery. 

4. Dynamic. The value chain reconfiguration and optimization will lead to changes 
in PSSC organization. The reusability of some resource, such as expert or skilled 
worker, leads to dynamic service resource management. 


Design of Product-Service Supply Chain 

Product-Service Supply Chain Strategy and Network Structure 

Fisher has divided the products into functional products and innovative products 
and indicated the need to establish an efficient supply chain and reaction supply 
chain according to the characteristics of products (Fisher 1997). The similar rule 
also exists in PSSC. A particular customer’s demand frequency for some product 
service differs from others during the whole life cycle of the product. For example, 
as to a customer, some services are required once for several years, like major 
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repairs for equipment, but some maybe required once every month or a few months, 
such as oil changing and painting maintenance for some products. During the 
product life cycle, customer demand frequency of product service has a profound 
effect on the service supply chain design strategy and network architecture, as 
shown in Fig. 2. 

If the demand for a product service is of high frequency, the development of 
efficient PSSC can reduce the service cost through the economies of scale; if the 
demand frequency is low, a rapid response PSSC can minimize customer service 
delay. In a general sense, the high-frequency service demands often are of unex- 
pectedness or can be planned but with a long time interval. The low-frequency 
service demands are usually can be planned or forecasted with short time intervals. 
Efficient PSSC mainly performs the physical function of the service supply chain 
by allocating service resources and transforming them into service ability at the 
lowest cost. As to the rapid response PSSC, it performs the market mediation 
function of a supply chain, i.e., to make quick response to the unpredictable 
customer service demand. The comparison between the rapid response and the 
efficient PSSC is shown in Table 1. Remarkably, the collaboration of the key 
partners is significant to meet the customer demand for the unpredictable product 
service. Moreover, because the demand for this kind of product service is often very 
little and usually customized, it is necessary to establish the cooperation framework 
with the goal of improving the speed and flexibility of the PSSC in the early stage. 
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Table 1 Comparison between rapid response and efficient product- service supply chain 


Elements 

Rapid response product- service supply 
chain 

Efficient product- service supply 
chain 

Objective 

Respond to product service with long 
time intervals or unpredictable product 
service as quick as possible and 
minimize customer wait or loss 
simultaneously 

Provide product service with 
short time intervals and 
predictable product service at the 
lowest cost 

Service 

resource 

management 

strategy 

Avoid the necessary service resources 
lying idle or being wasted, such as 
reduce spare parts inventory 

Maximize the efficiency of 
service resource, such as improve 
the spare parts inventory turnover 
ratio 

Process 

management 

Flexible process management in order 
to improve the response speed 

Standardized process 
management to ensure the quality 
of service 

Service delay 

Require a large number of service 
resources to shorten the service delay 

Shorten the service delay as 
much as possible without any 
increase in service cost 

Suppliers 

Highlight speed and flexibility 

Highlight service quality and 
service cost 

Network 

architecture 

Single- or two-level network 

Multilevel network 


Centralized service network and decentralized service network are the two basic 
structures of PSSC network. The centralized service network is often characterized 
by single or two levels. It improves the efficiency of the service resources through 
the centralized management. But the average distance from the service centers to 
the end customers is too long to make a rapid response. The decentralized service 
network has multilevel structure and improves service resource availability by 
placing it close to the customers. However, this will reduce the decision-making 
efficiency of PSSC and increase the management cost at the same time. In terms of 
the product service with high demand frequency, a lot of service resources are 
required. The service resources can be transported to customers with reasonable 
quantities, such as the spare parts, equipment, or other material resources, even the 
field service engineers. With regard to product service with low demand frequency, 
the necessary service resource is relatively small. The centralized management can 
avoid or reduce the idle and waste of service resources. 

The rapid response PSSC matches the centralized service network, and the efficient 
PSSC is the right type for the decentralized network. A case in point is the company IR 
which advocates providing advanced compressed air solutions for customers. One of 
the services offered by the company is the compressed air, and the customers are 
charged by the use of compressed air instead of the equipment. And the service 
supplier is responsible for the air compressor maintenance, repair, and even the 
operation. The frequency of the product service referring to a series of activities is 
continuous or infinite. Efficient PSSC should be established for this kind of product 
service. Its corresponding service network should be decentralized and multilevel. 
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Service Facility Location 

Requirement and Analysis for Service Facility Location 

As a critical part of the service network, the service facilities offer detection, 
maintenance, spare parts, customer experience, product modification, and other 
customized service to customers. The number and the location of service facilities 
have significant influence on customers’ satisfaction and provider’s competitive- 
ness. In many service and industrial applications of the facility location problem, 
the number of required facilities and the allocation of the customers to the facilities 
are the two major questions that need to be answered (Pasandideh and Niaki 2012). 
The decision is subject to a set of changing elements during the decision. The 
factors leading to a reasonable service facilities location solution include: 

1. The service revenue and the service quality are suggested to be considered 
simultaneously. The service provider pursues the cost minimization or profit 
maximization, while the customer attaches importance to service quality which 
largely depends on the distance between the service facility and customers. 

2. Owing to the flexible service tactics borrowed by different service facilities, 
such as different prices of the same kind of spare part, a demand point is allowed 
to receive service from just one service facility in a time period. 

3. Once a service facility is opened, it should not be closed unless that customer 
demand reduces significantly, because a lot of money is required to build a CSC. 

4. With regard to the industrial equipment, the service capacity of a service facility 
is restricted by the space, tools, personnel, and other resources. 

5. The service demand is dynamic in the planning horizon. The factors posing 
influence on the service demand include not only the number of the product in 
use but also the added product sale. 

6. Because of the dynamic customer demand, the total numbers of the service 
facilities are not predetermined and should be adjusted to real requirements. 

7. The potential locations for service facilities are different with each other in the 
service demand, building cost, operation cost, and other aspects. A reasonable 
solution is subject to those discrepancies. 

The special requirements constitute the preconditions of the solution for the 
service facility location formulation. 

Method for Service Facility Location 

Cost and time are the two main criteria affecting the decision to determine the 
right location for a facility (Boloori Arabani and Farahani 2012; Pasandideh 
et al. 2013). The dynamic models related to service facility location problems 
can be sorted into two groups: the single-objective dynamic models and the multi- 
objective models. Furthermore, in light of the prerequisite that whether a facility 
is allowed to be closed, the dynamic models also can be divided into explicitly 
dynamic models and implicitly dynamic models. The aforementioned survey 
reveals that the focus is on single-objective facility location problems and that 
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the mixed integer programming is the most popular approach in the formulation. 
Minimizing the total cost is the primary criterion in both the single-objective and 
multi-objective programming models. However, due to the different application 
backgrounds, the models should be developed referring to specified assumptions, 
such as the scenario with a facility serving more than one demand point or the 
closable facilities. 


Selection of the Outsourced Service Supplier 

Requirement and Analysis for the Outsourced Service 
Supplier Selection 

The contradiction between the limited service resource and the multiple geographi- 
cally distributed customers is increasingly acute. As a response to the phenomenon, 
service outsourcing is getting more popular with manufacturers (Reeves et al. 2010; 
Ellram et al. 2004). In order to improve the profitability and competitiveness, the 
manufacturers should rely on a high level of outsourcing and work closely with a few 
qualified strategic business partners. The service outsourcing enables companies to 
offer new services more quickly and promote them more efficiently (Bustinza 
et al. 2010). Consequently, outsourcing service supplier selection has been acknowl- 
edged as one of the cmcial problems in service supply chain management. 

To produce a service is to organize a solution to a problem which does not 
principally involve supplying a good (Gadrey et al. 1995). Under the PSSC envi- 
ronment, it is important to select the most suitable service providers in order to 
enhance the service efficiency (Zhang and Chen 2009). The features of the 
outsourced service supplier selection can be derived by the general comparison 
with the product supplier in manufacturing industry, as shown in Table 2. The 
outsourced service supplier selection is a strategic decision and aims for long-term 
success in business. 

The evaluation is of subjectivity but not only depends on the historical perfor- 
mance. That means there is not too much previous performance data available in the 
evaluation. The features of the outsourcing service suppliers should be highlighted 
in the evaluation criteria development. Moreover, the selection method should 
distinguish the current performance and the potential performance on service 
providing in the future. 

Criteria for the Outsourced Service Supplier Selection 

The manufacturers and the customers are with the same hope that the outsourced 
service supplier can provide services effectively, efficiently, and flexibly. Follow- 
ing the analysis of the requirement of the service supplier selection, six criteria are 
suggested and elaborated in Fig. 3. 

1. Service capability. Three elements determine the service capability: technical 
capability, multiservice providing capability, and service management 
capability. In general, the preferred service supplier should be with the advanced 
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Table 2 Comparison of the product supplier and the outsourcing service supplier 


Elements 

Product supplier 

Outsourcing service 
supplier 

Final deliverables 

Standardized and quantified 

Unstandardized and hard 
to quantify 

Delivery process 
controllability 

Controllable 

Uncertain, less 
controllable 

Supplier available for 
special requirement 

Many in general 

Very limited or only a 
few 

Supplier controllability 

More controllable 

Less controllable 

Supply chain relationships 

Transaction relationship or 
strategic partner relations 

Strategy partner relations 

Supplier replacement 

Low cost and low risk 

High cost and high risk 

Main criteria 

Historical performance 

The ability for promised 
performance 
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technology, capable of offering more kinds of service to customers, and be of 
strong ability in service innovation and service delivery management. 

2. Service resource. Service resource is the foundation of service providing. Firstly, 
the service infrastructure including the software, like the application of IT, and 
the hardware, like the equipment, should be considered. Secondly, the special 
accumulated knowledge as well as the experience in the field is also essential. 
Finally, the service network, such as the number and the location of service 
facilities, would influence the service level. 

3. Service quality. As a potential strategic supplier, the supplier’s service quality 
can be assessed by the service flexibility, service process management, contin- 
uous improvement of service quality, and customer satisfaction management. 

4. Service cost. The service cost consists of the explicit cost, .such as price; the 
inexplicit cost, such as cost resulted from collaboration; the flexibility of pay- 
ment, such as the payment terms; and the potential of continuous cost reduction. 

5. Service supply chain collaboration. The manufacturers, outsourced service sup- 
pliers, and customers have to work closely with each other on service delivery. 
The profit contribution of the outsourced service business determines the pro- 
vider’s willingness of engaging in the service supply chain collaboration. The 
cultural compatibility and the knowledge sharing, such as the infrastructure for 
knowledge gathering, analyzing, and sharing, have great effect on service supply 
chain integration. 

6. Service risk. The risk focuses on the customer’s risk, like the service quality risk 
and safety risk; the manufacturer’s risk, like the risk of service being outsourced 
again; the financial risk resulted from the payment terms; as well as the social 
and environmental risk, like the pollution in service providing and waste 
disposal. 

Method for the Outsourced Service Supplier Selection 

In general, the method for the supplier selection can be divided into two groups: the 
individual approaches and the integrated approaches. The individual approaches 
were slightly more popular than the integrated approaches in the outsourced service 
supplier selection. As the individual approach, the data envelopment analysis 
(DEA) method is the most popular, followed by the mathematical programming, 
analytic hierarchy process (AHP), case-based reasoning (CBR), and the fuzzy set 
theory (Ho et al. 2010; Wu and Barnes 2011). With regard to the integrated 
approach, it was noticed that the integrated AHP and fuzzy set are the two most 
prevalent approaches. Fuzzy set theory does allow simultaneous treatment of 
imprecise and precise variables. But the AHP does not consider the interactions 
among the various factors and also cannot effectively take into account risk and 
uncertainty in estimating the partner’s performance (Wu and Barnes 2011). 
Compared with AHP, ANP allows for complex interrelationships among decision 
levels and attributes by the feedback mechanism. The service supplier selection 
involves kinds of strategies, quantity, and uncertain criteria. The methods employed 
in the outsourced service supplier selection should be able to deal with various 
criteria and their complicated relations effectively. 
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Configuration of the Service Network 

Product- service network is not a simple collection of service centers. Since the 
product services differ with each other, the product- service network supporting the 
service delivery should be developed and should be configured individually. 
Product- service network configuration is a process of selecting the appropriate 
service network elements and developing the service network that matches a 
specific product service. In a general sense, the four parts, including the product 
service, service facility, network management, and the network support, are 
involved in configuring the product- service network, as shown in Fig. 4. Two 
service network configuration examples for the compressed air and the air com- 
pressor installation are shown in Fig. 4. 
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Product service includes the identification of the customer requirement and use 
models. Customer demand is the driving force behind the product service and can 
be grouped as continuous productivity, such as compressed air for 24 h; intermittent 
productivity, for instance, very limited compressed air for a day; and product, for 
example, a little but necessary compressed air for a week. The customer demand 
requirement determines that the use model is result oriented or availability oriented 
or function oriented (Meier et al. 2011). It is evident that the continuous produc- 
tivity involves the result-oriented or availability-oriented use model while the 
product involves the function-oriented use model. 

Referring to the area covered by a service center, it can be classified into global, 
regional, local, and the special service facility. The product service provided by the 
special service facility often involves the patent, proprietary technology or other 
knowledge related to intellectual property right protection. The special facility 
reflects the advantages of the product- service providers and can bring them extra 
profits. In addition, a fully qualified service center can provide all the product 
services, for instance, the global service center. Some service center only can 
provide limited product services, such as the special service center. In the case 
mentioned above, a fully qualified local service center is responsible for the 
compressed air providing, but the air compressor installation is outsourced to a 
service supplier who is able to offer limited product services. 

Product-service network management includes two aspects. One is business 
management which is also called administrative management, and the second is 
process management. Different business processes have different management 
methods that can be divided into fully global centralized management, partly 
global centralized management, self-management with agreement, and fully self- 
management. However, the higher in the degree of integration of product-service 
network management, the higher the cost of management and the lower efficiency 
of decision-making. Whereas the higher autonomy of the product-service network 
management, the lower the management cost accompanied by higher service risk. 
Therefore, with the changes in the service center management methods, the 
processes of the service centers fall into four types: standardized and monitored 
process, standardized process, customized and monitored process, and the cus- 
tomized processes. The local service center providing the compressed air is 
governed partly by the global service center or the main provider. The operations 
are standardized and monitored, especially the service delivery activities. The 
outsourced service supplier, who provides the air compressor installation, is self- 
management with an agreement which indicates that the supplier should follow 
the guidelines required by the main provider. Although the activities of the 
installation are customized, the product- service delivery process is monitored 
by the provider. 

Product- service network support includes technical support, engineering sup- 
port, information support, partnership relation support, and customer collaborative 
relationship. Since the compressed air providing belongs to the result-oriented use 
model, the kinds of activities should performed by the provider. Therefore, it is 
essential to get the technical and engineering support from the global, regional, and 
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the local service centers. Owing to many partners being involved, the kinds of 
partnership relationship can be developed. The customized or localized information 
system or particular functional module should be developed because it is a cus- 
tomized product service, and long-term or strategic customer relation should be 
established. Similarly, the critical elements of the service network are highlighted 
in Fig. 4. 


Process Design of the Product-Service Supply Chain Management 

According to whether a process is involved in direct service providing, the process 
of PSSC can be clarified into two groups: the functional process and the enabling 
process (Fig. 5). The functional process, which has a direct effect on the customer 
value realization, includes the demand management, service resource and capacity 
management, and service delivery management. Aiming for offering the manage- 
rial and technological support, the enabling process covers the performance 
management, customer relationship management, service provider and supplier 
relationship management, information technology application management, and 
service network management. 
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Functional Process Management 

Demand management is to understand and react to current and potential customer 
demand (Rainbird 2004). The product related service is provided actively but not 
just responded to failures, which mean that service providers should make full 
preparation for service provision beforehand rather than afterwards. Because 
keeping the service available at any moment is in the interests of both the 
providers and the customers. For example, any delay in replacing the broken 
part would cause extra expenses for both the providers and the customers. 
Therefore, accurate demand forecasting is incredibly important for preparing 
and optimizing service resource (Akkermans and Vos 2003). The service demand 
features identification is to distinguish the deliverables for a better understanding 
and more accurate forecasting of the customer demands. Generally speaking, the 
deliverables can be classified as three types. The first type is physical unit, such as 
spare part, detecting device, and tools. The second is the service unit, such as 
training, consultancy, technical support, detection, logistics, and stock. While the 
third is the combination of physical unit and service unit, like providing installa- 
tion service for free when selling spare parts. In order to avoid any misunder- 
standing, if there is no specification, the word “service” refers to all the three 
deliverables in the following part of the chapter. Therefore, it has the same 
importance as the service demand forecast. Hence, the service demand planning 
is an ultimate result, indicating the requirements of deliverables on strategic, 
tactical, and operational plans. 

The purpose of service resource and capacity management is to balance the 
customer needs and service capacity. Service outsourcing is a way of maximizing 
the benefits of the service resource and extending the service capacity. Because of 
the perishability of services, service resource management plays the role of the 
buffer. Effective and profitable product service depends on managing service assets 
and fulfilling service demands in a flexible and integrated manner (Cohen 
et al. 2006). Generally speaking, the service resources can be sorted as follows: 
material resource (e.g., spare parts, tools, and material), human resource (experts, 
technicians, installation staff, etc.), information resource (e.g., schedule informa- 
tion, equipment status information, and logistic information), knowledge resource 
(e.g., technology and experience), and other resource. The appropriate combined 
service resources form the service capability, and the resources have to be coordi- 
nated for the right time and right place (Meier et al. 2010). Since the service 
resource results from the service deliverables demand, their differences among 
the resources and features should be identified initially for planning the engineers, 
tools, and materials. And then, on the basis of the service demand forecast, the 
maximal service capacity would be obtained by allocating the service resource on 
time and space optimally. 

Service delivery is a providers and customers’ involvement process fulfilling the 
commitments to customers (West 2004). The service delivery may involve one 
provider and one customer or involve many providers and many customers simul- 
taneously. The deliverables determine the way of service delivery. It includes 
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on-site delivery, like air compressor installation; online delivery, for example, 
remote technological training; offline delivery (delivered by service factory), such 
as air compressor cleaning and upgrading. Therefore, the features identification of 
the deliverables is the first step for any customer satisfaction. Even for the air 
compressor overhaul, there are options for repairing the product in customers’ 
company or shipping back to the service factory. The service delivery plan should 
be formulated according to the service demand and the service resource plan. 
However, the customers’ participation increases the uncertainty of the service 
delivery, especially for the outsourced service. The application of the modular 
technique in PSS development and standardization of PS SC process management 
would ensure a high level of customer satisfaction (Wang et al. 2011). Whether the 
service is provided by the main provider or not, the service quality is controllable 
via monitoring the supply chain process. 

Enabling Process Management 

The customer is no longer regarded as a passive transaction-oriented actor but 
rather an active relationship-oriented actor with a long-term perspective in the 
interaction (Kindstrom and Kowalkowski 2009). Customer relationship manage- 
ment is concerned with the development of profitable, long-term relationships with 
customers and other key stakeholders (Payne and Frow 2005). Identifying the 
customer value and building trust are the key steps in customer relationship 
management. Therefore, it would be reasonable to assort the customers in light of 
their value propositions. In company IR which is an air compressor manufacturer in 
China, the customers are classified according to the customer value: the compressed 
air, the function of the air compressor, and the air compressor. It is evident that the 
customers requiring the compressed air have tight connections with the service 
providers. With regard to this kind of customers, the emphasis of the customer 
relationship management is not only the product life-cycle management but also the 
customer life-cycle management, which covers the stages from the service rela- 
tionship establishment to the time when the customer would not need the service or 
turns to other service providers. With effective customer interaction, the providers 
can find out more potential needs and present more innovative services. It not only 
improves the customer satisfaction but also wins their trust. 

As mentioned above, providers contribute to service delivery and contact with 
customers directly. The suppliers are indirect supporters and contribute to the 
supply chain by sharing their technologies, expertise, and other useful resource. 
Supplier relationship management involves setting up, developing, stabilizing, and 
dissolving relationships with suppliers (Moeller et al. 2006). The purpose of the 
service provider and supplier relationship management is to share the information, 
technologies, tools, and other necessary service resources freely, as well as the 
benefit and risk. As a result, the recognition of the partners’ value proposition would 
be helpful to build long-term mutual confidence and deepen the cooperation. On the 
one hand, appropriate selection and evaluation approach will promise the 
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performance of the partners. On the other hand, the performance of the suppliers 
and providers could be assessed by the feedback from customers and the service 
delivery process. 

The PSSC depends on the information being shared efficiently (Baltacioglu 
et al. 2007). The information technology (IT) is helpful to build the fundamental 
information flow of the PSSC. Decreasing the service cost and improving the 
service response time are the primary drivers for IT application in service delivery, 
such as the remote diagnosis and maintenance and online training. On the one hand, 
IT has proved to be an effective tool for PSSC, such as the demand forecast and 
logistics management (Baltacioglu et al. 2007; Ellram et al. 2004). On the other 
hand, it facilitates the service providing and enhances the transparency of the 
service delivery process. Furthermore, it provides a useful tool for several providers 
offering service corporately and timely, especially when dealing with the thorny 
problems in the emergency. 

The contradiction between the decentralized customers and the limited service 
resource is one of the main challenges in PSSC. An appropriate service network 
would decrease the service cost and improve customer service significantly. The 
service network consists of various terminal service facilities, such as the service 
centers, spare parts warehouse, and the service stations of partners. As for the main 
providers, the service facilities may be self-built, partner owned, or distributer 
owned. The service network management performs the network structure design, 
service facility location selection, service facilities function and capacity design, 
and the forward and reverse logistics management. 

Performance management has a profound influence on the improvement of 
customer satisfaction, service quality of providers, and the behavior of suppliers 
and other stakeholders. Appropriate performance metrics and measurement can 
help to implement the principle of value co-creation and strategic objectives of the 
PSSC. The performance management can be conducted at two levels: the service 
delivery level and the PSSC level. In the service delivery level, the customer value 
and customer satisfaction are the core criteria for evaluating the efforts of the 
providers. In the PSSC level, the customer satisfaction and the service cost 
are equally important considering the providers’ value proposition. But the time 
interval of the assessment of the latter is longer than the former. 


Summary 

In this chapter, the author introduces the implications, structure, and roles of the 
supply chain members, as well as the prominent features of PSSC. To realize PSSC 
design, the five crucial elements are explained, which include the PSSC strategy, 
service facility allocation, outsourced service supplier selection, service network 
configuration, and the process design for PSSC. The approaches for the PSSC 
design applications are reviewed, suggested, and elaborated. 
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Abstract 

This chapter presents development of enabling technologies that are able to 
assess the reliability of remanufactured products based on predictive modeling 
methods, to describe fast and accurate prediction algorithms that are able to 
predict condition of critical components or parts of manufactured products based 
on historical data. Machine health condition prediction of critical components 
under the situation of insufficient data, missing prior fault knowledge, and noisy 
measurement are studied using an enhanced online sequential leaming-fuzzy 
neural network. Meanwhile, Weibull model-based reliability analysis is inves- 
tigated in this chapter. Performance of various Weibull parameter estimation 
methods is compared using case studies. Results of this part of research have 
enabled the development of a product reliability analysis tool that is able to 
characterize the product failure modes, failure rate, and reliability profile. 


Introduction 

As the primary goal of remanufacturing is part reuse, understanding of the 
quality/condition of the returned cores is very important for decision-making in 
remanufacturing processes. Hence, condition assessment and fault isolation of the 
returned cores becomes one of the most critical activities that determine the success 
of a remanufacturing process. Existing practices in remanufacturing typically carry 
out defect inspection and fault diagnosis only for isolated parts/components after 
the returned cores are disassembled. This may impose additional challenges and 
cost on the remanufacturing process such as fluctuation of schedule for remedial 
processes or treatments in the shop floor due to unexpected defects/faults identified 
after disassembly. In addition, it depletes the opportunity to assess the condition and 
performance of the products systematically based on their field operational data in 
each lifecycle before they are returned as cores, which is very important to establish 
reliability models of the products (Mazhar et al. 2010). 

On the other hand, for valuable machineries, such as mining trucks, a large 
amount of operational data is already being collected, typically on log sheets or by 
a control system. This process is usually not regarded as part of a condition 
monitoring or diagnostic program. However, there is a lot of valuable machinery 
performance and condition information buried in such operational data. In prac- 
tice, the only challenge is how to extract useful information from such data. It is 
believed that the operational and inspection data collected on a machine, when 
properly interpreted, can produce an accurate picture of the machines health. In 
reality, the cores returned for remanufacturing may have experienced very dif- 
ferent working conditions, and their components/parts may have diverse ages and 
different stress and strains arisen by the users, but remanufacturing companies 
usually do not involve the historical and field operational data of the cores when 
they make decisions for remanufacturing processes in the shop floor. Taking the 
engine of a mining truck as an example, operational data often is adequate to 
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allow for calculations of engine efficiency and detection of its deviations, which 
can then be extended to evaluate the condition of the engines. Analysis such as the 
emerging operational trends of operational data not only can tell us that there is a 
performance problem with this engine, but it also can help to isolate fault sources 
well before the engine is sent for inspection in the remanufacturing workshop. 
Measures of this sort can greatly assist with prioritizing remanufacturing decisions 
and balance the time between reman cycles, and more importantly, it may simplify 
the reman workshop inspection procedure and minimize the inspection cost. 

It should be noted that most of the current practices in remanufacturing rely on 
rules of thumb or expert knowledge and lack rigorous reliability -based evaluation 
models to support remanufacturing shop-floor decision-making. Although recent 
years have seen some applications using visual and/or statistical analysis tools to 
assist the lifecycle assessment of remanufactured products, such tools remain 
inadequate in coping with large amount of field operational data with inherent 
variability, uncertainty, and nonlinearity. The proposed approach attempts to 
address critical issues for improving remanufacturing processes through effective 
analysis of field operational data. This research will fill up the gap in the current 
state of the art where existing remanufacturing practices lack rigorous and reliable 
analysis of operational data for support of remanufacturing shop-floor decision- 
making, despite the fact that an effective analysis of field operational data in various 
aspects of the products will provide invaluable information to facilitate sound 
decision-making and continuous improvement of remanufacturing processes. 
More specifically, the novelty of this research includes. 

A comprehensive approach to condition assessment and fault isolation through 
rigorous reliability-based evaluation models with progressive model learning 
capabilities; Fusion of statistical and machine learning techniques to form a fast 
real-time RUL prognosis tool that can scale well against large operational data with 
inherent variety and uncertainty in remanufacturing processes. 


Weibull Model for Analysis of Time-to-Failure Data in Product 
Life Cycle Management 

Management of products and materials at the end-of-life (EOL) is being recognized 
as an integral part of the product life cycle engineering. Among various EOL 
management strategies, remanufacturing as a sustainable manufacturing process 
has received more attention in recent years. In remanufacturing practices, under- 
standing and communicating the failure risk and reliability of a critical part, 
component, or subsystem plays a crucial role as it has a significant impact on the 
lifecycle management of the product and also determines the success of the 
remanufacturing process. In such a context, it is envisaged that many of the life 
cycle engineering techniques will have a significant impact on remanufacturing 
practices such as reliability and remanufacturability analysis of valuable parts and 
components, remaining useful life prediction and warranty cost of remanufactured 
products etc (Mazhar et al. 2007). 
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In life cycle engineering as well as in remanufacturing industry, estimation of 
product mean life is an important task as it provides valuable information for 
effective life cycle management, in particular, the core management and invento- 
ries in remanufacturing. Usually a product’s mean life is determined by analyzing 
its time-to-failure data from a wide range of the same category of products operated 
under the same conditions of use (Li 2004). Another important issue in life cycle 
engineering and remanufacturing is the quantitative analysis of product or compo- 
nent reliabilities, based on which the product or the component’s expected useful 
life can be estimated. Understanding the probability of product or component 
failures at different stages of its life can be very useful to make optimized decisions 
in life cycle management and remanufacturing practices (Mazhar et al. 2007). 


Weibull Model for Reliability Analysis 


In life cycle management, one of the simplest approaches to predicting failure is 
based on statistical reliability models of past failures (Gu and Li 2012). Reliability is 
defined as the probability that a product will continue to perform its intended function 
without failure for a specified period of time under stated conditions (Pham 2006; 
Calixto 2013). Usually, reliability predictions are used to estimate future failure based 
on past failure records by applying a probability distribution such as the exponential 
distribution. However, one of the principal shortcomings of using the exponential 
distribution is that it imposes a “Markov” assumption, meaning that the future 
prediction of a failure is independent of the history of the unit given the current 
measurement (Lourenco and Mello 2012). In this respect, Weibull distribution 
(Weibull 1951) for prediction provides an alternative reliability method as it relaxes 
the assumption of constant failure rates as well as the Markov assumption (Groer 
2000). In fact, the most common distribution function in EOL management is 
Weibull distribution due to its ability to fit a greater variety of data and life 
characteristics by changing its shape parameter (Artana and Ishida 2002). Today, 
Weibull analysis is the leading method in the world for fitting and analyzing life data. 
In most cases of application, Weibull distribution is able to provide the best fit of life 
data. This is due in part to the broad range of distribution shapes that are included in 
the Weibull family. Many other distributions are included in the Weibull family 
either exactly or approximately, including the normal, the exponential, the Rayleigh, 
and sometimes the Poisson and the Binomial (Abemethy 2006). 

Compared with classic statistical methods, Weibull analysis uses failure ref- 
erence and mean-time-to-failure (MTTF) to forecast failures, whereas statistical 
pattern analysis uses test data to identify a statistical pattern such as trend lines 
(Fitzgibbon et al. 2002). Another most salient feature to be noted for Weibull 
analysis is its ability to provide reasonably accurate failure analysis and failure 
forecasts with extremely small samples of life data, where most of other distri- 
butions fail to give meaningful result (usually when the sample size is smaller 
than 20) (Abernethy 2006). This feature of Weibull analysis makes it very 
valuable in remanufacturing decision-making practices because it is a common 
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case that the life data of very big and especially very expensive parts/components 
collected in remanufacturing process are either incomplete or small in size. 


Basics of Weibull Distribution 


In general, a typical Weibull probability distribution function (PDF) is defined by 


fit) 


P 

r 1 



( 1 ) 


where t > 0 represents time, P > 0 is the shape or slope parameter, and rj > 0 is the 
scale parameter of the distribution. Equation 1 is usually referred to as 
the 2-parameter Weibull distribution. Among the two parameters, the slope of the 
Weibull distribution, /?, is very important as it determines which member of the 
family of Weibull failure distributions best fits or describes the data. It also 
indicates the class of failures in the “bathtub curve” failure modes as shown in 
Fig. 1. The Weibull shape parameter fi indicates whether the failure rate is increas- 
ing, constant, or decreasing. If fi < 1, it indicates that the product has a decreasing 
failure rate. This scenario is typical of “infant mortality” and indicates that the 
product is failing during its “bum-in” period. If P = 1, it indicates a constant failure 
rate. Frequently, components that have survived burn-in will subsequently exhibit a 
constant failure rate. If P > 1, it indicates an increasing failure rate. This is typical 
for products that are wearing out. To summarize: 


• p < 1 indicates infant mortality. 

• p = 1 means random failures (i.e., independent of time). 

• P > 1 indicates wear-out failures. 


The information about the P value is extremely useful for reliability-centered 
maintenance planning and product life cycle management. This is because it can 
provide a clue to the physics of the failures and tell the analyst whether or not 
scheduled inspections and overhauls are needed. For instance, if P is less than or 
equal to one, overhauls are not cost effective. With p greater than one, the overhaul 
period or scheduled inspection interval can be read directly from the plot at an 


Fig. 1 The “bathtub curve” 
failure modes 


Failure 

rate 
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acceptable or allowable probability of failures. For wear-out failure modes, if the 
cost of an unplanned failure is much greater than the cost of a planned replacement, 
there will be an optimum replacement interval for minimum cost. 

On the other hand, the scale parameter, or spread, 77 , sometimes also called the 
characteristic life, represents the typical time-to-failure in Weibull analysis. It is 
related to the mean-time-to-failure (MTTF). In Weibull analysis, 77 is defined as the 
time at which 63.2 % of the products will have failed (Pasha et al. 2006). 

There are basically two fitting methods for parameter estimation in widespread 
use in reliability analysis, namely, the maximum likelihood estimation (MLE) 
and regression methods. MLE involves developing a likelihood function based on 
the available data and finding the values of the parameter estimates that maximize 
the likelihood function. Regression method generally works best with data sets 
with smaller sample sizes that contain only complete data (i.e., data in which all of 
the units under consideration have been run or tested to failure). This failure- 
only data is best analyzed with rank regression on time, as it is preferable to 
regress in the direction of uncertainty. In Weibull analysis, median-rank regres- 
sion (MRR) method which uses median ranking for regression fitting is often 
deployed to find out the shape and scale parameters for complete life data 
(Abernethy 2006). 

The probability of failure at time t , also referred to as the Weibull distribution or 
the cumulative distribution function (CDF), can be derived from Eq. 1 and 
expressed as 



Thus, the Weibull reliability at time t, which is 1 — F(t) = R(t ), is defined as 



1 - F(t) 




This can be written as 


1 

1 - W) 




Taking two times the natural logarithms of both sides gives an equation of a 
straight line: 


In In 



= pint — p\mj 



Equation 5 represents a straight line in the form of “y = ax + b” on 
log/log(Y) versus log(X), where the slope of the straight line in the plot is ft, 
namely, the shape parameter of Weibull distribution. Through the above transfor- 
mation, the life data samples can be fitted in the Weibull model and the two Weibull 
parameters can be estimated. 
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Table 1 Life data of a 
critical component in a 
diesel engine 


No. 

Failure time (h) 

1 

38,456 

2 

48,334 

3 

50,806 

4 

51,521 

5 

61,544 

6 

66,667 

7 

72,605 

8 

75,521 

9 

80,785 

10 

84,894 


The mean of the Weibull PDF, T, which is the MTTF in Weibull analysis, is 
given by 



where T is the gamma function. 

It is noted that when /3 = 1, MTTF is equal to rj. In fact, as a rough approxima- 
tion, in practices of Weibull analysis where is equal to or slightly larger than 1, the 
characteristic life can be approximated as MTTF. However, for that is much 
larger than 1, MTTF should be calculated using Eq. 6. This will be further discussed 
in the example elaborated in the next section. 


Weibull Analysis of Life Data: An Illustrative Case Study 

In the life cycle management of a certain type of heavy-duty diesel engine, it is 
required to quantify the life characteristics of a critical component in order to 
understand its reliability and remanufacturability. The engine manufacturer has 
provided a past record of 10 failure cases of the said component under normal use 
conditions. The complete life data, i.e., the failure time of each sample, is shown in 
Table 1. 

Assume that our objectives in this case study include: 

1 . Determine the Weibull parameters and derive the Weibull distribution model for 
the data given. 

2. Estimate the average life of the component (i.e., the MTTF or mean life). 

3 . Estimate the time by which 5 % of the components will fail or the time by which 
there is a 5 % probability that the component will fail. 

4. Estimate the reliability of the components after a given number of hours of 
operation. 
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Fig. 2 Weibull probability plot 


5. Estimate the warranty time for the component if the manufacturer does not want 
failures during the warranty period to exceed 5 %. 


In the following sections, Weibull analysis will be conducted to address the 
above objectives. 

In this case study, the 2-parameter Weibull analysis is deployed to analyze the 
life data characteristics of the diesel engine component. First of all, the parameters 
are estimated based on the 2-parameter Weibull analysis, in which the standard 
ranking method and median-rank regression are used to fit the given data in Table 1. 
As discussed earlier in section “Basics of Weibull Distribution,” regression method 
should be selected to fit the data when the data sample is small and contains 
complete life data. The fitting plot is shown in Fig. 2 and the two Weibull 
distribution parameters are calculated as follows: fi = 4.40 and r\ — 69,079.89 
(hours). 

After the two parameters of /? and r\ are determined, the Weibull PDF expressed 
by Eq. 1 can be obtained as shown below: 


/(0 


4.4 


t 


3.4 


69079 


69079.89 V69079.89 


79.89) 


4.4 


After simplification the Weibull PDF is plotted as shown in Fig. 3. 
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Fig. 3 Weibull probability density function 

As discussed earlier, the MTTF or mean life is a very important indicator of the 
life data characteristics in life cycle engineering. MTTF can be either approximated 
by the value of rj in cases where fi is slightly larger but close to 1 or calculated using 
Eq. 6 if is much larger than 1 or a more accurate value is required. In this case 
study, = 4.40 and therefore Eq. 6 is used to calculate the exact MTTF instead. 
The MTTF calculated is 62,952.73 h, and it is much smaller than the value of 
characteristic life rj , which is 69,079.89 h. 

As shown in Fig. 2, in the Weibull probability fitting plot of the case study, the 
x-axis represents time using a logarithm scale, and the probability of failure is 
displayed on the y-axis using a double log reciprocal scale. Such a Weibull 
probability plot is able to tell very important information about the characteristics 
of the failures. From the plot, the probability of failure at a given time, or vice versa, 
can be obtained. For example, it may be of interest to determine the time at which 
1 % of the population will have failed. For more serious or catastrophic failures, a 
lower risk may be required, for instance, six-sigma quality program goals often 
equate to 3.4 parts per million (PPM) allowable failure proportion. Such important 
information can be easily obtained from the Weibull probability plot. In this case 
study, for the red dot (point A) shown on the plot in Fig. 2, the x and y coordinates 
are x = 35,221.228 h and y = 5.053, respectively, which can be interpreted in the 
following way: the failure probability of the component at the time of 35,221.228 h 
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Time (Hr) 


Fig. 4 Weibull reliability plot 

is 5.053 %, or the average time by which 5.053 % of the components will fail is 
35,221.228 h. 

It is known that reliability analysis is a very important issue in life cycle 
engineering. In this case study, the Weibull reliability function can be calculated 
based on Eq. 3 and its plot is shown in Fig. 4. Figure 4 can be easily used to answer 
the estimate of reliability of the component after a certain number of hours of 
operation. For example, for point B shown on the plot in Fig. 4, the x and 
y coordinates are x = 32,090.306 h and y = 0.966, respectively, which can be 
interpreted in the following way: the reliability of the component after 
32,090.306 h of operation is 96.6 %. 

This is in fact the reverse interpretation of the coordinates of x and y in Fig. 4. 
For example, if the manufacturer does not want failures during the warranty period 
to exceed 3.4 % (i.e., the required reliability is 96.6 %), then the maximum warranty 
time promised to customers should not exceed 32,090.306 h, as shown by point B in 
Fig. 4. 

In life cycle engineering, failure rate is another important indicator of life data 
characteristics. Failure rate is usually defined as the frequency with which a 
product or component fails, and it is often expressed in failures per unit of time 
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Fig. 5 Failure rate versus time 


(e.g., per hour in this case study). The failure rate of a product usually depends on 
time, with the rate varying over the life cycle of the product, as shown in the 
“bathtub curve” failure modes in Fig. 1. The failure rate of the case study is 
calculated and shown in Fig. 5. The increasing failure rate shown in the figure 
confirms that the life data in Table 1 follow a wear-out failure mode. 


Condition Prediction Using Enhanced Online Learning 
Sequential-Fuzzy Neural Networks 

Machine health condition (MHC) prediction is useful for preventing unexpected 
failures and minimizing overall maintenance costs since it provides decision- 
making information for condition-based maintenance (CBM) (Vachtsevanos 
et al. 2006). Typically, MHC prediction methods can be divided into two catego- 
ries, namely, model-based data-driven methods (Jardine et al. 2006). Due to the 
difficulty of deriving an accurate fault propagation model (Yu et al. 2012;, 
Yu et al. 2011), researches have focused more on the data-driven method in recent 
years (Si et al. 2011). The neural network (NN)-based approach, which falls under 
the category of the data-driven method, has been considered to be very promising 
for MHC prediction due to the adaptability, nonlinearity, and universal function 
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approximation capability of NNs (Tian and Zuo 2010). Batch learning and 
sequential learning are two major training schemes of NNs. MHC prediction is 
essentially an online time-series forecasting problem which should perform real- 
time prediction while updating the NN. Thus, to save updating time and to maintain 
consistency of the NN, the sequential learning should be employed in such a 
problem. 

The most popular NNs applied to MHC prediction are recurrent NNs (RNNs) 
and fuzzy NNs (FNNs). In Gu and Li (2012), an extended RNN which contains both 
Elman and Jordan context layers was developed for gearbox health condition 
prediction. In Zhao et al. (2009), a FNN in Brown and Harris (1994) was applied 
to predict bearing health condition. In Wang et al. (2004), an enhanced FNN was 
developed to forecast MHC. Next, in Wang (2007) and Liu et al. (2009), a recurrent 
counterpart of the approach in Wang et al. (2004) and a multistep counterpart of the 
approach in Wang (2007) were presented to predict MHC, respectively. An interval 
type-2 FNN was also proposed to predict bearing health condition under noisy 
uncertainties in Chen and Vachtsevanos (2012). Note that the batch learning was 
employed in Tian and Zuo (2010), Zhao et al. (2009), and Chen and Vachtsevanos 
(2012). Common conclusions from Tian and Zuo (2010), Zhao et al. (2009), and 
Wang et al. (2004), Wang (2007), Liu et al. (2009), and Chen and Vachtsevanos 
(2012) are that the RNN usually outperforms the feedforward NN and the FNN 
usually outperforms the feedforward perceptron NN, feedforward radial basis 
function (RBF) NN, and RNN. Recently, to improve prediction performance 
under measurement noise, an integrated FNN and Bayesian estimation approach 
was proposed for predicting MHC in Chen et al. (2012), where a FNN is employed 
to model fault propagation dynamics offline and a first-order particle filter is 
utilized to update the confidence values of the MHC estimations online. In Chen 
et al. (2011), a high-order particle filter was applied to the same framework of Chen 
et al. (2012). A question in the approaches of Chen et al. (2011, 2012) is that the 
FNNs should be trained by the system state data (rather than the output data) which 
are assumed to be immeasurable. 

Extreme learning machine (ELM) is an emergent technique for training 
feedforward NNs with almost any type of nonlinear piecewise continuous hidden 
nodes (Huang et al. 2006). The salient features of ELM are as follows (Huang 
et al. 2006): (i) All hidden node parameters of NNs are randomly generated without 
the knowledge of the training data; (ii) it can be learned without iterative tuning, 
which implies that the hidden node parameters are fixed after generation and only 
output weight parameters need to be turned; (iii) both training errors and weight 
parameters need to be minimized so that the generalization ability of NNs can be 
improved; and (iv) its learning speed is extremely fast for all types of learning 
schemes. ELM demonstrates great potential for MHC prediction due to these salient 
features. Nonetheless, the original ELM proposed in Huang et al. (2006) is not 
appropriate for predicting MHC since it belongs to the batch learning scheme. To 
enhance the efficiency of ELM, online sequential ELM (OS -ELM) was developed 
in Liang et al. (2006) and was further applied to train the FNN in Rong et al. (2009). 
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Due to its extremely high learning speed, the OS-ELM-based FNN in Rong 
et al. (2009) seems to be suitable for MHC prediction. Yet, there are two drawbacks 
in Rong et al. (2009) as follows: (i) It is not good to yield generalization models 
since only tracking errors are minimized and (ii) it may encounter singular and 
ill-posed problems while the number of training data is smaller than the number of 
hidden notes. 

To further improve the efficiency of MHC prediction, a novel FNN with an 
enhanced sequential learning strategy is proposed in this paper. The design proce- 
dure of the proposed approach is as follows: Firstly, a ellipsoidal basic functions 
(EBFs) FNN is proposed; secondly, the FNN approximation problem is transformed 
into the bi-objective optimization problem; thirdly, an enhanced online sequential 
learning strategy based on the EFM is developed to train the FNN; and finally, a 
multistep direct prediction scheme based on the proposed learning strategy is 
presented for MHC prediction. The developed enhanced online sequential 
learning-FNN (EOSF-FNN) is applied to predict bearing health condition by the 
use of real-world data from accelerated bearing life. Comparisons with other 
NN-based methods are carried out to show the effectiveness and superiority of 
the proposed approach. 


Architecture of Fuzzy Neural Network 

For MHC prediction, the ft-input single-output system is considered. Yet, the 
following results can be directly extended to the multi-input multi-output 
(MIMO) system. The FNN is built based on an EBF NN. It is functionally 
equivalent to a Takagi-Sugeno-Kang (TSK) fuzzy model that is described by the 
following fuzzy rules (Wu et al. 2001): 


Rule R J : IF x\ is A\j and- • -and x n is A n j THEN y is no- 



where Xj G M andy G M are the input variable and output variable, respectively; A t j 
is the antecedent (linguistic variable) of the zth input variable in the yth fuzzy rule; 
Wj is the consequent (numerical variable) of the yth fuzzy rule, i = 1 , 2, . . ., nj = 1 , 
2, . . ., L; and L is the number of fuzzy rules. 

As illustrated in Fig. 6, there are in total four layers in the FNN. In Fayer 1, each 
node is an input variable x t and directly transmits its value to the next layer. In Fayer 
2, each node represents a Gaussian membership function (MF) of the corresponding 
Ajj as follows: 



where c /; - G M and <r z y G M + are the center and width of the zth MF in the jth fuzzy 
rule, respectively. Note that the MF in Eq. 8 is an EBF since all its widths o,y are 
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Layer 1 A n Layer 2 Layer 3 Layer 4 



Fig. 6 Architecture of fuzzy neural network 


different (Wu et al. 2001). In Layer 3, each node is an EBF unit that denotes a 
possible IF-part of the fuzzy rule. The output of the yth node is as follows: 



T 


where X = [xi,x 2 ,* • -,x n \ G 



n 


I C I/’ c 2h 


Cni\ G 



n 


and Gj 


[&lh °2 h 


°nj\ F 



n 


J ’ n J A ^ ^ J 175 -z/ 

. In the last layer, the output y is obtained by the weighted summation 


of 4>, as follows: 





/(x|fL, c, a) = d>(x|c, (j)W 



where /(•) : m«+ l ( 1 + 2 «) h ^R , O = [()>!, cj) 2 , • • •, E R L , c = [c 1? c 2 , • • •, c L ] T 
G R l ’ ”, o — [<7i, <r 2 , • • •, <r L ] r G R l ’ n , and Vk = [uq, w 2 , • • •, w L ] T G R L . 

For the TSK model, the THEN-part wj is a polynomial of x L which can be 
expressed as follows: 


Wj = aq 1/ + OtiyXi H b a*/** (11) 

where crq/, cri/, * * •> a n j E R are weights of input variables in the yth fuzzy rule. The 
following lemma shows the universal function approximation property of the 
proposed FNN. 

Lemma 1 (Lin and Cunningham 1995) For any given continuous function /(x) : 
2T-+R and arbitrary small constant £ € M + , there exists a FNN in Eq. 10 with 
proper parameters W, c, and g such that 



where V C R" is an approximation region. 
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Online Sequential Learning Strategy 

For training FNNs, consider a data set with N arbitrary distinct training samples: 

T" 1 

Nn = {(x ; ,y ; )} ;=1 , where x, = [x n , x n , ■ ■ x, n ] G R", y t G R, and / is the 

number of the sampling point. If a FNN with L hidden nodes can approximate 
these N samples with zero error, then there exist proper parameters W, c, and a such 
that 


<F(X/|c,c)lF = y. 


(13) 


for all / = 1, 2, . . N. Since wj in Eq. 11 can be rewritten into Wj 


j 


xT.CC; with 


le^J 


X/, 


1, X 


T 

I 


1 T ^ TTD/7 + 1 



rr r 1 

and a, = [a 0/ , aw, • • •, a,„] G R" , one gets 


v 


W 


T 


T 


T 


iT 


x te a 1 ,x te a 2 ,...,x te a L 


(14) 


Substituting Eq. 14 into Eq. 13 for all / = 1, 2, . . ., N, applying the definition of 
O, and making some manipulations, one gets 


X le(4>l a l + 4>2 a 2 + 

xL(<|)iai + c|>2 0t2 + 


+ 4>L«i) 


+ 4*2 a 2 + 


+ ^L 0 ^) . 


From the above expression, it is easy to show that 


x i>i. 

x L4h, 


X l4>2> 

x L 4> 2 . 


. x L4>l 
* x L4>l 
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which can be written into the following compact form: 


H(X,c,a)Q = Y 


(15) 


X 


T T T T 

[oci, 0 C 2 , • • •, ocj G 


where 

Q = 

H G 
rules given by 


iT 


X U X 2 ,---,X N G 



Nxn 


Y 


[yi, >’2, 


, y N f e 



N x 1 


and 



(w + 1)L x 1 



N x (n + 1 )L 


is the consequent parameter matrix and 
is the hidden matrix weighted by the fired strength of fuzzy 


x^/^x^ci.oi),- • -,x\ e f L (xi,c L ,a L ) 

x L/t ( x 2. Cl , (T! ),•••, x^/ L (x 2 , C L , o L ) 
X Nef l( X N, Cl, Oi), ■ ■ -,x T Ne f L (x N , c L ,a L ) 


H(X,c,a) 


(16) 


3254 


X. Li et al. 


From ELM theory, the parameters c and a in Eq. 16 can be randomly generated 
and fixed after generation, i.e., the updating of antecedent parameters is not 
necessary. Usually, the equality in Eq. 15 cannot be obtained due to the limitation 
of FNN scale. Consider the following minimizing problem: 


min ( I \HQ 

Q V 


F|| 2 + A||G|| 2 ) 



where II • II denotes the Euclidean norm and X is a real positive constant. The least- 
squares solution of Q in Eq. 17 is as follows: 




Q 


(H t H + Al) 'H r Y. 


(18) 


Now, give an initial data set Vo = {(•*/, From Eq. 18, one immediately 


gets 


Go = K-'HlY () 


(19) 


K 


0 


Hi H 0 + II 


( 20 ) 


where Eq 


3U 3^2’ 


iT 


3W 0 J , H 0 = H(X 0 , c, a), and X 
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Xi,X 2 ,---,X 



T 


Let y t be the estimation of y t with / 
is as follows: 


1 , 2 , 


. The FNN output at the initial phase 



where To 


3^2’ •••, y N 


iT 


oJ 


Then, present the (k + l)th chuck of new observations: N \+i = { (x/, 3 7 / ) } with 


/ 


I 


k 

j 


(Ay + 1, Z 


k 

j 


qNj + 2, • • •, Z 


k + i 

j = 


o Nj, where N f denotes the number of 


j 


observations in the ( k + l)th chunk. From Liang et al. (2006), one obtains the RLS 
solution for Q in Eq. 17 as follows: 


K 


k + 1 


K k + Hj _ , H 


k + 1 


(22) 


/\ 
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Q k + «rX, i Y k+t - h 
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where H k+ \ = H(X k+l ,c,o ) , X^ +1 
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The FNN output at the 


learning phase is as follows: 
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Y k+ 1 = H k+ 1 Q k+ j (24) 


where Y k+ i 



To avoid the singular problem for the matrix inversion of K k + pn Eq. 23 while 
N 0 < L, one makes P 0 = K 0 1 and applies the Woodbury identity to calculate P 0 as 
follows (Huynh and Won 2011): 


P 0 = I/X - Hi (XI + HoHl) l H 0 /\. (25) 

Similarly, to avoid the ill-posed problem so that the computational cost for the 
matrix inversion of K k+ i in Eq. 23 while N { <C L can be reduced, one makes 

i i A 

P k = K k and P k + | = K k + 1 and applies the updating law of Q k+ \ as follows: 

Pk+ 1 =Pk~ PkH T k+ , (/ + (26) 

Qk+ 1 = 0/. + ^ 'k+\Hk+\ (jk+ 1 — Hk+iQi^j ■ (27) 


Multistep Prediction Scheme 

MHC prediction is essentially an online time-series prediction problem which 
should carry out updating and prediction concurrently. To carry out multistep direct 
prediction, consider the nonlinear autoregressive with exogenous input (NARX) 
model as follows: 


y s ( k + r) =f(y s (k), y s (k — r) , y s (k-2r), 

x s (k), x,(k-r ), y s (k-2r), • 


. y s ( k 


nr 


x 



nr 



(28) 


step, n + 1 is the maximum lag, i.e., the order of the system. Then, give a time- 
series data set T = {(x 5 (/),y 5 (z))}^i , its initial set To = {(* s (/),y s (/))}”= i with 
zz 0 > (zz + l)r, and choose the root-mean- square error (RMSE) as the performance 
index. Based on the proposed learning strategy, the multi step direct prediction 
scheme of time-series is presented as follows: 


Step 1) Offline initialization: Obtain the initial training data set 


Af 


o 


{(*/, 3';)}Si> where 


«o 


(n + 1 )r and 


x/ 


x s (l),x s (l + r), ■ ■ ■ ,x s (l + nr 


y s (i)’y s (i + r )’---,y s {i + nr 


T 


(29) 


yi =y s 0 + (! +n)r). 


(30) 
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(a) Randomly generate parameters c and a. 

(b) Calculate H o = H(X o,c,o) by Eq. 16, where Xq 


Xi,x 2 ,- • -,X N 


iT 


oJ 


(c) 




Calculate Q 0 using Eq. 19 with Eq. 20 (if N 0 > L) or with Eq 
(if N 0 < L). 


25 


(d) Calculate the initial training performance: RMSE train (Y 0 , To) with Y 0 = H 


o 


Go and Y () 


y i. ?2 


}'n 


iT 


oJ 


(e) Predict the next r step’s time- series: 


TVo+r 


Hlxi, . ,c,a I Q n . 


‘N 0 -hr 




(f) Let V I0 = y No+1 and E 10 = J Wo+ i 

(g) Set the training step: k = 


Hlxl +1 ,c,a)Q 


0 


0. 


Step 2) Online sequential prediction: Present the ( k + l)th training data set 


M 


k + 1 


(*jv„+*+i> yN a +k+i) . where x No+k+1 and y No+k+l are given by 


Eqs. 29 and 30, respectively. 


(a) Calculate H k+ \ 


H lx^ +t+1 ,c,a) by Eq. 16. 


(b) Update the prediction performance by Eq. 31: RMSEp re d(E( J (- + iu, Yi k+ 1 u 
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(c) Update Q k+ \ using Eq. 23 with Eq. 22 (if N k+ i > L) or by Eq. 27 with 


Eq. 26 (if N k 


+ i 


< L). 


(d) Update the training performance by Eq. 31: RMSE tra i n (7^ + i, Y k +\) , Y k+ 1 


= [Y T k ,y No+k+l \‘ , Y k+ 1 = H(X k+u c,a)Q k+l , and X* +1 
(e) Predict the next r step’s time- series: 


iT 
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^‘Ic ’ ^A/o+^+l 


iT 


y (Nq +&+l)+r 


H x 
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(Vo +&+ 1 ) +r 




,c,a )Q 


k + 1 


(f) Set the training step /: = /:+! and go to Step 2. 


Simulation Studies 

The applied MHC monitoring data were collected from PRONOSTIA, an experi- 
mental platform dedicated to test and validate bearings fault detection, diagnostic, 
and prognostic approaches (Nectoux et al. 2012). As shown in Fig. 7, the 
PRONOSTIA is composed of three main parts: a rotating part, a degradation 
generation part, and a measurement part. The main objective of PRONOSTIA is 
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Thermocouple 

— J 


Fig. 7 Experimental platform PRONOSTIA 



Fig. 8 An example of the vibration raw signal 


to provide real experimental data that characterize the degradation of ball bearings 
along their whole operational life. This platform allows accelerating bearing deg- 
radation in only few hours. An example of the vibration raw signal gathered during 
a whole experiment is shown in Fig. 8. The non-trendable and nonperiodical 
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a b 



c d 



Fig. 9 Initial training response of the proposed approach (a) 1 step ahead (b) 2 steps ahead 
(c) 5 steps ahead (d) 10 steps ahead 


statistical properties of this type of signals increase the difficulty of MHC prediction 
(Porotsky 2012). 

In this study, two bearing data sets are chosen under the operating conditions: 
1,800 rpm speed and 4,000 N load to carry out simulation. For the NARX model in 
Eq. 28, set n = 1 and r — 1, 2, 5, or 10 and select x s as the standard deviation (STD) 
of each vibration data set which consists of 2,560 vibration signals and y s as the 5 % 
trimmed mean of the vibration signal. The prediction procedure is as follows: 

First, the offline initialization is carried out based on one data set to obtain an 
intimal FNN model; second, the online prediction is carried out based on another 
data set to forecast time-series of r steps ahead. To demonstrate the superiority of 
the proposed EOSL-FNN, the OS-ELM in Liang et al. (2006) and the NARX-NN 
are selected as the compared methods, where 10 notes are applied to the NARX-NN 
and 100 notes with 2 = 0.001 are applied to the EOSL-FNN and OS-ELM. Two 
performance indexes, namely, the RMSE and the mean absolute percentage error 
(MAPE), are defined as follows: 


RMSE(f, Y) 
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Fig. 10 Initial training errors of the proposed approach (a) 1 step ahead (b) 2 steps ahead (c) 
5 steps ahead (d) 10 steps ahead 


a 

MAPE = - (EL & - y*)/y) x 100% - ( 32 ) 

The accuracy index is defined as (100 % — MAPE). 

The initial training and online prediction performance of the proposed EOSL- 
FNN are depicted in Figs. 9, 10, 11, and 12. One observes that high training and 
predicting accuracy is obtained under small ahead step, and satisfied training and 
predicting accuracy is still obtained under large ahead step. The performance 
comparisons of all prediction methods in terms of the time, RMSE, STD, and 
accuracy are shown in Table 2. Note that the results are obtained from averaging 
10 times’ simulation results. One observes that both the EOSL-FNN and the 
OS -ELM are extremely faster (with small training and predicting time) and more 
stable (with small STD) than the NARX-NN, the EOSL-FNN performs similar or 
better (with small RMSE and accuracy) than the NARX-NN and OS -ELM, and the 
EOSL-FNN performs a little slower (with larger training and predicting time) than 
the OS-ELM since it contains more adjusting parameters. 
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a b 




Fig. 11 Online prediction response of the proposed approach (a) 1 step ahead (b) 2 steps ahead 
(c) 5 steps ahead (d) 10 steps ahead 


Summary 

In this chapter, both enhanced online sequential learning-fuzzy neural network 
(EOSL-FNN) and Weibull modeling are presented for predicting machine health 
condition and life cycle reliability analysis. 

The Weibull distribution is among the most popular in the field of life cycle 
engineering and reliability analysis because it is able to accommodate various types 
of failure data by manipulation of its parameters. The case study presented in this 
chapter has successfully shown that Weibull analysis can provide a simple and 
informative graphical representation of characteristics of life data, especially when 
the life data sample is small and other statistical tools fail to given useful information. 
Weibull analysis is able to answer many life cycle engineering problems such as 
mean life estimation, reliability of products at any operational time and warranty cost 
estimation, etc. The advantages of Weibull model in life data analysis can be 
extended to facilitate decision-making processes in many remanufacturing practices 
such as prediction of the number of cores returned for remanufacturing, estimation of 
spare parts or remedy resources needed for each failure mode, and so on. Future work 
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Fig. 12 Online prediction errors of the proposed approach (a) 1 step ahead (b) 2 steps ahead (c) 
5 steps ahead (d) 10 steps ahead 


of this study will include (1) the analysis of incomplete life data for the support of 
decision-making in product EOL management and remanufacturing processes and 
(2) comparison and optimization of parameter estimation methods in Weibull model. 
It is envisaged that the extension of this research will see more robust capabilities of 
Weibull analysis in life cycle engineering applications. 

The novel EOSL-FNN models have been developed and successfully applied to 
predict machine health condition. An online sequential learning strategy based on 
the ELM is developed to train the FNN. A multistep time-series direct prediction 
scheme is presented to forecast bearing health condition online. The proposed 
approach not only keeps all salient features of the ELM, including extremely fast 
learning speed, good generalization ability, and elimination of tedious parameter 
design, but also solves the singular and ill-posed problems caused by the situation 
that the number of training data is smaller than the number of hidden nodes. 
Simulation studies using real-world data from the accelerated bearing life have 
demonstrated the effectiveness and superiority of the proposed approach. Further 
work would focus on bearing long-term condition and remaining useful life pre- 
diction using online dynamic FNNs. 
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Abstract 

Unlike in traditional manufacturing, remanufacturers face uncertainty in quality, 
quantity, and frequency of returned products, making the remanufacturing 
processes less predictable and remanufacturing decision-making more difficult. 
The research on the use of embedded smart sensors in products to facilitate 
remanufacturing through monitoring and registering information associated with 
the products, e.g., their state-of-health, remaining service life, remanufacturing 
history, etc., has received increasingly high level of interests. This chapter first 
introduces the background of sensor-embedded products, including the essential 
parts of a typical smart sensor and product information model. Next, the current 
practices toward the development of embedded smart sensors in products are 
reviewed in detail in two aspects, namely, (1) embedding smart sensors in 
products and (2) representing and interpreting sensor data. A conceptual frame- 
work is presented to illustrate how sensor data gathered using smart sensors can 
be managed to facilitate product remanufacturing decision-making. Lastly, 
future research trends are given to address the challenges efficiently in using 
embedded smart sensors for facilitating remanufacturing processes and 
planning. 


Introduction 

Product remanufacturing is regarded as one of the most beneficial product end-of- 
life (EoL) alternatives for manufacturers to meet increasingly more stringent 
environmental regulations and government legislations for sustainable product 
development. It presents an environmentally more attractive option than material 
recycling as it retains the intrinsic values of a product, e.g., material and energy 
consumed during manufacturing. It differs from component reuse as it aims to 
return a used product to a condition with equivalent or even better performance than 
a new product. Uncertainty in the quality, quantity, and frequency of product 
returns has been identified as one of the prevalent issues faced by the remanufac- 
turers, and it has significant impact on the decision-making for product 
remanufacturing. Various research studies have addressed this issue, e.g., the effect 
of quality categorization of product returns (Aras et al. 2004; Zikopoulos and 
Tagaras 2007), customer incentives to promote core returns (Liang et al. 2009), 
optimal core acquisition quantities under quality uncertainty (Galbreth and Black- 
burn 2010), etc. Apart from this perspective, it has been identified that the presence 
of uncertainty in product returns is mainly due to a lack of information associated 
with the usage of these products (Klausner et al. 1998; Parlikad and McFarlane 
2007; Hribemik et al. 2011). Therefore, embedding smart sensors to gather useful 
information of a product has received considerable research interest, presenting a 
desirable solution such that the condition of the returns is traceable and predictable. 
Such information could be product identity, constituent components, remaining 
service life, remanufacturing history, etc. At the time of a product return, all the 
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information can be made available to the remanufacturers or recyclers for sound 
product EoL decision-making. 

There are two main areas of research on the use of smart sensors in facilitating 
EoL product recovery. The first area addresses issues in embedding smart sensors in 
products, and the second area investigates sensor data interpretation and manage- 
ment and the use of such data in EoL decision-making. In this chapter, section 
“Sensor-Embedded Products” briefly introduces sensor-embedded products, 
including the essential parts constituting a typical smart sensor, product information 
model essential for data management, and examples of smart sensors that are 
commercially available. Sections “Embedding Smart Sensors in Products” and 
“Sensor Data Representation, Fusion and Interpretation” review the research on 
the development of embedded smart sensors in products from two aspects, namely, 
embedding smart sensors in products and sensor data representation and interpre- 
tation. In section “Framework Based on Sensor Data for Product Remanufacturing 
Decision-Making,” a conceptual framework is presented to illustrate the use of 
sensor data gathered using smart sensors to facilitate product remanufacturing 
decision-making. Section “Challenges and Issues on the Use of Embedded Smart 
Sensors” summarizes the challenges in using embedded smart sensors to facilitate 
remanufacturing processes and planning, followed by section “Summary” the 
future research trends. 


Sensor-Embedded Products 

A sensor-embedded product contains sensors that monitor the product during its life 
cycle and collect the product life-cycle data. Such data have significant positive 
impact on closed-loop product life-cycle management, which benefits all the 
stakeholders in the entire product life cycle. For example, original equipment 
manufacturers (OEMs) can receive the design feedback to improve the current 
design; the maintenance teams can schedule for the potential maintenance tasks and 
prepare the spare parts accordingly; independent remanufacturers (IRs) can access 
the conditions of used equipment for effective remanufacturing processes planning; 
and end users or customers could be notified of potential equipment failure to 
prevent the production downtime. 


Product Life-Cycle Information 


Information required for effective EOL decision-making can be generally classified 
into internal and external information (Klausner et al. 1998; Harrison 2003; 
Parlikad and McFarlane 2007). External information includes information such as 
the market trends, legislative policies, corporate policies, etc., which are not 
directly related to a product but have considerable impact on the choice of the 
recovery options. Internal information can be categorized into static information 
and time-stamped historical information. Static information is associated with the 
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Fig. 1 Product data in a Product Life-cycle Information Requirements 

product life cycle 



intrinsic characteristics of a product, e.g., bill of material (BOM), design informa- 
tion, production processes, disassembly sequence, designed life-span, etc. Such 
information is normally determined at the product design and manufacturing stages, 
but subject to changes during the remanufacturing stage when modifications may 
have been made to the product design. Time- stamped historical information has 
dynamic characteristics associated with the use of a product and refers to product 
conditions that can be represented by a sequence of data with respect to time, e.g., 
the remaining life-span of a product/component in its current use cycle, cumulative 
service time, number of times a product has been remanufactured or reused, part 
repair/replacement history, etc. In particular, the remaining service life of a product 
can be deduced from data obtained through embedding sensors to monitor product 
use. The part repair/replacement history, which is normally made available during 
maintenance or servicing, can help identify the typical failures of a product due to 
factors such as design flaws, patterns of usage, etc. Figure 1 presents a summary of 
product information available during a product’s life cycle. 


Smart Sensors 

The development of microelectromechanical systems (MEMS) technology has 
enabled the manufacturing of smart sensors for various applications in smaller 
sizes and lower cost. A generic smart sensor for product life-cycle data acquisition 
contains a set of essential components (Zeid et al. 2004; Vadde et al. 2008), namely, 
(a) a sensing element to register the environmental parameters, e.g., temperature, 
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Table 1 Summary of smart sensors for product EoL recovery 


Sensor 

References 

Description/application 

Electronic data log 
(EDL) - prototype 

Klausner et al. (1998) 

Self-contain data 
acquisition 

Usage pattern 
interpretation 

Reuse potential of 
components 

RFID tags - 
prototype 

Parlikad and McFarlane (2007), (2010); 
Ilgin and Gupta (2010); Ilgin et al. (201 1); 
Ferrer et al. (2011); Ondemir and Gupta 
(2013) 

Real-time localization of 
returned product/ 
component 

Component 

identification 

Component 

classification 

Wireless data 
transmission 

Life-cycle unit 
(LCU) - prototype 

Seliger et al. (2003) 

Self-contain data 
acquisition 

Actuators supporting 
intelligent disassembly 

Maintenance scheduling 

Watchdog agent - 
commercial 

Lee et al. (2006) 

Comprehensive data 
processing capability 

E-maintenance 

Autonomous 
monitoring device - 
commercial 

Cheng et al. (2010) 

RFID tag as sensor ID 

Wireless data 
transmission 

Electronics health 
monitoring and 
management 


pressure, etc., and convert them into suitable signal forms; (b) a microprocessor to 
process the received signals; (c) a memory to store the received sensor data and 
output from the microprocessor; (d) a data transmitter to transmit the data from the 
smart sensor to the communication network; (e) a power supply from product 
power source or a separate battery; and (f) a sensor identification (ID). There are 
cases where the sensing elements are not physically integrated with the data 
processing unit, e.g., an engine’s electric control unit (ECU) receives data from 
various engine sensors to monitor and regulate the engine ’ s performance (Fleming 
2001, 2008). Existing research has explored the use of embedded sensors to 
improve the efficiency of product life-cycle management. Table 1 summarized 
some examples of embedded smart sensors. 

An example is a data-recording device known as electronic data log (EDL) 
(Klausner et al. 1998). It measures and stores the parameters that indicate the 
degradation of the motor of a consumer product during the use stage. The data 
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obtained through EDL can provide valuable insights into the usage patterns of the 
products. Simon et al. (2001) reported the use of self-contained data acquisition 
units in washing machines for monitoring the usage parameters and error condi- 
tions, which can be accessed either during servicing or at EoL stage. Seliger 
et al. (2003) developed a microsystem prototype called the life-cycle unit (LCU) 
for product and component monitoring. The LCU can acquire and analyze the data 
of the status of a product or a component and inform both the users and servicing 

T'A/r 

personnel of the maintenance schedules in advance. Watchdog Agent 
(Djurdjanovic et al. 2003; Lee et al. 2006) is a commercially available smart 
sensing system known for its applications in predictive condition-based mainte- 
nance, where the performance degradation in a process, product, or component 
is assessed through multiple sensor-based diagnostic and prognostic techniques. 
Pecht (2008) has conducted a comprehensive survey on the commercial sensor 
systems used in prognostics and health management for electronic products and 
systems. 

Radio -frequency identification (RFID) tags have been used to replace barcodes 
as product/component IDs to provide easy access to retrieving, updating, and 
managing product information in an entire life cycle (Kiritsis et al. 2003; Parlikad 
and McFarlane 2007). Kulkarni et al. (2007) investigated the practical and eco- 
nomic impact of using RFID in alleviating the quality uncertainty associated with 
the remanufacturing processes. Ferrer et al. (2011) reported an application of RFIDs 
where active RFID can be used for easy identification and localization of compo- 
nents within a remanufacturing facility and passive RFID can be permanently 
tagged onto components of remanufacturable products at the beginning of their 
service life. The limitation of the RFID is the storage capacity as it cannot store the 
large amount of data that has been gathered over a period of time. An alternative is 
to use a RFID tag as an ID to identify a specific smart sensor with embedded 
memory chip for data storage. Cheng et al. (2010) introduced an autonomous 
monitoring device for prognostics including RFID tags for sensor identification 
and wireless data transmission and sealed processor and memory chip for signal 
processing and storage. 


Embedding Smart Sensors in Products 

The conditions of the returned products would affect the viability and profitability 
of remanufacturing. Sensors can be installed onto a product for condition monitor- 
ing and data collection such that these data can be analyzed for EoL decision- 
making. Embedding smart sensors in products requires considerable domain 
expertise. The general principle is that the performance of the product should not 
be compromised in terms of functionality and reliability when sensors are embed- 
ded in the product. The two primary concerns of embedding smart sensors are 
(1) sensor selection and (2) sensor placement and installation. Table 2 summarized 
some recent research work on these two directions. 
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Sensor Selection 

A selection of sensors to be installed in a product is a prerequisite for product life- 
cycle data monitoring and collection. Some essential considerations for selecting 
the sensors are the parameters to be monitored, the electrical and physical charac- 
teristics of the sensors, reliability and performance, data transmission options, 
protective measures on the sensors, and cost (Zeid et al. 2004; Cheng et al. 2008). 
It is desirable to have a smart sensor system with multiple sensing abilities, 
miniature size and light weight, low power consumption, considerable long 


Table 2 Research work on embedding smart sensors for condition monitoring 


Area of 
research 

Authors (year) 

Research 

issues/ 

considerations 

Methodologies 

Applications/ 

validation 

Parameter 

identification/ 

sensor 

selection 

Al-Habaibeh 
and Gindy 
(2000) 

Number of 
sensors 

Automated sensory and 
signal processing 
selection 

Milling 

cutter 

The cost and 
time for the 
design of 
monitoring 
system 

Taguchi’s orthogonal 
arrays 

Pattern recognition 

Mushini and 
Simon (2005) 

Parameter 

uncertainty 

Optimization 
algorithm: exhaustive 
search, probabilistic 
search, genetic 
algorithm 

Aircraft gas 

turbine 

engines 

Cost of the 

selected 

sensors 

Santi 

et al. (2005) 

Diagnostics 

performance 

Failure modes and 
effects analysis 

Boost stage 
rocket engine 

Risk reduction 
potential of 
selected sensor 
set 

Genetic algorithm 

Cheng 
et al. (2008); 
Kumar 
et al. (2010) 

General 

guidelines 

Failure modes and 
effects analysis 

Electronics 

products 

Borguet and 
Leonard 
(2008) 

Sensor noise 
(sensitivity) 

Information theory 

Turbofan 

engine 

Sensor 

redundancy/ 

orthogonality 

Fisher information 
matrix 

Naive brute force 
technique 

Wang 

et al. (2012) 

Multiple 

monitoring 

points 

Principal component 
analysis 

Wind turbine 

Dimension of 
data set 

Multivariant analysis 


(i continued ) 
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Table 2 (continued) 


Area of 
research 

Authors (year) 

Research 

issues/ 

considerations 

Methodologies 

Applications/ 

validation 

Sensor 

placement/ 

installation 

Sheng 
et al. (2006) 

Data 

acquisition 

quality 

Finite element analysis 

Rolling 

bearings 

Sensor 

coverage 

Effective-independence 

method 

Zhang and 
Vachtsevanos 

(2007) 

Sensor 

reliability 

Failure modes and 
effects analysis 

Five-tank 

system 

Fault 

detectability/ 

isolation 

Fault detectability 
metric 

Cost for 
installation 

Graph-based model for 
fault propagation 

Chen 

et al. (2012) 

Sensor 

sensitivity 

Genetic algorithm 

Gearbox 
(gears and 
bearings) 

Sensor 

coverage 

Particle swarm 
optimization 

Fault propagation graph 

Pourali and 
Mosleh 

(2012) 

Location of 
sensors 

Bayesian belief 
networks 

Simulation 

Logical or 
functional 
placement 

Logical diagram 
representation of 
system 

Al-Habaibeh 
et al. (2005); 
Alkhadafe 
et al. (2012) 

Sensor stability 

Initial optimization 
procedure 

Gearbox; 
milling cutter 

Sensor 

reliability 

Spectral kurtosis 
analysis 

Fourier transformation 
and wavelet 


range, large onboard memory capacity, fast onboard data processing and high rate 
data transmission, high reliability, and low cost. Pecht (2008) reviewed the com- 
mon sensors and their sensing principles as well as the required attributes of sensor 
systems for health monitoring of electronics products. Fleming (2001, 2008) 
conducted a comprehensive survey on the most significant sensors used in 
present-day automotive applications. 

Figure 2 outlines a general procedure for the selection of suitable sensors. The 
design of the components defines their functionality and physical properties, which 
will help in the identification of the potential causes of components failures during 
their use stage. Subsequently, suitable parameters to be monitored can be deter- 
mined accordingly for the identification of these potential failures. Finally, the type, 
operating range, size, and life-span of the sensors can be determined based on the 
parameters to be monitored and the expected life-span of the components. 
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Fig. 2 A general procedure 
for sensor selection 



Component Classification 

Not all components in a product have equal life-span. Some components can last 
considerably longer than other components. For example, in an automotive engine, 
components with longer life-span, e.g., engine block, crankshaft, and connecting 
rods, which normally have significant residual values that can be retained, are 
usually reused after proper reconditioning. Critical components with shorter life- 
span, e.g., engine pistons, will always be replaced with new parts (Smith and 
Keoleian 2004). In addition, not all components in a product have equal intrinsic 
value. For example, washer and bearings have lower intrinsic values. However, 
these components isolate wear and absorb vibrations from the more expensive shaft 
or housing, which are considered as “ideal” components from a remanufacturing 
viewpoint (Kalyan-Seshu and Bras 1997). For components which have longer life- 
span and relatively high intrinsic value, it is important to track the number of times 
they have been reused and update their cumulative service history. For components 
with relatively shorter life-span and lower intrinsic value, they may need to be 
monitored continuously to determine whether they have reached a stage for 
replacement or remanufacturing. The components of a product can be classified 
accordingly based on their useful life-span and intrinsic value. This classification 
system leads to appropriate EoL options for the various classes in this system, 
which in turn facilitates the identification of suitable life-cycle parameters to be 
monitored and the sensors for monitoring these parameters. 

Parameters Identification 

The parameters to be monitored should be able to demonstrate the mechanisms of an 
anticipated failure mode reliably or indicate the effects of the failure. These param- 
eters can be classified into two categories, namely, ambient conditions and operating 
status. Ambient conditions, such as temperature, humidity, pressure, load, etc., 
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are parameters that have direct or indirect impact on the resulting condition. The 
operating status, e.g., vibration, speed, etc., indicate directly the current state-of- 
health of the product. The relationship between the parameters and the failures can 
be established through various methods, e.g., the Taguchi’s method (Al-Habaibeh 
and Gindy 2000), failure modes and effects analysis (FMEA) related methods (Santi 
et al. 2005; Vichare and Pecht 2006; Kumar et al. 2010), etc. Some examples of 
using the FMEA approach to identify the parameters for assessing the state-of-health 
of electronic products are hard drives (Vichare and Pecht 2006) and computer server 
systems (Kumar et al. 2010). Once the parameters for condition monitoring have 
been determined, a set of sensors can be defined accordingly. 

Optimal Sensor Selection 

Sensor selection for product condition monitoring is an optimization problem, 
particularly for complex systems or processes in which multiple monitoring points 
are present and a large number of sensors have to be installed (Subrahmanya 
et al. 2008; Joshi and Boyd 2009). Many research studies have been reported on 
optimal sensor selection for higher diagnostic performance with justifiable cost. 
Al-Habaibeh and Gindy (2000) proposed an automated sensor selection approach 
for milling cutter monitoring in which the number and types of sensors required can 
be optimized without compromising the performance in the identification of the 
cutter faults. Mushini and Simon (2005) compared three search and optimization 
methods, namely, exhaustive search, probabilistic search, and genetic algorithm 
(GA), to find an optimal set of sensors that can offer the best measurement set with 
minimized parameter uncertainty for aircraft gas turbine engines. Santi et al. (2005) 
developed a sensor selection approach for robust health diagnostics. This approach 
adopts FMEA to determine candidate sensor sets by identifying system critical 
faults and fault signatures, and applies GA to search for a suitable set of sensors for 
optimal heath diagnostics based on criteria such as speed of detection, probability 
of correct fault source isolation, and overall risk reduction potential. Wang 
et al. (2012) introduced the principal component analysis (PC A) method for sensor 
selection optimization addressing the presence of multiple monitoring points. The 
PCA method was utilized to identify an essential number of sensors sufficient for 
wind turbine condition assessment. 


Sensor Placement and Installation 

The placement and installation of smart sensors is an issue that needs to be 
addressed for condition monitoring. Embedding a smart sensor would result in 
modification to the current product design and may weaken its physical strength and 
structural stability. Many research studies deal with structural health monitoring 
while addressing optimal sensor placement. Few recent studies listed in Table 1 
have been reported on sensor localization optimization by addressing other sensor 
attributes that have considerable impact on the performance in fault isolation and 
condition monitoring, e.g., sensing reliability (Al-Habaibeh et al. 2005; Alkhadafe 


90 Use of Embedded Smart Sensors in Products to Facilitate Remanufacturing 3275 


et al. 2012), installation cost (Zhang and Vachtsevanos 2007), sensing sensitivity 
(or signal-to-noise ratio) and sensing coverage (Sheng et al. 2006; Chen et al. 2012), 
etc. A graph-based model can be used to depict the cause-effect relationship for 
fault propagation (Zhang and Vachtsevanos 2007). Finite element analysis can be 
used to model the changes in the mechanical and structural properties of the 
components due to smart sensor installation (Sheng et al. 2006; Chen et al. 2012). 
The optimization methods include GA (Chen et al. 2012), Bayesian belief networks 
(Pourali and Mosleh 2012), etc. 

Sensors are usually made of different materials from that of the components to 
be monitored. Currently, no study has addressed sensors embedding methods from 
the disassembly point of view. It would be desirable for sensors to have plug-and- 
use mechanism, allowing for fast and safe disassembly in the case of product EoL 
recovery or faulty sensor replacement. 

Sensor Data Representation, Fusion, and Interpretation 

With the embedded smart sensors, product usage data can be monitored, collected, 
and sent to the data server located at the remanufacturing center. Similarly, when a 
used product is returned to a remanufacturing plant, the associated product infor- 
mation should be made available to the remanufacturers. However, to make use of 
such data in facilitating product EoL decision-making, two primary concerns need 
to be addressed, namely, (2) sensor data representation and (2) data fusion and 
interpretation. 


Sensor Data Representation and Management 

An efficient information system should be available so that relevant data can be 
retrieved in real time and authorized parties can access data of interest safely and 
conveniently. Different data storage methods can be used for different types of data 
to achieve easy data retrieval and update (Kiritsis et al. 2003). 

XML tools can be used to access and manipulate the static attribute data as the 
existing industry- standard XML schema allows unambiguous access to particular 
properties of an object and can be applied across all industry sectors. Relational 
databases can be used to handle historical data (Harrison 2003; Parlikad and 
McFarlane 2007). Yang et al. (2009) developed an XML-based tree- view structure 
for product life-cycle data acquisition and provided an information engine for 
online data analysis and data retrieval. 

The unintuitive representation of product information to the remanufacturers is a 
critical issue. For example, disassembly during remanufacturing is often labor 
intensive with low level of factory automation. Therefore, to improve the efficiency 
of the disassembly process, intuitive representations of such information can be 
provided to the disassembly technicians to guide the disassembly operations. 
Augmented reality-assisted visualization tools have been explored, where virtual 
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Fig. 3 Fusion and interpretation of sensor data 


cues in the forms of texts, images, CAD drawings, video clips, etc., can be provided 
to assist the manual assembly tasks (Yuan et al. 2008; Zhang et al. 2011). This 
concept can be applied in disassembly, where the disassembly sequence can be 
organized in a disassembly tree structure. 


Data Fusion and Interpretation 

A product can have multiple sensors embedded. The sensor data are often in different 
units, requiring robust multisensor fusion for assessing the actual condition of the 
product, i.e., fault diagnosis and failure prognosis. The difference between the two 
lies in that the former involves identifying and quantifying the damage that has 
occurred, while the latter is concerned with trying to predict the damage that is yet 
to occur (Sikorska et al. 2011). The outcome of fault diagnosis and failure prognosis 
would serve as an anchor point for any decision made for product EoL recovery. 
Figure 3 illustrates a generic procedure for sensor data fusion and interpretation. 

Sensor Data Fusion 

Multisensor fusion has been an intensive research area, and many previous studies 
and reports have focused on condition monitoring in machine tools, manufacturing 
processes, etc. (Du et al. 1995a, b). It is reported that data fusion structures can be 
generalized into three types, namely, raw signal-level, feature-level, and decision- 
level fusion (Niu et al. 2010). 

• Raw signal-level fusion: Raw data from all sensors are collected and combined 
directly and a feature/signature is extracted from the data set. The data acquisition 
and fusion are performed in a centralized manner, provided that the sensors are only 
responsible for sensing and have no processing capability (Spencer et al. 2004). 
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Fusion at this level may produce good results since the data could be redundant and 
has not been processed/compressed; however, it may fail to generate relevant results 
due to the presence of noises and other imperfections in the measurements. One 
limitation of signal-level fusion is that data should be of the same or similar physical 
quantities, e.g., vibration signal cannot be fused with temperature signal. In addition, 
the data should be calibrated under the same measuring unit. This would restrict the 
applications of signal-level fusion in complex real environments which can have 
many physical quantities (Niu et al. 2010). 

• Feature-level fusion : Features from the same sensor type are first extracted, 
processed, and represented by feature vectors. These feature vectors are normal- 
ized and combined into a compound feature vector, which forms a basic repre- 
sentation of the data set. At each instance, the compound feature vector is passed 
to a pattern classification model for subsequent decision-making. Feature-level 
fusion allows individual sensor to perform simple computations, e.g., feature 
extraction, which reduces the computational burden of the central processor for 
data storage and processing. 

• Decision-level fusion: A decision would first be made based on the single-source 
data obtained from each sensor with necessary data processing techniques (e.g., 
feature extraction and pattern recognition). A final decision can be generated by 
fusing individual decision made with respect to each sensor using decision-level 
fusion techniques, e.g., Bayesian method, Dempster-Shafer theory, etc. Fusion 
at this level allows the sensors to have self-contained data processing capability, 
representing the most distributed fusion architecture, in which ubiquitous com- 
puting techniques can be well suited. 

There is no universal approach capable of solving all data fusion problems. Hence, 
the data fusion algorithms should be selected according to the requirements and 
issues encountered in the specific applications. Jardine et al. (2006) reviewed the 
signal processing techniques for waveform data in different domains, e.g., time 
domain, frequency domain, and time-frequency domain. Niu et al. (2010) listed a 
number of signatures of signals that can be used for pattern recognition with respect 
to these data domains. Khaleghi et al. (2013) conducted an analytical survey of recent 
developments in the data fusion domain. Common but challenging issues pertaining 
to data fusion, such as imperfect data due to the presence of noise and measurement 
uncertainty, spurious data from faulty sensor, conflicting data from different sensor 
types, etc., were studied. Various data fusion algorithms have been reviewed, e.g., 
probabilistic-based fusion, evidential belief reasoning, Dempster-Shafer evidence 
theory, and hybrid fusion techniques, and comparisons have been made to analyze 
their suitability in addressing the various data fusion issues. 

Fault Diagnosis and Failure Prognosis 

Diagnosis is to detect the failure that has occurred in a component (or subsystem) 
and isolate and identify the root of the failure, based on the data collected by the 
embedded sensors. Prognosis is to estimate the time at which a component will fail 
to operate at its stated specifications based on its current condition as well as the 
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future load and environmental exposure, i.e., the prediction of the remaining useful 
life (RUL) of the component. RUL is a commonly used parameter to assess the 
reliability (or reusability) of a used component. The mean lifetime of a product 
( T PM ) is governed by the critical component with the shortest life-span. Commonly 
used reliability indices are mean-time-to-failure (MTTF), mean-time-to-repair 
(MTTR), etc., which are normally provided by the product manufacturers. How- 
ever, experimental conditions used to determine these indices may not reflect the 
actual working conditions. Therefore, these indices may need to be rated to consider 
the actual working conditions. The lifetime of a product/component can be defined 
in other forms, e.g., the mileage of a car, the amount of petrol consumed for an 
automotive engine (Smith and Keoleian 2004), the number of starts and stops for an 
electric motor (Kara et al. 2008), etc. At the component level, it can be defined by 
the number of reusable cycles, e.g., the camera core for single -use cameras (Geyer 
et al. 2007). The RUL can be defined in similar forms accordingly. 

Figure 4 outlines a general procedure for RUL estimation of a component based 
on the monitored condition. If there is no fault signal, the RUL can be approximated 
based on reliability models (e.g., MTTF or MTTR) empirically. If a fault signal has 
been detected, the causes will first be isolated and identified through fault diagnosis, 
and the failure prognosis process will be triggered at the same time. 

There has been a substantial amount of research work dedicated to failure 
diagnostics based on data fusion (Engel et al. 2000; Schwabacher 2005; Roemer 
et al. 2006; Schwabacher and Goebel 2007; Pecht 2008; Saxena et al. 2008; Heng 
et al. 2009; Si et al. 2011; Sikorska et al. 2011). In general, prognostic approaches 
can be divided into three categories, namely, statistical data-driven, physics-of- 
failure (PoF) -based, and hybrid approaches. 



Fig. 4 A generic diagnosis-prognosis process for RUL estimation 
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• Statistical data-driven approaches : This set of approaches estimate the RUL of a 
component in a probabilistic manner based on the past observed data and statistical 
models without relying on any physics or engineering principles. The observed data 
includes past recorded failure data and condition monitoring data, e.g., degradation 
signals, operational, performance, environmental data, etc. The advantage of sta- 
tistical data-driven methods is that the mathematical properties of the estimated 
RUL can be analyzed. However, the effectiveness of these methods is restricted by 
the availability of the observed data, particularly for newly commissioned systems 
that lack such observed data. Si et al. (2011) reviewed the data-driven approaches 
for RUL estimation, including regression-based, Brownian motion-based, and 
Markovian-based models. Goebel et al. (2008) compared three regression tech- 
niques, namely, neural networks, relevance vector machine, and Gaussian process 
regression; they showed that the estimation outcome significantly relies on the 
quality and the availability of the observed data. The same research group has 
developed metrics for evaluating various prognostic techniques and the suitability 
in various prognostic applications (Saxena et al. 2008, 2009). 

• Phy sic s-of -failure -based approaches : Physics-of-failure (PoF)-based RUL esti- 
mation involves building the mathematical models of the system physics, the 
knowledge of the product’s life-cycle loads, and relating them with the potential 
failure modes, failure mechanisms, and failure sites (Pecht 2008). The 
PoF-based approaches require less data than that of data-driven-based 
approaches; however, their performance highly depends on the accuracy of the 
established physics models. In addition, these models normally are defect 
specific and often too stochastic and complex to model. This makes PoF-based 
RUL estimation less practical in cases when destructive intervention is required 
to identify the failure modes, mechanisms, and effects. Heng et al. (2009) 
reviewed some PoF-based prognostics approaches, e.g., crack-growth modeling, 
fatigue spall initialization and progression modeling, etc., and the requirements 
to apply these models. 

• Hybrid approaches : A hybrid approach fuses the PoF-based prognostics models 
with the condition monitoring data to produce a more reliable and robust RUL 
estimation. It aims to utilize the strength of both types of approaches and draw a 
more comprehensive picture by integrating the PoF model, empirical failure data, 
and past and current operational and environmental data being monitored. Goebel 
et al. (2006) adopted Dempster-Shafer regression techniques to fuse the indepen- 
dent RUL estimates to achieve a more accurate and robust RUL prediction. 

Framework Based on Sensor Data for Product Remanufacturing 
Decision-Making 

Existing research studies have started to investigate the use of product information 
from embedded smart sensors in product EoL recovery decision-making. Table 3 
summarized some recent developments in this direction. For instance, RFID 
technologies can be implemented to address disassembly-related issues, 
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Table 3 Recent development in product EoL decision-making with aid of embedded sensors 


Theme 

References 

Description/issues addressed 

Application 

EoL recovery 
decision 

Klausner 

et al. (1998); Simon 
et al. (2001) 

Usage pattern monitoring 

Electric motors 

Component reuse and 
cost-benefit analysis 

Parlikad and 
McFarlane (2007) 

Product information 
categorization 

Photocopier 

Modeling the impact of 
product info on EoL decisions 

Parlikad and 
McFarlane (2010) 

EoL vehicles (ELV) recovery 

Automobile 

clutch 

Sensor network for product 
life-cycle data sharing 

Probabilistic model for 
cost-benefit analysis 

Ondemir and Gupta 

(2013) 

Recovery decisions without 
disassembly/inspection 

Air-conditioner 

EoL decision subject to 
disassembly-to-order and 
repair-to-order 

Disassembly 

Ilgin and Gupta 
(2010); Ilgin 
et al. (2011) 

Cost-benefit analysis 

Washing 

machine 

EoL decision subject to 
disassembly-to-order 

Maintenance 

Lee et al. (2006) 

Smart prognostics algorithms 

Roller bearings, 

industrial 

network 

“Peer-to-peer” 

communication 

Condition-based maintenance 

Remanufacturing 

jobshop 

management 

Ferrer et al. (201 1) 

Real-time localization of 
returned product/component 

Electronics 

maintenance 

facility 

Discrete-event simulation 


e.g., disassembly-to-order, in remanufacturing jobshop operations (Ferrer 
et al. 2011; Ondemir and Gupta 2013); EDL enables the decision-makers to access 
the usage patterns of the returned products or components (Klausner et al. 1998). 
Figure 5 presents a conceptual framework on the use of product information to 
assist remanufacturers during remanufacturing operations, i.e., products collection, 
disassembly, cleaning, sorting, grading, reconditioning, machining, component 
replacement, and reassembly. 

If the time -to -failure data of a component is available, the mean lifetime of a 
component ( T CM ) can be estimated by using the Weibull analysis (Kara 2010) given 
in Eq. 1, where rj is defined as the life at which 63.2 % of units will fail and fi 
identifies the mode of failure, i.e., fi < 1 means infant mortality, /? = 1 indicates 
random failure, and > 1 describes wear-out failure, can be obtained from the 
Weibull distribution function given by Eq. 2, where F(t) represents the fraction of 
components failing and t is the time-to-failure (Kara 2010). 
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Disassembly 
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of parts 
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solution 
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parts 
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[ 

Layout, 
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Remanufacturing 
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Fig. 5 Conceptual framework on the use of sensor data to facilitate product remanufacturing 
decision-making 



( 1 ) 

( 2 ) 


The actual usage life ( T 0 ) of a component of a returned product can be estimated 
based on the data obtained during the use phase of the product. Some frequently 
used estimation techniques (Kara et al. 2005; Kara 2010) are the regression 
analysis, Kriging techniques, artificial neural networks, etc. Once the actual usage 
life has been determined, the remaining useful life of a component can be obtained 
using Eq. 3 (Kara 2010). It can also be obtained based on the number of 
remanufacturable (reusable) cycles Axeman? given in Eq. 4: 



N 


reman 


Floor 


T 


RUL 


T 


PM 



Assuming that the historical data of a large number of returned products is 
available, (1) the mean lifetime of a critical component (J PM ) and (2) the mean 
lifetime of the core (T C m) can be derived based on the analysis in Eqs. 1 and 2. 
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Based on the life-cycle data of a returned product and its components, the following 
information can be determined and used to assist in decisions on recovery, inspec- 
tion, assembly, disassembly, etc. They are (1) the number of times a component has 
been remanufactured (A reman ) or directly reused (A reuse ), (2) the maximum number 
of times a component can be remanufactured (A mreman ) or directly reused (A mreuS e), 
and (3) the remaining useful life of a component (T RUL ). 


Collection and Presorting 


Both OEMs and independent remanufacturers have their own criteria for accepting 
cores and the corresponding core refund guidelines to facilitate used products 
acquisition. OEMs normally exhibit a more generous acceptance threshold, while 
independent remanufacturers may have more stringent core acceptance criteria, e. 
g., they may not accept a used product if the product condition is beyond 
remanufacturing. The end users will receive alerts to return their products when 
these products are reaching their EoL stage whenever the lifetime data of the 
products is uploaded to the server. The remanufacturers will be able to determine 
the quality of the incoming returns based on the sensor data. A general rule for core 
acceptance is given by Eq. 5, where D co \ = 1 means the acceptance of the core, 
while D co{ = 0 refers to rejection. 


f 1, if (Erul > Tpm) and (A mreman ^ A reman ) 

| 0, otherwise 



Once the cores have been accepted, the product type, the technology used in the 
product, and the production date will be identified to sort these cores for storage and 
other subsequent operations. 


Disassembly 


Actual disassembly of a return is not necessarily an exact reversal of its assembly 
sequence due to various factors, e.g., irreversible welded joints may have been 
used, degradation of components and damages to components during use, missing 
components, product upgrade during maintenance and remanufacturing tasks, etc. 
Therefore, the disassembly sequence should consider the changes that have 
occurred to the product during its entire lifetime. The index for ease of disassembly 
of a product is determined by the joining methods used, e.g., mechanical fastening, 
welding, gluing, riveting, etc., and the physical damages to the product, e.g., 
corrosion, deformation, etc. In addition, the choice of product recovery plays an 
important role in disassembly. Some components, e.g., electric motors, can be 
reused directly without the need for further disassembly (Kaehernick and Kara 
2008). The cores, e.g., the crankshaft and engine block, require proper 
reconditioning so that they can be reused in a remanufactured product, such that 
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they have to be disassembled completely. The disassembly level of a component 
based on different recovery options can be determined using Eq. 6, in which D dis = 
0 means no further disassembly is needed, D dis = 1 refers to complete disassembly, 
and D dis = 2 refers to the case where the component should be disassembled 
properly for disposal or material recycling. 

( 0, if(r RUL > Tpm) and (N mreuse ^ N reuse ) 

T^dis = \ 1, else if(r RUL > Tpm) and (N mreman ^ N reman) (6) 

I 2, otherwise 


Cleaning 

Cleaning is one of the most environmentally unfriendly processes in 
remanufacturing. The contamination level of a component and the cleanliness to 
be achieved affect the cleaning technology and cleaning equipment that can be used 
(Schweinstig 2010). It is important to know the compatibility between the materials 
used in the components and the cleaning agents. Hence, product information 
needed for cleaning include (a) the bill of material of a component, (b) design 
features of the components, (c) required cleanliness level, and (d) the company 
policies on the disposal of wastes generated during cleaning. The static information 
can be retrieved as they are available during the design stage. The contamination 
level of a component can be graded based on technician’s expertise through which 
suitable cleaning methods can be adopted. 


Inspection and Grading 

Inspection is required to measure and detect the current condition of a component. 
For components with significant physical defects that cannot be recovered, they can 
be sorted for material recycling. Other information related to material fatigue, 
functional degradation, etc., which cannot be detected through simple visual 
inspection, can be monitored using embedded smarted sensors and assessed to 
determine the RUL of the components. Table 4 shows a simple method for 
components grading based on the physical defects, the RUL of the components, 
and the number of times they have been reused or remanufactured. In general, the 
components can be graded into three categories (Steinhilper 1998), namely, 


Table 4 Grading of components based on conditions 


Conditions of components 

Grading decision 

Significant identifiable physical defects 

Not remanufacturable 

No obvious physical defects 

Trul >— T PM 

T > T 

1 mruse 1 ruse 

Directly reusable 

T > T 

± mreman ^ ± reman 

Reusable after repair 

Trul < T pm 

Not remanufacturable 
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(a) directly reusable, (b) reusable after proper repair or reconditioning, and 
(c) cannot be repaired or reconditioned. 

Based on these classifications, companies may apply different sorting criteria. 
The maintenance history can help identify components or parts that require frequent 
maintenance. In addition, the cores can be further classified based on the number of 
times they have been directly reused or remanufactured. However, there exists a 
trade-off between the accuracy of the grading and the economic profitability of 
remanufacturing (Zikopoulos and Tagaras 2008). 


Repair and Reconditioning 

To restore a used part to a like-new condition, the specifications of the 
corresponding new part, e.g., geometric features, material, surface property, reli- 
ability, etc., should be known a priori by the remanufacturers. The current condition 
and the failure mode of the used parts affect the reconditioning strategies. For 
example, a damaged or worn part can be restored by removing the damaged area or 
adding new material to the worn area, depending on the severity of the damage or 
wear (Bras 2008). The remanufacturing history as well as the performance and 
reliability of the previous remanufactured versions of a component will provide 
feedback on the effectiveness of the reconditioning methods. 


Reassembly and Reliability Test 

The reassembly sequence may be the same as the original new product if there is no 
significant upgrade during remanufacturing. The original standards and reliability 
of the product should be known in advance. OEMs can have access to these data, 
and the independent remanufacturers will be able to access these data if they are 
within the closed-loop supply chain, where such information may be readily 
available. Otherwise, detailed inspection and significant expertise will be needed 
to extract such information from a new product. 


Design Feedback to Manufacturers 

Through collecting used products, manufacturers can obtain feedback on the 
reliability and durability of their products (Bras and McIntosh 1999). The mainte- 
nance record and remanufacturing history can help identify components and parts 
that require frequent maintenance under certain working and environmental condi- 
tions. First, these information can be accumulated and used by the remanufacturers 
during subsequent inspecting and sorting of the used components and parts. Second, 
the information can assist the remanufacturers identify critical components that 
have the most significant impact on the useful life of a product. Through analyzing 
the failure causes, the failure rates, and their relations with the working conditions, 
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remanufacturers can provide feedback to the design teams on the weaknesses of the 
current design. Kara et al. (2008) proposed to assess the useful life of products 
based on the critical design parameters, which can be identified at the early design 
stage, as a means for the products to be redesigned. However, it should be noted that 
components should not be designed only for remanufacturing, and other require- 
ments, e.g., product functionality and initial manufacturability, etc., should be 
considered as well (Bras 2008). 


Challenges and Issues on the Use of Embedded Smart Sensors 

The advances in sensor technologies as well as data fusion and interpretation 
algorithms pave the way for the use of embedded smart sensors in products to 
facilitate effective EoL decision-making. Nevertheless, there are challenges and 
issues which may prevent the use of these sensors, as summarized in Fig. 6. 

As discussed in section “Embedding Smart Sensors in Products,” smart sensors 
should be selected carefully. The data transceiver should comply with the commu- 
nication protocol depending on whether the data needs to be transmitted via wired 
or wireless communication networks. The selection of the memory chip needs to 
consider the data volume to be stored between two consecutive data uploading 
intervals. The cost of embedding sensors in products should be economically 
justifiable (Klausner et al. 1998; Ilgin and Gupta 2010; Parlikad and McFarlane 
2010) as it is normally borne by the manufacturers since there is no direct benefit 
offered to the end users. The challenge lies in the modular design of these sensors 


Selection of sensors 

• Type, operating range, size 

• Sensor components 

• Communication protocol 
Installation of sensors 
Justification of investment 


Production 

Design 

Manufacturing 

Z 


Data interpretation 

• Remanufacturing history 

• Remaining lifetime 

Data representation 

• Representation method 

• Possible change in 
disassembly 

Data security and privacy 


Data transmission 

• Communication network 

• Automatic/manual 

Cooperation among all 
parties of interest 

• Data coordination 
Data security and privacy 



Returns 



Remanufacturing 

Disassembly 

Cleaning, inspection, machining 
Re-assembly 


Disposal options for 
embedded sensors 

Disassembly depth 


Fig. 6 Challenges in using embedded smart sensors in product life-span monitoring 
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such that they can be disassembled and replaced easily. A faulty sensor should not 
affect the function and reliability of the products. 

In product distribution and use stages, data from the embedded smart sensors 
have to be harnessed by the manufacturing or remanufacturing sites. For stationary 
equipment or machines, e.g., CNC machines, power generators, etc., wired data 
transmission can be adopted. For other products, e.g., automotive engines, data can 
be transmitted to the servers at the manufacturing or remanufacturing sites when- 
ever the products are brought back to these sites with data transmission facilities. 
There are two data transmission modes, namely, manual mode and automatic mode. 
The manual mode will need the end users to set up the connection between the 
sensors and the server. In this mode, proper authorization and access control need to 
be established to protect the data from unauthorized viewing, modification, etc. In 
addition, ensuring data validity incurs cost (Rostad et al. 2005). In the automatic 
mode, the transmission can be initiated whenever a sensor is detected and recog- 
nized by the transmitter. 

Sensors may need to be disassembled and disposed of, e.g., a malfunctioned 
embedded sensor has to be replaced, or a component has reached its end of life. In 
the second situation, the disposal or recycling of the component may require the 
embedded sensor to be separated from the component since sensors are often made 
of different materials than that of the component. This would require specific 
disassembly steps. In addition, the disposal of the sensors should not violate any 
environmental regulations, which may incur additional cost for the remanufac- 
turers/recyclers. 

Summary 

The remanufacturing process is often labor intensive and relies heavily on the 
expertise of the employees due to wide-ranging variations in the return conditions 
of the cores. The use of embedded sensors has presented the potential to assist the 
remanufacturers in making more reliable decisions at each stage of the 
remanufacturing process. However, product condition monitoring using embedded 
sensors, particularly sensor data fusion and interpretation, remains challenging in 
the remanufacturing industry. This chapter has reviewed the current practices 
toward the development of embedded smart sensors in products in two primary 
aspects, namely, embedding smart sensors in products and representing and 
interpreting sensor data. Sensor selection and sensor placement/installation are 
the two most relevant issues requiring careful considerations to meet the target 
performance for condition monitoring. Multisensor data fusion and interpretation 
for efficient fault diagnosis and failure prognosis are reviewed briefly. A conceptual 
framework has been developed for the use of the sensor data in facilitating 
remanufacturing operations and decision-making at each remanufacturing stage. 

With the advancement and development in smart sensor technologies, individual 
sensors can have powerful computation capacity and the ability to communicate 
with other sensors or the server via wireless networks. With these features, future 
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smart sensor systems can adopt ubiquitous computing in product condition moni- 
toring and management. By allowing all decision-makers to access the product life- 
cycle data easily and safely, the application of embedded smart sensors to facilitate 
product EoL recovery decision-making can be further investigated and understood. 
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Abstract 

A special feature of remanufacturing business is the existence of large 
proportion of replacement customers. This is due to the fact that many 
durable product markets are highly saturated and customers who return their 
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end-of-life products need to do replacement purchase. At the same time, 
pricing strategies have been widely adopted by remanufacturing companies 
to balance supply and demand. In this study, the joint decision of acquisition, 
trade-in, and selling price is considered. The objective is to maximize the 
expected profit. It is shown that a remanufacturing firm should offer higher 
rebates to replacement customers when this customer segment has high return 
quality and high price sensitivity. The optimal pricing policies under uncertain 
return yield rate are studied. The profitability of different pricing schemes is 
also investigated. 


Introduction 


In recent years, there has been an increasing concern on closed-loop supply 
chains and green supply chains. Due to both economic incentives and legislation 
regulation, more and more companies are involved in product recovery business. 
Remanufacturing is one of the various product recovery options. Successful practices 
of remanufacturing can be found in industries like automotive, construction, mining, 
aerospace, etc. A special feature of remanufacturing business is the correlation 
between supply and demand. This phenomenon is due to the large proportion of 
replacement customers. As reported by Lund and Hauser (2010), many 
remanufactured products are used for replacement. A possible explanation would 
be that customer wants to avoid the switching cost of changing to a different product. 

This study is motivated by these special characteristics of remanufacturing 
practices. A remanufacturing system is considered with the existence of replace- 
ment customer segment and uncertain return yield. The remanufacturing company 
acquires used products from previous customers through acquisition programs. The 
supply of return flow is price dependent. Demand comes from both replacement 
customer and first-time buyer which is also price dependent. Replacement customer 
will return their old products and get trade-in rebates for new purchases. The 
demand can be satisfied by either remanufacturing used products or manufacturing 
new ones. 

This model represents the remanufacturing practice of many durable products. 
For highly saturated markets, a significant portion of purchase could be replace- 
ment. A practical example can be found in Caterpillar, which is the world’s largest 
manufacturer of construction and mining equipment and diesel and natural gas 
engines. Customers who return their end-of-life products will get a cashback from 
Caterpillar. The company also offers trade-in rebates to those replacement cus- 
tomers. In this article, the problem with deterministic and random yield rate is 
studied. The effect of different pricing schemes is also investigated. 

The rest of the study is organized as follows. The relevant literature is reviewed 
in section “Related Literature.” In section “Model,” the model is described in detail 
and the optimal pricing policy is presented. To get managerial insights, numerical 
study is provided in section “Numerical Study.” Finally, conclusions and future 
research directions are discussed in section “Conclusions.” 
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Related Literature 

This study is mainly related to the three streams of operations research: consumers’ 
replacement decisions, return acquisition in remanufacturing, and systems with 
random yield. 

Some studies in remanufacturing assume supply and demand are independent. 
However, a notable feature of remanufacturing business is the correlation between 
returns and sales. For durable products like engines or transmissions, many cus- 
tomers need to do replacement after the products they are using reach the end-of- 
service life. Consumers’ replacement or repurchase behavior has been widely 
discussed in marketing research. Customers’ replacement decisions not only 
depend on their own attitudes and perceptions (Bayus 1991) but also are affected 
by companies’ advertisements and product developments (Winer 1997). It is 
suggested that marketers can mitigate consumers’ loss aversion by accepting the 
old product as a trade-in (Novemsky and Kahneman 2005). Unlike these studies 
which focus on descriptive and empirical analysis of consumer’s replacement 
behavior, in this work, it is assumed that companies can use trade-in rebates as a 
pricing tool to differentiate replacement customers and first-time buyers. Optimal 
pricing strategies are then discussed with different yield conditions. 

Due to the increasing concern on closed-loop supply chain, there is an extensive 
literature on remanufacturing, reverse logistics, and other related problems. 
Detailed review of quantitative models and business aspects can both be found 
Fleischmann et al. (1997) and Guide and Van Wassenhove (2003). More recent 
reviews are also available (Souza 2008; Guide and Van Wassenhove 2009). 

One of the important issues in closed-loop supply chain is the product acquisi- 
tion management, which has been widely discussed in both practice and academia. 
To stimulate product return, firms can either facilitate the reverse channel or 
provide monetary incentives to existing customers. In an early work of Guide and 
Jayaraman, a framework for product acquisition management is investigated 
(Guide and Jayaraman 2000). Game theory model is used to study the efficiency 
of different reverse channels in a supply chain setting (Savaskan et al. 2004). 

Guide et al. consider a remanufacturing planning problem in which returns can 
have different quality levels (Guide et al. 2003). They assume that return supply 
from each quality class and product demand are both price dependent. A single- 
period framework is developed to determine the optimal pricing policy. Recently, 
the joint acquisition, pricing, and inventory management problem is studied in a 
multiperiod setting (Zhou and Yu 2011). But both papers ignore the fact that higher 
acquisition price may lead to higher demand due to the existence of replacement 
customers. Despite the extensive discussion of product acquisition management, 
few studies investigate the effect of replacement purchase on remanufacturing 
business. Some researchers consider infinite -horizon model in which previous 
customers can make repeated purchase in future periods (Debo et al. 2006). Others 
study the joint pricing problem of new and remanufactured products under the 
existence of green segment customers (Atasu et al. 2008). They assume return from 
previous sales can affect future demand. But they take return quantity as a fixed 
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fraction of previous sales and do not consider the acquisition decisions. In a closely 
related work to this study, Ray et al. assume firms can influence customers’ return 
and repurchase decisions by offering different rebates and prices (Ray et al. 2005). 
However, instead of considering return products as supply for future production, 
they model return revenue as a deterministic function of products’ remaining 
lifespan. 

This work is also related to the research stream on systems with random yields. 
Yano and Lee provide a comprehensive review of this problem (Yano and Lee 1995). 
More recent work includes Hsu and Bassok (1999), Bollapragada and Morton (1999), 
Li and Zheng (2006), Inderfurth and Transchel (2007), and Tang et al. (2012). In 
remanufacturing planning, there are several works considering the effect of uncertain 
yield. Inventory system with deterministic demand and random yield is firstly 
analyzed (Ferrer 2003). Zikopoulos and Tagaras study a remanufacturing system 
where return supply comes from two collection sites, both with uncertain yield rate 
(Zikopoulos and Tagaras 2007). It is shown that in some situations, it is optimal to 
collect from only one site. Bakal and Akcali develop a single-period model to 
determine the optimal acquisition and selling price (Bakal and Akcali 2006). 
Mukhopadhyay and Ma study the joint procurement and production problem of a 
hybrid system, and both demand and return yield rate are random (Mukhopadhyay 
and Ma 2009). Zhou et al. adopt a different approach where return flows can 
have different quality levels, but the remanufacturing process is perfectly reliable 
(Zhou et al. 2011). 

This work differs from the existing studies in that replacement customers are 
considered as a different customer segment. Unlike those models which consider 
repeated purchase as an uncontrollable process, in this study, it is assumed that 
replacement demand can be actively controlled by the trade-in rebates. 


Model 

Assumption 

In this analysis, a single-period remanufacturing business model is considered. A 
remanufacturing company acquires end-of-use products from existing users and 
sells remanufactured products to both new and replacement customers. It is 
assumed the market is monopolistic and the company has pricing power. Further- 
more, to make the price discrimination policy possible, it is required to assume 
there is no efficient secondary market. For new customers who are first-time buyers, 
their demand is modeled as a linear function of selling price p , oo( p) = a— bp, where 
a,b> 0. Replacement customers are current users who need to replace their end-of- 
life products. Their repurchasing decisions also depend on prices. Since end-of-life 
product can be used for remanufacturing, companies usually offer trade-in rebates 
for those replacement purchases. Therefore, the demand of replacement customers 
is considered as a linear function of repurchasing price/, 6(f) = 8—yf where <5, 
y > 0. The difference between p and / is the trade-in rebates offered to the 
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replacement customers. There is also a return flow from end-of-use products from 
existing users. It is assumed that acquisition return depends on acquisition price 
r and can be modeled as rj(r) = a + /?r, where a, /? > 0. For simplicity, rj , co, and 6 are 
used to represent the corresponding functions. Figure 1 illustrates the material flow 
of such a hybrid system. 

After return products are acquired (through both trade-in and acquisition), they 
are disassembled to check whether they can be remanufactured; the cost is denoted 
as inspection d. Since acquisition return and replacement return are from different 
customer segments, the yield rate of return products can be either identical or 
different. The case when yield conditions are identical is firstly considered; and 
the aggregate yield rate is denoted by p , which is a random variable observed only 
after inspection. In this case, the remanufacturing quantity is min{ co + 0, p(0 + rj ) }, 
with unit remanufacturing cost c r . Worn-out returns and excess reusable returns are 
disposed with zero disposition cost. When returns are insufficient to satisfy demand, 
the company needs to manufacture new products at unit cost c, where c > c r . Later 
the case when acquisition return and replacement return are of different yield 
conditions is also discussed. Table 1 summarizes the notations which will be used. 


Identical Yield Rate 

Given the model described above, in this section, the pricing problem when the 
yield rate is identical between acquisition return and replacement return is formu- 
lated. Firstly the case with deterministic yield is considered, which means that the 
percentage of remanufacturable cores is fixed and known. The assumption is then 
relaxed to incorporate random yield condition. The optimal decisions are charac- 
terized for both cases. 
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Table 1 Notations 


Notations 

Explanations 

r 

Acquisition price 

f 

Selling price for replacement customer 

P 

Selling price for new customer 

rj(r) 

Supply of returns, rj(r) = a + pr 

>(P) 

Demand of new customer, o)( p ) = a— bp 

9(f) 

Demand of replacement customer, 0( f) = 5—yf 

d 

Unit inspection (preprocessing) cost of returns 

c r 

Unit remanufacturing cost 

c 

Unit manufacturing cost 

p 

Aggregate yield rate of return products 

G(-), g(-) 

CDF and PDF of random yield rate p 


Deterministic Yield Rate 

It is assumed that firm always recognizes that there exists replacement customer 
segment. When the firm decides not to offer trade-in programs, both replacement 
and new customers buy products at price p. Meanwhile, replacement and acquisi- 
tion customers will sell their old products to the firm at price r. In this case, demand 
function of replacement customers can be characterized as 0(p, r) = 8 — y(p — r). 
The company decides p and r simultaneously to maximize its profit. Such a pricing 
strategy is named as uniform pricing. This pricing strategy represents the case when 
product sales and return collection lack coordination, for example, the reverse 
channel is outsourced to a third-party collector. It is not the main focus of this 
study but serves as a benchmark for the price discrimination strategy. For more 
details of such a uniform pricing, readers can refer to Ray et al. (2005) and 
Savaskan et al. (2004). The pricing problem can be formulated as follows: 


Max Yl v (r,p) = co(p)p + 0(p,r)(p - r - d) -rj(r)(r + d) 

-c(w(p)+0(p,r)-p(0(p,r)+ri(r))) 
subject to p(t](r) + 9(p,r)) < 0(p,r) + a>(p) 


- c,p(rj(r) +9{p,r j) 


The optimal pricing decisions have two possible forms: 


{ (fuo,Puo) when (a - be - (c + d)y + S) + (2 y(c - c r ) + 

d(p + y)-8- a)p - (c - c r ) (/? + y)p 2 > 0 

(rui,Pui) otherwise. 

where (r v 0 , puo ) solves the first-order condition and ( r m , p V i) is the optimal 
solution when constraint is binding. 

When the firm decides to offer trade-in to replacement customers, it charges p to 
new customers and/to replacement customers and pay r for each acquisition return. 
Since the company can choose whether to manufacture or not, two different 
scenarios are obtained. First, if the company chooses to manufacture, 
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remanufacturable return is then less than total demand, oo(p) + 0(f) > p(rj(r) + 
0(f)), and part of the demand will be satisfied by manufacturing. The profit is the 
sales revenue minus acquisition, inspection, remanufacturing, and manufacturing 
cost. Second, when demand is only filled by remanufacturing, i.e., co(p) + 0(f) = 
p(rj(r) + 0(f)), then there is no manufacturing cost. Additionally, the company 
should assure p—f> r to make the trade-in price attractive to replacement 
customers. Consequently, the pricing problem is formulated as follows: 

Max]l = co(p)p + 0(f) (f - d) - rj(r) (r + d) - c r p(rj(r ) + 0(f) ) 

-c(w{p) + 0(f) - p(0(f) + t](r))) 
subject to p(p(r) + 0(f)) < 0(f) + oa(p) 

r+f <p 

The first constraint makes sure that manufacturing cost occurs only when 
reusable return is insufficient. The second constraint means that trade-in rebates 
should be greater than or equal to the acquisition price. Otherwise replacement 
customers would sell their end-of-life products at price r and purchase new products 
at price p. When the second constraint is binding, the above problem becomes 
equivalent to the uniform pricing case. We first solve the relaxation problem 
without considering the second constraint. After that, we identify the condition 
for which the pricing policy violates this constraint. 

Proposition 1 Without considering the second constraint, the optimal pricing 
policy under deterministic return yield rate is 

{ ( r o,/o ,Po) when (a -be - (c + d)y + 5) + (2 y (c - c r ) + 

d(J5 + y) - 8- a)p - (c - c r ) (f + y)p 2 > 0 

CufuPi) otherwise. 

where (. r 0 ,f 0 ,p 0 ) = ^ \ (c - c r )p, c rg* +s -\(c- c r )p, solves the 

first-order condition and (r\, f\, p x ) is the solution when the first constraint is 
binding. 

Proposition 1 shows that either the optimal solution satisfies first-order condition 
or the first constraint is binding. 

Corollary 1 Uniform pricing policy should be chosen when ^ | | > 0. 

Proof By checking the optimal solution in Proposition 1 , we can find the condition 
when the second constraint is binding. 

According to Corollary 1, price discrimination policy is implementable only 
when ^ — f — | < 0 . Since y, b, and f represent the price sensitivity of each 

customer segments, the result suggests that a higher rebate to replacement 
customers is preferable when this segment has higher price sensitivity, while the 
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sensitivities of new customers and acquisition customers are low. An interesting 
question is that given ^ — f — | < 0, what is the profit improvement of price 

discrimination? Define value of price discrimination as n( r *> P *) — 
Ylu( r u> Pu )• Corollary 2 gives the result under deterministic yield rate. 

Corollary 2 Given j — f — | < 0, the value of price discrimination is 

(£>ay + cc/fy — b/35) 2 

4bfy{fy + b{fi + y)) 

Corollary 2 can be easily proved by substituting (r*,/*, /?*) and (r^, into the 
corresponding profit functions. This corollary shows that the value of price dis- 
crimination is independent of unit manufacturing cost c, unit remanufacturing cost 
cy, and yield rate p. We can verify that for both pricing strategies the demand and 
return volume are the same. Since unit remanufacturing cost, unit manufacturing 
cost, and yield rate only affect the production cost, the production cost will keep 
unchanged. This explains why the profit difference is independent in c, cy, and p. 
The implication is that benefits of price discrimination come from better targeting at 
different customer segments, instead of the cost savings from production. 

Random Yield Rate 

When yield rate is random, depending on the pricing decisions and the realization 
of yield rate, the firm’s profit has two expressions: 



( cop + Of — rjr — d(6 + rj) — cy (go + 6 ) , whenco + 6 < p 0 (6 + TJ ) 

\ cop + Of - pr - d(6 + p) - c(co + 0 ) + p 0 (c - cy) ( 6 + rj ) , otherwise 

where p 0 denotes the realization of yield rate. 

The expected profit function under uncertain yield rate will be 

£[n (r>f>p)\ = cop + 0(f - d ) - t 7 ( 7 * + d)- c r E [min{<w + 0,p(rj + 6 ) }] 

-cE[{(w + 0) -p{ii + 0)) + ] 

= m(p - cy) + 0(f — c r - d) — t](r + d) — (c — cy) E[co + 0 - p(0 + t])) + ] 

The random yield rate distributes on [A, B] (0 < A < B < 1), with CDF G( ), PDF 
g(-), and mean value p. Depending on the pricing decisions, the relation between 
return and sales has two different cases: 


Case 1: (6 + rj)A <6 + co<(6 + rj)B 




cop + 0{f — d) 






- t](r + d) - c, (oi + 6) 

(0 + co — p(0 + t]))g(p) dp 
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Case 2: ( 6 + rj)B <6 + 00 

E [Ih( r ’f’P)} = m P + &(f ~ d) — t](r + d) + fi{c — c r )(9 + tj) — c(co + 9) 


Note that the case (0 + rj)A > 0 + rj is omitted. In practice, such situation is 
obviously suboptimal. When reusable return is greater than the total demand, the 
company can always reduce the acquisition price to increase the profit. 

For Case 2, reusable return is less than the total demand for any yield realization. 
Hence, manufacturing is always needed. The optimization problem is similar to the 
situation of deterministic yield rate, and the optimal decision only depends on the 
mean value of yield rate. 

For Case 1, the expected profit function is similar to that of a classical 
newsvendor problem with price-dependent demand. However, there is a major 
difference between this model and newsvendor problem. Classic newsvendor 
model usually assume random demand and perfectly reliable supply. While in 
this remanufacturing problem, we assume a deterministic demand and uncertain 
yield rate. Because of this difference, the profit function in Case 1 shows a different 
property compared to that of a newsvendor model. 

Lemma 1 E [Y[( r >f P)] is differentiable. 

Proposition 2 The expected profit function is jointly concave in r, /“ and p. 

Lemma 1 and Proposition 2 shows that there exists an optimal pricing decision 
and the optimal solution can be found determined efficiently by gradient methods. 

Corollary 3 Given two different yield conditions pi and p 2 which are distributed 
on [A, B] with p x < st p 2 , the optimal expected profit E [Q* {p \ )] < E [Q* (pf )\ • 

For the uniform pricing problem, it is equivalent to add a linear constrain 
r + / = p to the above problem. Since the concavity of the profit function has 
been proved, the optimal (r u p u ) can be obtained similarly. By definition, it is 
indisputable that E [n( r *, /*, /?*)] > E i no-,, p u )]. However, because of the 
complexity of the problem, a closed-form solution is not obtainable. An interesting 
question is that whether Corollary 2 still holds for the random yield rate problem; 
numerical results are presented in section “Numerical Study.” 


Different Yield Rate of Returns 

Zikopoulos and Tagaras investigate a reverse supply chain with two collection sites 
and different return qualities (Zikopoulos and Tagaras 2007). They derive the 
condition under which is optimal to use only one site. In this study, acquisition 
returns come from end-of-use products, which means costumers no longer need 
such product. On the other hand, replacement returns occur only when the products 
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Table 2 Integration area 


Range of 
to and rj 

Range of 6 

Integration limits 

go > r/B i 

6> 0 

Ai < pi < B\, A 2 < p 2 < B 2 

6(1 — A 2 ) < rjBi — go 

Ai < pi < g(1 ~y )+<B for A 2 <p 2 <B 2 

rjA\ < go 

< nB\ 

0(1 — B 2 ) > rjB\ — go 

Ai < pi <Bi for A 2 < p 2 < B 2 

9( 1 — B 2 ) < r/B i — co < 

6( 1 - A 2 ) 

A , 0(1 ~ p 2 ) + oo rjB\ — go 

A\<Pi< for 1 < P 2 < 

11 0 

B 2 , and A i < p x <B\ for A 2 < p 2 < 

1 nB\ - go 

0 

0(1 — B 2 ) > rjBi — go 

Ai Ei P\ E B\ for A 2 < p 2 < B 2 

0(1 — A 2 ) < r]Ai — go 

Null 

go < r]A\ 

0(1 — B 2 ) < 11 A i — go < 
0(1 — A 2 ) < rjBi — go 

A, < Pl < for A 2 <p 2 < 1 - 2^ 

YjAi — GO < 0(1 — B 2 ) < 
0(1 — A 2 ) < rjB i — go 

Ai <Pi< ( \ )+ for A 2 <p 2 <B 2 

0(1 — B 2 ) < ii A | — go < 
rjBi — go < 0(1 — A 2 ) 

0(1- p -2) A go rjB\ — go 

Ai <Pi < fori < p 2 < 

rj 0 

B 2 , and A i < p x <B 2 for A 2 < p 2 < 

1 r jB x — GO 

0 

o)Ai — go < 0(1 — B 2 ) < 
rjBi — go < 0(1 — A 2 ) 


fail or excess their useful life. Due to the difference in return causation, a reasonable 
conjecture would be that acquisition return and replacement return may have 
different quality conditions. In this subsection, the assumption of identical yield 
rate is relaxed to investigate how it affects firm’s profitability. 

Let pi and p 2 denote the yield rate of acquisition return and replacement return, 
respectively. When remanufacturing company adopts price discrimination policy, 
it is possible that acquisition returns have higher yield rate but receive lower 
rebates. It is assumed that this will not change the supply function of acquisition 
returns. The reason is twofold. Firstly, yield condition only comes after inspec- 
tion; customers cannot get this information in advance. Secondly, the higher 
rebates to the replacement customers are used to encourage repurchase, and 
pure return customers should not enjoy this benefit. The expected profit function 
then becomes 


F[I1 ( r >f>p)\ = a>P + 0(f - d) - rj(r + d) - c r E[mm{co + 0,p x rj+ p 2 0}\ 


cE (go + 0 — p x rj — p 2 0) 


+ 


O) (p - c r ) + 0(f — c r - d) - rj(r + d) - (c — c r )E [(go + 0 - p { 6 - p 2 rj) 


+ 


Assume p t is distributed on [A z , Bj ], (0 < A z < B t < 1), with CDF G z (•), PDF g z (-), 
and mean value p t (i = 1,2). Depending on the pricing decisions, the integration 
area of E[(oo + 6 — p\6 — p 2 rj) + ] is shown in Table 2. 
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Due to the complexity of the expected profit function, it is not straightforward to 
show the concavity of the profit function. Therefore, numerical results are reported 
using MATLAB with interior-point optimization technique. 


Numerical Study 

Unlike most manufacturing systems, in remanufacturing, the quality level of supply 
flow is highly variable and uncertain. In this section, computational experiments are 
conducted based on the model described above. The purpose of the numerical study 
is twofold. Firstly, since it is difficult to attain closed-form solution for random 
yield rate problem, numerical results are used to investigate the advantages and 
limitations of different pricing policies. Secondly, sensitivity analyses are carried to 
find how the optimal decisions change according to different parameter settings. 
This would help managers make decisions when facing different market conditions. 

Uniform distribution is used to describe the uncertainty of yield rate condition. 
However, it is not to claim that uniform distribution is more suitable to model the 
usability of return products. In literature, several distributions have been adopted 
for study. Weibull distribution is used by Lo et al. and Wee and Chen 
(Lo et al. 2007; Wee et al. 2007). Bakal and Akcali use normal distribution in 
their analysis (Bakal and Akcali 2006). Uniform distribution has been used by 
Mukhopadhyay and Ma (2009 and Tang et al. (2012). For the numerical study, the 
following data sets are assigned as base value throughout this section: 

a = 150, b = 3, S = 100, y = 3, a = 10, /? = 10, c r = 5, d = 2, c = 30. 


Effects of Random Yield Rate 
Identical Yield Rate 

Firstly, sensitivity analysis is conducted with identical yield rate. In this situation, 
acquisition return and replacement have the same yield rate condition. Analytic 
results are presented in section “Identical Yield Rate.” For the sensitivity analysis, 
the standard deviation of yield rate a is fixed at 1/75, and the mean value // varies 
from 0.3 to 0.8. Figure 2 shows how the pricing decisions change accordingly. 
According to Fig. 2, the optimal acquisition price first increases with //, then after a 
threshold, the price slightly decreases as // further increases, while the optimal 
replacement purchase is always decreasing in // within the range of computational 
experiment. When yield rate is low, the optimal selling price for new customers is 
independent of //, but as // further increases, the optimal price decreases to attract 
more first-time buyers. The result suggests that when expected return yield is low, 
the firm would choose to acquire less cores (( 6 + rj)B < 6 + go) and demand is 
satisfied by both manufacturing and remanufacturing. More specifically, if the 
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Fig. 2 Expected yield rate and pricing decisions 


reusable return is less than the replacement, then all the new demands are satisfied 
by manufacturing. Consequently, in this case, selling price p is independent of yield 
condition. When p is high, supply of reusable return is ample, and the firm then 
reduces the selling price to attract more customers. 

Corollary 2 shows that when yield rate is identical and deterministic, the value 
of price discrimination is independent in c, cy, and p. However, the conclusion 
under random yield is not easy to draw. Hence, computational experiments are 
used to verify whether this result still holds in random yield situation. Firstly, the 
yield variance a is fixed at 1/75 to study the effect of expected yield rate p. 
Afterward, p is fixed at 0.5 and a is varied to see how standard deviation affects 
expected profit. Figure 3 illustrates the profit difference of these two pricing 
schemes under different yield rate conditions. Under both pricing schemes, the 
expected profit is increasing with p and decreasing with a. These two observations 
are consistent with intuitions as higher yield rate saves acquisition cost and lower 
randomness leads to higher profits. It can be also observed that to offer a trade-in 
program is especially favorable when the expected yield rate is low and the 
variance of yield rate is large, as the percentage of profit improvement is higher 
in such cases. On the other hand, managers should also take into account the 
related cost of such a market decision. 

Another observation is that, under random yield rate, the profit difference 
between the two pricing strategies is stable with respect to both p and o. More- 
over, although not shown here, numerical results also reveal that the profit 
difference is independent of c and cy, which is consistent with the case of 
deterministic yield rate. 
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Fig. 3 Value of price discrimination 


Different Yield Rates 

As stated in section “Different Yield Rate of Returns,” the difference in return 
causation may lead to different return quality conditions. This section investigates 
the situation when p x and p 2 are independent random variables. Table 3 summarizes 
how pricing decisions are affected by different yield rate conditions. When p x is 
low, the acquisition price r is unchanged to different levels of p 2 . When p x is in the 
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Table 3 Pricing decisions 
with respect to different 
yields 


Hi 

yW 2 

ir, f, p) 

E[U] 

0.4 

0.4 

(3.50, 27.67, 40.00) 

598.83 


0.6 

(3.50, 25.17,40.00) 

702.58 


0.8 

(3.50, 22.67, 40.00) 

843.83 

0.6 

0.4 

(5.07,27.17,38.74) 

862.82 


0.6 

(5.10, 25.06,38.70) 

960.03 


0.8 

(4.61,22.49,38.14) 

1090.06 

0.8 

0.4 

(5.16, 25.68, 36.42) 

1107.96 


0.6 

(5.00, 23.85, 36.22) 

1198.79 


0.8 

(4.68, 22.08, 35.83) 

1305.09 


middle level, r first increases in p 2 and then decreases. When p x is high, r decreases 
in p 2 . The replacement price /is decreasing in both p x and / 1 2 . The selling price 
p behaves similar to the identical yield case, which remains unchanged when return 
supply is insufficient to fulfill replacement returns, and then decreases as return 
yield further increases. 

Define the profit difference with and without a trade-in rebate as the profit gain 
from price discrimination. Figure 4 shows that when p x and p 2 are independent 
random variables, the profitability of price discrimination policy is affected by not 
only return yield condition but also replacement customers’ price sensitivity. As 
indicated by Fig. 4, the profit gain is increasing in the expected yield rate of 
replacement return while decreasing in that of acquisition return. This result is 
different from the case when these two yield rates are identical. As Corollary 2 and 
Fig. 3 show that when yield rates are identical, the profit gain remains the same with 
respect to the different yield rate levels. The result highlights the importance of 
identifying return yield conditions of different customer segments. Another observa- 
tion is that the profit gain is affected by replacement customers’ price sensitivity y. 
This is consistent with Corollary 1, which suggests that when y is low, the company 
has less incentive to offer higher rebates to replacement customers. 

The numerical study implies that when p 2 and y are high, a greater rebate to 
replacement customer can stimulate the replacement sales and acquire more reus- 
able returns. Therefore, remanufacturing managers are recommended to use price 
discrimination policy under such circumstances. 

Although not shown here, the numerical study also reveals that the yield 
randomness does affect the profitability, but the effect is minor compare with the 
above two factors. 


Quality-Dependent Rebate Policy 

Up to this point, it is assumed that remanufacturing company will rebate all the 
return customers without considering the inspection results of return products. 
While in practice, some companies do check the reusability of return products 
and pay the rebates based on inspection results. Ray et al. discuss the relation 
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Fig. 4 Value of price discrimination, yield rate, and price sensitivity 


between products’ residual value and replacement decisions in a deterministic 
environment (Ray et al. 2005). They assume the perceived residual value of product 
depends on the remaining useful lifespan, and both customers and the 
remanufacturing firm are fully aware of this information. 

In this study, the actual yield condition can only be observed after inspection; it 
is hard to imagine that return customers would know this information in advance, 
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while it is assumed that the remanufacturing firm is well experienced and knows the 
yield distribution. Since return customers do not have knowledge on the yield 
distribution, their return decisions only depend on the nominal acquisition price r. 
It is assumed that the acquisition return function is the same as the case when 
rebates are paid to all return customers. 

For the replacement customers, their demand may have two possible situations. 
Firstly, if replacement customers behave the same as acquisition return customers, 
the replacement demand function is also unchanged. In this case, the expected profit 
for the remanufacturing company becomes 


E[Uri(>' J ^ P)\ = cop + 0(fi 2 f + (1 — p 2 )p ~ d) — t](p\r + d) 

— c r E[min{co + 0,p { f] + /? 2 0}] — cE\(cq + 0 — p x ri — p 2 0) 


+ 


where pi and p 2 are the expected yield rate of acquisition and replacement returns. 

Secondly, although the return decisions are induced by the nominal rebates, the 
replacement customers can make their purchasing decisions after yield information 
is revealed. In addition, we assume the customers are homogeneous in price 
sensitivity with respect to the return yield condition. Therefore, the expected profit 
for the remanufacturing company becomes 


£[H ;2 ('■>/»] = cop + p 2 0(f) {f - c r ) + (1 - p 2 )0(p)p - 0(f)d - t](pp- + cl) 

- c r E[mm{w + (l - p 2 )0(p),pit]}\ -cE[{w+{\ - p 2 )0{p) ~ P\n) + _ 

Figure 5 demonstrates how different rebate policies and yield conditions affect the 
firm ’ s expected profit. If both replacement and acquisition customers only look at the 
nominal prices, then a quality-dependent rebate policy will improve the firm’s profit 
significantly. Furthermore, the firm will offer higher rebates to attract more replace- 
ment customers and increase the selling price p to generate more profit from customers 
whose return cannot be remanufactured. This effect makes the firm more profitable 
when p 2 is low. While for the second situation, the expected profit increment is much 
less. In this case, the replacement purchasing decision differs regarding to the yield 
realization. Although the company saves rebate costs for unusable return, part of the 
replacement customers can only buy at price p and their demand is deterred. 

Our numerical results suggest that the profitability of such quality -based rebate 
policy is largely affected by the return customers’ response. In remanufacturing 
research, few studies consider the effects of different rebate policies. The actual 
reaction of return customers may fall between the above two scenarios. Further 
empirical studies are required to justify the assumption of customers’ return decisions. 


Summary 

Matching supply and demand is the major concern of managers who are dealing 
with remanufacturing business. Most studies in remanufacturing systems have 
assumed that supply and demand are two independent flows. This assumption is 
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Fig. 5 Performance of quality-dependent rebate policy 


reasonable for new product manufacturing and sales. However, due to the existence 
of replacement customers, it does not hold for remanufacturing business. 

This study investigates the pricing decisions of a remanufacturing firm who are 
facing both new and replacement demands. A single-period model is developed to 
evaluate the benefit of adopting a price discrimination policy. It is the first attempt 
to study the effect of replacement customers in remanufacturing business. 


3308 


L. Jing et al. 


For deterministic yield condition, it is shown that the price discrimination policy is 
applicable if replacement customers have high price sensitivity, while new cus- 
tomers and acquisition customers have low price sensitivity. When yield rate is 
uncertain, due to the complexity of the problem, a closed-form solution is not 
attainable. Computational experiments are conducted to compare the profits of 
different pricing schemes. Factors like yield rate conditions and customers’ price 
sensitivities are investigated. The numerical results show that both factors are 
crucial for the firm. The price discrimination policy to replacement customer is 
no worse than the uniform pricing policy in every case. Furthermore, price dis- 
crimination policy is significantly better off when the yield rate of replacement 
return is high. The payment scheme of return rebates also affects firm’s profit. 
According to the numerical study, when rebates are only offered to reusable returns, 
the firm is significantly better off if replacement and acquisition customers both 
make decisions only based on the nominal repurchasing price. 

The present model has assumed deterministic demand function; in practice, 
however, the demand information is usually imperfect. Consequently, it is mean- 
ingful to incorporate random demand into the model. The company will then decide 
on both pricing strategy and production quantity. It would increase the complexity 
of the model, but such a model is similar to the newsvendor problem with endog- 
enous demand, which has been extensively studied. The existing results will 
facilitate the analysis with a remanufacturing problem setting. 

There are several other possible extensions for this model. One is to relax the 
assumption of independence of new customer and replacement customer. In prac- 
tice, replacement customers may choose to purchase a new product without 
returning their old one. The demand from this customer segment will then depend 
on both/ and p. It is expected that the optimal pricing policy would be different, but 
the price discrimination policy should preserve its profitability. A limitation of this 
model is that the yield rate is taken as the fraction of reusable returns. In practice, 
return products are usually under different quality conditions and require different 
remanufacturing costs. The current model would be more realistic if multiple type 
returns can be incorporated. Besides, one can also consider that the remanufactured 
products are imperfect substitutes of brand-new products. It is interesting to see 
how cannibalization effect will change the pricing decisions in such cases. 


Appendix 

Proof of Lemma 1 Define //(x, p) = 0(f) + co(p) — p(rj(r) + 0(f)) and K(x) = 

0 ^)+^) > where x = (r, f p). It is obvious that £[Fl(x)] is differentiable for 

//(x, B) < 0 and //(x, B) > 0. The only thing that needs to be proven is whether 
£[II(x)] is differentiable at //(x, B) = 0. Let x 0 = { x I //(x, B) = 0 } ; it can be shown 
that the partial derivatives at x 0 exist and are continuous. 

Denote A as a vector so that //(x 0 + A, B) > 0 and //(x 0 — A, B) < 0. Consider the 
special case where Ai = te- x = ^ (3 ^ ^ ^ ^ (3 ^ ^ i € {r,f,p}. 
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£[n( x o)] - £[n( x o - A r )] = ri(r 0 - t) (r 0 - t + d) - t](r 0 ) (r 0 + d) 

/ j-Jf(xo) \ 

+ (c r - c) I H(x 0 ,p)g{p)dp I 

*/C(x 0 -A r ) 

H(x 0 - A r ,p)g(p)dp) 

Ja 

Let IA r l — ■» 0: 



where K(x 0 — A r ) < E, < K(x 0 ) and f > K(x 0 ) as IA r l — > 0. 
Then we can obtain 



The last equality comes from the fact that K(x 0 ) = B. It is easy to show that 



£[n(xo + A r )] - £[n(xo)] 

A,. 


= -t](r 0 ) - rf r (r 0 ) (r 0 + d) + (c - 



Similarly, it can be proven that the partial derivatives exist and are continuous 
with respect to /and p. Hence, £[n(x)] is differentiable at x 0 . 

Poof of Proposition 2 It is easy to show that E[Y[(r, /, p)] is concave when 
( 6 + rj)A < 6 + co < (6 + rf)B. For ( 6 + rj)A < 6 + go < (6 + rj)B, it is necessary to 
show E [— n (r,/, p)] is convex in r,/, and p. Applying Sylvester’s criterion, it is 
equivalent to prove 


1. H i 


_ d 2 E[-Y\{r,f,p)] q 

dr 2 
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Defin eC = c ^g(<$>),x 

f}(a>+6 

rj+d 

andy — Since c > c r , it 


> 0 


that C > 0. It can be obtained that Hi = 2/3 + Cx 2 > 0, H 2 = 4 fy + 2 C/3y 2 + 2C/x 2 > 
0, and // 3 = 4 Cby 2 /3 + 4C/?x 2 y + 8/?/fy + 4Cb 2 /3y > 0. Therefore, the expected profit 
function is concave on (6 + rj)B < 6 + go. Combing Proposition 1, it can be 
concluded that E\]J(r,f ’ /?)] is concave. 
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Abstract 

There has been a growing interest in remanufacturing during the past decade, 
since it offers many advantages to our economy. However, the qualification and 
quantification of the benefits of remanufacturing compared to original 
manufacturing remain confusing to us due to the difficulties of data collection 
in complex production processes and the lack of accurate and convinced eval- 
uation method. Life cycle assessment (LCA) is a “cradle to grave” approach for 
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assessing industrial products and systems, which enables to estimate the cumu- 
lative environmental impacts resulting from all stages in a product life cycle. In 
this book, taking a diesel engine as a case study, a comprehensive LCA is 
conducted for remanufactured diesel engines, aiming to identify the negative 
impact on the environment during the whole life cycle and to analyze the 
potential that remanufacturing had in terms of energy savings and environment 
protections. In order to demonstrate the environmental benefit of 
remanufacturing, the environmental impacts achieved in the study are compared 
with a newly manufactured counterpart. The results show that remanufacturing 
of a diesel engine has lesser contribution to all the environmental impact 
categories when compared to its original manufacturing; the greatest benefit is 
EP which is reduced by 79 %, followed by GWP, POCP, and AP which can be 
reduced by 67 %, 32 %, and 32 %, respectively. 

Introduction 

Background 

Resource and Environmental Problem 

Global emissions of carbon dioxide (C0 2 ) - the main cause of global warming - 
increased by 3 % in 20 1 1 , reaching an all-time high of 34 billion tons in 2011. In 
2011, China’s average per capita C0 2 emissions increased by 9 % to 7.2 1 C0 2 (Jos 
et al. 2012). The International Energy Outlook 2013 (IEO2013) projects that world 
energy consumption will grow by 56 % between 2010 and 2040, and the industrial 
sector continues to account for the largest share of delivered energy consumption; 
the world industrial sector still consumes over half of global delivered energy in 
2040 (IEA 2013). Statistics show that according to the present automobile growth, 
the volume of the end-of-life automobiles will reach up to six million by 2015, and 
the large quantity of the discarded cars and engines will lead to resource waste and 
environment pollution. Increasingly serious resource consumptions and environ- 
ment problems have attracted more and more attention by the society and busi- 
nesses. The government is establishing legislations and policies to encourage 
manufacturers to conduct green design and to explore methods for minimizing 
the effects of their activities on the environment (Zhang and Yu 1999; Kaebemick 
et al. 2003; MlastasPaul and Zimmemm 2003). 

The rapid depleting metal resources bring about a rigorous challenge to car 
components manufacturers and halt economic development of China. Statistics 
show that most of the emissions are given out by the processes associated manufactur- 
ing industry, among which the metal processing operations have a major share in the 
energy consumption. For instance, heavy-duty tmck engines with a large amount of 
steel and aluminum contribute significantly towards C0 2 emissions. Besides, more 
than 80 % of industrial raw materials are dependent on the supply of mineral resources 
from within China, and resource reserve shortage has become a major restriction for 
the development of the equipment manufacturing industry. 
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Modem manufacturing industry is facing the problem of how to reduce the 
environmental impacts of manufacturing. Some experts have put forwards four 
steps: understand the sources of the impact, quantify the environmental impact, 
identify improving opportunities, and then apply impact reduction strategies and 
assess the effectiveness. Quantifying the environmental impact bridged the preced- 
ing and the following when considering reducing the environmental impacts and 
quantifying the differences between a new strategy and traditional mode. There are 
two major problems in quantifying an impact in manufacturing: (1) manufacturing 
is not a stand-alone system and (2) both inputs and outputs of manufacturing are 
closely linked to other systems and processes. During environmental impact con- 
trol, the impact may shift from one process to another or from one life cycle stage to 
another. Due to the two complexities, life cycle assessment (LCA) has to be used 
for a comprehensive and reliable assessment. 

Development of Remanufacturing 

Before remanufacturing, material recycling is always applied as product end-of-life 
strategy. Material recycling could return the consumed product to their original raw 
material form to be used again, but it requires added labor, energy, and processing 
capital to recover the raw materials. Normally, the relative costs of material, labor, 
energy and the contribution of plant and equipment are the major concerns in 
product manufacturing. Remanufacturing could preserve much of this value while 
adding a second life to the product. In contrast, recycling shreds the product in an 
attempt to recover only the material value. Little or none of the other residual values 
in the product are retained. Reuse could save the labor from original processing and 
also retain the function and the design. The material recovery value chain of 
recycling, remanufacturing, and reuse is shown in Fig. 1. 

Remanufacturing could repair degraded components and put the product back into 
service, thus retaining the value of the extracted and refined materials (Kumar 
et al. 2007). Steinhilper said remanufacturing can avoid between 38 % and 53 % of 
carbon dioxide generated from new production in the 2010 International BIG R Show 
(Abby 2011). The remanufacturing of vehicles dates back to the 1940s. In 1947, a 
take back scheme called “Exchange Parts Program” was launched by Volkswagen to 
meet raw material shortages after World War II. The program significantly reduced 
the material and energy consumption for a large proportion of the components. 

Engine remanufacturing is a process of recovering the performances of the used 
engine after serious remanufacturing processes based on remanufacturing standard. 
Being different from the original engine manufacturing and traditional engine 
overhaul, engine remanufacturing begins with used engine reverse logistics, taking 
repairable components as processing objects, going through disassembly, cleaning, 
inspection, repairing, and reassembly processes. Once the product is disassembled 
and the parts are recovered, the process concludes with an operation not too 
different from the original manufacturing. Disassembled parts are inventoried, 
just like purchased parts, and made available for final assembly. It is being realized 
that a diesel engine remanufacturing too has better environmental performance than 
its original manufacture because of the fact that the materials’ shaping processes 
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Fig. 2 Framework of product life cycle assessment 


such as molding, casting, etc. can be avoided. Professor Xu from the National Key 
Laboratory for Remanufacturing said that in essence, parts remanufacturing can 
save over 70 % of material costs, cut energy consumption by 60 %, and lower 
overall cost by 50 % (Xu 2007). Since 2008, China has been trying to set up several 
auto parts remanufacturing bases under the direction of the National Development 
and Reform Commission. However, qualification and quantification of the benefits 
of diesel engine remanufacturing compared to original manufacturing remain 
unsolved due to the difficulties of data collection in complex production processes 
and the lack of accurate and convincing evaluation methods. 

Remanufacturing a qualitative transition of engine, it could give a second service 
life to an engine with advantages of high quality and efficiency and low cost and 
pollution. Remanufacturing improves sales volume and profit for enterprises as well 
as brings about considerable environmental benefits. 


Life Cycle Assessment (LCA) Method 

Life cycle assessment (LCA) is a “cradle to grave” approach for assessing industrial 
products and systems, which enables the estimation of the cumulative environmen- 
tal impacts resulting from all stages in a product life cycle, often including impacts 
not considered in more traditional analyses (EPA 2006). 

According to the ISO 14040 and 14044 standards, an LCA consists of the 
following four components (see in Fig. 2): 

1 . Goal and scope definition - Determine the type of information that is needed to 
add value to the decision-making process. From EPA 2006, the following six 
basic decisions should be made at the beginning of the LCA process to make 
effective use of time and resources: 

(a) Define the goal(s) of the project. 
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(b) Determine what type of information is needed to inform the decision- 
makers . 

(c) Determine the required specificity. 

(d) Determine how the data should be organized and the results displayed. 

(e) Define the scope of the study. 

(f) Determine the ground rules for performing the work. 

2. Life cycle inventory analysis - Quantify energy and raw material requirements, 
atmospheric emissions, waterborne emissions, solid waste, and other releases for 
the entire life cycle of a product, process, or activity. EPA 1995 defined the 
following four steps of a life cycle inventory: 

(a) Develop a flow diagram of the processes being evaluated. 

(b) Develop a data collection plan. 

(c) Collect data. 

(d) Evaluate and report results. 

Life cycle inventory (LCI) analysis is the most labor-intensive, time-consum- 
ing, and costly process. Inventory analysis involves the collection of data and 
calculations in order to quantify the inputs and outputs to the product system 
over its entire life cycle (ISO 1999). 

Currently, the most commonly used inventory analysis methods include 
simplified LCI, process-based LCI, matrix-based LCI, economic input-output 
LCI, hybrid LCI, etc. The comparisons of the major LCI approaches are shown 
in T able 1 . 

3. Life cycle impact assessment - Assess the potential human and ecological 
effects of energy, material usage, and environmental releases, as identified in 
the inventory analysis. 

The following steps comprise a life cycle impact assessment: 

(a) Selection and definition of impact categories - identifying relevant environmen- 
tal impact categories (e.g., global warming, acidification, terrestrial toxicity) 

(b) Classification - assigning LCI results to the impact categories (e.g., classi- 
fying carbon dioxide emissions to global warming) 

(c) Characterization - modeling LCI impacts within impact categories using 
science-based conversion factors (e.g., modeling the potential impact of 
carbon dioxide and methane on global warming) 

(d) Normalization - expressing potential impacts in ways that can be compared 
(e.g., comparing the global warming impact of carbon dioxide and methane 
for the two options) 

(e) Weighting - emphasizing the most important potential impacts 

In conclusion, LCA is conducted to calculate the final environmental impacts 
indicator by 
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where El is the final environmental impact indicator, G, is the value of zth substance 
in life cycle inventory, El { is characterization factor of zth substance to kth indicator, 
k — 1 ~ 5, R k is the reference value of kth indicator, V k is the weight factor of kth 
indicator, m is the number of substance related to Mi indicator, and n is the number 
of the indicators. 

4. Life cycle interpretation - Identify, quantify, check, and evaluate information 
from the results of LCI and LCIA and communicate them effectively (ISO 
1998). Within the ISO standard, the following steps to conducting a life cycle 
interpretation are identified and discussed: 

(a) Identification of the significant issues based on the LCI and LCIA 

(b) Evaluation which considers completeness, sensitivity, and consistency 
checks 

(c) Conclusions, recommendations, and reporting 

An LCA can help decision-makers select the product or process which results in 
the least impact to the environment. In this paper, a comparative life cycle assessment 
is conducted for an originally manufactured diesel engine and compared with its 
remanufactured counterpart, aiming to identify the negative impact on the environ- 
ment during the whole life cycle and analyze the potential that remanufacturing 
possesses in terms of energy savings and environmental protections. 


Technical Processes of Engine Remanufacturing 

Technical process flows of engine remanufacturing (shown in Fig. 3) includes 

disassembly, classification and cleaning, inspection, repairing, reassembly, etc. 

1 . Full-Scale Disassembly 

Disassembly can be defined as the systematic separation of an assembly into 
its components, subassemblies, or other groups (Fambert and Gupta 2005). It is 
an important process in material and product recovery since it allows for the 
selective separation of desired parts and materials. During the engine disassem- 
bly, the quick wear parts, such as the piston assembly, main shaft bushing, oil 
seal, rubber hose, and cylinder head gasket, are discarded directly. These 
components always cannot be remanufactured or with no remanufacturing 
value, and they will be substituted by the new parts when reassembling. The 
major components after engine disassembly are shown in Fig. 4; some quick 
wear parts are shown in Fig. 5. 

2. Components Cleaning 

All the parts coming from the disassembly process are cleaned, and the 
cleaning process involves washing away dirt and dust from the parts as well as 
degreasing, deoiling, derusting, and freeing the parts from old paint (Steinhilper 
1998). Several cleaning methods can be applied according to the different 
materials and contaminations, including pyrogenic decomposition, chemical 
cleaning, ultrasonic cleaning, and liquid spraying. 
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Fig. 4 Major components after disassembly 



Fig. 5 Quick wear components 

3. Inspection and Identification 

Inspection of disassembled and cleaned parts is required to determine their 
reusability and reconditionability. According to Steinhilper (Steinhilper 1998), 
there are two important aspect of inspection in remanufacturing: 
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Fig. 6 Used diesel engine 


• Specification of criteria and condition characteristics required for the deter- 
mination of the condition of the components 

• Development and application of suitable and affordable testing equipment 
The testing equipment used for the testing of new parts is generally used after 

reconditioning the disassembled parts. During inspection, the components are 
sorted after testing, those which can be reused directly, such as inlet pipe 
assembly, manifold, oil pan, and timing gear covers etc., are loaded into the 
warehouse for reassembly; the failure components which can be repaired, such 
as cylinder block assembly, connection rod assembly, crankshaft assembly, fuel 
injection pump assembly, and cylinder head assembly etc., are prepared for 
remanufacturing . 

4. Repairing for the Components Which Can Be Remanufactured 

Several methods and technologies can be applied when repairing the failure 
parts, for example, the advanced surface technology applied in surface dimen- 
sion restoration to achieve a better performance compared with original parts, or 
mechanical manufacturing technology applied to reprocess the remanufactured 
parts to satisfy the tolerance scope for assembly. 

5. Reassembly 

The parts are reassembled into a remanufactured product using the same 
power tools and equipment used in the assembly of new parts (Steinhilper 1998). 

Then the remanufactured engine will go through testing, coating, and package 
processes. The effect drawings of the used engine before and after 
remanufacturing are shown in Figs. 6 and 7. 
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Fig. 7 Remanufactured diesel engine 


Table 2 Technical 
parameters of WD615.87 
diesel engine 


Parameter 

Quantity 

Unit 

Weight 

850 

Kg 

Volume 

9,726 

ml 

Rated power 

213 

kw 

Rated speed 

2,200 

r/min 

Torsion 

1,160 

N • m 

Torque speed 

1,100-1,600 

r/min 


Case Study: LCA-Based Evaluation of Diesel Engine Block 
Remanufacturing 

Goal and Scope Definition 

The goal of this study is to analyze the energy consumptions and environmental 
impacts of original remanufacturing of a diesel enginewith the perspective of total 
life cycle. Resource and energy consumptions and air/water emissions are carried 
out and five environmental impacts which are Global Warming Potential (GWP), 
Acidification Potential (AP), Eutrophication Potential (EP), and Abiotic Depletion 
Potential (ADP) are assessed in this LCA. 

The engine evaluated in this study is WD615.87 with in-line 6-cylinder, water- 
cooled, and turbocharged engine having a total displacement of 9.726 L. In this LCA, 
functional unit is defined as “300,000 km driven using a WD6 15-87 diesel engine.” The 
major technical parameters of the diesel engine under analysis are shown in Table 2. 

A cradle to gate boundary scope was selected when analyzing the life cycle of the 
remanufactured diesel engine, beginning with the used engine recycled back to the 
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workshop through disassembly, cleaning, refurbishing, and reassembly. Due to time 
constraint and technical restrictions, it is difficult to track the usage information of a 
remanufactured diesel engine; it is assumed according to the remanufacturer’s assur- 
ance. A remanufactured engine has the quality as good as a new engine and, therefore, 
meets the same fuel requirements as an originally manufactured engine. Moreover, as 
for the period of end-of-life disposal, the remanufactured engines are recycled back for 
another remanufacturing period; therefore, the phase end-of-life disposal is excluded 
from the evaluation scopes. The components considered in our analysis include the six 
parts which can be manufactured in the workshop, including cylinder block, cylinder 
head, crankshaft, connection rod, gearbox, and the accessories which are purchased 
from outside but can also be remanufactured. It was investigated that an engine can be 
remanufactured three to five times. As the failure modes and repair methods are 
usually different each time and the remanufactured engine has not reached service 
life, the given remanufacturing cycle has been considered as the first one. Figure 8 
shows the system boundary of this life cycle assessment. 


Life Cycle Inventory Analysis 
Data Resources 
Materials Production 

The materials consumed in the engine components manufacturing are mainly steel, 
cast iron, and aluminum. As for remanufacturing, there are some additional mate- 
rials such as kerosene, copper, nickel and diesel for refurbishing the components. 
The respective quantities of the main materials used in manufacturing/ 
remanufacturing are shown in Table 3. 

The raw materials need to be extracted and refined from the minerals and then 
undergo various remanufacturing processes to rebuild the engine parts. Energy and 
resources are used for this purpose. Aluminum, cast iron, and diesel are the three 
major materials of diesel engine remanufacturing, which bring about large amounts 
of energy consumptions and environmental emissions. The data related to energy 
requirements, air/water emissions of materials, mining, and production phases are 
referred from the Chinese Life Cycle Database (CLCD) developed by IKE, China 
(Liu et al. 2010). The CLCD database can reflect the average production levels 
existing currently. The inventory of unit material production is shown in Table 4. 


Table 3 Main materials used in manufacturing/remanufacturing 


Materials for manufacturing 

Quantity (kg) 

Materials for remanufacturing 

Quantity (kg) 

Steel 

188.19 

Nickel 

0.388 

Cast iron 

578.83 

Cast iron 

9 

Aluminum 

39.9 

Aluminum 

10 

Alloy 

32.92 

Diesel 

14.91 

/ 

/ 

Kerosene 

8.8 
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Table 4 Inventory of unit material production 


Inventory 

(kg) 

Production 
of nickel 

Production 
of diesel 

Production of 
aluminum 

Production of 
kerosene 

Production of 
cast iron 

Coal 

1.48E+01 

8.58E-02 

1.25E+01 

1.27E+01 

1.11E+00 

Crude oil 

1.60E+00 

1.21E+00 

5.11E-01 

1.38E+00 

4.78E-02 

N atural 

gas 

1.25E-01 

6.04E-04 

1.68E-01 

1.04E-01 

1.11E-03 

CO 

2.23E-02 

4.02E-04 

5.76E-03 

1.91E-02 

4.84E-04 

co 2 

2.85E+01 

3.75E-01 

2.25E+01 

2.45E+01 

2.21E+00 

so 2 

1.00E+00 

2.62E-03 

7.74E-02 

8.65E-01 

4.68E-03 

NO x 

1.25E-01 

6.04E-04 

5.56E-02 

1.03E-01 

2.33E-03 

ch 4 

8.33E-02 

2.05E-02 

6.37E-02 

7.16E-02 

5.15E-03 

h 2 s 

6.25E-02 

4.71E-06 

4.69E-04 

4.68E-02 

1.11E-05 

HCL 

5.69E-03 

3.04E-05 

4.84E-03 

4.89E-03 

5.22E-05 

CFCs 

2.15E-08 

6.60E-10 

3.16E-10 

1.84E-08 

1.33E-02 

BOD 

6.02E-02 

7.38E-03 

9.77E-03 

5.18E-02 

5.72E-03 

COD 

6.25E-02 

8.65E-03 

1.48E-02 

5.33E-02 

5.98E-03 

nh 4 

2.35E-04 

2.01E-04 

1.62E-04 

2.27E-04 

3.02E-05 


Table 5 Inventory of the truck transportation process/tkm 


Inventory 

Coal 

Crude 

oil 

Natural 

gas 

CO 

co 2 

S0 2 

NO x 

Mass (kg) 

4.04E- 

03 

4.91E- 

02 

8.13E-04 

1.79E- 

02 

1.26E- 

01 

2.03E- 

04 

2.03E- 

03 

Inventory 

ch 4 

H 2 S 

HCL 

CFCs 

COD 

nh 4 

Dust 

Mass 

(kg) 

8.53E- 

04 

1.96E- 

07 

1.48E- 

06 

9.48E- 

05 

3.93E- 

04 

1.79E- 

05 

9.350E- 

05 


Reverse Logistics of the Used Diesel Engines 

According to the investigation, the used diesel engines for remanufacturing are all 
recycled back from the CNHTC 4S shop by truck (carrying capacity: 10 1); there are 
about 170 4S shops in the mainland; the average distance D avg covered for the old 
engine recycling is estimated by Eq. 2: 


m 

Y A X Ni 

Avg = ^~= (2) 

K total 

where R to ta i is the total recovery number of the used engines, Dj is the recycling 
distance of the one used engines in the /th 4S shop, and N f is the number of the used 
engines recovered by the /th 4S shop. 

Then, the average distance can be obtained by the investigation, and Z) avg = 800 
km. It is assumed that the truck consumes gasoline only and the transportation inner 
the plant is ignored, the energy consumption and emissions of unit distance when 
recycling can be obtained by CLCD (shown in Table 5). 
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Fig. 9 Detailed flow diagram of the cylinder block remanufacturing process {Reman. 1 and 
Reman.2 refer to the cylinder liner substitution and the cylinder liner brush plating.) 


Engine Disassembly 

When the used engines are recycled back to the workshop, they are usually 
disassembled by high-pressure air rifle; the average time for one engine disassem- 

Q 

bly is 300 min, and it will consume 30 m compressed air, which equals 1 .2 kg when 
converted to standard coal. 

Parts Remanufacturing 

The volumes of the materials for the component remanufacturing are quantified in 
the section of materials production. The energy consumptions for the six parts are 
measured during their remanufacturing processes. The detailed method for data 
gathering is stated in section “Data Collection in Parts Remanufacturing.” 

Air/Water Emissions 

The data for the air/water emissions have been discussed in detail in the data 
collection sheets. The different gases involved are C0 2 , CO, H 2 S, N 2 0, and 
chloro fluorocarbons (CFC). The water pollution emissions contains ammonia 
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Fig. 10 Detailed flow diagram of the crankshaft remanufacturing process ( Clean 1,2 : high- 
pressure water jet cleaning; Reman. 1\ silk hole repairing, polishing; Inspection: magnetic powder 
inspection; Reman. 2: crankshaft neck, connecting rod journal laser plating; Post-processing : 
crankshaft neck milling) 


nitrogen, Biological Oxygen Demand (BOD) and Chemical Oxygen Demand 
(COD). The data about the energy demand and environmental emissions are all 
obtained from the CLCD fundamental database. 

Data Collection in Parts Remanufacturing 

Remanufacturing processes are generally composed of several stages: disassembly, 
cleaning, testing, repair, inspection, updating, component replacement, and 
reassembly (Sherwood and Shu 2000). The flow diagram of the six parts 
remanufacturing processes are illustrated for data gathering and the resource and 
energy consumption of each part are collected from its remanufacturing line. 
Figures 9, 10, 11, 12, and 13 illustrate the data collection process of the cylinder 
block, cylinder head, crankshaft, connection rod, gearbox, and flywheel. 

Table 6 summarized the electricity and material consumptions of engine com- 
ponents remanufacturing; the main materials consumed in the engine 
remanufacturing are nickel, aluminum, cast iron, and kerosene and diesel. Energy 
and natural recourse consumption and environmental emissions generated in these 
materials production can be obtained by CLCD. 
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Fig. 11 Detailed flow diagram of the connection rod remanufacturing process ( Clean 1,2 : high- 
pressure water jet cleaning; Reman. 1 : bush substitution, boring, and milling; Inspection : magnetic 
powder inspection; Reman. 2: big hole nano brush plating; Post-processing : polishing and quilted 
grinding of the big hole) 


Usage 

According to the remanufacturer’s assurance, a remanufactured engine has the 
quality as good as a new engine and, therefore, meets the same fuel requirements 
as an originally manufactured engine. It is assumed that the diesel engine is used in 
a truck, the energy consumed in the usage is mainly diesel fuel production, and the 
emissions are generated in the diesel engine operation. The diesel fuel consumed in 
the usage is calculated as follows: 

Driving distance: 300,000 km, as is defined in the functional unit 
Fuel efficiency: 24 ~ 26 L/100 km, using the average 25 L/100 km (Lambert and 
Gupta 2005) 

Density of diesel: 0.85 kg/L 

Mass of the diesel: 3,000 x 25 x 0.85 = 63,750 kg 

The energy inputs and emission outputs of 63,750 kg diesel production is cited 
from the unit “diesel production” in CLCD, and the air/water emissions of the 
engine operation is cited from the unit of “operation, passenger car, diesel” in the 
public ecoinvent 2.0 database. 
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Fig. 12 Detailed flow diagram of the cylinder head remanufacturing process ( Clean 1 : high- 
temperature decomposition; Clean 2: dedicated cleaning machine; Reman.', valve pipe substitu- 
tion, valve processing, and surface grinding) 


Life Cycle Inventory 

The energy consumptions for remanufactured diesel engines along with different 
life cycle stages are shown in Table 7. Figure 14 is illustrated to show the inventory 
results more vividly, and logarithmic processing is conducted in order to normalize 
the result to a more tractable range. 

Three kinds of natural resources - coal, crude oil, and natural gas - are consid- 
ered in the production of remanufactured diesel engines. It is obvious that the usage 
period will consume the most energy and generate the most air emissions; com- 
paratively the used engine reverse logistics will bring about little environmental 
load. The crude oil and C0 2 are the biggest inventory substances, followed by coal, 
CH 4 , NO x , COD, and S0 2 . 


Life Cycle Impact Assessment 

Although much more can be learned about the processes by considering the life 
cycle inventory data, an LCIA provides a more meaningful basis to make compar- 
isons. Based on the life cycle inventory data, LCIA is conducted for the environ- 
mental impacts mentioned above according to ISO 14042 (Yang et al. 2002). 
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Fig. 13 Detailed flow diagram of the gearbox and flywheel remanufacturing process 


Table 6 Energy and resource consumption of engine components remanufacturing 


Components 

Cleaning 

Inspection 

Reman. 

Post-processing 

(kWh) 

(kWh) 

(kWh) 

(kWh) 

Crankshaft 

1.611 

0.886 

3.269 

6.615 

Connection rod 

0.483 

2.8 

5.14 

0.105 

Cylinder block 

3.608 

0.7 

2.8875 

17.694 

Cylinder head 

3.085 

13.44 

161.9 

/ 

Gearbox 

0.1 

/ 

0.595 

/ 

Flywheel 

0.162 

/ 

0.595 

/ 

Else 

24.76 

/ 

2.243 

/ 


At each process in remanufacturing, the inventory data sets, including resource 
extraction and air/water emissions, were collected and classified into the impact 
categories. Subsequently, through characterization and normalization processing, 
the environmental impacts were calculated for each category. 

Classification 

The LCI results are organized and combined into the impacts categories by classi- 
fication. The main impact categories to be investigated under this project are Global 
Warming Potential (GWP), Acidification Potential (AP), Eutrophication Potential 
(EP), Photochemical Ozone Creation Potential (POCP), and Abiotic Depletion 
Potential (ADP). 

Characterization 

Characterization provides a way to directly compare the LCI results with each 
impact category. Based on the inventory data, the results of LCI, such as raw 
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Table 8 Characterization results of diesel engine remanufacturing 


Environmental 

impacts 

Substances 

Remanufacturing 
quantity (kg) 

Characterization factor 

Remanufacturing 

ADP 

Steel and 
cast iron 

9 

1 .66E-6 

Kg Sbeq 

2.98E-03 

CML2002 

Aluminum 

10 

2.53E-5 

Nickel 

0.388 

4.18E-3 

Coal 

567.16 

8.08E-7 

Crude oil 

60.06 

9.87E-6 

N atural 
gas 

5.40 

7.02E-6 

GWP 

n 

o 

to 

1266.26 

1 

Kg C0 2 eq 

2886.24 

IPCC2007 

ch 4 

4.32 

25 

NO x 

4.98 

320 

CO 

10.33 

2 

AP 

so 2 

11.43 

1 

Kg S0 2 eq 

15.82 

CML2002 

NO x 

4.98 

0.7 

H 2 S 

0.44 

1.88 

HCL 

0.29 

0.88 

EP 

nh 4 

0.02 

3.44 

Kg N0 3 eq 

0.32 

CML2002 

COD 

1.15 

0.23 

POCP 

CO 

10.33 

0.03 

Kg C 2 H 4 eq 

0.31 

CML2002 


material consumption, energy consumption, and air/water emissions, were 
converted into impact indicators by multiplying the characterization factor with 
IPCC, CML, and WMO methodologies (Yang et al. 2002; WMO 1992). Table 8 
shows the results of characterization of the remanufacturing processes. 


Normalization and Weighting 

Normalization expresses the potential impacts in ways that can be compared with 
an equivalent value, and weighting assigns weights to the different impact catego- 
ries based on their perceived importance or relevance, which are based on the 
characterization results. Normalization and weighing of five environmental impacts 
of remanufacturing are shown in Table 9. The results show that the environmental 
impacts of manufacturing and remanufacturing are 1.72 and 0.86, respectively (not 
including ADP). 


Interpretation 

Life cycle interpretation is a systematic technique to identify, quantify, check, and 
evaluate information from the results of the LCI and LCIA. 
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Table 9 Normalization and weighing of environmental impacts of remanufacturing 


Environmental impacts 

Equivalent value a 

Remanufacturing 

WF b 

Result 

GWP 

8,700 

0.34 

0.83 

0.862 

AP 

36 

0.44 

0.73 


EP 

62 

5.16E-03 

0.73 


POCP 

0.65 

0.48 

0.53 



Equivalent value of the national standardization, 1990, China 
b Weighting factors according to the reduction target, 2000, China 


Table 10 Environmental impacts of different life cycle stages of remanufacturing after 
normalization 


Environmental 

impacts 

Processes of engine remanufacturing 

Materials 

production 

Old engine reverse 
logistic 

Component 

remanufacturing 

Usage 

GWP 

0.12 

0.05 

0.16 

53.18 

AP 

0.30 

0.03 

0.11 

17.28 

EP 

3.39E-03 

1.37E-03 

2.90E-04 

2.79 

POCP 

0.01 

0.46 

0.01 

33.66 


Fig. 15 Environmental 
impacts of the different 
remanufacturing life cycle 
stages 


100.00 



■ Materials Production ■ Components remanufacturing 

■ Old engine reverse logistic ■ Usage 


Contribution analysis is conducted in order to quantify the contribution of the 
life cycle stages or groups of processes compared to the total result and examined 
for relevance (EPA 2006). Environmental impacts of different life cycle stages after 
normalization are shown in Table 10. 
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Figure 1 5 illustrates the results of the environmental impacts as are presented in 
Table 10. From the results, it can be seen that during the life cycle of diesel engine 
remanufacturing, the usage brings about most environmental impacts, especially 
GWP and AP; production of materials brings about larger environmental impacts 
with regards to GWP and AP; and old diesel engine reverse logistics can bring 
about less environmental impacts except POCP. 


Comparisons with Newly Manufactured Engine 

In order to demonstrate the environmental benefit of remanufacturing more vividly, 
the results of the study are compared with an LCA case of new manufactured diesel 
engine with the same type (Li et al. 2013). 

The total energy inputs and emissions outputs during the life cycle of diesel 
engine manufacturing and remanufacturing are shown in T able 1 1 . 

Figure 16 illustrates the environmental emissions of diesel engine manufacturing 
and remanufacturing before usage more vividly. 

Remanufacturing offers significant savings in coal and natural gas consump- 
tions, which are 73.85 % and 71.1 %, respectively. On the other hand, it causes a 
little more crude oil consumption due to the production of kerosene, diesel mate- 
rials, and gasoline fuels which are consumed in used engine remanufacturing. 

Table 11 compares the environmental emissions during diesel engine 
manufacturing and remanufacturing, which shows that the remanufacturing process 
results in significant reductions in the most relevant air/water emission categories. 
For example, the production of a new diesel engine produces 4.84 t of carbon 
dioxide, while diesel engine remanufacturing produces only 1.25 t of C0 2 . It should 
be noted that remanufacturing brings about more H 2 S emissions from fuel com- 
bustion in old engine reverse logistics. 


Table 11 Total energy inputs and emissions outputs during the life cycle of diesel engine 
manufacturing and remanufacturing 


Categories 

Manufacturing 

Remanufacturing 

Energy savings 

Resources (kg) 

Coal 

2,703.74 

707.71 

1,996.03 

Crude oil 

104.13 

66.24 

37.89 

Natural gas 

24.81 

7.17 

17.64 

Air emissions (kg) 

CO 

15.37 

10.33 

5.04 

co 2 

4,844.01 

1,250.33 

3,593.68 

so 2 

14.44 

11.43 

3.01 

NO x 

11.83 

4.72 

7.11 

ch 4 

13.42 

4.21 

9.21 

h 2 s 

0.03 

0.44 

-0.41 

HCL 

0.84 

0.29 

0.55 

Water emissions (kg) 

BOD 

5.23 

0.95 

4.28 

nh 4 

0.05 

0.02 

0.03 
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■ Manufacturing ■ Remanufacturing 


Fig. 17 Environmental impacts of manufacturing and remanufacturing 


The environmental impacts of the manufacturing and remanufacturing strategies 
are compared and presented in Fig. 17. It is evident that remanufacturing of a diesel 
engine has lesser contribution towards all the environmental impact categories 
when compared with its manufacturing equivalent. The greatest benefit regarding 
environmental impacts is EP, which is reduced by 79 %, followed by GWP, POCP, 
and AP which can be reduced by 67 %, 32 %, and 32 %, respectively. 


Summary 

This study conducted a comparative LCA for a remanufactured diesel 
engine produced by China SINOTRUK. The results obtained could be used in 
the future for engine designing from a life cycle perspective. The energies 
consumed in the engine component remanufacturing processes are collected in 
the remanufacturing line, and all of them are showed in detailed process flows. 
Due to time constraints and technical restrictions, it is difficult to track the usage 
information of a remanufactured diesel engine; therefore, the usage of a 
remanufactured engine is regarded as the same with a new manufactured diesel 
engine, and accurate energy consumptions during the usage period of 
remanufactured engines require more detailed investigation and survey to guar- 
antee the quality of the LCA data. 

During the life cycle of diesel engine remanufacturing, the usage brings about 
most environmental impacts, especially GWP and AP; production of materials 
brings about larger environmental impacts with regards to GWP and AP; and old 
diesel engine reverse logistics can bring about less environmental impacts 
except POCP. 

Being different from material recycling, remanufacturing “recycles” the value 
originally added to the raw material, including the cost of labor, energy, and 
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manufacturing operations. However, recycling requires added labor, energy, and 
processing capital to recover the raw materials. Remanufacturing could make 
greater economic contribution per unit of product than recycling by cutting down 
energy consumption and resources used for processing. From the analysis provided 
in this paper, it can be concluded that remanufacturing of a diesel engine has lesser 
involvement towards all the environmental impact categories when compared to its 
manufacturing alternate. The greatest reduction is EP, which is reduced by 79%. 

The results in Fig. 15 show that usage and production of the materials, mainly 
aluminum and cast iron, brings about serious environmental problems. Future work 
will focus on building greater efficiencies into the remanufacturing processes and 
greener energies, reusing a greater percentage of end-of-life components, and 
developing more sustainable and energy-efficient materials for diesel engine. In 
the life cycle of remanufactured diesel engines, the environmental impacts are 
largely determined by diesel consumption, electric power, and material consump- 
tions; thus, subsequent analyses should focus on these aspects for further 
optimization. 
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Abstract 

Conventionally, corporations focused on economic value creation for share- 
holders. However, sustainable business practices require considering sustainable 
value added to all stakeholders. The overall sustainable value added can be 
evaluated by measuring economic, environmental, and societal values created 
for all the stakeholders. While economic value assessment methods are com- 
monly used and well established, there are challenges in defining and 
establishing methods for environmental and societal value assessment. 

Manufacturing is one of the key sectors for achieving economic growth. 
Applying the sustainable value framework in manufacturing applications 
requires a total product life-cycle approach that considers the four product life- 
cycle stages (pre-manufacturing, manufacturing, use, and post-use) and the 6R 
(Reduce, Reuse, Recycle, Recover, Redesign, and Remanufacture) approach to 
create sustainable value for all stakeholders through sustainable manufacturing. 
In order to evaluate how effectively sustainable manufacturing creates sustain- 
able value, there is a need for a structured approach for sustainability 
assessment. 

This chapter focuses on developing a sustainability performance evaluation 
methodology for manufacturing through the introduction of sustainability met- 
rics that quantify and measure sustainable value in a comprehensive manner 
incorporating numerous factors related to creating sustainable values in sustain- 
able manufacturing activities. The methodology defines sustainability metrics 
that cover economic, environmental, and social value added for products and 
manufacturing processes. The methodology also presents the process of normal- 
izing, weighting, and aggregating the measurements for the sustainability met- 
rics to evaluate the overall product sustainability index ( ProdSI ) and process 
sustainability index (P roc SI). The application of the ProdSI and ProcSI meth- 
odologies is demonstrated by a case study to evaluate the sustainability perfor- 
mance of an automotive component. 


Introduction 

Sustainable development is defined as development that meets “the needs of the 
present without compromising the ability of future generations to meet their own 
needs” (UNWCED 1987). Meeting the needs of the present depends on the use of 
our resources, and being able to meet the needs of future generations requires 
sustaining these resources. Sustainable development is also shown as the “process 
of achieving human development ... in an inclusive, connected, equitable, prudent, 
and secure manner” (Gladwin et al. 1995). Accordingly, a sustainable corporation is 
one that “contributes to sustainable development by delivering simultaneously 
economic, social, and environmental benefits” also known as the triple bottom 
line (TBL) (Hart and Milstein 2003; Elkington 1998). Corporate contribution to 
sustainable development is known as sustainable value (Figge and Hahn 2004). 
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Although there is universal agreement on the importance of sustainable develop- 
ment, developing methods for applying sustainable development and assessing 
sustainable value are still a challenge. 

Laszlo (2008) states that sustainable value addition must take into account 
the overall benefits or costs to both the shareholders and other stakeholders 
(i.e., anyone that can help or hurt a business including producers, consumers, the 
society, and the environment). Accordingly, sustainable value is generated only 
when business practices deliver value to shareholders without transferring it away 
from other stakeholders (Badurdeen et al. 2009). Any other cases resulting in 
transferring benefits from one group to another, or away from both, are considered 
unsustainable business practices. Ueda et al. (2009) presented three models for 
value creation based on the nature of interactions between two major stakeholders 
(producers and consumers) and the environment, which they also consider as a 
stakeholder. The three models are class I, providing value model; class II, adaptive 
value model; and class III, co-creative value model. The class III model shows 
sustainable value creation based on the premise of the producers interacting with 
the consumers and the environment (both natural and social) to create sustainable 
value. Although these two models consider the principles of sustainable develop- 
ment by addressing value creation for all stakeholders and consider the environ- 
ment, they present a conceptual approach and focus only on products. 

The definition of sustainable value must incorporate several different domains. 
One domain defines who the value is created for. In this context, value must be 
created for shareholders and other stakeholders. Accordingly, sustainable value is 
generated only when value is created simultaneously for shareholders and other 
stakeholders (Laszlo 2008; Badurdeen et al. 2009). Another domain defines what 
type of value is being created. The type of value created can be in terms economic 
value, environmental value, and societal value. Sustainable value creation requires 
increased value in all these categories. The third domain defines where the value is 
being created. Sustainable value is created at different levels: the product level, the 
process level, the enterprise level, and the system (or supply chain) level. Another 
domain can be defined in terms of area of application of sustainable development 
and can be classified, for example, along different industrial sectors such as 
agriculture, transportation, construction and building, energy, and manufacturing. 
This chapter focuses on the application of sustainable development in manufactur- 
ing for sustainable value creation. 

Manufacturing contributes to 16.5 % of total GDP worldwide and 12.4 % within 
the USA according to the World Bank data (The World Bank 2013). Aside from 
being a major value-adding contributor, manufacturing has been the engine for 
economic growth and has the highest effect on economic growth in the industry. 
Thus, to promote sustainable development, the value-generating capability through 
manufacturing should become a major focus. As mentioned previously, sustainable 
development must not only focus on economic growth, but also consider sustain- 
able value creation (incorporating economic, environmental, and societal values) 
for all stakeholders. Sustainable manufacturing is defined as “the creation of 
manufactured products that use processes that minimize negative environmental 
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impacts, conserve energy and natural resources, are safe for employees, communi- 
ties, and consumers and are economically sound” (U.S. Department of Commerce 
2009). Consequently, sustainable manufacturing requires a total product life-cycle 
approach that considers the four product life-cycle stages (pre-manufacturing, 
manufacturing, use, and post-use) and the 6R (Reduce, Reuse, Recycle, Recover, 
Redesign, and Remanufacture) (Jawahir et al. 2006) approach to create economic, 
environmental, and societal values for all stakeholders. 

In order to evaluate how effectively sustainable manufacturing creates sustain- 
able value, there is a need for a structured approach for sustainability assessment. 
According to the domains of sustainable development, quantitative sustainability 
assessment must incorporate the economic, environmental, and societal aspects of 
sustainable development. Although economic value assessment methods are well 
established, there are still challenges in defining and establishing quantitative 
methods to assess environmental and societal values. In addition, the quantified 
sustainability assessment must be done at different levels in manufacturing by 
assessing product sustainability, process sustainability, enterprise sustainability, 
and system sustainability. 

This chapter focuses on developing a sustainability performance evaluation 
methodology for manufacturing through the introduction of sustainability metrics 
that quantify and measure sustainable value in a comprehensive manner incorpo- 
rating all the different factors related to creating sustainable value in sustainable 
manufacturing activities, as presented in Fig. 1. 

Feng et al. (2010) presented a review of prominent metrics and indicators for 
sustainability assessment in manufacturing. They classified the different methodolo- 
gies based on the level of technical detail (from low to high) and the application 
domain (product, process, facility, corporation, sector, country, and world). This 
work summarized the various methodologies that have been developed by a wide 
range of entities including corporations (e.g., Ford), international organizations (e.g., 
OECD), government organizations (e.g., NIST), and standards organizations (e.g., 
ISO). The categorization of these different methodologies is presented in Fig. 2. 

Sustainable value assessment can be done at the product, process, plant, and system 
levels (Badurdeen et al. 2013). However, there could be difficulties in applying the 
sustainability assessment methods reviewed by Feng et al. (2010) at these levels. Most 
methods presented are not comprehensive as they focus on only part of the product’s 
life-cycle or a limited part of the TBL aspects. To have a comprehensive assessment, 
the sustainability content for both shareholders and all other stakeholders from the 
three aspects of the TBL must be considered without any aspect being overlooked or 
repetitive accounting. Furthermore, sustainability performance evaluation must gen- 
erate measures to assist decision makers to more effectively assess manufacturing 
improvement efforts that can increase sustainable value; it must lend itself for 
integration with other tools and techniques used to improve the manufacturing 
performance. This is only possible through a quantitative assessment approach that 
can evaluate sustainable value creation along all the aspects covered in Fig. 1. 

To assess the sustainable value creation in manufacturing, one needs to evaluate 
the sustainability performance of the manufacturing processes and products. 
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Fig. 2 Categorization of sustainability evaluation methodologies (Feng et al. 2010) 


Conforming to the definition of sustainable manufacturing from NACFAM 
(National Council for Advanced Manufacturing) addressing the product and 
manufacturing process (NACFAM 2012), the target of the research work summa- 
rized in this chapter is to develop a set of metrics and a framework through which 
those metrics can be used to evaluate the sustainability performance of products and 
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Fig. 3 Sustainable 
manufacturing metric 
hierarchy (Badurdeen 

et al. 2013) 




manufacturing processes. In the methodology proposed here, a set of comprehen- 
sive and quantifiable measurements considering the economic, environmental, and 
societal aspects of various factors in manufactured products and their manufactur- 
ing processes is identified. The total life-cycle behavior of both the product and the 
intermediate material flows is considered in this comprehensive sustainability 
evaluation where the 6Rs for closed-loop material flow are also considered. The 
metrics are quantitative, and the measured data need to go through a process of data 
collection, normalization, weighting assignment, and aggregation to generate local 
or global conclusions. 

The sustainability evaluation of products and manufacturing processes is the 
major focus in this chapter. However, such an evaluation can be expanded or 
aggregated, as needed, as shown in Fig. 3, to assess sustainability performance at 
narrower levels within the organization (e.g., machine level) or at a much broader 
level (e.g., enterprise and supply chain levels). 

The remainder of this chapter is organized as follows. The section “Sustainable 
Value Creation Through Sustainable Manufacturing” explains how sustainable 
value is generated through sustainable manufacturing and the key aspects that 
must be incorporated in evaluating sustainable value. The section “Methodology” 
presents the methodology to define sustainability metrics that cover economic, 
environmental, and social values added for products and manufacturing processes. 
This section also presents the process of normalizing, weighting, and aggregating 
the measurements for the sustainability metrics to evaluate the overall product 
sustainability index ( ProdSI ) and process sustainability index ( ProcSI ). The section 
“Case Study” demonstrates the application of the ProdSI I ProcSI methodology 
using a case study to evaluate the sustainability performance of an automotive 
component. Concluding remarks are presented in the section “Conclusions.” 


Sustainable Value Creation through Sustainable Manufacturing 

The U.S. Department of Commerce defines sustainable manufacturing as “the 
creation of manufactured products that use processes that minimize negative 
environmental impacts, conserve energy and natural resources, are safe for 
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employees, communities, and consumers and are economically sound” 
(U.S. Department of Commerce 2009). In addition to the US Department of 
Commerce definition on sustainable Manufacturing, NACFAM emphasizes the 
need for considering manufacturing of “sustainable” products and the “sustainable 
manufacturing” of all products (NACFAM 2012). Adapting the U.S. Department of 
Commerce definition and the NACFAM modification, Jawahir and Jayal (2011) 
stressed that sustainable manufacturing must demonstrate reduced negative envi- 
ronmental impact, offer improved energy and resource efficiency, generate mini- 
mum quantity of wastes, and provide greater operational safety and personal health, 
while maintaining and/or improving the product and process quality. 

Sustainable manufacturing creates sustainable value by applying three key 
concepts: economic, environmental, and societal consideration; the total life- 
cycle approach; and the 6R approach. The following sections provide an explana- 
tion of how the three concepts are related to sustainable value creation and how they 
should be considered in quantifiable sustainable value assessment methods, along 
with the total life-cycle consideration and the 6R approach. 


Economic, Environmental, and Societal Aspects 

Sustainable manufacturing extends beyond the conventional practices that focus on 
economic performance (e.g., costs) to also focus on environmental (e.g., resource 
use and wastes) and societal (e.g., health and safety) aspects. These three aspects 
are in line with creating sustainable value, which can be evaluated by measuring the 
economic, environmental, and the societal values added (Badurdeen et al. 2009). 

Manufacturing requires resources. According to sustainable development prin- 
ciples, these resources must be conserved for use by future generations. In 
addition, manufacturing generates wastes and emissions which impact both the 
environment and the society. Sustainable manufacturing must minimize these 
negative impacts. 

Therefore, quantifying sustainable value generation through sustainable 
manufacturing requires the quantification of environmental and societal impacts 
in addition to economic performance. The higher the economic benefit and the 
lower the adverse environmental and societal impacts, the greater the sustainable 
value created through manufacturing. 


Total Life-Cycle Consideration 

When evaluating the impacts of economic, environmental and societal aspects in 
discrete product manufacturing, the total life cycle, covering the four life-cycle 
stages (pre-manufacturing, manufacturing, use, and post-use), as illustrated in 
Fig. 4, must be considered. 

Various stakeholders are involved during different life-cycle stages of a product. 
Since sustainable value creation must consider impacts on all stakeholders, 
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Fig. 4 Total life-cycle approach for sustainability assessment 


comprehensive assessment of sustainable value must consider the total product life- 
cycle. Thus, the sustainability assessment of each factor related to product or 
process sustainability should be an aggregate of the overall benefits or impacts 
that occur throughout the four life-cycle stages. 


6R Approach 

The 6R approach (, Reduce , Reuse , Recycle , Recover , Redesign , and Remanufacture) 
promotes a multiple life-cycle concept (Jawahir et al. 2006), as shown in the closed- 
loop material flow system in Fig. 5. Recover is the activity of collecting end-of-life 
products for subsequent post-use activities. Redesign of the product in view of 
simplifying future post-use processes is another important element that incorporates 
environmental considerations at the design stage of both products and processes. It 
also offers an opportunity for redesigning the next-generation products using 
recovered materials and residues. Remanufacture involves the manufacturing pro- 
cesses utilizing recovered and reconditioned materials and components. It can be 
used to restore old products to like new condition, offering similar or even better 
performance than that of the original products, thus saving natural resources, 
energy, and cost and reducing the waste generation (Steinhilper 1998). 

The 6R approach is important because it allows moving from the cradle-to-grave 
concept, which involves only a single life-cycle and the first three stages, to a 
multiple life-cycle emphasis (Jawahir et al. 2006). Through the 6R activities and by 
consideration of multiple product life-cycles, closed-loop material flow is achieved 
to recover the economic, environmental, and societal value remaining in products at 
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Fig. 5 The 6R concept for a near-perpetual material flow (Jawahir et al. 2006) 


the end-of-life. In addition, applying reduce throughout the product life-cycle 
minimizes the adverse environmental and societal impacts of manufacturing oper- 
ations and maximizes the economic benefits. Therefore, there is a direct relation- 
ship between 6R activities and creating sustainable value; hence, the effectiveness 
of the 6R activities must be evaluated in the quantified assessment of sustainable 
value. 

Overall sustainable development not only focuses on economic growth, but also 
considers sustainable value creation (incorporating economic, environmental, and 
societal values) for all stakeholders. In addition, the total product life cycle must be 
considered to aggregate the value creation for all stakeholders across the four 
stages. Finally, the 6R approach must be incorporated to evaluate sustainable 
value creation across multiple life-cycles. Using a framework that integrates these 
aspects, sustainable value can be quantified and assessed in sustainable manufactur- 
ing applications. 

The following section presents a framework that applies a metrics-based 
approach for quantified sustainable value assessment in sustainable manufacturing. 
The framework focuses on sustainability assessment at the product and process 
levels and incorporates the above aspects. 


Methodology 

Following the generic framework presented in the previous section, the methodol- 
ogy to quantify and measure the sustainability performance of products [the product 
sustainability index ( ProdSI )] and processes [process sustainability index ( ProcSI )] 
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in sustainable manufacturing practices is presented here. The hierarchical approach 
to identify product and process sustainability metrics incorporating the three key 
aspects of sustainable manufacturing and the complete sets of product and process 
sustainability metrics are presented. The process of normalizing, weighting, and 
aggregating these metrics to compute the ProdSI and ProcSI is also presented. 


Hierarchical Approach to ProdSI and ProcSI Development 

While generic definitions of sustainable products and sustainable manufacturing 
provide general guidance in identifying factors or elements that can evaluate 
product sustainability, their identification and quantitative evaluation for assessing 
sustainability performance of a specific product or manufacturing process are 
challenging and complex tasks (Fiksel et al. 1998). This is due to the wide range 
of aspects to be considered in evaluating product and process sustainability, the 
difficulties in quantifying many sustainability aspects (especially the social 
aspects), and the inherently heterogeneous nature of the data needed for sustain- 
ability evaluation which makes it difficult to combine them for an overall 
assessment. 

At this point, it is important to make a distinction must be made between 
performance metrics and indicators. While an indicator provides qualitative or 
quantitative information about performance of a specific phenomenon, environ- 
ment, or area, a performance metric provides a quantitative measure that is required 
for overall product sustainability assessment. The approach presented here con- 
siders sustainability performance metrics which must be measurable, relevant and 
comprehensive, understandable and meaningful, manageable, reliable, accessible, 
and measurable in a timely manner (Feng et al. 2010). 

The proposed ProdSI and ProcSI methodologies have a hierarchical structure 
that breaks product and process sustainability down to individual metrics through a 
five- and four-level process, respectively. These levels are index {ProdSI /ProcSI), 
sub-index, cluster, sub-cluster (for ProdSI only), and individual metric, as 
presented in Fig. 6. 

To address the challenges of defining product sustainability metrics, a top-down 
approach was followed. This hierarchical approach ensures that the individual 
metrics are comprehensive and cover all major aspects of product sustainability. 
The identified sustainability metrics are however generic and can be applied to any 
type of product or manufacturing process by customization. The five-level hierar- 
chical structure developed can be described as follows: 

• ProdSI - the overall aggregated product sustainability performance index 

• Sub-index - the three aspects of the TBL: economy, environment, and society 

• Cluster - major elements or factors of product sustainability within each of the 
three TBL categories 

• Sub-cluster - decomposition of clusters to more specific aspects of product 
sustainability 
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Fig. 6 Hierarchical structure 
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• Individual metric - a quantifiable and measurable attribute or property related 
to a single parameter or indicator in each sub-cluster that is measured through 
out the total product life-cycle 

The evaluation of the overall ProdSI and ProcSI is done using a bottom-up 
approach to aggregate the individual metrics to provide an overall product 
and process sustainability assessment. The ProdSI and ProcSI are calculated 
through a series of operational steps including data collection for individual 
metrics measurement and data collection and data normalization, weighting, 
and aggregation. The product and process sustainability metrics are 
presented next. 


Product Sustainability Metrics 

By expanding the six previously identified major product sustainability elements of 
environmental impact, societal impact, functionality, resource utilization and econ- 
omy, manufacturability, and recyclability and remanufacturability, a more compre- 
hensive set of 13 clusters was developed for product sustainability evaluation. 
These clusters are categorized under the three categories of the TBL: economy, 
environment, and society, as illustrated in Fig. 7. 

The complete set of individual product metrics under the economy, society, and 
environment subclusters is presented in Tables 1-3, respectively. These metrics can 
be customized for the product being evaluated, considering its functionality and 
performance. Further, in order to comprehensively evaluate the product sustain- 
ability, the measurement of each of these metrics must be made across the four 
product life-cycle stages (pre-manufacturing, manufacturing, use, and post-use), 
depending on their applicability. 
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Process Sustainability Metrics 

Following the criteria mentioned before, a comprehensive set of metrics for 
manufacturing process sustainability assessment was identified. The chosen metrics 
are categorized under six clusters that represent the process-related elements of 
sustainable manufacturing: manufacturing cost, energy consumption, waste man- 
agement, environmental impact, operator safety, and personnel health. A descrip- 
tion of the complete set of individual process metrics under each cluster is presented 
in Tables 4-9. Similarly, these metrics can be customized for a specific manufactur- 
ing process. 


ProdSI and ProcSI Evaluation Process 

The product and process sustainability metrics provide individual measures, but do 
not directly provide an overall assessment of product or process sustainability. The 
proposed ProdSI and ProcSI methodologies aggregate the metrics to provide an 
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Table 1 Economy sub-cluster: individual metrics 


Cluster 

Subcluster 

Individual metrics 

Initial investment 

Capital cost 

Equipment cost 
Facility cost 


R&D cost 

R&D cost 


Employee training 

Initial training cost 

Direct/indirect costs and 

Labor cost 

Labor cost 

overheads 

Materials 

Material cost 
Packaging cost 


Energy 

Energy cost 


Logistics 

Transportation cost 
Warehouse cost 


Product operational 

Recovery cost 


cost 

Product ownership cost 
Average disassembly cost 


Legal costs 

Environmental regulations 
violation 

Other costs related to legal issues 

Benefits and losses 

Market value 

Sales price 
Profit 


Quality losses 

Defective/retumed products loss 
Warranty cost 


overall product and process sustainability assessment, respectively. The details of 
data normalization, weighting, and aggregation methods are presented in the 
following subsections. 

Normalization 

Due to the heterogeneous nature of the sustainability metrics, the physical mea- 
surements of individual metrics cannot be directly aggregated. Therefore, all the 
individual metrics must be converted to a single normalized scale. In the ProdSI 
and ProcSI methodology, the individual metrics are normalized to a single scale 
from 0 to 10, where 0 represents the worst case and 10 represents the best case. 
Generally, a score of 0-4 would indicate a “poor” status, “average” with a score of 
4-6, “good” with a score of 6-8, and “excellent” with a score of 8-10. 

A single standard normalization method that can be applied for all metrics does 
not exist; the normalization of each individual metric is case specific and depends 
on several factors including the unit of measure, the limits of the measured value, 
whether the individual metric is positively or negatively correlated with overall 
product sustainability, and the existence of benchmarks or standard reference points 
for normalization. Establishing reference points is essential to normalize the dif- 
ferent units of measurement. Benchmarks can be set up based on earlier generations 
of the same product, standards, regulations, or expert opinions. The normalization 
can be done using a continuous scale or a discrete scale from 0 to 10. 
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Table 2 Society sub-cluster: individual metrics 


Cluster 

Subcluster 

Individual metrics 

Product quality and durability 

Product repair and 

Reparability 


maintenance 

Maintainability 


Product reliability 

Failure rate 
Life span 


Return and recall 

Return rate for product 
defects 

Product recall rate 

Functional performance 

Major product 
specifications 

(Product specific) 


Product customizability 

(Product specific) 


Product functional 
Effectiveness 

(Product specific) 


Ease of operation 

(Product specific) 

Product EOL management 

Ease of EOL activities 

Ease of EOL product 
disposal for the user 

Ease of EOL product 
recovery 


Product EOL societal 
impact 

Product EOL societal impact 

Product safety and health impact 

Safety 

Injury rate 

Product safety specifications 


Health 

(Product specific) 

Product societal impact regulations 

Product EOL regulation 

Product EOL regulation 

and certification 

compliance 

compliance 


Product EOL 
certification 

Product EOL certification 


Weighting and Aggregation 

Weightings are assigned for each element in the ProdSI and ProcSI (individual 
metrics, subclusters, clusters, and subindices) to balance the normalized values 
based on their relative importance or level of impact. Typically, a higher weighting 
is assigned to elements with a higher importance or impact level which must be 
determined considering many aspects such as regional variations in legislation, 
expert opinions, monetary valuation, and consumer value requirements. Weighting 
is a very sensitive process, and it affects the accuracy of the sustainability assess- 
ment. Therefore, it is important to be objective when assigning weights to the 
elements when computing the ProdSI and ProcSI. 

Currently, there are no universal or standard weighting methods that can accu- 
rately capture the relative importance of sustainability metrics or indicators. As 
such, the most suitable of several weighting methods can be applied. The first is 
assigning equal weighting, which is considered simple and transparent. However, 
the shortcoming of equal weighting is that it does not truly reflect the relative 
importance of the aggregated elements. The second is soliciting experts’ opinions 
using surveys and questionnaires. Once the surveys and questionnaires are 
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Table 3 Environment sub-cluster: individual metrics 


Cluster 

Sub-cluster 

Individual metrics 

Material use and efficiency 

Product material content 

Total product material use 

Recycled material ratio of 
product 

Mass of restricted/ hazardous 
material 

Total packaging material use 

Recycled packaging material 
ratio 


Material utilization 

Material utilization 


Regulations and certification 

Regulation compliance 
Certification 

Energy use and efficiency 

Energy from renewable sources 

Solar 

Hydro 

Wind 

Other 


Energy from nonrenewable 

Coal 


sources 

Petroleum 
Nuclear 
Natural Gas 
Other 


Energy regulations/certification 

Energy regulation compliance 
Energy certification 


Energy efficiency 

Energy efficiency 

Other resources use and 

Water use 

Mass of water used 

efficiency 

Recycled water use 

Ratio of recycled water used 


Other natural resources 

Other natural resources used 


Natural resource regulations/ 

Natural resource regulation 


certification 

compliance 

Natural resource certification 

Wastes & emissions 

Gaseous emissions 

Greenhouse gases 
Hazardous gaseous emissions 


Solid waste 

Mass of solid waste landfilled 

Reused/recycled hazardous 
waste 

Disposed hazardous solid waste 


Liquid waste 

To hydrosphere 
Reused/recycled liquid waste 
Disposed hazardous liquid waste 


Other waste & emissions 

Heat 

Noise 

Light 

Radioactive emissions 


Waste management regulations/ 

Waste management regulation 


certification 

compliance 

Waste management certification 


0 continued ) 
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Table 3 (continued) 


Cluster 

Sub-cluster 

Individual metrics 

Product end-of-life 

EOL product/material Recovery 

Ease of product disposability 
Product disassemblability 
Ratio of EOL product recovered 


EOL product 

Product reusability 


Reuse 

Ratio of EOL product reused 


EOL product remanufacturing 

Product remanufacturability 

Product redesign 

Ratio of product remanufactured 


EOL product recycling 

Product recyclability 

Ratio of product/material 
recycled 


Product EOL regulations/ 

EOL regulation compliance 


certification 

EOL certification 


Table 4 Manufacturing cost cluster with its sub-clusters and individual metrics 


Subcluster 

Individual metric 

Measurement method 

Direct 

cost 

Labor cost 

Total employee payment to machining positions/total 
number of product units made 


Operation energy 
cost 

Total cost for energy consumed in machine operation/ 
total number of product units made 


Consumable -related 
cost 

Total cost of consumables/total number of product units 
made 


Cutting tool-related 
cost 

(Total cost for purchasing new tools + cost for regrinding 
used tools - cost of recycling used tools)/total number of 
product units made 


Packaging-related 

cost 

(Total cost for purchasing new packages + used package 
treatment fee)/total number of product units made 


Scrap cost 

Total cost of scrapped product units/total number of 
product units made 


Cost of by-product 
treatment 

Total cost for by-product treatment (which is not covered 
above)/total number of product units made 


Training cost 

Total training cost/number of employees 

Indirect 

cost 

Indirect labor cost 

Total indirect labor cost/total number of product units 
made 


Maintenance cost 

Total cost for equipment maintenance/total number of 
product units made 


Audit and legal cost 

Total cost of audits, legal services, and litigation/total 
number of product units made 


Cost of PPE and 
safety investment 

Total cost of PPE and equipment/total number of product 
units made 

Capital 

cost 

Cost of depreciation 

Total depreciation of storage and fixed facilities/total 
number of product units made 


Cost of jigs/fixtures 
investment 

Total cost of jigs and fixtures/total number of product 
units made 
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Table 5 Energy consumption cluster with its sub-clusters and individual metrics 


Subcluster 

Individual metric 

Measurement method 

Production 

In-line electricity 
consumption 

Total electricity consumption of all units and 
equipment in the line/total number of product 
units made 

In-line fossil fuel 
consumption 

Total fossil fuel consumption of all units and 
equipment in the line/total number of product 
units made 

Transportation 

Transportation electricity 
consumption 

Total energy consumption of all 
transportation equipment in the beginning or 
end of the line/total number of product units 
made 

Transportation fossil fuel 
consumption 

Total fossil fuel consumption of all 
transportation equipment in the beginning or 
end of the line/total number of product units 
made 

Facilities 

Electricity consumption on 
maintaining facility 
environment 

Total energy consumption of all 
environmental maintenance units and 
equipment/total number of product units 
made 

Fossil fuel consumption on 
maintaining facility 
environment 

Total energy consumption of all 
environmental maintenance units and 
equipment/total number of product units 
made 

Production 
supply system 

Electricity consumption of 
concentrated supply system 

Total energy consumption of all supply 
system equipment/total number of product 
units made 

Fossil fuel consumption of 
concentrated supply system 

Total fossil fuel consumption of all supply 
system equipment/total number of product 
units made 

Maintenance 

Electricity consumption on 
maintenance 

Total electricity consumption for 
maintenance operations/total number of 
product units made 

Fossil fuel consumption on 
maintenance 

Total fossil fuel consumption for 
maintenance operations/total number of 
product units made 

Efficiency 

Energy efficiency 

Useful equivalent energy output from the 
process/total energy input 

Renewable 

energy 

Percentage of renewable 
energy used 

Total consumption of renewable energy/total 
energy consumption 


collected, weighting can be assigned by simply averaging the weights assigned by 
different experts or by following more complex statistical mechanisms such as the 
analytic hierarchy process (AHP), previously developed for product sustainability 
evaluation (Gupta et al. 2010). 

Once the weights are assigned, the normalized metrics are aggregated to calcu- 
late the scores for the subclusters, clusters, subindices, and ProdSI/ProcSI. The 
aggregation follows a bottom-up approach as presented in (Eqs. 2-4). 
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Table 6 Waste management cluster with its sub-clusters and individual metrics 


Subcluster 

Individual metric 

Measurement method 

Consumables 

Ratio of consumables 
recovered 

Mass of recovered consumables/total mass of used 
consumables 


Ratio of consumables 
reused 

Mass of reused consumables/total mass of used 
consumables 


Ratio of consumables 
recycled 

Mass of recycled consumables/total mass of used 
consumables 


Mass of disposed used 
consumables 

Mass of used consumables going to landfill/total 
number of product units made 

Packaging 

Ratio of used 
packaging recovered 

Mass of recovered packaging/total mass of used 
packaging material 


Ratio of used 
packaging reused 

Mass of reused packaging/total mass of used 
packaging material 


Ratio of used 
packaging recycled 

Mass of recycled packaging/total mass of used 
packaging material 


Mass of disposed used 
packaging 

Mass of used packaging going to the landfill/total 
number of product units made 

Used raw 
material (chips) 

Ratio of used raw 
material recovered 

Mass of used raw material recovered/total mass of 
used raw material 


Ratio of used raw 
material reused 

Mass of used raw material reused/total mass of 
used raw material 


Ratio of used raw 
material recycled 

Mass of used raw material recycled/total mass of 
used raw material 


Mass of disposed used 
raw material 

Mass of used raw material going to landfill/total 
number of product units made 

Scrap parts 

Ratio of scrap parts 
recovered 

Mass of scrap part recovered/total mass of scrap 
parts 


Ratio of scrap parts 
remanufactured 

Mass of remanufactured scrap part/total mass of 
scrap parts 


Ratio of scrap parts 
recycled 

Mass of recycled scrap part/total mass of scrap 
parts 


Mass of disposed scrap 
parts 

Mass of scrap part going to the landfill/total 
number of products made 


Sub-duster Level 



where 

Z// is the mth individual metric under sub-cluster Y fj 

v m ./ 

is the /th sub-cluster under /th cluster X, 

Wij m is the weighting factor for individual metrics Z,y 

/i is the number of individual metrics under the sub-cluster 7 z y 
Njj is the normalized constant for the mth individual metrics under 
sub-cluster 7 z y 



93 Sustainable Value Creation in Manufacturing at Product and Process. . . 


3361 


Table 7 Environmental impact cluster with its sub-clusters and individual metrics 


Subcluster 

Individual metric 

Measurement method 

Energy 

GHG emission from energy 
consumption of the line 

Total energy consumption/total number of 
product units made 


Percentage of renewable 
energy used 

Total renewable energy used/total energy 
consumption 

Water 

Total water consumption of 
the line 

Total water consumption/total number of product 
units made 

Restricted 

material 

Mass of restricted materials 
in disposed consumables 

Mass of restricted materials in disposed 
consumables/total number of product units made 


Mass of restricted material in 
disposed packaging 

Mass of restricted material in used packaging/ 
total number of product units made 


Mass of restricted material in 
disposed raw materials 

Mass of restricted materials in raw material 
going to landfill/total number of product units 
made 


Mass of restricted material in 
scrap parts going to landfill 

Mass of restricted material in scrap parts going to 
landfill/total number of product units made 

Disposed 

waste 

Mass of non-collected solid 
wastes 

Total mass of non-collected solid wastes/total 
number of product units made 


Mass of non-collected liquid 
wastes 

Total mass of non-collected liquid wastes/total 
number of product units made 


Mass of non-collected 
gaseous wastes 

Total mass of non-collected gaseous wastes/total 
number of product units made 


Mass of solid wastes going to 
landfill 

Total mass of solid wastes going to landfill/total 
number of product units made 


Mass of liquid waste 
disposed 

Total mass of liquid wastes going to landfill/total 
number of product units made 

Noise 

pollution 

Noise level outside the plant 

Noise level measured outside the plant 

Heat 

Heat generation 

Heat generated by the manufacturing line/total 
number of product units made 


Cluster Level 




7=1 


where 

Xj is the ith cluster 

w t j is the weighting factor for the sub-cluster E /; 
k is the number of sub-clusters under the cluster X, 


Subindex Level (for Product Only) 

Qp = Yj w ' X i 

1=1 



( 3 ) 
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Table 8 Operator safety cluster with its sub-clusters and individual metrics 


Subcluster 

Individual metric 

Measurement method 

W orking 
environment 
conditions (safety) 

Exposure to corrosive/ 
toxic chemicals 

Number of points with corrosive or toxic 
chemicals/total number of employees (break 
down to chemical list) 

Exposure to high- 
temperature surfaces 

Total number of high-temperature points 
exposed to the operator/total number of 
employees 

Exposure to high-speed 
components and 
splashes 

Total number of points with high-speed 
components exposed to the operator/total 
number of employees 

Exposure to high- 
voltage electricity 

Total number of points with high-voltage 
electricity exposed to the operator/total 
number of employees 

Other threatening 
exposure 

Total other exposed points with hazardous 
effects (splash, sparks, high-energy laser, 
etc.)/total number of employees 

Injuries 

Injury rate 

Total injuries/total number of product units 
made 


Table 9 Personnel health cluster with its sub-clusters and individual metrics 


Subcluster 

Individual metric 

Measurement method 

Working environment 
conditions (health) 

Chemical 

concentration 

Chemical concentration in the working 
environment (break down to the chemical list) 

Mist/dust level 

Microparticle concentration in the working 
environment 

Noise exposure 

Noise level in the working environment 

T emperature 

Temperature level in the working environment 

Other hazardous 
exposure 

Hazardous exposure level in the working 
environment 

PLI 

Physical load 
index 

Measured physical load index (Hollman 
et al. 1999) 

Absentee rate 

Health-related 
absenteeism rate 

Health-related absenteeism rate 


where 

Q P is the pth subindex 

w t is the weighting factor for the cluster X f 

s is the number of clusters under sub-index Q P 

Prod SI Level 


Prod SI = Y. vi ),Qp 

p = i 


(4) 
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where 

w p is the weighting factor for the sub-index Q P 
t is the number of sub-index 

The normalization and weighting processes are usually associated with subjec- 
tive judgments (Singh et al. 2012). This affects the sensitivity and accuracy of the 
product sustainability assessment. Sensitivity analysis and expert evaluations can 
help reduce the effects of the subjectivity of normalization and weighting and 
increase the accuracy of the assessment. 


Case Study 

Case Study Overview 

The case study aims at comprehensively identifying and developing the metrics 
needed for evaluating sustainability performance of manufactured products and their 
manufacturing processes. The objectives of the case study are summarized as follows: 

Manufacturer 

• Evaluation of sustainability rating for selected component and process 

• Identification of areas with potential for sustainability improvements 

Academic Team 

• Validation of the initial set of metrics for product and process sustainability 

• Identifying areas where current metrics are lacking and expanding to a compre- 
hensive set of metrics for sustainability assessment 

The product under investigation is an automobile power train component. The 
manufacturing process flow chart is shown in Fig. 8. 

Not all the data were provided by manufacturer, either due to the lack of 
measurement or due to confidentiality, and were estimated based on information 
available in public sources like automotive industry reports (Dreher et al. 2009; 
Environmental Affairs Co-ordination Office 2002; Schmidt and Taylor 2006; 
Toyota North America 2010), government agencies, and scientific literature. 


Case Study Results 

Product Sustainability Assessment 

This section presents the results from applying the total life-cycle-based product 
sustainability assessment methodology to the automobile component and the 
resulting ProdSI. Due to lack of information, most data collected for the automobile 
component relates to its manufacturing stage. Information for other stages are 
extracted from public resources as mentioned in the previous section and then 
adapted to the component. 
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ig. 8 Process flow chart for the component manufacturing line 
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The results of sustainability measurements and the corresponding scores are 
presented and discussed next based on each cluster of the metric elements: econ- 
omy, environment, and society. Then, a summary of product sustainability assess- 
ment is presented. 

Economy: Cost 

Because cost data were withheld by the manufacturer, values estimated based on 
public sources or estimation are used in this category. 

Therefore, the costs related to labor and to training are based on average wage 
estimation. The information on labor hours consumed for each of the tasks and 
public data of average automobile worker pay rates are known. The energy cost is 
estimated by multiplying the amount of energy consumed and price per unit of 
energy. The costs for materials, transportation, and warehousing are based on 
estimates as well. Costs for operation and maintenance/repair are based on 
published data about a medium-size sedan (IntelliChoice 2011). 

All cost measurements are in dollars spent per unit of product manufactured and 
are aggregated to the subcluster level without normalization. The normalized scores 
are given at the subcluster level and equal weights are assigned to these scores. 
Metrics with no data, namely, capital and legal cost, were not included in the 
calculation. A final score of 7.53 has been calculated for the cluster Cost based 
on equal weight. 

Economy: Innovation 

In this cluster, the material consumption efficiency is based on in-line measure- 
ment. The other two individual metrics - R&D cost and average disassembly cost - 
are left out due to lack of data. Linear normalization is applied to the material 
consumption efficiency. Since the material consumption efficiency is 79.1 % in 
general, a final score of 7.91 is given to the cluster Innovation. 

Economy: Profitability 

In this cluster, the profit is based on published data for a medium-size sedan. The 
rest of the metrics are not considered, as data for the reuse, remanufacturing, or 
recycling of the component is not available. 

Economy: Product Quality 

In this cluster, 12 years is considered as a benchmark for the average life span of a 
vehicle provided by the U.S. Department of Transportation. Reliability score of 
6.09 is given according to market ranking. Data for the other metrics within this 
cluster were not provided. 

Environment: Material Use and Efficiency 

In this cluster, the scores of both recycled material use ratio and recycled packaging 
material use ratio are rated on a scale from 0 to 10 for 0 % to 100 %, respectively. A 
score of 10 is given based on the use of 100 % recycled material, according to the 
supplier. Also, there are no restricted materials used to make the component; thus, a 
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full score is given to this metric. The mass of material used measured was the 
average mass of the raw workpiece. 

Since there are no plastic materials integrated into the product, no data are 
given to the associated individual metric. It should be noted that the mass of 
packaging material is estimated and the mass of transportation plastic tray is 
unknown. 

Environment: Energy Use and Efficiency 

In this cluster, the data for manufacturing energy use is from a direct measurement, 
in kilowatt hour per unit (kWh/unit) of product manufactured. It should be noted 
that only the manufacturing stage is considered for this metric. Data for the product 
energy use (e.g., use stage of life-cycle) is allocated based on the total energy use 
across the four life-cycle stages of the automobile and the weight for the component 
as a ratio of the overall weight of the automobile. The energy reduction ratio is from 
published reports, considering overall manufacturing operations. Again, product 
recycling, reuse, and remanufacturing are not properly traced by this case company, 
and the associated information is unknown. 

The score of 6.64 is given to manufacturing energy use and product energy use 
as a reference value. The final score is aggregated and generated from those three 
individual metrics. 

Environment: Water Use and Efficiency 

In this cluster, all the data are for a medium-size sedan based on published reports. 
Linear normalization is applied to the metric of recycled water use. The three 
metrics equally share the weight leading to a final score of 7.30. 

Environment: Residues 

In this cluster, full scores (10 in score column) are given to the metrics solid 
waste stream and liquid waste stream as there is no waste disposed (0 in mea- 
surement column). Furthermore, both landfill reduction and environmental regu- 
lation compliance receive full credit because the research team was informed that 
no waste is sent to landfill and no environmental regulations have been violated, 
respectively. The measurement for the gaseous emissions during the use stage 
is calculated from the total gaseous emission for a medium-size sedan provided 
by the UK Department for Transport (United Kingdom Department for 
Transport 2013). 

Since the post-use data are unknown, the metrics for product recycling, reuse, 
and remanufacturing are not aggregated to the final score. A final score of 9.33 is 
calculated without considering the end-of-life (EOL) metrics, which are left blank. 
To have a more sound and comprehensive product sustainability assessment, 
considering product EOL is a necessity. 

Environment: Product End-of-Life Management 

All measurements in this cluster come from published reports. It must be empha- 
sized that low scores are given due to no cosideration of the product EOL. 
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Table 10 Score aggregation under the sub-index of economy 


SUB-CLUSTER 

CLUSTER 

SUB-INDEX 

Sub-Cluster 

Value 

Weight 

Cluster 

Value 

Weight 

Sub-Index 

Value 

Weight 

Manufacturing 

Cost 

6.13 

0.33 

Cost 

7.53 

0.25 

Economy 

6.75 

0.33 

Use Cost 

6.46 

0.33 

Post-management 

Cost 

10.00 

0.33 

Product 

Development 

7.91 

1.00 

Innovation 

7.91 

0.25 

Profit 

5.49 

1.00 

Profitability 

5.49 

0.25 

Product Life 

6.09 

1.00 

Product Quality 

6.09 

0.25 

Product Quality 
Losses 

— 

— 





Color Coding 

0 5 10 


Society: Education and Customer Satisfaction 

Data related to customer satisfaction and product repairs and returns are not 
available. Therefore, the final score of 7.70 is based on the metric of employee 
training and development. 

Society: Product End-of-Life Management and Product Safety and Societal 
Well-Being 

Most data are unavailable as they relate to the product EOL stage, for which the 
company did not have information readily available. There are no product 
processing injuries for the manufacturing line, and a full score is given to 
the corresponding metric and the perfect score of 10 is generated solely based 
on that. 

ProdSI 

Economic subindex sustainability performance is visually presented by the color- 
coded table shown in Table 10. 

As most data are extracted from public sources, there is an opportunity to 
improve the accuracy of the ProdSI by using actual data related to the component 
and the automobiles bearing the component. 

Environmental sub-index sustainability performance is visually presented by the 
color-coded table shown in Table 11. 

Results indicate that the product EOL management is the weakest subcluster that 
needs further consideration and improvement. 

Societal sub-index sustainability performance is visually presented by the color- 
coded table shown in Table 12. 

The three sub-indices, combined with equal weighting, give a final score for 
the ProdSI that is 7.59 (out of a full score of 10). The final product sustainability 
index ( ProdSI ) in Table 13 once again reveals that the element of product end-of- 
life management is the area that requires the most efforts for improvement. 
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Tablel 1 Score aggregation under the sub-index of environment 


SUB-CLUSTER 

CLUSTER 

SUB-INDEX 

Sub-Cluster 

Value 

Weight 

Cluster 

Value 

Weight 

Sub-Index 

Value 

Weight 

Product 

Materials 

8.06 

0.50 

Material Use and 
Efficiency 

8.83 

0.20 

Environment 

7.15 

0.33 

Packaging 

Materials 

9.55 

0.50 


Manufacturing 
Energy Use 

7.37 

0.50 

Energy Use and 
Efficiency 

6.64 

0.20 

Use-phase 
Energy Use 

5.90 

0.50 


Post-use 
Phase Energy 
Use 




water Use 
and Efficiency 

7.30 

1.00 

Water Use and 
Efficiency 

7.30 

0.20 

Residues 

10.00 

0.33 

Residues 

9.33 

0.20 

Residue 

Reduction 

8.00 

0.33 


Post-use 

Phase 

Emissions 




Compliance 

10.00 

0.33 


Product End- 
of-Life 

Management 

2.93 

1.00 

Product End-of-Life 
Management 

2.93 

0.20 


Table 12 Score aggregation under the sub-index of society 


SUB-CLUSTER 

CLUSTER 

SUB-INDEX 

Sub- 

Cluster 

Value 

Weight 

Cluster 

Value 

Weight 

Sub- 

Index 

Value 

Weight 

Education 

7.70 

1.00 

Education 

7.70 

0.50 




Customer 

Satisfaction 









Product 

Repairs 

and 

Returns 



Customer Satisfaction 



Society 

8.85 

0.33 

Product 

Recovery 

Incentives 



Product End-of-Life 
Management 



Product 
Safety and 
Societal 
Well-being 

10.00 

1.00 

Product 

Safety and Societal 
Well-being 

10.00 

0.50 





For better visualization, the results can also be represented in the form of spider 
charts, as shown in Figs. 9 and 10 below. The discontinuity in the charts is due to 
nonavailability of data for those subclusters/clusters. 

Process Sustainability Assessment 
Machining Cost 

The research team did not have access to most of the cost data related to the component 
studied. The labor cost and training cost are estimated based on the number of labor 


93 Sustainable Value Creation in Manufacturing at Product and Process. . . 


3369 


Table 13 Score aggregation for ProdSI 


CLUSTER 

SUB-INDEX 

ProdSI 

Cluster 

Value 

Weight 

Sub-Index 

Value 

Weight 

ProdSI 

Cost 

7.53 

0.25 

Economy 

6.75 

0.33 

7.59 

Innovation 

7.91 

0.25 

Profitability 

5.49 

0.25 

Product Quality 

6.09 

0.25 

Material Use and Efficiency 

8.83 

0.20 

Environment 

7.15 

0.33 

Energy Use and Efficiency 

6.64 

0.20 

Water Use and Efficiency 

7.30 

0.20 

Residues 

9.33 

0.20 

Product End-of-Life 
Management 

2.93 

0.20 

Education 

7.70 

0.33 

Society 

8.85 

0.33 

Customer Satisfaction 



Product End-of-Life 
Management 



Product Safety and 
Societal Well-being 

10.00 

0.33 


Sub-clusters 



Society 

8.85 


Product 
Safety and Societal- 


Manufacturing Cost 


Use Cost 


Product Recovery Incentives 


Product Repairs and Returns 


Post-management Cost 


Product Development 


Economy 

6.75 


Customer Satisfaction 


Profit 


Education 


Product Life 


Product End-of-Life 
Management 


Product Quality Losses 


Compliance 


Product Materials 


Post-use Phase Emissions 


Packaging Materials 


Residue Reduction 


Manufacturing Energy Use 


Use-phase Energy Use 
Post-use Phase Energy Use 


Environment 

7.15 


Residues 
water Use and Efficiency 


Fig. 9 Spider chart for sub-clusters within ProdSI 


hours (known) consumed for each of the tasks and public data on average automobile 
worker pay rate. The operation energy cost and coolant-related costs are estimated by 
multiplying the amount consumed and the unit price of purchasing. Scrap loss is 
calculated based on number of scrap parts made and raw material price. 
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Clusters 


Cost 



Product i 


Society 

8.85 


Residues 


Water Use and 
Efficiency 


Environment 

7.15 


Fig. 10 Spider chart for clusters within ProdSI 

All cost measurements are in dollars spent per unit of product manufactured. 
Thus, they are aggregated without any weighting. The normalization takes place at 
the subcluster level. Since financial data were not available, a score of 7 was 
assumed for the machining cost cluster. Benchmark data can be applied for more 
accurate normalization. 

Energy Consumption 

Most of the energy consumption data are directly measured in kilowatt hours per 
unit (kWh/unit) of product manufactured. The data are not normalized until the 
cluster level. A reference energy consumption amount was set for manufacturing 
one piece of the component (in $), based on the total amount of energy consumed to 
manufacture a car as described in published reports. Based on the reference points 
selected, the energy consumption data are normalized and then aggregated to 
generate the final score of this cluster. 

Waste Management 

The major solid waste streams of the manufacturing line are the machining chips 
and scrapped parts. While a significant coolant losses were present the exact coolant 
loss streams were not identified because the coolant system was shared with other 
manufacturing lines. 

The company adopts a “zero landfill” policy which makes for a good recovery 
and recycling practice. Therefore, full scores are given for the corresponding 
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individual metrics. Benchmarks are needed for a more objective scoring of reuse 
and waste generation metrics. 

Environmental Impact 

GHG emissions are tracked in terms of the carbon content for the electricity 
consumed. The only stream of restricted material use related to the coolant loss. 
Solid waste is tracked, but the liquid waste and gaseous waste streams are not 
identified. 

“Zero landfill” policy results in full scores for all the corresponding individual 
metrics. A reference for the water usage is set by value allocation based on the total 
amount of water consumed to manufacture a car as reported in published reports. 
The result shows that manufacturing the component is not a heavy water- 
consuming process. The major problem in this cluster is the high greenhouse gas 
emission (GHG), due to the fact that the major source of electrical power in the 
local electricity grid being coal. The plant does not utilize any form of renewable 
energy. The reference point is given by monetary value allocation based on the total 
amount of GHG emission during the manufacturing of a car as stated in published 
reports. 

Personal Health 

Measurements concerning the working conditions are tracked, based on the bench- 
marks set by safety regulations from agencies or organizations including EPA, 
OSHA, and NIOSH (NIOSH 1992, 1998a, b). The records of operator absenteeism 
were referred to and no health-related absenteeism was found. 

Noise at the manufacturing site is close to the threshold limit of 85 dB, 
while hearing protection is not enforced. In this case, the reduction effect of 
the personal protective equipment (PPE) is not considered; thus, a poor score is 
given. Aside from that, it was unexpected to see from the physical load index 
(PLI) questionnaire results that a highly automated manufacturing process still 
involves considerable physical stress to the operators. The operators often had to 
bend their bodies to lift heavy workpieces. It is likely that the bodyguards on the 
equipment are restrictive and limit the ability of operators to access workpieces 
and tools. 

Operator Safety 

All exposures to hazards are well shielded. According to the safety records of the 
line, no injuries occurred during the period of investigation. As a result, full scores 
are given to all the measurements within this cluster. 

ProcSI Results 

The aggregated scores for the ProcSI clusters are shown in Table 14. 

The results are also represented in the form of spider chart, as shown in Fig. 1 1 
below. 

The final score for the ProcSI is 7.61 out of 10. 
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Table 14 Score aggregation for ProcSI 


CLUSTER 

ProdSI 

CLUSTER 

VALUE 

WEIGHT 

VALUE 

Machining cost 

7.00 

0.17 

7.61 

Energy consumption 

4.73 

0.17 

Waste management 

7.98 

0.17 

Environmental Impact 

8.42 

0.17 

Personal health 

7.12 

0.17 

Operator safety 

10.00 

0.17 


Color Coding 

0 5 10 


Sub-clusters 


Machining 

cost 



Fig. 1 1 Spider chart for clusters within ProcSI 


Concluding Remarks for the Case Study 

The component under investigation and its manufacturing processes received a 
fairly good score, considering that a score of 8 is given indicating industry-leading 
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performance, based on the value creation evaluation, market behavior and customer 
impressions. However, there are some concerns that must be addressed. 

First, the uncertainty and difficulty of the assessment should be considered. Data 
collection was not easy, and many measurements lack detailed data from the 
manufacturer. Estimating from other sources and ignoring the influences of the 
metrics due to missing data can reduce the reliability of the evaluation made. This 
emphasizes the significance of data collection and management effort needed for a 
realistic evaluation. 

On the other hand, the scope of this objective sustainable value creation assess- 
ment has not been taken by current manufacturers, one indication is that, in this 
study the data provided focus on the manufacturing stage while the data in other 
life-cycle stages are inadequate. Here is an example of the resulting effects. The 
influence of energy consumption on the economic value creation might be rela- 
tively small due to the inexpensive local price of electricity. Under this situation, 
the manufacturer showed inferior performance in energy related metrics, especially 
when compared with their performance in other metrics. It implies that even the 
market-leading manufacturers have difficulties taking the ideas of sustainable value 
creation and sustainable manufacturing concepts into their applications, thus the 
full potential of sustainable value is not retained. 


Summary 

Sustainable development is driving corporations to shift from conventional busi- 
ness practices that focus on value creation for shareholders to sustainable business 
practices that focus on value creation for all stakeholders. Corporate contribution to 
sustainable development is known as sustainable value. Sustainable value can be 
divided to economic, environmental, and societal value, which is in line with the 
triple bottom-line concept introduced by Elkington (1998). 

This chapter presented the ProdSI and ProcSI methodologies for sustainability 
assessment at the product and process levels. The ProdSI and ProcSI methods are 
metrics-based and have ninety and seventy-five metrics categorized into thirteen 
and six clusters, respectively. The sustainability clusters for product and 
manufacturing processes were summaries of past research. These metrics were 
introduced and the process to normalize, weight, and aggregate the measurements 
for these metrics to compute the ProdSI and ProcSI was explained. Finally, the 
application of the methodology was demonstrated through a case study to evaluate 
ProdSI and ProcSI for an automotive component. During this case study, an 
automobile component and its manufacturing processes are taken into consider- 
ation, where the whole procedure of preliminary target setting, data collection and 
allocation for each of the metrics, normalization of measurements, weighting, and 
aggregation is demonstrated. Such a case study can be taken as an example of self- 
assessment for sustained improvement, and benchmarked comparison requires 
further effort at each of the steps in the procedure to generate universal standards/ 
baseline of evaluation. Future work will focus on applying the ProdSI and ProcSI 
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methodologies to refine and customize the individual metrics’ sets for specific 
industries and application. It will also address weighting and normalization by 
involving industry experts, exploring different weighting methods and performing 
sensitivity analysis to examine the effect on overall sustainability evaluation. 
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Abstract 

This chapter introduces a method for deciding optimal options in end-of-life 
(EoL) product recovery. It utilizes multiple factors on EoL product condition 
and EoL product recovery values for better decision making in the planning of 
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EoL product recovery for optimal eco-performance. These factors are measur- 
able and closely tied to the product characteristics. The merits of the method can 
be seen from three perspectives. Firstly, embedded information and resources of 
returned products are fed back to the product life cycle chain as a closed loop for 
continuous improvement in product design and manufacturing. Secondly, EoL 
product recovery options for reusability, remanufacturability, and recyclability 
can be optimally determined. And thirdly, it quantifies the benefits of incorpo- 
rating EoL product recovery into manufacturing processes in terms of 
manufacturing costs, material utilization, and energy consumption. A case 
study on a crankshaft from a refrigerator reciprocating compressor is presented 
to demonstrate the merits of the method. 


Introduction 

By the end of this year, there will be more than 100 million personal computers and 
more than 500 million mobile phones being disposed globally. Thanks to growth of 
middle-class population around the world, demand for industrial product has gained 
momentum throughout the world especially in emerging market. At the same time, 
the life span for these products has been reduced gradually over the years due to 
consumer behavior. For example, people in developed countries tend to change 
personal computer as fast as 4 years of average lifetime, while those in developing 
countries change in 5-6 years of average lifetime (Yu et al. 2010). 

According to the UNEP, 20-50 million tons of e- waste is generated each year. 
The amount is estimated to double in the next decade. Moreover, generation of 
e-waste is growing three times faster than any other type of municipal waste on 
global level. The rapid growth of e- waste in the last few decades is mainly because 
of prosperous growth in electronic and electrical industries and the fast advance- 
ment of technologies. Consequently, the quick growing development affects 
consumption habits. Life cycle of products gets shorter and results in escalating 
e- waste. This has resulted in replacing electrical and electronic equipment (EEE) 
products which still have a long life span. Furthermore, majority of the e- waste are 
ended up in landfills or incinerators. In the USA, about 75-80 % of e- waste ready 
for EoL management ended up in landfills (Kahhat et al. 2008). It has now become 
one of the fastest growing waste components in municipal solid waste stream, and it 
could be a source of hazardous waste that adds to environmental burden and human 
health risk (Gaidajis et al. 2010; Ongondo et al. 201 1; Terazono et al. 2006; Widmer 
et al. 2005; UNE Programme 2007a, b). To make things worse, 50 % more 
resources are being extracted than our planet can replenish today (Almond 
et al. 2012; UNE Protection 2012). 

What does human do with millions of obsolete products? Most of the obsolete 
products are being recycled in one way or another. However, the methods are 
unsystematic, not effective (in terms of energy and cost), and hazardous due to 
mishandling and end up as landfills. This is because the team who is responsible for 
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the recycling does not have the knowledge of the product and has no recycling 
process, no support of advance technology, no established recycling channel, and 
no connection with the manufacturer. All the reasons mentioned above are critical 
criteria for an effective recovery process which can retrieve values from the end-of- 
life (EoL) product. Therefore, regulatory and manufacturer have been racing to 
propose product and process to counter this trend, which the trend supports toward 
sustainable manufacturing. 

To save the resources and serve the planet better, an effective solution is 
proposed to predict the life cycle status of the product, determine the product 
condition, and design a management system to handle it at EoL stage. 


Definitions 

• EoL product condition is referring to the state or quality of the product at the 
point of return. Usually, there are few critical parameters which can indicate the 
product condition, and this has to be determined during the stage where the 
product is designed and manufactured by the manufacturer. For most mechanical 
parts, the remaining life can be estimated by comparing the date of manufactured 
and predicted life span modeled by the manufacturer from reliability test. The 
information such as wear-out life also gives hint to predict remaining useful life. 
This parameter shall be able to be measured accurately and precisely by certain 
equipment. Some manufacturers even design or customize this equipment in 
order to achieve certain efficiency when measuring the EoL product condition. 
The measured parameters could be used to simulate the EoL product condition 
by using certain model and assumptions. Then, user can categorize the product 
according to the product condition. 

The silver lining behind the cloud of problems are journals which develop 
methods to quantify the value of returned products - in the form of either 
recovery value or EoL value. In a case study on television sets, product and 
environmental costs (converted from carbon emissions) are summed to attain an 
overall product life cycle cost (PLCC). This value is then put through Weibull 
analysis by means of historical data. Ultimately, it allows users to compare 
between making a new product and reusing a returned piece (Anityasari 
et al. 2005). While the formulation is comprehensive, the method falls short in 
developing quantity-based comparison between new production and other 
recovery options like remanufacturing and recycling. 

• EoL value refers to the quality value at EoL phase and includes technical 
condition, remaining useful life, and material selling price. 

• Recovery value is the value gained from EoL treatment as compared to making 
new product. It includes cost savings, GHG emission savings, and technical 
condition savings. 

• Residue value is the remains value after recoverable value has been retrieved 
from EoL product. 
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• Product recovery is the process of restoring EoL product to a former and better 
state or condition. A product that can recover certain range of value at EoL stage 
is called product recovery value. 

• Product recovery cost is the cost required to recover EoL product, and this cost 
varies depending on the product condition. Both the product recovery value and 
product recovery cost are determined to support product recovery decision. 

• Product recovery decision is the decision-making process of selecting the best 
recovery option. The recovery options are, namely, reuse, remanufacture, recy- 
cle, and dispose (incinerate). 


End-of-Life Product Recovery 


End-of-life (EoL) product recovery enables an organization to help reduce the 
environmental impacts of their products while maintaining profit margin. Some 
common ways of treating EoL product are landfill and recycling. However, these 
recovery treatments are more for short-term solution because they achieve 
low-value recovery efficiency. Recovery treatments such as landfill and recycle 
are usually adopted by an organization who intends to solve the EoL product 
without early planning during the product design or manufacture. Therefore, it is 
proven that high-efficiency product recovery can be achieved when EoL product 
treatment is being considered or planned during the product development stage. In 
consequence, product manufacturer has to be involved starting from the birth of 
the product till EoL stage. In other term, it is called design for EoL product 
recovery. 

In order to design for EoL product recovery, it is important to understand the 
possible types of EoL products, which can be categorized according to the product 
condition. Based on the EoL product condition, one can determine the product 
recovery options appropriately. 

The objective of this topic is to: 


• Describe the possible types of EoL products 

• Evaluate the product recovery options 

• Identify new product information to support product recovery types of EoL 
product return 


Classification of EoL Product Condition 

Returning of EoL products is acquired from production rejects, supply chain return, 
end-of-lease return, warranty return, and consumer return: 

i. Production rejects are the defective products that might be caused by 
manufacturing process or assembly errors. Thereby, these erroneous parts are 
not suitable for further assembly into product. 
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Table 1 EoL product 
condition 


Types of return 

Product condition 

Production reject 

Unused (with error) 

Supply chain return 

Unused (new) 

End-of-lease return 

Predictable (inaccurate) 

Warranty return 

Predictable 

Consumer return 

Unpredictable 


ii. Supply chain return is the unsold products returned from distributor or retailer. 
It might cause product obsolescence due to excess purchase or shipment 
because of errors in sales estimation. 

iii. End-of-lease return refers to the returning of leased product when the lease 
schedules are expiring. At the end of the leasing period, the old equipment is 
often replaced by a new unit and usually comes with maintenance service in the 
contract. 

iv. Warranty return refers to the return of products by consumers due to defects in 
material and workmanship or fail units within a certain period of time specified 
by the manufacturer or retailer. 

v. Consumer return mainly refers to the end of product usage. Consumers might 
return the old unit as trade-in when they purchase a new unit for replacement. In 
other case, consumer sells it as raw material to the third-party waste recycler. 

Product condition of the EoL product is classified based on the types of return. 

The products are mainly categorized in unused, predictable, and unpredictable 

status as shown in Table 1. 

i. Unused (with error ) product condition is typically caused by inconsistent 
manufacturing process that resulted in inaccurate part dimension. The defective 
unused product could be also due to mistake in assembly or destructive 
dismantling when the product performance is rejected as a whole. 

ia. Unused (new) product condition, on the other hand, refers to the condition of 
the parts that are not affected when the product is rejected from product 
assembly. Similarly, products return from supply chain (distributor) are unused 
and in new condition with remaining shelf life. 

ii. Predictable product condition for a warranty return product is possible since 
the product has been used in a limited timeframe. The condition is predicted 
with reference to the historical data on failure rates of similar products in the 
earlier stages of product development. The condition of leasing product is 
somehow predictable as the up-to-date product information is available during 
product maintenance. However, the complete identity of the product could not 
be confirmed as there is a mixture of old and new parts in the reconditioned 
equipment. Therefore, it is rather difficult to accurately predict overall product 
performance. 

iii. Unpredictable product condition is identified from the consumer returns which 
there are unknown operating condition and environment during usage stage. 
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Thereby, the condition of the product could have large variation. For instance, 
the condition of a 5 -year-old machine could be better than a similar 2-year-old 
machine under different operating frequency. 

With the source of product returns, EoL product condition can be further 
streamline so that it can be quantified more accurately. 


The Methodology for Product Recovery 

Figure 1 shows the methodology of product assessment at the point of return. When 
a product is first received, the date of return must be recorded so that such 
information can be referred to the date of manufacture to determine its used life. 
The used life of the product is an important indicator to judge product reusability. If 
the used life has over the designed life, where the reliability of the part is not 
covered in design phase, the part is not appropriate for reuse. Otherwise, remaining 
useful life of the part shall be determined based on the historical or statistical data in 
product development phase. To better assess the part for recovery, specific type of 
material needs to be identified for the right treatment. Valuable parts are chosen so 
that the embedded resources are worthwhile to retrieve. Subsequently, the returned 
part shall be classified based on the types of return channel. The condition of 
returned product from different channel is explained in the following section. 
Consumer returns include end-of-lease return, warranty return, and end-of-use 
return. The operating condition is commonly known for the leased product, which 
provides the details of usage trend. It eases the observation effort to identify the 
product condition. On the other hand, if there is lack of usage information, product 
condition can be determined through indentifying the critical degrading area, 
measuring the level of degradation, and then comparing with the manufacturing 
data. Several recovery options can be explored such as reuse, remanufacture, or 
recycle. Finally, recovery value is computed with the particular processes based on 
the recovery option. 

The EoL products can be categorized according to the few key characteristics 
that are used to describe the condition of the product. For example, when an EoL 
product is collected, it will be sorted out based on the types of return and product 
model. Then, the EoL product will be going through verification, assessment, and 
determination of critical breakdown component and severity as referred to the data 
available in design stage. If the EoL product meets the reusable specification, the 
product will be reused. Otherwise, the EoL product will be decomposed into 
subassembly parts - mechanical part, electrical part, and consumables as shown 
in Fig. 2. Then, each of the subassembly part can be further tested to determine the 
quality. Finally, the component can be graded as good, moderate, poor, and very 
poor with reference to the characteristics generated in design and development 
stage. The result of “good” reflects that significant value can be recovered from EoL 
product. On the other hand, “very poor” regards to zero value; in fact additional 
resources are required for handling the EoL product. 
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Fig. 1 Product recovery methodology 

EoL Product Measurement 

Product condition in this study implies the state of product’s health; more impor- 
tantly it indicates the potential cause of product failure. The purpose of scrutinizing 
the product condition is mainly to identify how significant the remaining product 
value can be recovered and what is the best alternative recovery option to salvage 
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Fig. 2 EoL product assessment flow 


Fig. 3 Crankshaft (http:// 
www.asia.ru/en/ProductInfo/ 

490872.html) 

the product. Types of products such as consumer durables (household appliances, 
cars, personal computer, etc.) are used frequently for short interval of time (within a 
day), while industrial and commercial products (equipment) are generally contin- 
uously operating for 24 h a day. However, this analogy does not confirm longer 
usage life in either case until measurement is executed at the point of return. Most 
of the products mentioned above are commonly made of electronics and mechan- 
ical component. This study focuses on the condition of mechanical component, 
where there is a list of failure mechanism, such as fatigue, corrosion, wear, and 
fretting, to name a few. Crankshaft of refrigerator compressor is presented to 
illustrate the concept of determining EoL product condition as shown in Fig. 3. 
The crankshaft is part of the compressor that functions to convert the linear piston 
motion into rotation. 

The condition of the above mechanical parts are measured in wear-out life (i.e., 
in terms of cycle), change of dimension from the original measurement, and 
cleanliness level. Figure 4 shows the part assessment flow, where the wear-out 
life of the product is first determined. If the usage life has over the design life, the 
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Fig. 4 Mechanical part assessment flow 


part will be sorted out as nonreusable part, which the recovery options are recycle or 
dispose. Otherwise, the part will be checked for the dimension changed. If the part 
dimension is within the reusable limit (design specification), the part is suitable for 
reuse after cleaning. On the other hand, the part will be considered for 
remanufacturing if the dimension changed is over the limit. Determination of 
cleanliness level of the part is necessary for cost analysis. Besides, carbon emission 
from recovery activities is computed to interpret the impact of particular recovery 
option to environment. All the information allows the decision maker to decide on 
the recovery options based on cost and carbon emission impact to the environment. 


Wear-Out Life 

Wear-out life measures the age of the product at the point of return; moreover, the 
information allows manufacturer to estimate remaining life of a product or com- 
ponent after it is collected from the user. The bathtub curve is widely used in 
reliability engineering. It describes a particular form of the hazard function which 
comprises three parts. The first part is a decreasing failure rate (early failures), the 
second part is a constant failure rate (random failures), and the third part is an 
increasing failure rate (wear-out failures). 

The life spans of a product or component usually follow one of the curves described 
above. This curve can be predicted and plotted by manufacturer during product 
development phase by collecting and analyzing data through running reliability testing. 
The manufacturer can also do the same for a subsystem. The widely used methods to 
determine the reliability of the product are tests such as accelerated life test, endurance, 
temperature cycling, and so on. 
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This information will help manufacturer to determine the remaining “safe to 
use” period when the component is collected from known period of usage. Every 
product has a unique serial number which can be traced back to the manufactured 
date. However, to accurately predict the remaining life of the product, it is impor- 
tant to consider also the usage behavior, frequency of usage within a period of time, 
and unexpected operation of the product outside product specification. The out- 
come from wear- out life valuation enables the decision maker to make quick 
decision on whether to reuse the product or component. 


Change of Dimension 

Change of dimension is one of the common mechanisms observed in failure 
machines or machine parts, where it results in loss of material by mechanical 
removal. This information not only informs the condition of the product; more 
importantly it provides a clue to cost computation based on different recovery 
option and thus estimates the optimal profit. The critical problem to solve here 
involves searching for the best method to detect loss of material and how to patch 
the loss of material. 

There are many methods to detect loss of material and many are costly. In any 
case, the method selected must be the most cost-effective and accurate. A simpler 
method to measure dimension change is using profilometer, where the stylus moves 
relative to the contact of surface. This method could be economical; however, it is 
time taken. Another advanced method for thickness measurement called electron 
energy loss spectroscopy (EELS) is introduced. The theory behind this method is 
electron scattering and energy loss. With the known original dimension, analysis 
method could be programmed and absolute dimension changed values can be 
accurately determined in milliseconds. Nevertheless, the equipment could be sev- 
eral times more expensive than profilometer. Taking recovery of high value product 
and large volume into consideration, return on investment of the advanced machine 
will be recovered in the long term. 


Cleanliness Level 

Cleanliness level of a returned product is another important indicator to quantify a 
product condition. The contaminated or dirty surface causes bad heat dissipation, 
lowers flow rate, and thus affects the internal pressure and so on. With a layer of 
contaminant or dirt covers on the surface, it is impossible to patch the loss material 
for reuse. Moreover, the dirtier a part is, the longer time and higher cost to clean the 
part is required. 

In industry practice, all parts that require cleaning are treated as the worst case. 
Thereby, the part cleaning line in production is planned in such a way that higher 
chemical washing concentration is used to clean all parts. In another case, manufac- 
turer might fix a longer washing time in order to make sure all dirt is totally cleaned 
up. Nonetheless, the concentration of chemical agent and time taken to wash associ- 
ated with cost. Hence, inspection of cleanliness level is introduced so that part cleaning 
process could be done cost efficiently. A manual way to identify part cleanliness level 
is via visual inspection, which the labor sorts out the dirty part according to his 
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Table 2 Part cleanliness measure 


Cleanliness 

level 

Cleanliness measurement and definition 

Action taken 

1. Very 
dirty 

Total weight — part weight (3 or more units of gross 
contaminant) 

2 wiping 
time + wash 

2. Dirty 

Total weight — part weight (2 unit or gross contaminant) 

1 wiping 
time + wash 

3. Clean 

Total weight — part weight (1 unit of gross contaminant) 

Wash 


experience. A more accurate and direct method to recognize the cleanliness of part 
could be using gravimetric measurement, where a highly sensitive scale can detect 
gross contaminant. To automate the process, this scale can be integrated with EELS to 
measure the change of part dimension and at the same time attain the cleanliness status. 
As shown in Table 2, the lowest level of cleanliness represents very dirty, while the 
highest level of cleanliness stands for clean. A wiping step is introduced to clean the 
part with the lower level of cleanliness instead of having multiple wash cycle. 


Types of Product Recovery 

There are numerous strategies for the treatment of EoL products. A list of the 
potential treatment options is as follows: 

• Reuse 

• Repair 

• Remanufacturing 

• Refurbishing 

• Recycling 

• Composting 

• Incineration 

• Landfill 

The options to choose depend on the conditions of the EoL products as described 
in the previous section. Each option shall be designed and optimized in the 
objective to recover as much value as possible from the EoL product at a minimum 
cost and minimum environmental impact. Figure 5 as follows shows the assessment 
flow for EoL reciprocating compressor. A crankshaft is used to examine the method 
to decide product recovery options among reuse, remanufacture, recycle, and 
dispose: 

i. Reuse 

In reuse, the part is inspected and channeled to assembly and distribution. 
Reuse only involves evaluation cost; however, the recovered value is lower 
than remanufacture part as it is recovered as a functional used product. 
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Fig. 5 Assessment flow for compressor part 

ii. Remanufacture 

In remanufacturing, the part is cladded before undergoing fine machining, 
cleaning, and inspection. The cladding process is affected by the amount of 
wear experienced by the part, and it is averaged for the purpose of this study. 
After these, the part is finally assembled and distributed. Remanufacturing 
costs involve EoL product evaluation cost and also cost to repair/remanufac- 
ture the EoL product so that it is comparable to a new product. 

iii. Recycle 

In recycling, the part is transported to be recycled, before it gets a new lease of 
life and follows the production of a new part. 

iv. Disposal 

In disposal, the part is transported to be landfilled, before it gets a new lease of 
life and follows the production of a new part. The part selected is iron-casted, 
thereby disposal will follow the process of actual recycling, as opposed to 
traditional incineration. The value that can be recovered from disposal is the 
least as there is cost involved to manage the disposal, while the only benefit is 
reduced environmental impact. 


New Product Information to Support Product Recovery 

Identifying the environmental aspects and fulfilling all the performances through- 
out the life cycle may be complex. However, with the closed-loop product life cycle 
framework in Fig. 6, knowledge that feedback to manufacturing engineers will able 
to define the changes for efficient and cost saving process, while design engineers 
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Fig. 6 Product design information for EoL product recovery 

define the degree of change in succeeding the integration of all aspects in producing 
attractive product cost and maintaining product functionality. 

As referred to the framework below, when a used product is collected, the 
return date will be captured and the product will be tested for its performance for 
product remaining life estimation. While at the design stage, useful information 
such as product specification, product design quality, and reliability data play 
relevance to this cycle. Product specification dictates the acceptance level for EoL 
product recovery options, product quality corresponds to the recovery value, and 
reliability data are to confirm if the functions of the product are statistically 
significant. Therefore, discovery of new product design data and EoL product 
status enables decision makers to predict product design life span and desired 
quality. As, for example, taking into consideration the returned product status and 
match to the product designed quality, it enlightens product designer the depth of 
innovation suitable for upcoming designs. On another note, the design-EoL cycle 
also sees varied decisions made depending on the recovery option chosen. As 
brought up by Thierry et al. (1995), through sustainability, products can be 
designed for simplified disassembly - so that relatively transient components 
are isolated for easy access during dismantling and reassembly. Compatible 
parts can also be modularized. This framework can be extended and implied in 
determining which of the recovery options is more appropriate for the given 
product. If the remaining quality is relatively good, remanufacturing could be 
desired over discard. Hence, the particular product should be designed for 
remanufacturability. In other words, this closed-loop method inherently eases 
the recovery option in the product design. 
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Fig. 7 EoL product recovery steps 


Design for EoL Product Recovery 

Figure 7 outlines the steps for product recovery. With the input of product condi- 
tion, decision maker decides on selecting the recovery process to obtain the optimal 
recovery value. Finally, the product is restored, where the recovered product is the 
output of the recovery process. EoL product information is not merely a decision 
input; more importantly, the knowledge learned enables better design for EoL 
product recovery. If the EoL product information is utilized wisely, product recov- 
ery process could achieve more efficiently and productively. 

The design of product recovery process shall enable each product recovery cost 
to be highly efficient so that one can optimize resources required to recover the EoL 
product according to the product nature. This section shall examine the design for 
each of the following process: 

• Reusability 

• Remanufacturability 

• Recyclability 


Reusability 

Reusability indicates that the EoL product can be reused from evaluation of EoL 
product condition, though the EoL product is lagging behind a new product in terms 
of quality and reliability. In this case, the EoL product does not require any 
refurbish process; it only requires test to determine that the EoL product condition 
passes the threshold of reusability. Therefore, the cost involves only the EoL 
product condition test. 

The design of EoL product condition test in most cases requires the following to 
achieve high optimization: 

• Accuracy 

• Precision 

• Short test time 

• Less material or manpower required 

Automation becomes an obvious option, and thus the test has to be planned, 
designed, and maintained. Today, there is a lot of high-end equipment which allows 
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user to do different kinds of measurement quickly without much manpower 
required as long as the setting is preset. Besides, these machines can communicate 
with other machines to enable data sharing or data analysis. This will enable the 
user to quickly interpret the EoL product condition as reusable. 


Remanufacturability 

Remanufacturability specifies that the EoL product is appropriate for reprocess to 
become as new product. The cost incurred here is higher than reusability as 
mentioned above due to the extra process required to reprocess the EoL product, 
and therefore the value recovered has to be higher too to make economic sense. In 
most of the cases, remanufacturability is only viable if the EoL product can be 
repackaged as a new product as the customer is unwilling to pay good price for a 
used product. 

In order to design a product for remanufacturability, one can apply the automa- 
tion strategy of assessing the EoL product condition as mentioned in the section on 
reusability above. This assessment shall be able to provide information such as 
which component is not working according to the specification. After the assess- 
ment, the EoL product has to go through reprocess so that it can recover the 
functionality, quality, and reliability comparable to a new product. In order to 
achieve this, one has to consider the following steps (Fig. 8). 


Recyclability 

Recyclability in this context refers to the decomposition of the EoL product and 
conversion of the subcomponent back to raw material. The EoL product is recycled 
in this case, is largely due to irreparable or rectifiable condition. Another reason for 
recycling is when the cost for repair is too high that it does not make any economic 
sense. Therefore, the most critical process design for recyclability would be deter- 
mining the EoL product condition, decomposition, and conversion. 

During the EoL product condition determination, one has to design the key 
parameters or conditions that can quickly indicate the high cost of repair. For example, 
the X-ray machine that shows the crack on mechanical parts can be costly to repair. 
Therefore, first to detect the costly component will cut short the process to decide on 
recycling. 

Secondly, decomposition of the EoL product can be time consuming and resulted 
in high cost. This can be improved if manufacturer has considered decomposition 
during product development and manufacturing. For example, manufacturer can 
design the product to be in modular form so that certain easy to fail parts can be 
grouped together for easy isolation or replacement during decomposition process. 

Lastly, the subcomponents shall be converted to raw materials if it is confirmed 
not to be repaired or reused. The conversion usually involves high cost as a lot of 
energy is required to convert end product to raw material. To overcome this, one has 
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Fig. 8 Design steps for remanufacturability 

to design this process so that the conversion is being done in high volume to achieve 
the economy of scale. Besides, adopting high technology method can also help to 
reduce the effort of conversion. 


Residue Value 

As explained earlier, residue value is the remains value after recoverable value has 
been retrieved from EoL product. In any case, residues are the waste resources. 
Thereby, residue value should keep as minimum as possible. As for the EoL product 
that cannot be further recovered by using the three methods mentioned above, it has 
to be discarded. This conclusion can be due to the following: 

• The cost or time required to recover does not make economic sense. 

• There isn’t a good method to recover the EoL product due to technology constraint. 

• The recover method will cause high impact to the environment. 

Therefore, it is important to design the management of residue value. Usually 
there will be cost incurred to manage the residue value in order not to cause impact 
to the environment. 


Marketability 

The commercialization of recovered EoL product is equally important as the design 
of product recovery process, and this has to be considered or designed as early as 
possible during the development of the product. One can plan the market of 
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recovered EoL product for each category including reuse, remanufacture, and 
recycle to maximize the recovered value. In practical, one has to consider the 
potential demand, market, and profits that can be generated from each category of 
recovered EoL product. Thus, it is important to differentiate the recovered EoL 
product characteristic so that one can market or price it accordingly. 

For reusability, the right market should be the secondhand market because it is a 
used product and the quality grade is lower than a new product. It is important to 
counter the two biggest challenges to serve this market, which are pricing and 
quality. Customer will refuse to spend above certain level for a used product, and 
thus one has to strike a balance between the pricing and the quality guaranteed. For 
example, from data collection in the lab, one can determine the remaining life of 
EoL recovered product under reusability category and propose an optimized war- 
ranty period to comfort the anxiety of customer buying a secondhand product. 

The difference between recovered EoL product from reuse and remanufacturing 
has to be carefully crafted so that customer sees values in both options. Manufac- 
turer has to position of recovered EoL product from remanufacturing at higher 
grade compare to reuse. This is because more cost is usually spent during the EoL 
product recovery process for remanufacturing such as replacement of faulty or 
degraded subcomponent. Therefore, manufacturer could market recovered EoL 
product from remanufacturing close to new product in terms of pricing and quality. 

As for the recycled product, it is usually recovered in the form of raw material or 
at a very low subassembly level. Therefore, one has to observe the raw material 
market prices and demand to justify the investment or operation of recycling EoL 
product. At the same time, the cost of disposal and local regulatory can help to 
support the recycling option. Recycling sometimes can do more harm to environ- 
mental impact as it requires reverse engineering and high energy consumption. 


Cost and Environmental Impact Analysis 

This section will be focusing on saving in terms of cost and C0 2 , which is used as 
the indicator for environmental burden. In this study, the product ending in disposal 
is the default path in manufacturing; given OEMs act in accord with the take-back 
regulations. 


Life Cycle Cost Analysis 

Recovering the EoL product always involves a cost, and this cost can vary 
depending on the choice of product recovery process. A simple rule of thumb is 
that the product recovery cost shall not exceed the cost of manufacturing a new 
product or the cost of disposal. Thus, it is critical that one considers all the costs 
incurred for recovering the EoL product from EoL product collection, logistic, EoL 
product evaluation, recovery process, recovered EoL product packaging and 
marketing, etc. 
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The cost calculation for a new product can be seen in Eq. 1 , where the cost of 
manufacturing a new part is one of the cost components in the disposal cost (Eq. 2) 
and recovery cost (Eq. 3). Operational cost in Eq. 4 is one of the cost components in 
manufacturing and recovery cost: 
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where : 


C new = New product cost 
C dis = Disposal cost 

C rec -/ = Recovery cost of option i, i = 1 (Reuse); i = 2 (Reman.); 
(Recycle) 

C op = Operational cost (including direct material, process, direct labor) 
C oh = Overhead cost (including indirect labor, rent, utilities) 

Cproc = Procurement cost (including collection, transport, take back) 

C dep = Machine depreciation cost (assume straight line depreciation 
5 years) 


C 

C 


mat 


Material cost 


process-y 


Cost of process j, j 




over 


Life Cycle (Environmental Impact) Assessment 

Life cycle assessment (LCA) is a widely used technique for assessment of envi- 
ronmental impacts throughout the entire product life cycle. Of the entire life cycle 
of crankshaft (from refrigerator compressor) as a case study, LCA is applicable to 
the material extraction, transportation, and manufacturing process of the product. 
The main contributors to ecological burden are energy usage (in the form of 
electricity) in material extraction and manufacturing process, as well as fuel 
consumption in transportation. These inputs are shown in Table 3. 

In view of the above inputs, the following values, unique to this case study, are 
keyed into the SimaPro program for each of the reprocessing options: 

1. 1 kWh of electricity in Singapore is comprised of (Tan et al. 2010): 

i. 0.758 kWh natural gas, burned in power plant 

ii. 0.219 kWh electricity, oil, at power plant 

iii. 0.023 kWh electricity, waste, at municipal waste incineration plant 


94 Product Characteristic Based Method for End-of-Life Product Recovery 


3395 


Table 3 Inputs for 
resource consumption 


Input 

Types 

Unit 

Electricity 

kWh 

Material 

Kg 

Transport type 

tkm 

Fuel 

/ 


2. Power consumed for recycling using induction furnace: 700 kWh/tonne 
(Gandhewar et al. 2011) 

3. Transportation used: transport, lorry, 3. 5-7. 5 tonne, EURO 3 

Global warming is used as the impact category; thus the environmental impact to 
be obtained is the sum of the carbon dioxide (C0 2 ) emission from all types of 
energy consumed. 

The environmental impact caused by recovery activities is calculated using 
equation (5) 


Total C0 2 emission 


(Electricity consumption x electricity grid emission factor) 
+ (Fuel consumption x GHG emission factor) 

(5) 


where 

the electricity grid emission factor (Singapore NEA 2012) is 
1 kW electricity = 0.5716 kg C0 2 e/kWh 

and the direct GHG emission factor (Guidelines to Defra 2012) is 
llitre of diesel = 2.6763 kg C0 2 e/1 

The environmental impact caused by carbon emission from making new product 
is calculated using Eq. 6, carbon emission from product disposal is calculated with 
Eq. 7, and carbon emission from product recovery options is determined using 
Eq. 8. Carbon emission from recovery operation in Eq. 9 is one of the carbon 
components in all of the above equations: 
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Fig. 9 The normalized chart of environmental impact model for crankshaft 


where: 


El 

El 


new 


dis 


EI 


rec— / 


Environmental impact caused by making new product 
Environmental impact caused by product disposal 
= Environmental impact caused by recovery option i, 


i 


EI 


1 (Reuse); i = 2 (Reman.); i = 3 (Recycle) 

= Environmental impact caused by process j,j = 1,2,3... 


process — j 

The LCA is done using SimaPro program, and ReCiPe (midpoint) is utilized as 
the assessment method. Among the three cultural perspectives, egalitarian is chosen 
for it represents long-term and conservative environmental mindset. Furthermore, 
the environmental impact calculation is reflected in kg C02 equivalent, which is 
located under the “climate change” impact category (Flowers et al. 2003). LCA is 
done for all options - reuse, remanufacture, recycle, and dispose. In the end, the 
procedure generates various environmental impact indicators, as shown in Fig. 9. 


Decision Making in Selecting EoL Product Recovery Option 

Rapid changing technologies on the product front has aroused consumer pursue for 
the latest goods. If the assertion holds true, more products would have shorter life 
span and this resulted in unwanted products surging high. Therefore, caution is 
needed for an equal fast response from the waste products. On top of that, products 
such as consumer durables are at the competitive edge in the market. Manufacturer 
could merely earn a narrow profit or even lose without a prudent manufacturing 
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planning. To meet the challenges, manufacturers have to select appropriate 
manufacturing strategies including recovery options, manufacturing processes, 
workpiece, machinery, and so on. The decision-making process is more challenging 
today as more aspects are taking into consideration. One of the most common 
problems faced by decision maker is assessing wide range of alternative options and 
choosing one based on the conflicting criteria. The section as follows illustrates the 
process of decision making using simple and logical methods to select the most 
optimal option by eliminating the unsatisfactory alternatives to better strengthen the 
existing decision-making procedures. 


Decision-Making Process 

Decision making is one of the most important actions engaged in planning and 
selecting strategies to accomplish company goal. Firstly, decision making involves 
selecting an option from a number of alternatives - reuse, remanufacture, recycle, 
or discard the EoL product to achieve cost-effective manufacturing. Secondly, 
decision making is a process that implicates more than simply a final decision 
among the alternatives - what is the product reliability and impact to the environ- 
ment if EoL product is reused, remanufactured, recycled, or discarded. Lastly, the 
decision made is related to action taken that the decision maker engages in to 
achieve the goal - to develop EoL product recovery production line. A general 
decision-making process is explained in the following steps (Fig. 10). 

Step 1: Defining Problem 

Original equipment manufacturer (OEM) is required to manage the EoL product 
(with the premise of take-back regulations enforced). Within the company each 
subunit is expected to have targets, such as reducing raw material by adding 
recycled material from EoL product, finding new market for the recovered product, 
developing new approach of logistic planning, and so on. Generating these targets 
becomes the basis for identifying the problem, deciding on the actions taken, and 
evaluating the outcomes. Overall, understand the problem situation is important as 
it affects the quality of the decision. 

Step 2: Identifying Requirements 

Requirements are the conditions that must meet in accordance with the problems 
set. These requirements are the boundary describing the possible solutions to the 
decision problem. In order to retrieve the maximum embedded value from the EoL 
product efficiently, one should determine the condition of the product at point of 
return. Information such as product design specification and product reliability data 
are used as benchmark so that the exact quantitative form of requirement can be 
stated. The quality of a reusable product should be as good as new, however less 
reliable. As for a remanufactured product, remain or change of dimension is 
allowed as long as it meets as new condition. With the exact quantitative form of 
requirement on hand, it can prevent the ensuing debates on judgmental evaluation. 
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Identifying requirements 



Fig. 10 Decision-making steps 

Step 3: Establishing Goal 

Goals are the broader statement of desired outcomes toward which effort is 
directed. In this context, the end goals in decision making are maximizing the 
profit (via optimal retrieve EoL product value), minimizing the environmental 
impact, and maximizing (to meet) the product technical condition. 

Step 4: Generating Alternatives 

Once the goals have been identified, the next step in the decision-making process is 
to generate alternatives to the goal. Manufacturer has to search for alternative 
means of reaching the goals. In this step, relevant information and the likely 
consequences must be gathered. As such, manufacturer must seek as much infor- 
mation as possible pertaining to the likelihood that each alternative would result in 
the achievement of various outcomes. For example, manufacturer should consider 
the solution of setting up a recycling line or engage the third-party recycler could 
result more cost-efficient. Moreover, the extent of generating alternatives is 
bounded by the importance of decision, cost, and value of additional information 
needed to evaluate the alternative and number of people affected by the decision 
(Zopounidis 2011). The more important, extensive, and greater number of people 
involve in the decision, the higher cost of evaluating, and lengthy time are required. 
In this context, the recovery alternatives for EoL product are reuse, remanufacture, 
recycle, and dispose. 
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Fig. 1 1 Hierarchy of generating criteria and sub-criteria 


Step 5: Identifying Criteria 

With the end goals in mind, criteria among the alternatives can be identified. This step 
is necessary as the criteria indicate how well each alternative accomplishes the goals. 
As shown in Fig. 1 1, the sub-criteria are grouped into a set of related criteria. Grouping 
the criteria is particularly helpful in scmtiny whether the set of criteria is rightly 
adapted to the problem. Besides, it is straightforward for the process of calculating 
criteria weights using some decision analysis methods. Another advantage of group- 
ing the criteria is it aids in visualizing the emergence of higher- level affair. 

Step 6: Selecting Decision Analysis Method 

There are a number of decision analysis methods for solving a decision problem. 
Nonetheless, choosing the particular method depends on the complication of the 
problem as well as the objective of the decision maker. Sometimes it requires 
integration of few methods to solve a problem. References of further readings are 
recommended (Munier 201 1). In the case study, two subgoals are identified; thereby 
multi-attribute utility theory (MAUT) is applied to the decision-making analysis. 

Step 7: Evaluating Value or Business Model Against Goals 

After getting the criteria for each alternative, manufacturer could develop value or 
business model to evaluate the alternatives. In the process of evaluating the 
business model, manufacturer should verify (1) feasibility of the alternative, 
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(2) satisfactory of the requirement, and (3) impact to the people. In order to validate 
the first aspect, manufacturer must confirm that the costs of handling product 
recovery are always lower than making a new product. The handling process 
costs for each alternative are considered based on the process flow shown in 
Fig. 5. Besides, costs that include operational, overhead, procurement, and machine 
depreciation are calculated using Eqs. 1, 2, 3, and 4. It is necessary to specify the 
calculations on different cases. For example, 

Alternative A: Value = Revenue — Costs 
Alternative B: Value = Revenue — Costs — Taxes 

If the problem involves certain time period, it is appropriate to use the notion of 
net present value (NPV) to model the value: 



cash flow at time t 

(l+r)' 


where r is the discount rate and n is the number of years. 

In addition to feasibility study on each alternative, manufacturer must achieve 
the cost-benefit requirement. For instance, cost of remanufacture has to be 50 % 
lower than a new product price. Finally, the chosen alternative must be acceptable 
to the person who has to commit to the consequences of the decision. 

Step 8: Analyzing the Sensitivity and Uncertainty 

Next, sensitivity analysis is worked on the business model to assess the sensitivity of 
potential changes and errors. This analysis helps decision maker to identify which 
parameters are the main drivers of the model’s outcome. In the case study, one-way 
sensitivity analysis is done to assess the impact change in certain parameter that will 
have on each alternative’s outcome. The model’s results are shown graphically in the 
form of tornado diagram (Fig. 12). To achieve the most profit by integrating the 
recovery option to production line, remanufacturing cost, recycling cost, 
remanufactured product selling price, and recycled material selling price are the 
key influential parameters. Deterministic analysis is acquired to determine the most 
influence parameters, followed by stochastic analysis to further optimize the result. 

To simplify the process, Monte Carlo simulation is applied to study the uncer- 
tainty conditions, followed by plotting the cumulative probability graph. The ana- 
lyses ease the decision maker to understand the risk profile of each alternative 
(Fig. 13). The results in the figure show that alternative 1 has the least uncertainty 
condition, which is also the least risky option. Alternative 2 dominates alternative 3; 
in other words, the returns (NPV) of alternative 2 are always better than alternative 
3. However, alternative 3 tends to have the largest fluctuation (risk) as compared to 
the other 2 options. Overall, if the decision maker is a risk taker, remanufacture of 
EoL product will be the best option. As for the risk averse decision maker, setting up 
the reuse recovery production line should be the safest choice. 
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Fig. 12 Combined tornado diagram 


Step 9: Evaluating Effectiveness on Decision 

The last step in the decision-making process is evaluating the effectiveness (qual- 
ity) of the decision. All the process can be done using simulation tool based on 
user’s input, which the simulated results assist in observing the trend. However, in 
actual case, when an implemented decision does not produce the desired outcomes, 
revision in part of the process is needed. Typically, inadequate definition of 
problem is the major flaw. After readdressing the problem formulation, user will 
go through the same process, thus generating new perspective of analysis. 

Summary 

With an eye on sustainable manufacturing, attention is shifted to reverse supply 
chains to reduce the organizations’ carbon footprint and preserve our physical 
environment. But perhaps above all, from a market perspective, sustainability 
requires gathering more business logic and favorable bottom lines to trigger the 
change. The proposed framework which allows reuse, remanufacture, and recycle 
of EoL product can boost the value retrieved from return goods through proper 
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Cumulative Chart for All Alternatives 
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Fig. 13 Combined risk diagrams for all alternatives 


planning and design. Each of the options discussed above essentially increases 
product knowledge, creates channels for information feedback, lowers production 
costs and selling prices, improves customer relationships, and widens presence in 
new and old markets. 

In conclusion, the proposed methodology is able to stimulate and support 
product and process innovation, thus enhancing bottom-line performance, deliver- 
ing cost benefits to the company, and ensuring sustainability. It serves to create new 
knowledge, increase failure detection, and reduce risks, as well as provide alterna- 
tive solutions with integration of public policy. 
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Abstract 

Waste Electrical and Electronic Equipment (WEEE) is one of the most signif- 
icant waste products in modem societies. Disassembly is a critical step to reduce 
Electrical and Electronic Equipment (EEE) waste. In the past two decades, 
despite disassembly has been applied to support recycling and remanufacturing 
of WEEE products worldwide, full disassembly of WEEE is rarely an ideal 
solution due to high disassembly cost. Selective disassembly, which prioritizes 
operations for partial disassembly according to the economic considerations, is 
becoming an important but still a challenging research topic in recent years. In 
order to address the issue effectively, in this chapter, space interference matrix is 
generated based on a product model to represent the space interference relation- 
ship between each component, and all feasible disassembly sequences can be 
obtained by analyzing the space interference matrix with a matrix analysis 
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algorithm. Then, a particle swarm optimization (PSO)-based selective disassem- 
bly planning method embedded with customizable decision-making models is 
applied, which is capable to achieve optimized selective disassembly sequences 
for products. Finally, industrial cases on liquid crystal display (LCD) televisions 
are used to verify and demonstrate the effectiveness and robustness of the 
developed research. 


Introduction 

The mounting demand for new products has brought more production activities 
worldwide in recent years. The rapid development, however, has been hindered by 
the increasing concerns of the scarcity of natural resources and environmental 
issues. It has been estimated that the required bio-capacity of two Earths is 
necessary to satisfy the need of the development in 2050 according to current 
production and consumption trends (Jovane et al. 2008). On the other hand, more 
and more products after services are filled up in landfills. Among them, Electrical 
and Electronic Equipment (EEE) after services, that is, Waste Electrical and 
Electronic Equipment (WEEE), is becoming one of the major and challenging 
waste streams in terms of quantity and toxicity. For instance, there is approximately 
seven million tons of WEEE generated in Europe per year (Walther et al. 2010). In 
China, 1.1 million tons of WEEE is generated per year (Hicks et al. 2005). Due to 
the rapid technical innovations and shorter usage life cycle of EEE, WEEE is 
growing much faster than any other municipal waste streams. In order for the 
Earth to be cleaner, end-of-life (EoL) recovery strategies are critical to shape the 
future of WEEE life cycle management patterns. Among the strategies, 
remanufacturing is viewed as a “hidden green giant” and attracting escalating 
attentions of researchers and practitioners (Kopacek and Kopacek 1999; Duflou 
et al. 2008; Kernbaum et al. 2009; Hatcher et al. 2011). Remanufacturers seek to 
bring some components of products after their services back into “as new” condi- 
tions by carrying out necessary disassembly, overhaul, and/or repairing operations 
for reuse to extend life cycles. There are two driving forces for industries in 
adopting the relevant technologies and practices, i.e., stricter legislative pressure 
for environmental protection and better profit margins from remanufacturing. The 
explanations are expanded below: 

• The WEEE Directive has been enacted and implemented from 2003 in Europe, 
and the equivalent Directives have been developed in different countries of the 
world. Further proposals for the tighter WEEE Directives have been suggested to 
regulatory bodies with an aim to make products and components after services 
more recyclable, reusable, and remanufacturable. According to the WEEE 
Directives, a producer (manufacturer, brand owner, or importer)’ s responsibility 
is extended to the postconsumer stage of WEEE, instead of stopping at selling 
and maintenance (i.e., extended producer responsibility - EPR; Mayers 2007; 
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Sander et al. 2007). The EPR is aimed at encouraging producers especially 
manufacturers to provide cradle -to -grave support to reduce environmental 
impacts such that they work closely with remanufacturing industries to recover 
maximum values and reduce environmental toxicity/hazardousness. For 
instance, the remanufacturing legislative initiatives are underway in the EU 
and the USA to ensure original equipment manufacturers (OEMs) and suppliers 
to provide free access to remanufacturing information facilities in global chains 
(Giuntini and Gaudette 2003). 

• Good remanufacturing planning and management can effectively balance eco- 
nomic and environmental targets and close gaps between the shorter innovation 
cycles of EEE and the extended lives of components of WEEE. 
Remanufacturing industries in the EU and worldwide have been recently grow- 
ing quickly because of better economic return values. There are a number of 
successful cases in industries, including single-use cameras (Eastman Kodak and 
Fujifilm), toner cartridges (Xerox), personal computers (IBM, HP, Toshiba, 
Reuse network - Germany), photocopiers (Fuji Xerox - Australia, Netherlands, 
and UK), commercial cleaning equipment (Electrolux), washing machines 
(ENVIE - France), mobile phones (Nokia, ReCellular, USA; Greener Solution, 
UK), etc. 

Disassembly planning, which is used to determine sensible disassembly opera- 
tions and sequencing, is critical in remanufacturing. Effective disassembly planning 
can significantly improve the recycling and reuse rates of components and materials 
from WEEE to ensure maximum value recovery. For a set of WEEE, there could be 
a number of different sequences of disassembly operations constrained technically 
and geometrically between the components of the WEEE, leading to the different 
decision-making models according to the perspectives and criteria of stakeholders 
(Kara et al. 2006). As thus, it becomes difficult for remanufacturers to solely depend 
upon their experiences to plan disassembly operations so as to recover a larger 
proportion of components and fulfill environmental targets at a reasonable cost. In 
the past years, research has been carried out to address the issues of disassembly. 
The previous research can be generally summarized as the following two 
categories: 

• Disassembly for design. Disassembly approaches for EEE such as consumer 
electronic products have been developed to use smart materials like shape 
memory polymers (SMPs) in the design of embedded releasable fasteners to 
facilitate the disassembly processes of the products (Masui et al. 1999; Chiodo 
et al. 2001; Jones et al. 2004; Braunschweig 2004; Hussein and Harrison 2008; 
Ijomah and Chiodo 2010). Design for remanufacturing/disassembly principles 
have been spread among Japanese manufacturers since products with the prin- 
ciples are more profitable in this context than those that were not designed with 
this purpose (Duflou et al. 2008; Sundin et al. 2009; Dindarian et al. 2012). 

• Disassembly planning and operation sequencing. Typical disassembly opera- 
tions based on manual, semiautomatic, and automatic processes and the 
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Fig.l A main flow of the developed approach 

associated toolkits were summarized (Duflou et al. 2008). Based on disassembly 
operations and the precedence constraint relationships among the disassembly 
operations, sequencing rules and intelligent and/or meta- heuristic reasoning 
algorithms were applied to deduce an optimal plan from a large pool of candidate 
solutions (Kara et al. 2006; Santochi et al. 2002; Lambert 2002; Kuo 2012). In 
recent years, remanufacturers are facing many challenges to disassemble WEEE 
due to their high customization and diversity, high integration level, and more 
complex assembly processes. Current economic analyses have demonstrated that 
full disassembly is rarely an optimal solution and necessary owing to high 
disassembly cost. Selective disassembly, which prioritizes operations to imple- 
ment partial dismantling of WEEE so as to take account of the legislative and 
economic considerations and meet the specific requirements of stakeholders, is a 
promising alternative and has therefore become a new research trend (Duflou 
et al. 2008; Renteria et al. 2011; Ryan et al. 2011). 

Attributing to booming personalized and mass-customized EEE, there is still 
challenge in applying the developed methods to the increasingly diversified and 
personalized WEEE to make sensible decisions and meet different stakeholders’ 
perspectives. This chapter presents a new method conducted in the area, and the 
main flow of the method is shown in Fig. 1. A summary of the developed approach 
is given below: 

• A space interference matrix is used to represent the space relationship of each 
component of WEEE in six directions in a Cartesian coordinate system. By this 
way, all the space interference relationships between components of WEEE can 
be digitally recorded and can be analyzed in the next step. 

• A matrix analysis algorithm is developed to find out all the feasible disassembly 
sequences of WEEE by analyzing the six space interference matrices in a 3D 
environment. It is capable to find out all the feasible disassembly sequences of 
WEEE, and the result can be used as a solution space to support a disassembly 
planning method to search optimized result. 

• A particle swarm optimization (PSO)-based selective disassembly 
planning method with customizable decision-making models has been 
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developed. The method is adaptive to various types of WEEE, flexible for 
customized decision modeling and decision making for different stakeholders, 
and capable for handling complex constraints and achieving better 
economic value and environmental protection requirements during disassembly 
planning. 

• In the end, industrial case studies on liquid crystal display (LCD) televisions are 
used to verify and demonstrate the effectiveness of the developed method in 
different application scenarios. 


Methodology 

Space Interference Matrix and Matrix Analysis Algorithm 

In the past two decades, there are many research articles published for disassembly 
research of WEEE. In the literature (Ying et al. 2000; Lambert 2003; Carrell 
et al. 2009), some detailed reviews on the research were made. Almost all those 
researches focused on the optimal disassembly solution searching. However, before 
applying disassembly planning and optimization techniques in real industrial cases 
such as LCD televisions, which is one of the main products of WEEE nowadays, it 
cannot evade the issue of the feasible solution space generation for further search 
and optimization (Wang et al. 2014). There are two reasons: (1) in real practice, if 
the disassembly sequence is obtained by searching all the disassembly sequences 
instead of the solution space, the result could hardly be used as there are some 
geometrical constraints to specify precedent relationships between disassembly 
operations, and (2) LCD televisions are normally assembled by many components 
with complex shapes. For an assembly LCD television with N components, the total 
disassembly sequences could be as much as N! = N x (N— 1). . .2 x 1. It is too 
difficult to search the best disassembly sequence within an acceptable runtime. In 
this section, an effective approach was developed to address the issues of the 
solution space generation of WEEE. 

Space Interference Matrix 

Firstly, based on a CAD product model, six space interference matrices are gener- 
ated in six directions separately in a 3D environment. It can be used to represent the 
space interference relationship of components of a product: 
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Fig. 2 Matrices in six directions to represent the space interference relationship 


Fig. 3 Product with four 
components 



In the matrix, the element E t in each row and column is one of the components in 
the product. The element tg shows the space interference relationship between 
components i and j in six directions (X + , X , Y + , Y , Z + , Z") in a 3D environment. 
If space interference exists between components i and j in one direction, the 
element t t j in the matrix is “7” in this direction. Otherwise, it is “ 0 .” 

An example is used here to explain the space interference relationship between 
“A” and “7?” (shown in Fig. 2). As the object “5” is in the X + direction of the object 
“A,” and the object “A” is in the X" direction of the object “7>,” the element t AB in the 
X + direction matrix is “7,” and the element t BA in the X" direction matrix is “7,” all 
other results are “0.” 

For example, Eqs. 2, 3, 4, 5, 6, and 7 are used to represent the space interference 
relationship between four components of a product (shown in Fig. 3): 
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Matrix Analysis Algorithm 

Based on the obtained space interference matrices in six directions in above, a 
matrix analysis algorithm is then developed to find all the feasible disassembly 
sequences of the product. Figure 4 shows the flow of the algorithm. 

An example is used here to explain the details of the developed matrix analysis 
algorithm. Firstly, Eq. 8 is generated by combining Eqs. 2, 3, 4, 5, 6, and 7 in six 
directions: 
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Fig. 4 Flowchart of the matrix analysis algorithm 
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Fig. 5 Feasible disassembly sequence analysis for the product 
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The Boolean operator “OR” is used here for the above equation at any row to 
determine whether a component can be freely disassembled in a direction. Equation 
9 is obtained below: 
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The result “111111” represents the relationship between one component and all 
remaining components of the product in six directions (X + , X , Y + , Y", Z + , Z"). If the 
result is always “1,” it means the component could not be disassembled in any 
direction; if the result includes “0,” it means the component can be disassembled 
from that direction. The example in Fig. 5 can be used to explain the concept. In Eq. 2, 
components “A” and “B” could not be disassembled in any direction as the result is all 
“1”; component “C” can be disassembled in Z + direction as the result is “0” in this 
direction; and component “D” can be disassembled in both ZA and Z directions. 

Here, component “C” is disassembled in ZA direction firstly, and the remaining 
combined space interference matrix is shown below: 
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Fig. 6 All the feasible disassembly sequences for the product 


From Eq. 10, only component “D” can be disassembled in both ZA and Z 
directions. Here, component “D” is disassembled in Z + direction, and the remaining 
combined space interference matrix is shown below: 



A B 

000000 010011 
100011 000000 


Result 

010011 

100011 



From Eq. 11, components “A” and “Z?” can be disassembled in three directions. 
After disassembling “A” in X + direction, the product has been disassembled 
completely. Loop the above analysis processing until all the feasible disassembly 
sequences of the product are obtained. Based on the above analysis and the 
developed matrix analysis algorithm, the total feasible disassembly sequences for 
the product is 192 (30 + 30 + 30 + 30 + 30 + 30 + 6 + 6) (shown in Fig. 6). 

The obtained result of all feasible disassembly sequences for the product can be 
used as a solution space to support our developed PSO-based selective disassembly 
planning method to search the optimized disassembly sequence based on customer 
requirements. The details are shown in the following section. 


Customizable Decision-Making Model and PSO-Based Selective 
Disassembly Planning Approach 

Customizable Decision-Making Modeling for Selective Disassembly 
Planning 

Disassembly of WEEE involves different stakeholders, such as environmental 
regulators and remanufacturers. The different levels of targets will lead them to 
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Fig. 7 Criteria used to develop different decision-making models to address various users’ needs 


adopt or develop different decision-making models. For instance, according to the 
WEEE Directive, WEEE regulators will check whether remanufacturing compa- 
nies are able to recycle at least 75 % of WEEE by weight and remove/recover all the 
hazardous materials. In other words, at least 75 % of WEEE are required to be 
dismantled to a component level, and all the components containing hazardous 
materials need to be taken apart from WEEE for further recycling and processing. 
Apart from fulfilling these fundamental environmental targets, remanufacturers 
would also improve the economic efficiency by prioritizing valued components 
during disassembly. In Fig. 7, an example of LCD WEEE is used to illustrate the 
above scenario. 

In order to develop a selective disassembly planning method that is suitable for 
stakeholders to process various types of WEEE and meet their specific require- 
ments, it is imperative to define customizable decision-making models. The models 
(disassembly indices and objective) developed in this research are described below. 

Disassembly Indices 

In the following formulas, two symbols will be used frequently and they are 
explained here first: 

n The number of the total disassembly operations in a plan of a set of WEEE 
m The number of the disassembly operations in a selective disassembly plan 
Position(Oper(/)) The position {sequence) of the ith disassembly operation in a 
disassembly plan 

• Selective Disassembly Plan {DP) and Disassembly Operation (Oper(/)) 

A set of WEEE can be fully disassembled using a disassembly plan. The number 
of all the operations in the plan is n. A selective disassembly plan {DP), which 
consists of a set of disassembly operations, is a part of the above complete 
operations. The number of the selected operations is m , and the i th operation is 
denoted as Oper(/). DP can be represented as 
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yyi 

DP = U (Oper(/), Position (Oper(/))) (12) 

z= 1 v 

where U represents the set of disassembly operations and m < n. 

For instance, there are a set of disassembly operations Oper(l), Oper(2), Oper 
(3), Oper(4), and their positions in DP are 4, 2, 1, 3 (e.g., Position(Oper(l)) = 4), so 
that the sequence of the operations in DP is Oper(3), Oper(2), Oper(4), Oper(l). 

Meanwhile, Oper(/) has some properties related to the environmental and 
economic targets defined as follows. 

• Hazardousness (//(Oper(/))) and Hazardousness Index (Index_//) 

//(Oper(/)) of the i th disassembly operation is to indicate the level of hazard- 
ousness contained in the component (s) removed by the operation from the WEEE. 
It can be represented in a qualitative means, i.e., high, relatively high, medium, and 
low, and converted to a quantitative means accordingly, such as (5, 3, 1,0) for 
(high, relatively high, medium, low). Index_// of a set of WEEE is to indicate the 
accumulated hazardousness contained in the component(s) removed by the disas- 
sembly operations in the WEEE. Index_// can be computed as below: 


m 

Index_// = (//(Oper(/)) * Position(Oper(/))) (13) 

7=1 

A smaller Index_// will be beneficial. The function of multiplying //(Oper(/)) 
and its position Position(Oper(/)) in DP is to ensure that the disassembly operations 
with higher hazardousness (i.e., //(Oper(/))) are arranged earlier in DP to achieve a 
smaller Index_//. 

For instance, the hazardousness of Oper(l), Oper(2), Oper(3), Oper(4) is (high, 
low, medium, relatively high), respectively, which can be converted to (5, 0, 1, 3). 
The positions of the operations in DP are (4, 2, 1, 3). Therefore, the hazardousness 
index of DP is (5*4 + 0*2 + 1*1 + 3*3) = 30. If the positions of the operations are 
rearranged as (1, 4, 3, 2), then the hazardousness index is (5*1 + 0*4 + 1*3 + 3*2) = 
14. The latter is lower than the earlier since the operations with higher hazardous- 
ness are arranged earlier in the latter. In objective defined later on, a weighted 
minimum hazardousness index will be pursued to ensure the operations to remove 
the most hazardous components will be arranged as early as possible to improve the 
efficiency of hazardousness removal in a selective disassembly plan. 

• Potential Recovery Value (V(Oper(/))), Disassembly Time (T(Oper(/))), and 

Potential Value Index (Index_V) 

V(Oper(/)) of the i th disassembly operation is to indicate the potential recovery 
value of the component(s) disassembled from the WEEE by the operation. The 
disassembled component(s) could be reusable so that V(Oper(/)) can be represented 
as the depreciation value of the equivalent new component(s). T(Oper(/)) represents 
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the time spent for the disassembly operation Oper(7). Index_V of a set of WEEE is 
to indicate the accumulated potential value index by the disassembly operations in 
the WEEE. Index_E can be computed as below: 

m 

Index, V = ((E(Oper(/))/r(Oper(/)) * Position (Oper(/))) (14) 

i=\ 

A smaller Index_V will be beneficial. E(Oper(/))/r(Oper(/)) represents 
the potential value recovery efficiency of Oper(/). The function of multiplying 
E(Oper(/))/T(Oper(/)) and its position Position(Oper(/)) in DP is to ensure that 
the disassembly operations with higher E(Oper(/))/T(Oper(/)) are arranged earlier 
to achieve a smaller Index_V so as to achieve a higher efficiency of potential value 
recovery for a selective disassembly plan. 

• Weight Removal (W(Oper(/))) and Weight Removal Index (Index_W) 

W (Oper(/)) is to indicate the level of the removed weight by the i th disassembly 
operation from the WEEE. It can be represented by the weight of the component 
(s) disassembled by the operation. Index_W of a set of WEEE is to indicate the 
accumulated weight removal index by the disassembly operations in the WEEE. 
Index_W can be computed as below: 

m 

Index_W = (W(Oper(/)) * Position(Oper(/))) (15) 

i= 1 

Similarly, a smaller Index_W will be beneficial. The function of multiplying W 
(Oper(/)) and its position Position(Oper(/)) in DP is to ensure that the disassembly 
operations with higher W(Oper(/)) are arranged earlier to achieve a smaller Index_W in 
order to improve the efficiency of weight removal in a selective disassembly plan. 

Disassembly Constraints 

During the process of disassembly, there are some technical constraints to specify 
precedent relationships between disassembly operations. An example in Fig. 8 is used 
to illustrate the concept. There is a single disassembly direction for components 
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A and B. Geometrically, it can dismantle either the joining mechanism between 
component B and housing first (Oper(l)) or the joining mechanism between 
components A and B first (Oper(2)). However, from the technical point of view, it 
is recommended to remove the joining mechanism between component B and 
housing first, considering that the disassembly of the second joining mechanism 
needs more operation space. Therefore, Oper(l) is constrained to be prior to 
Oper( 2 ) technically. 

Decision-Making Objective 

Disassembly decision-making will be modeled as a constraint-based optimization 
problem. The objective can be customized to address the different requirements of 
stakeholders through providing weight setting by users. The objective is 
represented below: 

Mimimise(Index_//, Index_E, Index_Vk) 

= Minimise (w\ * Index_// + W 2 * Index_V + W 3 * Index_Vk) (16) 

where uq — w 3 are the weights. The setting of weights can be used to reflect 
importance. A higher weight means more attentions will be paid to that index, 
and a zero value means such the index will not be considered. In order to 
rationalize the model, the three indices are required to be normalized to be in the 
same measurement scale. The late case studies can illustrate the normalization 
process. 

Improved Particle Swarm Optimization Algorithm 

The different selection and optimization sequencing of disassembly operations 
for a set of WEEE usually brings forth a large search space. Conventional 
algorithms are often incapable of optimizing the problem. To address it effectively, 
some modern optimization algorithms, such as genetic algorithm (GA) and 
simulated annealing (SA), have been developed to quickly identify an optimized 
solution in a large search space through some evolutional or heuristic strategies. 
In this research, an improved algorithm based on a modem intelligent algorithm, i. 
e., PSO, has been applied to facilitate the search process. Moreover, the improved 
PSO has been also compared with GA and SA for this disassembly planning 
problem to show the characteristics of the algorithms. For more details of GA 
and SA implementation, refer to (Li et al. 2002; Li and McMahon 2007; 
Reddy et al. 1999). 

A classic PSO algorithm was inspired by the social behavior of bird flocking and 
fish schooling (Kennedy and Eberhart 1995). Three aspects will be considered 
simultaneously when an individual fish or bird (particle) makes a decision about 
where to move: ( 1 ) its current moving direction (velocity) according to the inertia of 
the movement, ( 2 ) the best position that it has achieved so far, and (3) the best 
position that all the particles have achieved so far. In the algorithm, the particles 
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form a swarm, and each particle can be used to represent a potential disassembly 
plan of a problem. The velocity and position of a particle (disassembly plan) will be 
computed below: 

V' +i = w * V\ + Cl * Rand(l) * ( P\ - X') + c 2 * Rand(l) * (V - X') (17) 

X' +1 =X' + V' +1 (18) 

Xj = (Xn,X i2 ,...,X iN ) (19) 

V t = (ViuV i2 , . . . ,V iN ) (20) 

Here, i is the index number of particles in the swarm, t is the iteration number, 
and V and X are the velocity vector and the position vector of a particle, respec- 
tively. For an N-dimensional problem, V and X can be represented by N particle 
dimensions as formulas Eqs. 14 and 15 show. P t is the local best position that the 
i th particle has achieved so far; P g is the global best position that all the particles 
have achieved so far; W is the inertia weight to adjust the tendency to facilitate 
global exploration (smaller w) and the tendency to facilitate local exploration to 
fine-tune the current search area (larger w); Rand(l) returns a random number in 
[0, 1]; and c x and c 2 are two constant numbers to balance the effect of P { and P g . 

In each iteration, the position and velocity of a particle can be adjusted by the 
algorithm that takes the above three considerations into account. After a number of 
iterations, the whole swarm will converge at an optimized position in the search 
space. A classic PSO algorithm can be applied to optimize the disassembly plan- 
ning models in the following steps: 

1. Initialization: 

• Set the size of a swarm, e.g., the number of particles “ Swarm_Size ” and the 
max number of iterations “ Iter_Num .” 

• Initialize all the particles (a particle is a disassembly plan DP) in a swarm. 
Calculate the corresponding indices and objective of the particles according 
to formulas Eqs. 12, 13, 14, 15, and 16 (the result of the objective is called 
fitness here). 

• Set the local best particle and the global best particle with the best fitness. 

2. Iterate the following steps until “Iter_NunT is reached: 

• For each particle in the swarm, update its velocity and position values. 

• Decode the particle into a disassembly plan in terms of new position values 
and calculate the fitness of the particle. Update the local best particle and the 
global best particle if a lower fitness is achieved. 

3. Decode the global best particle to get the optimized solution. 

However, the classic PSO algorithm introduced above is still not effective in 
resolving the problem. There are two major reasons for it: 
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• Due to the inherent mathematical operators, it is difficult for the classic PSO 
algorithm to consider the different arrangements of operations, and therefore the 
particle is unable to fully explore the entire search space. 

• The classic algorithm usually works well in finding solutions at the early stage of 
the search process (the optimization result improves fast), but is less efficient 
during the final stage. Due to the loss of diversity in the population, the particles 
move quite slowly with low or even zero velocities, and this makes it hard to 
reach the global best solution. Therefore, the entire swarm is prone to be trapped 
in a local optimum from which it is difficult to escape. 

To solve these two problems and enhance the capability of the classic PSO 
algorithm to find the global optimum, new operations, including crossover and 
shift, have been developed and incorporated in an improved PSO algorithm. Some 
modification details are depicted below: 

1. New operators in the algorithm: 

• Crossover. Two particles in the swarm are chosen as parent particles for a 
crossover operation. In the crossover, a cutting point is randomly determined, 
and each parent particle is separated as left and right parts of the cutting point. 
The positions and velocities of the left part of parent 1 and the right part 
of parent 2 are reorganized to form child 1 . The positions and velocities of 
the left part of parent 2 and the right part of parent 1 are reorganized to form 
child 2. 

• Shift. This operator is used to exchange the positions and velocities of two 
operations in a particle in a random position so as to change their relative 
positions in the particle. 

2. Escape method. 

During the optimization process, if the iteration number of obtaining the same 
best fitness is more than 10, then the crossover and shift operations are applied to 
the best particle to escape from the local optima. 

A general diagram to show the above flow is shown in Fig. 9. 


Case Studies on LCD Televisions 

Televisions can be generally classified into five groups: CRT, LCD, PDP, OLED, 
and RP. The LCD televisions have been developed quickly over the past decades, 
and they are now sharing the biggest market (e.g., the global market figures for the 
LCD televisions are forecasted to surpass $80 billion in 2012; Ryan et al. 201 1). An 
LCD television produces a black and colored image by selectively filtering a white 
light. The light is typically provided by a series of cold cathode fluorescent lamps 
(CCFLs) at the back of the screen, although some displays use white or colored 
LED. The LCD televisions studied here are produced by the Changhong Electronics 
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Fig. 9 The general workflow of the PSO-based disassembly plan optimization 

Company, Ltd. from China, which is the biggest television producer in China. The 
company provides information about LCD televisions of the type of LC24F4, such 
as the bill of materials (BoMs), the exploded view, the mass of each part, and the 
detailed assembly processes. The structure of the LCD television is shown in 
Fig. 10a, b. The typical exploded view of an LCD television is shown in (c). 
As shown in (d), an LCD television is typically assembled by three main parts: 
front cover assembly part, back cover assembly part, and base assembly part. All 
feasible disassembly sequences for these three parts are generated in the following 
section. 


Solution Space Generation 
Base Assembly Part 

The base assembly part of the LC24F4 LCD television is shown in Fig. 11. It is 
composed of nine parts: (A) metal fixing plate, ( B ) metal washer 1, (C) metal washer 
2, ( D ) top metal support, (. E ) cylindrical metal support 1, (F) cylindrical metal 
support 2, (G) toughened glass seat, (//) steel plate, and (/) rubber gasket. The 
space interference matrices to represent the base assembly part in six directions are 
shown below: 
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d 

Complete machine 

— Front cover assembly part (1) 

— Surface cover (1-1) 

— Remote control receiver board (1-2) 
— Control buttons board (1-3) 

— Main board (1-4) 

— Power supply board (1-5) 

— LNB board (optional) (1-6) 

— DVD rom (optional) (1-7) 

— Back cover assembly part (2) 

— Base assembly part (3) 


Fig. 10 The LC24F4 LCD television and its structures (a) Front view of the LCD television 
framework (b) Back view of the LCD television framework (c) Typical exploded view of the LCD 
television structure (d) Part of the BoMs of the LCD television 



Fig. 11 The base assembly part of the LC24F4 LCD television: (a) base assembly part, (b) 
components A, B, C, (c) components D, E, F, and (d) components G, H, I 
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After combining the above six matrices and using Boolean operator “OR” in 
rows, the obtained result is as follows: 




A 

B 

c 

D 

E 

F 

G 

H 

/ 

Result 

A 

- 000000 

000010 

000010 

000010 

000010 

000010 

111110 

111110 

111100“ 

111110 

B 

000001 

000000 

000010 

000010 

000010 

000010 

111110 

111110 

111100 

linn 

C 

000001 

000001 

000000 

000010 

000010 

000010 

111110 

111110 

111100 

linn 

D 

000001 

000001 

000001 

000000 

111101 

111101 

111101 

111101 

000000 

111101 

E 

000001 

000001 

000001 

111110 

000000 

111101 

000001 

000001 

000000 

111111 

F 

000001 

000001 

000001 

111110 

111110 

000000 

111110 

111110 

111100 

111111 

G 

111101 

111101 

111101 

000010 

000010 

000010 

000000 

111101 

000001 

111111 

H 

111101 

111101 

111101 

000010 

000010 

000010 

111110 

000000 

111100 

111111 

I 

. 000000 

000000 

000000 

000000 

000000 

000000 

000010 

111100 

000000 _ 

111110 



3424 


G. Jin and W. Li 



Fig. 12 The front assembly part of the LC24F4 LCD television: (a) front assembly part, (b) 
components J, K, L, M, (c) components N, O, P, Q, and (d) components R, S, T 


Based on the developed matrix analysis algorithm, there are a total of 918 fea- 
sible disassembly sequences for the base assembly part. 


Front Cover Assembly Part 

The front cover assembly part of the LC24F4 LCD television is shown in Fig. 12. It is 
composed of 11 parts: (/) control button, ( K ) power switch, (L) side loudspeaker, (M) 
control receiver board, (N) positive loudspeaker, (O) power supply board, (P) main 
board, ( Q ) metal board, ( R ) metal mounting plate, (, S ) surface frame, and (T) LCD screen. 

The space interference matrices to represent the front cover assembly part in six 
directions are shown below: 
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After combining the above six matrices and using 
rows, the obtained result is shown below: 


Boolean operator “OR” in 
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Based on the developed matrix analysis algorithm, there are a total of 7096320 
feasible disassembly sequences for the front assembly part. 
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Fig. 13 The back cover assembly part of LCD television 


Back Cover Assembly Part 

The back cover assembly part of an LC24F4 LCD television is composed of three 
parts: (U) back cover, (V) cover plate, and (W) support (shown in Fig. 13). 

The space interference matrices to represent the back cover assembly part in six 
directions are shown below: 
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After combining the above six space interface matrices and using Boolean 
operator “OR” in rows, the combined matrix is as follows: 



U V W 

000000 110111 101111 

111011 000000 100000 

011111 010000 000000 


Result 

linn 

111011 

011111 


Based on the developed matrix analysis algorithm, the number of feasible 
disassembly sequences for the back cover assembly part is 4. 



95 Life Cycle Management of LCD Televisions - Case Study 


3427 


Table 1 Comparison between our developed method and others 

Our developed method: 450 x 7096320 x 4 = 2.6058e +1 ° (all feasible disassembly sequences) 
Others: 23! = 23 x 22. . .2 x 1 = 2.5852e +22 (all disassembly sequences) 

Searching range reduced: 2.5852e +22 /3776400 = 9.9209e +n times 


Fig. 14 The component/ 
material composition of the 
LCD television 



■ Metal 

■ Plastic 

■ PCB 

■ LCD screen 

■ Glass 

■ Loudspeaker 

■ Others 


Based on the above analysis, the developed solution space generation approach 
can find that the value of all the feasible disassembly sequences of the LC24F4 LCD 
television is 2.6058e +1 ° = 918 x 7096320 x 4(base assembly part x front cover 
assembly part x back over assembly part). Compared with all disassembly 
sequences, which is 23! = 23 x 22. . .2 x 1 = 2.5852e +22 , the searching range for 
a disassembly planning algorithm to find the optimized disassembly sequence of the 
LC24F4 LCD television is reduced to about 9.9209e +n times (shown in Table 1). All 
the results from the above have been generated using the algorithm in MATLAB 
language. It is obvious that the developed approach can dramatically reduce the 
searching range and obtain all feasible disassembly sequences of the LC24F4 LCD 
television, which can be used as a solution space to support our developed 
PSO-based selective disassembly planning method to achieve better economic 
value and environmental protection requirements within an acceptable runtime. 


Selective Optimizations and Comparisons 

The mass of the LC24F4 LCD television is 5963.8 g, and the main component/ 
material composition is shown in Fig. 14, in which the percentage is represented in 
terms of the ratio of mass. Among the component/material compositions, the PCBs 
(printed circuit boards, which are mainly main boards and power supply boards) 
and LCD screens are quite complex. Other components/materials include cables, 
wires, pins, switches, and rubbers. The cables, wires, pins, and switches consist of 
plastics that are usually polyvinyl chloride (PVC) and nonferrous mainly copper 
(Cu) and aluminum (Al). 
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Fig. 15 The disassembly constraint graph 

Based on the BoMs of the LC24F4 LCD television, the process of disassembly can 
be planed. Figure 15 is used to represent the constraints of the disassembly plan and 
called the disassembly constraint graph. Except for the disassembly constraint graph, 
there are several other methods to represent the disassembly constraints, such as 
disassembly tree, state diagram, and and/or graph (Lambert and Gupta 2005). In the 
graph, nodes represent operations and arcs represent the precedence constraint 
relationships between operations. Meanwhile, each operation is defined with several 
properties, such as disassembly operation number, disassembly operation time, 
component(s) (name, amount, and mass) to be disassembled by each operation, and 
potential recovered component(s)’ mass, value, and hazardousness. Firstly, one of the 
disassembly plans of the LC24F4 LCD television is (1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 
13, 14, 15, 16, 17, 18, 19, 20). This plan is called “an initial plan” to be used in the 
following scenarios for the comparisons with an optimized plan for a better under- 
standing of the optimization process. Table 2 lists the properties of the disassembly 
process according to the disassembly operation number. 

Scenario 1 for Selective Optimization 

It is aimed to determine a selective optimization disassembly plan (part of the full 
disassembly plan) to meet the environmental protection targets (100 % hazardous- 
ness removal and 75 % component disassembled for the whole WEEE) and achieve 
the optimized potential recovery value (all the three weights in formula Eq. 15 were 
set 1). The input data is shown in Table 2. 

In Fig. 16a, the disassembly planning selection and optimization process is 
shown. During the computation process, results were normalized, i.e., the index 
result of each operation was converted as the percentage of the overall results of all 
the operations. The results in the Y-axis were also accumulated for the operations. 
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Table 2 Disassembly operations and some properties of the LC24F4 LCD television 






Potential 


Disassembly 

Time 



value 

Hazardousness 

operations 

(s) 

Components 

Mass (g) 

(Yuan) 

removal 

1. Unscrew and 

864 

Base part 

1.8 

0.0119 

Low 

remove base part 


M4xl2 

1.6 

0.0106 


2. Unscrew and 

864 

[4xlOBTECh 

11.2 

0.0739 

Low 

remove cover plate 


Cover plate 

23.0 

0.1840 




3xl0KTHCh 

0.6 

0.0004 


3. Remove back cover 

43.2 

Support 

15.6 

0.1248 

Low 

part 


structure 




4. Disassemble back 

21.6 

Back cover 

723.8 

1.7904 

Low 

cover part 


Insulation 

board 

25.0 

0.2280 


5. Remove wire with 

864 

Wire with pin 

50.0 

0.1000 

Low 

pin 






6. Remove power 

43.2 

Power switch 

5.0 

0.0100 

Low 

switch part 


part 




7. Remove control 

43.2 

Control button 

3.7 

0.0050 

Low 

button part 


Control button 
part 

5.5 

0.0050 


8. Unscrew and 

129.6 

Main board 

196.0 

0.7908 

Relatively high 

remove main board 


M3x8GB/ 
T90744 

3.0 

0.0021 




Insulating 

washer 

3.0 

0.0100 


9. Unscrew and 

864 

Loudspeaker 

60.0 

1.3000 

Low 

remove loudspeaker 


part 




part 


M3x8GB/ 
T90744 

2.0 

0.0040 


10. Unscrew and 

86.4 

Power supply 

118.0 

0.6466 

Medium 

remove power supply 


board 




board and insulating 
board 


Insulating 

board 

25.0 

0.1520 




M3x8GB/ 
T90744 

0.5 

0.0033 




M4x8GB/ 
T90744 

0.6 

0.0004 


11. Unscrew and 

864 

Metal support 

183.0 

1.2078 

Low 

remove metal support 


M4x8GB/T818 

24 

0.0158 


12. Unscrew 

864 

[4x8BTHCh 

7.2 

0.0475 

Low 



Clamping bush 

24.0 

0.1584 


13. Remove 
loudspeaker 

[43.1 

Loudspeaker 

77.8 

0.0600 

Low 

14. Remove remote 

21.6 

Remote control 

3.0 

04000 

Medium 

control receiver board 


receive board 





(continued) 
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Table 2 (continued) 


Disassembly 

operations 

Time 

(s) 

Components 

Mass (g) 

Potential 

value 

(Yuan) 

Hazardousness 

removal 

15. Separate surface 

21.6 

Surface frame 

270.8 

1.1000 

High 

frame and LCD screen 


LCD screen 

2900.0 

9.6684 




Metal 

mounting plate 

639.0 

1.2170 


16. Disassemble 

64.8 

Power switch 

5.0 

0.0100 

Low 

power switch part 


Power wire 

75.5 

0.1000 




Wire with pin 

5.0 

0.0100 


17. Disassemble 

64.8 

Loudspeaker 

152.0 

0.6000 

Low 

loudspeaker part 


Support 

95.0 

0.0200 




Washer 

2.0 

0.0070 




[4x8BTHCh 

24 

0.0158 


18. Disassemble base 

864 

Metal washer 1 

10.0 

0.0660 

Low 

part 


Metal washer 2 

10.0 

0.0660 




Metal fixing 
plate 

15.0 

0.0990 




M4xl2GB/ 

T818 

24 

0.0158 


19. Disassemble brace 

864 

Metal support 

25.0 

0.1650 

Low 

part 


Plastic support 
1 

30.0 

0.2400 




Plastic support 
2 

20.0 

0.1600 




M4xl2GB/ 

T818 

24 

0.0158 


20. Disassemble seat 
part 

64.8 

Toughened 
grass seat 

150.0 

0.3300 

Low 



Steel plate 

50.0 

0.0640 




Rubber gasket 

20.0 

0.0200 



The hazardousness removal, weight removal, and potential recovery value for 
the initial plan and an optimized plan are shown in (b), (c), and (d), respectively. In 
(b), a 100 % hazardousness removal target will be achieved after 13 disassembly 
operations for the optimized plan. In (c), a target to achieve 75 % component 
disassembled by weight (of the total weight of the WEEE) took 6 operations for the 
optimized plan. In (d), the result of potential recovery value divided by spent time 
for each operation is shown, which is a target to achieve the most potential recovery 
value within the shortest time. To meet the environmental protection targets of 
removing 100 % components with hazardous materials and 75 % components by 
weight to be disassembled, the first 1 3 disassembly operations were selected from 
the optimized plan as the selective optimized plan. Meanwhile, the potential 
recovery value and spent time for this plan were optimized in this selective plan. 
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a 


Optimisation Process 


b 


Hazardousness Removal 



Iterations 

The disassembly planning optimisation process 
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Fig. 16 Disassembly planning optimization with customizable decision-making models (all 
weights are 1) 

In (b) and (c), it is shown that the initial plan will take 15 disassembly operations 
to achieve 100 % hazardousness removal and also 15 operations for 75 % compo- 
nents by weight to be disassembled. Therefore, 15 operations are necessary to 
achieve the environmental protection targets. Therefore, the optimized plan will 
have 2 less operations. The potential value/time in (d) can be separated and 
interpreted in (e) and (f). It shows that with the selective optimized plan, the 
potential recovery values during the disassembly process are 86.7 % (of the total 
potential value of all the disassembled components in the WEEE) for 1 3 operations 
and 38.8 % and 85.8 % for the initial plan after 13 and 15 operations, respectively. 
With the selective optimized plan, the time spent during the process was 62.7 % 
(of the total time spent to disassemble the WEEE) for 13 operations and 69.4 % and 
77.6 % for the initial plan after 13 and 15 operations, respectively. 
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3 Hazardousness Removal 


C/) 

<*> 



Hazardousness removal during disassembly 


Spent Time 
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Operations 

Spent time during disassembly 


b Weight Removal 



d Potential recovery value 


CD 



Operations 


Potential recovery value during disassembly 


Fig. 17 Disassembly planning optimization with customizable decision-making models (weights 
are 1, 0.5, 1) 


Therefore, if the first 13 operations are selected for both plans, it can be observed 
that significant potential value is recovered (86.7 % vs. 38.3 %) while less time 
spent with the optimized solution (62.7 % vs. 69.4 %). If the first 13 operations and 
15 operations are selected for both plans respectively, a better potential recovery 
value (86.7 % vs. 85.8 %) while about 15 % time of the total disassembly time can 
be saved with the optimized solution (62.7 % vs. 77.6 %). Fifteen percent labor time 
of disassembling a single set of LCD WEEE stands for 200 s and about 6 h for 
100 sets of the LCD WEEE. 

Scenario 2 for Selective Optimization 

It is aimed to prioritize the environmental protection targets (100 % hazardousness 
removal and 75 % component disassembled for the whole WEEE) (the weights for 
the hazardousness index and weight removal index in formula Eq. 15 were set 1 and 
the weight for potential recovery value 0.5). The input data is shown in Table 2. 

In Fig. 17a, a 100 % hazardousness removal target will be achieved after 
10 disassembly operations for the optimized plan with this weight setting. In (b), 
a target to achieve 75 % component disassembled by weight (of the total weight of 
the WEEE) took seven operations for the optimized plan with this weight setting. 
Therefore, 10 disassembly operations are needed for the selective optimized plan, 
compared to 13 operations in scenario 1. In (c), the time spent for the 10 operations 
is 50.0 % of the total time for the WEEE, which can be compared to the related 
results in scenario 1, which were 62.7 % and 69.4 % of the total time spent to 
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disassemble the WEEE for the optimized plan with all the weights set to 1 and the 
initial plan for 13 operations, respectively. In (d), the potential recovery value is 
77.4 % of the total potential value of the WEEE for this setting, while the potential 
recovery values are 86.7 % and 38.8 % of the total potential value of all the 
disassembled components in the WEEE for the optimized plan and the initial 
plan in scenario 1, respectively. It can be clearly observed that with the prioritized 
considerations of hazardousness and weight removal, less operations and time are 
needed accordingly, while the potential recovery value has to be traded off (from 

86.7 % to 77.4 %). 


Summary 

WEEE has been increasingly customized and diversified, and the selective disas- 
sembly planning of WEEE to support remanufacturing decision-making is an 
important but challenging research issue. In this paper, an effective selective 
disassembly planning method has been developed to address the issue systemati- 
cally. The characteristics and contributions of the research include: 

• Space interference matrix has been used to represent the space interference 
relationship of each component in six directions in a Cartesian coordinate system 
for WEEE. By this way, all the space interference relationships between com- 
ponents of WEEE can be digitally recorded and can be analyzed in the next step. 

• A matrix analysis algorithm has been developed to obtain all the feasible 
disassembly sequences of WEEE by analyzing the six space interference matri- 
ces in a 3D environment. It is capable to obtain all the feasible disassembly 
sequences of WEEE, and the result can be used as a solution space to support a 
disassembly planning method to achieve better economic value for WEEE 
within an acceptable runtime. 

• An improved PSO algorithm-based selective disassembly planning method with 
customizable decision-making models has been developed. In the method, the 
customizable decision-making models embedded with adaptive multi-criteria to 
meet different stakeholders’ requirements have been designed to enable the 
method flexible and customizable in processing WEEE effectively. 

• Based on the intelligent optimization algorithms, the developed method is capable 
to process complex constraints for different types of WEEE based on a generic and 
robust process and achieve selective optimized disassembly plans efficiently. 

• Industrial cases on the LC24F4 LCD television have been carried out to verify 
the effectiveness and generalization of the developed research. Different appli- 
cation scenarios and targets have been set to validate and demonstrate that this 
research is promising for practical problem solving. 

Future work will include developing an intelligent automated selective disas- 
sembly system with industrial robotic manipulator, cameras, and sensors for WEEE 
such as LCD televisions. 
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Abstract 

This paper seeks to address an approach called the social network analysis 
method (SNAM) to evaluate the effect of resource scalability on networked 
manufacturing system. Considering the case of networked manufacturing mode, 
we have proposed a framework of SNAM for generating the collaborative 
networks. The collaborative networks have been obtained by transferring the 
input data in the form of an affiliation matrix to the UCINET and Netdraw 
software packages. Subsequently, we have conducted various tests to analyze the 
collaborative networks for finding the network structure, size, complexity and its 
functional properties. In this paper, a social network based greedy k-plex 
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algorithm has been applied to evaluate the scalability effect on different data sets 
of networked manufacturing system. Experimental studies have been conducted 
and comparisons have been made to demonstrate the efficiency of the proposed 
approach. 


Introduction 


Recently scalability in a manufacturing system is considered as an area of research 
for enhancing the techniques and methodologies to meet the challenges of the 
emerging manufacturing paradigm. In other words, manufacturing systems seal- 
ability can further enhance the manufacturing systems operations by providing 
further optimization. According to the nature of the undertaken problem, we have 
considered scalability as “the design of a manufacturing system and its machines 
with adjustable structure that enable system adjustment in response to market 
demand changes,” (Koren 2010). However, most of the existing manufacturing 
organizations still exhibit rigid organizational structures and their deterministic 
approach cannot support the above mentioned requirements. Researchers’ attention 
to a large extent have been focused on an alternative to the traditional manufactur- 
ing system which can meet high flexible manufacturing operations. Several next 
generation manufacturing systems such as holonic manufacturing systems 
(Valckenaers et al. 1994), fractal factory (Okino 1993), networked manufacturing 
systems and bionic manufacturing systems (Ueda 1993; Liu et al. 2002) can be 
conceptualized as a network of elements that are adaptable to environmental 
changes in particular when market demand causes turbulent fluctuations. Over the 
past few years, many studies focused on the newly emerged manufacturing para- 
digm, networked manufacturing or network based manufacturing, which has the 
capability to achieve the requirements and functionalities of global manufacturing 
(Zhou et al. 2010). 

Networked manufacturing encapsulates the information and knowledge from 
product design to manufacturing which enables resource sharing between geo- 
graphically distributed enterprises to achieve competitive advantages that would 
be difficult to attain with an individual enterprise (Wiendahl et al. 2007). 
Networked manufacturing has the ability to change its production mode from 
make-to-stock to make-to-order. Due to the customized manufacturing environ- 
ment and competition of delivery times between different manufacturing enter- 
prises, the objective of resource scalability and its effect on the manufacturing 
system is becoming a critical task. Many manufacturers nowadays try to further 
push optimizing the performance of the manufacturing system by implementing the 
scalability function. Different issues related to scalability and its relationship with 
some of the critical features of recent manufacturing systems are adaptability, 
flexibility, reconfigurability, etc. (Putnik et al. 2013). A detailed literature review 
on scalability as an area of research on manufacturing systems is detailed in 
(Wasserman 1994). 
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Although many approaches and models have been developed in the recent past, 
we have identified that there is still a need to address some more issues particularly 
relevant to the networked manufacturing systems. First, in networked manufacturing 
systems (NMS) the structure of the network does not affect the performance of the 
manufacturing system. In other words, the network structure has no influence on 
analyzing the performance of the manufacturing system. With the obtained networks 
from social network analysis method (SNAM) on manufacturing systems, there is a 
possibility to analyze the scalability and its performance on the manufacturing system 
(Newman 2002). Second, in NMS, the size, scope, and complexity of the network are 
not defined. On the contrary, with the SNAM on NMS, there is a possibility to define 
the network size and its functional properties such as centrality measures and network 
complexity in a much better way (Mendes et al. 2004). Moreover, there is hardly any 
information regarding the communication flow inside the network structure and the 
descriptive statistics that can be used to extract some information about the speed/ 
nature of the structure. Various types of topologies and how these topologies affect 
the search space for exploiting the desired solution is discussed in (Neukum and 
Ivanov 1994). In their work, authors have presented the descriptive statistics such as 
average distance, diameter, and distribution sequence of various topologies and found 
that the series of statistics directly affects the performance of the topologies. 

However, much work has been done on a wide range of problems ranging from 
natural phenomena to military (Lu and Hamilton 1991; Crovella and Bestavros 
1996; Roberts and Turcotte 1998; Zhang et al. 2013). A framework to predict the 
missing quality of service values of the manufacturing services by combining social 
network and collaborative filtering techniques is presented (Newman and Park 
2003). However, there is limited work that has applied a social network kind of 
analysis on manufacturing problems, in particular NM-problems. In order to give 
voice to the challenge, in this paper we have analyzed the existing NMS with social 
network method (SNM) to find the reconfiguration effect of various performance 
measures of the system. The detailed description of the analysis, method, and 
framework is presented in later sections. 

Since efficiency is a significant part of networked manufacturing problems, the 
proposed methodology and its characteristics better serves the existing traditional 
networked manufacturing approach in many ways. The fundamental difference 
between social networks and non-social networked systems with two important 
properties are discussed in (Newman 2005). First, the degrees of adjacent vertices 
in networks are positively correlated in social networks but negatively correlated in 
most other networks (Watts and Strogatz 1998). Second, high levels of clustering 
are possible with social networks, whereas in many non-social networks clustering 
would be expected on the basis of pure chance (Heddaya 2002). In this paper, a case 
in the context of networked manufacturing is taken. Later, we have shown how a 
manufacturing execution system data can be extracted and viewed as a network 
connected with a number of nodes. Later, we map the attributes of the manufactur- 
ing system as elements of connecting nodes and the connections between the 
elements act as interactions where the actual material flows on different resources. 
Moreover, a framework has been developed and a social network analysis method 


3442 


V.K. Manupati et al. 


has been conducted to find the effect of resource scalability and its effect on 
networked manufacturing system. 

The remainder of this paper is organized as follows. In section “Problem 
Description,” we give a detailed description of a case with the basic assumptions. 
In section “Framework of the Proposed SNAM Approach,” we presented a frame- 
work and the logical steps of the execution of a case with proposed SNM. The 
detailed SNAM to find the functional properties of the network has been discussed 
in section “Social Network Analysis Model.” The scalability feature to the 
networked manufacturing system has been introduced and with the help of clique 
based social network algorithm the time scale has been measured and its results are 
presented in section “Scalability with Social Network Analysis Algorithm.” The 
paper concludes with section “Conclusion and Future Work” which suggests the 
directions of the future work. 


Case Study 

We consider a customized manufacturing environment where different customers order 
multiple products and it is denoted as n. Each product corresponds to a different 
sequence of operation steps and set of alternative process plans. Consequently, the 
products with alternative process plans constitute different operations which are to be 
processed on a set of alternative machines. However, in networked manufacturing 
environment, the machines with different capabilities are distributed geographically to 
perform various operations of the products. Hence, the transportation time between the 
two corresponding machines take part as a significant role for assignment of machines 
to production tasks. Due to alternative process plans, machines, and operations 
sequences the problem is much more complex and it is considered as a challenging 
problem in today’s manufacturing environment. As part of our objective in this paper, 
network size, scalability, and modularity are considered as one such performance 
measure to conduct SNAM to find the effect of resource scalability on the above 
mentioned networked manufacturing system. The above mentioned problem makes 
several assumptions that are worth highlighting, (a) Products preemption is not 
allowed; (b) The operation of a product on a machine should not be interrupted until 
it is finished; (c) We have considered the transportation time between the machines. 
With this kind of manufacturing system, after an immediate completion of the opera- 
tion of a job on a machine it is immediately transported to the succeeding machine for 
its process, (d) A machine can handle only one product at a time, (e) All machines, 
products with operations and process plans are simultaneously available at time zero. 


Framework of the Proposed SNAM Approach 

It is evident from the literature that not much work has been done on finding the 
scalability issues in networked manufacturing systems particularly by considering 
SNAM. In order to respond to the aforementioned issues, it is necessary to conduct 
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Fig. 1 Scalability with 
respect to resouce and 
capacity (Heddaya 2002) 


▲ 



Resources 


further optimization for economic stability. From Fig. 1 the details of scalability 
and its influence in relation with capacity and resources on system behavior are 
shown. The X-axis in the graph represents resources and Y-axis represents capacity, 
where resources and capacity increase the graph follows a straight line which means 
the mentioned performance measures are linearly increasing. 

In this paper, we try to prove the system stability with the help of a case by 
increasing the resource scalability. Moreover, we try to find the linear pattern by 
increasing the resources scalability to find the effectiveness of the proposed SNAM 
model. In the study, we propose a step by step process of SNAM and its imple- 
mentation on a networked manufacturing problem with a framework as illustrated 
in Fig. 2. The flowchart has been divided into three steps: (1) network modeling, 
(2) social network analysis method, and (3) evaluation of manufacturing system by 
considering. 

Details of the collected data and the description of the method steps are elabo- 
rated in the following sections. 


Social Network Analysis Model 

The method shows, how the manufacturing execution data can be extracted and 
viewed as a network with nodes. However, it is very difficult to get the real world 
data of process planning and scheduling problem. Therefore, we have used input 
data from (Zhou et al. 2010) for conducting various tests with SNAM to obtain 
different characteristics of the network. The SNAM is categorized into two steps: 
(a) network modeling, and (b) network analysis. The detailed description of these 
two steps is mentioned in the following sections. 
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Fig. 2 Framework to find the resource scalability effect on networked manufacturing system 


Network Modeling 

A network consists of a set of nodes connected with ties indicating interaction 
(Newman 2005). This section presents how the manufacturing system execution 
data can be conceived as networks. The collected data from the literature is listed as 
an affiliation matrix, whose rows and columns represent the attribute (machines, 
jobs, operations, alternative process plans) information. The detailed case and its 
description is shown in Table 1. 

Later, the matrix is analyzed using the modeling algorithm in the Ucinet software 
package and for visualizing, the obtained results are submitted to the Netdraw software 
package. The obtained network from the affiliation matrix is called a collaboration 
network. This collaboration network is more interesting and meaningful than the 
simple network in terms of its characteristics, size, etc. The above procedure has 
been repeated for the remaining scenarios and obtained different collaboration net- 
works. The collaborative network and its details are depicted in Fig. 3. 

The nodes in the network represent different attributes of the manufacturing 
system and for distinguishing each attribute we mentioned nodes with different 
colors. For example, in Fig. 3 the nodes with blue color indicate different opera- 
tions, and the red-accent color indicates different machines participating in 
performing the task. We showed each different job node with a different color. 
Before analyzing the network, there is a need to run some preliminary analysis to 
describe the overall nature of the network. As part of that, we map the network with 
the elements in the manufacturing system. The detailed description of the prelim- 
inary and the detailed statistical analysis of the obtained networks are specified in 
the following section. 
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Table 1 Input data for the 6x6 problem (Zhou et al. 2010) 


Job 

PP 

01 

02 

03 

04 

05 

06 

J1 

PP1,1 

11,2} 

{3,4,5} 

{6} 




[6 5] 

[7, 6, 6] 

[8] 

PP1,2 

{1,3} 

{2,4} 

{3,5} 

{4,5,6} 



[4, 5] 

[4, 5] 

[5, 6] 

[5, 5, 4] 

J2 

PP2,1 

{2} 

{1,3} 

{2, 4,6} 

{3,5} 

{2,4} 

{4,6} 

[4] 

[2, 3] 

[4, 3, 5] 

[2, 4] 

[3,4] 

[3,5] 

J3 

PP3,1 

{2,3} 

{1,4} 

(2,5) 

{3,6} 



[5, 6] 

[6, 5] 

[5, 6] 

[6, 5] 

PP3,2 

Ul 

{3,4} 

{5} 




[9] 

[8, 8] 

[9] 

PP3,3 

{2,3} 

{4} 

{3,5} 

{4,6} 

{2,4} 


[7, 6] 

[7] 

[4, 6] 

[5, 5] 

[6, 4] 

J4 

PP4,1 

{1,2} 

{3,4} 

{6} 




[7, 8] 

[7, 6] 

[9] 

PP4,2 

{1,3} 

{2} 

{3,4} 

{5,6} 



[4, 3] 

[4] 

[4, 5] 

[3, 5] 

J5 

PP5,1 

Ul 

{2,4} 

{3} 

{5} 

{4,6} 


[3] 

[4, 5] 

[4] 

[3] 

[5,4] 

PP5,2 

{2,4} 

{5} 

{3,6} 




[5, 6] 

[7] 

[9, 8] 

J6 

PP6,1 

{1,2} 

{3,4} 

{2,5} 

{3} 

{4,5} 

{3,6} 

[3, 4] 

[4, 3] 

[5, 3] 

[4] 

[4, 6] 

[5,4] 

PP6,2 

{1,3} 

{2,3} 

{2,4} 

{6} 



[4, 4] 

[5,6] 

[6, 7] 

[7] 

PP6,3 

{1,2,3} 

{4,5} 

{3,6} 


[3, 5, 8] 

[7, 10] 

[9, 9] 


Network Analysis 

The goal of the network analysis is to reveal the information of the structure of the 
collaboration networks for potential synergies. In order to obtain the information of 
the structure, important properties of descriptive statistics such as average distance, 
diameter, and modularity of different networks have been tested. Moreover, with 
the probability distribution the size and complexity of the network has been 
identified. 

Distance of a Network 

In this paper, we have submitted the input data to the Ucinet and then obtained the 
results of average distance for different sets. Based on properties of the network and 
their descriptive statistics, i.e., average distance, it is clear that the graph structure 
and its complexity increases with an increase in resources. In Table 2 below, the 
measures of average distance for different scenarios is shown, which depend on the 
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Opr 6 


Mac 6 


Fig. 3 Collaborative networks of 6 by 6 problem 


Table 2 Scenarios used in 
the study and the associated 
graph statistics 


Data 

Average distance 

6 by 6 

1.343 

6 by 10 

1.353 

6 by 14 

1.417 

6 by 18 

1.475 

6 by 22 

1.512 

6 by 26 

1.554 

6 by 30 

1.598 

6 by 34 

1.624 


information about the structure and the speed of communication flow. The average 
distance measures the average number of edges between any two nodes where the 
average number of cycles of influence is needed to broadcast information through- 
out the graph. 

Complexity Analysis 

Once a network is generated, it can be proven to be complex if the connections 
between the work systems follow a well known power law distribution. In this 
section, the mathematics behind the power law distribution and its implementation 
to the obtained network has been presented. 

The probability distribution function for a normalized degree centrality of the 
collaborative networks follows power law and it is represented in Eq. 1 . 


p(x) = Cx~ a 


( 1 ) 
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Where, C is constant and a is an exponent and the value of a is assumed as zero 
with a > 0. We observe that while x approaches zero, the probability of x diverges. 
Hence, there must be some lowest value at which the power law function should be 
obeyed. From the descriptive statistics the value of x min is found for normalization. 
Normalizing the constant C and its solution in Eq. 1 gives: 



OO 

r* 


oo 

r* 


p(x)dx = C 


Ami 


X 


— (X) 


dx 


mm 


u 
Ami 


min 





— (X~\~ 1 


OO 


- Ami 


min 




l)x 


a— l 
min 




a — 1 


-Enin 




a = 1 + n 



-tm in 




In Eq. 2, we can observe that the value of x changes to x min with a > 1, 
otherwise the right side of the equation would diverge. If the value of a > 1, then 
Eq. 2 gives Eq. 3. Thus the correct normalized expression of power law is 
represented in Eq. 4 and we have plotted the power law distribution p(x) in a 
log-log graph with the exponent a with the quantities x h i = 1 . . .n are the observed 
values of parameters x and x min . 

Figure 4 shows, the log-log plots of different data sets with normalized distri- 
bution of the connections strength, i.e., the number of connections the nodes occurs. 
From the plots it is evident that the distribution obeys the power law having 
observed exponents of a = 1.628 to 1.855 for 6 by 6 to 6 by 35 of all data sets. 
For in-depth analysis of the power law distribution one can refer to (Newman 

2005). 


Fig. 4 Power law 
distribution of 6 by 6 with 


1.628 
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Table 3 Comparison of different scenarios and their different performance measures 


Data 

Average 

distance 

Size of the network 
(power law) 

Scalability (greedy K-plex 
(clique) algorithm) 

Modularity 

6 by 6 

1.343 

1.628 

Scalable 

0.007 

6 by 10 

1.353 

1.670 

Scalable 

0.008 

6 by 14 

1.417 

1.691 

Scalable 

0.022 

6 by 18 

1.475 

1.712 

Scalable 

0.058 

6 by 22 

1.512 

1.731 

Scalable 

0.076 

6 by 26 

1.554 

1.743 

Scalable 

0.084 

6 by 30 

1.598 

1.823 

Scalable 

0.102 

6 by 34 

1.624 

1.855 

Scalable 

0.122 


Scalability with Social Network Analysis Algorithm 

The proposed SNAM is implemented with a case that is expressed with different 
manufacturing scenarios. Different tests were performed on several graph sizes and 
data sets, by fixing the number of multiple jobs to six and by increasing the number 
of resources to each data set. 

Table 3 shows the different performances of generated collaborative graphs, 
whose size complexity increases with the increase in resource scalability. In this 
work, we try to find the increase in number of cliques according to size of the graph. 
Thereby, we use social network based greedy k-plex algorithm to find the scalabil- 
ity with respect to time complexity. From Table 3, column 4, it is clear that the 
number of cliques increases rapidly with the size of the graph and thereby the time 
complexity, i.e., O(n) to execute the graph also increases. An algorithm for 
detecting community structure of social networks based on priority knowledge 
and modularity is used. Column 5 in Table 3 clearly depicts the increase in 
modularity with increase of resources in the data. 


Summary and Future Work 

The paper presents a social network analysis method (SNAM) that can evaluate the 
effect of resource scalability in the context of a networked manufacturing system. 
To prove the effectiveness of the proposed method we have described a case with 
various complex scenarios. More importantly, we have defined a conceptual model 
with the help of a framework that fulfills the desired objective. In particular, 
solutions to queries involving SNAM such as “How can the collaborative networks 
be acquired from the manufacturing execution data?”, “How can the size of the 
network and its functional properties be extracted?”, and “How can the extracted 
properties influence the behavior of the network?” are provided. 
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To implement the above proposed method, first the manufacturing execution 
data has to be converted into an affiliation matrix to be inputted to the UCINET 
software package. Later, with the obtained results, the Netdraw software package 
has been used to generate the collaborative networks. It is critical to conduct 
various tests such as scalability tests and complexity analysis on the network to 
identify the characteristics, nature, and size of the collaborative network. Moreover, 
we have mapped the structure of the network with the attributes in the manufactur- 
ing system. Essentially, with different experimental settings, the effect of resource 
scalability on a networked manufacturing system has been tested on different 
performance measures. To validate the role of scalability and to find the effective- 
ness of the proposed methodology we used a social network based greedy algorithm 
to see whether different scenarios follow the same pattern. Results from Table 3 
clearly show as the resources increase, the size, complexity, and modularity of the 
graph increase, thus following a linear increment in time complexity. 

In future work, one can find the gain in performance measures that can be 
achieved with resource scalability. Moreover, some issues which are critical for 
manufacturing system designs such as cost of sharing resources (contention), 
diminishing returns at higher loads (saturation), and negative return on investment 
(coherency delays) can be identified with the proposed method. 
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Abstract 

Job shop scheduling problem (JSSP) is a typical scheduling problem that aims to 
generate an optimal schedule to assign all the operations to the production equip- 
ments. JSSPs can be categorized into single objective JSSP (SOJSSP) and multiple 
objective JSSP (MOJSSP) based on the optimization objectives considered. 
SOJSSP involves generating schedules to allocate operations to different machines 
considering only one objective, while MOJSSP considers more than one objective 
in the scheduling process. SOJSSP and MOJSSP are typical NP-hard optimization 
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problems which have significant values in real production. Intelligent Water Drops 
(IWD) is a new type of meta-heuristics which shows excellent ability of solving 
optimization problems. In this research, IWD is improved and customized to solve 
SOJSSP and MOJSSP problems. Experiments have been conducted, and the results 
show that the enhanced algorithms can solve these two types of problems better 
compared with current literature. To the best of the authors’ knowledge, this is 
among the first research employing IWD for solving SOJSSP and MOJSSP. 


Introduction 

Scheduling is an optimization process of allocating limited resources or machines 
over time to perform a set of tasks while satisfying multiple constraints and goals. 
Scheduling plays an important role in the manufacturing realm. It can be used by 
high-level production planning systems to check their capacity; it also provides 
visibility of future plans in the job shops for the suppliers and customers to adjust 
their actions; it can be used to evaluate the performance of job shop personnel and 
management; besides, it can provide greater degrees of freedom to avoid future 
problems (Aytug et al. 2005). Scheduling is well recognized by the academia as 
well as the practitioners, and it has been extensively studied in recent years. 

In a job shop, machines or resources are structured according to the processes 
they perform, where machines with the same or similar material processing capa- 
bilities are grouped together to form work-centers. The machines are usually 
general-purpose machines that can accommodate a large variety of part types. A 
part moves through different work-centers based on its process plan. Normally, job 
shops are most suitable for small lot size production (Chryssolouris 2006). 

There are many advantages of job shop processing, and these advantages 
become more obvious when there is greater variety in the jobs, and these jobs 
have different processing sequences. This research focuses on job shop scheduling. 
The advantages of job shop scheduling are as follows: 

(1) Each operation can be assigned to a machine to achieve the best production rate 
or the best quality. 

(2) The load can be distributed to the machines evenly. 

(3) It is easier to accommodate machine breakdowns. 

The scheduling problem is proven to be typically NP-hard; the computation time 
increases exponentially with the problem size. It is time consuming to search for an 
optimal solution in the huge solution space, especially when the problem is com- 
plex. Therefore, JSSP is among the most difficult (Reza and Saghafian 2005). 

JSSP is a typical NP-hard optimization problem which is difficult to find the exact 
solution within a reasonable computation time (Garey et al. 1976). There are two 
widely used approaches for solving JSSP, namely, the exact methods and the heuristic 
methods. Exact methods, such as mathematical approaches and dynamic programming, 
are computationally intensive and can only solve small-scale problems. Heuristic 
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methods are used to find near-optimal solutions within limited computational time. 
They usually aim to find a “good” solution instead of an optimal one. Meta-heuristics 
are high-level heuristics. For solving difficult combinatorial optimization problems, 
meta-heuristics has been proven to be one of the most powerful heuristic approaches 
(Hartmann and Kolisch 2000). The most popular meta-heuristics used to solve the JSSP 
in recent years include the Tabu search method (TS) (Pezzella and Merelli 2000), 
genetic algorithm (GA) (Hong et al. 2009; Pan and Han-Chiang 2009; Vilcot and 
Billaut 2008; De Giovanni and Pezzella 2010), simulated annealing (SA) (Suresh and 
Mohanasundaram 2006), ant colony optimizer (ACO) (Blum and Sampels 2004; Seo 
and Kim 2010), shifting bottleneck (SB) (Balas and Vazacopoulos 1998), artificial 
neural networks (ANN)(Adibi et al. 2010), and particle swarm optimization (PSO) 
(Zhang et al. 2009; Lin et al. 2010; Sha and Lin 2010; Ge et al. 2008). 

In 2007, another promising meta-heuristics called intelligent water drops (IWD) 
algorithm was proposed (Shah-Hosseini 2007). IWD algorithm is the most recent 
swarm-based nature-inspired optimization algorithm. IWD algorithm has found 
successful applications in several optimization problems, such as the travelling 
salesman problem (TSP) (Shah-Hosseini 2007, 2009a), robot path planning prob- 
lem (Duan et al. 2008, 2009), ft-queen puzzle (Shah-Hosseini 2009a), and the 
multidimensional knapsack problem (MKP) (Shah-Hosseini 2009a, b). The exper- 
imental results of these research work demonstrate that the IWD algorithm is very 
promising for solving optimization problems, and more research is required to 
improve its efficiency or/and adapt it to other engineering problems. 

In this research, the OIWD algorithm is successfully customized to solve the 
SOJSSP and MOJSSP. To the best of the author’s knowledge, it is the first research 
work on the application of the IWD algorithm to solve SOJSSP and MOJSSP. In this 
research, the OIWD algorithm is improved through five schemes, namely, (1) diverse 
soil and velocity initialization is employed to increase the diversity of the solution 
space; (2) conditional probability computation scheme is designed to further improve 
the diversity of the solution space; (3) bounded local soil update is proposed to make 
full use of the guiding information and control the convergence rate of finding a path; 
(4) elite global soil update is proposed to retain the good information of the results 
obtained; and (5) a combined local search is used for improving the search quality. The 
enhanced IWD algorithm is employed to solve the SOJSSP and MOJSSP. The quality 
and the efficiency of the enhanced IWD algorithm are tested in the experiments. 

The rest of the paper is organized as follows: section “Problem Formulation” 
presents the problem formulation. Section “Solution Methodologies” presents the 
solution methodologies. Section “Experimental Evaluation” describes experimen- 
tal evaluation. Section “Conclusion” concludes the paper. 


Problem Formulation 

In a JSSP problem, a set of machines M — {Mj I j — 1, 2, . . ., m) and a set of 
jobs J — [Ji I i = 1,2, . . ., n } are considered. Each job has a sequence of operations 
O = [O k \k =1,2,...,/}, and these n jobs (i.e., all the operations of these n jobs) 
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have to be processed on m machines. Job splitting is not allowed, and the operations 
are non-pre-emptible, which means temporary interruption of an operation is not 
allowed after it has started. Each machine only performs one operation at a time, 
and each operation is performed only once on one machine. 

JSSP aims to find a feasible assignment (schedule) of all the operations on the 
given machines with optimized objectives. Depending on the goals of the decision 
makers, different objectives are used. In the single objective JSSP problem, only 
one criterion (objective) is considered, while multi- objectives are considered in the 
MOJSS problem. In this paper, makespan is the objective considered for SOJSSP. 
Unlike SOJSSP, more than one objective is explored simultaneously in the 
MOJSSP as merely considering one objective is not sufficient for some situations. 
The optimization goal for MOJSSP is to find a set of best compromising solutions in 
the form of alternative trade-offs instead of generating a single optimum. Simulta- 
neous consideration of several objectives in MOJSSP is more challenging. 

The optimization goals for SOJSSP and MOJSSP considered in this research are 
as follows: 


Makespan (C max ): This is the time interval between the time at which the 
schedule begins and the time at which the schedule ends. Thus, the makespan 
of a schedule is equal to max [C z ], where i = 


1 , . . ., m. 


Tardiness (T z ): The tardiness T z of a job / z is the nonnegative amount of time by 
which the completion time exceeds the due date d h 7) = max[0, (C z — d z )]. The 
difference between the completion time and due date for each job. 

Mean flow time (F): This is the average flow time of a schedule, and it is defined 

E yi 

1 Fj . This criterion implies that the cost is directly related to 

the average time to process a single job. The flow time (F t ) is also referred to as the 
cycle time. It is the amount of time job / z spends in the shop floor. It is the time 
interval between the release time r z and the completion time C z of job / z : F, = C z — r z . 


Solution Methodologies 

In this section, the original IWD (OIWD) is customized to solve SOJSSP and 
MOJSSP. An overview of the OIWD algorithm is first given in section “Overview 
of the OIWD Algorithm.” The schemes for improving the OIWD algorithm are 
presented in section “Schemes for Improving the OIWD Algorithm.” A brief 
description of the disjunctive graph is then given in section “Problem Representa- 
tion Using Modified Disjunctive Graph” as the IWD algorithm for scheduling is 
represented on the disjunctive graph in this research. The enhanced IWD algorithm, 
EIWD for SOJJP, is presented in section “Enhanced IWD Algorithm (EIWD) for 
SOJSSP,” and the proposed MOJSS-IWD algorithm for MOJSSP is introduced in 
section “Modified IWD Algorithm Based on Scoring Function for MOJSSP.” The 
MOJSS-IWD algorithm is based on the IWD algorithm and a score function is 
embedded into its local search process. 
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Overview of the OIWD Algorithm 

The IWD algorithm is inspired by the movement of natural water drops which flow 
in rivers, lakes, and seas. It is a population-based meta-heuristics where the IWDs 
construct a better solution through cooperation with each other. This algorithm can 
be applied to solve optimization problems (Shah-Hosseini 2009a). As pointed out 
by Shah-Hosseini, a stream can find an optimum path considering the conditions of 
its surroundings to reach its ultimate goal, which is often a lake or a sea. In the 
process of reaching for the destination, the water drops and the environment 
react to each other as the water drops move through the river bed. The water 
drops can change the environment (river beds) in which they are flowing; the 
environment can also influence the moving directions of the water drops. The 
gravitational force of the earth powers the IWDs moving toward the destination. 
If there are no barriers or obstacles, the IWDs will move in a straight path to the 
destination. However, in the real scenario, as there are different types of obstacles 
when IWDs are forming their paths, the real path of the IWDs may be different 
from the ideal path. In a river path, many twists and turns (meanders) can be 
observed. However, by considering the distance to the destination and the environ- 
mental constraints, the constructed path seems to be an optimal one 
(Duan et al. 2008, 2009). 

In the OIWD algorithm, the IWDs are associated with two attributes, namely, the 
amount of soil and the velocity of the IWDs. The velocity enables the water drops to 
transfer soil from one place to another. Faster water drops can gather and transfer 
more soil from the river beds. Besides, the velocity of the IWDs is also affected by 
the path condition. The amount of soil in a path has impact on the IWDs’ soil 
collection and movement. A path with less soil allows the IWDs to move faster 
along that path, and the IWDs can attain a higher speed and collect more soil from 
that path, while a path with more soil is the opposite. 

In the IWD algorithm, the movement of IWDs from the source to the destination 
is performed in discrete finite-length time steps. When an IWD moves from one 
location to the next one, the increase in its velocity is proportional (nonlinearly) to 
the inverse of the soil of the path between the two locations, and the soils of the 
IWDs increase because the IWDs remove some soil from the path they travel. The 
soil increase is in inverse proportion to the time needed for the IWDs passing 
between the two locations. The time duration to travel from one location to the 
second location depends on the distance between these two locations and the 
velocity of the IWDs. In the OIWD algorithm, the undesirability of a path is 
reflected by the amount of soil in the path. When an IWD has to choose a path 
among several candidate paths, it would prefer an easier path, i.e., a path with less 
soil than with more soil. The IWDs select a path based on a probabilistic function. 
The IWD algorithm uses a parameterized probabilistic model to construct solutions, 
and the values of the parameters are updated in order to increase the probability of 
constructing high-quality solutions. 

The IWD algorithm has been tested using several standard optimization bench- 
mark problems. It can find good solutions for TSP (Shah-Hosseini 2007, 2009a), 


3456 


S.H. Niu et al. 


and it can also solve robot path planning (Duan et al. 2008, 2009), the ft-queen 
puzzle (Shah-Hosseini 2009a), and the MKP (Shah-Hosseini 2009a, b) with optimal 
or near-optimal solutions. 

In this research, the OIWD algorithm is further improved with five schemes; 
the rationale behind the proposed schemes is to increase the diversity of the 
solution space as well as improve the search quality of the IWDs. The detailed 
description of the schemes is given in section “Schemes for Improving the OIWD 
Algorithm.” 


Schemes for Improving the OIWD Algorithm 

As a meta-heuristic algorithm, IWD suffers from two problems, viz., (1) it has an 
earlier convergence and (2) the initial solution and the diversity of the solution 
space often affect its search quality. The OIWD algorithm is enhanced through five 
schemes to form the EIWD for solving SOJSSP and the MOJSSP-IWD to solve 
MOJSSP in this research. 

(1) Scheme 1: Diverse Soil and Velocity Initialization 

In the OIWD algorithm, all the edges are set with the same amount of initial 
soil, and all the IWDs have the same initial velocity. In the modified algorithm, 
the initial amount of soil of each edge is randomly set, and the initial velocity 
of every IWD is also randomly chosen. This different initial soil and 
velocity setting provides the modified IWD algorithm with a diverse initial 
solution space. 

(2) Scheme 2: Conditional Probability Computation 

When an IWD is at node i in the disjunctive graph, the probability of choosing 
node j is represented by p™ D (j). The OIWD algorithm computes this probability 
based on the soil on the edges. In the EIWD algorithm, to increase the convergence 
speed of the IWDs, namely, the speed of finding a best path, the probability is 
computed based on the soil of the edges and the processing time p t ( j) of the 
candidate nodes (operations). To further improve the diversity of the search pro- 
cess, a piecewise function (Eq. 1) is employed to determine this probability 
(conditional probability computation). A random number cp dec E (0, 1) is used to 
determine the method to be used for computing the probability. cp dec is compared 
with (p o = 0.5; the probability of choosing node j is determined by comparing the 
results of cp dec and cp$. rn E (0, 1) is a random number to add randomness to the 
probability, r is a variable which represents the relative importance of the soil of 
the edge to the processing time of the next operation, its default value is 1, which 
indicates these two variables are equally important. The rationale behind 
the conditional probability computation lies in broadening the possible solution 
search space 
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e s = 0.01 is to prevent a possible division by 0. By repeatedly applying the above 
rule, each IWD builds its own path. 


(3) Scheme 3: Bounded Local Soil Update 


The soil updating model is one of the most important components of an IWD 
algorithm. To make full use of the guiding information and controlling of the 
convergence rate of finding a path, a bounded soil-updating model is proposed. 
This model differs from the soil-updating model in the OIWD algorithm by applying 
a lower and an upper bound to the soil-updating process. Let Asoil max and Asoil min be 
the upper and lower bound values of the soil changes when the IWDs pass through 
any edge in the disjunctive graph. The lower bound (a small positive constant) 
prevents the algorithm from slow convergence, while the upper bound prevents the 
algorithm from getting to the local optima too quickly. More precisely, the edge (/, j) 
soil updating and IWD soil updating use the following formulas: 


! (1 - p L ) * soil(/J) - /? L Asoil min if Asoil(/J) < Asoil min 

(1 - p L ) * soil(/J) -/? L Asoil max if Asoil(i,j) > Asoil max (2) 

(1 — p L ) * soil(zj) — y9 L Asoil(/J) otherwise 

( soil /M) + Asoil min if Asoil (ij) < Asoil min 

soil"™ = < soil"™ + Asoil max if Asoil(/J) > Asoilmax (3) 

[ soil"™ + Asoil (ij) otherwise 

soil"™ represents the soil that an IWD carries. p L = 0.9 is the local soil-updating 
parameter. The upper bound and the lower bound of the soil updating are set based 
on the value of the soil on the edge and the soil in the IWDs. 

(4) Scheme 4: Elite Global Soil Update 

In the OIWD algorithm, only the soil in the best iteration solution S 1B is updated. 
In the modified algorithm, the path of the best iteration schedule S IB and the paths 
corresponding to the IWDs in an elite IWD group are also updated. By doing so, 
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better information in each iteration is retained. This elite group contains 7V e]ite IWDs 
which solutions are among the best A e i ite in all IWDs. The number of elite IWDs is 
determined by a coefficient oc%, and the number of elite IWDs, N e i ite , is calculated 
as A^eiite = * Niwd- When an iteration is completed, the soil on the disjunctive 

graph is updated using Eq. 4. 


soil ( i , y ) = (1 +Piwd) * soil (i,j) 


1 


PlWD 


(/- 1 ) 


soilf 75 


W(i,j)eS eh ^,ke[l...N elke ] 


(4) 


In Eq. 4, p mD is a global soil updating parameter, and / is the number of 
operations in each job. For the k- th IWD in the elite group, soil^^ is the soil it 

carries andS 611 ^ is its corresponding schedule, a is set by the author; a larger a leads 
to more information being retained on the path of the IWDs. Through this scheme, 
more information of the latest iteration can be used to increase the search efficiency. 


(5) Scheme 5: Combined Local Search 


After an iteration, all the IWDs can find feasible schedules. A scheme, which is a 
local search that combines both breadth and depth searches, is proposed to improve 
the obtained feasible schedules. The combined local search methods for SOJSSP 
and MOJSSP are introduced in detail in sections “Enhanced IWD Algorithm 
(EIWD) for SOJSSP” and “Modified IWD Algorithm Based on Scoring Function 
for MOJSSP,” respectively. 


Problem Representation Using Modified Disjunctive Graph 

To implement the IWD algorithm for SOJSSP and MOJSSP, the JSSP is represented 
as a modified disjunctive graph G dis which resembles rivers as in the IWD algorithm. 
A disjunctive graph G dis = < A, C, D > can be used to represent a JSSP (Yamada 
2003; Balas 1969). A disjunctive graph G dis has a node set A, a disjunctive edge set D, 
and a conjunctive edge set C. Each operation has a corresponding node in the node set 
N. Besides, N contains two dummy operations (source node and sink node) with zero 
processing time. For the conjunctive edge set C, it contains directed edges connecting 
the neighbor operations of the same job. Such edge links can represent the precedence 
constraints of the / operations of the same job. The disjunctive edge set D contains 
undirected edges which connect consecutive operations processed on the same 
machine. These edges are undirected ones against each other, which represent the 
unsolved precedence of the operations. Both the disjunctive edges and conjunctive 
edges emanate from the operation nodes, and their lengths represent the processing 
times of the operations where they emanate. Thus, the lengths of the outgoing edges 
from the same node are the same. 

In the schedule construction process, one direction of the disjunctive edge pairs 
should be determined in order to change each undirected disjunctive edge to a 
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Table 1 The processing 
time (in unit time) for each 
operation (3 x 3 job) 



1st operation 

2nd operation 

3rd operation 

Job 1 

86 

60 

10 

Job 2 

68 

28 

38 

Job 3 

33 

67 

96 


Table 2 The machine 
to process each operation 
(3x3 job) 



1st operation 

2nd operation 

3rd operation 

Job 1 

2 

3 

1 

Job 2 

2 

1 

3 

Job 3 

3 

1 

2 



Fig. 1 Disjunctive graph for the 3x3 job described in Tables 1 and 2 


directed conjunctive edge. In this research, the IWDs will follow the next node using 
the probability of the next node that is calculated using IWD algorithm. The 
processing order of all the conflicting operations that require the same machine is 
determined by fixing the directions of all the disjunctive edges and a complete 
schedule is obtained. The optimization objective is the length of the longest path 
(critical path) in the newly constmcted graph. This path is acyclic with the source 
node as the start node and the sink node as the ending node. For a 3 x 3 job indicated 
in Tables 1 and 2, its disjunctive graph representation is shown in Fig. 1. There are 
two dummy nodes, namely, s and t, which represent the source node and the sink 
node, respectively. Each operation is represented by a node, and the nodes in one row 
form a job, e.g., the nodes in the first row (node 1 1, node 12, and node 13) represent 
job 1, and the node (operation) with number 12 ( 0 12 for ease of representation) 
represents the second operation of job 1. The operations belonging to the same job are 
connected by the conjunctive edges according to their processing order. The first 
operation of each job is connected to node s, while the last operation of each job is 
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Fig. 2 Modified disjunctive graph for the 3x3 job in Tables 1 and 2 

connected to node t. The operations which are performed on the same machine are 
connected by disjunctive edges. For example, 0 12 , 0 2 3 , and 0 31 are connected by 
disjunctive edges as they are performed on machine 3. 

In this research, the objective is to solve the SOJSSP and MOJSSP depicted in 
the disjunctive graph using the modified IWD algorithm. The disjunctive graph 
depicts the environment for the IWDs, and the IWDs flow on the edges of the graph. 
Each IWD travels on the graph gradually along the edges from source to sink. After 
the completion of the iterations, all the IWDs will reach the sink. The solutions are 
represented by the edges that the IWDs have visited. 

The basic idea of the IWD algorithm is to set up a graph and let the IWDs travel 
through the graph. IWDs travel from a start node to a destination node. During the 
travel, the soils of the edges are modified as the IWDs pass through these edges. The 
soil and velocity of the IWDs are modified as well. To apply the modified IWD 
algorithm to SOJSSP and MOJSSP, a modified disjunctive graph is used. Figure 2 
shows the modified disjunctive graph for the 3 x 3 job described in Tables 1 and 2. 

Besides the edges in a standard disjunctive graph (as shown in Fig. 1), new edges 
are added (shown as dash edges). For each node (operation) O x , all the operations 
that are possible subsequent to O x in the schedule are identified. Dash edges are 
formed to connect O x and the potential subsequent operations in the schedule. For 
example, the possible subsequent operations of 0 22 in a schedule are On, 0 12 , 0 13 , 
0 23 , 0 31 , 0 32 , and 0 33 , and 0 22 is connected to 0 31 , O n , 0 12 , and 0 33 by dash 
edges (for the rest of the operations, no dash edges are formed as they are already 
connected to 0 22 .). Figure 3 shows the dash edges of 0 22 . 

In the modified disjunctive graph, two soil values are attached to each disjunc- 
tive edge, one for each direction. The IWDs choose the next edge to visit based on 
the probability calculated from Eq. 1 using the soil on the path and the processing 
time. In the modified algorithm, a group of IWDs is used. In each iteration, each 
IWD starts from node s and visits every node in the modified disjunctive graph until 
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Fig. 3 Modified disjunctive graph for the 3 x 3 job in Tables 1 and 2 {dash edges for 0 2 i ) 


it reaches node t. The path an IWD has passed produces a schedule. The visiting 
sequence of the nodes in the path corresponds to the order of the operations in a 
schedule. Each IWD will find its path on the modified disjunctive graph. 

For an IWD to select the next node to visit, the algorithm which is called G&T 
algorithm proposed by Giffler and Thompson (Giffler and Thompson 1960) is used, 
and Scheme 2, i.e., the conditional probability computation scheme described in 
section “Schemes for Improving the OIWD Algorithm,” is employed for priority 
computation. The G&T algorithm is used to ensure an active schedule is obtained. 
Assume IWD) starts from node s. The set of operations that can be scheduled is 
£1 [O n, 0 2 i, O 3 j}. 0 31 will be scheduled as it is the only operation which meets the 
requirement of the G&T algorithm. 0 3 { s successor 0 32 will be added and £2 
becomes { O n , 0 2 i , 0 32 ). 0 21 has the smallest finishing time and thus all the 
operations that are performed on machine 2 and with start time less than 0 2 i s 
finishing time will be the candidates. The candidate operations are { O n , 0 21 }. The 
priority (probability) of each candidate operation will be determined using Scheme 
2. The priority of an operation O x (either O n or 0 2 i) is correlated with (1) the soil 
on the edge between the latest scheduled operation ( 0 3 i ) and O x and (2) the 
processing time of O x . Assume O n is selected, the soil of edge ( 0 31 , O n ) will be 
updated using the Scheme 3, i.e., the bounded local soil update described in section 
“Schemes for Improving the OIWD Algorithm.” The velocity and soil of IWDx will 
be updated as well. From O n , IWDi continues its path (select next node to visit, 
update soil and velocity) until it reaches the sink node. 


Enhanced IWD Algorithm (EIWD) for SOJSSP 

To facilitate the operation of the EIWD algorithm, the JSSP is represented as a 
disjunctive graph G dis which resembles rivers as in the OIWD algorithm. The entire 
procedures of the EIWD are shown in Algorithm 1 . As shown in Algorithm 1 , the 
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EIWD algorithm contains N IWD iter iterations (Line 3-Line 21). In each iteration, 
Niwd IWDs travel from the source node to the sink node in G dis . The path of an IWD 
can produce a feasible solution (schedule). The soils on the edges where the IWDs 
pass, the soils of the IWDs, and the velocities of the IWDs are updated during the 
travelling of the IWDs (Line 6-Line 10). After each iteration, the soils on N e i ite 
IWDs’ paths are updated (Line 13-Line 15). Next, a group of best solutions S BD are 
chosen, and a combined local search is performed to further improve these solutions 
(Line 16-Line 17). After a local search, a best iteration solution S 1B is identified, 

'pn 

and the global best solution S is updated (Line 18-Line 19). After all the 
iterations, another local search is performed on S TB (Line 22). The brief descriptions 
of the functions in Algorithm 1 are presented next. 


Algorithm 1 EIWD for single objective JSSP (JSSP disjunctive graph G dis ) 
1: Initialize an IWDs group A, // A population of IWDs 
2: initialization () ; //scheme 1 
3: while (k < N IWD _ iter ) do 
4 : for (each time step t) do 
5: for (each IWD g E A which 


feasible solution has not been 


discovered) do 

6: (i, j) = 

Vel mD * = 


7 

8 
9 



( IWD a ) ; //scheme 2 


updateVelocity (Vel 


9 
IWD 


8 


) ; 


Asoil (i, j) = computeDeltaSoil ((i, j), 


soil (i, j) 


updateEdgeSoil ( ( i , j) , 


IWDg) ; 

Asoil ( i , 


j ) ) ; 


//scheme 3 


10 : soil /w ^= update IWDS oil (soil 7 ^^, Asoil(zj) ) ; //scheme 3 

11 : end for 

12 : end for 

13 : for 


14 : 


(tfelite IWDS) dO 

globalSoilPropagation ( ) ; //scheme 4 


15 : end for 


16 : 


17: S 


S BD = setupBestSolutionGroup (); 
JB =combinedLocalSearch (S BD ) ; //scheme 5 


IB 


18: updatePathSoil (S ) ; 


TB 


19 : update ( S ) ; 
2 0 : k+ + ; 


21 : 


22 : S 


end while 

TB = combinedLocalSearch ( S TB 


) ; 


(1) initialization (): This function initializes the static and dynamic parameters, 
such as the soil of each edge and the velocity of each IWD. A scheme 
(Scheme 1) is proposed to increase the diversity of the initial solution space, 
and it has been discussed in section “Schemes for Improving the OIWD 
Algorithm.” 

(2) selectNextEdge (IWDa): Lor the g-th IWD, choose the next node to visit in the 
schedule operation list according to the probability calculated. A conditional 
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probability computation scheme (Scheme 2) is designed to replace the prob- 
ability computation part in the OIWD algorithm (section “Schemes for 
Improving the OIWD Algorithm”). 

(3) update Velocity (Vel /vl/Z)? )- For the g-th IWD moving from node z to node j on 
the disjunctive graph, update its velocity as follows: 

vel /TO * (t + 1 ) = vel m (0 + — 2 - 

b v + c v * soil (/,/) 

\q\ 1wd s [t + 1) is the velocity of the IWD g after updating, soil(/, y) is the soil 
on the edge linking node z and node j , and a v , b v and c v are the updating 
parameters to ensure that the value of the velocity is increased in the 
same scale of magnitude as the original velocity. If the value of the 
velocity increase is too big, the IWDs may be trapped in the local optima; if 
the value of the velocity increase is too small, the IWDs may need more time 
to obtain a schedule. Besides, b v also guarantees that the equation is not 
divided by 0. 

(4) computeDeltaSoil ((/, /), IWD g ) For the g-th IWD, calculate the amount of 
soil that it loads from the edge (i,j) as follows: 


Asoil(z'J) 



b s + c s * time 2 (zj; \t\ IWDs 



timefz, /; vel /H/Z) 0 = — 7 Pt[J) , wn v is the time taken for an IWD to travel on 

V ’• y ’ / max ( £ v , \t\ WD s ) 

the edge (z, j) with the velocity vel /WjD? , and p t (j ) is the processing time of 
operation j. s v = 0.0001 guarantees that the equation is not divided by 0. a s , b s , 
and c s are the updating parameters to ensure that the value of the soil is 
increased in the same scale of magnitude as the original soil. If the value of 
the soil is increased significantly, the IWDs may be trapped in the local 
optima; if the value of the soil is increased marginally, the IWDs would 
need more time to obtain a schedule. Besides, b s also guarantees that the 
equation is not divided by 0. 

(5) updateEdgeSoil ((/,/), Asoil(z,y)) and updatelWDSoil (soil /WZ)g , Asoil(zJ)): 
For the g-th IWD, update the soil of the edge it traverses and the soil contained 
in the IWD. Scheme 3 is proposed to utilize the guiding information, i.e., the 
amount of soil on the path, and control the convergence rate (section 
“Schemes for Improving the OIWD Algorithm”). 

(6) globalSoilPropagation (): Update the soils of the edges included in the 
current elite IWDs’ solutions (A e ii te elite IWDs). This is Scheme 4 and has 
been discussed in section “Schemes for Improving the OIWD Algorithm.” 

(7) setupBestSolutionGroup (): This is used for local search; a solution group S BD 
is set up for recording the best N BD solutions during the local search process. 
After each iteration in the EIWD algorithm, the schedules generated using the 
Njwd IWDs are chosen to conduct the local search; these are chosen either 
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randomly or based on the quality of the IWDs, where schedules of IWDs with 
the top quality are selected. Here, the quality of an IWD is determined by the 
makespans of the schedules that it can produce, and the IWDs which paths can 
result in a shorter makespan have a higher quality. A random number cp' dec £ 
(0, 1) is used in the selection of the IWDs. cp' dec is compared with (p' 0 = 0.5. 
(p ' o is set as 0.5 as two IWD selection methods are provided. If cp' dec > qj 0 , N BD 
IWDs are randomly picked up to conduct the local search, otherwise, the top 
N B d IWDs are chosen. The parameter N BD is set by the author; a larger N BD 
leads to more IWDs being selected to conduct the local search. 

(8) combinedLocalSearch (): This is a local search method (Scheme 5) which 
combines breadth search and depth search schemes in the search neighbor- 
hood. The input of this function can be a group of solutions (as S BD ) or a single 
solution (as S ), and the output is an improved solution. 

After the IWDs for the local search have been selected, a local search is 
performed on these IWDs. As mentioned above, after each iteration in the 
EIWD algorithm, N BD IWDs are chosen to conduct the local search. To 
facilitate the local search, the neighborhood structure and the Tabu list 
designed by Nowicki and Smutnicki (1996) are adopted, where the operations 
in the critical path are exchanged. The iteration number N iter LS and the Tabu 
list size N size tabu can be specified by the decision makers based on the size of 
the job. A global solution group S BD is maintained, which keeps the best N BD 
solutions (one for each IWD) discovered during the local search. The initial 
solutions in S BD come from the solutions of the abovementioned N BD IWDs. 
Within each iteration in the combined local search, for each solution, s x 
solution in S BD , its neighborhood is searched using two search schemes, 
namely, the breadth search and the depth search. The neighborhood is gener- 
ated by swapping the first two and the last two operations in a block except the 
first and the last block on the critical path. For the first block on the critical 
path, only the first two operations are swapped, and for the last block on the 
critical path, only the last two operations are swapped. Every block on 
the critical path consists of operations being processed on the same machine, 
and the two consecutive blocks contain operations being processed on differ- 
ent machines. There are Ao epth of rounds of depth search, and A^ readth neighbors 
of the solution s x are identified within each round, and the best one is used 
to update s x . After the combined local search is completed, the best solution in 
S BD is used to update S 1B . The values of A^epth and A^eadth are determined based 
on experiments; too large or too small a value will result in low-quality results. 

(9) updatePathSoil ( S 1B ): To update the soil of the path associated with the best 
iteration solution S 1B . 

r T'f> 'TD 

(10) update ( S ): Update the global best solution S by the best iteration solution 

S TB if q ( S TB ) < q 
S IB otherwise 

defined as the makespan of the given schedule. A schedule with a smaller 
makespan is a better-quality schedule. 



. q0 is a quality function which is 


IB 


S ID using 5 


TB 
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Modified IWD Algorithm Based on Scoring Function for MOJSSP 

The objective of MOJSS is to generate feasible schedules that attempt to optimize 
several objectives, and these schedules form the Pareto optimal solution set. 
Different objectives have been studied for the MOJSS problem, and the techniques 
to handle multiple objectives can be classified into two categories: 

(1) Transform the multi- objective problem into a mono -objective problem by 
aggregating the different objectives into a weighted sum. The weighted com- 
bination of several scheduling objectives serves as the performance criterion. 
For the generated schedule s, the weighted sum fitness function F can be 

k 

represented as F = WjFj (s ) . F ( (s) is the ith criterion of schedule s, and w t 

i = 1 

is the weight of the ith criterion. It is possible that the weights among the 
criteria are known before generating the schedules, and these weights are 
usually given by the decision makers according to the situations when decisions 
are required. 

(2) Converge toward the Pareto front while achieving diversified solutions 
scattered all over the Pareto front. For the MOJSS problem, some basic 
concepts are introduced to discuss the solution methodologies: 


(a) Pareto dominance: A feasible solution x\ is said to Pareto dominate over 
another feasible solution x 2 , denoted as x\ y x 2 , if and only if 

Vje{l,2,...,n}, fj(x i) <fj(x 2 ) 

a/c€ {1,2, f k (xi)<f k (x 2 ) 

(b) Pareto optimal solution: A feasible solution x\ is said to be a Pareto optimal 
solution if and only if there is no feasible solution x 2 satisfying v 2 >- X\. 

(c) Pareto optimal set: The set containing all the Pareto optimal solutions is defined 
as the Pareto optimal set. 

(d) Non-dominated solution set: The set containing all non-dominated solutions 
obtained from a certain algorithm is defined as the non-dominated solution set. 

(e) Optimal Pareto front: The optimal Pareto front (in the objective space) is 
formed by those objective vectors corresponding to the solutions in the optimal 
Pareto set. 

In this section, the OIWD algorithm is customized to meet the characteristics 
and requirements of MOJSS, and a Pareto schedule-checking process is embedded 
into the customized IWD algorithm, which is referred to as MOJSS-IWD. 

The OIWD algorithm is modified to solve the MOJSS problem. The modified 
algorithm, MOJSS-IWD, is shown in Algorithm 2. In MOJSS-IWD, a global Pareto 
set P is maintained. For each objective considered, an external iteration cycle is 
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executed to select the best schedules. In this iteration cycle, an internal iteration 
cycle with N 1WD iter iterations is executed (Line 6-Line 26). Each internal iteration 
contains two steps, namely, (1) identification of the initial schedules and (2) a 
Pareto local search on the schedules identified in Step (1). In Step (1), N IWD IWDs 
travel from the source node to the sink node in G dis . The path generated by an IWD 
is a feasible solution (schedule). The soils on the edges where the IWDs travel, the 
soils of the IWDs, and the velocities of the IWDs are updated during the travel of 
the IWDs (Line 9-Line 13). Next, the soils on N e i ite IWDs’ paths are updated (Line 
16-Line 18). After these updates, a group of good solutions S BD are chosen, and a 
Pareto local search is performed to further improve these solutions (Line 19-Line 
20). After the execution of the Pareto local search, the best solution S IB is identified 
among the group of good solutions S BD , a dominance check is conducted, and the 

'T'f> 

global best solution S is updated (Line 22-Line 24). After the completion of the 

r T'f> 

internal iteration cycle, a new Pareto local search is performed on S (Line 27). 
The Pareto local search in MOJSS-IWD is based on a scoring function. The 
schedule that yields the lowest value based on this scoring function is selected 
during the local search. After this Pareto local search, a dominance check is 

r Tf> 

performed on S to check whether it can be added into the Pareto set P. Next, 
the Pareto set P is updated and reported as the final results. 


ALGORITHM 2 MOJSS-IWD (disjunctive graph G dis ) 

1 : Initialize a Pareto set P; // A population of IWDs 

2 : for each optimization objective do 

3 Initialize an IWDs group A; 

4: initialization () ; 

5 : k : = 0 ; 

6 : while ( k < N IWD _ iter ) do 

7 : for (each time step t) do 

8: for (each IWD g E A which feasible solution has not been 
discovered) do 

9: (i, j )= selectNextEdge ( IWD g ) ; 

10: \q[ iwd s= updateVelocity ( \el IWDg ) ; 

11: Asoil (i, j ) = computeDeltaSoil ( ( i , j) , IWD g ) ; 

12 : soil ( i , j ) = updateEdgeSoil ( (i , j ) , Asoil ( i , j ) ) ; 

13: soil /vl/Z)? = updatelWDSoil (soil /vl/Z)? , Asoil(/J) ) ; 

14 : end for 

15 : end for 


16: for ( N ellt& IWDs ) do 

17: globalSoilPropagation ( ) ; 
18 : end for 


19: S BD = setupBestSolutionGroup (); 
20: S XB =ParetoLocalSearch (S BD ) ; 

21 : updatePathSoil ( S IB ) ; 

22: dominancechecking {S IB ) ; 

23 : update (P) ; 
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24 : update ( S TB ) ; 

2 5: k++ ; 

2 6 : end while 

27: S TB = ParetoLocalSearch (S TB ) ; 

28: dominanceChecking (S TB ) ; 

2 9 : update (P) ; 

3 0 : end for 
31 : return P; 

Pareto local search and Pareto non-dominated solution set generating method are 
introduced in sections “Pareto Local Search” and “Pareto Non-dominated Solution 
Set Generating Method.” 

Pareto Local Search 

The Pareto local search combines a breadth search scheme and with a depth search 
scheme to search the solution space, where the search is based on a scoring function 
to evaluate the schedules. For each schedule, the sum of the three objective values is 
computed, and this sum serves as the score to rank the schedules. 

(a) Selection of IWDs for Performing the Local Search 

The Pareto local search is conducted for a group of solutions S BD (line 20 in 
Algorithm 2) (, S 1B =ParetoLocal Search (S BD )) and a single solution S TB (line 
21 Algorithm 2) (S TB = ParetoLocalSearch (S TB )). In case when the input of the 
Pareto local search is a group of solutions (schedules) S BD , the schedules are 
selected randomly or based on the scores of the schedules from the scoring function 
where schedules with the smallest value (a smaller score means a higher quality) are 
selected. To select schedules, a random number q?' dec E (0, 1) is used, (p’ dec is 
compared with (p' 0 = 0.5. If (p' dec > (p'o, N BD schedules will be randomly selected 
for the local search, where N BD = (p% * N 1WD . Otherwise, the top N BD schedules are 
selected. 

(b) Performing the Local Search 

After all the schedules have been selected, a local search is carried out. The 
neighborhood structure and the Tabu list designed by Nowicki and Smutnicki 
(1996) are adopted. A neighborhood is formed by exchanging the operations in the 
critical path. Based on the size of the job, the decision maker specifies the iteration 
number N iter LS and the Tabu list size N size tabu . During the local search, a global 
solution group S BD is maintained to keep the good N BD solutions (one for each IWD) 
discovered. For each solution s x in S BD during the iterations of the Pareto local search, 
two search schemes are used to search its neighborhood, namely, a breadth search and 
a depth search. To generate the neighborhood, the first two and the last two operations 
in a block are swapped except the first block and the last block on the critical path. 
Every block on the critical path consists of operations being processed on the same 
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machine, and two consecutive blocks contain operations being processed on different 
machines. Only the first two operations in the first block on the critical path and the 
last two operations in the last block are swapped. There are A^epth rounds of depth 
search, and A^readth neighbors of the solution s x are identified within each round, with 
the best solution being used to update s x . After the Pareto local search, the best 
solution in S BD is used to update S IB . Based on experiments, the value of A^E, epth and 
breadth are determined; too large or too small a value will result in low-quality 
results. 

Pareto Non-dominated Solution Set Generating Method 

The makespan (C max ), tardiness (T,), and mean flow time (F) are the objectives 
considered in this research. The aim is not to generate a single optimum, but to 
find a set of solutions which are in the form of alternative trade-offs. A Pareto 
set P is maintained to store the non-dominated schedules. During the execution 
of the MOJSS-IWD algorithm, whenever a new schedule s is generated, this 
new schedule is checked to determine whether (a) it is dominated by any 
existing schedule in P or (b) it dominates any existing schedule in P. For (a), 
s is rejected, and for (b), those dominated schedules in P are removed. 
When checking the new schedule, its scoring function is called. The value of 
the scoring function is used to conduct the dominance checking. Thus, the 
newly generated schedule s will be stored in the Pareto set P when it is not 
dominated by any schedule in the Pareto set P. After checking, P is updated and 
reported as the result. 

Experimental Evaluation 

The EIWD and MOJSSP-IWD algorithms have been implemented on a PC with an 
Intel Core 2 Duo L7700 1.8GHz CPU and 2GB RAM. Experiments have been 
conducted on the benchmark data for JSSP in the OR-Library (Beasley 1990); 
43 instances were tested among which three instances (FT06, FT 10, FT20) 
are designed by Fisher and Thompson (FT instances) (Fisher and Thompson 
1963) and 40 instances (LA01-LA40) are designed by Lawrence (LA instances) 
(Lawrence 1984). 


Experimental Evaluation of EIWD for SOJSSP 

The parameters (with their values) used in the experiments are listed in Table 3. For 
the parameters, through experiments and theoretical study, some observations can 

be obtained. For N IWD , N IWDJten breadth, ^Depth, and N BD , a larger value will result 
in better solutions but longer computation time. Trade-off values are obtained based 
on experiments for these parameters. For N size tabu , experiments show that a Tabu 
list that is too large or too small will result in low-quality results. N iter LS is set to be 
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Table 3 Parameters in the experiments for EIWD 


Parameter 

Value 

Parameter 

Value 

Parameter 

Value 

Parameter 

Value 

Niwd 

20 

N IWD_iter 

100 

N Breadth 

50 

N Depth 

200 


0.01 

£v 

0.0001 

Asoil max 

90 

Asoil m i n 

10 

N size tabu 

10 

Niter LS 

3,000 

Nbd 

10 



av 

1 

Bv 

0. 01 

Cv 

1 

as 

1 

bs 

0.01 

Cs 

1 

Pl 

0.9 

PlWD 

0.9 


Table 4 Computational results of FT and LA test instances 




TSSB 

HGA-Param 

HIA 

MPSO 


EIWD 

Pezzella and 
Merelli 
(2000) 

Goncalves 
et al. (2005) 

Ge et al. 
(2008) 

Lin et al. 
(2010) 

Average 

deviation 

0.000584148 

0.001014975 

0.003915762 

0.003151033 

0.001378487 

Relative 

average 

deviation 

1 

1.74 

6.70 

5.39 

2.36 

BKS found/ 
total 

instances 

37/43 

36/43 

31/43 

32/43 

35/43 


3,000 as experiments showed that for most cases where the value is larger than 
3,000, the results will not improve further, but the computational time is increased. 
The parameters in the last two rows of Table 3 ensure that the values of the velocity 
updated and the soil updated are changed in the same scale of magnitude as the 
original velocity and the original soil. 

The EIWD algorithm is compared with the TS algorithm (Pezzella and Merelli 
2000), GA algorithm (Gongalves et al. 2005), and the PSO algorithm (Lin 
et al. 2010; Ge et al. 2008). Table 4 shows the comparison results for FT and LA 
test instances. TSSB, HGA-Param, HIA, and MPSO are the names of the algorithms 
from Pezzella and Merelli (2000), Goncalves et al. (2005), Ge et al. (2008), and Lin 
et al. (2010), respectively. In this table, the relative average deviation represents the 
ratio of the average deviation for TSSB, HGA-Param, HIA, and MPSO with respect 
to that of EIWD. EIWD can find 37 best known solutions among the 43 instances, 
and the optimal results are better than that of TSSB, HGA-Param, HIA, and MPSO. 
From Table 4, it can be seen that the results of EIWD is closest to the best known 
solutions. Its deviation is about 1.74 times smaller than TSSB, 6.70 times smaller 
than HGA-Param, 5.39 times smaller than HIA, and 2.36 times smaller than MPSO. 
Thus, EIWD can find better results as compared with the TS Algorithm (Pezzella 
and Merelli 2000), GA (Gonsalves et al. 2005), and PSO (Lin et al. 2010; Ge 
et al. 2008). It does not only find more best known solutions (BKS) but also results 
with a smaller average deviation. 
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Table 5 Parameters in the experiments for MOJSSP-IWD 


Parameters 

Value 

Niwd 

Number of IWDs 

50 

N IWD 

Number of IWDs 

50 

N IWDJter 

Number of iterations in algorithm MOJSS-IWD 

100 

N Breadth 

Number of neighbors (schedules) generated in a single round of 
breadth local search 

50 

N Depth 

Number of rounds of breadth search in depth local search 

200 


A parameter prevents a possible division by 0 

0.01 

s v 

A parameter prevents a possible division by 0 

0.0001 

Asoil max 

Upper bound for soil changes in any edge (/, j) 

90 

Asoil min 

Lower bound for soil changes in any edge (/, j) 

10 

Nsizetabu 

Tabu list size 

40 

NiterLS 

The iteration number of local search 

3,000 

N BD 

The number of good solutions chosen to conduct a Pareto local search 

10 

a v 

IWD velocity updating parameters 

1 

b v 

IWD velocity updating parameters 

0. 01 

c v 

IWD velocity updating parameters 

1 

a s 

IWD soil updating parameters 

1 

b s 

IWD soil updating parameters 

0.01 

Cs 

IWD soil updating parameters 

1 

Pl 

Local soil updating parameter 

0.9 

P IWD 

Global soil updating parameter 

0.9 


Experimental Evaluation of MOJSSP-IWD for MOJSSP 

Experiments have been conducted using the same test instances for MOJSSP, and 
the research results are compared with the research conducted by Suresh and 
Mohanasundaram (2006). The makespan (C max ), tardiness (T,), and the mean flow 

time (F) are considered in the experiments in this research. These three objectives 
are conflicting, and achieving good results with respect to one objective may 
degenerate the results with respect to the other objectives. Suresh and 
Mohanasundaram used Pareto archived simulated annealing (PASA) to solve the 
JSSP with the objectives of minimizing the makespan and the mean flow time of 
jobs. The schedules generated by PASA does not consider the tardiness (7)) 
objective. Simultaneous consideration of the three objectives in this research is 
more challenging than considering two objectives in the case of PASA. 

The parameters (with their values) used in the experiments to test MOJSSP-IWD 
for MOJSSP are listed in Table 5. The values of the parameters are set based on trial 
and error experiments and theoretical study. The experimental results for MOJSSP- 
IWD are shown in Table 6. In Table 6, “NA” means “not applicable” and it 
indicates those cases where the number of schedules in the Pareto optimal set 
obtained by MOJSS-IWD is less than that obtained by PASA. There are a total of 
six test instances among all the 43 benchmark instances in the category that “the 
number of schedules in the Pareto optimal set obtained from MOJSSP-IWD is less 
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LA10 

12859.87 

13177.2 

LA23 

13060.34 

12849.87 

LA36 

24944.94 

25397.4 






LA09 

7909.35 

7908.2 

LA22 

12746.66 

12681.94 

LA35 

26181.99 

26724.63 




LAO 8 

24526.53 

25722.6 

LA21 

16927.14 

18083.27 

LA34 

63284.62 

63265.73 




LA07 

14948.33 

14902.13 

LA20 

25526.24 

23993 

LA33 

NA 

NA 




LAO 6 

9528.2 

9482.07 

LA19 

26471.93 

24331.3 

LA32 

NA 

NA 




LA05 

7243.1 

7229 

LA18 

28758.46 

26203.8 

LA31 

3085.37 

3102.27 




LA04 

13975.9 

13966.9 

LA17 

24489.2 

24254.6 

LA30 

26577.5 

26507.95 




LAO 3 

20716.8 

20704.3 

LA16 

40673.33 

39639.6 

LA29 

41473.35 

40833.65 




LAO 2 

14031.9 

13951.3 

LA15 

NA 

NA 

LA28 

37260.95 

37201.4 




LA01 

12094.9 

11987 

< 

H-l 

4124.7 

4191.5 

LA27 

23887.6 

23882.9 

LA40 

16742.9 

16687.73 

FT20 

NA 

NA 

LA13 

25044.8 

24982.05 

LA26 

NA 

NA 

LA39 

16469.67 

17583.4 

FT 10 

NA 

NA 

LA12 

15291.4 

13869.5 

LA25 

7530.06 

7481.07 

LA38 

19194.19 

19137.2 

FT06 

529.51 

529 

LA11 

14010.15 

14050.4 

LA24 

10874.06 

11009.07 

LA37 

43082.06 

44145.53 

Instance 

PASA 

MOJSS- 

IWD 

Instance 

PASA 

MOJSS- 

IWD 

Instance 

PASA 

MOJSS- 

IWD 

Instance 

PASA 

MOJSS- 

IWD 






































3472 


S.H. Niu et al. 


than that obtained from PAS A.” For the rest of the 43 benchmark test instances 
(37 in total), the makespan and mean flow time of their schedules are summed up 

C max + F^J . Among the 37 instances in Table 6, the MOJSSP-IWD algorithm 

outperforms the PASA algorithm for 27 test instances in terms of the quality of the 
Pareto non-dominated set, and the PASA performs better than MOJSSP-IWD for 
10 test instances. MOJSSP-IWD outperforms PASA when the sum 

C max H- F^j for the Pareto non-dominated set generated from MOJSSP-IWD 

is smaller than that generated from PASA. 

In general, the MOJSSP-IWD algorithm can generate better results than PASA. 

When the three objectives (makespan C max , tardiness T h and mean flow time F) are 
considered such that the problem becomes more challenging, it becomes more 
obvious that MOJSSP-IWD is a promising approach for solving the multi- objective 
scheduling problem. 


Summary 

In this paper, a new meta-heuristics, viz., the IWD algorithm, is enhanced to solve 
SOJSSP and MOJSSP, where the JSSP is modelled as a modified disjunctive graph 
that resembles rivers in the IWD algorithm. In this research, the OIWD algorithm is 
improved by introducing five schemes, viz., (1) diverse soil and velocity, (2) con- 
ditional probability computation, (3) bounded local soil, (4) elite global soil update, 
and (5) a combined local search. The enhanced algorithms are the EIWD and the 
MOJSSP-IWD algorithms. The optimization objective considered for SOJSSP is 
makespan in this research, while MOJSSP with the consideration of three objec- 
tives, namely, the makespan, tardiness, and mean flow time, has been studied. The 
research goal for MOJSSP is to find a set of solutions in the form of alternative 
trade-offs in the Pareto optimal set, and a new method is proposed to generate the 
Pareto non-dominance set. A scoring function and a Pareto schedule-checking 
process are embedded in the MOJSS-IWD algorithm. Experiments have been 
conducted using 43 standard benchmark instances from the OR-Library to validate 
the effectiveness and efficiency of the enhanced algorithms. 

Experimental results show that EIWD can generate better results by finding 
additional best known solutions and generate schedules with a smaller deviation 
from the best known solutions. Experimental results also indicate that MOJSSP- 
IWD can generate better results in general. Considering that the three objectives 
have been explored, the question under study is very challenging, and MOJSSP- 
IWD is a promising algorithm for solving MOJSSP. 
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Abstract 

The network-based manufacturing offers various advantages in current competi- 
tive atmosphere by way to reduce the short manufacturing cycle time and to 
maintain the production flexibility. In this paper, a multi- objective problem 
whose objectives are to minimize the makespan and maximize the machine 
utilization for generating feasible process plans of multiple jobs in the context of 
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network-based manufacturing system has been addressed. A mobile agent-based 
negotiation approach is proposed to the integration of manufacturing functions in a 
distributed manner, and the fundamental framework to support the functionality of 
the approach is presented in detail. With the help of an illustrative example along 
with varied production, environments that include production demand fluctuations 
are described, and the proposed approach has been validated. Finally, the compu- 
tational results are analyzed to the benefit of the manufacturer. 


Introduction 

Global competition renders adequate return on investment, only to those who can 
provide and develop, the innovative and intricate products with high quality, less 
process iterations, and cost effective. An effective practice observed by enter- 
prises across the globe in the past decades is to collocate the design, manufactur- 
ing, and marketing engineer teams into close physical proximity. Recent advances 
in information technology and communication technology have profoundly 
influenced the manufacturing research and its applications. In order to inculcate 
dynamism in manufacturing and its functions, a dynamic adaptive control of a 
manufacturing system is indispensable. Thus, it is essential to change the trend of 
traditional manufacturing system to integration approach. This change in phe- 
nomena in manufacturing system reduces the complexity and cost and thus 
increases its flexibility and enhances the fault tolerance. To support the above 
requirements and their functionalities, a recently emerged manufacturing para- 
digm known as networked manufacturing or network-based manufacturing has 
been adopted. 

Process planning and scheduling are the two significant functions to be engaged 
to process various operations of the jobs in a manufacturing system. These func- 
tions specify the decision maker of how, when, and in which sequence the opera- 
tions of the parts are allocated to the manufacturing resources. To realize this, some 
researchers have realized that there is a greater need to integrate both the functions 
to achieve better performance of the system. The fundamental idea of the integra- 
tion of process planning and scheduling functions has been introduced by 
Chryssolouris and Chan (1985). The abovementioned integration approach has 
been used by Khoshnevis and Chen (1990) to enhance the shop floor performance. 
Consequently, a feedback mechanism has been introduced for effective coordina- 
tion between various resources. 

The mobile agent-based approach in a distributed manufacturing environment is 
distinct from agent and multi-agent systems. Rather than having all the features of 
agent and multi-agent systems, it has also some additional character such as 
mobility attribute which helps to transport the messages throughout the network 
(Chou et al. 2010). Thus, the mobile agent gains leverage over other class of agents. 
In the present research, first, a mathematical model has been developed to present 
various performance measures of the system, and then, a framework of a mobile 
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agent-based negotiation scheme for integrating process planning and scheduling in 
a network-based manufacturing environment is proposed. The functionalities of 
these architectures are detailed in later sections. Subsequently, to verify the feasi- 
bility of a proposed approach different simulation experiments and comparisons 
have been made by varying the production demand. 

In section “Problem Description,” a detailed description of the problem with the 
basic assumptions and a developed mathematical model along with the constraints 
is given. In section “Framework of the Proposed Mobile Agent-Based Negotiation 
Scheme,” a framework of mobile agent-based negotiation scheme to integrate 
the process planning and scheduling functions is presented. Section “Negotiation 
Process” explains the mapping of negotiation-based schedule scheme. The exper- 
imentation with an illustrative example having different complex scenarios is 
illustrated, and their results are presented in section “Illustrative Example.” In 
section “Results and Discussion,” the results and their discussions are detailed. 
Finally, conclusions are drawn and future work delineated. 


Problem Description 

A series of jobs ordered by different customers are considered and denoted by n. 
Each job consists of a set of strategies which corresponds to its alternative process 
plans, and each process plan contains a series of sequential operations. Conse- 
quently, the jobs with alternative process plans are processed for different opera- 
tions on a set of alternative machines. However, in the networked manufacturing 
the machines are geographically distributive which can perform different opera- 
tions of the jobs; this can be one of the complex tasks of the present problem. 
Although, the transportation time between two corresponding machines acts as a 
crucial role for process planning and scheduling tasks. For large-scale problems, it 
is difficult to find perfect solutions in a reasonable time. Moreover, the 
abovementioned problem is further extended by increasing the number of products 
with product mix and production demand fluctuations. This extension in the 
problem leads to a much larger search space; thus, it is quite difficult to find the 
optimal/near-optimal solutions in a reasonable time. Due to flexibility in networked 
manufacturing, the integration approach has the potential to generate the near 
optimal process plans. 

Assumptions for the abovementioned problem: 

(1) Job preemption is not allowed; (2) when an operation of a job is being 
processed on a machine, it cannot be interrupted until finished; (3) each machine 
can handle only one job at a time; (4) transportation time is considered. The system 
is designed in such a way that, after an immediate completion of the operation of a 
job on a machine, the job is immediately transported to the succeeding machine on 
its process; (5) all jobs and machines are simultaneously available at the time zero. 
Based on the abovementioned problem with several assumptions, the proposed 
model and its mathematical model are represented: 
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Parameters 


N 

Total number of jobs 

K 

Total number of machines 

Gj 

Total number of alternative process plans of job j 

TJ. 

U jop 

oth operation in the pth alternative process plan of job j 

Qjp 

Number of operation in the pth alternative process plan of the job j 

Tj 

Makespan of 7 th job from the set of possible process plans 

Pvt • 

J ' l jopm 

Processing time of the operation Uj op on machine m 

Trt 

Transportation time 

M 

Any machine 

V 

Maximal number of generations 

W 

Number of generation for job scheduling game 

st (j, o, p, mi) 

Starting time of operation 7 th job and pth process plan and oth operation on 
machine 1 (one of the machines on which oth operation of pth process plan of 
7 th job is done) 

trt 

(j, p, % m 2 ) 

Transporting time for 7 th job and pth process plan from machine 1 to machine2 

wt ( j , o, p, m ) 

Waiting time of operation 7 th job and pth process plan and < 9 th operation on 
machine 

fU, t) 

Fitness function for each individual in the population 

A 

A very large positive number 

C J 

Completion time of job j 

r 

^ jopm 

Earliest completion time of operation U jop on machine m 


Decision Variables 



1 the p th flexible process plan of job j is selected 
0 otherwise 


YjopQrsm 1 the operation Uj op precedes the operation U q ps on machine m 

0 otherwise 


7. 

^ jopm 


1 if machine m is selected for Uj op 
0 otherwise 


Objectives: 

1. Maximum time required to complete all the jobs: 


Min makespan 7\ = Max 


je[l,N],oe 


1 ,Q, 


r 

^ jopm 


JP1 


,p £ 1, Gj ,m G [l,K 
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2. Maximization of machine utilization: 


m 


£ mpt k 


Max machine utilization = Ma xmu 


k = 1 


m 


£ ( mct k - mstk ) 


A— I 


Subject to: 


1. For the first operation in the alternative process plan p of job j: 




2. For the last operation in the alternative process plan p of job j : 



A(1 




< r 

_ ^ jopm 


je[l,N], P e 




3. The different operations of one job can not be processed simultaneously: 



4. Each machine can handle only one job at a time: 


r 

^ jopm 


c 


Qrsk 


T 7 t(l H - AY jopQrsm ^ P t 


j,Q£[\,N],r£ l,Q: \,o,p,S £ 1 ,G h 


P 


jopm 

,m G [1 ,K 



5. Only one alternative process plan can be selected of job j : 


J2 x jp = 1 

pe [l.Gy] 


( 7 ) 
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6. Each machine can only process one operation at a time: 



V z, 


m= 1 


jopm 



j, e [l,JV],o6 [l,Qj ],pe [1 ,Gj 




Framework of the Proposed Mobile Agent-Based 
Negotiation Scheme 

Contrary to the traditional classic message passing-based negotiation, the proposed 
mobile agent-based negotiation scheme has several great advantages (Lange and 
Oshima) (1998). It has the ability to send an agent to a target system for performing 
the job locally, thereby enhancing the flexibility of agents by exchanging huge 
number of messages. Moreover, it can adapt dynamically corresponding to the 
changed environment and also can combine with the other systems without any 
difficulty. Figure 1 shows a general representation of the application of mobile 
agent framework to system integration. The physical links can be the interactions 
between agents that interconnect each other for proper negotiation. The proposed 
framework has been broadly divided into four types of functional modules: (1) man- 
ager, (2) task agent (T-agent), (3) mobile agent (M-agent), and (4) resource agent 
(R-agent), which communicates the information among them for distributed prob- 
lem solving. The detailed functionalities of these agents have been described in the 
following sections: 


The Manager Agent 

A situation of resource-oriented bidding mechanism is assumed where machine 
control agents generate a bid based on resource capability to part order (Baker 
1998). The process starts with submission of jobs by various customers whose data, 
i.e., number of jobs, operations to be performed for each job, process plans, and 
processing time for performing each operation etc., has been stored in a shop 
database. 

Upon arrival of jobs, the knowledge manager registers all the resources in the 
shop database and reports the information to the negotiation scheduler agent 
whenever required. However, the function of negotiation scheduler is to generate 
a schedule for the initialization of negotiation process. It is the core function of the 
shop manager and one of the complicated tasks due to the presence of several 
concurrent events from distributed systems. The complete information about the 
resource data is converted into an extensible markup language (XML) based on 
ontology, and the knowledge manager interprets the information through an 
XML-parsing system to enhance the interoperability between distributed systems. 
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Fig. 1 Flowchart of mobile agent-based scheme 


The Task Agent 

The task agent encapsulates the information for negotiation, i.e., the type of 
negotiation, the goal of the negotiation process, to whom the agent has to coordi- 
nate, etc., for execution of a job submitted by the manager. Rather than encapsu- 
lating the information, the task agent has some more functions such as creating a 
mobile agent to travel through the network according to routing schedule and assign 
the bids to a bid evaluator for reviewing the best bid. After identifying the best bid, 
it is reported to the task agent. The task agent receives the bids submitted by the bid 
evaluator and sends the awarding messages to the resource agents; in response of 
the awarding message, the task agent receives the conformation message from the 
resource agent, and then the task agent is destroyed. 
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The Mobile Agent 

The actual negotiation process starts with the creation of a mobile agent, capable of 
travelling through the entire network, i.e., it acts both as a virtual controller and as a 
resource controller. Before it begins to travel, it inspects the status of the resources and 
delivers the generated bids while it travels from one resource agent to another. The 
mobile agent travels the network of resource controllers with the help of a routing 
schedule to collect the bids from resource agents. When it enters into the resource 
agent, it does not get the status of corresponding resource directly due to security 
issues. Hence, the communicator, which acts as a bridge between various agents, starts 
the conversation with the resource agent to examine the status of the resources. 
Depending upon their status, the communicator transfers the diverse information to 
the target agent. The mobile agent, after completion of its travel according to the 
specified routing schedule, reports the bid list to the task agent; then it destroys itself. 


The Resource Agent 

The resource/machine agent plays a major role in the performance of a shop floor. 
In the present problem, the machines are distributed geographically to perform 
different operations of the task agent. Moreover, it receives the awarding messages 
from the mobile agent, and in response the resource agent communicates the task 
acceptance message to the mobile agent. In the proposed negotiation scheme, only 
the task agent is concerned about the negotiation process where as the resource 
agent concentrates on providing necessary information to the part agent. The 
information of preassigned process plans, status of the resources, buffer capacity, 
etc., are stored in a resource database having information in a common database 
(XML-based) format using ontology. In case of machine failure due to tool break, 
motor failure etc., the tasks scheduled at the failed machine will have to be rerouted. 
This can be achieved through resource agent by sending the resource fault message 
to the manager who reschedules and cancels the tasks with the help of ontology 
knowledge base at its perusal. 

Negotiation Process 

Considering the above-described problem, a negotiation-based schedule scheme is 
presented and developed to realize the effectiveness of different performance 
measures. Figure 1 illustrates the logical framework of the negotiation schedule 
whose process is broadly divided into the following three kinds of phases: (1) ini- 
tialization phase, (2) routing phase, and (3) awarding phase. 

The scheduling process starts with the initialization phase; here, the bid manager 
checks whether all the resource agents are participating in the ongoing negotiation 
process or not. In the routing phase, the N-agents made by the previous phase travel 
across the resource agents to collect the bids. Thereafter, it returns back to the task 
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agent and provides the information about the collected bids from the resource 
agents for making list of bids. Later, the N-agents generate a bid according to the 
given policy of negotiation and filters out the bid by pre-evaluation. The completed 
list of bids is reported to the task agent. The third and the final stage of the 
negotiation process starts with the T-agent which evaluates the bids received 
from the N-agents, and it selects the ones that are determined to be most appropri- 
ate. With these best bids, the T-agent makes a task and offers the awarding message 
to the R-agents that process the selected bid. Then, the T-agent revises information 
related to resource of the shop database in order that the resource may take part in a 
new negotiation process. The selected R-agents assume the responsibility of exe- 
cution of the awarding task and send the task acceptance message to the T-agent. 
When the T-agent receives the message, it destroys itself, and the negotiation 
process is completed. 

Illustrative Example 

To illustrate the effectiveness and performance of the proposed approach in this 
paper, three representative instances (represented by problem n x m) based on 
practical data have been selected as a test bed to compute. Three problem instances 
(problem 6x6) are taken from (Zhou et al. 2010) and (Manupati et al. 2012) is used 
here, and these have been extended with varied production demand fluctuations. 
Table 1 shows the setting of the production demand fluctuations in which three terms 
of production are used and in which the product mix is set differently in each term. 


Results and Discussion 

In this paper, the two conflicting objectives - makespan and machine utilization - 
are chosen. In order to find the process plans and scheduling plans for minimum 
makespan, one needs to run the heuristic algorithms for multiple generations. The two 
algorithms have been coded in MATLAB software, and the problem is tested on 
Intel® Core™2 Duo CPU T7250 @2.00GHz, 1 .99 GB of RAM. Figure 2 summarizes 
the Gantt charts obtained from the simulation results of the objective function, i.e., 
average makespan for nine terms. Each production environment in each term was 
simulated 20 times on average. Figure 2a-c illustrate the maximum completion time 
of all six products with different product mix which has been processed with six 
machines which are situated geographically. Figure 3 portrays percentage of machine 


Table 1 The makespan 
values, process plans, and 
job schedules for 6 x 6 
problem 


Production amount in each job 


Term 

Job 

J1 

J2 

J3 

J4 

J5 

J6 

1 

6x6 

10 

10 

10 

10 

10 

10 

2 

6x6 

10 

15 

10 

15 

10 

0 

3 

6x6 

05 

10 

15 

05 

10 

15 
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Fig. 2 Gantt charts for the average makespan values for all the three terms 
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Fig. 3 Average machine utilization for all the three terms of a 6 x 6 problem 
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that has been used effectively utilized for processing of different jobs. The obtained 
simulation results show its consistency for each term; therefore, on an average for all 
terms, 20 simulation experiments have been considered. 


Summary 

In this paper, a mobile agent-based approach has been developed to generate 
optimal process plans in the context of network-based manufacturing environment. 
A mathematical model has been developed and the analysis of different perfor- 
mance measures of the system has been observed. Illustrative examples with 
different variations have been demonstrated to conduct experimentation with the 
proposed mobile agent-based approach. Results show that the proposed approach is 
very effective for integration of process planning and scheduling problem in 
particular for distributed manufacturing environment. 
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coolant flow rates, 997 
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Cutting tools 

multipoint, 814, 824. ( See also Multipoint 
cutting tools; Single-point cutting tools) 
single -point, 814. ( See also Multipoint 
cutting tools; Single-point cutting tools) 
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2454, 2461, 2463 
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Dedicated manufacturing systems (DMS), 

375, 381, 386 
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Defects, 570, 586-587, 2418 
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Delta compliant manipulator, 2238 
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Dendritic segregation, 277-278 
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Deposition, 2707 

Deposition rate, 609-610 
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Detonation gun spraying, 2827 

Diamond robot, 2095 

Dielectric coatings, 2844-2845 
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Differential kinematics, 1811 
Diffractive optical element (DOE), 1486 
Diffusion bonding, 587-588 
Diffusion coefficient, 2918 
Digital compass, 2319-2320 
Digital thread, 2486 
Dilution, 632 

Dimensional synthesis 2102 
Direct-current, 2894-2897, 2899-2900 
Direct digital manufacturing (DDM), 

2472-2473, 2476, 2478-2479 
in aerospace, 2475 
in automotive, 2476-2478 
calibration fixture, 2478. ( See also 
Calibration fixture) 
camera mount, 2475 
Direct drive pumps, 1657 
Directed energy deposition, 2514 
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Direct hard tooling, 2537-2538 

Direct kinematics, 1789-1790 

Direct manufacturing, 2530 

Direct numerical control (DNC), 853 

Direct rapid tooling, 2541 

Direct soft tooling, 2533 

Disassembly planning, 3407, 3415, 3432 

Discharge crater, 1568 

Discharge current, 1557, 1559, 1565 

Discharge delay time, 1571-1572 

Discharge duration, 1553, 1559, 1560, 1566, 

1571, 1576-1577 
Discharge energy, 1560, 1566 
Discharge gap, 1559, 1575 
Discharge interval, 1560 
Discharge location, 1553, 1570, 1572-1573 
Discharge voltage, 1557, 1562 
Displacement modeling, 2100 
DoD Cell printing, 2578-2579 
2-DOF CDR, 2205-2207, 2211 

2- DOF orientation, 2206-2207 

3 - DOF orientation, 2207-2209 
3-DOF CDR, 2207 

Double compound linear spring mechanism, 

2248 

Double sided incremental forming (DSIF), 

416-417,441 
Dow 17, 3044-3045 
Drag finishing, 1065 
Drilling, 1644-1645 
Drilling machine, 824 
Drop-on-demand 3D inkjet printing, 

2568-2569, 2883 
Drug delivery, 2883-2885 
Dual control stick system, 2479 
Ductile cast iron, 371-372 
Ductile mode machining, 928-929 
Ductile removal modes, 1083-1085 
DYNACUT, quick milling optimization 

software, 892 

Dynamic calibration, 2114-2115 
Dynamic fuzzy neural networks, 3250. 

See also Fuzzy neural networks 
Dynamic model, 2149 
Dynamics equation, 1891-1894 

E 

Ejected scaffold, 2575 

Elastic emission machining (EEM), 1055-1056 
Elastic-plastic analysis, 3082-3087 
Electrical discharge machining (EDM), 

1553, 1556 


Electric-arc spray process, 2822 
Electric gripper, 2037 
Electric motor, 2053 

Electrochemical microfabrication, 2872-2880 
Electrochemical process, 2853 
Electrocrystallization, 29 1 8-29 1 9 
Electrodeposition, 2894, 2902-2904 
Electrohydrodynamic jet printing (E-jetting), 

2571-2573 
Electromagnet, 2039 
Electroplating, 2895-2896, 2343 
Elettric80 system, 2343 
Embedded smart sensors, 3270, 3285-3286 
End-effector, 2035-2053 
End milling, 835 
End-of-life (EoL), 3380 
Energy sustainability, 3141-3142 
Environmental impact, 3141-3142, 3315, 

3318. 3336 

Environment friendly machining, 1129, 3141, 

3315. 3318. 3336 
Errors, 1998 

E- spinning, 2571 
Eulerian simulation, 1170-1171 
Euler’s theorem, 1719-1722 
Explosion welding, 589-590 
Extended Kalman filters (EKF), 2333 
Exteroceptive sensors, 2319-2324 
digital compass, 2319-2320 
infrared and sonar, 2320-2321 
vision camera, 2321-2324 
Extrusion, 182-186, 191, 193-194, 

199, 216-220, 225, 227-228 
Extrusion foaming process, 132-134, 

2559-2560 

Extrusion systems, 2559 

F 

Face mill, 836 
Failure analysis, 3140 
Feed drives, 851-853 
Feeding, 365-366 
Fettling, 314 

Fiber orientation, 2574-2575 
FIB induced deposition (FIBID), 1409-1411 
Fibre optic gyros (FOG), 2326 
Filtration, cutting fluids, 1000-1001 
cyclone separator, 1001 
settling tank, 1000 

Finishing, 2446-2450, 2454, 2456, 2465 

Finite element analysis, 748 

Finite element method (FEM), 1159 
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Finite element simulation, 299 

Fixed-path time minimization, 1882-1883 

Fixture, 2422 

Flame, 106-107 

Flame spray, 2820-2822 

Flexible manufacturing systems (FMS), 861 

Flexible roll forming, 159, 303-306 

Flexure properties, 158-160 

Flow forming, 455 

Fluid dynamics, 354 

Fluidized ALD reactor, 2979, 2981 

Fluoride anodizing, 3037-3038 

Focused ion beam (FIB) technique, 1104 

Focused ion beam (FIB) technology, 1277, 

1392-1394, 1409-1412 
advantages, 1403-1409 
imaging, 1393-1395 
induced deposition, 1409-1411 
irradiation, 1412-1419 
sputtering, 1395-1403 
Force-closure analysis, 2181-2186, 2189 
Force control, 2058, 2184, 2186, 2189, 

2348-2349, 2353, 2355, 2367, 
2373-2374, 2376, 2380, 2382-2383, 
2398, 2447, 2449-2450, 2453, 2465 
Force-torque sensor, 2058 
Forge welding, 570 

Forging, 175-183, 191, 193-194, 197-198, 

201, 213, 215, 217, 225, 229 
Formability, 234, 238-243 
Formability in IF, 421-422 
Forward kinematics, 2139 
Forming limit diagram (FLD), 239, 2139 
Fracture properties, 163 
Fracture toughness, 908 
Frequency, 606 

Friction stir welding (FSW), 587, 2416-2418 
Friction welding, 572-574 
Fuel cells, 2880 

Functionally gradient materials (FGMs), 

2846 

Functional models, 2521-2522 
Furnace types, 314, 316, 319-321, 327 
channel furnace, 321 
coreless induction furnace, 327 
crucible furnaces, 316 
cupola furnaces, 319-322 
electric arc furnaces, 327 
electric furnaces, 320-322 
medium frequency coreless furnaces, 314 
for melting cast irons, 314, 319-326 
reverberatory furnaces, 316 
shaft furnace, 316 


Fused deposition modeling (FDM), 2472, 

2479-2480, 2560 

Fuzzy neural networks, 3249-3252 

G 

Galvanostatic mode, 2915-2916 

Gantry mounted robot, 2420 

Gas held ion source (GFIS), 1321-1323 

Gas metal arc welding (GMAW), 2409-2412 

Gas tungsten arc welding (GTAW), 604, 606, 

609, 2409-2412 

Gear and rack actuation, 2042-2044 
Generalized force, 1848-1849 
Generalized predictive control (GPC), 2119 
General 6R manipulator, 1800 
Geometric constraints, 1874, 1877-1878, 2109 
Geometrical calibration, 2109 
Geometry -independent models, 2138-2139 
Glasses, 928-929 

Global conditioning index (GCI), 2103 
Global time minimization, 1883 
Glue effect, 2973 
Gold wire, 653 
Gough platform, 2092 
Grain refinement, 383 
Grain size, 687, 2899-2900, 2919 
Grain-boundary strengthening, 687 
Graphite, 253-254 
Graphite cast iron, 372-374 
Graphitization potential, 321 
Gravity segregation, 278 
Grease removal, 3036-3037 
Green packaging, 75 
Grey cast iron, 368-371 
Grid search, 1878-1879 
Grinding wheels, 841-842, 1646, 3035 
actuator, 2053-2056 
cylindrical grinding, 842-845 
sensor, 2056-2058 
surface grinding, 841-842 
transmission mechanism, 2041 
types, 2037-2041 
Gripper, 2037, 2041, 2053, 2056 
Groove test for IF, 421-423 
Guarded tele-operated robot, 2303 

H 

HAE process, 2536, 3045 
Hardness, 585, 904, 2906, 2909-2912, 

2919-2920 
Hard tooling, 2536 
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Heat flux, 1553, 1566 
Heat input, 606 

Heat transfer and solidification, 354-360 
high pressure die casting, 358-360 
sand mold, 280, 354-357 
Heat treatment, 279-281 
Helium ion microscope (HIM), 1321, 1323, 

1327, 1331-1333 

High energy beam, 618 

High mix low volume production, 2472 

High-velocity oxy-fuel (HVOF), 2825-2826 

Holonomic WMR, 2316-2317 

Horizontal boring machine, 819 

Horizontal shaping machine, 820 

Hot isostatic pressing (HIP), 544-547 

Hot pressing (HP), 543-546 

Hot spots, 404 

Hybrid, 612 

Hybrid kinematic machine tool, 860 
Hybrid laser arc welding (HLAW), 

2415-2416 

Hybrid machining process (HMP), 1109-1124 
types, 1112 

Hydroxyapatite (HA) coating, 2499, 2578 

I 

IGES, 2499 

Incremental sheet metal forming (ISMF), 

413-420, 434 

Indirect hard tooling, 2538-2539 
Indirect rapid tooling, 2540-2541 
Indirect soft tooling, 2534-2536 
Indoor positioning system (iGPS), 2329-2332 
advantages, 2329 
applications, 2329 
landmark markers, 2331 
RFID reader, 2329 
trilateration, 2329 

ultrasonic transmitter and receivers, 2330 
wireless networks, 2331-2332 
Inductive sensor, 2056-2057 
Industrial robots, 855-288, 1885, 2349-2350, 

2355, 2360, 2367 

Industrial robot programming, 2078, 2087 
methods, 2078 
process, 2087 

Inertial measurement unit (IMU), 2325-2326 
Infrared sensor, 2320 
Ingot preheating, 280 
Injection molding, 131 
Inserts, 2541, 2545 
copper tool, 2545 


die, 2541 

hotwork steel tool, 2545 
Inspection, 586-587 
Integrated remanufacturing design, 

3209-3210 

Integrated robot vision, 2000 

Intelligent control, 2120 

Intelligent water drops optimization, 3452 

Intensifier pumps, 1656 

Intermetallic layer/Intermetallics, 717 

Inverse kinematics model (IKM), 1807, 1846, 

2110, 2141 

Investment casting process, 2535, 2544 

Ion beam deposition systems, 1338 

Ion beam etching, 1519-1520 

Ion beam etching systems, 1337-1338 

Ion beam figuring (IBF), 1278, 1343, 1344 

Ion beam instruments, 1318 

Ion beam implantation systems, 1340 

Ion beam nano-lithography instruments, 1333 

Ion beam nanomanufacturing (IBNM), 1277 

Ion beam technology, 1280 

Ion implantation, 1431 

Ion optics, 1323-1326 

IRB340 Flex Picker, 2092 

Iron, 919 

Iteration, 2151 

backward, 2151-2152 
forward, 2151 

Iteration convergence method, 1171 

J 

Jacobian analysis, 2100-2101 
Jacobian computation, 1829-1834 
forward, 1829 

Jacobian matrix, 1831, 1835 
parallel manipulator, 1834 
serial manipulator, 1829-1834 
Jacobian matrix, 1895 
JC flow-stress model, 1175 
Job shop scheduling, 3452 
Joining process, 571, 589 
Joint space control vs. Cartesian space control, 

2120-2121 

Joint speed linear control, 2116 

K 

Kinematic design, WMRs, 2304-2308 
castor wheel, 2305-2306 
centered steering wheel, 2305 
fixed wheel, 2304-2305 
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spherical wheel, 2308 
Swedish wheel, 2306-2307 
Kinematic redundancy, 1846, 1875, 1884 
Kiva system, 2341 

L 

Lagrange’s equation, 1859 
Lagrangian simulation, 1167-1169 
Land robots, 2302 
Lapping, 1053, 1054 
Laser, 612-613 
Laser beam, 2684 

Laser beam welding (LBW), 2412-2415 
Laser composite surfacing, 2662-2667 
Laser conduction welding, 621, 1605 
Laser machining, 1605 
Laser-material interactions, 1603-1604 
Laser processing, 1617 
Laser scanners, 2324 
Laser surface alloying, 2657-2262 
Laser surface cladding, 2667-2669, 2673 
Laser surface engineering, 2641, 2672 
Laser surface melting (LSM), 2651-2656, 

2658 

Laser technique, 2678 

Laser tracker, 2489-2490 

Laser transformation hardening, 2643-265 1 

Layered manufacturing, 743, 2472 

Lead- free solder, 724 

Leaf- spring compound linear spring 

mechanism, 2235 
Leap frog, 2491 
Levitrack reactor, 2985 
Life cycle assessment (LCA), 3317-3320 
Light-weight robot, 1861 
Li-ion battery, 2883 
Linear actuator, 2054 
Linear computed torque control, 2116 
Linear control, 2116-2118 
Linear cutting motion, 794-795 
Line vector, 1752 
Linkage actuation, 2042 
Liquid crystal display (LCD), 3420-3421 
Liquid metal ion source (LMIS), 1319-1321 
Lithium ion batteries (LIB), 2991 
Lithography, Galvanoformung, Abformung 

(LIGA), 2877 

Load transfer vehicle, 2343 
Long edge mill, 836 
Loose abrasive machining, 1052 
Low-temperature deposits, 2915 
Low temperature plasma, 107 


Lubricants for forming, 273-274 
Lubrication/cooling, 981. See also 

Cutting fluids 

M 

Machinability, 936 

Machine health condition (MHC) prediction, 

3249 

Machine tools, 812, 849-850, 855-860 
feed drives, 851-853 
industrial robot, 855-858 
parallel and hybrid kinematic machine 

tools, 859-860 
PKMTs, 859 
RMS, 861 

spindle drives, 850-851 
Machine utilization, 3483-3484 
Machining, 801-802, 813, 945 

heat flow in, 802 
measurands, 945 
nomenclature, 801 
Machining centres, 847-849 
Machining dynamics, 867-896 

DYNACUT, quick milling optimization 
software, 892 

input parameters/database information, 892 
long depth milling stabilization, mold 
base, 894 

overall geometry and finished product 
quality, 895 

quick milling vibration solver 
configuration, 893 
schematic representation, 870 
S45 cutting with 16 mm diameter/four 
flutes end mill, 891 

Machining economics, 934-935, 983, 1127 
Machining process, 982, 1129 

environmentally friendly machining, 1129 
regions of heat generation, 983 
Machining time, 817, 826, 834, 840 
broaching, 840 
drilling, 826 
lathe, 817 
milling, 834 

Magnesium alloys, 317-318, 388-391 
melting, 317-318 
Magnetic actuator, 2055 
Magnetic compound fluid (MCF), 1062-1063 
Magnetic field assisted finishing (MAF), 1058 
Magnetic float polishing, 1059-1060 
Magnetic fluids (MFs), 1059 
Magnetic gripper, 2039-2040 
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Magnetic recording heads, 1057, 2879 
Magneto-rheological abrasive flow finishing 

(MRAFF), 1057-1058 
Magneto-rheological finishing (MRF), 

1060-1061 

Magneto-rheological fluid-based slurry, 

1061-1062 

MAGOXID-COAT, 3046-3047 
Makespan, 3483 
Malleable cast iron, 375 
Manual tele-operated robots, 2303 
Manufacturing, solid state welding processes 

in, 569 

Manufacturing yield design, 404 
Map-based localization approaches, 

2332-2333 

Maraging steels, 250, 252 

Mass scaling, 1165 

Material and energy flows, 1130 

Material engineering, 2704 

Material extrusion, 2510-2511 

Material jetting, 25 1 1-2512 

Material removal mechanisms and modes, 

1078-1079 

Material removal rate, 903, 1040, 1556, 1576 
Materials removal behavior, 1040 
Materials selection, 255, 261 
bulk metal forming, 255-260 
sheet metal forming, 260-265 
MATLAB/SIMULINK, 1915 
Matrix exponentials, 2003 
Measurement system, 1999 
Mecanum wheel, 2307 
Mechanical gripper, 2037-2038 
Mechanical properties, 586 
Media, 2449-2450, 2456-2459, 2465 
Medium-carbon low-alloy steel, 250 
MEMs, 2989, 3453 
MEMS-based micro- actuator, 2239 
Meta-heuristics, 328, 3453 
Metal casting processes, 328 
permanent, 328 
Metal forming, 172-229, 473 
Metal inert gas (MIG) welding, 605 
Metallization, 3010-3011 
Metal nitrides, 2967, 2970-2971 
Metal oxide, 2967 

Methodology of research, 2680, 2688 
Metrology, 2488-2489, 2491 

considerations for part measurement, 

2491-2493 

destructive internal, 2489 
non destructive internal, 2488 


Microcellular foams, 142 
Micro-cutting, 1094-1096 
cutting force, 1095 
grain size, 1095 
vs. macro-cutting, 1094 
steel, 1096 

Micro-dispensing, 2577 
Micro-electrical-mechanical systems (MEMs) 

gyroscope, 2326-2327 
Micro electro mechanical system (MEMS), 

1090, 2879 

Microfabrication, 2874-2880 
Micro grooves, 1454, 1498 
Micro optical elements (MOE), 1403 
Micro-machining, 1090, 1577 
Micro-milling, 1090, 1101 
Micro-segregation, 277 
Microsoft Kinect, 2323 
Microstructure, 574, 578, 1095 
Micro tools, 1101-1104, 1473-1509 
SEM image, 1493 
Micro ultrasonic machining 
configurations, 1073 
development of, 1075-1078 
ductile and brittle removal modes, 

1083-1085 

machining of microfeatures, 1071, 1078 
material removal mechanisms and modes, 
1078-1079 

monitoring and control, 1074-1075 
SEM image, 1073 
tools, 1073 
vs. USM, 1071 

workpiece holding method, 1075 
Micro- valve dispenser, 2576 
Microwave sintering (MWS), 538-540, 

828, 1645 

Milling, 828-839, 1645 

Minimum depth of cut, 1104 

Mobile robot, 2302. See also Wheeled mobile 

robots (WMRs) 

Mobile-agent, 3476 
Modeling, 173, 210, 214 
Model predictive control (MPC), 2119 
Modular CDR, 2175-2176, 2181-2184, 

2186 

components, 2134 
design, 2132 
representation, 2133 
simulation software, 2158-2159 
Modular robot, 2132-2134, 2158 
Modulus of microcellular foams, 145 
Mold casting mold processes 
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Mold casting mold processes, disposable, 

339-349 

Mold/Die manufacturing, 311-313 
Mold filling and solidification, 349-353 
Molecular layer deposition (MLD), 2972 
Monocrystalline, 1429 
Monolithic compound linear spring 

mechanism, 2235 

Monte Carlo localization (MCL), 2333 
Morphology of Ni-Co film, 2909-2912 
Morphology of Ni films, 2906-2909 
Motion planning, 1874 
Moving robot, 2419 

Multi-bodied CDRs, 2178-2181, 2186-2197 
Multilayer thin film, 3074-3077, 3483 
Multi-objective, 3483 

Multiple objective job shop scheduling, 3452 
Multipoint cutting tools, 824, 827-828, 839 
broaching machine, 839-840 
cutting edge, 824 
dies, 827-828 
drilling machine, 824-827 
milling machine, 828-839 
taps, 827-829 

Multi stage forming, 431-432 
N 

Nanocomposite foams, 134-135 
Nano-fabrication, 1407-1409, 2571 
Nanofibers, 2571 
Nanofillers, 67, 134-136, 138 
Nano-hole fabrication, 1416 
Nanojoining, 708 

Nano machining of ion implanted materials 

(NilM), 1278 
Nanomanufacturing, 1280 
Nanomaterial reinforced composite, 708 
Nanomaterials assisted joining, 708 
Nanostructured coatings, 2846-2847, 2778 
Nanotechnology, 2778 
Neobotix system, 2344 
Netdraw software package, 3444 
Networked based manufacturing, 3475-3485 
Networked manufacturing, 2149 
Newton-Euler equation, 1859, 1863, 2149 
Nickel, 922-923 
Ni-Co films, 2906-2909 
Ni films, 2906, 2919-2920 
Nitridation, 3005-3006, 3009-3010 
Non-ferrous alloys, 254-255 
Nonholonomic WMRs, 2316-2319 
Brokett Theorem, 2318 


definition, 2317 
kinematic constraints, 2317 
posture stabilization, 2319 
trajectory tracking, 2318-2319 
Non linear control, 2118-2120 
Notch flexure joints, 2249 
Nucleating agents, 137-138 
Nucleation, 135, 137, 143 
Numerical control, 853 
Numerically controlled plasma chemical 

vaporization machining (NC-PCVM), 

1298 

Nylon 6 nanocomposites, 84, 85 

O 

Occupancy grid maps, 2334 

Octree hierarchical space decomposition, 

1209-1211 

Odometry, 2324-2325 
Off-centered steering wheel, 2305 
Off-line programming, 1997 
OLED, 2990-2991 
Omnidirectional WMR, 2312-2313 
OmniMove, 2342 
Omni-wheel, 2307 

On-demand manufacturing, 2472, 2474 
On-line pre-processing implementation, 

2014-2016 

Online sensing data, 3150 
Open cell polymer foam, 165-166 
Optical applications, 2990 
Optical trackers, 2490 

Optimization, 1874, 1881-1882, 3452-3454 
Organic finishing, 3048 
Orthogonal cutting, 1157-1159 
Orthogonal vs. oblique cutting, 794 
Osteointegration, 2578 
Oxidation, 2997-3207. See also Nitridation 
Oxide inclusions, 2818 

P 

Packaging materials, 2992 

Parallel kinematic machine tools (PKMTs), 

859 

Parallel linear spring mechanism, 2247 
Parallel robot (PR), 2122, 2383 
Parameter optimization, 2383, 2385, 

2388-2389, 2391-2392, 2394-2396, 
2399 

Partial heating method, 719 
Particle filters, 2332 
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Path shortcut, 1876-1881 
dehnition, 1877 
optimization, 1881 
planning, 1877-1878 
Path matrix ,213 6-2 138 
Path shortcut, 1882 

PD plus gravity controller, 1896-1899, 1918 
Perfactory system, 2558 
Permanent magnet, 2040 
Phenol formaldehyde resins, 7 
Photoelectric/opto-electronic sensor, 2057 
Photogrammetry, 2490, 3010 
Photolithography, 3010-3011 
Physical-fit models, 2519-2521 
Physical vapour deposition (PVD), 2727-2735, 

2737-2750 
adhesion, 2740 
AFM, 2746 

application, 2737-2739 
deposition method, 2727 
GDOS, 2743 

mechanical properties, 2737 
SEM, 2743-2744 
synthesis of coatings, 2734, 2743 
TEM, 2743 

Piezo-actuated printhead, 2579 

Piezoelectric actuators, 2055-2056 

Pillow effect, 436-437, 2338 

Pioneer 3-DX differential drive robot, 2309 

Pioneer 3 DX platform, 2338 

Plain strain stretching, 429 

Planarization CMP, 1591 

Planar motion compliant manipulators, 1591 

Planar motion compliant manipulators, 2237 

Planing machine, 823-824 

Plasma, 106-108, 596, 598, 611-612, 1565 

cladding, 611-612 
deposition, 107 
low temperature, 107 
spraying, 2822-2824 
treatment, 105, 1409 
Plasma ALD, 2976-2977, 2979 
reactor, 2979 

Plasma-assisted polishing (PAP), 1299 
Plasmonic lens, 1408-1409 
Plastics, 101, 116 
in car, 8 1 
coatings, 101, 116 

surface treatment and modification, 101 
Ploughing, 1093 
Plunger, 322-323 
Pneumatic actuation, 1984-1987 
Pneumatic gripper, 2037 


Pocket milling, 835 
Point cloud registration, 2019, 2021 
Poisson’s ratio, 3066, 3069 
Polarity, 599-600, 1577 
Polishing, 1053, 2652, 3035 
Poly(lactide-co-glycolide) (PLGA), 46-47 
Poly aniline (PANI), 60 
Polycaprolactone (PCL), 2574 
Polygon mesh, 2495-2496 
Polymer, 924-925, 2972 
Polymer brushes, 63-66 
Polymer coatings, 2845-2846 
Porosity (ies), 2816-2818 
cast iron, 375 

Post casting processing, 366 

Potential held method, 1878 

Potentio static mode, 2915 

Powder bed fusion, 2514 

Powder metallurgy (P/M) tool steel, 249 

Powder systems, 2428, 2558-2559 

Power sources, 601-602, 2428 

PR2, 2339-2340 

Precipitates, 579-580 

Precision, 604 

Preston’s equation, 1052 

Price discrimination, 3049, 3298, 3304, 3307 

Primers, 3049 

Process Sustainability Index (ProcSI), 

3351-3353, 3371, 3372 
case study, 3368 
evaluation process, 3354-3355 
hierarchical approach, 3352-3353 
Product condition, 3381 
Product customization, 2473 
Product development, 3213 
Productivity, 608-609 
Product-of-exponentials, 2003-2005 
Product recovery, 3377-3402 
Product service, 3224, 3231, 3233 
Product Sustainability Index (ProdSI), 

3351-3352, 3354-3363 
case study, 3363-3367 
evaluation process, 3354-3363 
hierarchical approach, 608, 3352 
Profile milling, 835 
Properties, laser, 2683 
Proprioceptive sensors, 2324-2328 
Adept Lynx, 2327-2328 
gyroscope, 2326-2327 
IMU, 2325-2326 
odometry, 2324-2325 
Proximity sensor, 2056, 2434 
Pulse current, 2903, 2906 
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Pulse discharge, 1561-1564 
Pulse generators, 1557-158 
Pulsing, 606 

Q 

Quantum effects, 686 
Quattro robot, 2092 
Quenching, 385 
QuickCNC GUI, 1217 

R 

Random yield, 3298-3299, 3301-3304 
Rapidly-exploring random trees (RRT), 

1879-1881 

Rapid manufacturing (RM), 2472-2473, 2530. 

See also Direct Digital Manufacturing 
(DDM) 

Rapid prototyping (RP), 2472, 2507-2510, 

2516, 2518, 2527-2528, 2568-2582 
Rapid tooling (RT), 2528-2530 
6R approach, 3350-3351 
Reaction injection molding (RIM) 132 
Reactive plasma spraying, 2824 
Reconhgurable machine tools, 860-864 
Reconhgurable manufacturing system (RMS), 

861-862 

Reconhgurable modular manipulator, 

2130-2131 

Reconhgurable robotic workcell, 2159-2162 
Recursive dimension reduction algorithm, 

2150,2184 

Recursive Newton-Euler algorithm, 2150 
Recursive Newton-Euler method, 1856, 1871, 

2149-2152 

Redundant manipulators, 1779, 1791, 1803 
Regressor matrix, 1893-1894, 1899, 1909 
Reliability, 722, 3152 
Remaining useful life (RUL) assessment, 

3137-3190 

Remanufacturability, 3140, 3146 
Remanufacturing, 3196-3197, 3265-3287 
Remanufacturing decision-making, 3279-3282 
Remanufacturing engineering, 3140 
Remanufacturing time point, 3 142, 3 148-3 150, 

3196, 3266, 3292, 3315, 3320, 3324 
Repairing, 2830, 2843-2844 
Residual stress, 1569, 2819, 2835 
Resistance spot welding (RSW), 2406-2409 
Resistance welding, 588-589 
Return acquisition, 3300. See also 

Acquisition return 


Reverse engineering, 2485-2503 

Reverse post processing, 1193 

Revolute and prismatic actuator modules, 2096 

Rians WeldTM, 2440 

Roboforming, 443 

Robot, 2448-2453 

Robotino robot 2313, 2315 

Robot joint control, 1943 

Roll forming 

calibration method, 291-292 
dehnition, 286 
design of, 290 

hnite element simulation, 299-303 
hexible, 303-306 
hower design, 292-295 
machine conhguration, 296-299 
process in, 286-289, 290 
strategies in, 290-291 
strip width calculation, 291-292 
tool design, 296-299 

Rolling, 173-177, 186, 189, 191, 193-194, 

197-199, 201, 208-209, 214-215, 217 
Rolling contact fatigue failure modes, 2835 
Roll-to-roll spatial ALD reactor, 2993 
Roomba robot, 2336-2337 
Rope and pulley actuation, 2044-2045 
Rotary ALD reactor, 2979, 2982 

S 

Sampling-based algorithms, 2335 
Sand-blasting, 278 
Sandwich composites, 148 
Sandwich ladle, 322 
Scheduling, 3452, 3465 
Screw actuation, 2042 
Sealants, 2833 
Sealing, 3047 

Selective laser sintering, 637, 2559 
Self-assembled monolayers (SAM), 306, 2975 
Self-driving blank holder system, 306 
Semiconductor ALD applications, 2989 
Semi- synthetic cutting huids, 1131 
SEMORS, 2158 

Semi-synthetic cutting huids, 2158 
Sensor, 2056-2058 

Sensor-based error compensation 

method, 2017 
Sensor fusion, 2368, 3276 
Sensor selection, 3271-3273 
Serial manipulators, 1807 
Service supply chain 
collaboration, 3229 
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Service supply chain ( cont .) 

enabling process management, 3234-3235 
functional process management, 

3233-3234 

strategy and network structure, 3224 
structure of, 1560, 3222 
Servo feed control, 1560-1561 
Shake-out/part removal, 314 
Shaping machine, 820-823 
SHARP sensor, 2320 
Shear, 738, 757 
Sheet lamination, 2512 
Sheet metal, 484 
Shielding gas, 600-601, 625 
Shoulder milling, 835 
Side milling, 832, 975 
structure of, 975 
Signal processing, 959-960 
chatter detection, 973 
Silica mold, 2544 
Silicon, 929-930, 2857-2860 
Simulation, 188, 191, 197, 200-203, 209-214, 

218 

Simultaneous localization and mapping 

(SLAM), 2333 
Sine law for IF, 420 
Single crystal, 1322, 1435 
Single ion beam system, 1318 
Single objective job shop scheduling, 

345 1-3472 

Single particle impact, 1083 
Single-point cutting tools, 814-815, 

824, 828, 839 
boring machine, 819-820 
centre lathe, 815-819 
closed-loop mechanisms, 1843-1845 
geometry, 814 
planing machine, 823-824 
shaping machine, 820-823 
Single point incremental forming (SPIF), 

414-415, 430 

Single walled carbon nanotube (SWNT), 57 
Singularities, 1842-1845 
cuspidality concept, 1845 
higher-order analysis, 1845 
parallel manipulators, 1844-1845 
serial manipulors, 1842 
Sinking EDM, 1554-1556, 1570 
Sinter forging (SF), 548-553 
Sintering, 553 
Size effect, 1092-1094 
Slab milling, 832 
Slip sensor, 2058 


Slot milling, 832 

Slotting machine, 821-822 

Social network analysis, 3439-3449 

Soft tooling, 2532 

Software, 2495, 2497 

Software packages, 2438 

Solar cells, 2882, 2991-2992 

Solid end mills, 836 

Solidification, 314 

Solid model, 2496-2497 

Solid state /Batch foaming process, 142, 143 

Solid state welding/bonding, 590 

Soluble oils, 1131 

Spark plasma sintering (SPS), 548 

Spatial ALD reactor, 2983-2988 

Spatial S-R-U CDR, 2191 

Spherical parallel manipulator, 1864-1871 

Spindle drives, 850-85 1 

Spin forming, 468-470 

Spiral tool path, 419, 444, 446 

Spring back in IF, 434, 436 

Sputtering 

coating, 2943, 2945, 2948 
decorative coating, 2945 
deposition, 2937, 2938, 2945 
functional coating, 2945 
magnetron, 2955 
metallization, 2946 
PVD, 2953 

reactive sputtering, 2937, 2940 
unbalanced magnetron sputtering, 2937 
Stamping, 457 

State-space model (SSM), 3173-3174 
Static calibration, 2114 
Static forces, 1848-1849 
Statics, 1850-1851 

Statics and dynamics models, 2103-2104 

Stationary robot, 2418-2419 

Steel, 919, 2499 

Steel alloys, 399 

Steel-bonded carbides, 252 

Steel casting 

deoxidation practice, 325 
equipment, 326 
gas porosity, 326 
reoxidations, 326 
sulphides, 326 
STEP, 2499 

Stereolithography (STL), 2498, 

2556-2557 

Stewart platform, 2092, 2198 
Stiffness, 2906 

Stiffness analysis, 2198-2199, 2203 
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Stopping and range of ion in matter (SRIM), 

1287, 1392 
Straddle milling, 832 
Strain, 3074, 3107 

Strain hardening modulus, 3074, 3103, 3107 

Strain-rate testing, 733 

Strengthening mechanism, 2920-2924 

Stress, 3055-3129 

Substrates, 2973 

Summit robot, 2310 

Supercapacitors, 2880-2881 

Superchair robot, 2313, 2315 

Supply chain design, 3219-3235 

Support vector machine model, 3162 

Surface cleaning, 278-279 

Surface coating techniques, 2807 

Surface contouring, 835 

Surface cross-linking, 104, 107, 112 

Surface degradation, 112 

Surface engineering, 2798 

Surface-enhanced Raman scattering (SERS), 

1303-1305 

Surface finish in IF, 440 
Surface grinding machine, 841-842 
Surface modification, 107-108 
Surface morphology, 2577-2578 
Surface quality, 1096-1098 
Surface roughness, 1556, 1576, 1706 
Surface tension, 113-115 
Surface treatment, 102-103 
chemical, 103 
physical, 102 
reactive gas, 103 
Survivability test, 2580-2582 
Sustainable manufacturing, 3348-3350 
Swedish wheel, 2306-2308 
Swisslog robot, 2303 
Synchronous drive WMR, 2311 
Syntactic foam, 148-150 
Synthetic cutting fluids, 1131 
System integrators, for robotic welding, 2441 

T 

Tactile sensing, 2434 

Tactile sensor, 2058 

Tagnite surface treatment, 3045-3046 

Tangential acceleration, 1722 

Taps, 827-828 

Task-space PD plus gravity, 1920 
Technologies, 2704 
Temperature cycling, 746-747 
Temperature distribution, 1565-1567 


Tensile properties, 85, 155-158 
Thermal ALD, 2976 
Thermal properties, 138 
Thermal spraying, 2808-2810, 2819, 2846 
Thermo-mechanical properties, 3064-3065, 

3098 

Thin coatings, 2763 
Thin films, 3055-3129 

Three-dimensional printing (3DP), 2508, 2552, 

2554. See also Additive manufacturing 
3D roll forming, 303 
Through thickness shear, 425, 2971 
TiN, 297 1 

Tissue regeneration, 2575 
Titanium, 922 
TOF camera, 2323 
Tolerance charting, 1269 
Tool, 2447, 2449-2450, 2455-2462, 2464 
Tool and die materials 243, 1576-1577 
Tool electrode, 1552, 1577 
Tool electrode wear, 1576-1577 
Tool materials, 2789-2792 
Topological synthesis, 2096 
Top paints, 3049 
Total heating method, 718-719 
Touch sensing, 2001 
Track mounted robot, 2420 
Trajectory, 1876-1877, 1881 
definition, 1877 
optimization, 1881 

Transferred plasma arc process, 2825 
Transformation Matrices, 2001-2002 
Triangulation, 2320 

Transport of ions in matter (TRIM) program 

1392, 2320 
Tricycle WMR, 23 1 1 
Triple beam system, 1330 
Triple bottom-line (TBF), 3344 
Tundish cover, 322 
Tungsten carbide, 93 1 
Turning, 1644 
Turntable, 2424 
Twin- wire, 609 
Twist, 440 

Two-column planing machine, 823 
Two point incremental forming (TPIF), 

415-416, 440 

Two step sintering (TSS), 540-543 

U 

Ucinet software package, 3444 
Ultra-precision machining, 1497 
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Ultrasonic machining (USM) 
advantages, 1070 
basic elements, 1066-1068 
capabilities and applications of, 1069-1070 
micro-USM, 1070-1073. ( See also Micro 
ultrasonic machining) 
process parameters and perfomance 
measures, 10681069 
working principle, 1066-1067 
Ultrasonic sensor, 2057, 2320 
Ultrasonic welding, 571-572 
Underactuated gripper, 2046 
Underwater robots, 2302 
Unicycle, 2318 
Universal wheel, 2307 
Unmanned ground vehicles (UGVs), 

2302, 2313 

V 

Vacuum gripper, 2038-2039 
Vacuum suction, 2054-2055 
Variable stiffness device, 2199-2200 
Velocity propagation, 1816 
Vat photopolymerization, 2513 
Velocity 

differential movement, 1811-1814, 1820 
differential relationships, coordinate 
frames, 1820-1824 
inverse kinematics, 1846 
Jacobian computation ( See Jacobian 
computation) 

kinematic characteristics, 1838 
kinematic redundancy, 1846 
propagation, 1816-1817 
relationships, 1814-1816 
representation, 1811 
singularities. See Singularities 
Vertical boring machine, 819-820 
Vertical shaping machine, 820-823 
Vibratory finishing, 1064-1065 
Vision-guided robotic tow tracker, 2342 
Visual odometry, 2322 

W 

Waste electrical and electronic equipment 

( WEEE) , 3406, 3414-315, 3432 
Water-based cutting fluids, 1131 
Wave soldering, 719, 2837 
Wear resistance, 2837 


Weibull plot, 747 
Weld/arc data monitoring, 2437 
Welding process, 569-590 
Weld seam tracking, 2434 
Wettability, 113-115, 717 
Wheeled mobile robots (WMRs) 
castor wheel, 2305 
centered steering wheel, 2305 
degree of maneuverability, 2314-2316 
description, 2302 

differential drive ( See Differential drive 
WMR) 

fixed wheel, 2304-2305 
nonholonomy ( See Nonholonomic WMRs) 
spherical wheel, 2308 
Swedish wheel, 2306-2307 
White cast iron, 374 
Wire EDM (WEDM), 1554 
Wire feeders, 2432 

WMRs. See Wheeled mobile robots (WMRs) 
Workpiece clamping system, 1075-1076 
Workpiece handling system, 2422 
Workpiece holding method, 1075 
Workspace analysis, 2101-2102, 2204 
Workspace performance measures, 2212 
Workspace representation, 2205-2209 
Workspace volume, 2210-2212 
Wrought tool steel, 243 

X 

X-ray photoelectron spectroscopy (XPS), 3013, 

3015-3017 


Y 

Yield start stress, 3083, 3103 
Yield strength, 905-906 
Young’s modulus, 3066-3067, 3073 


Z 

Zinc alloys, 319, 396-398 
melting, 319 
Zinc oxide (ZnO), 2970 
ZnS, 2972 
Zr film on SiC 

electrical properties, 3020-3021 
growth mechanism, 3021-3023 
physical characteristics, 3011-3020 
vs. Si substrate, 3023-3026 



